BLASTX nr result

ID: Zingiber25_contig00023099 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber25_contig00023099
         (1601 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMS47407.1| Histone-lysine N-methyltransferase SETD1B-A [Trit...   260   1e-66
emb|CBI29431.3| unnamed protein product [Vitis vinifera]              259   2e-66
ref|XP_003579145.1| PREDICTED: uncharacterized protein LOC100843...   256   1e-65
gb|EMT10715.1| BTB/POZ domain-containing protein [Aegilops tausc...   253   1e-64
ref|XP_006664740.1| PREDICTED: uncharacterized protein LOC102715...   250   1e-63
gb|EEE53601.1| hypothetical protein OsJ_36855 [Oryza sativa Japo...   246   3e-62
gb|EEC69667.1| hypothetical protein OsI_39097 [Oryza sativa Indi...   246   3e-62
ref|NP_001067262.1| Os12g0613200 [Oryza sativa Japonica Group] g...   244   6e-62
gb|EOY15834.1| Set domain protein, putative isoform 4 [Theobroma...   241   6e-61
gb|EOY15831.1| Set domain protein, putative isoform 1 [Theobroma...   241   6e-61
ref|XP_004963230.1| PREDICTED: uncharacterized protein LOC101782...   234   1e-58
ref|XP_004963226.1| PREDICTED: uncharacterized protein LOC101782...   234   1e-58
gb|EXC31045.1| Histone-lysine N-methyltransferase SETD1B [Morus ...   228   4e-57
ref|XP_004301597.1| PREDICTED: uncharacterized protein LOC101295...   226   2e-56
gb|EMJ21490.1| hypothetical protein PRUPE_ppa000519mg [Prunus pe...   226   2e-56
ref|XP_002510762.1| set domain protein, putative [Ricinus commun...   220   2e-54
ref|XP_003594905.1| Histone-lysine N-methyltransferase SETD1B [M...   218   4e-54
ref|XP_006348442.1| PREDICTED: uncharacterized protein LOC102597...   217   1e-53
ref|XP_004487927.1| PREDICTED: uncharacterized protein LOC101514...   217   1e-53
ref|XP_004487926.1| PREDICTED: uncharacterized protein LOC101514...   217   1e-53

>gb|EMS47407.1| Histone-lysine N-methyltransferase SETD1B-A [Triticum urartu]
          Length = 1321

 Score =  260 bits (664), Expect = 1e-66
 Identities = 162/471 (34%), Positives = 245/471 (52%), Gaps = 16/471 (3%)
 Frame = -2

Query: 1390 RCSYDGSGPSCLHEQVDV-GPYVAMEGSSQSFSNICELPLPDGTGVTLTQNSC-DRYAQF 1217
            RCS      SC  +Q    G +VAM+G+    +N   +    GTG +  QN     Y Q 
Sbjct: 139  RCSERRQIASCSGDQPQSSGMFVAMQGNVCPVNNGGGIYSQSGTGHSNGQNGTYGAYPQH 198

Query: 1216 S-ISGWMYVNQNGHMCGPYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQFS 1040
                G MYVN++G MCGPY+ +QL EGLSSGFLP +L +Y ++ G +   V L  L+QF 
Sbjct: 199  QPFEGCMYVNEHGQMCGPYAPKQLYEGLSSGFLPRDLAIYALVGGQMLNPVALSSLEQFL 258

Query: 1039 SPRYYPSSVDATARD-ENSQLAGKDPVSYFSSRTEHDATVTASHGSGQGKSNNGXXXXXX 863
            S      +V       EN  +A  D ++  ++ +  ++                      
Sbjct: 259  SQWNSAGAVTTPNEPKENKTVARTDKLALPNALSSDES---------------------- 296

Query: 862  XXXXXXXXXXSCWMFEDKEGMKHGPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFSLAI 683
                       CWMFED +G +HGPHSLAEL YW H+SYL+D  M+YHVD++ GPF+L  
Sbjct: 297  -----------CWMFEDSDGSRHGPHSLAELSYWLHSSYLQDLSMVYHVDSKFGPFTLVS 345

Query: 682  LVEEWSRINCEN--ISENDIKSEGDDAELDNSFTSLLFNSSEEVSIQLHSAIMKAARRVL 509
            LV+ WS  + E+   SEND  S  D          ++ +  +++S QLH+ I+K+AR++ 
Sbjct: 346  LVDWWSGGHTEHSEASENDSGSASD----------VMGDIVDDISHQLHAGIVKSARKIF 395

Query: 508  LDEIFSTIIPEFISTKKAQRHLRAES----IDKNAKTYELSKGKEQPPL------NTLQE 359
            +DEIFS ++PE I+ +K ++ L A+S    +  ++K     KGK           N+   
Sbjct: 396  VDEIFSCVLPEIIACRKTEKQLAAKSKLLAVKPDSKKASALKGKAHAKFTNHKKGNSYNT 455

Query: 358  VNSSQQTCSIILDIHENIKEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWLKKKQWS 179
            V ++         +H    ++L    +  YY+ MK +WD +LYDPVM+YCG+W+K+   S
Sbjct: 456  VRATSPVAVQSTAVHAKFSDILSAVWQTLYYESMKNIWDGILYDPVMDYCGSWIKRNHHS 515

Query: 178  GLQCSSLIVNSEVQDPDQMDMVLKNADKVVELEPVSSRHDMDFPPGFGPNI 26
             L C+S+   S+  +  + D +     KV+  + V+   DMDFPPGFGP +
Sbjct: 516  SLPCTSIPGASDNANKQEADGL-----KVI-CDSVALECDMDFPPGFGPTL 560


>emb|CBI29431.3| unnamed protein product [Vitis vinifera]
          Length = 1127

 Score =  259 bits (662), Expect = 2e-66
 Identities = 169/467 (36%), Positives = 246/467 (52%), Gaps = 38/467 (8%)
 Frame = -2

Query: 1321 MEGSSQSFSNICELPLPDGTGVTLTQN-SCDRYAQFS-ISGWMYVNQNGHMCGPYSQEQL 1148
            ME S +S  N  ++      G TL Q+     YA    + GWMY+N+ G MCGPY Q+QL
Sbjct: 1    MEMSCRSNGNTDDILQSCNIGGTLNQDRGGSGYAPPPFVGGWMYINEQGQMCGPYIQQQL 60

Query: 1147 IEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQF-----SSPRYYPSSVDATARDENSQ 983
             EGLS+GFLP+ELPVYPV+NG++   V LKY KQF     +   Y  + + AT R  N  
Sbjct: 61   YEGLSTGFLPDELPVYPVVNGNLINPVPLKYFKQFPDHVATGFAYLSAGISATIRPTNLT 120

Query: 982  LAGKDPVSYFSS------RTEHDATVTASHGSGQGKSNNGXXXXXXXXXXXXXXXXSCWM 821
               +D    F++      ++     V+ S     G+  N                 SCW+
Sbjct: 121  AHRQDGTVEFAALDKGYLQSASQPCVSHSVYGFDGQMPNTEAANCSTSNPHLSGEASCWL 180

Query: 820  FEDKEGMKHGPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFSLAILVEEWSRINCENIS 641
            FED EG KHGPHS AEL  WHH  YL DS MIYH +N+ GPF+L  ++  W      +  
Sbjct: 181  FEDSEGRKHGPHSYAELYSWHHYGYLSDSSMIYHAENKCGPFTLLSMLNTWR----TDRP 236

Query: 640  ENDIKSEGDDAELDNSFTSLLFNSSEEVSIQLHSAIMKAARRVLLDEIFSTIIPEFISTK 461
            E +  S+G++ E  +S  +L+   +EEVS QLHS I+KA+RR LLDEI S II EF+++K
Sbjct: 237  ETNPLSDGENNETGSSL-NLMSEIAEEVSSQLHSGIIKASRRALLDEIISNIIAEFVASK 295

Query: 460  KAQRHLRAESIDKNAKTYEL-SKGKEQPPLNTLQE-------VNSSQQTCSI-------- 329
            KAQR  + E+ +   +T+ + S G+    + + +           S QTC I        
Sbjct: 296  KAQRLRKLETAN---QTFNMCSDGRMSEIIGSRKNSVAPGGGTALSDQTCLINETPKESS 352

Query: 328  --ILDIH--ENIKEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWLKKKQWSGLQCSS 161
              I  +   EN +   +   +  +  CM+V+W++V Y PV EYC  W K+K+WSG     
Sbjct: 353  EKIKSVGGIENFQHTCMVVCRTIFDSCMQVMWNAVFYAPVAEYCSTWRKRKRWSG---HP 409

Query: 160  LIVNSEVQDPDQMDMVLKNADKVVELEPVSSRH-----DMDFPPGFG 35
             I++  V+        ++ ++K+++ EP+   H     ++D PPGFG
Sbjct: 410  RIMHPAVEQAMLFRDNVEKSEKLID-EPLQEEHEYSVCEVDCPPGFG 455


>ref|XP_003579145.1| PREDICTED: uncharacterized protein LOC100843412 [Brachypodium
            distachyon]
          Length = 1194

 Score =  256 bits (655), Expect = 1e-65
 Identities = 164/477 (34%), Positives = 248/477 (51%), Gaps = 23/477 (4%)
 Frame = -2

Query: 1387 CSYDGSGPSCLHEQVDVGPYVAMEGSSQSFSN---IC-ELPLPDGTGVTLTQNSCDRYAQ 1220
            CS DG      H + + G + AM+ +  S  N   IC +  +    G   T N+  ++ Q
Sbjct: 69   CSGDG------HTR-NTGLFAAMQENLCSVDNGGAICPQSVVGHSGGENGTLNAYPQHHQ 121

Query: 1219 FSISGWMYVNQNGHMCGPYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQFS 1040
             S+ G MY+N+NG MCGPY+ +QL EGLS+GFLP++L +Y ++ G +   V L  L+QF 
Sbjct: 122  -SLEGCMYMNENGQMCGPYAPKQLYEGLSTGFLPQDLAIYALVGGKMVNPVPLSLLEQFL 180

Query: 1039 SPRYYPSSVDATARD-ENSQLAGKDPVSYFSSRTEHDATVTASHGSGQGKSNNGXXXXXX 863
            S      +V   +   EN  +A  D ++     T  +                       
Sbjct: 181  SQLNSGVAVSLPSESKENKTVARIDKMALPDVLTSEEP---------------------- 218

Query: 862  XXXXXXXXXXSCWMFEDKEGMKHGPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFSLAI 683
                       CWMFED EG +HGPHSLAEL YWHH+SYL+D  MI+HVD++ GPF+L  
Sbjct: 219  -----------CWMFEDTEGCRHGPHSLAELSYWHHSSYLQDLSMIHHVDSKFGPFTLVS 267

Query: 682  LVEEWSRINCEN--ISENDIKSEGDDAELDNSFTSLLFNSSEEVSIQLHSAIMKAARRVL 509
            L++ W+  + E+   +END            SF++L+ +  +++  QLH+ IMK ARR++
Sbjct: 268  LIDWWTGGHSEHSEATEND----------SGSFSTLMSDIIDDIGHQLHTGIMKRARRII 317

Query: 508  LDEIFSTIIPEFISTKKAQRHLRAESIDKNAKTYELS-------KGKEQPPL------NT 368
            +DEIFS+++PE I+  KA++ L A+S  +  K   +S       KGK           N+
Sbjct: 318  VDEIFSSVLPEIIAGMKAKKQLAAKSTSQAVKPDNVSSKNASALKGKIDARSSIHKKGNS 377

Query: 367  LQEVNSSQQTCSIILDIHENIKEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWLKKK 188
               + ++       + +H    ++L E  +  YY+ MK +WD V+YDPVM YCG WLK+ 
Sbjct: 378  YNTIRATPSMAVQSIAVHAKFADILSEVWQTIYYESMKNIWDEVMYDPVMNYCGGWLKRN 437

Query: 187  QWSGLQCS---SLIVNSEVQDPDQMDMVLKNADKVVELEPVSSRHDMDFPPGFGPNI 26
                L C+       N  +Q+ D + +      KV+  +P +   DMDFPPGFGP+I
Sbjct: 438  HQLSLPCTIVPGAPENRNMQETDGLSL------KVI-CDPEAIECDMDFPPGFGPSI 487


>gb|EMT10715.1| BTB/POZ domain-containing protein [Aegilops tauschii]
          Length = 1919

 Score =  253 bits (647), Expect = 1e-64
 Identities = 161/473 (34%), Positives = 250/473 (52%), Gaps = 18/473 (3%)
 Frame = -2

Query: 1390 RCSYDGSGPSCLHEQVDV-GPYVAMEGSSQSFSNICELPLPDGTGVTLTQNSC-DRYAQF 1217
            RCS      SC  +Q    G +VA +G++   +N   +    G   +  QN     Y Q 
Sbjct: 695  RCSERRQIASCSGDQPQSSGMFVATQGNACPVNNGGGIYSQSGMSYSNGQNGTYGAYPQH 754

Query: 1216 S-ISGWMYVNQNGHMCGPYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQFS 1040
                G MYVN++G MCGPY+ +QL EGLSS FLP++L +Y ++ G +   V L  L+QF 
Sbjct: 755  QPFEGCMYVNEHGQMCGPYAPKQLYEGLSSSFLPQDLAIYALVGGQMLNPVALSSLEQFL 814

Query: 1039 SPRYYPSSVDATARDE---NSQLAGKDPVSYFSSRTEHDATVTASHGSGQGKSNNGXXXX 869
            S   + S+  AT  +E   N  +A  D +++  + +  ++                    
Sbjct: 815  SQ--WNSAGAATMPNESKENKTVARTDKMAFPDALSSDES-------------------- 852

Query: 868  XXXXXXXXXXXXSCWMFEDKEGMKHGPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFSL 689
                         CWMFED +G +HGPHSLAEL YW H+SYL+D  M+YHVD++ GPF+L
Sbjct: 853  -------------CWMFEDSDGSRHGPHSLAELSYWLHSSYLQDLSMVYHVDSKFGPFTL 899

Query: 688  AILVEEWSRINCEN--ISENDIKSEGDDAELDNSFTSLLFNSSEEVSIQLHSAIMKAARR 515
              L++ WS  + E+   SEND  S  D          +L +  +++S QLH+ I+K+AR+
Sbjct: 900  VSLIDWWSGGHTEHSEASENDSGSASD----------VLGDIVDDISHQLHAGIVKSARK 949

Query: 514  VLLDEIFSTIIPEFISTKKAQRHLRAES----IDKNAKTYELSKGKEQPPLNTLQE---V 356
            + +DEIFS ++PE I+ +K ++ L A+S       ++K     KGK        ++    
Sbjct: 950  IFVDEIFSCVLPEIIACRKTEKQLAAKSKLLACKPDSKKASALKGKAHAKFTNHKKGSSY 1009

Query: 355  NSSQQTCSIILD---IHENIKEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWLKKKQ 185
            N+ + T  + +    +H    +VL    +  YY+ MK +WD +LYDPVM+YCGAW+K+  
Sbjct: 1010 NTVRATSPVAVQSTAVHTKFADVLSAVWQTLYYESMKNIWDGILYDPVMDYCGAWIKRND 1069

Query: 184  WSGLQCSSLIVNSEVQDPDQMDMVLKNADKVVELEPVSSRHDMDFPPGFGPNI 26
             S L C+S+  +S+ ++  + D +     KV+  +  +   DMDFPPGFGP +
Sbjct: 1070 HSSLPCTSIPGSSDNRNKQEADGL-----KVI-CDSEALECDMDFPPGFGPTL 1116


>ref|XP_006664740.1| PREDICTED: uncharacterized protein LOC102715167 [Oryza brachyantha]
          Length = 1221

 Score =  250 bits (639), Expect = 1e-63
 Identities = 153/416 (36%), Positives = 215/416 (51%), Gaps = 14/416 (3%)
 Frame = -2

Query: 1234 DRYAQF-SISGWMYVNQNGHMCGPYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLK 1058
            D Y Q  ++ G MY+NQ+G MCGPYS EQL EGLS+GFLP +L +Y V  G ++  V L 
Sbjct: 121  DGYVQNQTLEGCMYMNQHGQMCGPYSPEQLYEGLSTGFLPHDLAIYAVFGGKMANPVHLS 180

Query: 1057 YLKQFSSPRYYPSSVDATARDENSQLAGKDPVSYFSSRTEHDATVTASHGSGQGKSNNGX 878
            +LKQF S     ++VD       ++ AG   ++  S     DA ++              
Sbjct: 181  FLKQFLSQWNSNAAVDT-----RNKSAGNKKLASVSKLLLPDALLSEES----------- 224

Query: 877  XXXXXXXXXXXXXXXSCWMFEDKEGMKHGPHSLAELCYWHHNSYLEDSLMIYHVDNRLGP 698
                            CWMFED EG +HGPHSLAEL YWHH+SYL D  MIYHVD++ GP
Sbjct: 225  ----------------CWMFEDAEGRRHGPHSLAELSYWHHSSYLHDLSMIYHVDSKFGP 268

Query: 697  FSLAILVEEWSRINCENISENDIKSEGDDAELDNSFTSLLFNSSEEVSIQLHSAIMKAAR 518
            F+L  L++ WS       SE+     G       S  +L+ +  E++S QLH+ IMK+ R
Sbjct: 269  FTLVSLIDWWS--GGAERSESAANDSG-------SLNALMSDIVEDISHQLHAGIMKSTR 319

Query: 517  RVLLDEIFSTIIPEFISTKKAQRHLRAESIDKNAKTYELSK-----------GKEQPP-- 377
            +V +DEIFS+++PE I+ +K ++ + A+   + AKT  +S            G    P  
Sbjct: 320  KVFIDEIFSSVLPEIIACRKTEKQMAAKLKSQAAKTDNVSNKNALVLMGKGDGTSTHPKK 379

Query: 376  LNTLQEVNSSQQTCSIILDIHENIKEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWL 197
            LN+  +V       +    +     ++L       Y + MK +WD VLYDPVM+YC AWL
Sbjct: 380  LNSFNKVLGDPSVAAQSTALQYEFADILSAVWTAIYNESMKSIWDEVLYDPVMDYCDAWL 439

Query: 196  KKKQWSGLQCSSLIVNSEVQDPDQMDMVLKNADKVVELEPVSSRHDMDFPPGFGPN 29
            K+K  S L  + ++  S  Q     D +   A      +  +   DMDFPPGFGPN
Sbjct: 440  KRKNESNLLSTVVLGTSNNQKMQATDEMPPKA----ICDSDAPDGDMDFPPGFGPN 491


>gb|EEE53601.1| hypothetical protein OsJ_36855 [Oryza sativa Japonica Group]
          Length = 1165

 Score =  246 bits (627), Expect = 3e-62
 Identities = 171/480 (35%), Positives = 242/480 (50%), Gaps = 19/480 (3%)
 Frame = -2

Query: 1411 FASRTSIRCSYDGSGPSCLHEQVDVGPYVAMEGSSQSFSNICELPLPDGTGVTLTQNSCD 1232
            FA+     CS D  G  C   Q  +G      G+              G G  L QN   
Sbjct: 79   FAAMQENACSIDSKGVVC--PQSGLGYSAGQNGTH------------GGGGSMLHQN--- 121

Query: 1231 RYAQFSISGWMYVNQNGHMCGPYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYL 1052
                  + G MY+NQ G MCGPY  EQL +GLS+GFL  +L +Y V  G ++  V+L  L
Sbjct: 122  ------LEGCMYMNQLGQMCGPYPPEQLYDGLSTGFLHRDLAIYAVFGGKMANPVSLGSL 175

Query: 1051 KQFSSPRYYPSSVDATARDENSQLAGKDPVSYFSSRTEHDATVTASHGSGQGKSNNGXXX 872
            KQF S ++   SV AT RDE+++     PV+           +   + S +         
Sbjct: 176  KQFLS-QWSSDSVVAT-RDESAENKKMAPVNKL---------ILPDNLSSEES------- 217

Query: 871  XXXXXXXXXXXXXSCWMFEDKEGMKHGPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFS 692
                          CWMFED EG +HGPHSLAEL YWHH+SYL D  MIYHVD++ GPF+
Sbjct: 218  --------------CWMFEDAEGRRHGPHSLAELSYWHHSSYLHDLSMIYHVDSKFGPFT 263

Query: 691  LAILVEEWSRINCENISENDIKSEGDDAELDNSFTSLLFNSSEEVSIQLHSAIMKAARRV 512
            L  L++ WS       +E+   S  D   L+    +L+ +  E++S QLH+ IMK+AR+V
Sbjct: 264  LVSLIDWWS-----GGTEHSESSANDSGSLN----ALMDDVVEDISHQLHAGIMKSARKV 314

Query: 511  LLDEIFSTIIPEFISTKKAQRHLRAESIDKNAKTYELS-------KGKEQPPLNTLQEVN 353
             +DEIFS+++PE I+ +K ++ + A+   + AKT  +S       KGK        + +N
Sbjct: 315  FIDEIFSSVLPEMIACRKTEKQMAAKRKSQAAKTDNVSNKNALVLKGKGDGTSTRPKSLN 374

Query: 352  SSQ----QTCSIILD---IHENIKEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWLK 194
            S      +  S+ +    +     ++L    +  Y   MK +WD VLYDPVM+YC AWLK
Sbjct: 375  SYNNKVPEDPSVAVQSTAMQYEFADILSAVWETIYNKSMKSIWDEVLYDPVMDYCDAWLK 434

Query: 193  KKQWSGLQCSSLIV-----NSEVQDPDQMDMVLKNADKVVELEPVSSRHDMDFPPGFGPN 29
            +K  S L   S +V     N ++QD D+M      + K +  +  +   DMDFPPGFGPN
Sbjct: 435  RKNESNL--LSTVVPGASDNQKMQDTDEM------SPKAI-CDSDAPESDMDFPPGFGPN 485


>gb|EEC69667.1| hypothetical protein OsI_39097 [Oryza sativa Indica Group]
          Length = 1167

 Score =  246 bits (627), Expect = 3e-62
 Identities = 171/480 (35%), Positives = 242/480 (50%), Gaps = 19/480 (3%)
 Frame = -2

Query: 1411 FASRTSIRCSYDGSGPSCLHEQVDVGPYVAMEGSSQSFSNICELPLPDGTGVTLTQNSCD 1232
            FA+     CS D  G  C   Q  +G      G+              G G  L QN   
Sbjct: 81   FAAMQENACSIDSKGVVC--PQSGLGYSAGQNGTH------------GGGGSMLHQN--- 123

Query: 1231 RYAQFSISGWMYVNQNGHMCGPYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYL 1052
                  + G MY+NQ G MCGPY  EQL +GLS+GFL  +L +Y V  G ++  V+L  L
Sbjct: 124  ------LEGCMYMNQLGQMCGPYPPEQLYDGLSTGFLHRDLAIYAVFGGKMANPVSLGSL 177

Query: 1051 KQFSSPRYYPSSVDATARDENSQLAGKDPVSYFSSRTEHDATVTASHGSGQGKSNNGXXX 872
            KQF S ++   SV AT RDE+++     PV+           +   + S +         
Sbjct: 178  KQFLS-QWSSDSVVAT-RDESAENKKMAPVNKL---------ILPDNLSSEES------- 219

Query: 871  XXXXXXXXXXXXXSCWMFEDKEGMKHGPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFS 692
                          CWMFED EG +HGPHSLAEL YWHH+SYL D  MIYHVD++ GPF+
Sbjct: 220  --------------CWMFEDAEGRRHGPHSLAELSYWHHSSYLHDLSMIYHVDSKFGPFT 265

Query: 691  LAILVEEWSRINCENISENDIKSEGDDAELDNSFTSLLFNSSEEVSIQLHSAIMKAARRV 512
            L  L++ WS       +E+   S  D   L+    +L+ +  E++S QLH+ IMK+AR+V
Sbjct: 266  LVSLIDWWS-----GGTEHSESSANDSGSLN----ALMDDVVEDISHQLHAGIMKSARKV 316

Query: 511  LLDEIFSTIIPEFISTKKAQRHLRAESIDKNAKTYELS-------KGKEQPPLNTLQEVN 353
             +DEIFS+++PE I+ +K ++ + A+   + AKT  +S       KGK        + +N
Sbjct: 317  FIDEIFSSVLPEMIACRKTEKQMAAKRKSQAAKTDNVSNKNALVLKGKGDGTSTRPKSLN 376

Query: 352  SSQ----QTCSIILD---IHENIKEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWLK 194
            S      +  S+ +    +     ++L    +  Y   MK +WD VLYDPVM+YC AWLK
Sbjct: 377  SYNNKVPEDPSVAVQSTAMQYEFADILSAVWETIYNKSMKSIWDEVLYDPVMDYCDAWLK 436

Query: 193  KKQWSGLQCSSLIV-----NSEVQDPDQMDMVLKNADKVVELEPVSSRHDMDFPPGFGPN 29
            +K  S L   S +V     N ++QD D+M      + K +  +  +   DMDFPPGFGPN
Sbjct: 437  RKNESNL--LSTVVPGASDNQKMQDTDEM------SPKAI-CDSDAPESDMDFPPGFGPN 487


>ref|NP_001067262.1| Os12g0613200 [Oryza sativa Japonica Group]
            gi|108862955|gb|ABA99391.2| SET domain containing
            protein, expressed [Oryza sativa Japonica Group]
            gi|113649769|dbj|BAF30281.1| Os12g0613200 [Oryza sativa
            Japonica Group]
          Length = 1212

 Score =  244 bits (624), Expect = 6e-62
 Identities = 171/480 (35%), Positives = 241/480 (50%), Gaps = 19/480 (3%)
 Frame = -2

Query: 1411 FASRTSIRCSYDGSGPSCLHEQVDVGPYVAMEGSSQSFSNICELPLPDGTGVTLTQNSCD 1232
            FA+     CS D  G  C   Q  +G      G+              G G  L QN   
Sbjct: 79   FAAMQENACSIDSKGVVC--PQSGLGYSAGQNGTH------------GGGGSMLHQN--- 121

Query: 1231 RYAQFSISGWMYVNQNGHMCGPYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYL 1052
                  + G MY+NQ G MCGPY  EQL +GLS+GFL  +L +Y V  G ++  V+L  L
Sbjct: 122  ------LEGCMYMNQLGQMCGPYPPEQLYDGLSTGFLHRDLAIYAVFGGKMANPVSLGSL 175

Query: 1051 KQFSSPRYYPSSVDATARDENSQLAGKDPVSYFSSRTEHDATVTASHGSGQGKSNNGXXX 872
            KQF S ++   SV AT RDE+ +     PV+           +   + S +         
Sbjct: 176  KQFLS-QWSSDSVVAT-RDESVENKKMAPVNKL---------ILPDNLSSEES------- 217

Query: 871  XXXXXXXXXXXXXSCWMFEDKEGMKHGPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFS 692
                          CWMFED EG +HGPHSLAEL YWHH+SYL D  MIYHVD++ GPF+
Sbjct: 218  --------------CWMFEDAEGRRHGPHSLAELSYWHHSSYLHDLSMIYHVDSKFGPFT 263

Query: 691  LAILVEEWSRINCENISENDIKSEGDDAELDNSFTSLLFNSSEEVSIQLHSAIMKAARRV 512
            L  L++ WS       +E+   S  D   L+    +L+ +  E++S QLH+ IMK+AR+V
Sbjct: 264  LVSLIDWWS-----GGTEHSESSANDSGSLN----ALMDDVVEDISHQLHAGIMKSARKV 314

Query: 511  LLDEIFSTIIPEFISTKKAQRHLRAESIDKNAKTYELS-------KGKEQPPLNTLQEVN 353
             +DEIFS+++PE I+ +K ++ + A+   + AKT  +S       KGK        + +N
Sbjct: 315  FIDEIFSSVLPEMIACRKTEKQMAAKRKSQAAKTDNVSNKNALVLKGKGDGTSTRPKSLN 374

Query: 352  SSQ----QTCSIILD---IHENIKEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWLK 194
            S      +  S+ +    +     ++L    +  Y   MK +WD VLYDPVM+YC AWLK
Sbjct: 375  SYNNKVPEDPSVAVQSTAMQYEFADILSAVWETIYNKSMKSIWDEVLYDPVMDYCDAWLK 434

Query: 193  KKQWSGLQCSSLIV-----NSEVQDPDQMDMVLKNADKVVELEPVSSRHDMDFPPGFGPN 29
            +K  S L   S +V     N ++QD D+M      + K +  +  +   DMDFPPGFGPN
Sbjct: 435  RKNESNL--LSTVVPGASDNQKMQDTDEM------SPKAI-CDSDAPESDMDFPPGFGPN 485


>gb|EOY15834.1| Set domain protein, putative isoform 4 [Theobroma cacao]
          Length = 1235

 Score =  241 bits (615), Expect = 6e-61
 Identities = 165/451 (36%), Positives = 224/451 (49%), Gaps = 22/451 (4%)
 Frame = -2

Query: 1324 AMEGSSQSFSNICELPLP--DGTGVTLTQNSCDRYAQFSI-SGWMYVNQNGHMCGPYSQE 1154
            A E S QS  N   +P    DG G +    S   YA  S  SGWMYVN++G MCGPY Q+
Sbjct: 53   ATEMSCQSNGNSSGVPQSCNDGGG-SCQDKSYSSYAPSSFASGWMYVNEHGQMCGPYIQQ 111

Query: 1153 QLIEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQFSSPRYYPSSVDATARDENSQLAG 974
            QL EGLS+GFLP+ELPVYPV+NG++S  V LKY +QF      P  V AT     S    
Sbjct: 112  QLYEGLSTGFLPDELPVYPVVNGTVSNPVPLKYFRQF------PGHV-ATGFVYLSSTTA 164

Query: 973  KDPVSYFSSRTEHDATVTASHGSGQGKSNNGXXXXXXXXXXXXXXXXSCWMFEDKEGMKH 794
             +      +  +H  + +  + +G   SN+                 +CW++ED +  KH
Sbjct: 165  SNCFKSSHTNFQHTLSQSQINRNGFDASND-----LISSSLLQSGEDACWLYEDDKSTKH 219

Query: 793  GPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFSLAILVEEWSRINCENISENDIKSEGD 614
            GPHSL +L  WH   YL DS+MI+H +NR  P  L  ++  W          +   +  +
Sbjct: 220  GPHSLLQLYSWHRYGYLADSVMIHHAENRFRPIKLLSVLNAWKG--------SQAYAAEN 271

Query: 613  DAELDNSFTSLLFNSSEEVSIQLHSAIMKAARRVLLDEIFSTIIPEFISTKKAQRHLRAE 434
            + +L  +F S   + SEEVS QLHS IMKAARRV+LDEI S +I EF++ KK+QRHL  E
Sbjct: 272  ERDLSVNFIS---DISEEVSSQLHSGIMKAARRVVLDEIISNMISEFVTAKKSQRHLMVE 328

Query: 433  SIDKNAKTYELSKGKEQPP---------LNTLQEVNSSQQTC---------SI-ILDIHE 311
            S +++AK +   K  E  P           T    N S Q C         SI  +   E
Sbjct: 329  SFNQDAKRFPDGKRIENAPEIKMQCIPMFETAASHNVSDQPCIQESTCSPASIKYVGSIE 388

Query: 310  NIKEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWLKKKQWSGLQCSSLIVNSEVQDP 131
            N         K  +  CM+V+W++V YD + EY  +W + K W G      ++ S     
Sbjct: 389  NFWGSYTVVCKMLFDYCMQVMWNAVFYDSIAEYSSSWRRGKLWFG---HPNVMLSATDSR 445

Query: 130  DQMDMVLKNADKVVELEPVSSRHDMDFPPGF 38
            D  +   K  DK +        HD+D PPGF
Sbjct: 446  DHGNETEKVTDKPLLSGMELIAHDVDCPPGF 476


>gb|EOY15831.1| Set domain protein, putative isoform 1 [Theobroma cacao]
            gi|508723935|gb|EOY15832.1| Set domain protein, putative
            isoform 1 [Theobroma cacao] gi|508723936|gb|EOY15833.1|
            Set domain protein, putative isoform 1 [Theobroma cacao]
          Length = 1241

 Score =  241 bits (615), Expect = 6e-61
 Identities = 165/451 (36%), Positives = 224/451 (49%), Gaps = 22/451 (4%)
 Frame = -2

Query: 1324 AMEGSSQSFSNICELPLP--DGTGVTLTQNSCDRYAQFSI-SGWMYVNQNGHMCGPYSQE 1154
            A E S QS  N   +P    DG G +    S   YA  S  SGWMYVN++G MCGPY Q+
Sbjct: 53   ATEMSCQSNGNSSGVPQSCNDGGG-SCQDKSYSSYAPSSFASGWMYVNEHGQMCGPYIQQ 111

Query: 1153 QLIEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQFSSPRYYPSSVDATARDENSQLAG 974
            QL EGLS+GFLP+ELPVYPV+NG++S  V LKY +QF      P  V AT     S    
Sbjct: 112  QLYEGLSTGFLPDELPVYPVVNGTVSNPVPLKYFRQF------PGHV-ATGFVYLSSTTA 164

Query: 973  KDPVSYFSSRTEHDATVTASHGSGQGKSNNGXXXXXXXXXXXXXXXXSCWMFEDKEGMKH 794
             +      +  +H  + +  + +G   SN+                 +CW++ED +  KH
Sbjct: 165  SNCFKSSHTNFQHTLSQSQINRNGFDASND-----LISSSLLQSGEDACWLYEDDKSTKH 219

Query: 793  GPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFSLAILVEEWSRINCENISENDIKSEGD 614
            GPHSL +L  WH   YL DS+MI+H +NR  P  L  ++  W          +   +  +
Sbjct: 220  GPHSLLQLYSWHRYGYLADSVMIHHAENRFRPIKLLSVLNAWKG--------SQAYAAEN 271

Query: 613  DAELDNSFTSLLFNSSEEVSIQLHSAIMKAARRVLLDEIFSTIIPEFISTKKAQRHLRAE 434
            + +L  +F S   + SEEVS QLHS IMKAARRV+LDEI S +I EF++ KK+QRHL  E
Sbjct: 272  ERDLSVNFIS---DISEEVSSQLHSGIMKAARRVVLDEIISNMISEFVTAKKSQRHLMVE 328

Query: 433  SIDKNAKTYELSKGKEQPP---------LNTLQEVNSSQQTC---------SI-ILDIHE 311
            S +++AK +   K  E  P           T    N S Q C         SI  +   E
Sbjct: 329  SFNQDAKRFPDGKRIENAPEIKMQCIPMFETAASHNVSDQPCIQESTCSPASIKYVGSIE 388

Query: 310  NIKEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWLKKKQWSGLQCSSLIVNSEVQDP 131
            N         K  +  CM+V+W++V YD + EY  +W + K W G      ++ S     
Sbjct: 389  NFWGSYTVVCKMLFDYCMQVMWNAVFYDSIAEYSSSWRRGKLWFG---HPNVMLSATDSR 445

Query: 130  DQMDMVLKNADKVVELEPVSSRHDMDFPPGF 38
            D  +   K  DK +        HD+D PPGF
Sbjct: 446  DHGNETEKVTDKPLLSGMELIAHDVDCPPGF 476


>ref|XP_004963230.1| PREDICTED: uncharacterized protein LOC101782399 isoform X5 [Setaria
            italica]
          Length = 1145

 Score =  234 bits (596), Expect = 1e-58
 Identities = 150/452 (33%), Positives = 226/452 (50%), Gaps = 15/452 (3%)
 Frame = -2

Query: 1342 DVGPYVAMEGSSQSFSNICELPLPDGTGVTLTQNSC-DRYAQFS-ISGWMYVNQNGHMCG 1169
            + G + AM+ S  + +N   +    G G +  QN     Y Q   + G MY+N++G MCG
Sbjct: 64   NTGVFPAMQESVCTTANSGVVYPKSGLGFSAGQNGTYGAYLQHQYLEGCMYMNEHGQMCG 123

Query: 1168 PYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQFSSPRYYPSSVDATARDEN 989
            PY  EQL EGLS+GFLP++L +Y V  G  +  V L +L QF S R +     AT    N
Sbjct: 124  PYPPEQLYEGLSTGFLPQDLAIYAVFGGKTADPVPLSFLNQFLSQRNF----GATVSTPN 179

Query: 988  SQLAGKDPVSYFSSRTEHDATVTASHGSGQGKSNNGXXXXXXXXXXXXXXXXSCWMFEDK 809
            + +  K   S+       D +   S                            CWMFED 
Sbjct: 180  AYMETKKIPSHAKMVLPDDLSSEES----------------------------CWMFEDA 211

Query: 808  EGMKHGPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFSLAILVEEWSRINCENISENDI 629
            EG + GPHSLAEL YWHHNSY++D  MIYHVD + GPF+L  L+  WS  + E       
Sbjct: 212  EGCRQGPHSLAELSYWHHNSYIQDLSMIYHVDGKFGPFTLVSLIGSWSGEHAE------- 264

Query: 628  KSEGDDAELDNSFTSLLFNSSEEVSIQLHSAIMKAARRVLLDEIFSTIIPEFISTKKAQR 449
                +    D+S   L+ +   +VS QLH+ IMK+ARRVL+DEIFS ++P+ I++KK ++
Sbjct: 265  --RSEATANDSSLNGLVGDIVGDVSHQLHAGIMKSARRVLIDEIFSCVLPDLIASKKTEK 322

Query: 448  HLRAESIDKNAKTYELS-------KGKEQPPLNTLQEVNSSQQ--TCSIILD---IHENI 305
             L A+  ++  K   +S       K K   P    +  NS++     S+ +    +H+  
Sbjct: 323  QLAAKLKNQATKPDSVSNMKISKLKVKINKPSTIPENGNSNRAPVDSSVAIQSTAVHDTF 382

Query: 304  KEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWLKKKQWSGLQCSSLIVNSE-VQDPD 128
             ++L    +  YY+ MK +WD +L DPVM+Y   W ++     L  + + V  + ++  D
Sbjct: 383  ADILSAVWQTIYYEAMKNIWDGILSDPVMDYSDVWFQRNCQLNLPSTIISVTPDNIKAQD 442

Query: 127  QMDMVLKNADKVVELEPVSSRHDMDFPPGFGP 32
              +M  K++D        ++  + +FPPGF P
Sbjct: 443  SHEMSSKDSD--------ATECETEFPPGFEP 466


>ref|XP_004963226.1| PREDICTED: uncharacterized protein LOC101782399 isoform X1 [Setaria
            italica] gi|514755227|ref|XP_004963227.1| PREDICTED:
            uncharacterized protein LOC101782399 isoform X2 [Setaria
            italica] gi|514755231|ref|XP_004963228.1| PREDICTED:
            uncharacterized protein LOC101782399 isoform X3 [Setaria
            italica] gi|514755235|ref|XP_004963229.1| PREDICTED:
            uncharacterized protein LOC101782399 isoform X4 [Setaria
            italica]
          Length = 1158

 Score =  234 bits (596), Expect = 1e-58
 Identities = 150/452 (33%), Positives = 226/452 (50%), Gaps = 15/452 (3%)
 Frame = -2

Query: 1342 DVGPYVAMEGSSQSFSNICELPLPDGTGVTLTQNSC-DRYAQFS-ISGWMYVNQNGHMCG 1169
            + G + AM+ S  + +N   +    G G +  QN     Y Q   + G MY+N++G MCG
Sbjct: 77   NTGVFPAMQESVCTTANSGVVYPKSGLGFSAGQNGTYGAYLQHQYLEGCMYMNEHGQMCG 136

Query: 1168 PYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQFSSPRYYPSSVDATARDEN 989
            PY  EQL EGLS+GFLP++L +Y V  G  +  V L +L QF S R +     AT    N
Sbjct: 137  PYPPEQLYEGLSTGFLPQDLAIYAVFGGKTADPVPLSFLNQFLSQRNF----GATVSTPN 192

Query: 988  SQLAGKDPVSYFSSRTEHDATVTASHGSGQGKSNNGXXXXXXXXXXXXXXXXSCWMFEDK 809
            + +  K   S+       D +   S                            CWMFED 
Sbjct: 193  AYMETKKIPSHAKMVLPDDLSSEES----------------------------CWMFEDA 224

Query: 808  EGMKHGPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFSLAILVEEWSRINCENISENDI 629
            EG + GPHSLAEL YWHHNSY++D  MIYHVD + GPF+L  L+  WS  + E       
Sbjct: 225  EGCRQGPHSLAELSYWHHNSYIQDLSMIYHVDGKFGPFTLVSLIGSWSGEHAE------- 277

Query: 628  KSEGDDAELDNSFTSLLFNSSEEVSIQLHSAIMKAARRVLLDEIFSTIIPEFISTKKAQR 449
                +    D+S   L+ +   +VS QLH+ IMK+ARRVL+DEIFS ++P+ I++KK ++
Sbjct: 278  --RSEATANDSSLNGLVGDIVGDVSHQLHAGIMKSARRVLIDEIFSCVLPDLIASKKTEK 335

Query: 448  HLRAESIDKNAKTYELS-------KGKEQPPLNTLQEVNSSQQ--TCSIILD---IHENI 305
             L A+  ++  K   +S       K K   P    +  NS++     S+ +    +H+  
Sbjct: 336  QLAAKLKNQATKPDSVSNMKISKLKVKINKPSTIPENGNSNRAPVDSSVAIQSTAVHDTF 395

Query: 304  KEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWLKKKQWSGLQCSSLIVNSE-VQDPD 128
             ++L    +  YY+ MK +WD +L DPVM+Y   W ++     L  + + V  + ++  D
Sbjct: 396  ADILSAVWQTIYYEAMKNIWDGILSDPVMDYSDVWFQRNCQLNLPSTIISVTPDNIKAQD 455

Query: 127  QMDMVLKNADKVVELEPVSSRHDMDFPPGFGP 32
              +M  K++D        ++  + +FPPGF P
Sbjct: 456  SHEMSSKDSD--------ATECETEFPPGFEP 479


>gb|EXC31045.1| Histone-lysine N-methyltransferase SETD1B [Morus notabilis]
          Length = 1249

 Score =  228 bits (582), Expect = 4e-57
 Identities = 142/361 (39%), Positives = 182/361 (50%), Gaps = 12/361 (3%)
 Frame = -2

Query: 1210 SGWMYVNQNGHMCGPYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQFSSPR 1031
            SGWMYVN  G MCGPY QEQL EGLS+GFLPE+LPVYP++NG I+ SV LKY K F   +
Sbjct: 183  SGWMYVNDCGQMCGPYIQEQLYEGLSTGFLPEDLPVYPLLNGKIANSVPLKYFKHFPD-Q 241

Query: 1030 YYPSSVDATARDENSQLAGKDPVSYFSSRTEHDATVTASHGSGQGKSNNGXXXXXXXXXX 851
                     A     Q A    V   S    H     AS  S +                
Sbjct: 242  VATGFAYLNANPLAYQSASYANVPISSPAPSHSLKPYASQSSKEA--------------- 286

Query: 850  XXXXXXSCWMFEDKEGMKHGPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFSLAILVEE 671
                   CW++ED E  KHGPHSL EL  WH   YL DS+MIYH +N   PF+L  L+  
Sbjct: 287  -------CWLYEDHERKKHGPHSLQELFSWHQYGYLRDSIMIYHTENTCTPFTLLSLLNA 339

Query: 670  WSRINCENISENDIKSEGDDAELDNSFTS-LLFNSSEEVSIQLHSAIMKAARRVLLDEIF 494
            W          +D  +   DA  + + +S  L   SEEVS QLH  IMKAARR++LDEI 
Sbjct: 340  WKP------DASDTATTTPDAATNETGSSPSLSEMSEEVSCQLHFGIMKAARRIVLDEII 393

Query: 493  STIIPEFISTKKAQRHLRAESIDKNAKTYELS---------KGKEQPPLNTLQEVNSSQQ 341
            S +I EF + KK+ R ++ E I++ A+T  L          K +  P   T     ++  
Sbjct: 394  SNVIAEFAAMKKSWREVKHEPINQAAETCSLDQRMLEFAGVKKRTAPLCETTTPSPAADN 453

Query: 340  TCSIILDIH--ENIKEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWLKKKQWSGLQC 167
               II  +   EN        SK  +  CM+V+W++V YD + EY  AW K+K WSG+  
Sbjct: 454  KAIIIKSVGSIENFWGSHAVVSKVLFDYCMEVMWNAVFYDTLAEYSSAWRKRKLWSGIPI 513

Query: 166  S 164
            S
Sbjct: 514  S 514


>ref|XP_004301597.1| PREDICTED: uncharacterized protein LOC101295723 [Fragaria vesca
            subsp. vesca]
          Length = 1228

 Score =  226 bits (576), Expect = 2e-56
 Identities = 160/472 (33%), Positives = 236/472 (50%), Gaps = 37/472 (7%)
 Frame = -2

Query: 1342 DVGPYVAMEGSSQSFSNICEL-PLPDGTGVTLTQNSCDRYAQFS---ISGWMYVNQNGHM 1175
            +VG    ME S QS  N  ++ P+ +  G +    S   Y   S   +SGWMYVN+ G M
Sbjct: 22   EVGSNTRMEMSCQSNGNSSDIQPVCNSGGTSYQDKSYSGYMPPSPSFVSGWMYVNEQGQM 81

Query: 1174 CGPYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQFSSPRYYPS-------- 1019
            CGPY Q+QL EGLS+GFLP+ELPVYPV+NG++   + LKY K F  P +  +        
Sbjct: 82   CGPYIQQQLYEGLSTGFLPDELPVYPVVNGALINPIPLKYFKLF--PNHVTTGFAYLSLA 139

Query: 1018 SVDATARDENS------QLAGKD---PVSYFSSRTEHDATVTASHGSGQGKSN--NGXXX 872
            S+ + +   NS       LA      P++      ++D+T  A+  +             
Sbjct: 140  SISSASTPTNSLKSCNGDLATSSIPTPIATSYPDLQNDSTSQANSNTDFSSKLILKSEAP 199

Query: 871  XXXXXXXXXXXXXSCWMFEDKEGMKHGPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFS 692
                         SCW++ED+EG ++GP+SL EL  WH   YL D+LMIYHV N+  PF+
Sbjct: 200  NQDTSYQSLSSKESCWLYEDEEGKRNGPYSLFELNSWHQYGYLRDTLMIYHVKNKCKPFT 259

Query: 691  LAILVEEWSRINCENISENDIKSEGDDAELDNSFTSLLFNSSEEVSIQLHSAIMKAARRV 512
            L+ +   W     E I++ D K          SF S++   +E+VS QLH  I+K+ARRV
Sbjct: 260  LSSVKCSWKLDGSETITKFDTK-----CNQSGSFVSIISEVAEDVSSQLHYGILKSARRV 314

Query: 511  LLDEIFSTIIPEFISTKKAQR-HLRAESIDKNAKTYELSKGKEQPPLNTLQEVNSSQQTC 335
            +LDEI S +I EF++T KAQR +   ++   +AK  E+    E P L++           
Sbjct: 315  VLDEIISNVIAEFVTTTKAQRLNQSMKTCSLDAKRSEID--GENPALSSEAGAADCVAQR 372

Query: 334  SIILDIH-ENIKEVLLETSKFCYYD------------CMKVLWDSVLYDPVMEYCGAWLK 194
            + I  +  E +       S   Y+D            CM+V+W++V YD V EY  AW +
Sbjct: 373  TFINQVSPEPLPNTKSVGSIHTYWDSYAVVCGMLFNHCMEVMWNAVFYDSVAEYSSAWRR 432

Query: 193  KKQWSGLQCSSLIVNSEVQDPDQMDMVLKNADKVVELEPVSSRHDMDFPPGF 38
            +K W+G     +  N   +  D+++ V      V+  E +S  +D D PPGF
Sbjct: 433  RKLWTGSPSFWIPPN---RCGDRVEKV-----TVLPHENLSDGYDDDCPPGF 476


>gb|EMJ21490.1| hypothetical protein PRUPE_ppa000519mg [Prunus persica]
          Length = 1116

 Score =  226 bits (576), Expect = 2e-56
 Identities = 140/356 (39%), Positives = 186/356 (52%), Gaps = 8/356 (2%)
 Frame = -2

Query: 1219 FSISGWMYVNQNGHMCGPYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQFS 1040
            F +SGW YVN+ G MCGPY QEQL EGLS+GFLP+ELPVYP++NGS+   V LKY KQF 
Sbjct: 5    FVVSGWTYVNELGQMCGPYIQEQLYEGLSTGFLPDELPVYPLVNGSLINPVPLKYFKQF- 63

Query: 1039 SPRYYPSSVDATARDENSQLAGKDPVSYFSSRTEHDATV-TASHGSGQGKSNNGXXXXXX 863
                 P  V AT              +Y S      AT  T S  S  G    G      
Sbjct: 64   -----PDHV-ATG------------FAYLSLGISTTATTPTNSFNSPHG----GDLPMCS 101

Query: 862  XXXXXXXXXXSCWMFEDKEGMKHGPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFSLAI 683
                      SCW++ D EG KHGPHSL EL  WH   YL+DS+MIYHV+N+  PF+L  
Sbjct: 102  TPAPAPPNEESCWLYADGEGQKHGPHSLFELYSWHRYGYLQDSVMIYHVENKCTPFTLLS 161

Query: 682  LVEEWSRINCENISENDIKSEGDDAELDNSFTSLLFNSSEEVSIQLHSAIMKAARRVLLD 503
            +V  W     E ++ +D KS G      +S  S +   SE VS +LH  I+KAARRV+ D
Sbjct: 162  VVNAWKTDGPETVTNSDAKSNG-----TSSLGSFIAEISEGVSGELHHGILKAARRVVFD 216

Query: 502  EIFSTIIPEFISTKKAQR-HLRAESIDKNAKT------YELSKGKEQPPLNTLQEVNSSQ 344
            EI S +I EF +TKKAQR +   ++   ++KT       E +          + E ++  
Sbjct: 217  EIISNVINEFFTTKKAQRLNQTVKTCSSDSKTGCAASLCEAAASYYVADETCINEDSTEP 276

Query: 343  QTCSIILDIHENIKEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWLKKKQWSG 176
               +  +   EN         +  +  CM+V+W++V YD V EY  +W ++K WSG
Sbjct: 277  PPSTKSVGSIENFWGSYAAVCRMLFDYCMQVMWNAVFYDSVAEYSSSWRRRKLWSG 332


>ref|XP_002510762.1| set domain protein, putative [Ricinus communis]
            gi|223551463|gb|EEF52949.1| set domain protein, putative
            [Ricinus communis]
          Length = 1258

 Score =  220 bits (560), Expect = 2e-54
 Identities = 140/383 (36%), Positives = 191/383 (49%), Gaps = 39/383 (10%)
 Frame = -2

Query: 1210 SGWMYVNQNGHMCGPYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQF---- 1043
            SGWMY+N NG MCGPY Q+QL EGLS+GFL E+LPVYPV+NG++   V LKY  QF    
Sbjct: 123  SGWMYLNVNGQMCGPYIQQQLYEGLSTGFLHEDLPVYPVLNGTLVNPVPLKYFNQFPDHV 182

Query: 1042 --------------SSPRYYPSSV--DATARDENSQLAGKDPVSYFSSRTE---HDATVT 920
                          S P  + +SV  D+    +   +     VS  S   E   H     
Sbjct: 183  ATGFAYLGIGISGTSMPMSHFTSVSMDSAIHRQEGCVPHAAQVSLCSDAQEMVSHSHVPH 242

Query: 919  ASHGSGQGKSNNGXXXXXXXXXXXXXXXXSCWMFEDKEGMKHGPHSLAELCYWHHNSYLE 740
             + GS Q  SN+                 SCWMFED  G KHGPHSL+EL  WH + YL 
Sbjct: 243  NTCGSNQPVSNS-MAASHDIPFSLLSGEDSCWMFEDDGGRKHGPHSLSELYSWHRHGYLR 301

Query: 739  DSLMIYHVDNRLGPFSLAILVEEWSRINCENISENDIKSEGDDAELDNSFTSLLFNSSEE 560
            +SL IYH+ N+  PF L  +++ WS    E++  +D + E        S  S +   SEE
Sbjct: 302  NSLTIYHIQNKFRPFPLLSVIDAWSTDKHESVLASDAEGE------MGSLCSFVSEISEE 355

Query: 559  VSIQLHSAIMKAARRVLLDEIFSTIIPEFISTKKAQRHLRAESIDKNAKTYELSKGKEQ- 383
            VS QLH+ IMKAARRV LDEI S ++ EF  TKK+ R+L+   I      Y+     E+ 
Sbjct: 356  VSCQLHAGIMKAARRVALDEIISNVMSEFFDTKKSHRNLKRSPITTLCLFYQSEVTGERR 415

Query: 382  ----PPLNTLQEVNSSQQTCSIILD--IHENIKEV---------LLETSKFCYYDCMKVL 248
                P        ++S Q C   +   + +N K V              +  +  CM+V+
Sbjct: 416  NHAVPECKPAAFSHNSDQACVDGMSELLPKNTKSVGTIDNFWGSYAVVCRILFDYCMEVM 475

Query: 247  WDSVLYDPVMEYCGAWLKKKQWS 179
            W++V YD + +Y  +W ++K WS
Sbjct: 476  WNAVFYDAIADYSNSWRRRKLWS 498


>ref|XP_003594905.1| Histone-lysine N-methyltransferase SETD1B [Medicago truncatula]
            gi|355483953|gb|AES65156.1| Histone-lysine
            N-methyltransferase SETD1B [Medicago truncatula]
          Length = 1232

 Score =  218 bits (556), Expect = 4e-54
 Identities = 138/375 (36%), Positives = 193/375 (51%), Gaps = 30/375 (8%)
 Frame = -2

Query: 1213 ISGWMYVNQNGHMCGPYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQF--- 1043
            +SGWMYVN++G MCGPY +EQL EGL++GFLP ELPVYPVING+I  SV L Y KQ+   
Sbjct: 85   VSGWMYVNEHGQMCGPYIKEQLHEGLTTGFLPFELPVYPVINGTIMNSVPLNYFKQYPDH 144

Query: 1042 -SSPRYYPSSVDATAR-DENSQLAGKDPVSYFSSRTEHDATV-------TASHGSGQGKS 890
             S+   Y S   + AR  +N   + +D V       E  A +       + SH +   K 
Sbjct: 145  VSTGFAYLSMDFSNARMSKNCSSSSQDMVDGQDRSVELAAVMAVNPDSKSVSHVNDCNKE 204

Query: 889  NN----GXXXXXXXXXXXXXXXXSCWMFEDKEGMKHGPHSLAELCYWHHNSYLEDSLMIY 722
            +N                      CW++EDK+GMKHGPHS++EL  WHH+ YLEDS +I 
Sbjct: 205  SNHVDLSSEAFSRIISSQMLGGECCWLYEDKKGMKHGPHSISELISWHHHGYLEDSTVIS 264

Query: 721  HVDNRLGPFSLAILVEEWSRINCENISENDIKSEGDDAELDNSFTSLLFNSSEEVSIQLH 542
            H DN+ G F L   V       C  I  +D KS G          +L+   SE++S QLH
Sbjct: 265  HFDNKYGTFVLLSAVNAMKGDTCGTICGSDSKSNG-----VGDVMNLICEISEDISSQLH 319

Query: 541  SAIMKAARRVLLDEIFSTIIPEFISTKKAQRHLRAESIDKNAKTYEL-----SKGKEQPP 377
            + +MK++R+V+LD I   II EFI+ KK ++  + ES D+ ++T  L     +KG   P 
Sbjct: 320  TGVMKSSRKVVLDGIIGDIIAEFITEKKCKKQ-KLESADQTSETCTLNNKMMNKGASIPS 378

Query: 376  LNTLQEVNSSQQTCSIILDIHENIKEV---------LLETSKFCYYDCMKVLWDSVLYDP 224
                  + + Q    I      N+K V              K  +   ++V+W++V +D 
Sbjct: 379  EPAASRILNGQACHEISRPSSTNVKSVGSIENFWWSYAVVRKVLFDHSLQVMWNAVFFDT 438

Query: 223  VMEYCGAWLKKKQWS 179
            V E   +W KKK WS
Sbjct: 439  VTEVLFSWRKKKYWS 453


>ref|XP_006348442.1| PREDICTED: uncharacterized protein LOC102597311 isoform X5 [Solanum
            tuberosum]
          Length = 1618

 Score =  217 bits (553), Expect = 1e-53
 Identities = 155/438 (35%), Positives = 206/438 (47%), Gaps = 39/438 (8%)
 Frame = -2

Query: 1210 SGWMYVNQNGHMCGPYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQFSS-- 1037
            +GWMYVN+ G MCGPY +EQL EGLS+GFLPEEL VYPV+NG+IS +V LKY  QF    
Sbjct: 109  TGWMYVNEQGQMCGPYIKEQLYEGLSTGFLPEELHVYPVLNGTISNAVPLKYFNQFPDHV 168

Query: 1036 ----PRYYPSSVDATARDENSQLAGKD--------PVS--YFSSRTEHDATVTASHGSGQ 899
                     SS  A+   + S    KD        P +  Y +S  EH      +H S Q
Sbjct: 169  ATGFAYVMVSSSGASGPTDKSMGVAKDSGGNGVELPTTSPYSNSVAEH-----GTHYSNQ 223

Query: 898  GKSNNGXXXXXXXXXXXXXXXXSCWMFEDKEGMKHGPHSLAELCYWHHNSYLEDSLMIYH 719
              +  G                SCW FED EG KHGPHSL EL  W H  Y+ DS+MI H
Sbjct: 224  QMATAG-SAGTFAPSTSSVNEESCWFFEDHEGTKHGPHSLMELYSWCHYGYIVDSVMIRH 282

Query: 718  VDNRLGPFSLAILVEEWSRINCENISENDIKSEGDDAELDNSFTSLLFNSSEEVSIQLHS 539
            V ++  PFSL  L+  W+       +   +     D     S    +   S+EV  QLH 
Sbjct: 283  VADKYRPFSLKSLISSWT-----TATPGALFLSNPDGHETASLQDFVSEISQEVCSQLHV 337

Query: 538  AIMKAARRVLLDEIFSTIIPEFISTKK------AQRHLRAESIDKNAKTYELSKG----- 392
             IMKAARR LLDEI S  I E IS KK       Q+ +  +S+  ++    +S G     
Sbjct: 338  VIMKAARRTLLDEIVSHAISECISEKKDLKKATNQKKVTNQSVKMSSPGTRMSAGFGGSK 397

Query: 391  --------KEQPPL----NTLQEVNSSQQTCSIILDIHENIKEVLLETSKFCYYDCMKVL 248
                     E P L    +   E+       S  +   EN  +      +  +  CM+ +
Sbjct: 398  ALIDPERSAEAPNLLNRKSPAAEIPLKSSGSSKSVGSFENYCDSYTVVCRKLFDSCMQNI 457

Query: 247  WDSVLYDPVMEYCGAWLKKKQWSGLQCSSLIVNSEVQDPDQMDMVLKNADKVVELEPVSS 68
            W++V YD V EY  AW K+K+WS      L+V S +      +   K + +V+++E  S 
Sbjct: 458  WNAVFYDHVSEYSSAWRKRKRWSP---PCLMVESNIPAISYANCTTKLSTEVLQVEEESF 514

Query: 67   RHDMDFPPGFGPNIGTLD 14
              D D+PPGF     T D
Sbjct: 515  GCDPDYPPGFEEKNMTAD 532


>ref|XP_004487927.1| PREDICTED: uncharacterized protein LOC101514300 isoform X5 [Cicer
            arietinum]
          Length = 1146

 Score =  217 bits (553), Expect = 1e-53
 Identities = 145/406 (35%), Positives = 204/406 (50%), Gaps = 19/406 (4%)
 Frame = -2

Query: 1213 ISGWMYVNQNGHMCGPYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQFSSP 1034
            +SGWMYVN+ G MCGPY +EQL EGL++GFLP ELPVYPVING+I   V L Y KQF  P
Sbjct: 34   VSGWMYVNEQGQMCGPYIKEQLYEGLTTGFLPFELPVYPVINGTIMNPVPLNYFKQF--P 91

Query: 1033 RYYPSSVDATARD-ENSQLAGKDPVSYFSSRTEHDATVTASHGSGQGKSNN----GXXXX 869
             +  +     + D   +++      S   +      +V  SH +   K +N         
Sbjct: 92   DHVSTGFAFLSMDFSGTRMPTNCSSSSLLAVNPDSMSVLPSHVNDCIKQSNHLNLNSEAF 151

Query: 868  XXXXXXXXXXXXSCWMFEDKEGMKHGPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFSL 689
                         CW++EDK+G+KHGPHS++EL  W+H+ YLEDS +I H DN+ G F L
Sbjct: 152  SRIISCQMVGGECCWLYEDKKGIKHGPHSISELISWYHHGYLEDSTVISHFDNKYGTFML 211

Query: 688  AILVEEWSRINCENISENDIKSEGDDAELDNSFTSLLFNSSEEVSIQLHSAIMKAARRVL 509
               V          I  +D KS G       +  +L+   SE +S QLH  IMKAARRV+
Sbjct: 212  LSAVNALKEDISGTICGSDSKSNG-----VGNVVNLVCEISENISSQLHMGIMKAARRVV 266

Query: 508  LDEIFSTIIPEFISTKKAQRHLRAESIDKNAKTYEL---------SKGKEQPPLNTL--- 365
            LD I   II EF++ KK  RH + ES D+ ++T  L         S   E  P + L   
Sbjct: 267  LDGIIGDIIAEFVTEKKYNRH-KLESADQTSETCMLDSKMMNKRTSISSEPAPSHILDGQ 325

Query: 364  --QEVNSSQQTCSIILDIHENIKEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWLKK 191
               E++    T    +   EN         K  +  C++V+W+++  D V EY  +W K+
Sbjct: 326  ACHEISRPSLTSVKSVGSIENFWWSYAAVRKVLFEHCLQVMWNAIFSDTVTEYVFSWRKR 385

Query: 190  KQWSGLQCSSLIVNSEVQDPDQMDMVLKNADKVVELEPVSSRHDMD 53
            K+WS     S +  S+    D +DM+   A   + L P SS  ++D
Sbjct: 386  KRWSHPTPQSSVNESK----DYVDMIKSEA---LVLRPGSSVCNVD 424


>ref|XP_004487926.1| PREDICTED: uncharacterized protein LOC101514300 isoform X4 [Cicer
            arietinum]
          Length = 1196

 Score =  217 bits (553), Expect = 1e-53
 Identities = 145/406 (35%), Positives = 204/406 (50%), Gaps = 19/406 (4%)
 Frame = -2

Query: 1213 ISGWMYVNQNGHMCGPYSQEQLIEGLSSGFLPEELPVYPVINGSISTSVTLKYLKQFSSP 1034
            +SGWMYVN+ G MCGPY +EQL EGL++GFLP ELPVYPVING+I   V L Y KQF  P
Sbjct: 34   VSGWMYVNEQGQMCGPYIKEQLYEGLTTGFLPFELPVYPVINGTIMNPVPLNYFKQF--P 91

Query: 1033 RYYPSSVDATARD-ENSQLAGKDPVSYFSSRTEHDATVTASHGSGQGKSNN----GXXXX 869
             +  +     + D   +++      S   +      +V  SH +   K +N         
Sbjct: 92   DHVSTGFAFLSMDFSGTRMPTNCSSSSLLAVNPDSMSVLPSHVNDCIKQSNHLNLNSEAF 151

Query: 868  XXXXXXXXXXXXSCWMFEDKEGMKHGPHSLAELCYWHHNSYLEDSLMIYHVDNRLGPFSL 689
                         CW++EDK+G+KHGPHS++EL  W+H+ YLEDS +I H DN+ G F L
Sbjct: 152  SRIISCQMVGGECCWLYEDKKGIKHGPHSISELISWYHHGYLEDSTVISHFDNKYGTFML 211

Query: 688  AILVEEWSRINCENISENDIKSEGDDAELDNSFTSLLFNSSEEVSIQLHSAIMKAARRVL 509
               V          I  +D KS G       +  +L+   SE +S QLH  IMKAARRV+
Sbjct: 212  LSAVNALKEDISGTICGSDSKSNG-----VGNVVNLVCEISENISSQLHMGIMKAARRVV 266

Query: 508  LDEIFSTIIPEFISTKKAQRHLRAESIDKNAKTYEL---------SKGKEQPPLNTL--- 365
            LD I   II EF++ KK  RH + ES D+ ++T  L         S   E  P + L   
Sbjct: 267  LDGIIGDIIAEFVTEKKYNRH-KLESADQTSETCMLDSKMMNKRTSISSEPAPSHILDGQ 325

Query: 364  --QEVNSSQQTCSIILDIHENIKEVLLETSKFCYYDCMKVLWDSVLYDPVMEYCGAWLKK 191
               E++    T    +   EN         K  +  C++V+W+++  D V EY  +W K+
Sbjct: 326  ACHEISRPSLTSVKSVGSIENFWWSYAAVRKVLFEHCLQVMWNAIFSDTVTEYVFSWRKR 385

Query: 190  KQWSGLQCSSLIVNSEVQDPDQMDMVLKNADKVVELEPVSSRHDMD 53
            K+WS     S +  S+    D +DM+   A   + L P SS  ++D
Sbjct: 386  KRWSHPTPQSSVNESK----DYVDMIKSEA---LVLRPGSSVCNVD 424


Top