BLASTX nr result

ID: Akebia25_contig00044217 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00044217
         (1272 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like ...   464   e-128
ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus ...   433   e-119
ref|XP_004288574.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   422   e-115
ref|XP_006430400.1| hypothetical protein CICLE_v10011537mg [Citr...   419   e-114
ref|XP_006430399.1| hypothetical protein CICLE_v10011537mg [Citr...   419   e-114
ref|XP_007032171.1| SET domain group 40, putative isoform 1 [The...   419   e-114
ref|XP_007215291.1| hypothetical protein PRUPE_ppa004975mg [Prun...   418   e-114
emb|CBI27360.3| unnamed protein product [Vitis vinifera]              414   e-113
gb|EXC05430.1| Protein SET DOMAIN GROUP 40 [Morus notabilis]          412   e-112
ref|XP_002305239.2| hypothetical protein POPTR_0004s07950g [Popu...   412   e-112
ref|XP_007141970.1| hypothetical protein PHAVU_008G241400g [Phas...   408   e-111
ref|XP_004145844.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   404   e-110
ref|XP_006596494.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   399   e-108
ref|XP_004167843.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   399   e-108
ref|XP_004490774.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   392   e-106
ref|XP_003616150.1| Protein SET DOMAIN GROUP [Medicago truncatul...   392   e-106
gb|ACU19071.1| unknown [Glycine max]                                  389   e-105
ref|XP_006400256.1| hypothetical protein EUTSA_v10015946mg [Eutr...   382   e-103
ref|XP_006289442.1| hypothetical protein CARUB_v10002957mg [Caps...   382   e-103
ref|XP_006596495.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   380   e-103

>ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera]
          Length = 504

 Score =  464 bits (1194), Expect = e-128
 Identities = 235/393 (59%), Positives = 284/393 (72%), Gaps = 1/393 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLLAEM KG+ S W+PYLMQLPR Y T+ +F++FE QALQVD AIW  E+A+ KAELEW+
Sbjct: 102  CLLAEMSKGKSSWWHPYLMQLPRSYDTLANFSQFEKQALQVDDAIWVTERAILKAELEWK 161

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            +A               FR+WLWAS+T+SSRTMH+PWDDAGCLCPVGDF+NYAAPGEE  
Sbjct: 162  KAIPLMEELKLKPQLQNFRAWLWASSTVSSRTMHIPWDDAGCLCPVGDFYNYAAPGEEPC 221

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
             ++ ++     SSL+ S                    +QRLTD GY+ED+ AYCFYARK+
Sbjct: 222  GWEDLKGSRNESSLQDSSFWNKDATSNSDAEQDDV-LSQRLTDGGYKEDLAAYCFYARKN 280

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDI-PSNSWPKDSLYIQQDG 717
            Y+KGEQVLLSYGTYTNLELLEHYGFLL+ NPN+KAFI LE ++  S+SWPKDSLYI Q+G
Sbjct: 281  YKKGEQVLLSYGTYTNLELLEHYGFLLDENPNDKAFIPLEPEVYASSSWPKDSLYIHQNG 340

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
            KPSFAL+SALRLWATP + R+SVGHLVYSG QLS+ENEI VM+W+   C ++LE  P+S+
Sbjct: 341  KPSFALLSALRLWATPASQRRSVGHLVYSGTQLSSENEIFVMEWIAKSCHVVLENLPTSV 400

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSNVVEFPLSRKARRS 1077
            E+D  +L  +DKM       EV   +   G E S F EA+ LK     V   LS KARRS
Sbjct: 401  EEDSLLLCALDKMQDPDLPMEVGNALRSSGVEFSAFLEAHDLKIGDGNVGLLLSEKARRS 460

Query: 1078 MERWKLAVQWRLRYKQILVSCISYCTESINVLS 1176
            MERWKLAVQWRLR+K+ILV CIS CTE I+ LS
Sbjct: 461  MERWKLAVQWRLRHKRILVDCISRCTEIISSLS 493


>ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
            gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP,
            putative [Ricinus communis]
          Length = 510

 Score =  433 bits (1114), Expect = e-119
 Identities = 219/397 (55%), Positives = 274/397 (69%), Gaps = 2/397 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL EM KG+ S W PYLM LPR Y+ + +F+ FE QALQVD AIW AEKA+SKAEL+ +
Sbjct: 108  CLLYEMSKGQSSFWYPYLMHLPRSYEILATFSEFEKQALQVDDAIWTAEKAISKAELDRK 167

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            EA              T R+W+WA ATISSRTMH+PWD+AGCLCPVGDFFNYAAPGEES 
Sbjct: 168  EAYSLMQELRLKPQFLTLRAWIWACATISSRTMHIPWDEAGCLCPVGDFFNYAAPGEESS 227

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
              +  E    +S LE +                     + LTD G++ED  AYCFYAR++
Sbjct: 228  SPENDESWKPASCLEDASLSSERSTSNFCSETFDV-QLKSLTDGGFDEDKAAYCFYARQN 286

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDI-PSNSWPKDSLYIQQDG 717
            Y+KG QVLLSYGTYTNLELLEHYGFLLN NPN+K FI LE  +  SN+WPK+S+YI QDG
Sbjct: 287  YKKGAQVLLSYGTYTNLELLEHYGFLLNENPNDKVFIPLELSMQSSNTWPKESMYIHQDG 346

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
            KPSF+L+ ALRLWATP N R+S+GHL YSG QLS ENE++++KW+  KC  +L+K P+++
Sbjct: 347  KPSFSLLCALRLWATPSNRRRSMGHLAYSGSQLSVENEVSILKWISRKCHAVLKKLPTTV 406

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEA-NGLKERSNVVEFPLSRKARR 1074
            E+D  +L  IDK+    S  E+ +M+    G+ S F EA N L  +       L  KA+R
Sbjct: 407  EEDSLLLSAIDKIQNCHSPLELGKMLHGFEGQASAFVEAHNLLNIKIGTESTMLCGKAKR 466

Query: 1075 SMERWKLAVQWRLRYKQILVSCISYCTESINVLSSQH 1185
            SMERWKLAV+WRL YK+ L+ CISYCTE I+ LS ++
Sbjct: 467  SMERWKLAVKWRLSYKKTLIDCISYCTEVIDSLSMEN 503


>ref|XP_004288574.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Fragaria vesca subsp.
            vesca]
          Length = 511

 Score =  422 bits (1085), Expect = e-115
 Identities = 219/396 (55%), Positives = 271/396 (68%), Gaps = 2/396 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL EMGKG+ S W PYL+ LPR Y  + +F  FE QALQV+ AIW A+KA+SKAE EW+
Sbjct: 113  CLLYEMGKGKTSWWYPYLINLPRSYDIIATFGEFEKQALQVEDAIWAADKAISKAEFEWK 172

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            E               TFR+WLWASAT+SSRT+H+PWD AGCLCPVGD FNY+AP E+S 
Sbjct: 173  ETNTLMEQLKLKPQLRTFRAWLWASATVSSRTLHIPWDGAGCLCPVGDLFNYSAPVEDSD 232

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
              D VE  +   +L+                     ++ RLTD  +E +V AYCFYA+KS
Sbjct: 233  S-DNVELRTHELALQ-DMTTVKEETSCILDNEQLDSDSGRLTDGRFENNVGAYCFYAKKS 290

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDIPSN-SWPKDSLYIQQDG 717
            YRKGEQVLLSYGTYTNLELLEHYGFLLN NPN+KA++ LE +I S+ SWPK+ LYI Q G
Sbjct: 291  YRKGEQVLLSYGTYTNLELLEHYGFLLNENPNDKAYVPLEPEIYSSCSWPKEFLYIHQSG 350

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
            KPSFAL+SALRLWATP N R+SVGHL YSGLQLS ENEI VM+W+ NKC  +++  P++ 
Sbjct: 351  KPSFALLSALRLWATPANRRRSVGHLAYSGLQLSIENEIFVMRWISNKCNSIVKNLPTTF 410

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSNVVEFPLSRKA-RR 1074
            E+D  +L  IDK+    +  E   +  V   E+   + A  LK+ +   E  +SRK  +R
Sbjct: 411  EEDSLLLSVIDKIQNVNAPLEFANISSVSTDEIC-TYRAEVLKKGATDSETVVSRKTMQR 469

Query: 1075 SMERWKLAVQWRLRYKQILVSCISYCTESINVLSSQ 1182
            S ERW+LAVQWRL YK+ILV CIS+C E I+VL SQ
Sbjct: 470  SRERWRLAVQWRLSYKKILVDCISFCDEMIDVLRSQ 505


>ref|XP_006430400.1| hypothetical protein CICLE_v10011537mg [Citrus clementina]
            gi|557532457|gb|ESR43640.1| hypothetical protein
            CICLE_v10011537mg [Citrus clementina]
          Length = 503

 Score =  419 bits (1076), Expect = e-114
 Identities = 211/392 (53%), Positives = 265/392 (67%), Gaps = 1/392 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL E+GKG+ S W  YLM LPR Y+ + +F  FE QALQVD AIW AEKAVSKAE EW+
Sbjct: 104  CLLYEVGKGKSSRWYTYLMLLPRCYEILATFGPFEKQALQVDDAIWAAEKAVSKAESEWK 163

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            +A              +F++WLWASAT+SSRTMH+ WD+AGCLCPVGD FNYAAPGE   
Sbjct: 164  QAIKLMEELKLKPQLLSFKAWLWASATVSSRTMHISWDEAGCLCPVGDLFNYAAPGEGEE 223

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
                +E          +                  G+ +RLTD  +EEDV++YCFYAR +
Sbjct: 224  SNIGIE---DVEGWMPAPCLPKGDTTDVLDSEKFNGHLRRLTDGRFEEDVNSYCFYARNN 280

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDIPSN-SWPKDSLYIQQDG 717
            Y++GEQVLLSYGTYTNLELLEHYGFLLN NPN+K FI LE  + S  SWP++S YI Q+G
Sbjct: 281  YKRGEQVLLSYGTYTNLELLEHYGFLLNENPNDKVFISLEPGMYSCCSWPRESQYIDQNG 340

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
            KPSFAL+SALRLW TP N R+SVGHL YSG QLS +NEI+VMKW+ N   ++L   P+S 
Sbjct: 341  KPSFALLSALRLWMTPANQRRSVGHLAYSGHQLSVDNEISVMKWLSNNSRVMLNSLPTSK 400

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSNVVEFPLSRKARRS 1077
            E+D  +L  IDK+    +  E+++++   GGEV  F E  G++ R    +  LSRK + S
Sbjct: 401  EEDALLLCAIDKIQDIYTAMELKKVLSDFGGEVCTFLENYGVQCRQRGAKLSLSRKTKLS 460

Query: 1078 MERWKLAVQWRLRYKQILVSCISYCTESINVL 1173
            M+RWKLA+QWRLRYK+ L  CISYC  ++N L
Sbjct: 461  MQRWKLAIQWRLRYKKTLADCISYCDYTVNCL 492


>ref|XP_006430399.1| hypothetical protein CICLE_v10011537mg [Citrus clementina]
            gi|557532456|gb|ESR43639.1| hypothetical protein
            CICLE_v10011537mg [Citrus clementina]
          Length = 489

 Score =  419 bits (1076), Expect = e-114
 Identities = 211/392 (53%), Positives = 265/392 (67%), Gaps = 1/392 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL E+GKG+ S W  YLM LPR Y+ + +F  FE QALQVD AIW AEKAVSKAE EW+
Sbjct: 90   CLLYEVGKGKSSRWYTYLMLLPRCYEILATFGPFEKQALQVDDAIWAAEKAVSKAESEWK 149

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            +A              +F++WLWASAT+SSRTMH+ WD+AGCLCPVGD FNYAAPGE   
Sbjct: 150  QAIKLMEELKLKPQLLSFKAWLWASATVSSRTMHISWDEAGCLCPVGDLFNYAAPGEGEE 209

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
                +E          +                  G+ +RLTD  +EEDV++YCFYAR +
Sbjct: 210  SNIGIE---DVEGWMPAPCLPKGDTTDVLDSEKFNGHLRRLTDGRFEEDVNSYCFYARNN 266

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDIPSN-SWPKDSLYIQQDG 717
            Y++GEQVLLSYGTYTNLELLEHYGFLLN NPN+K FI LE  + S  SWP++S YI Q+G
Sbjct: 267  YKRGEQVLLSYGTYTNLELLEHYGFLLNENPNDKVFISLEPGMYSCCSWPRESQYIDQNG 326

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
            KPSFAL+SALRLW TP N R+SVGHL YSG QLS +NEI+VMKW+ N   ++L   P+S 
Sbjct: 327  KPSFALLSALRLWMTPANQRRSVGHLAYSGHQLSVDNEISVMKWLSNNSRVMLNSLPTSK 386

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSNVVEFPLSRKARRS 1077
            E+D  +L  IDK+    +  E+++++   GGEV  F E  G++ R    +  LSRK + S
Sbjct: 387  EEDALLLCAIDKIQDIYTAMELKKVLSDFGGEVCTFLENYGVQCRQRGAKLSLSRKTKLS 446

Query: 1078 MERWKLAVQWRLRYKQILVSCISYCTESINVL 1173
            M+RWKLA+QWRLRYK+ L  CISYC  ++N L
Sbjct: 447  MQRWKLAIQWRLRYKKTLADCISYCDYTVNCL 478


>ref|XP_007032171.1| SET domain group 40, putative isoform 1 [Theobroma cacao]
            gi|508711200|gb|EOY03097.1| SET domain group 40, putative
            isoform 1 [Theobroma cacao]
          Length = 498

 Score =  419 bits (1076), Expect = e-114
 Identities = 212/394 (53%), Positives = 265/394 (67%), Gaps = 1/394 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            C L EM KG+ S W+PYL+ LPR Y  + +F  FE QALQVD+AIW A+KA+SKAE EW+
Sbjct: 121  CFLYEMSKGKASPWHPYLLHLPRSYGILAAFGEFEKQALQVDYAIWAAQKALSKAEYEWK 180

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            +A              TFR+W+WA+ TISSRT+H+PWD+AGCLCPVGD FNYAAPGE+  
Sbjct: 181  KATPLMKELKLKLQFLTFRAWIWATGTISSRTLHIPWDEAGCLCPVGDLFNYAAPGEDLN 240

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
             +D V+      +L+                     ++QRLTD  +EED  AYCFYA+ +
Sbjct: 241  GFDNVDNLQNGYALD----------------DLDTQHSQRLTDGAFEEDAAAYCFYAKTN 284

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDI-PSNSWPKDSLYIQQDG 717
            Y+KGEQVLLSYGTYTNLELLE+YGFLL  NPNEK FI LE DI  S+SWP DSLYI Q+G
Sbjct: 285  YKKGEQVLLSYGTYTNLELLEYYGFLLEDNPNEKVFIPLEPDIHSSSSWPNDSLYIHQNG 344

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
            +PSFALM+ALR+WATPP  RKS+ H  YSG QLS +NEI+VM W+  KC   L+  P+SI
Sbjct: 345  RPSFALMAALRVWATPPYQRKSIRHQAYSGSQLSQDNEISVMTWIAKKCHATLKAMPTSI 404

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSNVVEFPLSRKARRS 1077
            E D  +L F DK+    ++ E  + M   GGE     +A  LK      E   SR+A+  
Sbjct: 405  EDDNLLLSFTDKIQEFDNLWEWGKAMPAFGGEFCNLLQATNLKRND---ESFASRRAKML 461

Query: 1078 MERWKLAVQWRLRYKQILVSCISYCTESINVLSS 1179
            ++RWKLAV WRL YK++LV CISYCT++IN LSS
Sbjct: 462  IDRWKLAVHWRLIYKKVLVDCISYCTDTINSLSS 495


>ref|XP_007215291.1| hypothetical protein PRUPE_ppa004975mg [Prunus persica]
            gi|462411441|gb|EMJ16490.1| hypothetical protein
            PRUPE_ppa004975mg [Prunus persica]
          Length = 483

 Score =  418 bits (1074), Expect = e-114
 Identities = 212/396 (53%), Positives = 266/396 (67%), Gaps = 1/396 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL EMGKG+ S W+PYLM LPR Y  + +F  FE QALQVD AIW AEKA  KAE EW+
Sbjct: 102  CLLYEMGKGKISWWHPYLMNLPRSYDILATFGEFEKQALQVDDAIWAAEKATLKAEYEWK 161

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            EA              TF++WLWASATISSRT+H+PWD AGCLCPVGD FNY+APGEE  
Sbjct: 162  EANALMKQLKLKPQLLTFKAWLWASATISSRTLHIPWDAAGCLCPVGDLFNYSAPGEEPS 221

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
              +++E                              +++RLTD G+E+DVDAYCFYA+KS
Sbjct: 222  RCESME--------HTMHDLVNEDTSGMADVEQLVSDSRRLTDGGFEKDVDAYCFYAKKS 273

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDIPSN-SWPKDSLYIQQDG 717
            Y+KGEQVLLSYGTYTNLELLEHYGFLLN NPN+K +I LE +I S+ SWPK+SL+I Q+G
Sbjct: 274  YKKGEQVLLSYGTYTNLELLEHYGFLLNENPNDKVYIPLEPEIYSSCSWPKESLFIHQNG 333

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
            KPSFAL+S LRLWATP N R+SVGHLVYSGL LS +NE+ +++W+  KCT +L+   +S 
Sbjct: 334  KPSFALLSTLRLWATPQNQRRSVGHLVYSGLHLSIQNEMFILRWISKKCTTILKNLSTSF 393

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSNVVEFPLSRKARRS 1077
            E D  +L  IDK+    +  E+  +   C  E+   F+AN L++          R +  S
Sbjct: 394  EDDSLLLSAIDKIQNLDAPLELNNVSSTCRDEICA-FKANVLQKG--------ERSSMES 444

Query: 1078 MERWKLAVQWRLRYKQILVSCISYCTESINVLSSQH 1185
             ERW+LAV+WRL YK+ILV CISYC E ++ L  Q+
Sbjct: 445  KERWRLAVEWRLSYKKILVDCISYCDEIVSSLFHQN 480


>emb|CBI27360.3| unnamed protein product [Vitis vinifera]
          Length = 449

 Score =  414 bits (1063), Expect = e-113
 Identities = 213/393 (54%), Positives = 258/393 (65%), Gaps = 1/393 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLLAEM KG+ S W+PYLMQLPR Y T+ +F++FE QALQVD AIW  E+A+ KAELEW+
Sbjct: 102  CLLAEMSKGKSSWWHPYLMQLPRSYDTLANFSQFEKQALQVDDAIWVTERAILKAELEWK 161

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            +A               FR+WLWAS+T+SSRTMH+PWDDAGCLCPVGDF+NYAAPGEE  
Sbjct: 162  KAIPLMEELKLKPQLQNFRAWLWASSTVSSRTMHIPWDDAGCLCPVGDFYNYAAPGEEPC 221

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
             ++ ++   Q   L                       +QRLTD GY+ED+ AYCFYARK+
Sbjct: 222  GWEDLKDAEQDDVL-----------------------SQRLTDGGYKEDLAAYCFYARKN 258

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDI-PSNSWPKDSLYIQQDG 717
            Y+KGEQVLLSYGTYTNLELLEHYGFLL+ NPN+KAFI LE ++  S+SWPKDSLYI Q+G
Sbjct: 259  YKKGEQVLLSYGTYTNLELLEHYGFLLDENPNDKAFIPLEPEVYASSSWPKDSLYIHQNG 318

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
            KPSFAL+SALRLWATP + R+SVGHLVYSG QLS+ENEI VM+W+   C ++LE  P+S+
Sbjct: 319  KPSFALLSALRLWATPASQRRSVGHLVYSGTQLSSENEIFVMEWIAKSCHVVLENLPTSV 378

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSNVVEFPLSRKARRS 1077
            E+D  +L                                                    S
Sbjct: 379  EEDSLLL----------------------------------------------------S 386

Query: 1078 MERWKLAVQWRLRYKQILVSCISYCTESINVLS 1176
            MERWKLAVQWRLR+K+ILV CIS CTE I+ LS
Sbjct: 387  MERWKLAVQWRLRHKRILVDCISRCTEIISSLS 419


>gb|EXC05430.1| Protein SET DOMAIN GROUP 40 [Morus notabilis]
          Length = 508

 Score =  412 bits (1060), Expect = e-112
 Identities = 215/426 (50%), Positives = 263/426 (61%), Gaps = 32/426 (7%)
 Frame = +1

Query: 4    LLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWRE 183
            LL EM KG  S W PYL+ LPR Y  + +F  FE QALQVD AIW AEKA  KAE EW+E
Sbjct: 107  LLYEMNKGRSSWWYPYLVNLPRGYDILATFGEFEKQALQVDDAIWTAEKATLKAESEWKE 166

Query: 184  AXXXXXXXXXXXXXXTFRSWLWASAT-------------------------------ISS 270
            A              TFR+WLWASAT                               ISS
Sbjct: 167  ANPLMKELNLKPQFLTFRAWLWASATFTLTEFHHHFNIIIPNVESNDVKFYASTLIKISS 226

Query: 271  RTMHVPWDDAGCLCPVGDFFNYAAPGEESFCYDAVECGSQSSSLEASXXXXXXXXXXXXX 450
            RT+HVPWD+AGCLCPVGD FNY APGEE          + +  LE               
Sbjct: 227  RTLHVPWDEAGCLCPVGDLFNYVAPGEED--------SAHTLDLE--------------- 263

Query: 451  XXXXXGNAQRLTDAGYEEDVDAYCFYARKSYRKGEQVLLSYGTYTNLELLEHYGFLLNAN 630
                  ++QRLTD G+EEDV AYCFYAR+ Y KGEQVLL YGTYTNLELLEHYGFLLN N
Sbjct: 264  --QLDSHSQRLTDGGFEEDVVAYCFYARRHYEKGEQVLLGYGTYTNLELLEHYGFLLNDN 321

Query: 631  PNEKAFIQLESDI-PSNSWPKDSLYIQQDGKPSFALMSALRLWATPPNHRKSVGHLVYSG 807
             NEK FI L+ +I  SN+WPKDS++I Q GKPSFAL+SALR+WATP N R+   HL YSG
Sbjct: 322  SNEKVFIPLQPEICSSNTWPKDSMFIHQSGKPSFALLSALRIWATPRNQRRPASHLAYSG 381

Query: 808  LQLSAENEIAVMKWMKNKCTILLEKFPSSIEKDVFILKFIDKMDGNPSVEEVEQMMLVCG 987
             QLSAENEI VM+W+   C  +L+  P+S E+D F+L  IDKM  + S  E+   +    
Sbjct: 382  SQLSAENEILVMRWISKNCNCILKSLPTSFEEDRFLLSAIDKMQDSCSPLELRNTVASST 441

Query: 988  GEVSGFFEANGLKERSNVVEFPLSRKARRSMERWKLAVQWRLRYKQILVSCISYCTESIN 1167
              +  F EANGL++  +V E   SRK +R M+RW+LA+QWR+RYK+IL++CIS+C+  I+
Sbjct: 442  AHIHAFLEANGLQDGEDVAELLSSRKTKREMDRWRLAIQWRVRYKEILINCISHCSRVID 501

Query: 1168 VLSSQH 1185
              + Q+
Sbjct: 502  SFTPQN 507


>ref|XP_002305239.2| hypothetical protein POPTR_0004s07950g [Populus trichocarpa]
            gi|550340570|gb|EEE85750.2| hypothetical protein
            POPTR_0004s07950g [Populus trichocarpa]
          Length = 518

 Score =  412 bits (1059), Expect = e-112
 Identities = 212/402 (52%), Positives = 264/402 (65%), Gaps = 1/402 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL EMGKG+ S W PYLM LPR Y  + SF                 +KAVSKA+ EW+
Sbjct: 137  CLLYEMGKGKSSWWYPYLMHLPRSYDVLASF-----------------KKAVSKAKSEWK 179

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            EA              TFR+W+WASATISSR +H+PWD+AGCLCPVGD FNYAAPGEES 
Sbjct: 180  EANSLMDALKLKPQLLTFRAWIWASATISSRALHIPWDEAGCLCPVGDLFNYAAPGEESN 239

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
              + V     +SSLE +                  G  +RLTD G+ E++ AYCFYARK+
Sbjct: 240  DLENVVHLMNASSLEDTSLSNGETTDDFIGDQPDIG-LERLTDGGFNENMAAYCFYARKN 298

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDIPSN-SWPKDSLYIQQDG 717
            Y+KG QVLL YGTYTNLELLEHYGFLLN NPN+K FI LE  + S  SWPK S+YI QDG
Sbjct: 299  YKKGTQVLLGYGTYTNLELLEHYGFLLNENPNDKVFIPLEPSMYSFISWPKVSMYIHQDG 358

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
            KPSFAL+SALRLWATPPN R+S+ HLVYSG +LS  NEI+V+KW+   C ++L   P+ I
Sbjct: 359  KPSFALLSALRLWATPPNQRRSISHLVYSGSRLSVYNEISVLKWISKNCALILSNLPTVI 418

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSNVVEFPLSRKARRS 1077
            E+D  +L  I+K++   + ++  +++   GGE   F EA+ L++  N  E   S K +R 
Sbjct: 419  EEDSLLLSTINKIE---NFDKPTELVCTSGGEARAFLEASDLQKGKNGSELMFSGKTKRV 475

Query: 1078 MERWKLAVQWRLRYKQILVSCISYCTESINVLSSQHFLCKRT 1203
            +ERWKLAVQWR+ YK+ L+ CISYCT +IN LSSQ  L  RT
Sbjct: 476  IERWKLAVQWRISYKKTLIDCISYCTVTINSLSSQTILAMRT 517


>ref|XP_007141970.1| hypothetical protein PHAVU_008G241400g [Phaseolus vulgaris]
            gi|561015103|gb|ESW13964.1| hypothetical protein
            PHAVU_008G241400g [Phaseolus vulgaris]
          Length = 497

 Score =  408 bits (1048), Expect = e-111
 Identities = 203/390 (52%), Positives = 263/390 (67%), Gaps = 1/390 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL E+ KG+ S W+PYLM LP  Y  +  F  FE +ALQVD A+W  EKA+ KA+ EW+
Sbjct: 106  CLLYEVCKGKTSRWHPYLMHLPHTYDILAMFDEFEKRALQVDEAVWVTEKAILKAKSEWK 165

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            EA              TF++W+WA+ATISSRT+HVPWD+AGCLCPVGD FNY APGEES 
Sbjct: 166  EAHALMEDLMFRPQFLTFKAWVWAAATISSRTLHVPWDEAGCLCPVGDLFNYDAPGEESS 225

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
              + +E    +SS+                      ++QRLTD G+EE+V+AYCFYAR  
Sbjct: 226  DIEDLEHLLSNSSIH-DTNLLNGDKNIVVDAEQLDSHSQRLTDGGFEENVNAYCFYARAH 284

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDIP-SNSWPKDSLYIQQDG 717
            Y+KG+QVLL YGTYTNLELLEHYGFLL  NPN+K FI L+  +  S SW  +SLYI  +G
Sbjct: 285  YKKGDQVLLCYGTYTNLELLEHYGFLLQENPNDKVFIPLDPAVYFSTSWSMESLYIHHNG 344

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
            KPSFAL++ALRLWATP N RKSVGHLVYSG QLS +NEI + KW+   C  +L+  P+SI
Sbjct: 345  KPSFALLAALRLWATPQNKRKSVGHLVYSGSQLSTDNEIFITKWLSKTCATVLKNLPTSI 404

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSNVVEFPLSRKARRS 1077
            ++D  +L  +D      +  E+ ++M     E+  F E + +++  ++ E  LSRKARRS
Sbjct: 405  DEDTLLLNAMDSSQDIFTFMEITKLM-SSKDEIFTFLETHNMRDAHSLTEVILSRKARRS 463

Query: 1078 MERWKLAVQWRLRYKQILVSCISYCTESIN 1167
            M+RWKLAVQWRL+YK++L  CISYC E ++
Sbjct: 464  MDRWKLAVQWRLKYKKVLFDCISYCNEILD 493


>ref|XP_004145844.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Cucumis sativus]
          Length = 483

 Score =  404 bits (1037), Expect = e-110
 Identities = 211/394 (53%), Positives = 258/394 (65%), Gaps = 1/394 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL E+ KG  S W PYL  LP+ Y  + +F  FE QALQVD+AIW  EKA  K+  +WR
Sbjct: 101  CLLYEISKGPSSWWFPYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWR 160

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
                            TF++WLWASATISSRT++VPWD+AGCLCPVGD FNYAAP  ESF
Sbjct: 161  GVEGLMQESNIKSQLQTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESF 220

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
              +AV+  S  S    +                   +   LTD G+EE+  AYCFYAR+S
Sbjct: 221  --NAVDVLSFPSHASLNDELELLEEQRD--------SQWALTDGGFEENASAYCFYARES 270

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDI-PSNSWPKDSLYIQQDG 717
            YRKGEQVLLSYGTYTNLELLE+YGFLL  NPN+K FI +E DI  S+SWPK+SLYI Q+G
Sbjct: 271  YRKGEQVLLSYGTYTNLELLEYYGFLLQENPNDKVFIPIEHDIYGSSSWPKESLYIHQNG 330

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
             PSFAL+SALRLWAT PN R+ VGHL Y+G QLS +NEI VM+W+   C  +L   P+SI
Sbjct: 331  NPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNEILVMQWLSKNCHTVLNNLPTSI 390

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSNVVEFPLSRKARRS 1077
            E+D  +L  I K+       E+++ +L  GGE   F E NG+  R    E   S+K +RS
Sbjct: 391  EEDNQLLCNIAKVQDLQVPRELQKTLLTYGGEFCAFLETNGVVNRDE-AESHSSQKLKRS 449

Query: 1078 MERWKLAVQWRLRYKQILVSCISYCTESINVLSS 1179
            ++RWKLAVQWRL YK+ LV CI YCT +I  LSS
Sbjct: 450  LDRWKLAVQWRLLYKKALVDCIGYCTTTICSLSS 483


>ref|XP_006596494.1| PREDICTED: protein SET DOMAIN GROUP 40-like isoform X1 [Glycine max]
          Length = 497

 Score =  399 bits (1026), Expect = e-108
 Identities = 197/392 (50%), Positives = 260/392 (66%), Gaps = 1/392 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL EMGKG+ S W+PYLM LP  Y  +  F  FE  ALQVD A+W  EKA+ KA+ EW+
Sbjct: 106  CLLYEMGKGKTSRWHPYLMHLPHTYDVLAMFGEFEKHALQVDEAMWVTEKAMLKAKSEWK 165

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            EA              TF++W+WA+ATISSRT+H+PWD+AGCLCPVGD FNY APG E  
Sbjct: 166  EAHSLMQDLMFKPQFFTFKAWVWAAATISSRTLHIPWDEAGCLCPVGDLFNYDAPGIEPS 225

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
              + ++    ++S+                      ++ RLTD G+EED +AYCFYAR+ 
Sbjct: 226  GIEDLDRLLSNTSI-PDTIVLNGDKNIMVDAEQLDSHSWRLTDGGFEEDANAYCFYAREH 284

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDI-PSNSWPKDSLYIQQDG 717
            Y+KG+QVLL YGTYTNLELLEHYGFLL  NPN+K FI LE  +  S SW K+SLYI  +G
Sbjct: 285  YKKGDQVLLCYGTYTNLELLEHYGFLLQENPNDKVFIPLEPALYSSTSWSKESLYIHHNG 344

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
            KPSFAL++ALRLWATP N R+SVGHLVYSG ++S +NEI +MKW+   C  +L   P+S+
Sbjct: 345  KPSFALLAALRLWATPQNRRRSVGHLVYSGSRVSTDNEIFIMKWLSKTCDAVLRNLPTSL 404

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSNVVEFPLSRKARRS 1077
            E+D  +L  +D      +  E+ + ++    E   F E + +K+  +  +  LSRKARRS
Sbjct: 405  EEDTLLLNAMDNSQDFSTFMEITK-LVSSREETYTFLETHNMKDTHSFTDVILSRKARRS 463

Query: 1078 MERWKLAVQWRLRYKQILVSCISYCTESINVL 1173
            M+RWKLAVQWRL+YK+++  CISYC + ++ L
Sbjct: 464  MDRWKLAVQWRLKYKKVIFDCISYCNKILDSL 495


>ref|XP_004167843.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Cucumis sativus]
          Length = 389

 Score =  399 bits (1025), Expect = e-108
 Identities = 208/394 (52%), Positives = 257/394 (65%), Gaps = 1/394 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL E+ KG  S W PYL  LP+ Y  + +F  FE QALQVD+AIW  EKA  K+  +WR
Sbjct: 7    CLLYEISKGPSSWWFPYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWR 66

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
                            TF++WLWASATISSRT++VPWD+AGCLCPVGD FNYAAP  ESF
Sbjct: 67   GVEGLMQESNIKSQLQTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESF 126

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
              +AV+  S  S    +                   +   LTD G+EE+  AYCFYAR++
Sbjct: 127  --NAVDVLSFPSHASLNDELELLEEQRD--------SQWALTDGGFEENASAYCFYAREN 176

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDI-PSNSWPKDSLYIQQDG 717
            YRKGEQVLLSYGTYTNLELLE+YGFLL  NPN+K FI +E DI  S+SWP++SLYI Q+G
Sbjct: 177  YRKGEQVLLSYGTYTNLELLEYYGFLLQENPNDKVFIPIEHDIYGSSSWPEESLYIHQNG 236

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
             PSFAL+SALRLWAT PN R+ VGHL Y+G QLS +NE  VM+W+   C  +L   P+SI
Sbjct: 237  NPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSI 296

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSNVVEFPLSRKARRS 1077
            E+D  +L  I K+       E+++ +L  GGE   F E NG+  R    E   S+K +RS
Sbjct: 297  EEDNQLLCNIAKVQDLQVPRELQKTLLTYGGEFCAFLETNGVVNRDE-AESHSSQKLKRS 355

Query: 1078 MERWKLAVQWRLRYKQILVSCISYCTESINVLSS 1179
            ++RWKLAVQWRL YK+ LV CI YCT +I  LSS
Sbjct: 356  LDRWKLAVQWRLLYKKALVDCIGYCTTTICSLSS 389


>ref|XP_004490774.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Cicer arietinum]
          Length = 494

 Score =  392 bits (1008), Expect = e-106
 Identities = 202/386 (52%), Positives = 254/386 (65%), Gaps = 2/386 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL E+GKG+ S W+PYLM LP+ Y  +  F  FE  ALQVD AIW  EKAV KA+ EW+
Sbjct: 102  CLLYEVGKGKTSRWHPYLMHLPQSYDVLAMFGEFEKNALQVDEAIWITEKAVLKAKSEWK 161

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            EA              TF++W+WA+ATISSRT+H+PWD+AGCLCPVGD FNY APGEE  
Sbjct: 162  EAHALMEDLMFKPQLLTFKAWVWAAATISSRTLHIPWDEAGCLCPVGDLFNYDAPGEELS 221

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
              + V+    +SS+  +                   ++QRLTD G++ED +AYCFYAR  
Sbjct: 222  GIEDVDNFLSNSSIPVTTLSNGDKNIVVDEEQVDF-HSQRLTDGGFDEDANAYCFYARTH 280

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDI-PSNSWPKDSLYIQQDG 717
            Y+KG+QVLL YGTYTNLELLEHYGFLL  NPN+K FI LE  +  S SW K+SLYI  +G
Sbjct: 281  YKKGDQVLLCYGTYTNLELLEHYGFLLQGNPNDKVFIPLEPAMYTSTSWSKESLYIHHNG 340

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
            KPSFAL++ALRLWATP N R+SVGHL YSG QLSA+NE  VMKW+   C  +L+   +SI
Sbjct: 341  KPSFALLAALRLWATPHNKRRSVGHLAYSGSQLSADNETFVMKWLLKTCKAVLKNMSTSI 400

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEA-NGLKERSNVVEFPLSRKARR 1074
            E D  ++  +D      +  E+ ++M     EV  F EA N   +  +     LS+K RR
Sbjct: 401  EDDTLLVNALDSSKEFFTFMEIAKLM-TSKDEVYTFLEAHNVTTDAHSFTGILLSKKVRR 459

Query: 1075 SMERWKLAVQWRLRYKQILVSCISYC 1152
             M+RWKLAV WRLRYK++LV CI+YC
Sbjct: 460  LMDRWKLAVVWRLRYKKVLVDCIAYC 485


>ref|XP_003616150.1| Protein SET DOMAIN GROUP [Medicago truncatula]
            gi|355517485|gb|AES99108.1| Protein SET DOMAIN GROUP
            [Medicago truncatula]
          Length = 532

 Score =  392 bits (1007), Expect = e-106
 Identities = 202/398 (50%), Positives = 257/398 (64%), Gaps = 14/398 (3%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL E+GKG+ S W+PYL+ LP+ Y  +  F  FE QALQVD A+W  EKAV KA+ EW+
Sbjct: 138  CLLYEVGKGKTSRWHPYLVHLPQSYDLLAMFGEFEKQALQVDEAMWVTEKAVQKAKSEWK 197

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASAT-------------ISSRTMHVPWDDAGCLCPVG 321
            EA              TF++W+WA+AT             ISSRT+H+PWD+AGCLCPVG
Sbjct: 198  EAHALMEDLMFKPQLLTFKAWVWAAATGRTVPETFHLPGLISSRTLHIPWDEAGCLCPVG 257

Query: 322  DFFNYAAPGEESFCYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYE 501
            D FNY APGEE    + V+    +  +                      N+QRLTD G+E
Sbjct: 258  DLFNYDAPGEELSGVEDVDHFLSNGDMNVVIDEGQIDF-----------NSQRLTDGGFE 306

Query: 502  EDVDAYCFYARKSYRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDI-PSN 678
            ED +AYCFYAR +Y+KG+QVLL YGTYTNLELLEHYGFLL  NPN+K FI LE  +  S 
Sbjct: 307  EDANAYCFYARTNYKKGDQVLLCYGTYTNLELLEHYGFLLQENPNDKIFIPLEPAMYTST 366

Query: 679  SWPKDSLYIQQDGKPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKN 858
            SW K+SLYI  +GKPSFAL++ALRLWATP N R+S+GHL YSG QLSA+NEI VMKW+  
Sbjct: 367  SWSKESLYIHPNGKPSFALLAALRLWATPHNKRRSIGHLAYSGSQLSADNEIIVMKWLSK 426

Query: 859  KCTILLEKFPSSIEKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSN 1038
             C  +L+  P+SIE D  +L  +D      +  ++ ++M     EV  F EA+ + +  +
Sbjct: 427  TCDAVLKNMPTSIEDDTLLLNALDCSQDFITFMKIVKLM-SSRDEVYTFLEAHNITDALS 485

Query: 1039 VVEFPLSRKARRSMERWKLAVQWRLRYKQILVSCISYC 1152
              +   S+K RRSM+RWKLAV WRLRYK++LV CISYC
Sbjct: 486  FCDTISSKKTRRSMDRWKLAVLWRLRYKRVLVDCISYC 523


>gb|ACU19071.1| unknown [Glycine max]
          Length = 497

 Score =  389 bits (999), Expect = e-105
 Identities = 194/392 (49%), Positives = 256/392 (65%), Gaps = 1/392 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL EMGKG+ S W+PYLM LP  Y  +  F  FE  ALQVD A+W  EKA+ KA+ EW+
Sbjct: 106  CLLYEMGKGKTSRWHPYLMHLPHTYDVLAMFGEFEKHALQVDEAMWVTEKAMLKAKSEWK 165

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            EA              TF++W+ A+ATISSRT+H+PWD+AGCLCPVGD FNY APG E  
Sbjct: 166  EAHSLMQDLMFKPQFFTFKAWVRAAATISSRTLHIPWDEAGCLCPVGDLFNYDAPGIEPS 225

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
              + ++    ++S+                      ++ RLTD G+EED +AYCFYAR+ 
Sbjct: 226  GIEDLDRLLSNTSI-PDTIVLNGDKNIVVDAEQLDSHSWRLTDGGFEEDANAYCFYAREH 284

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDI-PSNSWPKDSLYIQQDG 717
            Y+KG+QVLL YGTYTNLELLEHYGFLL  NPN+K FI LE  +  S SW K+SLYI  +G
Sbjct: 285  YKKGDQVLLCYGTYTNLELLEHYGFLLQENPNDKVFIPLEPALYSSTSWSKESLYIHHNG 344

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
            KPSFAL++ALRLWATP N R+SVGHLVY G ++S +NEI +MKW+   C  +L   P+ +
Sbjct: 345  KPSFALLAALRLWATPQNRRRSVGHLVYFGSRVSTDNEIFIMKWLSKTCDAVLRNLPTFL 404

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSNVVEFPLSRKARRS 1077
            E+D  +L  +D      +  E+ +++     E   F E + +K+  +  +  LSRKARRS
Sbjct: 405  EEDTLLLNAMDNSQDFSTFMEITKLVF-SREETYTFLETHNMKDTHSFTDVILSRKARRS 463

Query: 1078 MERWKLAVQWRLRYKQILVSCISYCTESINVL 1173
            M+RWKLAVQWRL+YK++   CISYC + ++ L
Sbjct: 464  MDRWKLAVQWRLKYKKVTFDCISYCNKILDSL 495


>ref|XP_006400256.1| hypothetical protein EUTSA_v10015946mg [Eutrema salsugineum]
            gi|557101346|gb|ESQ41709.1| hypothetical protein
            EUTSA_v10015946mg [Eutrema salsugineum]
          Length = 506

 Score =  382 bits (980), Expect = e-103
 Identities = 202/406 (49%), Positives = 259/406 (63%), Gaps = 13/406 (3%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL EM KG+ S W PYL+ LPR Y    +F  FE QALQV+ A+W AEKA++K++ EW+
Sbjct: 103  CLLYEMSKGKKSFWYPYLVHLPRDYDLSSTFGEFEKQALQVEDAVWAAEKAIAKSQSEWK 162

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            EA              + ++WLWASATISSRT+H+PWD AGCLCPVGD FNY APG++  
Sbjct: 163  EAVTLMKVLDLKPKFQSLQAWLWASATISSRTLHIPWDSAGCLCPVGDLFNYDAPGDDLN 222

Query: 361  CYDAVECGSQSSSLE-ASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARK 537
              +  E   Q+SS +  S                    ++RLTD G++ED +AYC YAR+
Sbjct: 223  TSEGPELVIQTSSPKPVSTTHHECRNNAEEAGHVVETQSERLTDGGFDEDANAYCLYARR 282

Query: 538  SYRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDIPS--NSWPKDSLYIQQ 711
            +Y+ GEQVLL YGTYTNLELLEHYGF+L  N N+K FI LE+ + S  +SWPKDSLYI Q
Sbjct: 283  NYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLYSLASSWPKDSLYIHQ 342

Query: 712  DGKPSFALMSALRLWATPPNHR-KSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFP 888
            DGKPSFAL+S LRLW  P N R K+   LVY+G Q+S +NEI VMKWM +KC  +L   P
Sbjct: 343  DGKPSFALVSTLRLWLIPQNQRDKTAMRLVYAGSQISVKNEILVMKWMSDKCGRVLRDLP 402

Query: 889  SSIEKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSN---------V 1041
            +S+ +D  +L+ I  +  +P V   ++     G EV  F + N L +  N          
Sbjct: 403  TSLLEDTVLLQDIKNLQ-DPEVCLKQKETEAFGSEVRAFLDVNHLWDLINGDVIGLSGKA 461

Query: 1042 VEFPLSRKARRSMERWKLAVQWRLRYKQILVSCISYCTESINVLSS 1179
            VEF  SRK  R + +W+L+VQWRLRYK+ LV CISYC E +N LSS
Sbjct: 462  VEF--SRKTNRIISKWRLSVQWRLRYKRTLVDCISYCNEKMNHLSS 505


>ref|XP_006289442.1| hypothetical protein CARUB_v10002957mg [Capsella rubella]
            gi|482558148|gb|EOA22340.1| hypothetical protein
            CARUB_v10002957mg [Capsella rubella]
          Length = 503

 Score =  382 bits (980), Expect = e-103
 Identities = 198/399 (49%), Positives = 253/399 (63%), Gaps = 8/399 (2%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL EM KG+ S W PYL+ LPR Y  + +F  FE QALQV+ A+W  EKA +K + EW+
Sbjct: 103  CLLYEMSKGKKSFWYPYLVHLPRDYDLLATFGEFEKQALQVEDAVWVTEKATAKCQSEWK 162

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            EA              +F++WLWASATISSRT+H+PWD AGCLCP GD FNY APG++  
Sbjct: 163  EAGTLMKELDLKPKFQSFQAWLWASATISSRTLHIPWDSAGCLCPAGDLFNYDAPGDDLN 222

Query: 361  CYDAVECGSQSSSLE-ASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARK 537
              +  E   Q+SS + AS                    ++RLTD G+EED +AYC YAR+
Sbjct: 223  YSEGPESAIQTSSPQPASITNLECRNNEEEAGLNVEIQSERLTDGGFEEDANAYCLYARR 282

Query: 538  SYRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDIPS--NSWPKDSLYIQQ 711
            +Y+ GEQVLL YGTYTNLELLEHYGF+L  N N+K FI LE+ + S  +SWPKDSLYI Q
Sbjct: 283  NYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLYSLASSWPKDSLYIHQ 342

Query: 712  DGKPSFALMSALRLWATPPNHR-KSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFP 888
            DGKPSFAL+S LRLW  P + R KSV  LVY+G Q+S +NEI VMKWM  KC  +L   P
Sbjct: 343  DGKPSFALVSTLRLWLVPQSQRDKSVMRLVYAGSQISVKNEILVMKWMSEKCGSVLRNLP 402

Query: 889  SSIEKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKE----RSNVVEFPL 1056
            +S+ +D  +L  IDK+  +P +   ++     G E+  F + N L +        VEFP 
Sbjct: 403  TSVSEDNLLLHNIDKLQ-DPKIRLEQKETEAFGSEMRAFLDVNRLWDVIGFSGKDVEFP- 460

Query: 1057 SRKARRSMERWKLAVQWRLRYKQILVSCISYCTESINVL 1173
             R+  R M +W+L+VQWRL YK+ L  CI YC E +N L
Sbjct: 461  -RRTNRMMSKWRLSVQWRLSYKRTLADCIYYCNEKMNNL 498


>ref|XP_006596495.1| PREDICTED: protein SET DOMAIN GROUP 40-like isoform X2 [Glycine max]
          Length = 483

 Score =  380 bits (977), Expect = e-103
 Identities = 191/392 (48%), Positives = 254/392 (64%), Gaps = 1/392 (0%)
 Frame = +1

Query: 1    CLLAEMGKGEGSLWNPYLMQLPRHYQTVPSFTRFEIQALQVDFAIWDAEKAVSKAELEWR 180
            CLL EMGKG+ S W+PYLM LP  Y               VD A+W  EKA+ KA+ EW+
Sbjct: 106  CLLYEMGKGKTSRWHPYLMHLPHTYD--------------VDEAMWVTEKAMLKAKSEWK 151

Query: 181  EAXXXXXXXXXXXXXXTFRSWLWASATISSRTMHVPWDDAGCLCPVGDFFNYAAPGEESF 360
            EA              TF++W+WA+ATISSRT+H+PWD+AGCLCPVGD FNY APG E  
Sbjct: 152  EAHSLMQDLMFKPQFFTFKAWVWAAATISSRTLHIPWDEAGCLCPVGDLFNYDAPGIEPS 211

Query: 361  CYDAVECGSQSSSLEASXXXXXXXXXXXXXXXXXXGNAQRLTDAGYEEDVDAYCFYARKS 540
              + ++    ++S+  +                   ++ RLTD G+EED +AYCFYAR+ 
Sbjct: 212  GIEDLDRLLSNTSIPDTIVLNGDKNIMVDAEQLD-SHSWRLTDGGFEEDANAYCFYAREH 270

Query: 541  YRKGEQVLLSYGTYTNLELLEHYGFLLNANPNEKAFIQLESDI-PSNSWPKDSLYIQQDG 717
            Y+KG+QVLL YGTYTNLELLEHYGFLL  NPN+K FI LE  +  S SW K+SLYI  +G
Sbjct: 271  YKKGDQVLLCYGTYTNLELLEHYGFLLQENPNDKVFIPLEPALYSSTSWSKESLYIHHNG 330

Query: 718  KPSFALMSALRLWATPPNHRKSVGHLVYSGLQLSAENEIAVMKWMKNKCTILLEKFPSSI 897
            KPSFAL++ALRLWATP N R+SVGHLVYSG ++S +NEI +MKW+   C  +L   P+S+
Sbjct: 331  KPSFALLAALRLWATPQNRRRSVGHLVYSGSRVSTDNEIFIMKWLSKTCDAVLRNLPTSL 390

Query: 898  EKDVFILKFIDKMDGNPSVEEVEQMMLVCGGEVSGFFEANGLKERSNVVEFPLSRKARRS 1077
            E+D  +L  +D      +  E+ + ++    E   F E + +K+  +  +  LSRKARRS
Sbjct: 391  EEDTLLLNAMDNSQDFSTFMEITK-LVSSREETYTFLETHNMKDTHSFTDVILSRKARRS 449

Query: 1078 MERWKLAVQWRLRYKQILVSCISYCTESINVL 1173
            M+RWKLAVQWRL+YK+++  CISYC + ++ L
Sbjct: 450  MDRWKLAVQWRLKYKKVIFDCISYCNKILDSL 481


Top