BLASTX nr result

ID: Rheum21_contig00018347 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00018347
         (1421 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like ...   530   e-148
gb|EMJ16490.1| hypothetical protein PRUPE_ppa004975mg [Prunus pe...   526   e-146
gb|ESW13964.1| hypothetical protein PHAVU_008G241400g [Phaseolus...   513   e-143
ref|XP_004490774.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   513   e-143
ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus ...   509   e-142
ref|XP_006596494.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   508   e-141
ref|XP_002305239.2| hypothetical protein POPTR_0004s07950g [Popu...   506   e-141
ref|XP_004288574.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   503   e-140
ref|XP_004145844.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   502   e-139
ref|XP_006430400.1| hypothetical protein CICLE_v10011537mg [Citr...   498   e-138
gb|ACU19071.1| unknown [Glycine max]                                  497   e-138
ref|XP_006289442.1| hypothetical protein CARUB_v10002957mg [Caps...   494   e-137
gb|EXC05430.1| Protein SET DOMAIN GROUP 40 [Morus notabilis]          494   e-137
gb|EOY03097.1| SET domain group 40, putative isoform 1 [Theobrom...   493   e-137
ref|XP_003616150.1| Protein SET DOMAIN GROUP [Medicago truncatul...   492   e-136
emb|CBI27360.3| unnamed protein product [Vitis vinifera]              491   e-136
ref|XP_006481945.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   488   e-135
ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thalia...   487   e-135
ref|XP_006596495.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   485   e-134
ref|XP_006400256.1| hypothetical protein EUTSA_v10015946mg [Eutr...   481   e-133

>ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera]
          Length = 504

 Score =  530 bits (1364), Expect = e-148
 Identities = 274/485 (56%), Positives = 348/485 (71%), Gaps = 14/485 (2%)
 Frame = -2

Query: 1417 ELGVSDAPLNSATSSYS-----SCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRA 1253
            ELG+SD      T          C+GH+L ++ FP AGGRGLAA RDL +GELIL VP++
Sbjct: 10   ELGISDFTTTPTTVPSRLQIPHCCVGHSLCVSHFPHAGGRGLAAARDLSQGELILTVPKS 69

Query: 1252 ALMTRDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDL 1073
            ALMT  S+L KDEKLS A+ RH+SLSS QIL + LLAE+SK K S+W+PYL+ LPRSYD 
Sbjct: 70   ALMTSQSLL-KDEKLSVAVKRHTSLSSPQILTICLLAEMSKGKSSWWHPYLMQLPRSYDT 128

Query: 1072 LPTFGPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASAT 893
            L  F  FE +ALQV+DA W TE+AI K E +WK+AI LME+LK KPQL  F+AWLWAS+T
Sbjct: 129  LANFSQFEKQALQVDDAIWVTERAILKAELEWKKAIPLMEELKLKPQLQNFRAWLWASST 188

Query: 892  ISSRTLHVPWDDAGCLCPVGDLFNYAPPGEDFSDKDSVRSS-GYASYKDNSSDDGDECED 716
            +SSRT+H+PWDDAGCLCPVGD +NYA PGE+    + ++ S   +S +D+S  + D   +
Sbjct: 189  VSSRTMHIPWDDAGCLCPVGDFYNYAAPGEEPCGWEDLKGSRNESSLQDSSFWNKDATSN 248

Query: 715  STVNEDQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFI 536
            S   +D     S RLTDGGY+E LAAYCFYA+KNY+KG+QVLL YGTYTNLELLEHYGF+
Sbjct: 249  SDAEQDDV--LSQRLTDGGYKEDLAAYCFYARKNYKKGEQVLLSYGTYTNLELLEHYGFL 306

Query: 535  LGNNPNDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQRRSVG 356
            L  NPNDK FIPL  EV    S SWP DS  I  NG PSF+LLS+LRLWATP +QRRSVG
Sbjct: 307  LDENPNDKAFIPLEPEVY--ASSSWPKDSLYIHQNGKPSFALLSALRLWATPASQRRSVG 364

Query: 355  HLAFSGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQGSF--SEPGN 182
            HL +SG+QLS ENE+ VM+WIA +C  +L+NL TS+++D  +L ++D+ Q      E GN
Sbjct: 365  HLVYSGTQLSSENEIFVMEWIAKSCHVVLENLPTSVEEDSLLLCALDKMQDPDLPMEVGN 424

Query: 181  LPCAVATELSSFLSSMGLHNGEIGESN------SRVQRCLDKWKLSVNWRLGYKKILVDC 20
               +   E S+FL +   H+ +IG+ N       + +R +++WKL+V WRL +K+ILVDC
Sbjct: 425  ALRSSGVEFSAFLEA---HDLKIGDGNVGLLLSEKARRSMERWKLAVQWRLRHKRILVDC 481

Query: 19   ISFCT 5
            IS CT
Sbjct: 482  ISRCT 486


>gb|EMJ16490.1| hypothetical protein PRUPE_ppa004975mg [Prunus persica]
          Length = 483

 Score =  526 bits (1354), Expect = e-146
 Identities = 270/475 (56%), Positives = 347/475 (73%), Gaps = 4/475 (0%)
 Frame = -2

Query: 1420 AELGVSDAPLNSATSSYSSCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAALMT 1241
            AE+G+SD+     T    SCLGH+L ++ FP AGGRGL A RDLR GEL+L+VP++ LMT
Sbjct: 16   AEIGISDS-----TCCGDSCLGHSLDVSYFPSAGGRGLGAARDLREGELLLKVPKSVLMT 70

Query: 1240 RDSILEKDEKLSDAIN--RHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLP 1067
            ++S+L KDEKLS ++N   H SLS TQILAV LL E+ K K S+W+PYL++LPRSYD+L 
Sbjct: 71   KESLLLKDEKLSLSVNDYAHHSLSPTQILAVCLLYEMGKGKISWWHPYLMNLPRSYDILA 130

Query: 1066 TFGPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASATIS 887
            TFG FE +ALQV+DA WA EKA  K E +WKEA ALM+QLK KPQL+TFKAWLWASATIS
Sbjct: 131  TFGEFEKQALQVDDAIWAAEKATLKAEYEWKEANALMKQLKLKPQLLTFKAWLWASATIS 190

Query: 886  SRTLHVPWDDAGCLCPVGDLFNYAPPGEDFSDKDSVRSSGYASYKDNSSDDGDECEDSTV 707
            SRTLH+PWD AGCLCPVGDLFNY+ PGE+ S  +S+  + +    +++S   D       
Sbjct: 191  SRTLHIPWDAAGCLCPVGDLFNYSAPGEEPSRCESMEHTMHDLVNEDTSGMAD------- 243

Query: 706  NEDQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFILGN 527
              +Q    S RLTDGG+E+ + AYCFYAKK+Y+KG+QVLL YGTYTNLELLEHYGF+L  
Sbjct: 244  -VEQLVSDSRRLTDGGFEKDVDAYCFYAKKSYKKGEQVLLSYGTYTNLELLEHYGFLLNE 302

Query: 526  NPNDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQRRSVGHLA 347
            NPNDKV+IPL  E+    SCSWP +S  I  NG PSF+LLS+LRLWATP NQRRSVGHL 
Sbjct: 303  NPNDKVYIPLEPEIYS--SCSWPKESLFIHQNGKPSFALLSTLRLWATPQNQRRSVGHLV 360

Query: 346  FSGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQGSFS--EPGNLPC 173
            +SG  LS +NE+ +++WI+  C TIL+NL+TS +DD  +L+++D+ Q   +  E  N+  
Sbjct: 361  YSGLHLSIQNEMFILRWISKKCTTILKNLSTSFEDDSLLLSAIDKIQNLDAPLELNNVSS 420

Query: 172  AVATELSSFLSSMGLHNGEIGESNSRVQRCLDKWKLSVNWRLGYKKILVDCISFC 8
                E+ +F +++ L  GE     S+     ++W+L+V WRL YKKILVDCIS+C
Sbjct: 421  TCRDEICAFKANV-LQKGERSSMESK-----ERWRLAVEWRLSYKKILVDCISYC 469


>gb|ESW13964.1| hypothetical protein PHAVU_008G241400g [Phaseolus vulgaris]
          Length = 497

 Score =  513 bits (1322), Expect = e-143
 Identities = 266/482 (55%), Positives = 347/482 (71%), Gaps = 11/482 (2%)
 Frame = -2

Query: 1420 AELGVSDAPLNSATSSYS--SCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAAL 1247
            A+LG+SD+   +    +S  SCLG +L +A FP +GGRGL A RDLRRGE++L VP++AL
Sbjct: 16   AQLGISDSTTRTDQPQHSPSSCLGSSLCVAHFPHSGGRGLGAVRDLRRGEIVLSVPKSAL 75

Query: 1246 MTRDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLP 1067
            MTR++++E D+KL  A+NRHS LSS QIL V LL EV K K S W+PYL+HLP +YD+L 
Sbjct: 76   MTRENVME-DKKLCFAVNRHSCLSSAQILIVCLLYEVCKGKTSRWHPYLMHLPHTYDILA 134

Query: 1066 TFGPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASATIS 887
             F  FE +ALQV++A W TEKAI K +S+WKEA ALME L F+PQ +TFKAW+WA+ATIS
Sbjct: 135  MFDEFEKRALQVDEAVWVTEKAILKAKSEWKEAHALMEDLMFRPQFLTFKAWVWAAATIS 194

Query: 886  SRTLHVPWDDAGCLCPVGDLFNYAPPGEDFSD-KDSVRSSGYASYKDNSSDDGDECEDST 710
            SRTLHVPWD+AGCLCPVGDLFNY  PGE+ SD +D       +S  D +  +GD  ++  
Sbjct: 195  SRTLHVPWDEAGCLCPVGDLFNYDAPGEESSDIEDLEHLLSNSSIHDTNLLNGD--KNIV 252

Query: 709  VNEDQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFILG 530
            V+ +Q +  S RLTDGG+EE++ AYCFYA+ +Y+KG QVLL YGTYTNLELLEHYGF+L 
Sbjct: 253  VDAEQLDSHSQRLTDGGFEENVNAYCFYARAHYKKGDQVLLCYGTYTNLELLEHYGFLLQ 312

Query: 529  NNPNDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQRRSVGHL 350
             NPNDKVFIPL   V    S SW  +S  I +NG PSF+LL++LRLWATP N+R+SVGHL
Sbjct: 313  ENPNDKVFIPLDPAVY--FSTSWSMESLYIHHNGKPSFALLAALRLWATPQNKRKSVGHL 370

Query: 349  AFSGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQG--SFSEPGNLP 176
             +SGSQLS +NE+ + +W++  C T+L+NL TSID+D  +L +MD  Q   +F E   L 
Sbjct: 371  VYSGSQLSTDNEIFITKWLSKTCATVLKNLPTSIDEDTLLLNAMDSSQDIFTFMEITKL- 429

Query: 175  CAVATELSSFLSSMGLHNGEIGES------NSRVQRCLDKWKLSVNWRLGYKKILVDCIS 14
             +   E+ +FL +   HN     S      + + +R +D+WKL+V WRL YKK+L DCIS
Sbjct: 430  MSSKDEIFTFLET---HNMRDAHSLTEVILSRKARRSMDRWKLAVQWRLKYKKVLFDCIS 486

Query: 13   FC 8
            +C
Sbjct: 487  YC 488


>ref|XP_004490774.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Cicer arietinum]
          Length = 494

 Score =  513 bits (1322), Expect = e-143
 Identities = 265/479 (55%), Positives = 351/479 (73%), Gaps = 8/479 (1%)
 Frame = -2

Query: 1420 AELGVSDAPLNSATSSYSSCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAALMT 1241
            +++G+SD+  +S    + SCLGH+L ++ FP +GGRGL A RDLRRGE++LRVP++ALMT
Sbjct: 16   SQIGISDSTNHS--QHFFSCLGHSLCVSIFPHSGGRGLGAVRDLRRGEIVLRVPKSALMT 73

Query: 1240 RDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLPTF 1061
            R+S++E D+KL  A+N+H SLSS QIL V LL EV K K S W+PYL+HLP+SYD+L  F
Sbjct: 74   RESVME-DKKLCIAVNKHPSLSSVQILTVCLLYEVGKGKTSRWHPYLMHLPQSYDVLAMF 132

Query: 1060 GPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASATISSR 881
            G FE  ALQV++A W TEKA+ K +S+WKEA ALME L FKPQL+TFKAW+WA+ATISSR
Sbjct: 133  GEFEKNALQVDEAIWITEKAVLKAKSEWKEAHALMEDLMFKPQLLTFKAWVWAAATISSR 192

Query: 880  TLHVPWDDAGCLCPVGDLFNYAPPGEDFS---DKDSVRSSGYASYKDNSSDDGDECEDST 710
            TLH+PWD+AGCLCPVGDLFNY  PGE+ S   D D+  S+  +S    +  +GD  ++  
Sbjct: 193  TLHIPWDEAGCLCPVGDLFNYDAPGEELSGIEDVDNFLSN--SSIPVTTLSNGD--KNIV 248

Query: 709  VNEDQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFILG 530
            V+E+Q +  S RLTDGG++E   AYCFYA+ +Y+KG QVLL YGTYTNLELLEHYGF+L 
Sbjct: 249  VDEEQVDFHSQRLTDGGFDEDANAYCFYARTHYKKGDQVLLCYGTYTNLELLEHYGFLLQ 308

Query: 529  NNPNDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQRRSVGHL 350
             NPNDKVFIPL  E     S SW  +S  I +NG PSF+LL++LRLWATP N+RRSVGHL
Sbjct: 309  GNPNDKVFIPL--EPAMYTSTSWSKESLYIHHNGKPSFALLAALRLWATPHNKRRSVGHL 366

Query: 349  AFSGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQG--SFSEPGNLP 176
            A+SGSQLS +NE  VM+W+   C  +L+N++TSI+DD  ++ ++D  +   +F E   L 
Sbjct: 367  AYSGSQLSADNETFVMKWLLKTCKAVLKNMSTSIEDDTLLVNALDSSKEFFTFMEIAKLM 426

Query: 175  CA---VATELSSFLSSMGLHNGEIGESNSRVQRCLDKWKLSVNWRLGYKKILVDCISFC 8
             +   V T L +   +   H+      + +V+R +D+WKL+V WRL YKK+LVDCI++C
Sbjct: 427  TSKDEVYTFLEAHNVTTDAHSFTGILLSKKVRRLMDRWKLAVVWRLRYKKVLVDCIAYC 485


>ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
            gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP,
            putative [Ricinus communis]
          Length = 510

 Score =  509 bits (1312), Expect = e-142
 Identities = 259/479 (54%), Positives = 348/479 (72%), Gaps = 7/479 (1%)
 Frame = -2

Query: 1420 AELGVSDAPLNSAT-SSYSSCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAALM 1244
            AELG+SD+  +S +    +SCLG +L+++ FPDAGGRGL A RDL++GEL+LRVP++AL+
Sbjct: 19   AELGISDSSNSSQSLEEPNSCLGISLTVSHFPDAGGRGLGAARDLKKGELVLRVPKSALL 78

Query: 1243 TRDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLPT 1064
            T+DS L KD  L  AIN HS+LS TQ L V LL E+SK + SFWYPYL+HLPRSY++L T
Sbjct: 79   TKDSFL-KDGLLLSAINNHSALSPTQTLTVCLLYEMSKGQSSFWYPYLMHLPRSYEILAT 137

Query: 1063 FGPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASATISS 884
            F  FE +ALQV+DA W  EKAISK E D KEA +LM++L+ KPQ +T +AW+WA ATISS
Sbjct: 138  FSEFEKQALQVDDAIWTAEKAISKAELDRKEAYSLMQELRLKPQFLTLRAWIWACATISS 197

Query: 883  RTLHVPWDDAGCLCPVGDLFNYAPPGEDFSDKDSVRSSGYASYKDNSSDDGDECEDSTVN 704
            RT+H+PWD+AGCLCPVGD FNYA PGE+ S  ++  S   AS  +++S   +    +  +
Sbjct: 198  RTMHIPWDEAGCLCPVGDFFNYAAPGEESSSPENDESWKPASCLEDASLSSERSTSNFCS 257

Query: 703  EDQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFILGNN 524
            E   +     LTDGG++E  AAYCFYA++NY+KG QVLL YGTYTNLELLEHYGF+L  N
Sbjct: 258  E-TFDVQLKSLTDGGFDEDKAAYCFYARQNYKKGAQVLLSYGTYTNLELLEHYGFLLNEN 316

Query: 523  PNDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQRRSVGHLAF 344
            PNDKVFIPL  E+    S +WP +S  I  +G PSFSLL +LRLWATP N+RRS+GHLA+
Sbjct: 317  PNDKVFIPL--ELSMQSSNTWPKESMYIHQDGKPSFSLLCALRLWATPSNRRRSMGHLAY 374

Query: 343  SGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQGSFS--EPGNLPCA 170
            SGSQLS ENEV +++WI+  C  +L+ L T++++D  +L+++D+ Q   S  E G +   
Sbjct: 375  SGSQLSVENEVSILKWISRKCHAVLKKLPTTVEEDSLLLSAIDKIQNCHSPLELGKMLHG 434

Query: 169  VATELSSFLSSMGLHNGEIGESNS----RVQRCLDKWKLSVNWRLGYKKILVDCISFCT 5
               + S+F+ +  L N +IG  ++    + +R +++WKL+V WRL YKK L+DCIS+CT
Sbjct: 435  FEGQASAFVEAHNLLNIKIGTESTMLCGKAKRSMERWKLAVKWRLSYKKTLIDCISYCT 493


>ref|XP_006596494.1| PREDICTED: protein SET DOMAIN GROUP 40-like isoform X1 [Glycine max]
          Length = 497

 Score =  508 bits (1309), Expect = e-141
 Identities = 260/481 (54%), Positives = 345/481 (71%), Gaps = 8/481 (1%)
 Frame = -2

Query: 1420 AELGVSDAPL--NSATSSYSSCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAAL 1247
            A+LG+SD+    N    S SSCLG +LS++ FP +GGRGL A RDLRRGE++LRVP++AL
Sbjct: 16   AQLGISDSTTRTNQPQHSLSSCLGSSLSVSHFPHSGGRGLGAVRDLRRGEIVLRVPKSAL 75

Query: 1246 MTRDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLP 1067
            MTR++++E D+KL DA+NRHSSLSS QIL V LL E+ K K S W+PYL+HLP +YD+L 
Sbjct: 76   MTRETVME-DKKLCDAVNRHSSLSSAQILIVCLLYEMGKGKTSRWHPYLMHLPHTYDVLA 134

Query: 1066 TFGPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASATIS 887
             FG FE  ALQV++A W TEKA+ K +S+WKEA +LM+ L FKPQ  TFKAW+WA+ATIS
Sbjct: 135  MFGEFEKHALQVDEAMWVTEKAMLKAKSEWKEAHSLMQDLMFKPQFFTFKAWVWAAATIS 194

Query: 886  SRTLHVPWDDAGCLCPVGDLFNYAPPG-EDFSDKDSVRSSGYASYKDNSSDDGDECEDST 710
            SRTLH+PWD+AGCLCPVGDLFNY  PG E    +D  R     S  D    +GD  ++  
Sbjct: 195  SRTLHIPWDEAGCLCPVGDLFNYDAPGIEPSGIEDLDRLLSNTSIPDTIVLNGD--KNIM 252

Query: 709  VNEDQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFILG 530
            V+ +Q +  S RLTDGG+EE   AYCFYA+++Y+KG QVLL YGTYTNLELLEHYGF+L 
Sbjct: 253  VDAEQLDSHSWRLTDGGFEEDANAYCFYAREHYKKGDQVLLCYGTYTNLELLEHYGFLLQ 312

Query: 529  NNPNDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQRRSVGHL 350
             NPNDKVFIPL   +    S SW  +S  I +NG PSF+LL++LRLWATP N+RRSVGHL
Sbjct: 313  ENPNDKVFIPLEPALYS--STSWSKESLYIHHNGKPSFALLAALRLWATPQNRRRSVGHL 370

Query: 349  AFSGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQ--GSFSEPGNLP 176
             +SGS++S +NE+ +M+W++  C  +L+NL TS+++D  +L +MD  Q   +F E   L 
Sbjct: 371  VYSGSRVSTDNEIFIMKWLSKTCDAVLRNLPTSLEEDTLLLNAMDNSQDFSTFMEITKL- 429

Query: 175  CAVATELSSFLSSMGL---HNGEIGESNSRVQRCLDKWKLSVNWRLGYKKILVDCISFCT 5
             +   E  +FL +  +   H+      + + +R +D+WKL+V WRL YKK++ DCIS+C 
Sbjct: 430  VSSREETYTFLETHNMKDTHSFTDVILSRKARRSMDRWKLAVQWRLKYKKVIFDCISYCN 489

Query: 4    R 2
            +
Sbjct: 490  K 490


>ref|XP_002305239.2| hypothetical protein POPTR_0004s07950g [Populus trichocarpa]
            gi|550340570|gb|EEE85750.2| hypothetical protein
            POPTR_0004s07950g [Populus trichocarpa]
          Length = 518

 Score =  506 bits (1304), Expect = e-141
 Identities = 259/479 (54%), Positives = 341/479 (71%), Gaps = 7/479 (1%)
 Frame = -2

Query: 1420 AELGVSDAPLNSAT--SSYSSCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAAL 1247
            A LG+SD   N +    S +SCLGH+L+++ FPDAGGRGLAA RDL++GEL+LRVP++ L
Sbjct: 45   ANLGISDCTTNLSLHPQSPTSCLGHSLTVSHFPDAGGRGLAAVRDLKKGELVLRVPKSVL 104

Query: 1246 MTRDSILEKDEKLSDAINR--HSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDL 1073
            +TRDS+L KDEKL   +N   +SSLS TQILAV LL E+ K K S+WYPYL+HLPRSYD+
Sbjct: 105  ITRDSLL-KDEKLCSFVNNNTYSSLSPTQILAVCLLYEMGKGKSSWWYPYLMHLPRSYDV 163

Query: 1072 LPTFGPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASAT 893
            L +F                 +KA+SK +S+WKEA +LM+ LK KPQL+TF+AW+WASAT
Sbjct: 164  LASF-----------------KKAVSKAKSEWKEANSLMDALKLKPQLLTFRAWIWASAT 206

Query: 892  ISSRTLHVPWDDAGCLCPVGDLFNYAPPGEDFSDKDSVRSSGYASYKDNSSDDGDECEDS 713
            ISSR LH+PWD+AGCLCPVGDLFNYA PGE+ +D ++V     AS  +++S    E  D 
Sbjct: 207  ISSRALHIPWDEAGCLCPVGDLFNYAAPGEESNDLENVVHLMNASSLEDTSLSNGETTDD 266

Query: 712  TVNEDQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFIL 533
             + +    G   RLTDGG+ E++AAYCFYA+KNY+KG QVLL YGTYTNLELLEHYGF+L
Sbjct: 267  FIGDQPDIGLE-RLTDGGFNENMAAYCFYARKNYKKGTQVLLGYGTYTNLELLEHYGFLL 325

Query: 532  GNNPNDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQRRSVGH 353
              NPNDKVFIPL   +      SWP  S  I  +G PSF+LLS+LRLWATPPNQRRS+ H
Sbjct: 326  NENPNDKVFIPLEPSMYS--FISWPKVSMYIHQDGKPSFALLSALRLWATPPNQRRSISH 383

Query: 352  LAFSGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQGSFSEPGNLPC 173
            L +SGS+LS  NE+ V++WI+ NC  IL NL T I++D  +L+++++ + +F +P  L C
Sbjct: 384  LVYSGSRLSVYNEISVLKWISKNCALILSNLPTVIEEDSLLLSTINKIE-NFDKPTELVC 442

Query: 172  AVATELSSFLSSMGLHNGEIGES---NSRVQRCLDKWKLSVNWRLGYKKILVDCISFCT 5
                E  +FL +  L  G+ G     + + +R +++WKL+V WR+ YKK L+DCIS+CT
Sbjct: 443  TSGGEARAFLEASDLQKGKNGSELMFSGKTKRVIERWKLAVQWRISYKKTLIDCISYCT 501


>ref|XP_004288574.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Fragaria vesca subsp.
            vesca]
          Length = 511

 Score =  503 bits (1296), Expect = e-140
 Identities = 252/453 (55%), Positives = 329/453 (72%), Gaps = 5/453 (1%)
 Frame = -2

Query: 1351 ALSLACFPDAGGRGLAATRDLRRGELILRVPRAALMTRDSILEKDEKLSDAINRHSSLSS 1172
            +L ++ F  AGGRGL A RDL +GEL+L+VP++AL+TR+++L KD+ LS A+N H+SLS 
Sbjct: 47   SLVVSYFHGAGGRGLGAARDLEKGELVLKVPKSALITRETLLLKDDHLSLAVNAHTSLSP 106

Query: 1171 TQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLPTFGPFELKALQVEDATWATEKAISK 992
             Q L V LL E+ K K S+WYPYLI+LPRSYD++ TFG FE +ALQVEDA WA +KAISK
Sbjct: 107  IQTLCVCLLYEMGKGKTSWWYPYLINLPRSYDIIATFGEFEKQALQVEDAIWAADKAISK 166

Query: 991  VESDWKEAIALMEQLKFKPQLVTFKAWLWASATISSRTLHVPWDDAGCLCPVGDLFNYAP 812
             E +WKE   LMEQLK KPQL TF+AWLWASAT+SSRTLH+PWD AGCLCPVGDLFNY+ 
Sbjct: 167  AEFEWKETNTLMEQLKLKPQLRTFRAWLWASATVSSRTLHIPWDGAGCLCPVGDLFNYSA 226

Query: 811  PGEDFSDKDSVRSSGYASYKDNSSDDGDECEDSTVNEDQCNGCSGRLTDGGYEEHLAAYC 632
            P ED SD D+V    +     + +   +E     ++ +Q +  SGRLTDG +E ++ AYC
Sbjct: 227  PVED-SDSDNVELRTHELALQDMTTVKEE-TSCILDNEQLDSDSGRLTDGRFENNVGAYC 284

Query: 631  FYAKKNYRKGQQVLLIYGTYTNLELLEHYGFILGNNPNDKVFIPLPHEVIGSCSCSWPAD 452
            FYAKK+YRKG+QVLL YGTYTNLELLEHYGF+L  NPNDK ++PL  E+    SCSWP +
Sbjct: 285  FYAKKSYRKGEQVLLSYGTYTNLELLEHYGFLLNENPNDKAYVPLEPEIYS--SCSWPKE 342

Query: 451  SFCIQYNGIPSFSLLSSLRLWATPPNQRRSVGHLAFSGSQLSPENEVIVMQWIATNCGTI 272
               I  +G PSF+LLS+LRLWATP N+RRSVGHLA+SG QLS ENE+ VM+WI+  C +I
Sbjct: 343  FLYIHQSGKPSFALLSALRLWATPANRRRSVGHLAYSGLQLSIENEIFVMRWISNKCNSI 402

Query: 271  LQNLATSIDDDLKVLASMDEFQGSFS--EPGNLPCAVATELSSFLSSM---GLHNGEIGE 107
            ++NL T+ ++D  +L+ +D+ Q   +  E  N+      E+ ++ + +   G  + E   
Sbjct: 403  VKNLPTTFEEDSLLLSVIDKIQNVNAPLEFANISSVSTDEICTYRAEVLKKGATDSETVV 462

Query: 106  SNSRVQRCLDKWKLSVNWRLGYKKILVDCISFC 8
            S   +QR  ++W+L+V WRL YKKILVDCISFC
Sbjct: 463  SRKTMQRSRERWRLAVQWRLSYKKILVDCISFC 495


>ref|XP_004145844.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Cucumis sativus]
          Length = 483

 Score =  502 bits (1293), Expect = e-139
 Identities = 262/478 (54%), Positives = 342/478 (71%), Gaps = 6/478 (1%)
 Frame = -2

Query: 1420 AELGVSDAPLNSATSSYSSCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAALMT 1241
            A+ G+SD+ ++  TS   SCLGH+L ++ FPD GGRGLAA R L++GEL+LR P++ L+T
Sbjct: 15   ADHGISDS-VDQPTSH--SCLGHSLCVSFFPDTGGRGLAAVRQLKKGELVLRAPKSILLT 71

Query: 1240 RDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLPTF 1061
              S+  +DEKL  A+ R+ SLSSTQ L   LL E+SK   S+W+PYL HLP+SYD+L TF
Sbjct: 72   TQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKHLPQSYDILATF 131

Query: 1060 GPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASATISSR 881
            G FE +ALQV+ A WATEKA  K  +DW+    LM++   K QL TFKAWLWASATISSR
Sbjct: 132  GEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKAWLWASATISSR 191

Query: 880  TLHVPWDDAGCLCPVGDLFNY-APPGEDFSDKDSVRSSGYASYKDNSSDDGDECEDSTVN 704
            TL+VPWD+AGCLCPVGDLFNY AP GE F+  D +    +AS  D          +  + 
Sbjct: 192  TLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLND----------ELELL 241

Query: 703  EDQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFILGNN 524
            E+Q       LTDGG+EE+ +AYCFYA+++YRKG+QVLL YGTYTNLELLE+YGF+L  N
Sbjct: 242  EEQ-RDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN 300

Query: 523  PNDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQRRSVGHLAF 344
            PNDKVFIP+ H++ G  S SWP +S  I  NG PSF+LLS+LRLWAT PN+RR VGHLA+
Sbjct: 301  PNDKVFIPIEHDIYG--SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAY 358

Query: 343  SGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQGSFSEPGNLPCAVA 164
            +GSQLS +NE++VMQW++ NC T+L NL TSI++D ++L ++ + Q     P  L   + 
Sbjct: 359  AGSQLSVKNEILVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQ-DLQVPRELQKTLL 417

Query: 163  T---ELSSFLSSMGLHNGEIGESNS--RVQRCLDKWKLSVNWRLGYKKILVDCISFCT 5
            T   E  +FL + G+ N +  ES+S  +++R LD+WKL+V WRL YKK LVDCI +CT
Sbjct: 418  TYGGEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCT 475


>ref|XP_006430400.1| hypothetical protein CICLE_v10011537mg [Citrus clementina]
            gi|557532457|gb|ESR43640.1| hypothetical protein
            CICLE_v10011537mg [Citrus clementina]
          Length = 503

 Score =  498 bits (1282), Expect = e-138
 Identities = 255/477 (53%), Positives = 337/477 (70%), Gaps = 6/477 (1%)
 Frame = -2

Query: 1420 AELGVSDAPLNSATSSYSSCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAALMT 1241
            AE+G++D+ + + + S  +CLGH+L+++ FP+AGGRGLAA RDL +GELILRVP+ AL T
Sbjct: 16   AEMGITDSTIQNPSRS-RNCLGHSLTVSHFPEAGGRGLAAARDLTKGELILRVPKTALFT 74

Query: 1240 RDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLPTF 1061
             + +L+ D+K S A+NRH  LS +QIL V LL EV K K S WY YL+ LPR Y++L TF
Sbjct: 75   TECLLKSDQKRSLAVNRHLFLSPSQILIVCLLYEVGKGKSSRWYTYLMLLPRCYEILATF 134

Query: 1060 GPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASATISSR 881
            GPFE +ALQV+DA WA EKA+SK ES+WK+AI LME+LK KPQL++FKAWLWASAT+SSR
Sbjct: 135  GPFEKQALQVDDAIWAAEKAVSKAESEWKQAIKLMEELKLKPQLLSFKAWLWASATVSSR 194

Query: 880  TLHVPWDDAGCLCPVGDLFNYAPPGEDFSDKDSVRS-SGYASYKDNSSDDGDECEDSTVN 704
            T+H+ WD+AGCLCPVGDLFNYA PGE       +    G+         D  +  DS   
Sbjct: 195  TMHISWDEAGCLCPVGDLFNYAAPGEGEESNIGIEDVEGWMPAPCLPKGDTTDVLDS--- 251

Query: 703  EDQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFILGNN 524
             ++ NG   RLTDG +EE + +YCFYA+ NY++G+QVLL YGTYTNLELLEHYGF+L  N
Sbjct: 252  -EKFNGHLRRLTDGRFEEDVNSYCFYARNNYKRGEQVLLSYGTYTNLELLEHYGFLLNEN 310

Query: 523  PNDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQRRSVGHLAF 344
            PNDKVFI L   +  SC CSWP +S  I  NG PSF+LLS+LRLW TP NQRRSVGHLA+
Sbjct: 311  PNDKVFISLEPGMY-SC-CSWPRESQYIDQNGKPSFALLSALRLWMTPANQRRSVGHLAY 368

Query: 343  SGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQGSFS--EPGNLPCA 170
            SG QLS +NE+ VM+W++ N   +L +L TS ++D  +L ++D+ Q  ++  E   +   
Sbjct: 369  SGHQLSVDNEISVMKWLSNNSRVMLNSLPTSKEEDALLLCAIDKIQDIYTAMELKKVLSD 428

Query: 169  VATELSSFLSSMGLHNGEIGESNS---RVQRCLDKWKLSVNWRLGYKKILVDCISFC 8
               E+ +FL + G+   + G   S   + +  + +WKL++ WRL YKK L DCIS+C
Sbjct: 429  FGGEVCTFLENYGVQCRQRGAKLSLSRKTKLSMQRWKLAIQWRLRYKKTLADCISYC 485


>gb|ACU19071.1| unknown [Glycine max]
          Length = 497

 Score =  497 bits (1280), Expect = e-138
 Identities = 257/481 (53%), Positives = 341/481 (70%), Gaps = 8/481 (1%)
 Frame = -2

Query: 1420 AELGVSDAPL--NSATSSYSSCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAAL 1247
            A+LG+SD+    N    S SSCLG +LS++ FP +GGRGL A RDLRRGE++LRVP++AL
Sbjct: 16   AQLGISDSTTRTNQPQHSLSSCLGSSLSVSHFPHSGGRGLGAVRDLRRGEIVLRVPKSAL 75

Query: 1246 MTRDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLP 1067
            MTR++++E D+KL DA+NRHSSLSS QIL V LL E+ K K S W+PYL+HLP +YD+L 
Sbjct: 76   MTRETVME-DKKLCDAVNRHSSLSSAQILIVCLLYEMGKGKTSRWHPYLMHLPHTYDVLA 134

Query: 1066 TFGPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASATIS 887
             FG FE  ALQV++A W TEKA+ K +S+WKEA +LM+ L FKPQ  TFKAW+ A+ATIS
Sbjct: 135  MFGEFEKHALQVDEAMWVTEKAMLKAKSEWKEAHSLMQDLMFKPQFFTFKAWVRAAATIS 194

Query: 886  SRTLHVPWDDAGCLCPVGDLFNYAPPG-EDFSDKDSVRSSGYASYKDNSSDDGDECEDST 710
            SRTLH+PWD+AGCLCPVGDLFNY  PG E    +D  R     S  D    +GD  ++  
Sbjct: 195  SRTLHIPWDEAGCLCPVGDLFNYDAPGIEPSGIEDLDRLLSNTSIPDTIVLNGD--KNIV 252

Query: 709  VNEDQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFILG 530
            V+ +Q +  S RLTDGG+EE   AYCFYA+++Y+KG QVLL YGTYTNLELLEHYGF+L 
Sbjct: 253  VDAEQLDSHSWRLTDGGFEEDANAYCFYAREHYKKGDQVLLCYGTYTNLELLEHYGFLLQ 312

Query: 529  NNPNDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQRRSVGHL 350
             NPNDKVFIPL   +    S SW  +S  I +NG PSF+LL++LRLWATP N+RRSVGHL
Sbjct: 313  ENPNDKVFIPLEPALYS--STSWSKESLYIHHNGKPSFALLAALRLWATPQNRRRSVGHL 370

Query: 349  AFSGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQ--GSFSEPGNLP 176
             + GS++S +NE+ +M+W++  C  +L+NL T +++D  +L +MD  Q   +F E   L 
Sbjct: 371  VYFGSRVSTDNEIFIMKWLSKTCDAVLRNLPTFLEEDTLLLNAMDNSQDFSTFMEITKLV 430

Query: 175  CAVATELSSFLSSMGL---HNGEIGESNSRVQRCLDKWKLSVNWRLGYKKILVDCISFCT 5
             +   E  +FL +  +   H+      + + +R +D+WKL+V WRL YKK+  DCIS+C 
Sbjct: 431  FS-REETYTFLETHNMKDTHSFTDVILSRKARRSMDRWKLAVQWRLKYKKVTFDCISYCN 489

Query: 4    R 2
            +
Sbjct: 490  K 490


>ref|XP_006289442.1| hypothetical protein CARUB_v10002957mg [Capsella rubella]
            gi|482558148|gb|EOA22340.1| hypothetical protein
            CARUB_v10002957mg [Capsella rubella]
          Length = 503

 Score =  494 bits (1273), Expect = e-137
 Identities = 257/482 (53%), Positives = 337/482 (69%), Gaps = 11/482 (2%)
 Frame = -2

Query: 1420 AELGVSDAPLNSATSSYSSCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAALMT 1241
            A++G+SD+  +S  S   SCLGH+LS+A FP AGGRGL A R+LR+GEL+L+VPR ALMT
Sbjct: 16   ADIGISDSIDSSRCSD--SCLGHSLSVADFPLAGGRGLRAVRELRKGELVLKVPRNALMT 73

Query: 1240 RDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLPTF 1061
             +S++  D+KL+DA+N H SLSSTQIL+V LL E+SK K+SFWYPYL+HLPR YDLL TF
Sbjct: 74   TESMVANDQKLNDAVNLHGSLSSTQILSVCLLYEMSKGKKSFWYPYLVHLPRDYDLLATF 133

Query: 1060 GPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASATISSR 881
            G FE +ALQVEDA W TEKA +K +S+WKEA  LM++L  KP+  +F+AWLWASATISSR
Sbjct: 134  GEFEKQALQVEDAVWVTEKATAKCQSEWKEAGTLMKELDLKPKFQSFQAWLWASATISSR 193

Query: 880  TLHVPWDDAGCLCPVGDLFNYAPPGEDFSDKDSVRSSGYASYKDNSSDDGDECEDSTVNE 701
            TLH+PWD AGCLCP GDLFNY  PG+D +  +   S+   S    +S    EC +   NE
Sbjct: 194  TLHIPWDSAGCLCPAGDLFNYDAPGDDLNYSEGPESAIQTSSPQPASITNLECRN---NE 250

Query: 700  DQC----NGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFIL 533
            ++        S RLTDGG+EE   AYC YA++NY+ G+QVLL YGTYTNLELLEHYGF+L
Sbjct: 251  EEAGLNVEIQSERLTDGGFEEDANAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFML 310

Query: 532  GNNPNDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQR-RSVG 356
              N NDKVFIPL   +  S + SWP DS  I  +G PSF+L+S+LRLW  P +QR +SV 
Sbjct: 311  EENSNDKVFIPLETSLY-SLASSWPKDSLYIHQDGKPSFALVSTLRLWLVPQSQRDKSVM 369

Query: 355  HLAFSGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQG-SFSEPGNL 179
             L ++GSQ+S +NE++VM+W++  CG++L+NL TS+ +D  +L ++D+ Q          
Sbjct: 370  RLVYAGSQISVKNEILVMKWMSEKCGSVLRNLPTSVSEDNLLLHNIDKLQDPKIRLEQKE 429

Query: 178  PCAVATELSSFLSSMGL-----HNGEIGESNSRVQRCLDKWKLSVNWRLGYKKILVDCIS 14
              A  +E+ +FL    L      +G+  E   R  R + KW+LSV WRL YK+ L DCI 
Sbjct: 430  TEAFGSEMRAFLDVNRLWDVIGFSGKDVEFPRRTNRMMSKWRLSVQWRLSYKRTLADCIY 489

Query: 13   FC 8
            +C
Sbjct: 490  YC 491


>gb|EXC05430.1| Protein SET DOMAIN GROUP 40 [Morus notabilis]
          Length = 508

 Score =  494 bits (1272), Expect = e-137
 Identities = 262/510 (51%), Positives = 347/510 (68%), Gaps = 37/510 (7%)
 Frame = -2

Query: 1420 AELGVSDAPLN-SATSSYSSCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAALM 1244
            +E+G+S++P++ S  S  SSCL H+L ++ FPDAGGRGLAA R LRRGEL+LRVP++ALM
Sbjct: 17   SEIGISNSPISLSDRSCLSSCLCHSLFVSHFPDAGGRGLAAARPLRRGELVLRVPKSALM 76

Query: 1243 TRDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLPT 1064
            TR+S L KD++ S  +N  SSLS  QIL V LL E++K + S+WYPYL++LPR YD+L T
Sbjct: 77   TRES-LSKDQRFSIVVNAPSSLSPIQILIVGLLYEMNKGRSSWWYPYLVNLPRGYDILAT 135

Query: 1063 FGPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASAT--- 893
            FG FE +ALQV+DA W  EKA  K ES+WKEA  LM++L  KPQ +TF+AWLWASAT   
Sbjct: 136  FGEFEKQALQVDDAIWTAEKATLKAESEWKEANPLMKELNLKPQFLTFRAWLWASATFTL 195

Query: 892  ----------------------------ISSRTLHVPWDDAGCLCPVGDLFNYAPPGEDF 797
                                        ISSRTLHVPWD+AGCLCPVGDLFNY  PGE+ 
Sbjct: 196  TEFHHHFNIIIPNVESNDVKFYASTLIKISSRTLHVPWDEAGCLCPVGDLFNYVAPGEE- 254

Query: 796  SDKDSVRSSGYASYKDNSSDDGDECEDSTVNEDQCNGCSGRLTDGGYEEHLAAYCFYAKK 617
               DS                       T++ +Q +  S RLTDGG+EE + AYCFYA++
Sbjct: 255  ---DSAH---------------------TLDLEQLDSHSQRLTDGGFEEDVVAYCFYARR 290

Query: 616  NYRKGQQVLLIYGTYTNLELLEHYGFILGNNPNDKVFIPLPHEVIGSCSCSWPADSFCIQ 437
            +Y KG+QVLL YGTYTNLELLEHYGF+L +N N+KVFIPL  E+    S +WP DS  I 
Sbjct: 291  HYEKGEQVLLGYGTYTNLELLEHYGFLLNDNSNEKVFIPLQPEICS--SNTWPKDSMFIH 348

Query: 436  YNGIPSFSLLSSLRLWATPPNQRRSVGHLAFSGSQLSPENEVIVMQWIATNCGTILQNLA 257
             +G PSF+LLS+LR+WATP NQRR   HLA+SGSQLS ENE++VM+WI+ NC  IL++L 
Sbjct: 349  QSGKPSFALLSALRIWATPRNQRRPASHLAYSGSQLSAENEILVMRWISKNCNCILKSLP 408

Query: 256  TSIDDDLKVLASMDEFQGSFS--EPGNLPCAVATELSSFLSSMGLHNGE-IGE--SNSRV 92
            TS ++D  +L+++D+ Q S S  E  N   +    + +FL + GL +GE + E  S+ + 
Sbjct: 409  TSFEEDRFLLSAIDKMQDSCSPLELRNTVASSTAHIHAFLEANGLQDGEDVAELLSSRKT 468

Query: 91   QRCLDKWKLSVNWRLGYKKILVDCISFCTR 2
            +R +D+W+L++ WR+ YK+IL++CIS C+R
Sbjct: 469  KREMDRWRLAIQWRVRYKEILINCISHCSR 498


>gb|EOY03097.1| SET domain group 40, putative isoform 1 [Theobroma cacao]
          Length = 498

 Score =  493 bits (1270), Expect = e-137
 Identities = 251/474 (52%), Positives = 326/474 (68%), Gaps = 2/474 (0%)
 Frame = -2

Query: 1420 AELGVSDAPLNSATSSYSSCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAALMT 1241
            A LGVSD+P         SCLGH+L ++ FPDAGGRGL A RD+ RGEL+L+VP++AL+T
Sbjct: 37   AGLGVSDSP----NPDSCSCLGHSLGVSYFPDAGGRGLGAVRDITRGELLLKVPKSALIT 92

Query: 1240 RDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLPTF 1061
              S+L  DE+LS A+  H SLS  Q+L +  L E+SK K S W+PYL+HLPRSY +L  F
Sbjct: 93   THSLLN-DERLSTALKAHPSLSPAQVLTICFLYEMSKGKASPWHPYLLHLPRSYGILAAF 151

Query: 1060 GPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASATISSR 881
            G FE +ALQV+ A WA +KA+SK E +WK+A  LM++LK K Q +TF+AW+WA+ TISSR
Sbjct: 152  GEFEKQALQVDYAIWAAQKALSKAEYEWKKATPLMKELKLKLQFLTFRAWIWATGTISSR 211

Query: 880  TLHVPWDDAGCLCPVGDLFNYAPPGEDFSDKDSVRSSGYASYKDNSSDDGDECEDSTVNE 701
            TLH+PWD+AGCLCPVGDLFNYA PGED                 N  D+ D  ++    +
Sbjct: 212  TLHIPWDEAGCLCPVGDLFNYAAPGEDL----------------NGFDNVDNLQNGYALD 255

Query: 700  DQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFILGNNP 521
            D     S RLTDG +EE  AAYCFYAK NY+KG+QVLL YGTYTNLELLE+YGF+L +NP
Sbjct: 256  DLDTQHSQRLTDGAFEEDAAAYCFYAKTNYKKGEQVLLSYGTYTNLELLEYYGFLLEDNP 315

Query: 520  NDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQRRSVGHLAFS 341
            N+KVFIPL  ++    S SWP DS  I  NG PSF+L+++LR+WATPP QR+S+ H A+S
Sbjct: 316  NEKVFIPLEPDI--HSSSSWPNDSLYIHQNGRPSFALMAALRVWATPPYQRKSIRHQAYS 373

Query: 340  GSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQ--GSFSEPGNLPCAV 167
            GSQLS +NE+ VM WIA  C   L+ + TSI+DD  +L+  D+ Q   +  E G    A 
Sbjct: 374  GSQLSQDNEISVMTWIAKKCHATLKAMPTSIEDDNLLLSFTDKIQEFDNLWEWGKAMPAF 433

Query: 166  ATELSSFLSSMGLHNGEIGESNSRVQRCLDKWKLSVNWRLGYKKILVDCISFCT 5
              E  + L +  L   +   ++ R +  +D+WKL+V+WRL YKK+LVDCIS+CT
Sbjct: 434  GGEFCNLLQATNLKRNDESFASRRAKMLIDRWKLAVHWRLIYKKVLVDCISYCT 487


>ref|XP_003616150.1| Protein SET DOMAIN GROUP [Medicago truncatula]
            gi|355517485|gb|AES99108.1| Protein SET DOMAIN GROUP
            [Medicago truncatula]
          Length = 532

 Score =  492 bits (1266), Expect = e-136
 Identities = 267/522 (51%), Positives = 349/522 (66%), Gaps = 51/522 (9%)
 Frame = -2

Query: 1420 AELGVSDAPL-NSATSSYS-SCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAAL 1247
            + LG+SD+P  N+  S +S S LGH+L ++ FP +GGRGL A RDL+RGE+ILRVP++AL
Sbjct: 16   SHLGISDSPTTNTDQSQHSLSSLGHSLCVSTFPHSGGRGLGAVRDLKRGEIILRVPKSAL 75

Query: 1246 MTRDSILEKDEKLSDAINRHSSLSSTQ-------------------------------IL 1160
            MT +S++ +D+KL  A+NRHSSLSS Q                               IL
Sbjct: 76   MTSESVIMEDKKLCLAVNRHSSLSSVQRNTPNPKRCHVTERSRVQVLETASCVKQGKAIL 135

Query: 1159 AVLLLAEVSKRKRSFWYPYLIHLPRSYDLLPTFGPFELKALQVEDATWATEKAISKVESD 980
             V LL EV K K S W+PYL+HLP+SYDLL  FG FE +ALQV++A W TEKA+ K +S+
Sbjct: 136  TVCLLYEVGKGKTSRWHPYLVHLPQSYDLLAMFGEFEKQALQVDEAMWVTEKAVQKAKSE 195

Query: 979  WKEAIALMEQLKFKPQLVTFKAWLWASAT-------------ISSRTLHVPWDDAGCLCP 839
            WKEA ALME L FKPQL+TFKAW+WA+AT             ISSRTLH+PWD+AGCLCP
Sbjct: 196  WKEAHALMEDLMFKPQLLTFKAWVWAAATGRTVPETFHLPGLISSRTLHIPWDEAGCLCP 255

Query: 838  VGDLFNYAPPGEDFSDKDSVRSSGYASYKDNSSDDGDECEDSTVNEDQCNGCSGRLTDGG 659
            VGDLFNY  PGE+ S  + V         D+   +GD   +  ++E Q +  S RLTDGG
Sbjct: 256  VGDLFNYDAPGEELSGVEDV---------DHFLSNGD--MNVVIDEGQIDFNSQRLTDGG 304

Query: 658  YEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFILGNNPNDKVFIPLPHEVIG 479
            +EE   AYCFYA+ NY+KG QVLL YGTYTNLELLEHYGF+L  NPNDK+FIPL  E   
Sbjct: 305  FEEDANAYCFYARTNYKKGDQVLLCYGTYTNLELLEHYGFLLQENPNDKIFIPL--EPAM 362

Query: 478  SCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQRRSVGHLAFSGSQLSPENEVIVMQ 299
              S SW  +S  I  NG PSF+LL++LRLWATP N+RRS+GHLA+SGSQLS +NE+IVM+
Sbjct: 363  YTSTSWSKESLYIHPNGKPSFALLAALRLWATPHNKRRSIGHLAYSGSQLSADNEIIVMK 422

Query: 298  WIATNCGTILQNLATSIDDDLKVLASMDEFQG--SFSEPGNLPCAVATELSSFLSSMGLH 125
            W++  C  +L+N+ TSI+DD  +L ++D  Q   +F +   L  +   E+ +FL +  + 
Sbjct: 423  WLSKTCDAVLKNMPTSIEDDTLLLNALDCSQDFITFMKIVKL-MSSRDEVYTFLEAHNIT 481

Query: 124  NGEI---GESNSRVQRCLDKWKLSVNWRLGYKKILVDCISFC 8
            +        S+ + +R +D+WKL+V WRL YK++LVDCIS+C
Sbjct: 482  DALSFCDTISSKKTRRSMDRWKLAVLWRLRYKRVLVDCISYC 523


>emb|CBI27360.3| unnamed protein product [Vitis vinifera]
          Length = 449

 Score =  491 bits (1263), Expect = e-136
 Identities = 258/476 (54%), Positives = 314/476 (65%), Gaps = 5/476 (1%)
 Frame = -2

Query: 1417 ELGVSDAPLNSATSSYS-----SCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRA 1253
            ELG+SD      T          C+GH+L ++ FP AGGRGLAA RDL +GELIL VP++
Sbjct: 10   ELGISDFTTTPTTVPSRLQIPHCCVGHSLCVSHFPHAGGRGLAAARDLSQGELILTVPKS 69

Query: 1252 ALMTRDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDL 1073
            ALMT  S+L KDEKLS A+ RH+SLSS QIL + LLAE+SK K S+W+PYL+ LPRSYD 
Sbjct: 70   ALMTSQSLL-KDEKLSVAVKRHTSLSSPQILTICLLAEMSKGKSSWWHPYLMQLPRSYDT 128

Query: 1072 LPTFGPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASAT 893
            L  F  FE +ALQV+DA W TE+AI K E +WK+AI LME+LK KPQL  F+AWLWAS+T
Sbjct: 129  LANFSQFEKQALQVDDAIWVTERAILKAELEWKKAIPLMEELKLKPQLQNFRAWLWASST 188

Query: 892  ISSRTLHVPWDDAGCLCPVGDLFNYAPPGEDFSDKDSVRSSGYASYKDNSSDDGDECEDS 713
            +SSRT+H+PWDDAGCLCPVGD +NYA PGE+          G+   KD   DD       
Sbjct: 189  VSSRTMHIPWDDAGCLCPVGDFYNYAAPGEE--------PCGWEDLKDAEQDD------- 233

Query: 712  TVNEDQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFIL 533
                      S RLTDGGY+E LAAYCFYA+KNY+KG+QVLL YGTYTNLELLEHYGF+L
Sbjct: 234  --------VLSQRLTDGGYKEDLAAYCFYARKNYKKGEQVLLSYGTYTNLELLEHYGFLL 285

Query: 532  GNNPNDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQRRSVGH 353
              NPNDK FIPL  EV    S SWP DS  I  NG PSF+LLS+LRLWATP +QRRSVGH
Sbjct: 286  DENPNDKAFIPLEPEVY--ASSSWPKDSLYIHQNGKPSFALLSALRLWATPASQRRSVGH 343

Query: 352  LAFSGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQGSFSEPGNLPC 173
            L +SG+QLS ENE+ VM+WIA +C  +L+NL TS+++D  +L+                 
Sbjct: 344  LVYSGTQLSSENEIFVMEWIAKSCHVVLENLPTSVEEDSLLLS----------------- 386

Query: 172  AVATELSSFLSSMGLHNGEIGESNSRVQRCLDKWKLSVNWRLGYKKILVDCISFCT 5
                                          +++WKL+V WRL +K+ILVDCIS CT
Sbjct: 387  ------------------------------MERWKLAVQWRLRHKRILVDCISRCT 412


>ref|XP_006481945.1| PREDICTED: protein SET DOMAIN GROUP 40-like isoform X1 [Citrus
            sinensis] gi|568856762|ref|XP_006481946.1| PREDICTED:
            protein SET DOMAIN GROUP 40-like isoform X2 [Citrus
            sinensis] gi|568856764|ref|XP_006481947.1| PREDICTED:
            protein SET DOMAIN GROUP 40-like isoform X3 [Citrus
            sinensis]
          Length = 503

 Score =  488 bits (1256), Expect = e-135
 Identities = 250/477 (52%), Positives = 333/477 (69%), Gaps = 6/477 (1%)
 Frame = -2

Query: 1420 AELGVSDAPLNSATSSYSSCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAALMT 1241
            AE+G++D+ + + + S  +CLGH+L+++ FP+AGGRGLAA RDL +GELILRVP+ AL T
Sbjct: 16   AEMGITDSTIQNPSRS-RNCLGHSLTVSHFPEAGGRGLAAARDLTKGELILRVPKTALFT 74

Query: 1240 RDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLPTF 1061
             + +L+ D+KLS A+NRH  LS +QIL V LL EV K K S W+ YL+ LPR Y++L TF
Sbjct: 75   TECLLKSDQKLSLAVNRHLFLSPSQILIVCLLYEVGKGKSSRWHAYLMLLPRCYEILATF 134

Query: 1060 GPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASATISSR 881
            GPFE +ALQV+DA WA EKA+SK ES+WK+AI LME+LK KPQL++FKAWLWASAT+SSR
Sbjct: 135  GPFEKQALQVDDAIWAAEKAVSKAESEWKQAIKLMEELKLKPQLLSFKAWLWASATVSSR 194

Query: 880  TLHVPWDDAGCLCPVGDLFNYAPPGEDFSDKDSVRS-SGYASYKDNSSDDGDECEDSTVN 704
            T+H+ WD+AGCLCPVGDLFNYA PGE       +    G+         D  +  DS   
Sbjct: 195  TMHISWDEAGCLCPVGDLFNYAAPGEGEESNIGIEDVEGWMPAPCLPKGDTTDVLDSEKF 254

Query: 703  EDQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFILGNN 524
             D  +    RLTDG +EE + +YCFYA+ NY++G+QVLL YGTYTNLELLEHYGF+L  N
Sbjct: 255  NDHLH----RLTDGRFEEDVNSYCFYARNNYKRGKQVLLSYGTYTNLELLEHYGFLLNEN 310

Query: 523  PNDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQRRSVGHLAF 344
            PNDKVFI L   +     CSWP +S  +  +G PSF+LLS+LRLW TP NQRRSVGHLA+
Sbjct: 311  PNDKVFISLEPGMYS--GCSWPRESQYVDQDGKPSFALLSALRLWMTPANQRRSVGHLAY 368

Query: 343  SGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQ--GSFSEPGNLPCA 170
            SG QLS  NE+ VM+ ++ NC  +L +L TS ++D  +L ++D+ Q   + +E   +   
Sbjct: 369  SGYQLSVNNEISVMKCLSNNCCVMLNSLPTSKEEDALLLCAIDKIQDINTATELKKVLSD 428

Query: 169  VATELSSFLSSMGLHNGEIGESNS---RVQRCLDKWKLSVNWRLGYKKILVDCISFC 8
               E+S+FL +  +   + G   S   + +  + +WKL++ WRL YKK L DCIS+C
Sbjct: 429  FGGEVSTFLENYYVQCRQRGAKLSLSRKTKLSMQRWKLAIQWRLRYKKTLADCISYC 485


>ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
            gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein
            SET DOMAIN GROUP 40 gi|34222078|gb|AAQ62875.1| At5g17240
            [Arabidopsis thaliana] gi|51969984|dbj|BAD43684.1|
            unknown protein [Arabidopsis thaliana]
            gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40
            [Arabidopsis thaliana]
          Length = 491

 Score =  487 bits (1254), Expect = e-135
 Identities = 256/478 (53%), Positives = 338/478 (70%), Gaps = 7/478 (1%)
 Frame = -2

Query: 1420 AELGVSDAPLNSATSSYSSCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAALMT 1241
            AE+G+SD+  +S      SCLGH+LS++ FPDAGGRGL A R+L++GEL+L+VPR ALMT
Sbjct: 16   AEIGISDSIDSSRFRD--SCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKVPRKALMT 73

Query: 1240 RDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLPTF 1061
             +SI+ KD KLSDA+N H+SLSSTQIL+V LL E+SK K+SFWYPYL H+PR YDLL TF
Sbjct: 74   TESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRDYDLLATF 133

Query: 1060 GPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASATISSR 881
            G FE +ALQVEDA WATEKA +K +S+WKEA +LM++L+ KP+  +F+AWLWASATISSR
Sbjct: 134  GNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKPKFRSFQAWLWASATISSR 193

Query: 880  TLHVPWDDAGCLCPVGDLFNYAPPGEDFSDKDSVRSSGYASYKDNSSDDGDECEDSTVNE 701
            TLHVPWD AGCLCPVGDLFNY  PG D+S+      S      +N  + G   E      
Sbjct: 194  TLHVPWDSAGCLCPVGDLFNYDAPG-DYSNTPQGPESA-----NNVEEAGLVVETH---- 243

Query: 700  DQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFILGNNP 521
                  S RLTDGG+EE + AYC YA++NY+ G+QVLL YGTYTNLELLEHYGF+L  N 
Sbjct: 244  ------SERLTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENS 297

Query: 520  NDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQR-RSVGHLAF 344
            NDKVFIPL   +  S + SWP DS  I  +G  SF+L+S+LRLW  P +QR +SV  L +
Sbjct: 298  NDKVFIPLETSLF-SLASSWPKDSLYIHQDGKLSFALISTLRLWLIPQSQRDKSVMRLVY 356

Query: 343  SGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQG-SFSEPGNLPCAV 167
            +GSQ+S +NE++VM+W++  CG++L++L TS+ +D  +L ++D+ Q            A 
Sbjct: 357  AGSQISVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVLLHNIDKLQDPELRLEQKETEAF 416

Query: 166  ATELSSFLSS-----MGLHNGEIGESNSRVQRCLDKWKLSVNWRLGYKKILVDCISFC 8
             +E+ +FL +     + + +G+  E + +  R L KW+ SV WRL YK+ L DCIS+C
Sbjct: 417  GSEVRAFLDANCLWDVTVLSGKPIEFSRKTSRMLSKWRWSVQWRLSYKRTLADCISYC 474


>ref|XP_006596495.1| PREDICTED: protein SET DOMAIN GROUP 40-like isoform X2 [Glycine max]
          Length = 483

 Score =  485 bits (1249), Expect = e-134
 Identities = 252/481 (52%), Positives = 336/481 (69%), Gaps = 8/481 (1%)
 Frame = -2

Query: 1420 AELGVSDAPL--NSATSSYSSCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAAL 1247
            A+LG+SD+    N    S SSCLG +LS++ FP +GGRGL A RDLRRGE++LRVP++AL
Sbjct: 16   AQLGISDSTTRTNQPQHSLSSCLGSSLSVSHFPHSGGRGLGAVRDLRRGEIVLRVPKSAL 75

Query: 1246 MTRDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLP 1067
            MTR++++E D+KL DA+NRHSSLSS QIL V LL E+ K K S W+PYL+HLP +YD   
Sbjct: 76   MTRETVME-DKKLCDAVNRHSSLSSAQILIVCLLYEMGKGKTSRWHPYLMHLPHTYD--- 131

Query: 1066 TFGPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASATIS 887
                       V++A W TEKA+ K +S+WKEA +LM+ L FKPQ  TFKAW+WA+ATIS
Sbjct: 132  -----------VDEAMWVTEKAMLKAKSEWKEAHSLMQDLMFKPQFFTFKAWVWAAATIS 180

Query: 886  SRTLHVPWDDAGCLCPVGDLFNYAPPG-EDFSDKDSVRSSGYASYKDNSSDDGDECEDST 710
            SRTLH+PWD+AGCLCPVGDLFNY  PG E    +D  R     S  D    +GD  ++  
Sbjct: 181  SRTLHIPWDEAGCLCPVGDLFNYDAPGIEPSGIEDLDRLLSNTSIPDTIVLNGD--KNIM 238

Query: 709  VNEDQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFILG 530
            V+ +Q +  S RLTDGG+EE   AYCFYA+++Y+KG QVLL YGTYTNLELLEHYGF+L 
Sbjct: 239  VDAEQLDSHSWRLTDGGFEEDANAYCFYAREHYKKGDQVLLCYGTYTNLELLEHYGFLLQ 298

Query: 529  NNPNDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQRRSVGHL 350
             NPNDKVFIPL   +    S SW  +S  I +NG PSF+LL++LRLWATP N+RRSVGHL
Sbjct: 299  ENPNDKVFIPLEPALYS--STSWSKESLYIHHNGKPSFALLAALRLWATPQNRRRSVGHL 356

Query: 349  AFSGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQ--GSFSEPGNLP 176
             +SGS++S +NE+ +M+W++  C  +L+NL TS+++D  +L +MD  Q   +F E   L 
Sbjct: 357  VYSGSRVSTDNEIFIMKWLSKTCDAVLRNLPTSLEEDTLLLNAMDNSQDFSTFMEITKL- 415

Query: 175  CAVATELSSFLSSMGL---HNGEIGESNSRVQRCLDKWKLSVNWRLGYKKILVDCISFCT 5
             +   E  +FL +  +   H+      + + +R +D+WKL+V WRL YKK++ DCIS+C 
Sbjct: 416  VSSREETYTFLETHNMKDTHSFTDVILSRKARRSMDRWKLAVQWRLKYKKVIFDCISYCN 475

Query: 4    R 2
            +
Sbjct: 476  K 476


>ref|XP_006400256.1| hypothetical protein EUTSA_v10015946mg [Eutrema salsugineum]
            gi|557101346|gb|ESQ41709.1| hypothetical protein
            EUTSA_v10015946mg [Eutrema salsugineum]
          Length = 506

 Score =  481 bits (1238), Expect = e-133
 Identities = 255/489 (52%), Positives = 332/489 (67%), Gaps = 18/489 (3%)
 Frame = -2

Query: 1420 AELGVSDAPLNSATSSYSSCLGHALSLACFPDAGGRGLAATRDLRRGELILRVPRAALMT 1241
            AELG+SD+    ++ S  SCLGH+LS+A FP AGGRGL A R+LR+GEL+L+VPR AL+T
Sbjct: 16   AELGLSDSI--DSSRSLDSCLGHSLSVADFPLAGGRGLGAVRELRKGELVLKVPRNALLT 73

Query: 1240 RDSILEKDEKLSDAINRHSSLSSTQILAVLLLAEVSKRKRSFWYPYLIHLPRSYDLLPTF 1061
             +S++ KD+KL DAIN H S+SSTQ L V LL E+SK K+SFWYPYL+HLPR YDL  TF
Sbjct: 74   TESMVAKDQKLRDAINLHGSISSTQRLGVCLLYEMSKGKKSFWYPYLVHLPRDYDLSSTF 133

Query: 1060 GPFELKALQVEDATWATEKAISKVESDWKEAIALMEQLKFKPQLVTFKAWLWASATISSR 881
            G FE +ALQVEDA WA EKAI+K +S+WKEA+ LM+ L  KP+  + +AWLWASATISSR
Sbjct: 134  GEFEKQALQVEDAVWAAEKAIAKSQSEWKEAVTLMKVLDLKPKFQSLQAWLWASATISSR 193

Query: 880  TLHVPWDDAGCLCPVGDLFNYAPPGEDFSDKDSVRSSGYASYKDNSSDDGDECEDST--- 710
            TLH+PWD AGCLCPVGDLFNY  PG+D +  +        S     S    EC ++    
Sbjct: 194  TLHIPWDSAGCLCPVGDLFNYDAPGDDLNTSEGPELVIQTSSPKPVSTTHHECRNNAEEA 253

Query: 709  --VNEDQCNGCSGRLTDGGYEEHLAAYCFYAKKNYRKGQQVLLIYGTYTNLELLEHYGFI 536
              V E Q    S RLTDGG++E   AYC YA++NY+ G+QVLL YGTYTNLELLEHYGF+
Sbjct: 254  GHVVETQ----SERLTDGGFDEDANAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFM 309

Query: 535  LGNNPNDKVFIPLPHEVIGSCSCSWPADSFCIQYNGIPSFSLLSSLRLWATPPNQR-RSV 359
            L  N NDKVFIPL   +  S + SWP DS  I  +G PSF+L+S+LRLW  P NQR ++ 
Sbjct: 310  LEENSNDKVFIPLETSLY-SLASSWPKDSLYIHQDGKPSFALVSTLRLWLIPQNQRDKTA 368

Query: 358  GHLAFSGSQLSPENEVIVMQWIATNCGTILQNLATSIDDDLKVLASMDEFQG-SFSEPGN 182
              L ++GSQ+S +NE++VM+W++  CG +L++L TS+ +D  +L  +   Q         
Sbjct: 369  MRLVYAGSQISVKNEILVMKWMSDKCGRVLRDLPTSLLEDTVLLQDIKNLQDPEVCLKQK 428

Query: 181  LPCAVATELSSFL-----------SSMGLHNGEIGESNSRVQRCLDKWKLSVNWRLGYKK 35
               A  +E+ +FL             +GL +G+  E + +  R + KW+LSV WRL YK+
Sbjct: 429  ETEAFGSEVRAFLDVNHLWDLINGDVIGL-SGKAVEFSRKTNRIISKWRLSVQWRLRYKR 487

Query: 34   ILVDCISFC 8
             LVDCIS+C
Sbjct: 488  TLVDCISYC 496


Top