BLASTX nr result
ID: Rehmannia23_contig00001485
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia23_contig00001485 (1130 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EMJ16490.1| hypothetical protein PRUPE_ppa004975mg [Prunus pe... 372 e-115 ref|XP_002305239.2| hypothetical protein POPTR_0004s07950g [Popu... 377 e-115 ref|XP_006430400.1| hypothetical protein CICLE_v10011537mg [Citr... 369 e-112 ref|XP_006430399.1| hypothetical protein CICLE_v10011537mg [Citr... 369 e-112 ref|XP_004288574.1| PREDICTED: protein SET DOMAIN GROUP 40-like ... 361 e-111 ref|XP_006481945.1| PREDICTED: protein SET DOMAIN GROUP 40-like ... 364 e-110 ref|XP_006481948.1| PREDICTED: protein SET DOMAIN GROUP 40-like ... 364 e-110 ref|XP_004232670.1| PREDICTED: protein SET DOMAIN GROUP 40-like ... 352 e-109 ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus ... 359 e-109 ref|XP_006348182.1| PREDICTED: protein SET DOMAIN GROUP 40-like ... 346 e-106 ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like ... 389 e-105 gb|EXC05430.1| Protein SET DOMAIN GROUP 40 [Morus notabilis] 335 e-105 gb|EOY03097.1| SET domain group 40, putative isoform 1 [Theobrom... 347 e-105 ref|XP_003616150.1| Protein SET DOMAIN GROUP [Medicago truncatul... 348 e-104 ref|XP_006289442.1| hypothetical protein CARUB_v10002957mg [Caps... 332 e-104 ref|XP_004145844.1| PREDICTED: protein SET DOMAIN GROUP 40-like ... 333 e-102 ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thalia... 330 e-101 ref|XP_002871756.1| SET domain-containing protein [Arabidopsis l... 324 e-101 ref|XP_006400256.1| hypothetical protein EUTSA_v10015946mg [Eutr... 329 e-100 ref|XP_004167843.1| PREDICTED: protein SET DOMAIN GROUP 40-like ... 331 e-100 >gb|EMJ16490.1| hypothetical protein PRUPE_ppa004975mg [Prunus persica] Length = 483 Score = 372 bits (956), Expect(2) = e-115 Identities = 188/337 (55%), Positives = 238/337 (70%), Gaps = 9/337 (2%) Frame = -3 Query: 984 AEKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVG 805 AEKA LK + EW+EA LM +L LKP+L TF AWLWASATISSRT+HIPWD AGCLCPVG Sbjct: 149 AEKATLKAEYEWKEANALMKQLKLKPQLLTFKAWLWASATISSRTLHIPWDAAGCLCPVG 208 Query: 804 DYFNYTPPEEDPYNLNNGKVCGAGSFSHVTSSLEGENEGN--SMEQLSDANTDRLTDAGY 631 D FNY+ P E+P S H L E+ +EQL +++ RLTD G+ Sbjct: 209 DLFNYSAPGEEPSRCE--------SMEHTMHDLVNEDTSGMADVEQLV-SDSRRLTDGGF 259 Query: 630 DETVASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSL 451 ++ V +YCFYAK++Y KGEQVLLSYGTYTNLELLEHYGF+L ENPNDK +I LE E+YS Sbjct: 260 EKDVDAYCFYAKKSYKKGEQVLLSYGTYTNLELLEHYGFLLNENPNDKVYIPLEPEIYSS 319 Query: 450 CSWPKESLYISQDGKPSFALLSTVRLWATPVTKRRSVKHIAYSGQRISNENEIAVMEWIA 271 CSWPKESL+I Q+GKPSFALLST+RLWATP +RRSV H+ YSG +S +NE+ ++ WI+ Sbjct: 320 CSWPKESLFIHQNGKPSFALLSTLRLWATPQNQRRSVGHLVYSGLHLSIQNEMFILRWIS 379 Query: 270 KKCQVMLSSFSTSIDEDALIMRIIDQIQDYDGDLD---CSTALCDEVRTFVESTKLDLCS 100 KKC +L + STS ++D+L++ ID+IQ+ D L+ S+ DE+ F K ++ Sbjct: 380 KKCTTILKNLSTSFEDDSLLLSAIDKIQNLDAPLELNNVSSTCRDEICAF----KANVLQ 435 Query: 99 KTRRSIY----RWKLAVQWRHRYKKILSDCISYCTKV 1 K RS RW+LAV+WR YKKIL DCISYC ++ Sbjct: 436 KGERSSMESKERWRLAVEWRLSYKKILVDCISYCDEI 472 Score = 71.6 bits (174), Expect(2) = e-115 Identities = 31/50 (62%), Positives = 42/50 (84%) Frame = -1 Query: 1130 SLSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 SLS TQIL++ LL E+ KG+ SWW+ YL+ LPR+YD+LA+FG+FE +ALQ Sbjct: 92 SLSPTQILAVCLLYEMGKGKISWWHPYLMNLPRSYDILATFGEFEKQALQ 141 >ref|XP_002305239.2| hypothetical protein POPTR_0004s07950g [Populus trichocarpa] gi|550340570|gb|EEE85750.2| hypothetical protein POPTR_0004s07950g [Populus trichocarpa] Length = 518 Score = 377 bits (969), Expect(2) = e-115 Identities = 185/334 (55%), Positives = 229/334 (68%), Gaps = 9/334 (2%) Frame = -3 Query: 981 EKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVGD 802 +KAV K K EW+EA LM L LKP+L TF AW+WASATISSR +HIPWD AGCLCPVGD Sbjct: 168 KKAVSKAKSEWKEANSLMDALKLKPQLLTFRAWIWASATISSRALHIPWDEAGCLCPVGD 227 Query: 801 YFNYTPPEEDPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLSDANTDRLTDAGYDET 622 FNY P E+ +L N S TS GE + + D +RLTD G++E Sbjct: 228 LFNYAAPGEESNDLENVVHLMNASSLEDTSLSNGETTDDFIGDQPDIGLERLTDGGFNEN 287 Query: 621 VASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLCSW 442 +A+YCFYA++NY KG QVLL YGTYTNLELLEHYGF+L ENPNDK FI LE MYS SW Sbjct: 288 MAAYCFYARKNYKKGTQVLLGYGTYTNLELLEHYGFLLNENPNDKVFIPLEPSMYSFISW 347 Query: 441 PKESLYISQDGKPSFALLSTVRLWATPVTKRRSVKHIAYSGQRISNENEIAVMEWIAKKC 262 PK S+YI QDGKPSFALLS +RLWATP +RRS+ H+ YSG R+S NEI+V++WI+K C Sbjct: 348 PKVSMYIHQDGKPSFALLSALRLWATPPNQRRSISHLVYSGSRLSVYNEISVLKWISKNC 407 Query: 261 QVMLSSFSTSIDEDALIMRIIDQIQDYDGDLDCSTALCDEVRTFVESTKLD--------- 109 ++LS+ T I+ED+L++ I++I+++D + E R F+E++ L Sbjct: 408 ALILSNLPTVIEEDSLLLSTINKIENFDKPTELVCTSGGEARAFLEASDLQKGKNGSELM 467 Query: 108 LCSKTRRSIYRWKLAVQWRHRYKKILSDCISYCT 7 KT+R I RWKLAVQWR YKK L DCISYCT Sbjct: 468 FSGKTKRVIERWKLAVQWRISYKKTLIDCISYCT 501 Score = 64.7 bits (156), Expect(2) = e-115 Identities = 28/41 (68%), Positives = 35/41 (85%) Frame = -1 Query: 1130 SLSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASF 1008 SLS TQIL++ LL E+ KG+SSWWY YL+ LPR+YD+LASF Sbjct: 127 SLSPTQILAVCLLYEMGKGKSSWWYPYLMHLPRSYDVLASF 167 >ref|XP_006430400.1| hypothetical protein CICLE_v10011537mg [Citrus clementina] gi|557532457|gb|ESR43640.1| hypothetical protein CICLE_v10011537mg [Citrus clementina] Length = 503 Score = 369 bits (946), Expect(2) = e-112 Identities = 190/338 (56%), Positives = 235/338 (69%), Gaps = 13/338 (3%) Frame = -3 Query: 984 AEKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVG 805 AEKAV K + EW++A LM EL LKP+L +F AWLWASAT+SSRTMHI WD AGCLCPVG Sbjct: 151 AEKAVSKAESEWKQAIKLMEELKLKPQLLSFKAWLWASATVSSRTMHISWDEAGCLCPVG 210 Query: 804 DYFNYTPPEE-DPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLSDANTDRLTDAGYD 628 D FNY P E + N+ V G + +G+ + + + RLTD ++ Sbjct: 211 DLFNYAAPGEGEESNIGIEDVEG---WMPAPCLPKGDTTDVLDSEKFNGHLRRLTDGRFE 267 Query: 627 ETVASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLC 448 E V SYCFYA+ NY +GEQVLLSYGTYTNLELLEHYGF+L ENPNDK FISLE MYS C Sbjct: 268 EDVNSYCFYARNNYKRGEQVLLSYGTYTNLELLEHYGFLLNENPNDKVFISLEPGMYSCC 327 Query: 447 SWPKESLYISQDGKPSFALLSTVRLWATPVTKRRSVKHIAYSGQRISNENEIAVMEWIAK 268 SWP+ES YI Q+GKPSFALLS +RLW TP +RRSV H+AYSG ++S +NEI+VM+W++ Sbjct: 328 SWPRESQYIDQNGKPSFALLSALRLWMTPANQRRSVGHLAYSGHQLSVDNEISVMKWLSN 387 Query: 267 KCQVMLSSFSTSIDEDALIMRIIDQIQDYDGDLDCSTALCD---EVRTFVES-------- 121 +VML+S TS +EDAL++ ID+IQD ++ L D EV TF+E+ Sbjct: 388 NSRVMLNSLPTSKEEDALLLCAIDKIQDIYTAMELKKVLSDFGGEVCTFLENYGVQCRQR 447 Query: 120 -TKLDLCSKTRRSIYRWKLAVQWRHRYKKILSDCISYC 10 KL L KT+ S+ RWKLA+QWR RYKK L+DCISYC Sbjct: 448 GAKLSLSRKTKLSMQRWKLAIQWRLRYKKTLADCISYC 485 Score = 65.5 bits (158), Expect(2) = e-112 Identities = 31/49 (63%), Positives = 39/49 (79%) Frame = -1 Query: 1127 LSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 LS +QIL + LL EV KG+SS WYTYL+ LPR Y++LA+FG FE +ALQ Sbjct: 95 LSPSQILIVCLLYEVGKGKSSRWYTYLMLLPRCYEILATFGPFEKQALQ 143 >ref|XP_006430399.1| hypothetical protein CICLE_v10011537mg [Citrus clementina] gi|557532456|gb|ESR43639.1| hypothetical protein CICLE_v10011537mg [Citrus clementina] Length = 489 Score = 369 bits (946), Expect(2) = e-112 Identities = 190/338 (56%), Positives = 235/338 (69%), Gaps = 13/338 (3%) Frame = -3 Query: 984 AEKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVG 805 AEKAV K + EW++A LM EL LKP+L +F AWLWASAT+SSRTMHI WD AGCLCPVG Sbjct: 137 AEKAVSKAESEWKQAIKLMEELKLKPQLLSFKAWLWASATVSSRTMHISWDEAGCLCPVG 196 Query: 804 DYFNYTPPEE-DPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLSDANTDRLTDAGYD 628 D FNY P E + N+ V G + +G+ + + + RLTD ++ Sbjct: 197 DLFNYAAPGEGEESNIGIEDVEG---WMPAPCLPKGDTTDVLDSEKFNGHLRRLTDGRFE 253 Query: 627 ETVASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLC 448 E V SYCFYA+ NY +GEQVLLSYGTYTNLELLEHYGF+L ENPNDK FISLE MYS C Sbjct: 254 EDVNSYCFYARNNYKRGEQVLLSYGTYTNLELLEHYGFLLNENPNDKVFISLEPGMYSCC 313 Query: 447 SWPKESLYISQDGKPSFALLSTVRLWATPVTKRRSVKHIAYSGQRISNENEIAVMEWIAK 268 SWP+ES YI Q+GKPSFALLS +RLW TP +RRSV H+AYSG ++S +NEI+VM+W++ Sbjct: 314 SWPRESQYIDQNGKPSFALLSALRLWMTPANQRRSVGHLAYSGHQLSVDNEISVMKWLSN 373 Query: 267 KCQVMLSSFSTSIDEDALIMRIIDQIQDYDGDLDCSTALCD---EVRTFVES-------- 121 +VML+S TS +EDAL++ ID+IQD ++ L D EV TF+E+ Sbjct: 374 NSRVMLNSLPTSKEEDALLLCAIDKIQDIYTAMELKKVLSDFGGEVCTFLENYGVQCRQR 433 Query: 120 -TKLDLCSKTRRSIYRWKLAVQWRHRYKKILSDCISYC 10 KL L KT+ S+ RWKLA+QWR RYKK L+DCISYC Sbjct: 434 GAKLSLSRKTKLSMQRWKLAIQWRLRYKKTLADCISYC 471 Score = 65.5 bits (158), Expect(2) = e-112 Identities = 31/49 (63%), Positives = 39/49 (79%) Frame = -1 Query: 1127 LSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 LS +QIL + LL EV KG+SS WYTYL+ LPR Y++LA+FG FE +ALQ Sbjct: 81 LSPSQILIVCLLYEVGKGKSSRWYTYLMLLPRCYEILATFGPFEKQALQ 129 >ref|XP_004288574.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Fragaria vesca subsp. vesca] Length = 511 Score = 361 bits (927), Expect(2) = e-111 Identities = 182/340 (53%), Positives = 237/340 (69%), Gaps = 12/340 (3%) Frame = -3 Query: 984 AEKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVG 805 A+KA+ K + EW+E LM +L LKP+L TF AWLWASAT+SSRT+HIPWD AGCLCPVG Sbjct: 160 ADKAISKAEFEWKETNTLMEQLKLKPQLRTFRAWLWASATVSSRTLHIPWDGAGCLCPVG 219 Query: 804 DYFNYTPPEEDPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLSDANTDRLTDAGYDE 625 D FNY+ P ED + N + +T+ E + EQL D+++ RLTD ++ Sbjct: 220 DLFNYSAPVEDSDSDNVELRTHELALQDMTTVKEETSCILDNEQL-DSDSGRLTDGRFEN 278 Query: 624 TVASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLCS 445 V +YCFYAK++Y KGEQVLLSYGTYTNLELLEHYGF+L ENPNDKA++ LE E+YS CS Sbjct: 279 NVGAYCFYAKKSYRKGEQVLLSYGTYTNLELLEHYGFLLNENPNDKAYVPLEPEIYSSCS 338 Query: 444 WPKESLYISQDGKPSFALLSTVRLWATPVTKRRSVKHIAYSGQRISNENEIAVMEWIAKK 265 WPKE LYI Q GKPSFALLS +RLWATP +RRSV H+AYSG ++S ENEI VM WI+ K Sbjct: 339 WPKEFLYIHQSGKPSFALLSALRLWATPANRRRSVGHLAYSGLQLSIENEIFVMRWISNK 398 Query: 264 CQVMLSSFSTSIDEDALIMRIIDQIQDYDGDLD---CSTALCDEVRTF---------VES 121 C ++ + T+ +ED+L++ +ID+IQ+ + L+ S+ DE+ T+ +S Sbjct: 399 CNSIVKNLPTTFEEDSLLLSVIDKIQNVNAPLEFANISSVSTDEICTYRAEVLKKGATDS 458 Query: 120 TKLDLCSKTRRSIYRWKLAVQWRHRYKKILSDCISYCTKV 1 + +RS RW+LAVQWR YKKIL DCIS+C ++ Sbjct: 459 ETVVSRKTMQRSRERWRLAVQWRLSYKKILVDCISFCDEM 498 Score = 68.9 bits (167), Expect(2) = e-111 Identities = 29/50 (58%), Positives = 40/50 (80%) Frame = -1 Query: 1130 SLSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 SLS Q L + LL E+ KG++SWWY YL+ LPR+YD++A+FG+FE +ALQ Sbjct: 103 SLSPIQTLCVCLLYEMGKGKTSWWYPYLINLPRSYDIIATFGEFEKQALQ 152 >ref|XP_006481945.1| PREDICTED: protein SET DOMAIN GROUP 40-like isoform X1 [Citrus sinensis] gi|568856762|ref|XP_006481946.1| PREDICTED: protein SET DOMAIN GROUP 40-like isoform X2 [Citrus sinensis] gi|568856764|ref|XP_006481947.1| PREDICTED: protein SET DOMAIN GROUP 40-like isoform X3 [Citrus sinensis] Length = 503 Score = 364 bits (935), Expect(2) = e-110 Identities = 189/338 (55%), Positives = 233/338 (68%), Gaps = 13/338 (3%) Frame = -3 Query: 984 AEKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVG 805 AEKAV K + EW++A LM EL LKP+L +F AWLWASAT+SSRTMHI WD AGCLCPVG Sbjct: 151 AEKAVSKAESEWKQAIKLMEELKLKPQLLSFKAWLWASATVSSRTMHISWDEAGCLCPVG 210 Query: 804 DYFNYTPPEE-DPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLSDANTDRLTDAGYD 628 D FNY P E + N+ V G + +G+ + + + RLTD ++ Sbjct: 211 DLFNYAAPGEGEESNIGIEDVEG---WMPAPCLPKGDTTDVLDSEKFNDHLHRLTDGRFE 267 Query: 627 ETVASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLC 448 E V SYCFYA+ NY +G+QVLLSYGTYTNLELLEHYGF+L ENPNDK FISLE MYS C Sbjct: 268 EDVNSYCFYARNNYKRGKQVLLSYGTYTNLELLEHYGFLLNENPNDKVFISLEPGMYSGC 327 Query: 447 SWPKESLYISQDGKPSFALLSTVRLWATPVTKRRSVKHIAYSGQRISNENEIAVMEWIAK 268 SWP+ES Y+ QDGKPSFALLS +RLW TP +RRSV H+AYSG ++S NEI+VM+ ++ Sbjct: 328 SWPRESQYVDQDGKPSFALLSALRLWMTPANQRRSVGHLAYSGYQLSVNNEISVMKCLSN 387 Query: 267 KCQVMLSSFSTSIDEDALIMRIIDQIQDYDGDLDCSTALCD---EVRTFVES-------- 121 C VML+S TS +EDAL++ ID+IQD + + L D EV TF+E+ Sbjct: 388 NCCVMLNSLPTSKEEDALLLCAIDKIQDINTATELKKVLSDFGGEVSTFLENYYVQCRQR 447 Query: 120 -TKLDLCSKTRRSIYRWKLAVQWRHRYKKILSDCISYC 10 KL L KT+ S+ RWKLA+QWR RYKK L+DCISYC Sbjct: 448 GAKLSLSRKTKLSMQRWKLAIQWRLRYKKTLADCISYC 485 Score = 61.6 bits (148), Expect(2) = e-110 Identities = 29/49 (59%), Positives = 38/49 (77%) Frame = -1 Query: 1127 LSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 LS +QIL + LL EV KG+SS W+ YL+ LPR Y++LA+FG FE +ALQ Sbjct: 95 LSPSQILIVCLLYEVGKGKSSRWHAYLMLLPRCYEILATFGPFEKQALQ 143 >ref|XP_006481948.1| PREDICTED: protein SET DOMAIN GROUP 40-like isoform X4 [Citrus sinensis] gi|568856768|ref|XP_006481949.1| PREDICTED: protein SET DOMAIN GROUP 40-like isoform X5 [Citrus sinensis] Length = 489 Score = 364 bits (935), Expect(2) = e-110 Identities = 189/338 (55%), Positives = 233/338 (68%), Gaps = 13/338 (3%) Frame = -3 Query: 984 AEKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVG 805 AEKAV K + EW++A LM EL LKP+L +F AWLWASAT+SSRTMHI WD AGCLCPVG Sbjct: 137 AEKAVSKAESEWKQAIKLMEELKLKPQLLSFKAWLWASATVSSRTMHISWDEAGCLCPVG 196 Query: 804 DYFNYTPPEE-DPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLSDANTDRLTDAGYD 628 D FNY P E + N+ V G + +G+ + + + RLTD ++ Sbjct: 197 DLFNYAAPGEGEESNIGIEDVEG---WMPAPCLPKGDTTDVLDSEKFNDHLHRLTDGRFE 253 Query: 627 ETVASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLC 448 E V SYCFYA+ NY +G+QVLLSYGTYTNLELLEHYGF+L ENPNDK FISLE MYS C Sbjct: 254 EDVNSYCFYARNNYKRGKQVLLSYGTYTNLELLEHYGFLLNENPNDKVFISLEPGMYSGC 313 Query: 447 SWPKESLYISQDGKPSFALLSTVRLWATPVTKRRSVKHIAYSGQRISNENEIAVMEWIAK 268 SWP+ES Y+ QDGKPSFALLS +RLW TP +RRSV H+AYSG ++S NEI+VM+ ++ Sbjct: 314 SWPRESQYVDQDGKPSFALLSALRLWMTPANQRRSVGHLAYSGYQLSVNNEISVMKCLSN 373 Query: 267 KCQVMLSSFSTSIDEDALIMRIIDQIQDYDGDLDCSTALCD---EVRTFVES-------- 121 C VML+S TS +EDAL++ ID+IQD + + L D EV TF+E+ Sbjct: 374 NCCVMLNSLPTSKEEDALLLCAIDKIQDINTATELKKVLSDFGGEVSTFLENYYVQCRQR 433 Query: 120 -TKLDLCSKTRRSIYRWKLAVQWRHRYKKILSDCISYC 10 KL L KT+ S+ RWKLA+QWR RYKK L+DCISYC Sbjct: 434 GAKLSLSRKTKLSMQRWKLAIQWRLRYKKTLADCISYC 471 Score = 61.6 bits (148), Expect(2) = e-110 Identities = 29/49 (59%), Positives = 38/49 (77%) Frame = -1 Query: 1127 LSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 LS +QIL + LL EV KG+SS W+ YL+ LPR Y++LA+FG FE +ALQ Sbjct: 81 LSPSQILIVCLLYEVGKGKSSRWHAYLMLLPRCYEILATFGPFEKQALQ 129 >ref|XP_004232670.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Solanum lycopersicum] Length = 488 Score = 352 bits (902), Expect(2) = e-109 Identities = 178/338 (52%), Positives = 221/338 (65%), Gaps = 10/338 (2%) Frame = -3 Query: 984 AEKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVG 805 A+KA K ++EW E T LM EL LKP+ AWLWAS +ISSRTMHIPWD AGCLCPVG Sbjct: 152 AQKASRKAEQEWNEVTQLMHELKLKPQFLALKAWLWASGSISSRTMHIPWDEAGCLCPVG 211 Query: 804 DYFNYTPPEEDPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLSDANTDRLTDAGYDE 625 D+FNY PEE+ ++ + G F S+L+ E E +S T RL DAGY++ Sbjct: 212 DFFNYAAPEEET-SIYEDQGAGKPYFMQENSTLKSETELDS--------TTRLIDAGYEK 262 Query: 624 TVASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLCS 445 V+SY FYA+RNY KG+QVLLSYGTYTNLELL+HYGF+L ENPNDKAFI LE +MYSLCS Sbjct: 263 DVSSYHFYARRNYRKGDQVLLSYGTYTNLELLQHYGFLLTENPNDKAFIPLEPDMYSLCS 322 Query: 444 WPKESLYISQDGKPSFALLSTVRLWATPVTKRRSVKHIAYSGQRISNENEIAVMEWIAKK 265 W ESLYI DGKPSFALLST+R WA P T R+SV H+ YSG R+S E+E+ M W+ K Sbjct: 323 WDNESLYIHPDGKPSFALLSTLRFWAVPKTSRKSVVHLVYSGNRLSTESEVVAMRWLIMK 382 Query: 264 CQVMLSSFSTSIDEDALIMRIIDQIQD---YDGDLDCSTALCDEVRTFVESTK------- 115 C+ L T+ ED ++ I+ + QD + + L E+ F+E K Sbjct: 383 CRTTLEVLQTTAPEDCRLLNILYKFQDIHKFPEVKEIPPPLASELCAFIEKNKNVASEGI 442 Query: 114 LDLCSKTRRSIYRWKLAVQWRHRYKKILSDCISYCTKV 1 L S RRS RWKLA+ WR+ YK+IL CI +C+ V Sbjct: 443 CSLSSVARRSTERWKLAILWRYLYKQILCSCIIHCSAV 480 Score = 73.2 bits (178), Expect(2) = e-109 Identities = 34/50 (68%), Positives = 42/50 (84%) Frame = -1 Query: 1130 SLSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 SLSS QIL++ LLNEVNKG+SS W+ YL Q PR+Y+ LA FG+FEI+ALQ Sbjct: 95 SLSSAQILAVGLLNEVNKGKSSRWWPYLKQFPRSYETLADFGKFEIQALQ 144 >ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis] gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP, putative [Ricinus communis] Length = 510 Score = 359 bits (922), Expect(2) = e-109 Identities = 178/341 (52%), Positives = 227/341 (66%), Gaps = 13/341 (3%) Frame = -3 Query: 984 AEKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVG 805 AEKA+ K + + +EA LM EL LKP+ T AW+WA ATISSRTMHIPWD AGCLCPVG Sbjct: 155 AEKAISKAELDRKEAYSLMQELRLKPQFLTLRAWIWACATISSRTMHIPWDEAGCLCPVG 214 Query: 804 DYFNYTPPEEDPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLSDANTDRLTDAGYDE 625 D+FNY P E+ + N + S S + N + D LTD G+DE Sbjct: 215 DFFNYAAPGEESSSPENDESWKPASCLEDASLSSERSTSNFCSETFDVQLKSLTDGGFDE 274 Query: 624 TVASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLCS 445 A+YCFYA++NY KG QVLLSYGTYTNLELLEHYGF+L ENPNDK FI LE M S + Sbjct: 275 DKAAYCFYARQNYKKGAQVLLSYGTYTNLELLEHYGFLLNENPNDKVFIPLELSMQSSNT 334 Query: 444 WPKESLYISQDGKPSFALLSTVRLWATPVTKRRSVKHIAYSGQRISNENEIAVMEWIAKK 265 WPKES+YI QDGKPSF+LL +RLWATP +RRS+ H+AYSG ++S ENE+++++WI++K Sbjct: 335 WPKESMYIHQDGKPSFSLLCALRLWATPSNRRRSMGHLAYSGSQLSVENEVSILKWISRK 394 Query: 264 CQVMLSSFSTSIDEDALIMRIIDQIQDYDGDLDCSTALC---DEVRTFVESTKL------ 112 C +L T+++ED+L++ ID+IQ+ L+ L + FVE+ L Sbjct: 395 CHAVLKKLPTTVEEDSLLLSAIDKIQNCHSPLELGKMLHGFEGQASAFVEAHNLLNIKIG 454 Query: 111 ----DLCSKTRRSIYRWKLAVQWRHRYKKILSDCISYCTKV 1 LC K +RS+ RWKLAV+WR YKK L DCISYCT+V Sbjct: 455 TESTMLCGKAKRSMERWKLAVKWRLSYKKTLIDCISYCTEV 495 Score = 65.1 bits (157), Expect(2) = e-109 Identities = 28/50 (56%), Positives = 42/50 (84%) Frame = -1 Query: 1130 SLSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 +LS TQ L++ LL E++KG+SS+WY YL+ LPR+Y++LA+F +FE +ALQ Sbjct: 98 ALSPTQTLTVCLLYEMSKGQSSFWYPYLMHLPRSYEILATFSEFEKQALQ 147 >ref|XP_006348182.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Solanum tuberosum] Length = 488 Score = 346 bits (888), Expect(2) = e-106 Identities = 180/338 (53%), Positives = 221/338 (65%), Gaps = 10/338 (2%) Frame = -3 Query: 984 AEKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVG 805 A+KA + +EEW E T LM EL LKP+ AWLWAS +ISSRTMHIPWD AGCLCPVG Sbjct: 152 AQKASRRAEEEWNEVTQLMHELKLKPQFLALKAWLWASGSISSRTMHIPWDEAGCLCPVG 211 Query: 804 DYFNYTPPEEDPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLSDANTDRLTDAGYDE 625 D+FNY PEE+ N + GAG SL+ S +L A RL DAGY++ Sbjct: 212 DFFNYAAPEEETSNYEDQ---GAGK----PYSLQENGTLKSETELDAAA--RLIDAGYEK 262 Query: 624 TVASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLCS 445 V+SY FYA+RNY KG+QVLLSYGTYTNLELL+HYGF+L ENPNDKAFI LE +MYSLCS Sbjct: 263 DVSSYHFYARRNYRKGDQVLLSYGTYTNLELLQHYGFLLTENPNDKAFIPLEPDMYSLCS 322 Query: 444 WPKESLYISQDGKPSFALLSTVRLWATPVTKRRSVKHIAYSGQRISNENEIAVMEWIAKK 265 W ESLYI DGKPSFALLST+R WA P T R+SV H+ YSG R+S E+E+ M W+ K Sbjct: 323 WDNESLYIHPDGKPSFALLSTLRFWAVPKTSRKSVVHLVYSGNRLSTESEVVAMRWLITK 382 Query: 264 CQVMLSSFSTSIDEDALIMRIIDQIQD---YDGDLDCSTALCDEVRTFVESTK----LDL 106 C+ L T+ ED ++ I+++ QD + + L E+ F+E K + Sbjct: 383 CRTALEVLQTTAPEDCKLLNILNKFQDNHKFPEIKEMPPPLASELCAFIEKNKNVVSEGI 442 Query: 105 CSKT---RRSIYRWKLAVQWRHRYKKILSDCISYCTKV 1 CS + RRSI RWKLA WR YK+IL CI +C+ V Sbjct: 443 CSMSCVARRSIERWKLATLWRFLYKQILCSCIIHCSAV 480 Score = 68.2 bits (165), Expect(2) = e-106 Identities = 32/49 (65%), Positives = 39/49 (79%) Frame = -1 Query: 1127 LSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 L STQIL++ LLNE NKG+SS W+ YL Q PR+Y LA FG+FEI+ALQ Sbjct: 96 LCSTQILAVGLLNEANKGKSSRWWPYLKQFPRSYYTLADFGKFEIQALQ 144 >ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera] Length = 504 Score = 389 bits (998), Expect = e-105 Identities = 192/339 (56%), Positives = 243/339 (71%), Gaps = 12/339 (3%) Frame = -3 Query: 981 EKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVGD 802 E+A+LK + EW++A PLM EL LKP+L F AWLWAS+T+SSRTMHIPWD AGCLCPVGD Sbjct: 150 ERAILKAELEWKKAIPLMEELKLKPQLQNFRAWLWASSTVSSRTMHIPWDDAGCLCPVGD 209 Query: 801 YFNYTPPEEDPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLSDANTDRLTDAGYDET 622 ++NY P E+P + K S +S + NS + D + RLTD GY E Sbjct: 210 FYNYAAPGEEPCGWEDLKGSRNESSLQDSSFWNKDATSNSDAEQDDVLSQRLTDGGYKED 269 Query: 621 VASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLCSW 442 +A+YCFYA++NY KGEQVLLSYGTYTNLELLEHYGF+L ENPNDKAFI LE E+Y+ SW Sbjct: 270 LAAYCFYARKNYKKGEQVLLSYGTYTNLELLEHYGFLLDENPNDKAFIPLEPEVYASSSW 329 Query: 441 PKESLYISQDGKPSFALLSTVRLWATPVTKRRSVKHIAYSGQRISNENEIAVMEWIAKKC 262 PK+SLYI Q+GKPSFALLS +RLWATP ++RRSV H+ YSG ++S+ENEI VMEWIAK C Sbjct: 330 PKDSLYIHQNGKPSFALLSALRLWATPASQRRSVGHLVYSGTQLSSENEIFVMEWIAKSC 389 Query: 261 QVMLSSFSTSIDEDALIMRIIDQIQDYDGDLDCSTALCD---EVRTFVESTKLD------ 109 V+L + TS++ED+L++ +D++QD D ++ AL E F+E+ L Sbjct: 390 HVVLENLPTSVEEDSLLLCALDKMQDPDLPMEVGNALRSSGVEFSAFLEAHDLKIGDGNV 449 Query: 108 ---LCSKTRRSIYRWKLAVQWRHRYKKILSDCISYCTKV 1 L K RRS+ RWKLAVQWR R+K+IL DCIS CT++ Sbjct: 450 GLLLSEKARRSMERWKLAVQWRLRHKRILVDCISRCTEI 488 Score = 74.7 bits (182), Expect = 6e-11 Identities = 39/75 (52%), Positives = 52/75 (69%), Gaps = 8/75 (10%) Frame = -1 Query: 1130 SLSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ--------KK 975 SLSS QIL+I LL E++KG+SSWW+ YL+QLPR+YD LA+F QFE +ALQ ++ Sbjct: 92 SLSSPQILTICLLAEMSKGKSSWWHPYLMQLPRSYDTLANFSQFEKQALQVDDAIWVTER 151 Query: 974 LFLKVKRSGKKLLRL 930 LK + KK + L Sbjct: 152 AILKAELEWKKAIPL 166 >gb|EXC05430.1| Protein SET DOMAIN GROUP 40 [Morus notabilis] Length = 508 Score = 335 bits (859), Expect(2) = e-105 Identities = 176/371 (47%), Positives = 227/371 (61%), Gaps = 43/371 (11%) Frame = -3 Query: 984 AEKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASAT-------------------- 865 AEKA LK + EW+EA PLM ELNLKP+ TF AWLWASAT Sbjct: 153 AEKATLKAESEWKEANPLMKELNLKPQFLTFRAWLWASATFTLTEFHHHFNIIIPNVESN 212 Query: 864 -----------ISSRTMHIPWDTAGCLCPVGDYFNYTPPEEDPYNLNNGKVCGAGSFSHV 718 ISSRT+H+PWD AGCLCPVGD FNY P E Sbjct: 213 DVKFYASTLIKISSRTLHVPWDEAGCLCPVGDLFNYVAPGE------------------- 253 Query: 717 TSSLEGENEGNSMEQLSDANTDRLTDAGYDETVASYCFYAKRNYGKGEQVLLSYGTYTNL 538 E +EQL D+++ RLTD G++E V +YCFYA+R+Y KGEQVLL YGTYTNL Sbjct: 254 ----EDSAHTLDLEQL-DSHSQRLTDGGFEEDVVAYCFYARRHYEKGEQVLLGYGTYTNL 308 Query: 537 ELLEHYGFILQENPNDKAFISLETEMYSLCSWPKESLYISQDGKPSFALLSTVRLWATPV 358 ELLEHYGF+L +N N+K FI L+ E+ S +WPK+S++I Q GKPSFALLS +R+WATP Sbjct: 309 ELLEHYGFLLNDNSNEKVFIPLQPEICSSNTWPKDSMFIHQSGKPSFALLSALRIWATPR 368 Query: 357 TKRRSVKHIAYSGQRISNENEIAVMEWIAKKCQVMLSSFSTSIDEDALIMRIIDQIQDYD 178 +RR H+AYSG ++S ENEI VM WI+K C +L S TS +ED ++ ID++QD Sbjct: 369 NQRRPASHLAYSGSQLSAENEILVMRWISKNCNCILKSLPTSFEEDRFLLSAIDKMQDSC 428 Query: 177 GDLDCSTALCD---EVRTFVESTKL-------DLCS--KTRRSIYRWKLAVQWRHRYKKI 34 L+ + + F+E+ L +L S KT+R + RW+LA+QWR RYK+I Sbjct: 429 SPLELRNTVASSTAHIHAFLEANGLQDGEDVAELLSSRKTKREMDRWRLAIQWRVRYKEI 488 Query: 33 LSDCISYCTKV 1 L +CIS+C++V Sbjct: 489 LINCISHCSRV 499 Score = 74.3 bits (181), Expect(2) = e-105 Identities = 34/50 (68%), Positives = 41/50 (82%) Frame = -1 Query: 1130 SLSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 SLS QIL + LL E+NKGRSSWWY YL+ LPR YD+LA+FG+FE +ALQ Sbjct: 96 SLSPIQILIVGLLYEMNKGRSSWWYPYLVNLPRGYDILATFGEFEKQALQ 145 >gb|EOY03097.1| SET domain group 40, putative isoform 1 [Theobroma cacao] Length = 498 Score = 347 bits (891), Expect(2) = e-105 Identities = 171/335 (51%), Positives = 230/335 (68%), Gaps = 9/335 (2%) Frame = -3 Query: 984 AEKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVG 805 A+KA+ K + EW++ATPLM EL LK + TF AW+WA+ TISSRT+HIPWD AGCLCPVG Sbjct: 168 AQKALSKAEYEWKKATPLMKELKLKLQFLTFRAWIWATGTISSRTLHIPWDEAGCLCPVG 227 Query: 804 DYFNYTPPEEDPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLSDANTDRLTDAGYDE 625 D FNY P ED +N ++ G +++ L ++ RLTD ++E Sbjct: 228 DLFNYAAPGEDLNGFDN---------------VDNLQNGYALDDLDTQHSQRLTDGAFEE 272 Query: 624 TVASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLCS 445 A+YCFYAK NY KGEQVLLSYGTYTNLELLE+YGF+L++NPN+K FI LE +++S S Sbjct: 273 DAAAYCFYAKTNYKKGEQVLLSYGTYTNLELLEYYGFLLEDNPNEKVFIPLEPDIHSSSS 332 Query: 444 WPKESLYISQDGKPSFALLSTVRLWATPVTKRRSVKHIAYSGQRISNENEIAVMEWIAKK 265 WP +SLYI Q+G+PSFAL++ +R+WATP +R+S++H AYSG ++S +NEI+VM WIAKK Sbjct: 333 WPNDSLYIHQNGRPSFALMAALRVWATPPYQRKSIRHQAYSGSQLSQDNEISVMTWIAKK 392 Query: 264 CQVMLSSFSTSIDEDALIMRIIDQIQDYDGDLDCSTAL---CDEVRTFVESTKL---DLC 103 C L + TSI++D L++ D+IQ++D + A+ E +++T L D Sbjct: 393 CHATLKAMPTSIEDDNLLLSFTDKIQEFDNLWEWGKAMPAFGGEFCNLLQATNLKRNDES 452 Query: 102 SKTRRS---IYRWKLAVQWRHRYKKILSDCISYCT 7 +RR+ I RWKLAV WR YKK+L DCISYCT Sbjct: 453 FASRRAKMLIDRWKLAVHWRLIYKKVLVDCISYCT 487 Score = 62.0 bits (149), Expect(2) = e-105 Identities = 28/50 (56%), Positives = 40/50 (80%) Frame = -1 Query: 1130 SLSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 SLS Q+L+I L E++KG++S W+ YLL LPR+Y +LA+FG+FE +ALQ Sbjct: 111 SLSPAQVLTICFLYEMSKGKASPWHPYLLHLPRSYGILAAFGEFEKQALQ 160 >ref|XP_003616150.1| Protein SET DOMAIN GROUP [Medicago truncatula] gi|355517485|gb|AES99108.1| Protein SET DOMAIN GROUP [Medicago truncatula] Length = 532 Score = 348 bits (894), Expect(2) = e-104 Identities = 183/351 (52%), Positives = 228/351 (64%), Gaps = 24/351 (6%) Frame = -3 Query: 981 EKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASAT-------------ISSRTMHI 841 EKAV K K EW+EA LM +L KP+L TF AW+WA+AT ISSRT+HI Sbjct: 186 EKAVQKAKSEWKEAHALMEDLMFKPQLLTFKAWVWAAATGRTVPETFHLPGLISSRTLHI 245 Query: 840 PWDTAGCLCPVGDYFNYTPPEEDPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLSDA 661 PWD AGCLCPVGD FNY P E+ + G H S+ G+ E D Sbjct: 246 PWDEAGCLCPVGDLFNYDAPGEE--------LSGVEDVDHFLSN--GDMNVVIDEGQIDF 295 Query: 660 NTDRLTDAGYDETVASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAF 481 N+ RLTD G++E +YCFYA+ NY KG+QVLL YGTYTNLELLEHYGF+LQENPNDK F Sbjct: 296 NSQRLTDGGFEEDANAYCFYARTNYKKGDQVLLCYGTYTNLELLEHYGFLLQENPNDKIF 355 Query: 480 ISLETEMYSLCSWPKESLYISQDGKPSFALLSTVRLWATPVTKRRSVKHIAYSGQRISNE 301 I LE MY+ SW KESLYI +GKPSFALL+ +RLWATP KRRS+ H+AYSG ++S + Sbjct: 356 IPLEPAMYTSTSWSKESLYIHPNGKPSFALLAALRLWATPHNKRRSIGHLAYSGSQLSAD 415 Query: 300 NEIAVMEWIAKKCQVMLSSFSTSIDEDALIMRIIDQIQDYDGDLDCSTALC--DEVRTFV 127 NEI VM+W++K C +L + TSI++D L++ +D QD+ + + DEV TF+ Sbjct: 416 NEIIVMKWLSKTCDAVLKNMPTSIEDDTLLLNALDCSQDFITFMKIVKLMSSRDEVYTFL 475 Query: 126 E----STKLDLC-----SKTRRSIYRWKLAVQWRHRYKKILSDCISYCTKV 1 E + L C KTRRS+ RWKLAV WR RYK++L DCISYC + Sbjct: 476 EAHNITDALSFCDTISSKKTRRSMDRWKLAVLWRLRYKRVLVDCISYCNGI 526 Score = 58.5 bits (140), Expect(2) = e-104 Identities = 26/44 (59%), Positives = 36/44 (81%) Frame = -1 Query: 1112 ILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 IL++ LL EV KG++S W+ YL+ LP++YDLLA FG+FE +ALQ Sbjct: 134 ILTVCLLYEVGKGKTSRWHPYLVHLPQSYDLLAMFGEFEKQALQ 177 >ref|XP_006289442.1| hypothetical protein CARUB_v10002957mg [Capsella rubella] gi|482558148|gb|EOA22340.1| hypothetical protein CARUB_v10002957mg [Capsella rubella] Length = 503 Score = 332 bits (851), Expect(2) = e-104 Identities = 171/344 (49%), Positives = 231/344 (67%), Gaps = 18/344 (5%) Frame = -3 Query: 981 EKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVGD 802 EKA K + EW+EA LM EL+LKP+ +F AWLWASATISSRT+HIPWD+AGCLCP GD Sbjct: 151 EKATAKCQSEWKEAGTLMKELDLKPKFQSFQAWLWASATISSRTLHIPWDSAGCLCPAGD 210 Query: 801 YFNYTPPEEDPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLS---DANTDRLTDAGY 631 FNY P +D N + G + S +S+ N+ E+ + ++RLTD G+ Sbjct: 211 LFNYDAPGDD-LNYSEGPESAIQTSSPQPASITNLECRNNEEEAGLNVEIQSERLTDGGF 269 Query: 630 DETVASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSL 451 +E +YC YA+RNY GEQVLL YGTYTNLELLEHYGF+L+EN NDK FI LET +YSL Sbjct: 270 EEDANAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLYSL 329 Query: 450 C-SWPKESLYISQDGKPSFALLSTVRLWATPVTKR-RSVKHIAYSGQRISNENEIAVMEW 277 SWPK+SLYI QDGKPSFAL+ST+RLW P ++R +SV + Y+G +IS +NEI VM+W Sbjct: 330 ASSWPKDSLYIHQDGKPSFALVSTLRLWLVPQSQRDKSVMRLVYAGSQISVKNEILVMKW 389 Query: 276 IAKKCQVMLSSFSTSIDEDALIMRIIDQIQDYDGDLDC--STALCDEVRTFVESTKL--- 112 +++KC +L + TS+ ED L++ ID++QD L+ + A E+R F++ +L Sbjct: 390 MSEKCGSVLRNLPTSVSEDNLLLHNIDKLQDPKIRLEQKETEAFGSEMRAFLDVNRLWDV 449 Query: 111 --------DLCSKTRRSIYRWKLAVQWRHRYKKILSDCISYCTK 4 + +T R + +W+L+VQWR YK+ L+DCI YC + Sbjct: 450 IGFSGKDVEFPRRTNRMMSKWRLSVQWRLSYKRTLADCIYYCNE 493 Score = 73.6 bits (179), Expect(2) = e-104 Identities = 34/50 (68%), Positives = 43/50 (86%) Frame = -1 Query: 1130 SLSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 SLSSTQILS+ LL E++KG+ S+WY YL+ LPR YDLLA+FG+FE +ALQ Sbjct: 93 SLSSTQILSVCLLYEMSKGKKSFWYPYLVHLPRDYDLLATFGEFEKQALQ 142 >ref|XP_004145844.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Cucumis sativus] Length = 483 Score = 333 bits (853), Expect(2) = e-102 Identities = 171/336 (50%), Positives = 221/336 (65%), Gaps = 11/336 (3%) Frame = -3 Query: 981 EKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVGD 802 EKA LK + +W LM E N+K +L TF AWLWASATISSRT+++PWD AGCLCPVGD Sbjct: 149 EKAALKSRTDWRGVEGLMQESNIKSQLQTFKAWLWASATISSRTLYVPWDEAGCLCPVGD 208 Query: 801 YFNYTPPEEDPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLSDANTDRLTDAGYDET 622 FNY PE + +N + V S + + LE +E+ D+ LTD G++E Sbjct: 209 LFNYAAPEGESFNAVD--VLSFPSHASLNDELE------LLEEQRDSQW-ALTDGGFEEN 259 Query: 621 VASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLCSW 442 ++YCFYA+ +Y KGEQVLLSYGTYTNLELLE+YGF+LQENPNDK FI +E ++Y SW Sbjct: 260 ASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQENPNDKVFIPIEHDIYGSSSW 319 Query: 441 PKESLYISQDGKPSFALLSTVRLWATPVTKRRSVKHIAYSGQRISNENEIAVMEWIAKKC 262 PKESLYI Q+G PSFALLS +RLWAT KRR V H+AY+G ++S +NEI VM+W++K C Sbjct: 320 PKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNEILVMQWLSKNC 379 Query: 261 QVMLSSFSTSIDEDALIMRIIDQIQDYDGDLDCSTALC---DEVRTFVES--------TK 115 +L++ TSI+ED ++ I ++QD + L E F+E+ + Sbjct: 380 HTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYGGEFCAFLETNGVVNRDEAE 439 Query: 114 LDLCSKTRRSIYRWKLAVQWRHRYKKILSDCISYCT 7 K +RS+ RWKLAVQWR YKK L DCI YCT Sbjct: 440 SHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCT 475 Score = 68.9 bits (167), Expect(2) = e-102 Identities = 31/50 (62%), Positives = 41/50 (82%) Frame = -1 Query: 1130 SLSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 SLSSTQ L+ LL E++KG SSWW+ YL LP++YD+LA+FG+FE +ALQ Sbjct: 91 SLSSTQKLTFCLLYEISKGPSSWWFPYLKHLPQSYDILATFGEFEKQALQ 140 >ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana] gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein SET DOMAIN GROUP 40 gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis thaliana] gi|51969984|dbj|BAD43684.1| unknown protein [Arabidopsis thaliana] gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana] Length = 491 Score = 330 bits (846), Expect(2) = e-101 Identities = 167/341 (48%), Positives = 225/341 (65%), Gaps = 15/341 (4%) Frame = -3 Query: 981 EKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVGD 802 EKA K + EW+EA LM EL LKP+ +F AWLWASATISSRT+H+PWD+AGCLCPVGD Sbjct: 151 EKATAKCQSEWKEAGSLMKELELKPKFRSFQAWLWASATISSRTLHVPWDSAGCLCPVGD 210 Query: 801 YFNYTPPEEDPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLSDANTDRLTDAGYDET 622 FNY P G +S+ E N + + +++RLTD G++E Sbjct: 211 LFNYDAP---------------GDYSNTPQGPESANNVEEAGLVVETHSERLTDGGFEED 255 Query: 621 VASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLC-S 445 V +YC YA+RNY GEQVLL YGTYTNLELLEHYGF+L+EN NDK FI LET ++SL S Sbjct: 256 VNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASS 315 Query: 444 WPKESLYISQDGKPSFALLSTVRLWATPVTKR-RSVKHIAYSGQRISNENEIAVMEWIAK 268 WPK+SLYI QDGK SFAL+ST+RLW P ++R +SV + Y+G +IS +NEI VM+W+++ Sbjct: 316 WPKDSLYIHQDGKLSFALISTLRLWLIPQSQRDKSVMRLVYAGSQISVKNEILVMKWMSE 375 Query: 267 KCQVMLSSFSTSIDEDALIMRIIDQIQDYDGDLDC--STALCDEVRTFVEST-------- 118 KC +L TS+ ED +++ ID++QD + L+ + A EVR F+++ Sbjct: 376 KCGSVLRDLPTSVTEDTVLLHNIDKLQDPELRLEQKETEAFGSEVRAFLDANCLWDVTVL 435 Query: 117 ---KLDLCSKTRRSIYRWKLAVQWRHRYKKILSDCISYCTK 4 ++ KT R + +W+ +VQWR YK+ L+DCISYC + Sbjct: 436 SGKPIEFSRKTSRMLSKWRWSVQWRLSYKRTLADCISYCNE 476 Score = 68.6 bits (166), Expect(2) = e-101 Identities = 32/50 (64%), Positives = 40/50 (80%) Frame = -1 Query: 1130 SLSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 SLSSTQILS+ LL E++K + S+WY YL +PR YDLLA+FG FE +ALQ Sbjct: 93 SLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRDYDLLATFGNFEKQALQ 142 >ref|XP_002871756.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297317593|gb|EFH48015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 493 Score = 324 bits (830), Expect(2) = e-101 Identities = 169/344 (49%), Positives = 229/344 (66%), Gaps = 18/344 (5%) Frame = -3 Query: 981 EKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVGD 802 EKA+ K + EW+E LM EL LK + +F AWLWASATISSRT+H+PWD+AGCLCPVGD Sbjct: 154 EKAIAKCQFEWKEVGLLMEELELKSKFRSFQAWLWASATISSRTLHVPWDSAGCLCPVGD 213 Query: 801 YFNYTPPEEDPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLS---DANTDRLTDAGY 631 FNY P +D + +LEG N +E+ + +++RLTD G+ Sbjct: 214 LFNYDAPGDDLH------------------TLEGPESANDVEEAGLVVETHSERLTDGGF 255 Query: 630 DETVASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSL 451 +E V +YC YA+RNY GEQVLL YGTYTNLELLEHYGF+L+EN NDK FI LET ++SL Sbjct: 256 EEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSL 315 Query: 450 C-SWPKESLYISQDGKPSFALLSTVRLWATPVTKR-RSVKHIAYSGQRISNENEIAVMEW 277 SWPK+SLYI QDGKPSFAL+ST+RLW P ++R +SV + Y+G +IS +NEI VM+W Sbjct: 316 ASSWPKDSLYIHQDGKPSFALVSTLRLWLIPQSQRDKSVMRLVYAGTQISVKNEILVMKW 375 Query: 276 IAKKCQVMLSSFSTSIDEDALIMRIIDQIQDYDGDLDC--STALCDEVRTFVESTKL--- 112 I++KC + TS+ ED L++ ID++QD + L+ + A EV F+++ +L Sbjct: 376 ISEKCG-SVRDLPTSVLEDTLLLHNIDKLQDPELRLEQKETEAFGSEVCAFLDANRLRDV 434 Query: 111 --------DLCSKTRRSIYRWKLAVQWRHRYKKILSDCISYCTK 4 + KT R + +W+L+V WR YK+ L+DCISYC++ Sbjct: 435 TGFSGKPVEFSRKTSRMLSKWRLSVLWRLSYKRTLADCISYCSE 478 Score = 72.8 bits (177), Expect(2) = e-101 Identities = 34/50 (68%), Positives = 42/50 (84%) Frame = -1 Query: 1130 SLSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 SLSSTQILS+ LL E+ KG+ S+WY YL+ LPR YDLLA+FG+FE +ALQ Sbjct: 96 SLSSTQILSVCLLYEMGKGKRSFWYPYLVHLPRDYDLLATFGEFEKQALQ 145 >ref|XP_006400256.1| hypothetical protein EUTSA_v10015946mg [Eutrema salsugineum] gi|557101346|gb|ESQ41709.1| hypothetical protein EUTSA_v10015946mg [Eutrema salsugineum] Length = 506 Score = 329 bits (843), Expect(2) = e-100 Identities = 172/349 (49%), Positives = 225/349 (64%), Gaps = 22/349 (6%) Frame = -3 Query: 984 AEKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVG 805 AEKA+ K + EW+EA LM L+LKP+ + AWLWASATISSRT+HIPWD+AGCLCPVG Sbjct: 150 AEKAIAKSQSEWKEAVTLMKVLDLKPKFQSLQAWLWASATISSRTLHIPWDSAGCLCPVG 209 Query: 804 DYFNYTPPEEDPYNLNNGKVCGAGSFSHVTSSLEGENEGNSME--QLSDANTDRLTDAGY 631 D FNY P +D ++ S S+ E N+ E + + ++RLTD G+ Sbjct: 210 DLFNYDAPGDDLNTSEGPELVIQTSSPKPVSTTHHECRNNAEEAGHVVETQSERLTDGGF 269 Query: 630 DETVASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSL 451 DE +YC YA+RNY GEQVLL YGTYTNLELLEHYGF+L+EN NDK FI LET +YSL Sbjct: 270 DEDANAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLYSL 329 Query: 450 C-SWPKESLYISQDGKPSFALLSTVRLWATPVTKR-RSVKHIAYSGQRISNENEIAVMEW 277 SWPK+SLYI QDGKPSFAL+ST+RLW P +R ++ + Y+G +IS +NEI VM+W Sbjct: 330 ASSWPKDSLYIHQDGKPSFALVSTLRLWLIPQNQRDKTAMRLVYAGSQISVKNEILVMKW 389 Query: 276 IAKKCQVMLSSFSTSIDEDALIMRIIDQIQDYDGDLDC--STALCDEVRTFVESTKL--- 112 ++ KC +L TS+ ED ++++ I +QD + L + A EVR F++ L Sbjct: 390 MSDKCGRVLRDLPTSLLEDTVLLQDIKNLQDPEVCLKQKETEAFGSEVRAFLDVNHLWDL 449 Query: 111 -------------DLCSKTRRSIYRWKLAVQWRHRYKKILSDCISYCTK 4 + KT R I +W+L+VQWR RYK+ L DCISYC + Sbjct: 450 INGDVIGLSGKAVEFSRKTNRIISKWRLSVQWRLRYKRTLVDCISYCNE 498 Score = 65.1 bits (157), Expect(2) = e-100 Identities = 29/50 (58%), Positives = 40/50 (80%) Frame = -1 Query: 1130 SLSSTQILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 S+SSTQ L + LL E++KG+ S+WY YL+ LPR YDL ++FG+FE +ALQ Sbjct: 93 SISSTQRLGVCLLYEMSKGKKSFWYPYLVHLPRDYDLSSTFGEFEKQALQ 142 >ref|XP_004167843.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Cucumis sativus] Length = 389 Score = 331 bits (849), Expect(2) = e-100 Identities = 170/336 (50%), Positives = 220/336 (65%), Gaps = 11/336 (3%) Frame = -3 Query: 981 EKAVLKGKEEWEEATPLMSELNLKPRLTTFNAWLWASATISSRTMHIPWDTAGCLCPVGD 802 EKA LK + +W LM E N+K +L TF AWLWASATISSRT+++PWD AGCLCPVGD Sbjct: 55 EKAALKSRTDWRGVEGLMQESNIKSQLQTFKAWLWASATISSRTLYVPWDEAGCLCPVGD 114 Query: 801 YFNYTPPEEDPYNLNNGKVCGAGSFSHVTSSLEGENEGNSMEQLSDANTDRLTDAGYDET 622 FNY PE + +N + V S + + LE +E+ D+ LTD G++E Sbjct: 115 LFNYAAPEGESFNAVD--VLSFPSHASLNDELE------LLEEQRDSQW-ALTDGGFEEN 165 Query: 621 VASYCFYAKRNYGKGEQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLCSW 442 ++YCFYA+ NY KGEQVLLSYGTYTNLELLE+YGF+LQENPNDK FI +E ++Y SW Sbjct: 166 ASAYCFYARENYRKGEQVLLSYGTYTNLELLEYYGFLLQENPNDKVFIPIEHDIYGSSSW 225 Query: 441 PKESLYISQDGKPSFALLSTVRLWATPVTKRRSVKHIAYSGQRISNENEIAVMEWIAKKC 262 P+ESLYI Q+G PSFALLS +RLWAT KRR V H+AY+G ++S +NE VM+W++K C Sbjct: 226 PEESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNETLVMQWLSKNC 285 Query: 261 QVMLSSFSTSIDEDALIMRIIDQIQDYDGDLDCSTALC---DEVRTFVES--------TK 115 +L++ TSI+ED ++ I ++QD + L E F+E+ + Sbjct: 286 HTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYGGEFCAFLETNGVVNRDEAE 345 Query: 114 LDLCSKTRRSIYRWKLAVQWRHRYKKILSDCISYCT 7 K +RS+ RWKLAVQWR YKK L DCI YCT Sbjct: 346 SHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCT 381 Score = 60.8 bits (146), Expect(2) = e-100 Identities = 26/45 (57%), Positives = 36/45 (80%) Frame = -1 Query: 1115 QILSIALLNEVNKGRSSWWYTYLLQLPRTYDLLASFGQFEIEALQ 981 Q L+ LL E++KG SSWW+ YL LP++YD+LA+FG+FE +ALQ Sbjct: 2 QKLTFCLLYEISKGPSSWWFPYLKHLPQSYDILATFGEFEKQALQ 46