BLASTX nr result
ID: Perilla23_contig00026876
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Perilla23_contig00026876 (1037 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011074750.1| PREDICTED: protein SET DOMAIN GROUP 41 [Sesa... 269 3e-69 ref|XP_012839220.1| PREDICTED: protein SET DOMAIN GROUP 41 [Eryt... 233 2e-58 gb|EYU36834.1| hypothetical protein MIMGU_mgv1a023205mg [Erythra... 219 4e-54 ref|XP_004238489.1| PREDICTED: protein SET DOMAIN GROUP 41 [Sola... 158 8e-36 ref|XP_009786354.1| PREDICTED: protein SET DOMAIN GROUP 41 [Nico... 157 1e-35 ref|XP_006359805.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 156 3e-35 ref|XP_009605470.1| PREDICTED: protein SET DOMAIN GROUP 41 [Nico... 155 7e-35 ref|XP_008237847.1| PREDICTED: protein SET DOMAIN GROUP 41 [Prun... 153 2e-34 ref|XP_007199300.1| hypothetical protein PRUPE_ppa023162mg, part... 147 2e-32 ref|XP_007019535.1| SET domain-containing protein, putative isof... 145 5e-32 ref|XP_007019533.1| SET domain protein, putative isoform 1 [Theo... 145 5e-32 ref|XP_010665142.1| PREDICTED: protein SET DOMAIN GROUP 41 isofo... 144 9e-32 ref|XP_010665141.1| PREDICTED: protein SET DOMAIN GROUP 41 isofo... 144 9e-32 ref|XP_010260979.1| PREDICTED: protein SET DOMAIN GROUP 41 isofo... 125 4e-26 ref|XP_010260978.1| PREDICTED: protein SET DOMAIN GROUP 41 isofo... 125 4e-26 ref|XP_012080733.1| PREDICTED: protein SET DOMAIN GROUP 41 isofo... 109 4e-21 ref|XP_012080731.1| PREDICTED: protein SET DOMAIN GROUP 41 isofo... 109 4e-21 gb|KJB66309.1| hypothetical protein B456_010G134600 [Gossypium r... 107 2e-20 gb|KJB66308.1| hypothetical protein B456_010G134600 [Gossypium r... 107 2e-20 ref|XP_010062107.1| PREDICTED: protein SET DOMAIN GROUP 41 isofo... 107 2e-20 >ref|XP_011074750.1| PREDICTED: protein SET DOMAIN GROUP 41 [Sesamum indicum] Length = 734 Score = 269 bits (687), Expect = 3e-69 Identities = 157/330 (47%), Positives = 198/330 (60%), Gaps = 37/330 (11%) Frame = -1 Query: 1031 MEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTF----PKNSRHVPT 864 MEMRA+E+IAIG+DLT PLPPLA VL+DSA+SSHCSACFS LPP F +N HVP Sbjct: 1 MEMRAVEDIAIGQDLTPPLPPLAVVLNDSALSSHCSACFSTLPPHPFFPTTLQNLSHVPK 60 Query: 863 DALTPLYCSLRCXXXXXXXXXXSAERHXXXXXXXXXXXXXXXXXXXXXXXXLVHIFQN-- 690 + TPLYCS RC S E + L++IF+ Sbjct: 61 NIPTPLYCSPRCSSIDSPLHFSSGEPYLLSLFLQSPPATWDDSSDLRLSLRLLYIFREHL 120 Query: 689 -----LP---------QKSYLGGKKESFFQKNEETEGPVDYIDNTEMKNENL-------- 576 LP ++ LGGKK+ FQ+N+ GP DY++N E + EN+ Sbjct: 121 KECSFLPSFPGSTTSERRLLLGGKKDYSFQQNQNVAGPDDYLENKEEQRENMKSCCHNDI 180 Query: 575 --ERIAGLMTNRENLVVGEIEENQFE-------CDENPSNCEDCSENSGNVLQRIREGAK 423 ERIAGLMTNRE L+ GE ++QF EN N E E++ +V +RIREGA+ Sbjct: 181 VMERIAGLMTNRERLIFGENRKDQFRGSYEEDNLKENRENREGSCESTESVSERIREGAE 240 Query: 422 MMAKARRKCVGKDVNVEKEECFSLEEMVLCLVLTNAVEVQGDYGSSIGIAVYDATFSWIN 243 +MAKARR C+ +D NVEK+E F +EEMVLCLVLTNAVEVQ G S G+AVY SWIN Sbjct: 241 LMAKARRMCLDEDGNVEKQEEFVVEEMVLCLVLTNAVEVQDKSGCSTGVAVYGTAISWIN 300 Query: 242 HSCSPNSCYRFLVGPQNDQRQPLRIAPAAE 153 HSCSPN+CYRF +G +N+++ LRI AA+ Sbjct: 301 HSCSPNACYRFSLGLENNEQPRLRIVSAAK 330 >ref|XP_012839220.1| PREDICTED: protein SET DOMAIN GROUP 41 [Erythranthe guttatus] Length = 657 Score = 233 bits (594), Expect = 2e-58 Identities = 154/337 (45%), Positives = 183/337 (54%), Gaps = 15/337 (4%) Frame = -1 Query: 1031 MEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTFPK-------NSRH 873 MEMRA+E+IAIGEDLT LPPLA VL ++AVSS+CSACFS LPPQ FP N H Sbjct: 1 MEMRAVEDIAIGEDLTPALPPLAFVLLETAVSSYCSACFSILPPQPFPPLNPNSRPNCSH 60 Query: 872 VPTDALTPLYCSLRCXXXXXXXXXXSAERHXXXXXXXXXXXXXXXXXXXXXXXXLVHIFQ 693 P+ TPLYCS+ C S E L+H+FQ Sbjct: 61 FPSP--TPLYCSVNCSSIDSPLHFSSGELRLLSLFRQSPPFAWEDSSDLRLSLRLIHLFQ 118 Query: 692 NLPQKSYLGGKKESFFQKNEETEGPVDYIDNTEMKNENLERIAGLMTNRENLVVGEIEEN 513 + E+ E P +E +ERI GLMTNRE L+ E + Sbjct: 119 KI-----------------EKIECPE--------ASEIIERIGGLMTNREKLIFEESRNS 153 Query: 512 QFE-CDENPSNCEDCSENSGNVLQRIREGAKMMAKARRKCVGKDVNVEKE-ECFSLEEMV 339 + + DEN +C D ENS NV Q+IR GAKMMA+ARR VN EK+ + F LEEMV Sbjct: 154 KSKFSDENLRDCGDSGENSENVYQKIRSGAKMMAEARRASTDHYVNAEKKRDDFVLEEMV 213 Query: 338 LCLVLTNAVEVQGDYGSSIGIAVYDATFSWINHSCSPNSCYRFLVGPQNDQRQPLRIAP- 162 LCLVLTNAVEVQ G +IGIAVYD FSWINHSCSPNSCYRF+ +N Q+ LRIA Sbjct: 214 LCLVLTNAVEVQDKNGCTIGIAVYDTAFSWINHSCSPNSCYRFVSRLENHQQSSLRIASY 273 Query: 161 AAEGDR-----ANENGDGLRYLFSGISCE*CGGELTV 66 A G R NG G R + I G E+T+ Sbjct: 274 ATSGCRHGYGDIERNGYGPRVIVRSIKAVQKGEEVTI 310 >gb|EYU36834.1| hypothetical protein MIMGU_mgv1a023205mg [Erythranthe guttata] Length = 635 Score = 219 bits (557), Expect = 4e-54 Identities = 150/336 (44%), Positives = 175/336 (52%), Gaps = 14/336 (4%) Frame = -1 Query: 1031 MEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTFPK-------NSRH 873 MEMRA+E+IAIGEDLT LPPLA VL ++AVSS+CSACFS LPPQ FP N H Sbjct: 1 MEMRAVEDIAIGEDLTPALPPLAFVLLETAVSSYCSACFSILPPQPFPPLNPNSRPNCSH 60 Query: 872 VPTDALTPLYCSLRCXXXXXXXXXXSAERHXXXXXXXXXXXXXXXXXXXXXXXXLVHIFQ 693 P+ TPLYCS+ C S E L+H+FQ Sbjct: 61 FPSP--TPLYCSVNCSSIDSPLHFSSGELRLLSLFRQSPPFAWEDSSDLRLSLRLIHLFQ 118 Query: 692 NLPQKSYLGGKKESFFQKNEETEGPVDYIDNTEMKNENLERIAGLMTNRENLVVGEIEEN 513 + E+ E P +E +ERI GLMTNRE L+ E Sbjct: 119 KI-----------------EKIECPE--------ASEIIERIGGLMTNREKLIFEE---- 149 Query: 512 QFECDENPSNCEDCSENSGNVLQRIREGAKMMAKARRKCVGKDVNVEKE-ECFSLEEMVL 336 SENS NV Q+IR GAKMMA+ARR VN EK+ + F LEEMVL Sbjct: 150 --------------SENSENVYQKIRSGAKMMAEARRASTDHYVNAEKKRDDFVLEEMVL 195 Query: 335 CLVLTNAVEVQGDYGSSIGIAVYDATFSWINHSCSPNSCYRFLVGPQNDQRQPLRIAP-A 159 CLVLTNAVEVQ G +IGIAVYD FSWINHSCSPNSCYRF+ +N Q+ LRIA A Sbjct: 196 CLVLTNAVEVQDKNGCTIGIAVYDTAFSWINHSCSPNSCYRFVSRLENHQQSSLRIASYA 255 Query: 158 AEGDR-----ANENGDGLRYLFSGISCE*CGGELTV 66 G R NG G R + I G E+T+ Sbjct: 256 TSGCRHGYGDIERNGYGPRVIVRSIKAVQKGEEVTI 291 >ref|XP_004238489.1| PREDICTED: protein SET DOMAIN GROUP 41 [Solanum lycopersicum] Length = 677 Score = 158 bits (399), Expect = 8e-36 Identities = 112/300 (37%), Positives = 144/300 (48%), Gaps = 8/300 (2%) Frame = -1 Query: 1031 MEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTF--------PKNSR 876 MEMRA E I+IG+DLT P+PPL+ LH S + SHCS+CFS LPP PKN Sbjct: 1 MEMRAKEAISIGQDLTPPIPPLSLCLHHSTLLSHCSSCFSPLPPPPSLHYPPFFSPKN-- 58 Query: 875 HVPTDALTPLYCSLRCXXXXXXXXXXSAERHXXXXXXXXXXXXXXXXXXXXXXXXLVHIF 696 P + YCSL+C S+E H L+H+F Sbjct: 59 --PNSNHSIRYCSLQCSSLDSPIHFSSSEFHFFHLFPQPLYTNFPTSSDLRLSLRLLHLF 116 Query: 695 QNLPQKSYLGGKKESFFQKNEETEGPVDYIDNTEMKNENLERIAGLMTNRENLVVGEIEE 516 Q L +E+ G + NLERI GLMTN ++ E Sbjct: 117 QTL--------------HLIQESNGSL----------LNLERIGGLMTNFRKVMFLE--- 149 Query: 515 NQFECDENPSNCEDCSENSGNVLQRIREGAKMMAKARRKCVGKDVNVEKEECFSLEEMVL 336 E C++N ++ RIR+GAK +A +RR VG + N E +++E VL Sbjct: 150 ------------EHCNDN--DLSGRIRDGAKALAASRRMRVGLETNGE----YTVEAAVL 191 Query: 335 CLVLTNAVEVQGDYGSSIGIAVYDATFSWINHSCSPNSCYRFLVGPQNDQRQPLRIAPAA 156 CLVLTNAVEV G S+G+ VYD FSW+NHSCSPN+ YRF + RI PAA Sbjct: 192 CLVLTNAVEVYDKDGRSLGVGVYDVPFSWVNHSCSPNASYRFCTASDSGGILESRICPAA 251 >ref|XP_009786354.1| PREDICTED: protein SET DOMAIN GROUP 41 [Nicotiana sylvestris] Length = 662 Score = 157 bits (397), Expect = 1e-35 Identities = 109/292 (37%), Positives = 138/292 (47%), Gaps = 1/292 (0%) Frame = -1 Query: 1031 MEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTFPKNSRHVPTDALT 852 ME+RA EEI IG+DLT P+PPL+ LH S + SHCS+CFS LPP F P Sbjct: 1 MEIRANEEIPIGQDLTPPIPPLSLSLHHSILLSHCSSCFSPLPPSPFYPT---YPNPDHF 57 Query: 851 PLYCSLRCXXXXXXXXXXSAERHXXXXXXXXXXXXXXXXXXXXXXXXLVHIFQNLPQKSY 672 YCSL+C S+E H F PQ Y Sbjct: 58 VRYCSLQCSSLDSPLHFSSSE---------------------------FHFFHLFPQPLY 90 Query: 671 LGGKKESFFQKNEETEGPVDYIDNTEMKNENLERIAGLMTNRENLVVGEIEENQFECDEN 492 + + + + + NLERI GLMTN + L + +EE Q+ D++ Sbjct: 91 TTSPTSTDLRLSLRL---IHRFQEANVSFSNLERIGGLMTNFKKLTL--LEEQQYYNDDD 145 Query: 491 PSNCEDCSENSGNVLQRIREGAKMMAKARRKCVGKDVNVEKEEC-FSLEEMVLCLVLTNA 315 + RIR+GAK MA ARR G D NVE +++E VLCLVLTN+ Sbjct: 146 DG-----------LSGRIRDGAKAMAVARRMRDGLDTNVELSAAEYAVEAAVLCLVLTNS 194 Query: 314 VEVQGDYGSSIGIAVYDATFSWINHSCSPNSCYRFLVGPQNDQRQPLRIAPA 159 VEV G SIG+ VYD FS++NHSCSPN+ YRF RI PA Sbjct: 195 VEVHDKDGRSIGVGVYDLAFSYVNHSCSPNASYRFCTAFDCGGELEFRICPA 246 >ref|XP_006359805.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Solanum tuberosum] Length = 681 Score = 156 bits (394), Expect = 3e-35 Identities = 113/303 (37%), Positives = 141/303 (46%), Gaps = 11/303 (3%) Frame = -1 Query: 1031 MEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTFPKNSRHVPTDALT 852 MEMRA E I IG+DLT P+PPL+ LH S + SHCS+CFS LPP P S H P + Sbjct: 1 MEMRAKEAIPIGQDLTPPIPPLSLSLHHSTLLSHCSSCFSPLPP---PPPSLHYPP-FFS 56 Query: 851 PL---------YCSLRCXXXXXXXXXXSAERHXXXXXXXXXXXXXXXXXXXXXXXXLVHI 699 P YCSL+C S+E H L+H Sbjct: 57 PKNPNPNHFIRYCSLQCSSLDSPIHFSSSEFHFFHLFPQPLHTNFPTSSDLRLSLRLLHR 116 Query: 698 FQNLPQKSYLGGKKESFFQKNEETEGPVDYIDNTEMKNENLERIAGLMTNRENLVVGEIE 519 FQ L + I + NLERI GL+TN ++ E Sbjct: 117 FQTL------------------------NLIQESNGSFLNLERIGGLVTNFRKVMFLE-- 150 Query: 518 ENQFECDENPSNCEDCSENSGNVLQ-RIREGAKMMAKARRKCVGKDVNVEK-EECFSLEE 345 E C++N + L RIR GAK +A +RR +G D N E E +++E Sbjct: 151 -------------EHCNDNDDDDLSGRIRHGAKALAASRRMRLGLDTNRELLYEEYTVEA 197 Query: 344 MVLCLVLTNAVEVQGDYGSSIGIAVYDATFSWINHSCSPNSCYRFLVGPQNDQRQPLRIA 165 VLCLVLTNAVEV G S+G+ VYD FSW+NHSCSPN+ YRF + RI Sbjct: 198 AVLCLVLTNAVEVHDKDGRSLGVGVYDVPFSWVNHSCSPNASYRFCTASDSGGISECRIC 257 Query: 164 PAA 156 PAA Sbjct: 258 PAA 260 >ref|XP_009605470.1| PREDICTED: protein SET DOMAIN GROUP 41 [Nicotiana tomentosiformis] Length = 657 Score = 155 bits (391), Expect = 7e-35 Identities = 112/303 (36%), Positives = 143/303 (47%), Gaps = 3/303 (0%) Frame = -1 Query: 1031 MEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTFPKNSRHVPTDALT 852 ME+RA ++I IG+DL P+PPL+ LH S + SHCS+CFS LP F + P + Sbjct: 1 MEVRATKDIPIGQDLNPPIPPLSLSLHYSTLLSHCSSCFSPLPSSPFSPTNNPSPNHFIR 60 Query: 851 PLYCSLRCXXXXXXXXXXSAERHXXXXXXXXXXXXXXXXXXXXXXXXLVHIFQNLPQKSY 672 YCSL+C S+E H L+H FQ Sbjct: 61 --YCSLQCSSHDSPLHFSSSEFHFFHLFPEPLFTTSPTSTDLRLSLRLIHRFQ------- 111 Query: 671 LGGKKESFFQKNEETEGPVDYIDNTEMKNENLERIAGLMTNRENLVVGEIEENQFECDEN 492 E G NLERI GLMTN + ++ +EE Q D++ Sbjct: 112 -------------EANGSFS----------NLERICGLMTNFKKIMF--LEEQQCYNDDH 146 Query: 491 PSNCEDCSENSGNVLQRIREGAKMMAKARRKCVGKDVNVEKEEC-FSLEEMVLCLVLTNA 315 E SG +L +GAK MA ARR G D NVE +S+E VLCLVLTN+ Sbjct: 147 EE------ELSGRIL----DGAKAMAIARRTRDGLDTNVELSAAEYSVEAAVLCLVLTNS 196 Query: 314 VEVQGDYGSSIGIAVYDATFSWINHSCSPNSCYRFLVGPQNDQRQPLRIAPAAE--GDRA 141 VEV G S+G+ VYD FS+INHSCSPN+CY+F + RI PAA G Sbjct: 197 VEVHDKDGRSLGVGVYDLAFSYINHSCSPNACYKFCTTLDSGGELEFRICPAASETGAAG 256 Query: 140 NEN 132 NE+ Sbjct: 257 NES 259 >ref|XP_008237847.1| PREDICTED: protein SET DOMAIN GROUP 41 [Prunus mume] Length = 680 Score = 153 bits (387), Expect = 2e-34 Identities = 113/316 (35%), Positives = 148/316 (46%), Gaps = 20/316 (6%) Frame = -1 Query: 1037 LKMEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQ---------TFPK 885 ++MEMRA E+I IGED+T PL PLA LHDS +SSHCS+CFS LPP TFP Sbjct: 1 MEMEMRAEEDIEIGEDITPPLTPLAFALHDSLLSSHCSSCFSLLPPHPFPPLHFNPTFPH 60 Query: 884 NSRHVPTDALTPLYCSLRCXXXXXXXXXXSAERHXXXXXXXXXXXXXXXXXXXXXXXXLV 705 N HV + + + YCS C SAE H + Sbjct: 61 NPHHVLSSS-SSFYCSPLCSTSDSPLHVSSAEPHL------------------------L 95 Query: 704 HIFQNLPQKSYLGGKKESFFQKNEETEGPVDYIDNTEMKNENLERIAGLMTNRENLVVGE 525 H+ Q+ P +Y G + + + + + + RIAGL+TN L+ Sbjct: 96 HLLQSHPS-TYPHGD-------SSDLRAALRLLHSLPATRPSA-RIAGLLTNHHKLLHHH 146 Query: 524 IEENQFECDENPSNCEDCSENSGNVLQRIREGA-------KMMAKARRKCVGKDVNVEKE 366 RIR+GA KM +A C +V + Sbjct: 147 DHH------------------------RIRDGARAMFLASKMRDEAPNVCSDNSSSVSPD 182 Query: 365 ECFSLEEMVLCLVLTNAVEVQGDYGSSIGIAVYDATFSWINHSCSPNSCYRFLVGPQ--- 195 + LEE LCLVLTNAVEVQ G ++GI+VY +F WINHSCSPN+CYRFLV P Sbjct: 183 DAV-LEEAALCLVLTNAVEVQDKTGRTLGISVYGPSFCWINHSCSPNACYRFLVSPPPPT 241 Query: 194 -NDQRQPLRIAPAAEG 150 + ++ PLRIAP +G Sbjct: 242 CSAEKTPLRIAPFGQG 257 >ref|XP_007199300.1| hypothetical protein PRUPE_ppa023162mg, partial [Prunus persica] gi|462394700|gb|EMJ00499.1| hypothetical protein PRUPE_ppa023162mg, partial [Prunus persica] Length = 635 Score = 147 bits (370), Expect = 2e-32 Identities = 111/313 (35%), Positives = 146/313 (46%), Gaps = 16/313 (5%) Frame = -1 Query: 1031 MEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTFPK---------NS 879 MEMRA E+I IGED+T PL PL LHDS +SSHCS+CFS LPP FP N Sbjct: 1 MEMRAEEDIEIGEDITPPLTPLGFALHDSLLSSHCSSCFSLLPPHPFPPLHFTPPFPHNP 60 Query: 878 RHVPTDALTPLYCSLRCXXXXXXXXXXSAERHXXXXXXXXXXXXXXXXXXXXXXXXLVHI 699 HV + + YCS C SAE H +H+ Sbjct: 61 HHVLSSSS---YCSPLCSTSDSPLHVSSAELHL------------------------LHL 93 Query: 698 FQNLPQKSYLGGKKESFFQKNEETEGPVDYIDNTEMKNENLERIAGLMTNRENLVVGEIE 519 Q+ P +Y G + + + + + + RIAGL+TN + Sbjct: 94 LQSHPS-TYPHGD-------SSDLRAALRLLHSLPATGPSA-RIAGLLTNHHKFL----- 139 Query: 518 ENQFECDENPSNCEDCSENSGNVLQRIREGAKMMAKARRKCVGKDVNVEKEECFS--LEE 345 D++ RIR+GA+ M AR+ + E + LEE Sbjct: 140 ----HHDDH---------------HRIRDGARAMFLARK------MRDEAPNVYDAVLEE 174 Query: 344 MVLCLVLTNAVEVQGDYGSSIGIAVYDATFSWINHSCSPNSCYRFLVGPQ-----NDQRQ 180 LCLVLTNAVEVQ G ++GI+VY +F WINHSCSPN+CYRFLV P + +R Sbjct: 175 AALCLVLTNAVEVQDKTGRTLGISVYGPSFCWINHSCSPNACYRFLVSPPPPPPCSAERT 234 Query: 179 PLRIAPAAEGDRA 141 PLRIAP +G ++ Sbjct: 235 PLRIAPLGQGTQS 247 >ref|XP_007019535.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] gi|590600821|ref|XP_007019537.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] gi|590600825|ref|XP_007019538.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] gi|590600830|ref|XP_007019539.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] gi|508724863|gb|EOY16760.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] gi|508724865|gb|EOY16762.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] gi|508724866|gb|EOY16763.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] gi|508724867|gb|EOY16764.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] Length = 625 Score = 145 bits (366), Expect = 5e-32 Identities = 106/306 (34%), Positives = 140/306 (45%), Gaps = 7/306 (2%) Frame = -1 Query: 1034 KMEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTFPKNSRHVPTDAL 855 +MEMRA +++ G+D+T P+ PL++ L+DS +SSHCS+CFS LPP TFP RHVP Sbjct: 12 EMEMRAKQDLDYGQDITPPILPLSSSLYDSFLSSHCSSCFSPLPP-TFPHIPRHVP---- 66 Query: 854 TPLYCSLRCXXXXXXXXXXSAERHXXXXXXXXXXXXXXXXXXXXXXXXLVHIFQNLPQKS 675 LYCS C + H + + Q+LP Sbjct: 67 --LYCSPTC-----------SSSHSPLHSSSAESLLPPTCPDSSDLRTALRLLQSLPSTP 113 Query: 674 YLGGKKESFFQKNEETEGPVDYIDNTEMKNENLERIAGLMTNRENLVVGEIEENQFECDE 495 +L RI GL+TN L Sbjct: 114 ------------------------------PHLHRIDGLLTNHHML-------------- 129 Query: 494 NPSNCEDCSENSGNVLQRIREGAKMMAKARRKCVGKDVNVEKEECFSLEEMVLCLVLTNA 315 + +S V +IR+GA MA AR K +D N + + F LEE VL LV+TNA Sbjct: 130 --------TSSSPEVAAKIRQGAIAMAAAR-KSRNRD-NEGQSDGFLLEEAVLSLVITNA 179 Query: 314 VEVQGDYGSSIGIAVYDATFSWINHSCSPNSCYRFLVGP-------QNDQRQPLRIAPAA 156 VEVQ G S+GIAVYD +FSWINHSCSPN+CYRF + + D LRI P+ Sbjct: 180 VEVQDKSGRSLGIAVYDLSFSWINHSCSPNACYRFSISSPHATLSFREDSSSTLRIVPSV 239 Query: 155 EGDRAN 138 G+ + Sbjct: 240 LGEECD 245 >ref|XP_007019533.1| SET domain protein, putative isoform 1 [Theobroma cacao] gi|590600784|ref|XP_007019534.1| SET domain protein, putative isoform 1 [Theobroma cacao] gi|590600816|ref|XP_007019536.1| SET domain protein, putative isoform 1 [Theobroma cacao] gi|508724861|gb|EOY16758.1| SET domain protein, putative isoform 1 [Theobroma cacao] gi|508724862|gb|EOY16759.1| SET domain protein, putative isoform 1 [Theobroma cacao] gi|508724864|gb|EOY16761.1| SET domain protein, putative isoform 1 [Theobroma cacao] Length = 658 Score = 145 bits (366), Expect = 5e-32 Identities = 106/306 (34%), Positives = 140/306 (45%), Gaps = 7/306 (2%) Frame = -1 Query: 1034 KMEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTFPKNSRHVPTDAL 855 +MEMRA +++ G+D+T P+ PL++ L+DS +SSHCS+CFS LPP TFP RHVP Sbjct: 12 EMEMRAKQDLDYGQDITPPILPLSSSLYDSFLSSHCSSCFSPLPP-TFPHIPRHVP---- 66 Query: 854 TPLYCSLRCXXXXXXXXXXSAERHXXXXXXXXXXXXXXXXXXXXXXXXLVHIFQNLPQKS 675 LYCS C + H + + Q+LP Sbjct: 67 --LYCSPTC-----------SSSHSPLHSSSAESLLPPTCPDSSDLRTALRLLQSLPSTP 113 Query: 674 YLGGKKESFFQKNEETEGPVDYIDNTEMKNENLERIAGLMTNRENLVVGEIEENQFECDE 495 +L RI GL+TN L Sbjct: 114 ------------------------------PHLHRIDGLLTNHHML-------------- 129 Query: 494 NPSNCEDCSENSGNVLQRIREGAKMMAKARRKCVGKDVNVEKEECFSLEEMVLCLVLTNA 315 + +S V +IR+GA MA AR K +D N + + F LEE VL LV+TNA Sbjct: 130 --------TSSSPEVAAKIRQGAIAMAAAR-KSRNRD-NEGQSDGFLLEEAVLSLVITNA 179 Query: 314 VEVQGDYGSSIGIAVYDATFSWINHSCSPNSCYRFLVGP-------QNDQRQPLRIAPAA 156 VEVQ G S+GIAVYD +FSWINHSCSPN+CYRF + + D LRI P+ Sbjct: 180 VEVQDKSGRSLGIAVYDLSFSWINHSCSPNACYRFSISSPHATLSFREDSSSTLRIVPSV 239 Query: 155 EGDRAN 138 G+ + Sbjct: 240 LGEECD 245 >ref|XP_010665142.1| PREDICTED: protein SET DOMAIN GROUP 41 isoform X2 [Vitis vinifera] Length = 635 Score = 144 bits (364), Expect = 9e-32 Identities = 109/304 (35%), Positives = 139/304 (45%), Gaps = 4/304 (1%) Frame = -1 Query: 1031 MEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTFPKNSRHVPTDALT 852 MEMR E+ +G DLT PLPPLA+ LHDS + SHCSACFS LPP + P+ + Sbjct: 1 MEMRMREDTEMGLDLTHPLPPLASSLHDSHLRSHCSACFSPLPPTVLVNTN---PSSSFL 57 Query: 851 PLYCSLRCXXXXXXXXXXSAERHXXXXXXXXXXXXXXXXXXXXXXXXLVHIFQNLPQKSY 672 YCS C SAE H +HI P Sbjct: 58 -CYCSPPCSASDSPLHFSSAEHHLFLLLRHSHPSTAHSSDLRAALRL-LHILHLPP---- 111 Query: 671 LGGKKESFFQKNEETEGPVDYIDNTEMKNENLERIAGLMTNRENLVVGEIEENQFECDEN 492 + + L RI GL+TN +L+ + Sbjct: 112 --------------------------LHTQPLHRICGLLTNLHHLI-------------S 132 Query: 491 PSNCEDCSENSGNVLQRIREGAKMMAKARRKCVGKDVNVEKEECFSLEEMVLCLVLTNAV 312 PS+ + S L RIR+G K MA AR G + + + + LEE +LCLVLTNAV Sbjct: 133 PSH----NSESDETLTRIRDGGKAMAVARCMRDGTEFSGDSK----LEEALLCLVLTNAV 184 Query: 311 EVQGDYGSSIGIAVYDATFSWINHSCSPNSCYRFLV----GPQNDQRQPLRIAPAAEGDR 144 EVQ + GS++GIAVYD FSWINHSCSPN+CYRFL+ PQ L+I P + Sbjct: 185 EVQVNGGSALGIAVYDWCFSWINHSCSPNACYRFLLRSPETPQFSGESRLQIIPGGNDEI 244 Query: 143 ANEN 132 +N Sbjct: 245 EIQN 248 >ref|XP_010665141.1| PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Vitis vinifera] Length = 668 Score = 144 bits (364), Expect = 9e-32 Identities = 109/304 (35%), Positives = 139/304 (45%), Gaps = 4/304 (1%) Frame = -1 Query: 1031 MEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTFPKNSRHVPTDALT 852 MEMR E+ +G DLT PLPPLA+ LHDS + SHCSACFS LPP + P+ + Sbjct: 1 MEMRMREDTEMGLDLTHPLPPLASSLHDSHLRSHCSACFSPLPPTVLVNTN---PSSSFL 57 Query: 851 PLYCSLRCXXXXXXXXXXSAERHXXXXXXXXXXXXXXXXXXXXXXXXLVHIFQNLPQKSY 672 YCS C SAE H +HI P Sbjct: 58 -CYCSPPCSASDSPLHFSSAEHHLFLLLRHSHPSTAHSSDLRAALRL-LHILHLPP---- 111 Query: 671 LGGKKESFFQKNEETEGPVDYIDNTEMKNENLERIAGLMTNRENLVVGEIEENQFECDEN 492 + + L RI GL+TN +L+ + Sbjct: 112 --------------------------LHTQPLHRICGLLTNLHHLI-------------S 132 Query: 491 PSNCEDCSENSGNVLQRIREGAKMMAKARRKCVGKDVNVEKEECFSLEEMVLCLVLTNAV 312 PS+ + S L RIR+G K MA AR G + + + + LEE +LCLVLTNAV Sbjct: 133 PSH----NSESDETLTRIRDGGKAMAVARCMRDGTEFSGDSK----LEEALLCLVLTNAV 184 Query: 311 EVQGDYGSSIGIAVYDATFSWINHSCSPNSCYRFLV----GPQNDQRQPLRIAPAAEGDR 144 EVQ + GS++GIAVYD FSWINHSCSPN+CYRFL+ PQ L+I P + Sbjct: 185 EVQVNGGSALGIAVYDWCFSWINHSCSPNACYRFLLRSPETPQFSGESRLQIIPGGNDEI 244 Query: 143 ANEN 132 +N Sbjct: 245 EIQN 248 >ref|XP_010260979.1| PREDICTED: protein SET DOMAIN GROUP 41 isoform X2 [Nelumbo nucifera] Length = 661 Score = 125 bits (315), Expect = 4e-26 Identities = 102/291 (35%), Positives = 135/291 (46%), Gaps = 17/291 (5%) Frame = -1 Query: 1031 MEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQT-------------F 891 MEMRA E+I +G D+T P+PPLA L DS + SHCSACF L P+ F Sbjct: 1 MEMRAKEDIDMGHDVTPPIPPLAFSLSDSFLRSHCSACFLPLNPRIPPFLPIDPSIVHPF 60 Query: 890 PKNSRHVPTDALTPLYCSLRCXXXXXXXXXXSAERHXXXXXXXXXXXXXXXXXXXXXXXX 711 P S + + + LYCS C S E H Sbjct: 61 PATSTS-SSSSPSVLYCSPECSKADSDRHMSSGEHHLFLILQSEFTTWQGDTSDLRASLR 119 Query: 710 LVHIFQNL---PQKSYLGGKKESFFQKNEETEGPVDYIDNTEMKNENLERIAGLMTNREN 540 L+ F+ L P+++ D + N RI GL++NRE Sbjct: 120 LLLCFEKLGLLPRQN--------------------DLLPNISC------RIGGLISNREK 153 Query: 539 LVVGEIEENQFECDENPSNCEDCSENSGNVLQRIREGAKMMAKARRKCVGKDVNVEKEEC 360 L+ + E DE S RI EG ++M+ ARR G D VE+E+ Sbjct: 154 LIGAD------EFDETFS--------------RILEGGRLMSLARRWRDG-DFAVEEEKG 192 Query: 359 FSL-EEMVLCLVLTNAVEVQGDYGSSIGIAVYDATFSWINHSCSPNSCYRF 210 +L EE+VLC V+TN++EVQ + G +GIAVY +FSWINHSCSPN+CYRF Sbjct: 193 DTLLEEIVLCQVITNSIEVQVNEGRPLGIAVYGPSFSWINHSCSPNACYRF 243 >ref|XP_010260978.1| PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Nelumbo nucifera] Length = 694 Score = 125 bits (315), Expect = 4e-26 Identities = 102/291 (35%), Positives = 135/291 (46%), Gaps = 17/291 (5%) Frame = -1 Query: 1031 MEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQT-------------F 891 MEMRA E+I +G D+T P+PPLA L DS + SHCSACF L P+ F Sbjct: 1 MEMRAKEDIDMGHDVTPPIPPLAFSLSDSFLRSHCSACFLPLNPRIPPFLPIDPSIVHPF 60 Query: 890 PKNSRHVPTDALTPLYCSLRCXXXXXXXXXXSAERHXXXXXXXXXXXXXXXXXXXXXXXX 711 P S + + + LYCS C S E H Sbjct: 61 PATSTS-SSSSPSVLYCSPECSKADSDRHMSSGEHHLFLILQSEFTTWQGDTSDLRASLR 119 Query: 710 LVHIFQNL---PQKSYLGGKKESFFQKNEETEGPVDYIDNTEMKNENLERIAGLMTNREN 540 L+ F+ L P+++ D + N RI GL++NRE Sbjct: 120 LLLCFEKLGLLPRQN--------------------DLLPNISC------RIGGLISNREK 153 Query: 539 LVVGEIEENQFECDENPSNCEDCSENSGNVLQRIREGAKMMAKARRKCVGKDVNVEKEEC 360 L+ + E DE S RI EG ++M+ ARR G D VE+E+ Sbjct: 154 LIGAD------EFDETFS--------------RILEGGRLMSLARRWRDG-DFAVEEEKG 192 Query: 359 FSL-EEMVLCLVLTNAVEVQGDYGSSIGIAVYDATFSWINHSCSPNSCYRF 210 +L EE+VLC V+TN++EVQ + G +GIAVY +FSWINHSCSPN+CYRF Sbjct: 193 DTLLEEIVLCQVITNSIEVQVNEGRPLGIAVYGPSFSWINHSCSPNACYRF 243 >ref|XP_012080733.1| PREDICTED: protein SET DOMAIN GROUP 41 isoform X2 [Jatropha curcas] Length = 610 Score = 109 bits (272), Expect = 4e-21 Identities = 78/182 (42%), Positives = 91/182 (50%), Gaps = 7/182 (3%) Frame = -1 Query: 572 RIAGLMTNRENLVVGEIEENQFECDENPSNCEDCSENSGNVLQRIREGAKMMAKARRKCV 393 RI+GL+TNRE L+ + RIR+GAK +A RR Sbjct: 119 RISGLLTNREKLMT-----------------------DNEIFTRIRDGAKAIAATRRLRD 155 Query: 392 GKDVNVEKEEC--FSLEEMVLCLVLTNAVEVQGDYGSSIGIAVYDATFSWINHSCSPNSC 219 GK E SLEE LCLVLTNAVEVQ + G ++GIAVYD TFSWINHSCSPN+C Sbjct: 156 GKVAVTAANENDEVSLEESALCLVLTNAVEVQDNEGRTLGIAVYDHTFSWINHSCSPNAC 215 Query: 218 YRFLVGPQNDQRQPLRIAPAAEGDR-----ANENGDGLRYLFSGISCE*CGGELTVCLSS 54 YRFL+ PL IAP R A NG+ + FS I ELT Sbjct: 216 YRFLI-------SPLSIAPFPSESRQAIVPAGSNGE--KSAFSNI-------ELTKGHGE 259 Query: 53 YG 48 YG Sbjct: 260 YG 261 Score = 65.5 bits (158), Expect = 7e-08 Identities = 36/70 (51%), Positives = 43/70 (61%) Frame = -1 Query: 1037 LKMEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTFPKNSRHVPTDA 858 ++MEM A E+I IGED+T PL PL+ LHDS + SHCSACFS LP + H T Sbjct: 9 MEMEMEAGEDIGIGEDITLPLFPLSFSLHDSFLHSHCSACFSPLP-------NPHSSTSC 61 Query: 857 LTPLYCSLRC 828 LYCS C Sbjct: 62 PPFLYCSPIC 71 >ref|XP_012080731.1| PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Jatropha curcas] Length = 643 Score = 109 bits (272), Expect = 4e-21 Identities = 78/182 (42%), Positives = 91/182 (50%), Gaps = 7/182 (3%) Frame = -1 Query: 572 RIAGLMTNRENLVVGEIEENQFECDENPSNCEDCSENSGNVLQRIREGAKMMAKARRKCV 393 RI+GL+TNRE L+ + RIR+GAK +A RR Sbjct: 119 RISGLLTNREKLMT-----------------------DNEIFTRIRDGAKAIAATRRLRD 155 Query: 392 GKDVNVEKEEC--FSLEEMVLCLVLTNAVEVQGDYGSSIGIAVYDATFSWINHSCSPNSC 219 GK E SLEE LCLVLTNAVEVQ + G ++GIAVYD TFSWINHSCSPN+C Sbjct: 156 GKVAVTAANENDEVSLEESALCLVLTNAVEVQDNEGRTLGIAVYDHTFSWINHSCSPNAC 215 Query: 218 YRFLVGPQNDQRQPLRIAPAAEGDR-----ANENGDGLRYLFSGISCE*CGGELTVCLSS 54 YRFL+ PL IAP R A NG+ + FS I ELT Sbjct: 216 YRFLI-------SPLSIAPFPSESRQAIVPAGSNGE--KSAFSNI-------ELTKGHGE 259 Query: 53 YG 48 YG Sbjct: 260 YG 261 Score = 65.5 bits (158), Expect = 7e-08 Identities = 36/70 (51%), Positives = 43/70 (61%) Frame = -1 Query: 1037 LKMEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTFPKNSRHVPTDA 858 ++MEM A E+I IGED+T PL PL+ LHDS + SHCSACFS LP + H T Sbjct: 9 MEMEMEAGEDIGIGEDITLPLFPLSFSLHDSFLHSHCSACFSPLP-------NPHSSTSC 61 Query: 857 LTPLYCSLRC 828 LYCS C Sbjct: 62 PPFLYCSPIC 71 >gb|KJB66309.1| hypothetical protein B456_010G134600 [Gossypium raimondii] Length = 595 Score = 107 bits (267), Expect = 2e-20 Identities = 58/122 (47%), Positives = 79/122 (64%), Gaps = 12/122 (9%) Frame = -1 Query: 443 RIREGAKMMAKARRKCVGKDVNVEKEECFSLEEMVLCLVLTNAVEVQGDYGSSIGIAVYD 264 +IR+GA MA AR+ + K +++++ + LEE VLCLV+TNAVEVQ + G S+GIAVYD Sbjct: 126 QIRQGAIAMAAARK--LRKGLSLDQSDDVLLEEAVLCLVVTNAVEVQDESGRSLGIAVYD 183 Query: 263 ATFSWINHSCSPNSCYRFLVGPQN------DQRQPLRIAPAAEGDR------ANENGDGL 120 +FSWINHSCSPN+CYRF+V P N D LRI P+ + + N +G Sbjct: 184 PSFSWINHSCSPNACYRFIVSPPNATSFGEDSASALRIVPSVSEENFGVCSCSEYNKEGY 243 Query: 119 RY 114 +Y Sbjct: 244 KY 245 Score = 67.0 bits (162), Expect = 2e-08 Identities = 35/68 (51%), Positives = 45/68 (66%) Frame = -1 Query: 1031 MEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTFPKNSRHVPTDALT 852 MEMRA ++I IG+D+T PL PL+ LHDS +SSHCS+CFS P +FP + H + Sbjct: 1 MEMRAKQDIEIGDDITPPLLPLSFSLHDSFLSSHCSSCFS---PLSFPPSPHHYGS---- 53 Query: 851 PLYCSLRC 828 LYCS C Sbjct: 54 -LYCSAPC 60 >gb|KJB66308.1| hypothetical protein B456_010G134600 [Gossypium raimondii] Length = 628 Score = 107 bits (267), Expect = 2e-20 Identities = 58/122 (47%), Positives = 79/122 (64%), Gaps = 12/122 (9%) Frame = -1 Query: 443 RIREGAKMMAKARRKCVGKDVNVEKEECFSLEEMVLCLVLTNAVEVQGDYGSSIGIAVYD 264 +IR+GA MA AR+ + K +++++ + LEE VLCLV+TNAVEVQ + G S+GIAVYD Sbjct: 126 QIRQGAIAMAAARK--LRKGLSLDQSDDVLLEEAVLCLVVTNAVEVQDESGRSLGIAVYD 183 Query: 263 ATFSWINHSCSPNSCYRFLVGPQN------DQRQPLRIAPAAEGDR------ANENGDGL 120 +FSWINHSCSPN+CYRF+V P N D LRI P+ + + N +G Sbjct: 184 PSFSWINHSCSPNACYRFIVSPPNATSFGEDSASALRIVPSVSEENFGVCSCSEYNKEGY 243 Query: 119 RY 114 +Y Sbjct: 244 KY 245 Score = 67.0 bits (162), Expect = 2e-08 Identities = 35/68 (51%), Positives = 45/68 (66%) Frame = -1 Query: 1031 MEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSALPPQTFPKNSRHVPTDALT 852 MEMRA ++I IG+D+T PL PL+ LHDS +SSHCS+CFS P +FP + H + Sbjct: 1 MEMRAKQDIEIGDDITPPLLPLSFSLHDSFLSSHCSSCFS---PLSFPPSPHHYGS---- 53 Query: 851 PLYCSLRC 828 LYCS C Sbjct: 54 -LYCSAPC 60 >ref|XP_010062107.1| PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Eucalyptus grandis] Length = 673 Score = 107 bits (266), Expect = 2e-20 Identities = 71/158 (44%), Positives = 84/158 (53%), Gaps = 6/158 (3%) Frame = -1 Query: 572 RIAGLMTNRENLVVGEIEENQFECDENPSNCEDCSENSGNVLQRIREGAKMMAKARRKC- 396 RI GL+TNRE L GE S G V++RIR GA+++A ARRK Sbjct: 128 RIGGLLTNREKLTSGE------------------SSGDGEVVERIRSGARVLATARRKLG 169 Query: 395 VGKDVNVEKEECFSLEEMVLCLVLTNAVEVQGDYGSSIGIAVYDATFSWINHSCSPNSCY 216 +G + E LEE LC +TNAVEVQ G +GIAVY FSWINHSCS N+CY Sbjct: 170 LGGGGGGDVEGDVVLEEAALCATITNAVEVQDVDGRGLGIAVYGTEFSWINHSCSANACY 229 Query: 215 RF-LVGPQNDQRQP----LRIAPAAEGDRANENGDGLR 117 RF GP+ P LRI P GDRA + G R Sbjct: 230 RFQFSGPEISAPAPGESRLRIVP--YGDRAQMDSVGCR 265 Score = 72.4 bits (176), Expect = 6e-10 Identities = 33/72 (45%), Positives = 51/72 (70%), Gaps = 2/72 (2%) Frame = -1 Query: 1037 LKMEMRAIEEIAIGEDLTSPLPPLAAVLHDSAVSSHCSACFSAL--PPQTFPKNSRHVPT 864 ++M MRA E++A+GED+T P+ PL+ L+D+ + SHCS+CFS+L PP P++ + Sbjct: 1 MEMAMRATEDVAMGEDVTPPIAPLSLALYDAFLPSHCSSCFSSLSPPPPPPPRSPPAGTS 60 Query: 863 DALTPLYCSLRC 828 + +PLYCS RC Sbjct: 61 RSSSPLYCSARC 72