BLASTX nr result
ID: Achyranthes23_contig00037032
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Achyranthes23_contig00037032 (854 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006473070.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 132 2e-28 ref|XP_006434476.1| hypothetical protein CICLE_v10000601mg [Citr... 132 2e-28 ref|XP_002265243.2| PREDICTED: protein SET DOMAIN GROUP 41-like ... 126 9e-27 gb|EOY16760.1| SET domain-containing protein, putative isoform 3... 125 1e-26 gb|EOY16758.1| SET domain protein, putative isoform 1 [Theobroma... 125 1e-26 ref|XP_004290505.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 124 4e-26 gb|EMJ00499.1| hypothetical protein PRUPE_ppa023162mg, partial [... 122 2e-25 ref|XP_002306703.2| hypothetical protein POPTR_0005s21560g [Popu... 117 4e-24 gb|ESW24006.1| hypothetical protein PHAVU_004G094200g, partial [... 115 2e-23 ref|XP_006395990.1| hypothetical protein EUTSA_v10003905mg [Eutr... 114 3e-23 ref|XP_006395991.1| hypothetical protein EUTSA_v10003905mg [Eutr... 114 4e-23 ref|XP_004138545.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 112 1e-22 emb|CBI18219.3| unnamed protein product [Vitis vinifera] 112 2e-22 gb|EXC28030.1| Protein SET DOMAIN GROUP 41 [Morus notabilis] 111 3e-22 ref|XP_006599489.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 111 3e-22 ref|XP_004238489.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 111 3e-22 ref|XP_002893944.1| hypothetical protein ARALYDRAFT_314093 [Arab... 107 4e-21 ref|XP_006303699.1| hypothetical protein CARUB_v10011796mg [Caps... 107 5e-21 ref|XP_006359805.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 105 2e-20 ref|NP_683372.2| SET domain-containing protein [Arabidopsis thal... 105 3e-20 >ref|XP_006473070.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Citrus sinensis] Length = 619 Score = 132 bits (332), Expect = 2e-28 Identities = 84/179 (46%), Positives = 105/179 (58%), Gaps = 10/179 (5%) Frame = -3 Query: 510 FGLLTNRDALLS--DLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVI 337 FGLLTNRD L+S D S+I+ GAR +A AR EEA LCLV+ Sbjct: 81 FGLLTNRDKLMSSSDSDVASKIREGAREMARARG---------NLSDDVAWEEAALCLVM 131 Query: 336 TNAVEININ--GERLGIGVYDWRFSWINHSCSPNSCFRF------IPSFVVSEGPDCSLS 181 TNAVE+ + G LGI VYD FSWINHSCSPN+C+RF PSF ++ Sbjct: 132 TNAVEVQDDKTGRILGIAVYDKDFSWINHSCSPNACYRFSLSEPNAPSF--RNEKKMRIA 189 Query: 180 LLRIFASSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4 +F S++A+ C +CEL +G +GPR+IVRSIK I + EEVT+ YTDLLQP Sbjct: 190 PHVVFDSTEAETPGKSDVCISCELKEGSKRHGPRIIVRSIKPINKGEEVTVAYTDLLQP 248 >ref|XP_006434476.1| hypothetical protein CICLE_v10000601mg [Citrus clementina] gi|557536598|gb|ESR47716.1| hypothetical protein CICLE_v10000601mg [Citrus clementina] Length = 619 Score = 132 bits (332), Expect = 2e-28 Identities = 84/179 (46%), Positives = 106/179 (59%), Gaps = 10/179 (5%) Frame = -3 Query: 510 FGLLTNRDALLS--DLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVI 337 FGLLTNRD L+S D S+I+ GAR +A AR EEA LCLV+ Sbjct: 81 FGLLTNRDKLMSSSDSDVASKIREGAREMARARG---------NLSDDVAWEEAALCLVM 131 Query: 336 TNAVEININ--GERLGIGVYDWRFSWINHSCSPNSCFRF------IPSFVVSEGPDCSLS 181 TNAVE+ + G LGI VYD FSWINHSCSPN+C+RF PSF + + Sbjct: 132 TNAVEVQDDKTGRILGIAVYDKDFSWINHSCSPNACYRFSLSEPNAPSFRDEKKKRIAPH 191 Query: 180 LLRIFASSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4 + +F S++A+ C +CEL +G +GPR+IVRSIK I + EEVT+ YTDLLQP Sbjct: 192 V--VFDSTEAETQGKSDVCISCELKEGSKRHGPRIIVRSIKPINKGEEVTVAYTDLLQP 248 >ref|XP_002265243.2| PREDICTED: protein SET DOMAIN GROUP 41-like [Vitis vinifera] Length = 660 Score = 126 bits (317), Expect = 9e-27 Identities = 82/176 (46%), Positives = 103/176 (58%), Gaps = 8/176 (4%) Frame = -3 Query: 507 GLLTNRDALLSDLLQ------FSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLC 346 GLLTN L+S +RI+ G + +A+AR MRD TE G +LEEA+LC Sbjct: 122 GLLTNLHHLISPSHNSESDETLTRIRDGGKAMAVARCMRD----GTEFSGDSKLEEALLC 177 Query: 345 LVITNAVEINING-ERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCS-LSLLR 172 LV+TNAVE+ +NG LGI VYDW FSWINHSCSPN+C+RF+ E P S S L+ Sbjct: 178 LVLTNAVEVQVNGGSALGIAVYDWCFSWINHSCSPNACYRFL--LRSPETPQFSGESRLQ 235 Query: 171 IFASSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4 I G E+ + GPR+IVRSIK I++ EEV + Y DLLQP Sbjct: 236 IIPG------------GNDEIEVKKNRSGPRIIVRSIKAIKKGEEVWVAYIDLLQP 279 >gb|EOY16760.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] gi|508724865|gb|EOY16762.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] gi|508724866|gb|EOY16763.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] gi|508724867|gb|EOY16764.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] Length = 625 Score = 125 bits (315), Expect = 1e-26 Identities = 85/175 (48%), Positives = 107/175 (61%), Gaps = 7/175 (4%) Frame = -3 Query: 507 GLLTNRDALLSDLLQFS-RIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVITN 331 GLLTN L S + + +I+ GA IAMA A + + + + GF LEEAVL LVITN Sbjct: 121 GLLTNHHMLTSSSPEVAAKIRQGA--IAMAAARKSRNRDNEGQSDGFLLEEAVLSLVITN 178 Query: 330 AVEI-NINGERLGIGVYDWRFSWINHSCSPNSCFRF---IPSFVVSEGPDCSLSLLRIFA 163 AVE+ + +G LGI VYD FSWINHSCSPN+C+RF P +S D S S LRI Sbjct: 179 AVEVQDKSGRSLGIAVYDLSFSWINHSCSPNACYRFSISSPHATLSFREDSS-STLRIVP 237 Query: 162 SSDADKGDGRGDCGTCELADGVDNY--GPRVIVRSIKDIQRCEEVTITYTDLLQP 4 S ++ D C E G Y GP++IVRSIK I++ EEV ++YTDLLQP Sbjct: 238 SVLGEECDA---CSCVEHTKGNKGYELGPKIIVRSIKRIRKGEEVCVSYTDLLQP 289 >gb|EOY16758.1| SET domain protein, putative isoform 1 [Theobroma cacao] gi|508724862|gb|EOY16759.1| SET domain protein, putative isoform 1 [Theobroma cacao] gi|508724864|gb|EOY16761.1| SET domain protein, putative isoform 1 [Theobroma cacao] Length = 658 Score = 125 bits (315), Expect = 1e-26 Identities = 85/175 (48%), Positives = 107/175 (61%), Gaps = 7/175 (4%) Frame = -3 Query: 507 GLLTNRDALLSDLLQFS-RIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVITN 331 GLLTN L S + + +I+ GA IAMA A + + + + GF LEEAVL LVITN Sbjct: 121 GLLTNHHMLTSSSPEVAAKIRQGA--IAMAAARKSRNRDNEGQSDGFLLEEAVLSLVITN 178 Query: 330 AVEI-NINGERLGIGVYDWRFSWINHSCSPNSCFRF---IPSFVVSEGPDCSLSLLRIFA 163 AVE+ + +G LGI VYD FSWINHSCSPN+C+RF P +S D S S LRI Sbjct: 179 AVEVQDKSGRSLGIAVYDLSFSWINHSCSPNACYRFSISSPHATLSFREDSS-STLRIVP 237 Query: 162 SSDADKGDGRGDCGTCELADGVDNY--GPRVIVRSIKDIQRCEEVTITYTDLLQP 4 S ++ D C E G Y GP++IVRSIK I++ EEV ++YTDLLQP Sbjct: 238 SVLGEECDA---CSCVEHTKGNKGYELGPKIIVRSIKRIRKGEEVCVSYTDLLQP 289 >ref|XP_004290505.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Fragaria vesca subsp. vesca] Length = 645 Score = 124 bits (311), Expect = 4e-26 Identities = 83/171 (48%), Positives = 99/171 (57%), Gaps = 3/171 (1%) Frame = -3 Query: 507 GLLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFEL-EEAVLCLVITN 331 GLLTNR L DL RI+ GAR + +AR M D + + EEA LCLV+TN Sbjct: 115 GLLTNRRKLDDDL----RIRDGARAMFLARTMPDDNDAVLDVAHDDAVSEEAALCLVLTN 170 Query: 330 AVEININ-GERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVS-EGPDCSLSLLRIFASS 157 AVE+ + G LGI VYD FSWINHSCSPN+C+RF+ S P C + LRI + Sbjct: 171 AVEVQDHTGRTLGIAVYDSCFSWINHSCSPNACYRFLLSSPSQPTPPQCDETPLRIVPA- 229 Query: 156 DADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4 G + + +GPRVIVRSIK I R EEVTITYTDLLQP Sbjct: 230 -----------GQLIVNAECEKFGPRVIVRSIKRINRGEEVTITYTDLLQP 269 >gb|EMJ00499.1| hypothetical protein PRUPE_ppa023162mg, partial [Prunus persica] Length = 635 Score = 122 bits (305), Expect = 2e-25 Identities = 81/171 (47%), Positives = 98/171 (57%), Gaps = 3/171 (1%) Frame = -3 Query: 507 GLLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVITNA 328 GLLTN L RI+ GAR + +AR MRD+ ++ LEEA LCLV+TNA Sbjct: 130 GLLTNHHKFLHHD-DHHRIRDGARAMFLARKMRDEAPNVYDA----VLEEAALCLVLTNA 184 Query: 327 VEI-NINGERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSL--LRIFASS 157 VE+ + G LGI VY F WINHSCSPN+C+RF+ S P CS LRI Sbjct: 185 VEVQDKTGRTLGISVYGPSFCWINHSCSPNACYRFLVS--PPPPPPCSAERTPLRIAPLG 242 Query: 156 DADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4 + G C + YGPRVIVRSIK I++ EEVT+TYTDLLQP Sbjct: 243 QGTQSCGIDICCRLRVVFVAIIYGPRVIVRSIKRIKKGEEVTVTYTDLLQP 293 >ref|XP_002306703.2| hypothetical protein POPTR_0005s21560g [Populus trichocarpa] gi|550339461|gb|EEE93699.2| hypothetical protein POPTR_0005s21560g [Populus trichocarpa] Length = 626 Score = 117 bits (294), Expect = 4e-24 Identities = 79/171 (46%), Positives = 97/171 (56%), Gaps = 3/171 (1%) Frame = -3 Query: 507 GLLTNRDALLSDLLQFSRIKVGARLIAMARA--MRDQRKFDTESGGGFELEEAVLCLVIT 334 GLLTNR+ L++D + ++ GA+ IA AR M + K D L EA LCLV+T Sbjct: 106 GLLTNREKLMADEEISAHVRYGAKAIAAARRIEMVENEKNDAV------LLEAALCLVLT 159 Query: 333 NAVEININ-GERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIFASS 157 NAVE++ N G +GI VY FSWINHSCSPN+C+R I S + P S LRI + Sbjct: 160 NAVEVHDNEGRSIGIAVYGPNFSWINHSCSPNACYRSIISPPDNVLPFSDESRLRILPAG 219 Query: 156 DADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4 K GPRVIVRSIK I+R EEVT+ YTDLLQP Sbjct: 220 TEVKS---------------HESGPRVIVRSIKRIKRGEEVTVAYTDLLQP 255 >gb|ESW24006.1| hypothetical protein PHAVU_004G094200g, partial [Phaseolus vulgaris] Length = 530 Score = 115 bits (288), Expect = 2e-23 Identities = 76/170 (44%), Positives = 98/170 (57%), Gaps = 3/170 (1%) Frame = -3 Query: 507 GLLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAV--LCLVIT 334 GLL+NR L S ++ MA A+ +QR + LEEA LC V+T Sbjct: 107 GLLSNRRILTSHHHDHVSERIRLDATVMAEAIAEQRAVPHDDA---VLEEATIALCAVLT 163 Query: 333 NAVEININ-GERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIFASS 157 NAVE++ N G LGI V+D FSWINHSCSPN+C+RFI S S P+ LLRI + Sbjct: 164 NAVEVHDNEGRALGIAVFDPTFSWINHSCSPNACYRFILSSFPSNEPE----LLRI--AP 217 Query: 156 DADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQ 7 G G + E A + YGPR++VRSIK I++ EEVT+ YTD+LQ Sbjct: 218 HPQMGSGGVCVSSDEFAKEMLGYGPRLVVRSIKKIKKGEEVTVAYTDILQ 267 >ref|XP_006395990.1| hypothetical protein EUTSA_v10003905mg [Eutrema salsugineum] gi|557092629|gb|ESQ33276.1| hypothetical protein EUTSA_v10003905mg [Eutrema salsugineum] Length = 572 Score = 114 bits (286), Expect = 3e-23 Identities = 73/174 (41%), Positives = 99/174 (56%), Gaps = 3/174 (1%) Frame = -3 Query: 513 FFGLLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVIT 334 F GLLTN L++D I+ A IA+ +R RK ELEEA +C V+T Sbjct: 106 FGGLLTNHHRLMADSSFSVAIQCAANFIAVV--LRSDRK-------NTELEEAAICSVLT 156 Query: 333 NAVEI-NINGERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIFASS 157 NAVE+ + +G LGI VYD RFSWINHSCSPN+C+RF+ S + P + ++ Sbjct: 157 NAVELQDSSGRALGIAVYDTRFSWINHSCSPNACYRFVISPHSTTTPSFQDYPKMLPHTT 216 Query: 156 DADKGDGRGDCGTCELADGV--DNYGPRVIVRSIKDIQRCEEVTITYTDLLQPT 1 + +K G C + YGP+V+ RSIK I+ EE+TI+Y DL+QPT Sbjct: 217 NTEK----EQIGVCSRITSLWEVRYGPKVVARSIKRIKSGEEITISYIDLMQPT 266 >ref|XP_006395991.1| hypothetical protein EUTSA_v10003905mg [Eutrema salsugineum] gi|557092630|gb|ESQ33277.1| hypothetical protein EUTSA_v10003905mg [Eutrema salsugineum] Length = 575 Score = 114 bits (285), Expect = 4e-23 Identities = 75/174 (43%), Positives = 102/174 (58%), Gaps = 3/174 (1%) Frame = -3 Query: 513 FFGLLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVIT 334 F GLLTN L++D I+ A IA+ +R RK ELEEA +C V+T Sbjct: 106 FGGLLTNHHRLMADSSFSVAIQCAANFIAVV--LRSDRK-------NTELEEAAICSVLT 156 Query: 333 NAVEI-NINGERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIFASS 157 NAVE+ + +G LGI VYD RFSWINHSCSPN+C+RF+ S + P + ++ Sbjct: 157 NAVELQDSSGRALGIAVYDTRFSWINHSCSPNACYRFVISPHSTTTPSFQDYPKMLPHTT 216 Query: 156 DADKGDGRGDCG-TCELADG-VDNYGPRVIVRSIKDIQRCEEVTITYTDLLQPT 1 + +K + G C L +G YGP+V+ RSIK I+ EE+TI+Y DL+QPT Sbjct: 217 NTEK-EQIGVCSRITSLWEGKTVRYGPKVVARSIKRIKSGEEITISYIDLMQPT 269 >ref|XP_004138545.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Cucumis sativus] Length = 659 Score = 112 bits (281), Expect = 1e-22 Identities = 76/173 (43%), Positives = 97/173 (56%), Gaps = 4/173 (2%) Frame = -3 Query: 510 FGLLTNRDALLS---DLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLV 340 +GLLTNR L++ D F +++ GA IA R RK + G LEEAVLCLV Sbjct: 143 YGLLTNRHKLMTPQNDSEVFLKLREGANAIAALR-----RKNYADIPPGTALEEAVLCLV 197 Query: 339 ITNAVEININ-GERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIFA 163 +TNAV++ + G+ +GI VY FSWINHSCSPN+C+RF E P S++ A Sbjct: 198 LTNAVDVQDSIGQTIGIAVYASTFSWINHSCSPNACYRF-------ETPSDSVTTRFRIA 250 Query: 162 SSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4 S D G+ GPRV+VRSIK I++ E VTI Y DLLQP Sbjct: 251 PSCTDFMSDEGN---------FQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQP 294 >emb|CBI18219.3| unnamed protein product [Vitis vinifera] Length = 533 Score = 112 bits (279), Expect = 2e-22 Identities = 67/133 (50%), Positives = 85/133 (63%), Gaps = 4/133 (3%) Frame = -3 Query: 390 TESGGGFELEEAVLCLVITNAVEINING-ERLGIGVYDWRFSWINHSCSPNSCFRFIPSF 214 TE G +LEEA+LCLV+TNAVE+ +NG LGI VYDW FSWINHSCSPN+C+RF+ Sbjct: 5 TEFSGDSKLEEALLCLVLTNAVEVQVNGGSALGIAVYDWCFSWINHSCSPNACYRFL--L 62 Query: 213 VVSEGPDCS-LSLLRIF--ASSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRC 43 E P S S L+I + + + R E G + +GPR+IVRSIK I++ Sbjct: 63 RSPETPQFSGESRLQIIPGGNDEIEVKKNRSLFLNSEF-KGCNIHGPRIIVRSIKAIKKG 121 Query: 42 EEVTITYTDLLQP 4 EEV + Y DLLQP Sbjct: 122 EEVWVAYIDLLQP 134 >gb|EXC28030.1| Protein SET DOMAIN GROUP 41 [Morus notabilis] Length = 661 Score = 111 bits (278), Expect = 3e-22 Identities = 78/180 (43%), Positives = 101/180 (56%), Gaps = 12/180 (6%) Frame = -3 Query: 507 GLLTNRDALLSDLLQ--FSRIKVGARLIAMARAMRDQRKFDTESGGGFE-LEEAVLCLVI 337 GL TN L +D + +RI+ GAR +A AR MRD+ ES G E + A LC V+ Sbjct: 131 GLSTNLHKLANDDEEEVAARIRDGARAMAAARRMRDRDCSGEESEGEEEAMAAAALCAVL 190 Query: 336 TNAVEINI-NGERLGIGVYDWR-FSWINHSCSPNSCFRF-------IPSFVVSEGPDCSL 184 TN VE+ + +G LG+ VY FSWINHSCSPN+C+R SF+ PD Sbjct: 191 TNGVEVQVKSGRTLGVAVYGGGGFSWINHSCSPNACYRISLHSDLQTTSFL----PDHET 246 Query: 183 SLLRIFASSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4 + +RI + + CG +YGPR+IVRSIK IQ+ EEVT+ YTDLLQP Sbjct: 247 AAMRIVPCCNKET-----QCGC--------SYGPRIIVRSIKRIQKGEEVTVAYTDLLQP 293 >ref|XP_006599489.1| PREDICTED: protein SET DOMAIN GROUP 41-like isoform X1 [Glycine max] Length = 642 Score = 111 bits (278), Expect = 3e-22 Identities = 76/174 (43%), Positives = 96/174 (55%), Gaps = 6/174 (3%) Frame = -3 Query: 507 GLLTNRDALLSDLLQ---FSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCL-- 343 GLL+NR L S + RI VGA AMA A+ QR + LEEA + L Sbjct: 106 GLLSNRHILTSLSVHDDVSERISVGAG--AMAEAIAKQRGIPNDDA---VLEEATIALSA 160 Query: 342 VITNAVEININ-GERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIF 166 V+TNAVE++ N G LGI V+D FSWINHSCSPN+C+RF+ S G ++ Sbjct: 161 VLTNAVEVHDNEGRALGIAVFDQIFSWINHSCSPNACYRFVLSSSSHSGE------AKLG 214 Query: 165 ASSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4 + + E A G YGPR++VRSIK I + EEVT+ YTDLLQP Sbjct: 215 IAPHLQMNSSGVSISSSEFAKGGLGYGPRLVVRSIKKINKGEEVTVAYTDLLQP 268 >ref|XP_004238489.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Solanum lycopersicum] Length = 677 Score = 111 bits (278), Expect = 3e-22 Identities = 68/159 (42%), Positives = 93/159 (58%), Gaps = 8/159 (5%) Frame = -3 Query: 456 RIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVITNAVEI-NINGERLGIGVYD 280 RI+ GA+ +A +R MR E+ G + +E AVLCLV+TNAVE+ + +G LG+GVYD Sbjct: 160 RIRDGAKALAASRRMR----VGLETNGEYTVEAAVLCLVLTNAVEVYDKDGRSLGVGVYD 215 Query: 279 WRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIFASSDADKGDGRGDCG------- 121 FSW+NHSCSPN+ +RF S+ S + A+ G G Sbjct: 216 VPFSWVNHSCSPNASYRFC---TASDSGGILESRICPAATETGAAGIGHESISSNTELQK 272 Query: 120 TCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4 + + G + GP++I+RSIK IQR EEV I+YTDLLQP Sbjct: 273 SMSVIGGSEACGPKIILRSIKGIQRSEEVLISYTDLLQP 311 >ref|XP_002893944.1| hypothetical protein ARALYDRAFT_314093 [Arabidopsis lyrata subsp. lyrata] gi|297339786|gb|EFH70203.1| hypothetical protein ARALYDRAFT_314093 [Arabidopsis lyrata subsp. lyrata] Length = 567 Score = 107 bits (268), Expect = 4e-21 Identities = 73/172 (42%), Positives = 95/172 (55%), Gaps = 3/172 (1%) Frame = -3 Query: 507 GLLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVITNA 328 GLLTN L++D I A IA +R RK ELEEA +C V+TNA Sbjct: 103 GLLTNHHLLMADSSFSLAIHHAASFIATV--LRSNRK-------NTELEEAAICSVLTNA 153 Query: 327 VEI-NINGERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLRIFASSDA 151 VE+ + NG LGI +YD RFSWINHSCSPNSC+RF+ + S D + + ++ Sbjct: 154 VEVQDSNGLVLGIALYDSRFSWINHSCSPNSCYRFVNN-TTSYHDDLAYPITIPHVNNTE 212 Query: 150 DKGDGRGDCGTCELADGVD--NYGPRVIVRSIKDIQRCEEVTITYTDLLQPT 1 EL + V YGP+VI R+IK I+ EE+T++Y DLLQPT Sbjct: 213 -------TLSNLELQEQVRTMGYGPKVIARNIKRIKSGEEITVSYIDLLQPT 257 >ref|XP_006303699.1| hypothetical protein CARUB_v10011796mg [Capsella rubella] gi|482572410|gb|EOA36597.1| hypothetical protein CARUB_v10011796mg [Capsella rubella] Length = 572 Score = 107 bits (267), Expect = 5e-21 Identities = 70/173 (40%), Positives = 97/173 (56%), Gaps = 4/173 (2%) Frame = -3 Query: 507 GLLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVITNA 328 GLLTN +++D I+ A I+ +R R+ ELEEAV+C V+TNA Sbjct: 103 GLLTNHHRIMADSSLSVAIQTAASFISTV--LRSNRE-------NTELEEAVICSVLTNA 153 Query: 327 VEININ-GERLGIGVYDWRFSWINHSCSPNSCFRFIPSFVVSEGPDCSLSLLR---IFAS 160 VE+ + G LGI +YD RFSWINHSCSPNSC+RF+ S D +L+ I + Sbjct: 154 VEVQDSAGLALGIALYDSRFSWINHSCSPNSCYRFVTK-TTSFHDDLALAKTIPHIIITN 212 Query: 159 SDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQPT 1 ++ + + YGP+VIVRSIK I+ EE+T++Y +LLQPT Sbjct: 213 TETSSNLESKALSSLQEQGRRVGYGPKVIVRSIKRIKSGEEITVSYMNLLQPT 265 >ref|XP_006359805.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Solanum tuberosum] Length = 681 Score = 105 bits (263), Expect = 2e-20 Identities = 68/159 (42%), Positives = 95/159 (59%), Gaps = 8/159 (5%) Frame = -3 Query: 456 RIKVGARLIAMARAMRDQRKFDTESGGGFE---LEEAVLCLVITNAVEINI-NGERLGIG 289 RI+ GA+ +A +R MR DT +E +E AVLCLV+TNAVE++ +G LG+G Sbjct: 164 RIRHGAKALAASRRMR--LGLDTNRELLYEEYTVEAAVLCLVLTNAVEVHDKDGRSLGVG 221 Query: 288 VYDWRFSWINHSCSPNSCFRFI---PSFVVSEGPDCSLSLLRIFASSDADKGDGRGDC-G 121 VYD FSW+NHSCSPN+ +RF S +SE C + A +++ + Sbjct: 222 VYDVPFSWVNHSCSPNASYRFCTASDSGGISECRICPAATETGAAGIESESISSNPELQK 281 Query: 120 TCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQP 4 + + G + GP++I+RSIK I + EEV ITYTDLLQP Sbjct: 282 SMSVIGGSETCGPKIILRSIKGINKSEEVLITYTDLLQP 320 >ref|NP_683372.2| SET domain-containing protein [Arabidopsis thaliana] gi|97190651|sp|Q3ECY6.1|SDG41_ARATH RecName: Full=Protein SET DOMAIN GROUP 41 gi|332193843|gb|AEE31964.1| SET domain-containing protein [Arabidopsis thaliana] Length = 558 Score = 105 bits (261), Expect = 3e-20 Identities = 70/176 (39%), Positives = 93/176 (52%), Gaps = 8/176 (4%) Frame = -3 Query: 504 LLTNRDALLSDLLQFSRIKVGARLIAMARAMRDQRKFDTESGGGFELEEAVLCLVITNAV 325 LLTN L++D I A IA +R RK ELEEA +C V+TNAV Sbjct: 104 LLTNHHLLMADPSISVAIHHAANFIATV--IRSNRK-------NTELEEAAICAVLTNAV 154 Query: 324 EIN-INGERLGIGVYDWRFSWINHSCSPNSCFRFIPS-------FVVSEGPDCSLSLLRI 169 E++ NG LGI +Y+ FSWINHSCSPNSC+RF+ + V + +L L Sbjct: 155 EVHDSNGLALGIALYNSSFSWINHSCSPNSCYRFVNNRTSYHDVHVTNTETSSNLELQEQ 214 Query: 168 FASSDADKGDGRGDCGTCELADGVDNYGPRVIVRSIKDIQRCEEVTITYTDLLQPT 1 + + G+G GP++IVRSIK I+ EE+T++Y DLLQPT Sbjct: 215 VCGTSLNSGNGN---------------GPKLIVRSIKRIKSGEEITVSYIDLLQPT 255