BLASTX nr result
ID: Jatropha_contig00039583
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Jatropha_contig00039583 (611 letters) Database: NCBI-nr (updated 2014/02/11) 35,149,712 sequences; 12,374,887,350 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus ... 205 7e-51 gb|EEE85750.2| hypothetical protein POPTR_0004s07950g [Populus t... 163 3e-38 ref|XP_002305239.1| SET domain protein [Populus trichocarpa] 163 3e-38 ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like ... 155 6e-36 gb|ACU19071.1| unknown [Glycine max] 155 6e-36 ref|XP_002871756.1| SET domain-containing protein [Arabidopsis l... 155 8e-36 gb|EOY03097.1| SET domain group 40, putative isoform 1 [Theobrom... 153 3e-35 ref|XP_006289442.1| hypothetical protein CARUB_v10002957mg [Caps... 153 3e-35 gb|ESR43640.1| hypothetical protein CICLE_v10011537mg [Citrus cl... 151 1e-34 ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thalia... 151 1e-34 ref|XP_004490774.1| PREDICTED: protein SET DOMAIN GROUP 40-like ... 150 2e-34 ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like ... 148 1e-33 emb|CBI27360.3| unnamed protein product [Vitis vinifera] 148 1e-33 gb|ESQ41709.1| hypothetical protein EUTSA_v10015946mg [Eutrema s... 147 2e-33 gb|ESW13964.1| hypothetical protein PHAVU_008G241400g [Phaseolus... 145 8e-33 gb|EMJ16490.1| hypothetical protein PRUPE_ppa004975mg [Prunus pe... 143 3e-32 ref|XP_003518360.1| PREDICTED: LOW QUALITY PROTEIN: protein SET ... 140 3e-31 gb|ERN17050.1| hypothetical protein AMTR_s00044p00046290 [Ambore... 139 4e-31 ref|XP_004288574.1| PREDICTED: protein SET DOMAIN GROUP 40-like ... 138 1e-30 ref|XP_004232670.1| PREDICTED: protein SET DOMAIN GROUP 40-like ... 134 2e-29 >ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis] gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP, putative [Ricinus communis] Length = 510 Score = 205 bits (522), Expect = 7e-51 Identities = 116/195 (59%), Positives = 125/195 (64%), Gaps = 2/195 (1%) Frame = +3 Query: 33 MAEEAEHERLEGFLEWAA-ELGISDSPYNFQS-RNPNSCFGNSLTLSHFPXXXXXXXXXX 206 M E+AEHERLEGFL+WAA ELGISDS + QS PNSC G SLT+SHFP Sbjct: 1 MMEQAEHERLEGFLKWAAAELGISDSSNSSQSLEEPNSCLGISLTVSHFPDAGGRGLGAA 60 Query: 207 XXXWKGELVLRVPKPALLTRDSLLKDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSF 386 KGELVLRVPK ALLT+DS LKDGLL S +N H +LSPTQ LTVCLLYEM KG+SSF Sbjct: 61 RDLKKGELVLRVPKSALLTKDSFLKDGLLLSAINNHSALSPTQTLTVCLLYEMSKGQSSF 120 Query: 387 WYPYLMHLPRSYETLATFSEFEKQAFQVQXXXXXXXXXXXXXXXXXXXXHKNDV*KVWSY 566 WYPYLMHLPRSYE LATFSEFEKQA Q Sbjct: 121 WYPYLMHLPRSYEILATFSEFEKQALQ--------------------------------- 147 Query: 567 W*VDDAVWTTEKAIS 611 VDDA+WT EKAIS Sbjct: 148 --VDDAIWTAEKAIS 160 >gb|EEE85750.2| hypothetical protein POPTR_0004s07950g [Populus trichocarpa] Length = 518 Score = 163 bits (413), Expect = 3e-38 Identities = 87/157 (55%), Positives = 105/157 (66%), Gaps = 4/157 (2%) Frame = +3 Query: 3 RLDPKSERRGMAEEAEHERLEGFLEWAAELGISDSPYNFQ--SRNPNSCFGNSLTLSHFP 176 R + + ++ M + + E E FL+WAA LGISD N ++P SC G+SLT+SHFP Sbjct: 18 RRNSRQTKKEMEDAGQDEGFERFLKWAANLGISDCTTNLSLHPQSPTSCLGHSLTVSHFP 77 Query: 177 XXXXXXXXXXXXXWKGELVLRVPKPALLTRDSLLKDGLLSSFVNG--HPSLSPTQILTVC 350 KGELVLRVPK L+TRDSLLKD L SFVN + SLSPTQIL VC Sbjct: 78 DAGGRGLAAVRDLKKGELVLRVPKSVLITRDSLLKDEKLCSFVNNNTYSSLSPTQILAVC 137 Query: 351 LLYEMGKGKSSFWYPYLMHLPRSYETLATFSEFEKQA 461 LLYEMGKGKSS+WYPYLMHLPRSY+ LA+F + +A Sbjct: 138 LLYEMGKGKSSWWYPYLMHLPRSYDVLASFKKAVSKA 174 >ref|XP_002305239.1| SET domain protein [Populus trichocarpa] Length = 518 Score = 163 bits (413), Expect = 3e-38 Identities = 87/157 (55%), Positives = 105/157 (66%), Gaps = 4/157 (2%) Frame = +3 Query: 3 RLDPKSERRGMAEEAEHERLEGFLEWAAELGISDSPYNFQ--SRNPNSCFGNSLTLSHFP 176 R + + ++ M + + E E FL+WAA LGISD N ++P SC G+SLT+SHFP Sbjct: 18 RRNSRQTKKEMEDAGQDEGFERFLKWAANLGISDCTTNLSLHPQSPTSCLGHSLTVSHFP 77 Query: 177 XXXXXXXXXXXXXWKGELVLRVPKPALLTRDSLLKDGLLSSFVNG--HPSLSPTQILTVC 350 KGELVLRVPK L+TRDSLLKD L SFVN + SLSPTQIL VC Sbjct: 78 DAGGRGLAAVRDLKKGELVLRVPKSVLITRDSLLKDEKLCSFVNNNTYSSLSPTQILAVC 137 Query: 351 LLYEMGKGKSSFWYPYLMHLPRSYETLATFSEFEKQA 461 LLYEMGKGKSS+WYPYLMHLPRSY+ LA+F + +A Sbjct: 138 LLYEMGKGKSSWWYPYLMHLPRSYDVLASFKKAVSKA 174 >ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Glycine max] Length = 475 Score = 155 bits (393), Expect = 6e-36 Identities = 80/145 (55%), Positives = 98/145 (67%), Gaps = 2/145 (1%) Frame = +3 Query: 42 EAEHERLEGFLEWAAELGISDSPY--NFQSRNPNSCFGNSLTLSHFPXXXXXXXXXXXXX 215 E EH LE FL WAA+LGISDS N + +SC G+SL++SHFP Sbjct: 2 EQEHPNLESFLSWAAQLGISDSTTRTNQPQHSLSSCLGSSLSVSHFPHSGGRGLGAVRDL 61 Query: 216 WKGELVLRVPKPALLTRDSLLKDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYP 395 +GE+VLRVPK AL+TR+++++D L VN H SLS QIL VCLLYEMGKGK+S W+P Sbjct: 62 RRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKGKTSRWHP 121 Query: 396 YLMHLPRSYETLATFSEFEKQAFQV 470 YLMHLP +Y+ LA F EFEK A QV Sbjct: 122 YLMHLPHTYDVLAMFGEFEKHALQV 146 >gb|ACU19071.1| unknown [Glycine max] Length = 497 Score = 155 bits (393), Expect = 6e-36 Identities = 80/145 (55%), Positives = 98/145 (67%), Gaps = 2/145 (1%) Frame = +3 Query: 42 EAEHERLEGFLEWAAELGISDSPY--NFQSRNPNSCFGNSLTLSHFPXXXXXXXXXXXXX 215 E EH LE FL WAA+LGISDS N + +SC G+SL++SHFP Sbjct: 2 EQEHPNLESFLSWAAQLGISDSTTRTNQPQHSLSSCLGSSLSVSHFPHSGGRGLGAVRDL 61 Query: 216 WKGELVLRVPKPALLTRDSLLKDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYP 395 +GE+VLRVPK AL+TR+++++D L VN H SLS QIL VCLLYEMGKGK+S W+P Sbjct: 62 RRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKGKTSRWHP 121 Query: 396 YLMHLPRSYETLATFSEFEKQAFQV 470 YLMHLP +Y+ LA F EFEK A QV Sbjct: 122 YLMHLPHTYDVLAMFGEFEKHALQV 146 >ref|XP_002871756.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297317593|gb|EFH48015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 493 Score = 155 bits (392), Expect = 8e-36 Identities = 91/191 (47%), Positives = 113/191 (59%), Gaps = 1/191 (0%) Frame = +3 Query: 42 EAEHERLEGFLEWAAELGISDSPYNFQSRNPNSCFGNSLTLSHFPXXXXXXXXXXXXXWK 221 + EH+ +E FL WAAE+GISDS + SR +SC G+SL+++ FP K Sbjct: 5 DLEHQTMETFLRWAAEIGISDSIDS--SRYRDSCLGHSLSVADFPHAGGRGLGAVRELKK 62 Query: 222 GELVLRVPKPALLTRDSLL-KDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYPY 398 GELVL+VP+ AL+T +S++ KD L+ V H SLS TQIL+VCLLYEMGKGK SFWYPY Sbjct: 63 GELVLKVPRNALMTTESMIAKDRKLNDAVILHGSLSSTQILSVCLLYEMGKGKRSFWYPY 122 Query: 399 LMHLPRSYETLATFSEFEKQAFQVQXXXXXXXXXXXXXXXXXXXXHKNDV*KVWSYW*VD 578 L+HLPR Y+ LATF EFEKQA Q V+ Sbjct: 123 LVHLPRDYDLLATFGEFEKQALQ-----------------------------------VE 147 Query: 579 DAVWTTEKAIS 611 DAVW TEKAI+ Sbjct: 148 DAVWATEKAIA 158 >gb|EOY03097.1| SET domain group 40, putative isoform 1 [Theobroma cacao] Length = 498 Score = 153 bits (387), Expect = 3e-35 Identities = 80/146 (54%), Positives = 98/146 (67%), Gaps = 2/146 (1%) Frame = +3 Query: 39 EEAEHERLEGFLEWAAELGISDSPYNFQSRNPNSC--FGNSLTLSHFPXXXXXXXXXXXX 212 EE E L+ FL+WAA LG+SDSP NP+SC G+SL +S+FP Sbjct: 22 EEEERGSLDSFLKWAAGLGVSDSP------NPDSCSCLGHSLGVSYFPDAGGRGLGAVRD 75 Query: 213 XWKGELVLRVPKPALLTRDSLLKDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWY 392 +GEL+L+VPK AL+T SLL D LS+ + HPSLSP Q+LT+C LYEM KGK+S W+ Sbjct: 76 ITRGELLLKVPKSALITTHSLLNDERLSTALKAHPSLSPAQVLTICFLYEMSKGKASPWH 135 Query: 393 PYLMHLPRSYETLATFSEFEKQAFQV 470 PYL+HLPRSY LA F EFEKQA QV Sbjct: 136 PYLLHLPRSYGILAAFGEFEKQALQV 161 >ref|XP_006289442.1| hypothetical protein CARUB_v10002957mg [Capsella rubella] gi|482558148|gb|EOA22340.1| hypothetical protein CARUB_v10002957mg [Capsella rubella] Length = 503 Score = 153 bits (387), Expect = 3e-35 Identities = 81/145 (55%), Positives = 102/145 (70%), Gaps = 1/145 (0%) Frame = +3 Query: 42 EAEHERLEGFLEWAAELGISDSPYNFQSRNPNSCFGNSLTLSHFPXXXXXXXXXXXXXWK 221 E EH+ +E FL WAA++GISDS + SR +SC G+SL+++ FP K Sbjct: 2 ELEHQTMETFLRWAADIGISDSIDS--SRCSDSCLGHSLSVADFPLAGGRGLRAVRELRK 59 Query: 222 GELVLRVPKPALLTRDSLL-KDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYPY 398 GELVL+VP+ AL+T +S++ D L+ VN H SLS TQIL+VCLLYEM KGK SFWYPY Sbjct: 60 GELVLKVPRNALMTTESMVANDQKLNDAVNLHGSLSSTQILSVCLLYEMSKGKKSFWYPY 119 Query: 399 LMHLPRSYETLATFSEFEKQAFQVQ 473 L+HLPR Y+ LATF EFEKQA QV+ Sbjct: 120 LVHLPRDYDLLATFGEFEKQALQVE 144 >gb|ESR43640.1| hypothetical protein CICLE_v10011537mg [Citrus clementina] Length = 503 Score = 151 bits (381), Expect = 1e-34 Identities = 93/191 (48%), Positives = 105/191 (54%), Gaps = 1/191 (0%) Frame = +3 Query: 42 EAEHERLEGFLEWAAELGISDSPYNFQSRNPNSCFGNSLTLSHFPXXXXXXXXXXXXXWK 221 E E E LE L+WAAE+GI+DS SR+ N C G+SLT+SHFP K Sbjct: 2 EEEDESLEKLLKWAAEMGITDSTIQNPSRSRN-CLGHSLTVSHFPEAGGRGLAAARDLTK 60 Query: 222 GELVLRVPKPALLTRDSLLK-DGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYPY 398 GEL+LRVPK AL T + LLK D S VN H LSP+QIL VCLLYE+GKGKSS WY Y Sbjct: 61 GELILRVPKTALFTTECLLKSDQKRSLAVNRHLFLSPSQILIVCLLYEVGKGKSSRWYTY 120 Query: 399 LMHLPRSYETLATFSEFEKQAFQVQXXXXXXXXXXXXXXXXXXXXHKNDV*KVWSYW*VD 578 LM LPR YE LATF FEKQA Q VD Sbjct: 121 LMLLPRCYEILATFGPFEKQALQ-----------------------------------VD 145 Query: 579 DAVWTTEKAIS 611 DA+W EKA+S Sbjct: 146 DAIWAAEKAVS 156 >ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana] gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein SET DOMAIN GROUP 40 gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis thaliana] gi|51969984|dbj|BAD43684.1| unknown protein [Arabidopsis thaliana] gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana] Length = 491 Score = 151 bits (381), Expect = 1e-34 Identities = 81/145 (55%), Positives = 100/145 (68%), Gaps = 1/145 (0%) Frame = +3 Query: 42 EAEHERLEGFLEWAAELGISDSPYNFQSRNPNSCFGNSLTLSHFPXXXXXXXXXXXXXWK 221 + EH+ +E FL WAAE+GISDS + SR +SC G+SL++S FP K Sbjct: 2 DLEHQTMETFLRWAAEIGISDSIDS--SRFRDSCLGHSLSVSDFPDAGGRGLGAARELKK 59 Query: 222 GELVLRVPKPALLTRDSLL-KDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYPY 398 GELVL+VP+ AL+T +S++ KD LS VN H SLS TQIL+VCLLYEM K K SFWYPY Sbjct: 60 GELVLKVPRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPY 119 Query: 399 LMHLPRSYETLATFSEFEKQAFQVQ 473 L H+PR Y+ LATF FEKQA QV+ Sbjct: 120 LFHIPRDYDLLATFGNFEKQALQVE 144 >ref|XP_004490774.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Cicer arietinum] Length = 494 Score = 150 bits (380), Expect = 2e-34 Identities = 86/189 (45%), Positives = 109/189 (57%) Frame = +3 Query: 42 EAEHERLEGFLEWAAELGISDSPYNFQSRNPNSCFGNSLTLSHFPXXXXXXXXXXXXXWK 221 E E LE FL WA+++GISDS + S++ SC G+SL +S FP + Sbjct: 2 EQEQGNLESFLTWASQIGISDSTNH--SQHFFSCLGHSLCVSIFPHSGGRGLGAVRDLRR 59 Query: 222 GELVLRVPKPALLTRDSLLKDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYPYL 401 GE+VLRVPK AL+TR+S+++D L VN HPSLS QILTVCLLYE+GKGK+S W+PYL Sbjct: 60 GEIVLRVPKSALMTRESVMEDKKLCIAVNKHPSLSSVQILTVCLLYEVGKGKTSRWHPYL 119 Query: 402 MHLPRSYETLATFSEFEKQAFQVQXXXXXXXXXXXXXXXXXXXXHKNDV*KVWSYW*VDD 581 MHLP+SY+ LA F EFEK A Q VD+ Sbjct: 120 MHLPQSYDVLAMFGEFEKNALQ-----------------------------------VDE 144 Query: 582 AVWTTEKAI 608 A+W TEKA+ Sbjct: 145 AIWITEKAV 153 >ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera] Length = 504 Score = 148 bits (374), Expect = 1e-33 Identities = 89/188 (47%), Positives = 105/188 (55%), Gaps = 5/188 (2%) Frame = +3 Query: 60 LEGFLEWAAELGISD---SPYNFQSRN--PNSCFGNSLTLSHFPXXXXXXXXXXXXXWKG 224 +E FL+WA ELGISD +P SR P+ C G+SL +SHFP +G Sbjct: 1 MERFLKWATELGISDFTTTPTTVPSRLQIPHCCVGHSLCVSHFPHAGGRGLAAARDLSQG 60 Query: 225 ELVLRVPKPALLTRDSLLKDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYPYLM 404 EL+L VPK AL+T SLLKD LS V H SLS QILT+CLL EM KGKSS+W+PYLM Sbjct: 61 ELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKGKSSWWHPYLM 120 Query: 405 HLPRSYETLATFSEFEKQAFQVQXXXXXXXXXXXXXXXXXXXXHKNDV*KVWSYW*VDDA 584 LPRSY+TLA FS+FEKQA Q VDDA Sbjct: 121 QLPRSYDTLANFSQFEKQALQ-----------------------------------VDDA 145 Query: 585 VWTTEKAI 608 +W TE+AI Sbjct: 146 IWVTERAI 153 >emb|CBI27360.3| unnamed protein product [Vitis vinifera] Length = 449 Score = 148 bits (374), Expect = 1e-33 Identities = 89/188 (47%), Positives = 105/188 (55%), Gaps = 5/188 (2%) Frame = +3 Query: 60 LEGFLEWAAELGISD---SPYNFQSRN--PNSCFGNSLTLSHFPXXXXXXXXXXXXXWKG 224 +E FL+WA ELGISD +P SR P+ C G+SL +SHFP +G Sbjct: 1 MERFLKWATELGISDFTTTPTTVPSRLQIPHCCVGHSLCVSHFPHAGGRGLAAARDLSQG 60 Query: 225 ELVLRVPKPALLTRDSLLKDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYPYLM 404 EL+L VPK AL+T SLLKD LS V H SLS QILT+CLL EM KGKSS+W+PYLM Sbjct: 61 ELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKGKSSWWHPYLM 120 Query: 405 HLPRSYETLATFSEFEKQAFQVQXXXXXXXXXXXXXXXXXXXXHKNDV*KVWSYW*VDDA 584 LPRSY+TLA FS+FEKQA Q VDDA Sbjct: 121 QLPRSYDTLANFSQFEKQALQ-----------------------------------VDDA 145 Query: 585 VWTTEKAI 608 +W TE+AI Sbjct: 146 IWVTERAI 153 >gb|ESQ41709.1| hypothetical protein EUTSA_v10015946mg [Eutrema salsugineum] Length = 506 Score = 147 bits (371), Expect = 2e-33 Identities = 78/145 (53%), Positives = 100/145 (68%), Gaps = 1/145 (0%) Frame = +3 Query: 42 EAEHERLEGFLEWAAELGISDSPYNFQSRNPNSCFGNSLTLSHFPXXXXXXXXXXXXXWK 221 + EH+ +E FL WAAELG+SDS + SR+ +SC G+SL+++ FP K Sbjct: 2 DLEHQTMEMFLRWAAELGLSDSIDS--SRSLDSCLGHSLSVADFPLAGGRGLGAVRELRK 59 Query: 222 GELVLRVPKPALLTRDSLL-KDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYPY 398 GELVL+VP+ ALLT +S++ KD L +N H S+S TQ L VCLLYEM KGK SFWYPY Sbjct: 60 GELVLKVPRNALLTTESMVAKDQKLRDAINLHGSISSTQRLGVCLLYEMSKGKKSFWYPY 119 Query: 399 LMHLPRSYETLATFSEFEKQAFQVQ 473 L+HLPR Y+ +TF EFEKQA QV+ Sbjct: 120 LVHLPRDYDLSSTFGEFEKQALQVE 144 >gb|ESW13964.1| hypothetical protein PHAVU_008G241400g [Phaseolus vulgaris] Length = 497 Score = 145 bits (366), Expect = 8e-33 Identities = 83/191 (43%), Positives = 105/191 (54%), Gaps = 2/191 (1%) Frame = +3 Query: 42 EAEHERLEGFLEWAAELGISDSPYNFQS--RNPNSCFGNSLTLSHFPXXXXXXXXXXXXX 215 E E + LE FL WAA+LGISDS +P+SC G+SL ++HFP Sbjct: 2 EQEQQNLESFLTWAAQLGISDSTTRTDQPQHSPSSCLGSSLCVAHFPHSGGRGLGAVRDL 61 Query: 216 WKGELVLRVPKPALLTRDSLLKDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYP 395 +GE+VL VPK AL+TR+++++D L VN H LS QIL VCLLYE+ KGK+S W+P Sbjct: 62 RRGEIVLSVPKSALMTRENVMEDKKLCFAVNRHSCLSSAQILIVCLLYEVCKGKTSRWHP 121 Query: 396 YLMHLPRSYETLATFSEFEKQAFQVQXXXXXXXXXXXXXXXXXXXXHKNDV*KVWSYW*V 575 YLMHLP +Y+ LA F EFEK+A Q V Sbjct: 122 YLMHLPHTYDILAMFDEFEKRALQ-----------------------------------V 146 Query: 576 DDAVWTTEKAI 608 D+AVW TEKAI Sbjct: 147 DEAVWVTEKAI 157 >gb|EMJ16490.1| hypothetical protein PRUPE_ppa004975mg [Prunus persica] Length = 483 Score = 143 bits (361), Expect = 3e-32 Identities = 81/140 (57%), Positives = 98/140 (70%), Gaps = 3/140 (2%) Frame = +3 Query: 60 LEGFLEWAAELGISDSPYNFQSRNPNSCFGNSLTLSHFPXXXXXXXXXXXXXWKGELVLR 239 LE L+WAAE+GISDS + +SC G+SL +S+FP +GEL+L+ Sbjct: 8 LERLLKWAAEIGISDS-----TCCGDSCLGHSLDVSYFPSAGGRGLGAARDLREGELLLK 62 Query: 240 VPKPALLTRDSLL-KDGLLSSFVN--GHPSLSPTQILTVCLLYEMGKGKSSFWYPYLMHL 410 VPK L+T++SLL KD LS VN H SLSPTQIL VCLLYEMGKGK S+W+PYLM+L Sbjct: 63 VPKSVLMTKESLLLKDEKLSLSVNDYAHHSLSPTQILAVCLLYEMGKGKISWWHPYLMNL 122 Query: 411 PRSYETLATFSEFEKQAFQV 470 PRSY+ LATF EFEKQA QV Sbjct: 123 PRSYDILATFGEFEKQALQV 142 >ref|XP_003518360.1| PREDICTED: LOW QUALITY PROTEIN: protein SET DOMAIN GROUP 40-like [Glycine max] Length = 497 Score = 140 bits (353), Expect = 3e-31 Identities = 73/152 (48%), Positives = 102/152 (67%), Gaps = 2/152 (1%) Frame = +3 Query: 21 ERRGMAEEAEHERLEGFLEWAAELGISDSPY--NFQSRNPNSCFGNSLTLSHFPXXXXXX 194 +++ + +E +++ LE +L WAA LGISDS N + +SC G+SL +S FP Sbjct: 8 KKKKIEQEHQNQNLESYLSWAAXLGISDSRTGTNQPQHSLSSCLGSSLCVSRFPHSGRRG 67 Query: 195 XXXXXXXWKGELVLRVPKPALLTRDSLLKDGLLSSFVNGHPSLSPTQILTVCLLYEMGKG 374 +GE+VLRVPK AL+TR+S+++D L VN H SLSP Q+L VCLLYEMGK Sbjct: 68 LGAARDLGRGEIVLRVPKSALMTRESVMEDEKLCDAVNRHSSLSPAQMLIVCLLYEMGK- 126 Query: 375 KSSFWYPYLMHLPRSYETLATFSEFEKQAFQV 470 +S W+PYL+H+P++Y+ LA F EFEK+A QV Sbjct: 127 XTSRWHPYLVHMPQTYDILAMFGEFEKRALQV 158 >gb|ERN17050.1| hypothetical protein AMTR_s00044p00046290 [Amborella trichopoda] Length = 305 Score = 139 bits (351), Expect = 4e-31 Identities = 72/141 (51%), Positives = 96/141 (68%) Frame = +3 Query: 48 EHERLEGFLEWAAELGISDSPYNFQSRNPNSCFGNSLTLSHFPXXXXXXXXXXXXXWKGE 227 + + LE L W AE+GISDSP++ S P SC G+SL++S+FP GE Sbjct: 3 DQKGLEALLRWGAEVGISDSPHSVTS--PISCLGHSLSISNFPEAGGRGLAAARELRCGE 60 Query: 228 LVLRVPKPALLTRDSLLKDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYPYLMH 407 L+LRVP+ AL+ R+SL KDG L+ +P L+ TQ+LTV LL E+GKG SS+WYPYL+ Sbjct: 61 LILRVPRKALMNRESLRKDGKLTPGFQRYPHLTSTQVLTVYLLAEVGKGSSSWWYPYLVQ 120 Query: 408 LPRSYETLATFSEFEKQAFQV 470 LPR+Y+ LATF++FE QA QV Sbjct: 121 LPRTYDILATFNQFEIQALQV 141 >ref|XP_004288574.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Fragaria vesca subsp. vesca] Length = 511 Score = 138 bits (347), Expect = 1e-30 Identities = 78/145 (53%), Positives = 92/145 (63%), Gaps = 1/145 (0%) Frame = +3 Query: 42 EAEHERLEGFLEWAAELGISDSPYNFQSRNPNSCFGNSLTLSHFPXXXXXXXXXXXXXWK 221 E E LE L+WAA GISDS SL +S+F K Sbjct: 24 EEEEGNLESLLKWAAVFGISDS--------------KSLVVSYFHGAGGRGLGAARDLEK 69 Query: 222 GELVLRVPKPALLTRDSLL-KDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYPY 398 GELVL+VPK AL+TR++LL KD LS VN H SLSP Q L VCLLYEMGKGK+S+WYPY Sbjct: 70 GELVLKVPKSALITRETLLLKDDHLSLAVNAHTSLSPIQTLCVCLLYEMGKGKTSWWYPY 129 Query: 399 LMHLPRSYETLATFSEFEKQAFQVQ 473 L++LPRSY+ +ATF EFEKQA QV+ Sbjct: 130 LINLPRSYDIIATFGEFEKQALQVE 154 >ref|XP_004232670.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Solanum lycopersicum] Length = 488 Score = 134 bits (337), Expect = 2e-29 Identities = 75/145 (51%), Positives = 93/145 (64%), Gaps = 1/145 (0%) Frame = +3 Query: 39 EEAEHERLEGFLEWAAELGISDSPYNFQSRNPNSCFGNSLTLSHFPXXXXXXXXXXXXXW 218 EEAE L+ FL+WAAELGISDSP +++ +SC G +L +++FP Sbjct: 2 EEAEELNLKSFLKWAAELGISDSPSTCTTQS-DSCLGKTLCVANFPKAGGRGLAAVRDIK 60 Query: 219 KGELVLRVPKPALLTRDSLL-KDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYP 395 KGEL+LRVPK AL+T +L+ D S V HPSLS QIL V LL E+ KGKSS W+P Sbjct: 61 KGELILRVPKGALMTSQNLMMNDVAFSIAVKNHPSLSSAQILAVGLLNEVNKGKSSRWWP 120 Query: 396 YLMHLPRSYETLATFSEFEKQAFQV 470 YL PRSYETLA F +FE QA Q+ Sbjct: 121 YLKQFPRSYETLADFGKFEIQALQI 145