BLASTX nr result
ID: Catharanthus22_contig00000839
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00000839 (2689 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004238489.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 207 4e-82 gb|EOY16758.1| SET domain protein, putative isoform 1 [Theobroma... 202 1e-77 ref|XP_006359805.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 189 9e-77 ref|XP_002306703.2| hypothetical protein POPTR_0005s21560g [Popu... 197 6e-75 ref|XP_004290505.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 205 1e-49 ref|XP_006599490.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 204 2e-49 ref|XP_006599489.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 204 2e-49 gb|EXC28030.1| Protein SET DOMAIN GROUP 41 [Morus notabilis] 202 8e-49 ref|XP_002265243.2| PREDICTED: protein SET DOMAIN GROUP 41-like ... 200 3e-48 emb|CBI18219.3| unnamed protein product [Vitis vinifera] 194 2e-46 ref|XP_006434476.1| hypothetical protein CICLE_v10000601mg [Citr... 192 7e-46 ref|XP_006473070.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 189 4e-45 ref|XP_003595407.1| Protein SET DOMAIN GROUP [Medicago truncatul... 186 6e-44 ref|XP_004516217.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 176 6e-41 ref|XP_004516216.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 176 6e-41 gb|EMJ00499.1| hypothetical protein PRUPE_ppa023162mg, partial [... 169 6e-39 ref|XP_004138545.1| PREDICTED: protein SET DOMAIN GROUP 41-like ... 160 2e-36 ref|XP_004166625.1| PREDICTED: LOW QUALITY PROTEIN: protein SET ... 160 3e-36 gb|EOY16760.1| SET domain-containing protein, putative isoform 3... 155 1e-34 gb|ESW24006.1| hypothetical protein PHAVU_004G094200g, partial [... 144 3e-31 >ref|XP_004238489.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Solanum lycopersicum] Length = 677 Score = 207 bits (527), Expect(2) = 4e-82 Identities = 139/385 (36%), Positives = 200/385 (51%), Gaps = 16/385 (4%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQE------------SPDHDKEE--IEE 955 K RQSELWSKYRFSCCCKRC ++P TY+D LQE + D+ EE +E+ Sbjct: 312 KVMRQSELWSKYRFSCCCKRCRSMPMTYMDHCLQEILILNLDSSNMATGDNFYEEHVMEK 371 Query: 956 LTVFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXXX 1135 L DDAI +++S NN ++CC+KLE LL D H+ L+P ++ + Sbjct: 372 LIDCLDDAIDDFLSFNNPKNCCEKLEILLTQD-HVNVLLKPDGEKLHQLFRLHPLHHVSL 430 Query: 1136 DAYTILASGYKVLAFDLLALSSGNDKFQMEAFDKIRXXXXXXXXXXXXXXXXXXSESSLI 1315 A LAS YKV +LLAL + Q +AF R SESSLI Sbjct: 431 HAILTLASAYKVSVSELLALDPEGHEHQTKAFSLSRKSAAYSLLLAGATQHLLESESSLI 490 Query: 1316 ASVANYWGSAGESLLSVARSSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLFNSFCG 1495 V+N+W +AGE+LLS+ RSS W ++L+ E + + + F+S Sbjct: 491 VPVSNFWMTAGETLLSLVRSSTW--------------NLLSMERH----VEEFSFSSH-- 530 Query: 1496 QDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGSFLEQIND 1675 Q + T L+R+ D+ AEF V QF + + D+ K+W FLT G +L+ + D Sbjct: 531 QICGKCTLLDRFRDKFADCHDENAEFADVTSQFLSCVTDTTSKIWDFLTKEGGYLKVVED 590 Query: 1676 -LDFRRFLTKEAPFDSEATLTIETSKGKRNISGSESQPS-NQLRGILFQLGVHCLLYGAC 1849 ++FR ++ F AT S K + SG E++ + N++R LF LG+HCL+YGA Sbjct: 591 PINFRWLGSRMPSFSQFATHATSPSADKTD-SGLEAEDNHNEIRVNLFLLGIHCLIYGAF 649 Query: 1850 LSRICYGQHSELASDAMNFLHSQGI 1924 LS +C+G +S L S + L +GI Sbjct: 650 LSTVCFGPNSPLMSKVESLLSVEGI 674 Score = 127 bits (319), Expect(2) = 4e-82 Identities = 68/151 (45%), Positives = 100/151 (66%), Gaps = 3/151 (1%) Frame = +1 Query: 367 KGNKEVVSLERIGGLITNYRKLMMAEE---EDEVSRMIKDGAEAMVVARRMGEGSDSVVE 537 + N +++LERIGGL+TN+RK+M EE ++++S I+DGA+A+ +RRM G ++ E Sbjct: 124 ESNGSLLNLERIGGLMTNFRKVMFLEEHCNDNDLSGRIRDGAKALAASRRMRVGLETNGE 183 Query: 538 DGGLLEEAILCLVMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRFSTASSEIG 717 +E A+LCLV+ NAVEV +K+GR +GV +YD FSW+NHSCSPNA YRF TA S+ G Sbjct: 184 Y--TVEAAVLCLVLTNAVEVYDKDGRSLGVGVYDVPFSWVNHSCSPNASYRFCTA-SDSG 240 Query: 718 GEPRFLIFPATMGNGGGAQSLNNTNANSKFE 810 G I PA G + ++N++ + Sbjct: 241 GILESRICPAATETGAAGIGHESISSNTELQ 271 >gb|EOY16758.1| SET domain protein, putative isoform 1 [Theobroma cacao] gi|508724862|gb|EOY16759.1| SET domain protein, putative isoform 1 [Theobroma cacao] gi|508724864|gb|EOY16761.1| SET domain protein, putative isoform 1 [Theobroma cacao] Length = 658 Score = 202 bits (513), Expect(2) = 1e-77 Identities = 131/383 (34%), Positives = 201/383 (52%), Gaps = 17/383 (4%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQE-----------SPDHD---KEEIEE 955 K RQSELWSKY+F+C C RCSA P TYVDR L+E S DH+ E + Sbjct: 290 KAMRQSELWSKYQFTCSCSRCSASPTTYVDRALEEISTCNLSFSSSSFDHNLYRDEASKR 349 Query: 956 LTVFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXXX 1135 + + D+ I E +S + ESCC+KLES+L HI EQ++ K+ + Sbjct: 350 VYSYMDETITEVLSDGDPESCCEKLESILNLGLHI-EQVESKDGKSLLNFKLHPFHHLAL 408 Query: 1136 DAYTILASGYKVLAFDLLALSSGNDKFQMEAFDKIRXXXXXXXXXXXXXXXXXXSESSLI 1315 +AYT L S Y++ + DLLAL D+ Q++AFD R SESSLI Sbjct: 409 NAYTTLTSAYRICSSDLLALHPDVDECQLKAFDMNRTSAAYSLLLAGATHRLFCSESSLI 468 Query: 1316 ASVANYWGSAGESLLSVARSSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLFNSFCG 1495 AS AN+W +AGESL+++ARSS+W + +SE S+I H+C Sbjct: 469 ASAANFWTNAGESLVTLARSSLWNLFVKWGFPISEVSTIAKHKC---------------- 512 Query: 1496 QDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGSFLEQIND 1675 ++ + ++ + S SQ Q F+ ++ F + +++ K+W FL G +LE D Sbjct: 513 ---SKCSLMDIFDTKSILSQAQRVNFENISSDFLDCVSNMTAKIWRFLVRGCHYLEVFED 569 Query: 1676 LDFRRFLTKEAPFDSEATLTIETSK--GKRNISGSESQ-PSNQLRGILFQLGVHCLLYGA 1846 +L F + A E SK + +I ++Q +N+ R ++++G+HCLLYG Sbjct: 570 PFDFGWLVHTWDFHARANRNDEDSKFITEGSIYKHQAQWYTNERRIHVYEVGIHCLLYGG 629 Query: 1847 CLSRICYGQHSELASDAMNFLHS 1915 L+ ICYGQ+S+L++ ++ L++ Sbjct: 630 ILAHICYGQNSQLSTHVLSILYN 652 Score = 117 bits (294), Expect(2) = 1e-77 Identities = 67/147 (45%), Positives = 87/147 (59%), Gaps = 7/147 (4%) Frame = +1 Query: 391 LERIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVARRMGEGSDSVVEDGGLLEEAILC 570 L RI GL+TN+ M+ EV+ I+ GA AM AR+ + DG LLEEA+L Sbjct: 116 LHRIDGLLTNHH--MLTSSSPEVAAKIRQGAIAMAAARKSRNRDNEGQSDGFLLEEAVLS 173 Query: 571 LVMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRFS------TASSEIGGEPRF 732 LV+ NAVEVQ+K+GR +G+A+YD +FSWINHSCSPNACYRFS T S Sbjct: 174 LVITNAVEVQDKSGRSLGIAVYDLSFSWINHSCSPNACYRFSISSPHATLSFREDSSSTL 233 Query: 733 LIFPATMGNGGGAQS-LNNTNANSKFE 810 I P+ +G A S + +T N +E Sbjct: 234 RIVPSVLGEECDACSCVEHTKGNKGYE 260 >ref|XP_006359805.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Solanum tuberosum] Length = 681 Score = 189 bits (481), Expect(2) = 9e-77 Identities = 131/384 (34%), Positives = 190/384 (49%), Gaps = 16/384 (4%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQE------------SPD--HDKEEIEE 955 K RQSELWSKYRFSCCCKRC A+P TY+D LQE S D ++ +E+ Sbjct: 321 KVMRQSELWSKYRFSCCCKRCRAMPTTYMDHCLQEILILNLDCSNMASGDNFYENHVMEK 380 Query: 956 LTVFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXXX 1135 L +DAI +++S NN ++CC+KLE LL D H L+P ++ + Sbjct: 381 LMDCLNDAINDFLSFNNPKNCCEKLEILLTQD-HANILLKPDGEQLHQLFRLHPLHHVSL 439 Query: 1136 DAYTILASGYKVLAFDLLALSSGNDKFQMEAFDKIRXXXXXXXXXXXXXXXXXXSESSLI 1315 AY LAS Y+V +LLAL D+ Q +AF+ R SESSLI Sbjct: 440 HAYMTLASAYQVSVGELLALDPEGDEHQTKAFNMSRKSAAYSLLLAGATQHLLESESSLI 499 Query: 1316 ASVANYWGSAGESLLSVARSSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLFNSFCG 1495 V+N+W +AGE+LLS R S W F + + S C + L+R+ Sbjct: 500 VPVSNFWMTAGETLLSFVRRSAW-NLFSRGWHIEDFSFSSCQICGKCTLLDRF------- 551 Query: 1496 QDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGSFLEQIND 1675 R F F ++ AEF V QF + + D K+W FL +L+ + D Sbjct: 552 ----------RDKFTDFHYEN--AEFADVTSQFLSCVTDITPKIWGFLREEDGYLKVVED 599 Query: 1676 -LDFRRFLTKEAPFDSEATLTIETSKGKRNISGSESQPS-NQLRGILFQLGVHCLLYGAC 1849 ++FR ++ AT + ++ SG E++ + N++R LF LG+HCL+YGA Sbjct: 600 PINFRWLGSR------MATHATSPNASEKTGSGLEAEDNHNEIRVKLFLLGIHCLIYGAF 653 Query: 1850 LSRICYGQHSELASDAMNFLHSQG 1921 LS +C+G +S+L S + L +G Sbjct: 654 LSTVCFGPNSQLMSKVESLLSVKG 677 Score = 127 bits (319), Expect(2) = 9e-77 Identities = 68/157 (43%), Positives = 99/157 (63%), Gaps = 9/157 (5%) Frame = +1 Query: 367 KGNKEVVSLERIGGLITNYRKLMMAEE------EDEVSRMIKDGAEAMVVARRMGEGSDS 528 + N ++LERIGGL+TN+RK+M EE +D++S I+ GA+A+ +RRM G D+ Sbjct: 125 ESNGSFLNLERIGGLVTNFRKVMFLEEHCNDNDDDDLSGRIRHGAKALAASRRMRLGLDT 184 Query: 529 ---VVEDGGLLEEAILCLVMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRFST 699 ++ + +E A+LCLV+ NAVEV +K+GR +GV +YD FSW+NHSCSPNA YRF T Sbjct: 185 NRELLYEEYTVEAAVLCLVLTNAVEVHDKDGRSLGVGVYDVPFSWVNHSCSPNASYRFCT 244 Query: 700 ASSEIGGEPRFLIFPATMGNGGGAQSLNNTNANSKFE 810 A S+ GG I PA G + ++N + + Sbjct: 245 A-SDSGGISECRICPAATETGAAGIESESISSNPELQ 280 >ref|XP_002306703.2| hypothetical protein POPTR_0005s21560g [Populus trichocarpa] gi|550339461|gb|EEE93699.2| hypothetical protein POPTR_0005s21560g [Populus trichocarpa] Length = 626 Score = 197 bits (500), Expect(2) = 6e-75 Identities = 135/388 (34%), Positives = 188/388 (48%), Gaps = 20/388 (5%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQESPD--------------HDKEEIEE 955 K R+SELW+KYRF CCC RC A P +YVD VLQE + E + Sbjct: 256 KEIRRSELWAKYRFICCCTRCIASPPSYVDHVLQEISASNLASSSLSSELSFYRDEATRK 315 Query: 956 LTVFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXXX 1135 LT + D+ AEY+++ + ESCCKKLE++LI G + EQL+ + + Q Sbjct: 316 LTDYVDEVTAEYLAVGDPESCCKKLENMLI-TGLLDEQLEVREGKSQLNFRLHALHHLAL 374 Query: 1136 DAYTILASGYKVLAFDLLALSSGNDKFQMEAFDKIRXXXXXXXXXXXXXXXXXXSESSLI 1315 + YT+LAS YK+ A DL +L S EA R ESSL+ Sbjct: 375 NTYTVLASAYKIRASDLFSLHSEVGGLPWEALSMSRISAAYSLLLATATYHLFCFESSLL 434 Query: 1316 ASVANYWGSAGESLLSVARSSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLFNSFCG 1495 SVAN+W SAGESLL++A+SS W+ + V S + H+C + L + N G Sbjct: 435 VSVANFWTSAGESLLALAKSSAWDSLGKCGFPVLNLSPLAKHKCSKCSLLESFEVNLSFG 494 Query: 1496 QDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGSFLEQIND 1675 QD R A FD V+ +F + I L ++W FL G +L+ D Sbjct: 495 QDHIRK-----------------AGFDSVSSRFLDCIGSLLQEVWGFLIQGDRYLKMFKD 537 Query: 1676 LDFRRFLTKEAP-FDSEATLTIE----TSKGKRNISGSESQP-SNQLRGILFQLGVHCLL 1837 +L K +D +A LT +++SG E+ ++ R FQLGVHCLL Sbjct: 538 PTDFSWLGKSLDIWDFDAELTHNDVDFNCWTNKSVSGIEALGYTDHWRINTFQLGVHCLL 597 Query: 1838 YGACLSRICYGQHSELASDAMNFLHSQG 1921 YG L+ ICYG HS +S + L+ +G Sbjct: 598 YGGFLAGICYGPHSHWSSHIRSALNYEG 625 Score = 114 bits (284), Expect(2) = 6e-75 Identities = 87/243 (35%), Positives = 112/243 (46%), Gaps = 5/243 (2%) Frame = +1 Query: 34 MEMRAAE-DIAMAEDLTPPLLPLVFSLYDXXXXXXXXXXXXXXXXXXXXXXXXXNYSHVP 210 MEMRA E DI + ED+TP ++PL ++L+D + HVP Sbjct: 1 MEMRAGEEDIEIGEDITPSVIPLSYALHDSFIHSHCSSCFSRLPSANFT-----QHHHVP 55 Query: 211 TXXXXXXXXXXXXXXXXHFSSAEPHLLRXXXXXXXXXXANXXXXXXXXXXFAKGNKEVVS 390 T HFS AE HLL S Sbjct: 56 TLLYCSSICSSS-----HFSPAELHLLHSPPSSDLRAALRLLPLSLPSS----------S 100 Query: 391 LERIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVARRMGEGSDSVVEDGGLLEEAILC 570 RI GL+TN KLM ++E+S ++ GA+A+ ARR+ E ++ D LLE A LC Sbjct: 101 TNRICGLLTNREKLMA---DEEISAHVRYGAKAIAAARRI-EMVENEKNDAVLLEAA-LC 155 Query: 571 LVMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRFSTASSE----IGGEPRFLI 738 LV+ NAVEV + GR IG+A+Y FSWINHSCSPNACYR + + E R I Sbjct: 156 LVLTNAVEVHDNEGRSIGIAVYGPNFSWINHSCSPNACYRSIISPPDNVLPFSDESRLRI 215 Query: 739 FPA 747 PA Sbjct: 216 LPA 218 >ref|XP_004290505.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Fragaria vesca subsp. vesca] Length = 645 Score = 205 bits (521), Expect = 1e-49 Identities = 139/387 (35%), Positives = 199/387 (51%), Gaps = 19/387 (4%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQE---------------SPDHDKEEIE 952 K R+SELWS+YRF C CKRCSA P TYVDR L++ S D DK E Sbjct: 270 KAVRRSELWSRYRFMCSCKRCSASPLTYVDRALEDISAVNYNSSRFSSDISFDRDK-ATE 328 Query: 953 ELTVFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXX 1132 LT + DDAIA+Y+S+ N ESCC++LE +L +G +Q + + + Sbjct: 329 RLTDYIDDAIADYLSIGNPESCCERLEQVL-TEGLSDKQPEGNEEKSELTYWLNPLHHLS 387 Query: 1133 XDAYTILASGYKVLAFDLLALSSGNDKFQMEAFDKIRXXXXXXXXXXXXXXXXXXSESSL 1312 +AYT LAS YK+LA DLL +SS D + AF R SESSL Sbjct: 388 LNAYTTLASAYKILADDLLTMSSEIDNHVLGAFGMSRTGAAYSLLLAGAAHHLFNSESSL 447 Query: 1313 IASVANYWGSAGESLLSVARSSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLFNSFC 1492 + VAN+W SAG+SLL++A+SS+W E + VS++ + + Y+ P Sbjct: 448 VVYVANFWTSAGDSLLNLAKSSIWSEIVRWDLPVSDNLELYHIAKYKCP----------- 496 Query: 1493 GQDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGSFLEQI- 1669 R + +++ S ++F + +F + + + K+W FL G +L Sbjct: 497 -----RCSLIDKLETYSLHDPVTHSDFGHASREFVDCVTNLTQKVWYFLVQGCRYLGLCK 551 Query: 1670 NDLDFRRFLTKEAPFDSEA-TLTIETSKGK-RNISGSESQP-SNQLRGILFQLGVHCLLY 1840 N +DF T E + E T + T+ G R+ISGSE++ +N LR + +LGVHCLLY Sbjct: 552 NPIDFIWLDTSECSSEGEVFTHSTGTNCGNDRSISGSEAEENTNLLRMYILKLGVHCLLY 611 Query: 1841 GACLSRICYGQHSELASDAMNFLHSQG 1921 G L+R CYG++S L + N L QG Sbjct: 612 GEYLARTCYGRYSHLICHSHNILDRQG 638 Score = 125 bits (315), Expect = 7e-26 Identities = 85/227 (37%), Positives = 109/227 (48%), Gaps = 3/227 (1%) Frame = +1 Query: 34 MEMRAAEDIAMAEDLTPPLLPLVFSLYDXXXXXXXXXXXXXXXXXXXXXXXXXNYSHVPT 213 MEMRA E+I + DLTPPL PL +L+D N SH Sbjct: 1 MEMRAGEEIELGRDLTPPLSPLYSALHDSLLSSHCSSCFSPLPTPPSP-----NNSH--- 52 Query: 214 XXXXXXXXXXXXXXXXHFSSAEPHLLRXXXXXXXXXXANXXXXXXXXXXFAKGNKEVVSL 393 S+AEP LLR Sbjct: 53 --PVLLFCSSLCSSSASVSTAEPRLLRLLHSHPSTYPHGDSSDLRAALRLLHSLPASSPA 110 Query: 394 ERIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVARRMGEGSDSVVE---DGGLLEEAI 564 RI GL+TN RKL +D++ I+DGA AM +AR M + +D+V++ D + EEA Sbjct: 111 PRISGLLTNRRKL-----DDDLR--IRDGARAMFLARTMPDDNDAVLDVAHDDAVSEEAA 163 Query: 565 LCLVMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRFSTAS 705 LCLV+ NAVEVQ+ GR +G+A+YD+ FSWINHSCSPNACYRF +S Sbjct: 164 LCLVLTNAVEVQDHTGRTLGIAVYDSCFSWINHSCSPNACYRFLLSS 210 >ref|XP_006599490.1| PREDICTED: protein SET DOMAIN GROUP 41-like isoform X2 [Glycine max] Length = 593 Score = 204 bits (519), Expect = 2e-49 Identities = 139/394 (35%), Positives = 189/394 (47%), Gaps = 23/394 (5%) Frame = +2 Query: 815 LKTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQESPDHDKEEI------------EEL 958 L+ RQSELWSKYRF CCCKRCSA+P++YVD LQE E L Sbjct: 219 LQAMRQSELWSKYRFVCCCKRCSALPSSYVDHALQEISAITCESSGSCSKFLKDMADRRL 278 Query: 959 TVFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXXXD 1138 T DD I EY+S+ + ESCC+KLE +L +KE L+ + Sbjct: 279 TECIDDVILEYLSVGDPESCCEKLEEILTQG--LKEHLEVIEVKPDCIFMLHPLHHHSIK 336 Query: 1139 AYTILASGYKVLAFDLLALSSGNDKFQMEAFDKIRXXXXXXXXXXXXXXXXXXSESSLIA 1318 AYT LAS YKV A DLL++ S D Q++AFD R SESSLIA Sbjct: 337 AYTTLASAYKVCACDLLSVDSETDINQLKAFDMSRISAAYSLVLAGATHHLFNSESSLIA 396 Query: 1319 SVANYWGSAGESLLSVARSSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLFNSFCGQ 1498 SVAN+W AGESLLS+++SS W L + +S + +C + ++R+ Sbjct: 397 SVANFWTGAGESLLSLSKSSGWSMCVNLGLVIPNLASAMKFKCTKCSLMDRF-------- 448 Query: 1499 DLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGSFLEQINDL 1678 R+ LN Q + A+F+ V+ +F + ++D K+W FL FL+ D Sbjct: 449 ---RAGMLN--------GQIKSADFENVSNEFLHCVSDITQKVWGFLISDCQFLQSCKDP 497 Query: 1679 DFRRFLTKEAPFDSEATLTIETSKGKRNIS-GSESQPS----------NQLRGILFQLGV 1825 +L S +T+ +E K N+ +ES+ S + +FQLGV Sbjct: 498 IISSWLMST---KSSSTVDVEVCVNKTNMCYTNESENSVSMCHEQTLADHAVACIFQLGV 554 Query: 1826 HCLLYGACLSRICYGQHSELASDAMNFLHSQGIF 1927 HCL YG L+ ICYG HS L N L + F Sbjct: 555 HCLAYGGLLASICYGPHSHLVCHVQNVLEHEKNF 588 Score = 102 bits (254), Expect = 9e-19 Identities = 81/262 (30%), Positives = 107/262 (40%), Gaps = 2/262 (0%) Frame = +1 Query: 34 MEMRAAEDIAMAEDLTPPLLPLVFSLYDXXXXXXXXXXXXXXXXXXXXXXXXXNYSHVPT 213 MEMR+ E+I + D+T L PL F L+ N + P Sbjct: 1 MEMRSKEEIEIGRDITATLTPLSFCLHTFYLHTHCSACFSSLPIP--------NPNPNPN 52 Query: 214 XXXXXXXXXXXXXXXXHFSSAEPHLLRXXXXXXXXXXANXXXXXXXXXXFAKGNKEVVSL 393 H SSAE HL + S Sbjct: 53 SLFYCSPPCSAALSPLHHSSAERHLPPSAHSSHLCTALRLLL-----------SHRPTSS 101 Query: 394 ERIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVARRMGEGSDSVVEDGGLLEEAILCL 573 R+ GL++N L D+VS I GA AM A G + D +LEEA + L Sbjct: 102 SRLAGLLSNRHILTSLSVHDDVSERISVGAGAMAEAIAKQRG---IPNDDAVLEEATIAL 158 Query: 574 --VMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRFSTASSEIGGEPRFLIFPA 747 V+ NAVEV + GR +G+A++D FSWINHSCSPNACYRF +SS GE A Sbjct: 159 SAVLTNAVEVHDNEGRALGIAVFDQIFSWINHSCSPNACYRFVLSSSSHSGE-------A 211 Query: 748 TMGNGGGAQSLNNTNANSKFEF 813 +G Q++ + SK+ F Sbjct: 212 KLGIAPHLQAMRQSELWSKYRF 233 >ref|XP_006599489.1| PREDICTED: protein SET DOMAIN GROUP 41-like isoform X1 [Glycine max] Length = 642 Score = 204 bits (519), Expect = 2e-49 Identities = 139/393 (35%), Positives = 188/393 (47%), Gaps = 23/393 (5%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQESPDHDKEEI------------EELT 961 K RQSELWSKYRF CCCKRCSA+P++YVD LQE E LT Sbjct: 269 KAMRQSELWSKYRFVCCCKRCSALPSSYVDHALQEISAITCESSGSCSKFLKDMADRRLT 328 Query: 962 VFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXXXDA 1141 DD I EY+S+ + ESCC+KLE +L +KE L+ + A Sbjct: 329 ECIDDVILEYLSVGDPESCCEKLEEILTQG--LKEHLEVIEVKPDCIFMLHPLHHHSIKA 386 Query: 1142 YTILASGYKVLAFDLLALSSGNDKFQMEAFDKIRXXXXXXXXXXXXXXXXXXSESSLIAS 1321 YT LAS YKV A DLL++ S D Q++AFD R SESSLIAS Sbjct: 387 YTTLASAYKVCACDLLSVDSETDINQLKAFDMSRISAAYSLVLAGATHHLFNSESSLIAS 446 Query: 1322 VANYWGSAGESLLSVARSSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLFNSFCGQD 1501 VAN+W AGESLLS+++SS W L + +S + +C + ++R+ Sbjct: 447 VANFWTGAGESLLSLSKSSGWSMCVNLGLVIPNLASAMKFKCTKCSLMDRF--------- 497 Query: 1502 LNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGSFLEQINDLD 1681 R+ LN Q + A+F+ V+ +F + ++D K+W FL FL+ D Sbjct: 498 --RAGMLN--------GQIKSADFENVSNEFLHCVSDITQKVWGFLISDCQFLQSCKDPI 547 Query: 1682 FRRFLTKEAPFDSEATLTIETSKGKRNIS-GSESQPS----------NQLRGILFQLGVH 1828 +L S +T+ +E K N+ +ES+ S + +FQLGVH Sbjct: 548 ISSWLMST---KSSSTVDVEVCVNKTNMCYTNESENSVSMCHEQTLADHAVACIFQLGVH 604 Query: 1829 CLLYGACLSRICYGQHSELASDAMNFLHSQGIF 1927 CL YG L+ ICYG HS L N L + F Sbjct: 605 CLAYGGLLASICYGPHSHLVCHVQNVLEHEKNF 637 Score = 104 bits (259), Expect = 2e-19 Identities = 79/247 (31%), Positives = 101/247 (40%), Gaps = 2/247 (0%) Frame = +1 Query: 34 MEMRAAEDIAMAEDLTPPLLPLVFSLYDXXXXXXXXXXXXXXXXXXXXXXXXXNYSHVPT 213 MEMR+ E+I + D+T L PL F L+ N + P Sbjct: 1 MEMRSKEEIEIGRDITATLTPLSFCLHTFYLHTHCSACFSSLPIP--------NPNPNPN 52 Query: 214 XXXXXXXXXXXXXXXXHFSSAEPHLLRXXXXXXXXXXANXXXXXXXXXXFAKGNKEVVSL 393 H SSAE HL + S Sbjct: 53 SLFYCSPPCSAALSPLHHSSAERHLPPSAHSSHLCTALRLLL-----------SHRPTSS 101 Query: 394 ERIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVARRMGEGSDSVVEDGGLLEEAILCL 573 R+ GL++N L D+VS I GA AM A G + D +LEEA + L Sbjct: 102 SRLAGLLSNRHILTSLSVHDDVSERISVGAGAMAEAIAKQRG---IPNDDAVLEEATIAL 158 Query: 574 --VMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRFSTASSEIGGEPRFLIFPA 747 V+ NAVEV + GR +G+A++D FSWINHSCSPNACYRF +SS GE + I P Sbjct: 159 SAVLTNAVEVHDNEGRALGIAVFDQIFSWINHSCSPNACYRFVLSSSSHSGEAKLGIAPH 218 Query: 748 TMGNGGG 768 N G Sbjct: 219 LQMNSSG 225 >gb|EXC28030.1| Protein SET DOMAIN GROUP 41 [Morus notabilis] Length = 661 Score = 202 bits (513), Expect = 8e-49 Identities = 146/388 (37%), Positives = 192/388 (49%), Gaps = 24/388 (6%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQE------------SPDHDKEEIEELT 961 K+ RQS+LWSKYRF CCC RC +VP TY+DRVL+E S + + + LT Sbjct: 294 KSVRQSDLWSKYRFICCCSRCGSVPPTYMDRVLEEISVVNGNSSSSDSGFYRDKATQMLT 353 Query: 962 VFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREI--QRXXXXXXXXXXXX 1135 + DDAI++Y+S+ + +SCC+KL+ +L G EQL+ Sbjct: 354 QYIDDAISDYLSIGDAQSCCEKLDHVL-TRGLPDEQLERNEGTSLPTYTYWLHPLHHLSL 412 Query: 1136 DAYTILASGYKVLAFDLLALSSGNDKFQMEAFDKIRXXXXXXXXXXXXXXXXXXSESSLI 1315 +AYT LAS YK + D+LAL S ++ AFD R E SLI Sbjct: 413 NAYTTLASAYKTCSNDMLALFSEANENLCVAFDMSRTSVAYSLLLAGATNHLFQFEPSLI 472 Query: 1316 ASVANYWGSAGESLLSVARSSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLFNSFCG 1495 ASVANYW SAGESL + ARSS+W E L S SSI+ H C + C Sbjct: 473 ASVANYWVSAGESLSTFARSSMWRELIPL----SSLSSIIRHNCLK------------CS 516 Query: 1496 QDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGSFLEQIND 1675 N+Y SF SQ Q +F V+ +F + + D + K+W L +G + L D Sbjct: 517 LG-------NKYETGSFHSQVQYEDFAHVSSKFLDCVTDYMQKVWHLLVHGCNHLRVFKD 569 Query: 1676 -LDFRRFLTKEAPFDSEATLTIETSKGK--------RNISGSESQP-SNQLRGILFQLGV 1825 LDF +T A + S + S NI E+Q + Q+R LFQLGV Sbjct: 570 PLDFSWLVT--AKYSSMWEICSHCSSNNIGSNSDIYENIPLCEAQGCTTQVRIHLFQLGV 627 Query: 1826 HCLLYGACLSRICYGQHSELASDAMNFL 1909 HCLLYGA LS IC+G+HS L A N L Sbjct: 628 HCLLYGAYLSSICFGKHSYLTCHAQNIL 655 Score = 120 bits (302), Expect = 2e-24 Identities = 88/233 (37%), Positives = 107/233 (45%), Gaps = 6/233 (2%) Frame = +1 Query: 25 EMEMEMRAAEDIAMAEDLTPPLLPLVFSLYDXXXXXXXXXXXXXXXXXXXXXXXXXNY-- 198 EMEM MR E+I M EDLT PL PL FSL+ + Sbjct: 3 EMEMMMRGREEIEMGEDLTRPLPPLSFSLHHSLLLSHCSSCFSPLPSSPLPPIFPPRFPP 62 Query: 199 SHVPTXXXXXXXXXXXXXXXXHFSSAEPHLLRXXXXXXXXXXANXXXXXXXXXXFAKGNK 378 S+ HFSSAE HLL ++ + Sbjct: 63 SNSNPKILYCSSQCSFSDSPLHFSSAEHHLL-CLLPSAAAADSSDLRAALRLLESNPATR 121 Query: 379 EVVSLERIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVARRMGEGSDSVVEDGG---L 549 S+ RI GL TN KL ++E+EV+ I+DGA AM ARRM + S E G Sbjct: 122 RSSSVSRIAGLSTNLHKLAN-DDEEEVAARIRDGARAMAAARRMRDRDCSGEESEGEEEA 180 Query: 550 LEEAILCLVMMNAVEVQEKNGRPIGVAIYDAA-FSWINHSCSPNACYRFSTAS 705 + A LC V+ N VEVQ K+GR +GVA+Y FSWINHSCSPNACYR S S Sbjct: 181 MAAAALCAVLTNGVEVQVKSGRTLGVAVYGGGGFSWINHSCSPNACYRISLHS 233 >ref|XP_002265243.2| PREDICTED: protein SET DOMAIN GROUP 41-like [Vitis vinifera] Length = 660 Score = 200 bits (508), Expect = 3e-48 Identities = 140/400 (35%), Positives = 190/400 (47%), Gaps = 33/400 (8%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQESPD--------------HDKEEIEE 955 K R +ELW KY FSCCC RC+A P TYVD VLQE + + +EEI + Sbjct: 280 KEIRHAELWVKYWFSCCCNRCNASPPTYVDLVLQEKSESSLEDSFLSNELLFYREEEIRK 339 Query: 956 LTVFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXXX 1135 LT + DDAIA+Y+S+ N E+CC+KLE+ +I G EQL+P + Q Sbjct: 340 LTDYVDDAIADYLSVGNPEACCEKLEN-VIAQGLPDEQLEPIEGKSQANFKLHPLHHLSL 398 Query: 1136 DAYTILASGYKVLAFDLLALSSGNDKFQMEAFDKIRXXXXXXXXXXXXXXXXXXSESSLI 1315 AYT LAS Y+V A LL L S D ++EA I+ S+SSLI Sbjct: 399 AAYTTLASAYRVRASQLLDLHSEMDGDELEALSLIKTSAAYSLLLAGATHRIFLSDSSLI 458 Query: 1316 ASVANYWGSAGESLLSVARSSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLFNSFCG 1495 AS+AN+W +AGESLLS+ARSS+ + V SS+ +H+C Sbjct: 459 ASIANFWMNAGESLLSLARSSLLNSFVKGRLPVLNLSSLQSHKC---------------- 502 Query: 1496 QDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGSFLEQIND 1675 N + + + N F SQ + ++ QF N ++ K+WSFL G ++ D Sbjct: 503 ---NECSLADEFEANFFGSQAHNGGLENISKQFLNCVSSITPKVWSFLIQGHHLCKKFKD 559 Query: 1676 LDFRRFLTKEAPFDSEATLTIETSK------------------GKRNISGSESQ-PSNQL 1798 P DS +ETSK + + G E+Q +NQ Sbjct: 560 -----------PIDSNWLQKMETSKIWGFQAHSGCTAMDSSSWDEESTGGYEAQRDTNQE 608 Query: 1799 RGILFQLGVHCLLYGACLSRICYGQHSELASDAMNFLHSQ 1918 R LF+LG+HCLLYG LS ICYG S L N + + Sbjct: 609 RKNLFKLGIHCLLYGGFLSSICYGPSSYLTRYIRNLVDGE 648 Score = 122 bits (305), Expect = 1e-24 Identities = 97/280 (34%), Positives = 120/280 (42%), Gaps = 8/280 (2%) Frame = +1 Query: 34 MEMRAAEDIAMAEDLTPPLLPLVFSLYDXXXXXXXXXXXXXXXXXXXXXXXXXNYSHVPT 213 MEMR ED M DLT PL PL SL+D N + + Sbjct: 1 MEMRMREDTEMGLDLTHPLPPLASSLHDSHLRSHCSACFSPLPPTVLV-----NTNPSSS 55 Query: 214 XXXXXXXXXXXXXXXXHFSSAEPHL--LRXXXXXXXXXXANXXXXXXXXXXFAKGNKEVV 387 HFSSAE HL L ++ Sbjct: 56 FLCYCSPPCSASDSPLHFSSAEHHLFLLLRHSHPSTAHSSDLRAALRLLHILHLPPLHTQ 115 Query: 388 SLERIGGLITNYRKLMMAE---EEDEVSRMIKDGAEAMVVARRMGEGSDSVVEDGGLLEE 558 L RI GL+TN L+ E DE I+DG +AM VAR M +G++ LEE Sbjct: 116 PLHRICGLLTNLHHLISPSHNSESDETLTRIRDGGKAMAVARCMRDGTE--FSGDSKLEE 173 Query: 559 AILCLVMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRF---STASSEIGGEPR 729 A+LCLV+ NAVEVQ G +G+A+YD FSWINHSCSPNACYRF S + + GE R Sbjct: 174 ALLCLVLTNAVEVQVNGGSALGIAVYDWCFSWINHSCSPNACYRFLLRSPETPQFSGESR 233 Query: 730 FLIFPATMGNGGGAQSLNNTNANSKFEFFKDNKAIRIMVK 849 I P GG N + E K+ RI+V+ Sbjct: 234 LQIIP-----GG----------NDEIEVKKNRSGPRIIVR 258 >emb|CBI18219.3| unnamed protein product [Vitis vinifera] Length = 533 Score = 194 bits (492), Expect = 2e-46 Identities = 141/408 (34%), Positives = 190/408 (46%), Gaps = 41/408 (10%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQ------------ESPDHD-------- 937 K R +ELW KY FSCCC RC+A P TYVD VLQ E+ H Sbjct: 135 KEIRHAELWVKYWFSCCCNRCNASPPTYVDLVLQVRLLWNKLHPESETLAHSLNYIDDNM 194 Query: 938 --KEEIEELTVFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXX 1111 +EEI +LT + DDAIA+Y+S+ N E+CC+KLE+ +I G EQL+P + Q Sbjct: 195 CREEEIRKLTDYVDDAIADYLSVGNPEACCEKLEN-VIAQGLPDEQLEPIEGKSQANFKL 253 Query: 1112 XXXXXXXXDAYTILASGYKVLAFDLLALSSGNDKFQMEAFDKIRXXXXXXXXXXXXXXXX 1291 AYT LAS Y+V A LL L S D ++EA I+ Sbjct: 254 HPLHHLSLAAYTTLASAYRVRASQLLDLHSEMDGDELEALSLIKTSAAYSLLLAGATHRI 313 Query: 1292 XXSESSLIASVANYWGSAGESLLSVARSSVWEEAFQLAPAVSESSSILNHECYRSPQLNR 1471 S+SSLIAS+AN+W +AGESLLS+ARSS+ + V SS+ +H+C Sbjct: 314 FLSDSSLIASIANFWMNAGESLLSLARSSLLNSFVKGRLPVLNLSSLQSHKC-------- 365 Query: 1472 YLFNSFCGQDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGG 1651 N + + + N F SQ + ++ QF N ++ K+WSFL G Sbjct: 366 -----------NECSLADEFEANFFGSQAHNGGLENISKQFLNCVSSITPKVWSFLIQGH 414 Query: 1652 SFLEQINDLDFRRFLTKEAPFDSEATLTIETSK------------------GKRNISGSE 1777 ++ D P DS +ETSK + + G E Sbjct: 415 HLCKKFKD-----------PIDSNWLQKMETSKIWGFQAHSGCTAMDSSSWDEESTGGYE 463 Query: 1778 SQ-PSNQLRGILFQLGVHCLLYGACLSRICYGQHSELASDAMNFLHSQ 1918 +Q +NQ R LF+LG+HCLLYG LS ICYG S L N + + Sbjct: 464 AQRDTNQERKNLFKLGIHCLLYGGFLSSICYGPSSYLTRYIRNLVDGE 511 Score = 87.0 bits (214), Expect = 4e-14 Identities = 41/68 (60%), Positives = 48/68 (70%), Gaps = 3/68 (4%) Frame = +1 Query: 550 LEEAILCLVMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRF---STASSEIGG 720 LEEA+LCLV+ NAVEVQ G +G+A+YD FSWINHSCSPNACYRF S + + G Sbjct: 13 LEEALLCLVLTNAVEVQVNGGSALGIAVYDWCFSWINHSCSPNACYRFLLRSPETPQFSG 72 Query: 721 EPRFLIFP 744 E R I P Sbjct: 73 ESRLQIIP 80 >ref|XP_006434476.1| hypothetical protein CICLE_v10000601mg [Citrus clementina] gi|557536598|gb|ESR47716.1| hypothetical protein CICLE_v10000601mg [Citrus clementina] Length = 619 Score = 192 bits (488), Expect = 7e-46 Identities = 132/386 (34%), Positives = 191/386 (49%), Gaps = 19/386 (4%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQE-----------SPDHD---KEEIEE 955 K RQSELWSKY+F C C+RCSA P +YVD L+E S D++ E ++ Sbjct: 249 KGMRQSELWSKYQFVCHCRRCSASPPSYVDMALEETFSSNPEFSSLSSDYNFLKDEANQK 308 Query: 956 LTVFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXXX 1135 LT + D+ +EY+ + + ESCC+KLE++L G E L+ + +IQ Sbjct: 309 LTDWMDEVTSEYLLVGDPESCCQKLENIL-TQGLQGELLESEKVKIQLNLRLHPLHHLSL 367 Query: 1136 DAYTILASGYKVLAFDLLALSSGNDKFQMEAFDKIRXXXXXXXXXXXXXXXXXXSESSLI 1315 +AYT LAS YK+ + DLLAL+S D Q++AFD R SESSLI Sbjct: 368 NAYTTLASAYKIRSIDLLALNSDIDGQQLDAFDMSRTSAAYSFLLAGATDHLFRSESSLI 427 Query: 1316 ASVANYWGSAGESLLSVARSSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLFNSFCG 1495 A+ AN+W SAGESLL+++RS W + F + +SS NHEC Sbjct: 428 AASANFWASAGESLLTLSRSPGW-KLFVKPESPMSTSSPENHEC---------------- 470 Query: 1496 QDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGSFLEQIND 1675 + +Q++R+L N F SQ Q +F + +F I + K+W FL G +L+ + D Sbjct: 471 ---SNCSQVDRFLVNPFLSQSQNVDFQIICNEFLACITNMTRKVWGFLISGCGYLQMLKD 527 Query: 1676 -LDFRRFLTK----EAPFDSEATLTIETSKGKRNISGSESQPSNQLRGILFQLGVHCLLY 1840 +DF P S+ ET + + + R +FQLGVHC+ Y Sbjct: 528 PIDFSWLRQSSNLCHTPCCSDEESNKETEYQENICRRVMQRCDGKERITIFQLGVHCIAY 587 Query: 1841 GACLSRICYGQHSELASDAMNFLHSQ 1918 G L+ ICYG +S N + ++ Sbjct: 588 GGYLANICYGPNSHWPCKIKNVVQNE 613 Score = 101 bits (252), Expect = 2e-18 Identities = 58/101 (57%), Positives = 69/101 (68%), Gaps = 1/101 (0%) Frame = +1 Query: 397 RIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVARRMGEGSDSVVEDGGLLEEAILCLV 576 R+ GL+TN KLM + + D S+ I++GA M AR G SD V EEA LCLV Sbjct: 79 RLFGLLTNRDKLMSSSDSDVASK-IREGAREM--ARARGNLSDDVA-----WEEAALCLV 130 Query: 577 MMNAVEVQE-KNGRPIGVAIYDAAFSWINHSCSPNACYRFS 696 M NAVEVQ+ K GR +G+A+YD FSWINHSCSPNACYRFS Sbjct: 131 MTNAVEVQDDKTGRILGIAVYDKDFSWINHSCSPNACYRFS 171 >ref|XP_006473070.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Citrus sinensis] Length = 619 Score = 189 bits (481), Expect = 4e-45 Identities = 132/386 (34%), Positives = 191/386 (49%), Gaps = 19/386 (4%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQE-----------SPDHD---KEEIEE 955 K RQSELWSKY+F C C+RCSA P +YVD L+E S D++ E ++ Sbjct: 249 KGMRQSELWSKYQFVCHCRRCSASPPSYVDMALEETFSSNPEFLSLSSDYNFLKDEANQK 308 Query: 956 LTVFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXXX 1135 LT + D+ +EY+ + + ESCC+KLE++L G E L+ + +IQ Sbjct: 309 LTDWMDEGTSEYLLVGDPESCCQKLENIL-TQGLQGELLESEKVKIQLNLRLHPLHHLSL 367 Query: 1136 DAYTILASGYKVLAFDLLALSSGNDKFQMEAFDKIRXXXXXXXXXXXXXXXXXXSESSLI 1315 +AYT LAS YK+ + DLLAL+S D Q+EAFD R SESSLI Sbjct: 368 NAYTTLASAYKIRSIDLLALNSDIDGQQLEAFDMSRTSAAYSLLLASTTDHLFRSESSLI 427 Query: 1316 ASVANYWGSAGESLLSVARSSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLFNSFCG 1495 A+ AN+W SAGESLL++ARS W + +S SS + HEC Sbjct: 428 AASANFWASAGESLLTLARSPGWNLFVKPELPISTSSPEI-HEC---------------- 470 Query: 1496 QDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGSFLEQIND 1675 ++ + ++R N F SQ + A+F + +F I + K+W FLT+G +L+ + D Sbjct: 471 ---SKCSLVDRLQVNPFLSQSRNADFQIICNEFLACITNMTRKVWGFLTHGCGYLQMLKD 527 Query: 1676 -LDFRRFLTK----EAPFDSEATLTIETSKGKRNISGSESQPSNQLRGILFQLGVHCLLY 1840 +DF P S+ ET + + + R +FQLGVHC+ Y Sbjct: 528 PIDFSWLRQSSNLCHTPCCSDEESNKETGYQESICRRVMQRCDGEERITIFQLGVHCIAY 587 Query: 1841 GACLSRICYGQHSELASDAMNFLHSQ 1918 G L+ ICYG +S N + ++ Sbjct: 588 GGYLANICYGPNSHWPCKIKNVVQNE 613 Score = 101 bits (252), Expect = 2e-18 Identities = 58/101 (57%), Positives = 69/101 (68%), Gaps = 1/101 (0%) Frame = +1 Query: 397 RIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVARRMGEGSDSVVEDGGLLEEAILCLV 576 R+ GL+TN KLM + + D S+ I++GA M AR G SD V EEA LCLV Sbjct: 79 RLFGLLTNRDKLMSSSDSDVASK-IREGAREM--ARARGNLSDDVA-----WEEAALCLV 130 Query: 577 MMNAVEVQE-KNGRPIGVAIYDAAFSWINHSCSPNACYRFS 696 M NAVEVQ+ K GR +G+A+YD FSWINHSCSPNACYRFS Sbjct: 131 MTNAVEVQDDKTGRILGIAVYDKDFSWINHSCSPNACYRFS 171 >ref|XP_003595407.1| Protein SET DOMAIN GROUP [Medicago truncatula] gi|355484455|gb|AES65658.1| Protein SET DOMAIN GROUP [Medicago truncatula] Length = 683 Score = 186 bits (471), Expect = 6e-44 Identities = 142/418 (33%), Positives = 198/418 (47%), Gaps = 24/418 (5%) Frame = +2 Query: 746 QPWETVVELSRLIILMQTPSSNFLKT---TRQSELWSKYRFSCCCKRCSAVPATYVDRVL 916 QP + L +++ M SN L TRQSELWSKY+F CCC+RCS++ TYVD +L Sbjct: 288 QPKMISLSLEWMLMFMVMCRSNGLVLVLGTRQSELWSKYQFICCCQRCSSLLFTYVDHIL 347 Query: 917 QE---------------SPDHDKEEIEELTVFFDDAIAEYISMNNTESCCKKLESLLIDD 1051 QE D + LT +D I+EY+S+ ++ SCC+KLE +LI+ Sbjct: 348 QEICVVCGDLSGLRSNYKFFRDMTD-RRLTDSIEDVISEYLSVGDSVSCCEKLEKILIEG 406 Query: 1052 GHIKEQLQPKNREIQRXXXXXXXXXXXXDAYTILASGYKVLAFDLLALSSGNDKFQMEAF 1231 + EQL+ K + Y LAS YKV A DLL+ S D Q +AF Sbjct: 407 --VDEQLEGK---AHSQLTLHPLHHLSLNCYMTLASAYKVRASDLLSGDSEIDFNQSKAF 461 Query: 1232 DKIRXXXXXXXXXXXXXXXXXXSESSLIASVANYWGSAGESLLSVARSSVWEEAFQLAPA 1411 D R SESSLIASVAN+W AGESLL++ RSS W + + Sbjct: 462 DMSRTSAAYFLLLAGAAHHLFNSESSLIASVANFWIGAGESLLTLTRSSGWSKFLNVDLV 521 Query: 1412 VSESSSILNHECYRSPQLNRYLFNSFCGQDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQ 1591 +S +S +C + ++ + R+ LN Q +F+ V+ + Sbjct: 522 LSNLASDTKFKCCKWSLMDTF-----------RACMLN--------GQINSQDFENVSNE 562 Query: 1592 FQNFIADSLVKMWSFLTYGGSFLEQIND-LDFRRFLTKEAPFDSEA----TLTIETSKGK 1756 F + ++D +WSFL YG FL+ D ++F ++K+ D A T T + Sbjct: 563 FIHSVSDITRNVWSFLVYGCQFLKSCKDPINFGWVMSKQNSLDVRAHDIKTGMCYTHEPV 622 Query: 1757 RNISGSESQPSNQLRGI-LFQLGVHCLLYGACLSRICYGQHSELASDAMNFLHSQGIF 1927 +I Q N +FQLGVHCL YG L+ ICYG HS L S N L + F Sbjct: 623 NSIGFRGEQDYNDHTVTHIFQLGVHCLTYGGLLACICYGPHSHLVSQVQNILDHKNDF 680 Score = 104 bits (260), Expect = 2e-19 Identities = 60/148 (40%), Positives = 88/148 (59%), Gaps = 2/148 (1%) Frame = +1 Query: 397 RIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVARRMGEGSDSVVEDGGLLEEAI--LC 570 R+ L+TN R L+ ++ +D+V+ ++ GA M A G +DGG LEEA LC Sbjct: 118 RLNHLLTN-RHLLTSQNDDDVAETVRLGALTMATAIEKQNGCS---KDGGTLEEATVALC 173 Query: 571 LVMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRFSTASSEIGGEPRFLIFPAT 750 V+ NAVEV + G +G+A+++ AFSWINHSCSPNACYRFS ++S + E + I P T Sbjct: 174 AVLTNAVEVHDNEGCALGIAVFEHAFSWINHSCSPNACYRFSFSNSLLSRESKLRIAPFT 233 Query: 751 MGNGGGAQSLNNTNANSKFEFFKDNKAI 834 N Q +++ S EF ++ + I Sbjct: 234 Q-NSKQPQQIDSGVFGSSSEFAQEGREI 260 >ref|XP_004516217.1| PREDICTED: protein SET DOMAIN GROUP 41-like isoform X2 [Cicer arietinum] Length = 659 Score = 176 bits (445), Expect = 6e-41 Identities = 133/401 (33%), Positives = 194/401 (48%), Gaps = 27/401 (6%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQESP---------------DHDKEEIE 952 K RQSELWSKYRF CCCKRC+++P TYVD LQE D + Sbjct: 284 KALRQSELWSKYRFLCCCKRCTSLPFTYVDHALQEISVLYGDSSGLRTNYKFFRDMAD-R 342 Query: 953 ELTVFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXX 1132 LT +DAI+EY+S+ ++ SCC+KLE +L + + EQL+ + Sbjct: 343 RLTDSIEDAISEYLSVGDSLSCCEKLEKILTEG--LDEQLEENEEKSHYKFILHPLHHLS 400 Query: 1133 XDAYTILASGYKVLAFDLLALSSG-----NDKFQMEAFDKIRXXXXXXXXXXXXXXXXXX 1297 ++YT LAS YKV A D LSSG +++ + +AFD R Sbjct: 401 LNSYTTLASAYKVRACD---LSSGDFEIDSNQSESKAFDLSRTSTAYFLLLASGVHHLFN 457 Query: 1298 SESSLIASVANYWGSAGESLLSVARSSVWEEAF-QLAPAVSESSSILNHECYRSPQLNRY 1474 SESSLIASVAN+W AGESLL++ +SS W F +S +S EC + ++R+ Sbjct: 458 SESSLIASVANFWVGAGESLLTLTKSSGWSSKFVNFDLVLSNIASDTKFECSKCSLMDRF 517 Query: 1475 LFNSFCGQDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGS 1654 R + LN + + +F+ V+ +F + ++D K+W+FL YG Sbjct: 518 -----------RDSILN--------GKIKSEDFENVSNEFIHCVSDITHKVWNFLVYGCH 558 Query: 1655 FLEQIND-LDFRRFLTKEAPFDSEA----TLTIETSKGKRNISGS-ESQPSNQLRGILFQ 1816 FL+ D + F ++ + D A T T + + +I S E ++ + Q Sbjct: 559 FLKSCKDPISFSWLMSIKNSVDVGANDIKTDMCYTHEPENSIGVSDELAYTDHTVAHILQ 618 Query: 1817 LGVHCLLYGACLSRICYGQHSELASDAMNFLHSQGIFAESV 1939 LG HCL YG L+ +CYG +S L S N L + F S+ Sbjct: 619 LGRHCLTYGGLLAFVCYGPNSHLVSHVQNILARENNFLFSL 659 Score = 112 bits (281), Expect = 7e-22 Identities = 85/248 (34%), Positives = 113/248 (45%), Gaps = 7/248 (2%) Frame = +1 Query: 28 MEMEMRAAEDIAMAEDLTPPLLPLVFSLYDXXXXXXXXXXXXXXXXXXXXXXXXXNYSHV 207 MEMEMR+ D + D+TPPL P FSL++ N+SH Sbjct: 1 MEMEMRSISDRDIGTDITPPLTPFSFSLHNTHLHTHCSSCFSLITPIIPTT----NHSH- 55 Query: 208 PTXXXXXXXXXXXXXXXXHFSSAEPHLLRXXXXXXXXXXANXXXXXXXXXXFAKGNKEVV 387 H SSAE HL F Sbjct: 56 --STFYCSPHCSTSHSPIHLSSAERHLPSSINSSLLRTALRLLLLHHTTSLFP------- 106 Query: 388 SLERIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVA----RRMGEGSDSVVEDGGLLE 555 RI L+TN +L++ + D+V+ I+ GA AM A R G G S D +LE Sbjct: 107 ---RINHLLTN--RLLLTCQNDDVNETIRLGAHAMATAIANHRGGGSGGFSEPYDNAVLE 161 Query: 556 EAI--LCLVMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRFS-TASSEIGGEP 726 ++ LC V+ NAVEV + G +G+A+++ AFSWINHSCSPNACYRFS ++SS + E Sbjct: 162 KSTDALCAVLTNAVEVHDNEGCAVGIAVFEPAFSWINHSCSPNACYRFSFSSSSLLSQES 221 Query: 727 RFLIFPAT 750 +FLI P T Sbjct: 222 KFLIAPFT 229 >ref|XP_004516216.1| PREDICTED: protein SET DOMAIN GROUP 41-like isoform X1 [Cicer arietinum] Length = 660 Score = 176 bits (445), Expect = 6e-41 Identities = 133/401 (33%), Positives = 194/401 (48%), Gaps = 27/401 (6%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQESP---------------DHDKEEIE 952 K RQSELWSKYRF CCCKRC+++P TYVD LQE D + Sbjct: 285 KALRQSELWSKYRFLCCCKRCTSLPFTYVDHALQEISVLYGDSSGLRTNYKFFRDMAD-R 343 Query: 953 ELTVFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXX 1132 LT +DAI+EY+S+ ++ SCC+KLE +L + + EQL+ + Sbjct: 344 RLTDSIEDAISEYLSVGDSLSCCEKLEKILTEG--LDEQLEENEEKSHYKFILHPLHHLS 401 Query: 1133 XDAYTILASGYKVLAFDLLALSSG-----NDKFQMEAFDKIRXXXXXXXXXXXXXXXXXX 1297 ++YT LAS YKV A D LSSG +++ + +AFD R Sbjct: 402 LNSYTTLASAYKVRACD---LSSGDFEIDSNQSESKAFDLSRTSTAYFLLLASGVHHLFN 458 Query: 1298 SESSLIASVANYWGSAGESLLSVARSSVWEEAF-QLAPAVSESSSILNHECYRSPQLNRY 1474 SESSLIASVAN+W AGESLL++ +SS W F +S +S EC + ++R+ Sbjct: 459 SESSLIASVANFWVGAGESLLTLTKSSGWSSKFVNFDLVLSNIASDTKFECSKCSLMDRF 518 Query: 1475 LFNSFCGQDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGS 1654 R + LN + + +F+ V+ +F + ++D K+W+FL YG Sbjct: 519 -----------RDSILN--------GKIKSEDFENVSNEFIHCVSDITHKVWNFLVYGCH 559 Query: 1655 FLEQIND-LDFRRFLTKEAPFDSEA----TLTIETSKGKRNISGS-ESQPSNQLRGILFQ 1816 FL+ D + F ++ + D A T T + + +I S E ++ + Q Sbjct: 560 FLKSCKDPISFSWLMSIKNSVDVGANDIKTDMCYTHEPENSIGVSDELAYTDHTVAHILQ 619 Query: 1817 LGVHCLLYGACLSRICYGQHSELASDAMNFLHSQGIFAESV 1939 LG HCL YG L+ +CYG +S L S N L + F S+ Sbjct: 620 LGRHCLTYGGLLAFVCYGPNSHLVSHVQNILARENNFLFSL 660 Score = 112 bits (281), Expect = 7e-22 Identities = 85/248 (34%), Positives = 113/248 (45%), Gaps = 7/248 (2%) Frame = +1 Query: 28 MEMEMRAAEDIAMAEDLTPPLLPLVFSLYDXXXXXXXXXXXXXXXXXXXXXXXXXNYSHV 207 MEMEMR+ D + D+TPPL P FSL++ N+SH Sbjct: 1 MEMEMRSISDRDIGTDITPPLTPFSFSLHNTHLHTHCSSCFSLITPIIPTT----NHSH- 55 Query: 208 PTXXXXXXXXXXXXXXXXHFSSAEPHLLRXXXXXXXXXXANXXXXXXXXXXFAKGNKEVV 387 H SSAE HL F Sbjct: 56 --STFYCSPHCSTSHSPIHLSSAERHLPSSINSSLLRTALRLLLLHHTTSLFP------- 106 Query: 388 SLERIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVA----RRMGEGSDSVVEDGGLLE 555 RI L+TN +L++ + D+V+ I+ GA AM A R G G S D +LE Sbjct: 107 ---RINHLLTN--RLLLTCQNDDVNETIRLGAHAMATAIANHRGGGSGGFSEPYDNAVLE 161 Query: 556 EAI--LCLVMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRFS-TASSEIGGEP 726 ++ LC V+ NAVEV + G +G+A+++ AFSWINHSCSPNACYRFS ++SS + E Sbjct: 162 KSTDALCAVLTNAVEVHDNEGCAVGIAVFEPAFSWINHSCSPNACYRFSFSSSSLLSQES 221 Query: 727 RFLIFPAT 750 +FLI P T Sbjct: 222 KFLIAPFT 229 >gb|EMJ00499.1| hypothetical protein PRUPE_ppa023162mg, partial [Prunus persica] Length = 635 Score = 169 bits (428), Expect = 6e-39 Identities = 132/372 (35%), Positives = 175/372 (47%), Gaps = 22/372 (5%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQE-----------SPD---HDKEEIEE 955 K RQSELWS+YRF C C RCSA P TYVD+VL+E S D + + + Sbjct: 294 KAMRQSELWSRYRFICSCTRCSASPLTYVDQVLEEISAANFNSSSLSSDINFNRDKATQR 353 Query: 956 LTVFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXXX 1135 LT + DDAI +Y+S+ + ES +LE +L G +Q + K Q Sbjct: 354 LTNYIDDAIDDYLSIGDPESSSVRLEHVLTQ-GLSDKQSECKEETSQLTYWLHPLHHLSL 412 Query: 1136 DAYTILASGYKVLAFDLLALSSGNDKFQMEAFDKIRXXXXXXXXXXXXXXXXXXSESSLI 1315 +AYT LA L S D + A D R SESSLI Sbjct: 413 NAYTTLAQ----------PLYSKMDDHLLNALDLSRTSTAYSLLLAGATHHLFRSESSLI 462 Query: 1316 ASVANYWGSAGESLLSVARSSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLFNSFCG 1495 SVAN+W SAGESLL++ARSSVW + Q VS SS + C +++ +SF G Sbjct: 463 VSVANFWSSAGESLLTLARSSVWSQFVQRDLPVSNPSSTGKYRCPNCSLADKFETDSFHG 522 Query: 1496 QDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGSFLEQI-N 1672 Q RY A+FD V+ +F + + + +W+FL G +L + N Sbjct: 523 Q--------VRY-----------ADFDYVSNEFVDCVTNFTQNVWNFLGLGCQYLRLVKN 563 Query: 1673 DLDF------RRFLTKEAPFDSEATLTIETSKGKRNISGSESQP-SNQLRGILFQLGVHC 1831 +DF R E S T R ISGSE++ +NQ+R LF+LGVHC Sbjct: 564 PIDFSWLGTVRYSSVGEDIVRSSGTEVASKCGAGRRISGSEAEGYNNQVRICLFKLGVHC 623 Query: 1832 LLYGACLSRICY 1867 LLYG L+ ICY Sbjct: 624 LLYGGYLASICY 635 Score = 127 bits (320), Expect = 2e-26 Identities = 81/225 (36%), Positives = 106/225 (47%), Gaps = 5/225 (2%) Frame = +1 Query: 34 MEMRAAEDIAMAEDLTPPLLPLVFSLYDXXXXXXXXXXXXXXXXXXXXXXXXX-----NY 198 MEMRA EDI + ED+TPPL PL F+L+D N Sbjct: 1 MEMRAEEDIEIGEDITPPLTPLGFALHDSLLSSHCSSCFSLLPPHPFPPLHFTPPFPHNP 60 Query: 199 SHVPTXXXXXXXXXXXXXXXXHFSSAEPHLLRXXXXXXXXXXANXXXXXXXXXXFAKGNK 378 HV + H SSAE HLL Sbjct: 61 HHVLSSSSYCSPLCSTSDSPLHVSSAELHLLHLLQSHPSTYPHGDSSDLRAALRLLHSLP 120 Query: 379 EVVSLERIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVARRMGEGSDSVVEDGGLLEE 558 RI GL+TN+ K + ++ I+DGA AM +AR+M + + +V + +LEE Sbjct: 121 ATGPSARIAGLLTNHHKFLHHDDHHR----IRDGARAMFLARKMRDEAPNVYD--AVLEE 174 Query: 559 AILCLVMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRF 693 A LCLV+ NAVEVQ+K GR +G+++Y +F WINHSCSPNACYRF Sbjct: 175 AALCLVLTNAVEVQDKTGRTLGISVYGPSFCWINHSCSPNACYRF 219 >ref|XP_004138545.1| PREDICTED: protein SET DOMAIN GROUP 41-like [Cucumis sativus] Length = 659 Score = 160 bits (406), Expect = 2e-36 Identities = 132/397 (33%), Positives = 173/397 (43%), Gaps = 32/397 (8%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQE---------------SPDHDKEEIE 952 K RQSELWS+Y+F C C+RCSAVP TYVD LQE + DHD + Sbjct: 295 KVLRQSELWSRYQFVCSCQRCSAVPLTYVDHALQEISSVKVELLDSTPISNFDHD-TAVR 353 Query: 953 ELTVFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXX 1132 + + D+AI EY+S ++ ESCC+KL++LL H EQ++ + Sbjct: 354 RIDEYVDNAITEYLSTSSPESCCEKLQNLLTFGFH-DEQVEDGEGKQHVSLRLHPLHFLL 412 Query: 1133 XDAYTILASGYKVLAFDLLALSSGNDK---FQMEAFDKIRXXXXXXXXXXXXXXXXXXSE 1303 +AYT L S YKV + DL+ALSS DK + A + E Sbjct: 413 LNAYTALTSAYKVRSCDLVALSSEMDKDNGNRHNALTMGKTSAAYALFLAGATHRLFLFE 472 Query: 1304 SSLIASVANYWGSAGESLLSVAR-SSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLF 1480 SL+AS AN W AGESLL +AR SS+W ++ N + P R + Sbjct: 473 PSLVASAANCWVVAGESLLILARHSSLW-------------ATTTNTSNWVFPLGKRMCY 519 Query: 1481 NSFCGQDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGSFL 1660 N + N S Q A+F + + N IA K WS LT+G +L Sbjct: 520 NCSWVDEFNAS---------RIHGQPVQADFREFSIGISNCIASISQKCWSSLTHGCPYL 570 Query: 1661 EQINDLDFRRFLTKEAPFDSEATLTIETSKGKRNISGS-------------ESQPSNQLR 1801 + PFD T E R I S + Q SNQ R Sbjct: 571 KAFT-----------GPFDFSWPKTNEQDICGRGIDHSCACSKTQDVCLECKPQDSNQER 619 Query: 1802 GILFQLGVHCLLYGACLSRICYGQHSELASDAMNFLH 1912 + LG+HCL YG L+ ICYG HS LAS N L+ Sbjct: 620 ESISGLGIHCLYYGGYLASICYGHHSHLASQIQNILN 656 Score = 112 bits (280), Expect = 9e-22 Identities = 57/118 (48%), Positives = 76/118 (64%) Frame = +1 Query: 394 ERIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVARRMGEGSDSVVEDGGLLEEAILCL 573 +RI GL+TN KLM + + EV +++GA A+ RR + G LEEA+LCL Sbjct: 140 DRIYGLLTNRHKLMTPQNDSEVFLKLREGANAIAALRRKNYAD---IPPGTALEEAVLCL 196 Query: 574 VMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRFSTASSEIGGEPRFLIFPA 747 V+ NAV+VQ+ G+ IG+A+Y + FSWINHSCSPNACYRF T S + RF I P+ Sbjct: 197 VLTNAVDVQDSIGQTIGIAVYASTFSWINHSCSPNACYRFETPSDSV--TTRFRIAPS 252 >ref|XP_004166625.1| PREDICTED: LOW QUALITY PROTEIN: protein SET DOMAIN GROUP 41-like [Cucumis sativus] Length = 596 Score = 160 bits (405), Expect = 3e-36 Identities = 132/397 (33%), Positives = 175/397 (44%), Gaps = 32/397 (8%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQE---------------SPDHDKEEIE 952 K RQSELWS+Y+F C C+RCSAVP TYVD LQE + DHD + Sbjct: 232 KVLRQSELWSRYQFVCSCQRCSAVPLTYVDHALQEISSVKVELLDSTPISNFDHD-TAVR 290 Query: 953 ELTVFFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXX 1132 + + D+AI EY+S ++ ESCC+KL++LL G EQ++ + + Sbjct: 291 RIDEYVDNAITEYLSTSSPESCCEKLQNLL-TFGFRDEQVEDEEGKQHVSLRLHPLHFLL 349 Query: 1133 XDAYTILASGYKVLAFDLLALSSGNDK---FQMEAFDKIRXXXXXXXXXXXXXXXXXXSE 1303 +AYT L S YKV + DL+ALSS DK + A + E Sbjct: 350 LNAYTALTSAYKVRSCDLVALSSEMDKDNGNRHNALTMGKTSAAYALFLAGATHRLFLFE 409 Query: 1304 SSLIASVANYWGSAGESLLSVAR-SSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLF 1480 SL+AS AN W AGESLL +AR SS+W ++ N + P R + Sbjct: 410 PSLVASAANCWVVAGESLLILARHSSLW-------------ATTTNTSNWVFPLGKRMCY 456 Query: 1481 NSFCGQDLNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGSFL 1660 N + N S R + A+F + + N IA K WS LT+G +L Sbjct: 457 NCSWVDEFNASRIHGRPV---------QADFREFSIGISNCIASISQKCWSSLTHGCPYL 507 Query: 1661 EQINDLDFRRFLTKEAPFDSEATLTIETSKGKRNISGS-------------ESQPSNQLR 1801 + PFD T E R I S + Q SNQ R Sbjct: 508 KAFT-----------GPFDFSWPKTNEQDICGRGIDHSCACSKTQDVCLECKPQDSNQER 556 Query: 1802 GILFQLGVHCLLYGACLSRICYGQHSELASDAMNFLH 1912 + LG+HCL YG L+ ICYG HS LAS N L+ Sbjct: 557 ESISGLGIHCLYYGGYLASICYGHHSHLASQIQNILN 593 Score = 91.3 bits (225), Expect = 2e-15 Identities = 52/118 (44%), Positives = 66/118 (55%) Frame = +1 Query: 394 ERIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVARRMGEGSDSVVEDGGLLEEAILCL 573 +RI GL+TN KLM + RR + G LEEA+LCL Sbjct: 93 DRIYGLLTNRHKLMTPK----------------TTPRRKNYAD---IPPGTALEEAVLCL 133 Query: 574 VMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRFSTASSEIGGEPRFLIFPA 747 V+ NAV+VQ+ G+ IG+A+Y + FSWINHSCSPNACYRF T S + RF I P+ Sbjct: 134 VLTNAVDVQDSIGQTIGIAVYASTFSWINHSCSPNACYRFETPSDSV--TTRFRIAPS 189 >gb|EOY16760.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] gi|508724865|gb|EOY16762.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] gi|508724866|gb|EOY16763.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] gi|508724867|gb|EOY16764.1| SET domain-containing protein, putative isoform 3 [Theobroma cacao] Length = 625 Score = 155 bits (391), Expect = 1e-34 Identities = 102/320 (31%), Positives = 166/320 (51%), Gaps = 3/320 (0%) Frame = +2 Query: 965 FFDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXXXDAY 1144 + D+ I E +S + ESCC+KLES+L HI EQ++ K+ + +AY Sbjct: 320 YMDETITEVLSDGDPESCCEKLESILNLGLHI-EQVESKDGKSLLNFKLHPFHHLALNAY 378 Query: 1145 TILASGYKVLAFDLLALSSGNDKFQMEAFDKIRXXXXXXXXXXXXXXXXXXSESSLIASV 1324 T L S Y++ + DLLAL D+ Q++AFD R SESSLIAS Sbjct: 379 TTLTSAYRICSSDLLALHPDVDECQLKAFDMNRTSAAYSLLLAGATHRLFCSESSLIASA 438 Query: 1325 ANYWGSAGESLLSVARSSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLFNSFCGQDL 1504 AN+W +AGESL+++ARSS+W + +SE S+I H+C Sbjct: 439 ANFWTNAGESLVTLARSSLWNLFVKWGFPISEVSTIAKHKC------------------- 479 Query: 1505 NRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMWSFLTYGGSFLEQINDLDF 1684 ++ + ++ + S SQ Q F+ ++ F + +++ K+W FL G +LE D Sbjct: 480 SKCSLMDIFDTKSILSQAQRVNFENISSDFLDCVSNMTAKIWRFLVRGCHYLEVFEDPFD 539 Query: 1685 RRFLTKEAPFDSEATLTIETSK--GKRNISGSESQ-PSNQLRGILFQLGVHCLLYGACLS 1855 +L F + A E SK + +I ++Q +N+ R ++++G+HCLLYG L+ Sbjct: 540 FGWLVHTWDFHARANRNDEDSKFITEGSIYKHQAQWYTNERRIHVYEVGIHCLLYGGILA 599 Query: 1856 RICYGQHSELASDAMNFLHS 1915 ICYGQ+S+L++ ++ L++ Sbjct: 600 HICYGQNSQLSTHVLSILYN 619 Score = 117 bits (294), Expect = 2e-23 Identities = 67/147 (45%), Positives = 87/147 (59%), Gaps = 7/147 (4%) Frame = +1 Query: 391 LERIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVARRMGEGSDSVVEDGGLLEEAILC 570 L RI GL+TN+ M+ EV+ I+ GA AM AR+ + DG LLEEA+L Sbjct: 116 LHRIDGLLTNHH--MLTSSSPEVAAKIRQGAIAMAAARKSRNRDNEGQSDGFLLEEAVLS 173 Query: 571 LVMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRFS------TASSEIGGEPRF 732 LV+ NAVEVQ+K+GR +G+A+YD +FSWINHSCSPNACYRFS T S Sbjct: 174 LVITNAVEVQDKSGRSLGIAVYDLSFSWINHSCSPNACYRFSISSPHATLSFREDSSSTL 233 Query: 733 LIFPATMGNGGGAQS-LNNTNANSKFE 810 I P+ +G A S + +T N +E Sbjct: 234 RIVPSVLGEECDACSCVEHTKGNKGYE 260 >gb|ESW24006.1| hypothetical protein PHAVU_004G094200g, partial [Phaseolus vulgaris] Length = 530 Score = 144 bits (362), Expect = 3e-31 Identities = 97/283 (34%), Positives = 135/283 (47%), Gaps = 12/283 (4%) Frame = +2 Query: 818 KTTRQSELWSKYRFSCCCKRCSAVPATYVDRVLQESPDHDKEEIEELTVF---------- 967 K TRQ ELWSKYRF CCCKRCS +P +YVD LQE + ++F Sbjct: 269 KATRQWELWSKYRFVCCCKRCSDLPLSYVDHALQEISAFSYDSTSSYSMFLKDMADRRLT 328 Query: 968 --FDDAIAEYISMNNTESCCKKLESLLIDDGHIKEQLQPKNREIQRXXXXXXXXXXXXDA 1141 DD I+EY+S+ + ESC KLE +L + EQL+ + A Sbjct: 329 ECIDDVISEYLSVGDPESCRDKLEKILTQG--LNEQLEDIKEKSDSKFMLHPLNHHSLTA 386 Query: 1142 YTILASGYKVLAFDLLALSSGNDKFQMEAFDKIRXXXXXXXXXXXXXXXXXXSESSLIAS 1321 YT LAS YKV A DLL++ S D Q++AFD R SESSLIAS Sbjct: 387 YTTLASAYKVCASDLLSVDSDIDINQLKAFDMSRTSAAYSLLLAGATHHLFNSESSLIAS 446 Query: 1322 VANYWGSAGESLLSVARSSVWEEAFQLAPAVSESSSILNHECYRSPQLNRYLFNSFCGQD 1501 VAN+W AGESLL ++RSS W L V +S + + + ++R Sbjct: 447 VANFWIGAGESLLFLSRSSGWSMRVNLGLMVPNLASAIKFKLSKCSLIDRI--------- 497 Query: 1502 LNRSTQLNRYLFNSFCSQDQIAEFDKVNWQFQNFIADSLVKMW 1630 T ++ NS ++F+ V+ +F ++++ K+W Sbjct: 498 ---RTCISNGQINS-------SDFENVSSEFIYYVSNITQKVW 530 Score = 95.9 bits (237), Expect = 8e-17 Identities = 59/127 (46%), Positives = 79/127 (62%), Gaps = 4/127 (3%) Frame = +1 Query: 397 RIGGLITNYRKLMMAEEEDEVSRMIKDGAEAMVVARRMGEGSDSVVEDGGLLEEAI--LC 570 R+ GL++N R+++ + D VS I+ +A V+A + E +V D +LEEA LC Sbjct: 104 RLAGLLSN-RRILTSHHHDHVSERIR--LDATVMAEAIAE-QRAVPHDDAVLEEATIALC 159 Query: 571 LVMMNAVEVQEKNGRPIGVAIYDAAFSWINHSCSPNACYRFSTASSEIGGEPRFL-IFP- 744 V+ NAVEV + GR +G+A++D FSWINHSCSPNACYRF SS EP L I P Sbjct: 160 AVLTNAVEVHDNEGRALGIAVFDPTFSWINHSCSPNACYRF-ILSSFPSNEPELLRIAPH 218 Query: 745 ATMGNGG 765 MG+GG Sbjct: 219 PQMGSGG 225