BLASTX nr result
ID: Akebia23_contig00016634
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00016634 (3158 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002265036.1| PREDICTED: uncharacterized protein LOC100266... 527 e-146 ref|XP_006436656.1| hypothetical protein CICLE_v10030776mg [Citr... 496 e-137 ref|XP_002315450.2| hypothetical protein POPTR_0010s24240g [Popu... 494 e-136 emb|CBI40243.3| unnamed protein product [Vitis vinifera] 494 e-136 ref|XP_002532013.1| conserved hypothetical protein [Ricinus comm... 483 e-133 ref|XP_002311034.2| hypothetical protein POPTR_0008s02470g [Popu... 472 e-130 ref|XP_007010267.1| Enhancer of polycomb-like transcription fact... 469 e-129 ref|XP_006360530.1| PREDICTED: uncharacterized protein LOC102597... 459 e-126 ref|XP_006360531.1| PREDICTED: uncharacterized protein LOC102597... 459 e-126 ref|XP_007010268.1| Enhancer of polycomb-like transcription fact... 459 e-126 ref|XP_007221419.1| hypothetical protein PRUPE_ppa001422mg [Prun... 456 e-125 ref|XP_004243418.1| PREDICTED: uncharacterized protein LOC101263... 451 e-124 gb|EXC25392.1| hypothetical protein L484_016774 [Morus notabilis] 437 e-119 ref|XP_004166800.1| PREDICTED: uncharacterized LOC101207239 [Cuc... 417 e-113 ref|XP_004140897.1| PREDICTED: uncharacterized protein LOC101207... 417 e-113 gb|EYU39775.1| hypothetical protein MIMGU_mgv1a001436mg [Mimulus... 414 e-112 ref|XP_006398922.1| hypothetical protein EUTSA_v10012741mg [Eutr... 414 e-112 ref|NP_196087.1| Enhancer of polycomb-like transcription factor ... 409 e-111 ref|XP_007221418.1| hypothetical protein PRUPE_ppa001422mg [Prun... 408 e-111 ref|XP_007145542.1| hypothetical protein PHAVU_007G247300g [Phas... 402 e-109 >ref|XP_002265036.1| PREDICTED: uncharacterized protein LOC100266152 [Vitis vinifera] Length = 791 Score = 527 bits (1357), Expect = e-146 Identities = 321/800 (40%), Positives = 452/800 (56%), Gaps = 36/800 (4%) Frame = -2 Query: 2689 MPLVEMRRSTRVFVPKSVVKDT-DGARVLRSGKRFSLDCGEEKPVRGNGDEWFCIIDDTA 2513 MP V MRR+TRVFVPK+ K GARVLRSG+R D GE K R +WF ++ ++ Sbjct: 1 MPSVGMRRTTRVFVPKTAAKGAAGGARVLRSGRRLWPDSGEGKLTRDA--DWFRLLHNSG 58 Query: 2512 DVPR-------CKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMH-GIVY 2357 K W+EV+ ++++D D + ++ D GIVY Sbjct: 59 GGGGGAGGGGGLKENGWHEVNSKQEVDDVDAEVAVSESRNVAGKCGDDQGSDYSRWGIVY 118 Query: 2356 NRKRRRLPGNKFVSSPR-----DRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXX 2192 +R+ +R +S + D+ +GI F RKQRRKR Sbjct: 119 SRRTKRSDSKSLLSPEKKRGFEDKRFGIRFSRKQRRKRMEESE----------------- 161 Query: 2191 XXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMC 2012 G ++++ +V++SS + RFT FL SIL +++ SR+ L F+ Sbjct: 162 -----EGGYVCVEMV-----TVVIDSSRSGRCRFTSFLNSILGYMRRSRVRLWGLYEFLT 211 Query: 2011 SEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMR 1832 EP++ F+ HGV FL + S+ GIC+IF AR+FIP+FS+DF A P FM Sbjct: 212 WEPMMDAFSSHGVRFLRDPPCARSF-------GICKIFGARRFIPLFSVDFSAVPSCFMY 264 Query: 1831 LHFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWS-SSV 1655 LH S++LR LP VL+ + E D+++ L CIP++ GS S+ + +S Sbjct: 265 LHSSMLLRFGCLPFVLVNNSMSVCSNGEEPIDSEENLLCIPSKKDHFGSKSITLENDNSG 324 Query: 1654 KKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHHD------- 1496 K+R + + + F+ R+ R+ NSR++Q++R+S RS R RN S +G H Sbjct: 325 KRRMLQPTIGTSRFSGRNAQWRNGVNSRSIQKRRSSQRSRRVRNPSLVGIHKSNGALVSD 384 Query: 1495 ----------LFRAGYKHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIES 1346 Y + R+ A+ + +N++ELKST V +K+ +DSVCCSANIL++ES Sbjct: 385 FITNRNKGIPFSSVVYNQELRRSARHASATNIRELKSTSVVVKEEIDSVCCSANILIVES 444 Query: 1345 DRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENG 1166 DRCFRE GA VMLE A +W++ K G +Y +KAE MR ++ NR THAMIW GE+G Sbjct: 445 DRCFRENGANVMLEVSASKEWFIAVKKDGSMKYSHKAEKDMRYAS-NRHTHAMIWNGEDG 503 Query: 1165 WKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYI 986 WKLEF NR+DW+IFKEL+ C DRN++ S + IPVPGV+EV Y D PF RP YI Sbjct: 504 WKLEFPNRQDWMIFKELYKECCDRNVEAPSVKIIPVPGVHEVTDYGDYKGDPFSRPDTYI 563 Query: 985 TMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEK 806 ++DEV+RA+ + A+YDMDS DEEWL+KLN+ F + + ++ E FE M+DAFEK Sbjct: 564 AFKNDEVSRAMAKTTASYDMDSEDEEWLKKLNS-EFHAENDLHGHVSEEDFELMVDAFEK 622 Query: 805 AAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQL 626 A Y +PD+ D + A C L ++ +A VY YWMKKRK+ +L+RVFQ R +QL Sbjct: 623 AVYCSPDDYPDANGAADLCVDLGSREAIACVYGYWMKKRKRKRGSLVRVFQGHHLRKAQL 682 Query: 625 MQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLADISMES 458 + KPV RKKRSF RQ+ + GRGKQQ A A + E A + Q+A+ D S + Sbjct: 683 IPKPVLRKKRSFSRQVGKFGRGKQQNVMQALAAQRKAIDETSAKLKAQEARVSLDRSEKL 742 Query: 457 VLLKRRRAQILMDNADLATY 398 + KR RAQ LM+NADLATY Sbjct: 743 AIRKRVRAQSLMENADLATY 762 >ref|XP_006436656.1| hypothetical protein CICLE_v10030776mg [Citrus clementina] gi|568878428|ref|XP_006492195.1| PREDICTED: uncharacterized protein LOC102612244 [Citrus sinensis] gi|557538852|gb|ESR49896.1| hypothetical protein CICLE_v10030776mg [Citrus clementina] Length = 758 Score = 496 bits (1276), Expect = e-137 Identities = 317/796 (39%), Positives = 437/796 (54%), Gaps = 32/796 (4%) Frame = -2 Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGN-GDEWFC--IIDD 2519 MP V MRR+TRVF VVK DGARVLRSG+R D G+ K R N GD+W+ +I+ Sbjct: 1 MPSVGMRRTTRVF---GVVKGVDGARVLRSGRRLWPDSGDGKLRRTNYGDDWYHHPVINK 57 Query: 2518 T---ADVPRCKSIDWY-EVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIVYNR 2351 P+CK W +D D+ V N D M+GIVY+R Sbjct: 58 KNGGPGGPKCKPNGWAAHLD---DLKVYANNDEKKEVKMCKKVKEELKGADLMYGIVYSR 114 Query: 2350 KRRRLPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXXXSG 2171 KR+R G K + + YGI F R+QRRK+S Sbjct: 115 KRKRNDGEKSKILEKKK-YGIQFSRRQRRKKSE--------------------------- 146 Query: 2170 HESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRV 1991 ++ ++ + +ESS S+ FL S+L ++ + + L A+F+ SE + V Sbjct: 147 -----KIVPFSVFGVGLESS--SSGFLVSFLSSVLGCMRRATVELPRLASFLLSETISGV 199 Query: 1990 FAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFSVVL 1811 F+ G+ F SW + G+C+IF Q IPMFSLDF A P FM +H +++ Sbjct: 200 FSLRGIRF--------SWDPPIARTGMCRIFGTMQLIPMFSLDFSAVPSCFMYIHHCMLV 251 Query: 1810 RSLYLPDVLIRYLSGLIKKARE---ITDNKKCLPCIPTEMGFPGSNSMASWSSSVKKRKV 1640 R + P V + + + ++K P + +SV K + Sbjct: 252 RFMRPPSVNSSASEDDSSEEEDVDYVCESKTVTPVV---------------DNSVNKVAL 296 Query: 1639 DFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFH-------HDL---- 1493 VR++ A R+V R S NSR +Q++R+SLR RARN S +G DL Sbjct: 297 HPSVRSSKLAARNVQYRSSLNSRAIQKRRSSLRRRRARNPSLIGSQKASGALVSDLTSCR 356 Query: 1492 ------FRAGYKHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFR 1331 A K K R Q S ++KE+ ST+ L ++D CC +ILV+ESDRC R Sbjct: 357 KSSIPSSSAVSKSKLRSSLQHSSVLSIKEVSSTVDSLMLDLDRSCCCVSILVMESDRCCR 416 Query: 1330 EEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEF 1151 EGA V+LE +W+LV K G +RY +KA+ +MRPS+ NRFTHA++W G++ WKLEF Sbjct: 417 VEGANVILEMSHSKEWHLVVKKDGETRYSFKAQRIMRPSSFNRFTHAILWAGDDNWKLEF 476 Query: 1150 SNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDD 971 SNR+DWL FK+L+ C DRN QV ++ IP+PGVYEV Y+DS VPF RP +YI++ D Sbjct: 477 SNRQDWLNFKDLYKECSDRNAQVSVSKVIPIPGVYEVLGYEDSNTVPFCRPDSYISVNVD 536 Query: 970 EVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHT 791 EV+RAL ++ ANYDMDS DEEWL+K NN F + + ++ + FE ++DAFEKA + + Sbjct: 537 EVSRALAKRTANYDMDSEDEEWLKKFNN-EFVTENELHEHVSEDTFELIVDAFEKAYFCS 595 Query: 790 PDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPV 611 PD+ S+E A + C L RK+ V AVY +W +KRKQ ALLRVFQ P+ L+ KP Sbjct: 596 PDDYSNEEAAVNLCLELGRKEVVLAVYNHWKQKRKQKRAALLRVFQGRQPKKPSLIPKPA 655 Query: 610 FRKKRSFKRQMRQSGRGKQQIFFHASAVE-----PEQDAMQRVQKAKSLADISMESVLLK 446 RK+RSFKRQ Q GRGK + V EQ+AM+RV++AK+ A S+E +LK Sbjct: 656 LRKRRSFKRQASQPGRGKPPVVLLPEVVTQQDALEEQNAMRRVEEAKASAKRSLEEAVLK 715 Query: 445 RRRAQILMDNADLATY 398 R+RAQ+LM NADLATY Sbjct: 716 RQRAQLLMQNADLATY 731 >ref|XP_002315450.2| hypothetical protein POPTR_0010s24240g [Populus trichocarpa] gi|550330500|gb|EEF01621.2| hypothetical protein POPTR_0010s24240g [Populus trichocarpa] Length = 777 Score = 494 bits (1272), Expect = e-136 Identities = 312/802 (38%), Positives = 440/802 (54%), Gaps = 38/802 (4%) Frame = -2 Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGN-GDEWFCII---- 2525 MP V +RR+TRVF V+K DGARVLRSG+R + G+ K R N GDEW+ I Sbjct: 1 MPSVGLRRTTRVF---GVIKGVDGARVLRSGRRLWQESGDGKLRRSNDGDEWYHTIIKND 57 Query: 2524 -------DDTADVPRCKSIDWYEVDP-ERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMH 2369 + +D+ ++ W D ++D+ V +K Sbjct: 58 NYQTKNQNKNSDLKYKENSGWAHDDKLKKDLGVV--------IAIAAPKRIKRVKSEKKF 109 Query: 2368 GIVYNRKRRRLPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXX 2189 GIVY RKR+RL G K S D+ +GI F R+QRR Sbjct: 110 GIVYRRKRKRLGGEKSEDS-EDKKFGIQFSRRQRRSLDD--------------------- 147 Query: 2188 XXXXSGHESSIDVIHGAILDIVVES-SCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMC 2012 ESS ++ L ++VE S +S++ +CFL S+L+++K +SL+E A F+ Sbjct: 148 -------ESSESLVCTPELVVLVEDFSSSSSNGLSCFLSSVLRYIKRVNLSLSELADFLL 200 Query: 2011 SEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMR 1832 SEP+ VFA +G+HF +LS D I GIC+ F RQ +PMFS+DF + P F+ Sbjct: 201 SEPISSVFASNGLHFARDLSA------DRI--GICKFFGTRQLLPMFSVDFSSIPSCFVH 252 Query: 1831 LHFSVVLRSLYLPDVLIRYLSGLIKKAREI--TDNKKCLPCIPTEMGFPGS-NSMASWSS 1661 +H S+ +R +L + + + ++ + +K C + F ++ + Sbjct: 253 MHLSLFVRFKFLSPIPVNNSLDEDDEDDDVMMSGSKVDQSCTTMKTDFALKITAVPEIDN 312 Query: 1660 SVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH------ 1499 S K V VRA+ A RS R+ NSR +Q++R+SLR R RNS+ G H Sbjct: 313 SGSKAVVHPSVRASKLAGRSTQYRNGLNSRGIQKRRSSLRRGRPRNSAIAGLHKASGALV 372 Query: 1498 -DLF---RAGY-------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVI 1352 DL R G K+K R+ + SP +N+KE+ S V +K++M+ CSANILV Sbjct: 373 SDLISSRRKGIPFSSVVSKNKLRRSVRSSPAANIKEMNSAAVGVKKDMNMSSCSANILVS 432 Query: 1351 ESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGE 1172 ESDRC+R EGA VM E +W LV K GL+RY + A+ MR NRFTH +IWTG+ Sbjct: 433 ESDRCYRIEGATVMFEFTGSREWVLVVKKDGLTRYTHLAQKSMRTCASNRFTHDIIWTGD 492 Query: 1171 NGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIA 992 + WKLEF NR+DW IFKEL+ C D N+ ++ I VPGV EV Y++ G PF+RP A Sbjct: 493 DNWKLEFPNRQDWFIFKELYKECSDCNVPASVSKVISVPGVREVLGYENGGGAPFLRPYA 552 Query: 991 YITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAF 812 YI+ +DEVARAL R A+YDMDS DEEWL+K NN + +++ + FE ++DA Sbjct: 553 YISSENDEVARALARSTASYDMDSEDEEWLKKYNNDF----LAESDHLSEDNFELLIDAL 608 Query: 811 EKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSS 632 EK+ Y PD+ +DE+ A +C R++ AVY YWMKKRKQ LLRVFQ + + Sbjct: 609 EKSYYCNPDDFTDENAAAKYCKDFGRREVAEAVYSYWMKKRKQKCSPLLRVFQGHQAKKT 668 Query: 631 QLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVEPE----QDAMQRVQKAKSLADISM 464 ++ KPV RK+RSFKR Q GRGKQ A + + +AM ++++A++ S+ Sbjct: 669 PVIPKPVLRKRRSFKRPPSQFGRGKQPSLLPVMAADQDALEGYNAMHKIEEAENSVKRSL 728 Query: 463 ESVLLKRRRAQILMDNADLATY 398 E+ +LKRRRAQ+LM NADLATY Sbjct: 729 EAAILKRRRAQLLMKNADLATY 750 >emb|CBI40243.3| unnamed protein product [Vitis vinifera] Length = 734 Score = 494 bits (1272), Expect = e-136 Identities = 292/730 (40%), Positives = 416/730 (56%), Gaps = 28/730 (3%) Frame = -2 Query: 2503 RCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMH-GIVYNRKRRRLPGN 2327 RC+ W+EV+ ++++D D + ++ D GIVY+R+ +R Sbjct: 12 RCRLNGWHEVNSKQEVDDVDAEVAVSESRNVAGKCGDDQGSDYSRWGIVYSRRTKRSDSK 71 Query: 2326 KFVSSPR-----DRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXXXSGHES 2162 +S + D+ +GI F RKQRRKR G Sbjct: 72 SLLSPEKKRGFEDKRFGIRFSRKQRRKRMEESE----------------------EGGYV 109 Query: 2161 SIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRVFAQ 1982 ++++ +V++SS + RFT FL SIL +++ SR+ L F+ EP++ F+ Sbjct: 110 CVEMV-----TVVIDSSRSGRCRFTSFLNSILGYMRRSRVRLWGLYEFLTWEPMMDAFSS 164 Query: 1981 HGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFSVVLRSL 1802 HGV FL + S+ GIC+IF AR+FIP+FS+DF A P FM LH S++LR Sbjct: 165 HGVRFLRDPPCARSF-------GICKIFGARRFIPLFSVDFSAVPSCFMYLHSSMLLRFG 217 Query: 1801 YLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWS-SSVKKRKVDFIVR 1625 LP VL+ + E D+++ L CIP++ GS S+ + +S K+R + + Sbjct: 218 CLPFVLVNNSMSVCSNGEEPIDSEENLLCIPSKKDHFGSKSITLENDNSGKRRMLQPTIG 277 Query: 1624 ATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHHD----------------- 1496 + F+ R+ R+ NSR++Q++R+S RS R RN S +G H Sbjct: 278 TSRFSGRNAQWRNGVNSRSIQKRRSSQRSRRVRNPSLVGIHKSNGALVSDFITNRNKGIP 337 Query: 1495 LFRAGYKHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFREEGAK 1316 Y + R+ A+ + +N++ELKST V +K+ +DSVCCSANIL++ESDRCFRE GA Sbjct: 338 FSSVVYNQELRRSARHASATNIRELKSTSVVVKEEIDSVCCSANILIVESDRCFRENGAN 397 Query: 1315 VMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEFSNRRD 1136 VMLE A +W++ K G +Y +KAE MR ++ NR THAMIW GE+GWKLEF NR+D Sbjct: 398 VMLEVSASKEWFIAVKKDGSMKYSHKAEKDMRYAS-NRHTHAMIWNGEDGWKLEFPNRQD 456 Query: 1135 WLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDDEVARA 956 W+IFKEL+ C DRN++ S + IPVPGV+EV Y D PF RP YI ++DEV+RA Sbjct: 457 WMIFKELYKECCDRNVEAPSVKIIPVPGVHEVTDYGDYKGDPFSRPDTYIAFKNDEVSRA 516 Query: 955 LVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHTPDEVS 776 + + A+YDMDS DEEWL+KLN+ F + + ++ E FE M+DAFEKA Y +PD+ Sbjct: 517 MAKTTASYDMDSEDEEWLKKLNS-EFHAENDLHGHVSEEDFELMVDAFEKAVYCSPDDYP 575 Query: 775 DESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPVFRKKR 596 D + A C L ++ +A VY YWMKKRK+ +L+RVFQ R +QL+ KPV RKKR Sbjct: 576 DANGAADLCVDLGSREAIACVYGYWMKKRKRKRGSLVRVFQGHHLRKAQLIPKPVLRKKR 635 Query: 595 SFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLADISMESVLLKRRRAQI 428 SF RQ+ + GRGKQQ A A + E A + Q+A+ D S + + KR RAQ Sbjct: 636 SFSRQVGKFGRGKQQNVMQALAAQRKAIDETSAKLKAQEARVSLDRSEKLAIRKRVRAQS 695 Query: 427 LMDNADLATY 398 LM+NADLATY Sbjct: 696 LMENADLATY 705 >ref|XP_002532013.1| conserved hypothetical protein [Ricinus communis] gi|223528325|gb|EEF30368.1| conserved hypothetical protein [Ricinus communis] Length = 781 Score = 483 bits (1242), Expect = e-133 Identities = 308/802 (38%), Positives = 432/802 (53%), Gaps = 38/802 (4%) Frame = -2 Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGN-GDEWFCII---- 2525 MP V MRRSTRVF VVK DGARVLRSG+R + GE K R N GDEW + Sbjct: 1 MPSVGMRRSTRVF---GVVKGVDGARVLRSGRRLLIGAGENKFKRANDGDEWLHTMIKNH 57 Query: 2524 ---DDTADVPRC-KSIDWYEVDP-------ERDIDVTDFNLNLAXXXXXXXXXXXXXSRD 2378 + + + +C K W + ER V L + S + Sbjct: 58 HHNHNNSPIMKCNKENGWTQTQTHVSKLKKERPSPVA---LGVGAGAGNEVAKKVNDSGN 114 Query: 2377 KMHGIVYNRKRRRLPG-NKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXX 2201 KM GIVY+RKRRR+ G +K R++ +GI F R+QRR+ Sbjct: 115 KMWGIVYSRKRRRMSGIDKLEILGRNKKFGIQFSRRQRRRVLK----------------- 157 Query: 2200 XXXXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAA 2021 ++ ++ A+L I+V+ SC+S+ FL +L +++ + +S+ E Sbjct: 158 -----------DNEVESFEPALLGIIVDGSCSSSGLAASFLHLVLGYIRRTNLSIAELVP 206 Query: 2020 FMCSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFT 1841 F+ SE + FA G+ FL + + + GIC+IF +P+FSLDF A PF Sbjct: 207 FLLSESVKCAFASDGLRFLQDTTANRN--------GICKIFGGMSTVPIFSLDFSAVPFC 258 Query: 1840 FMRLHFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWSS 1661 F+ +H + R L + I+++++ C G +++ + Sbjct: 259 FLCMHLRLAFRVKCLSFEPVNNSLDEDSSQEVISESEEDHSC-----GLVRTDTFLLTDN 313 Query: 1660 SVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFH------- 1502 S K + + A+ A R R+ NSR +Q++R++ R RARN S +G H Sbjct: 314 SGGKVSLHPSLIASKLAGRHSQYRNVLNSRGIQKRRSAFRRRRARNPSGVGIHKANGALV 373 Query: 1501 HDLFRAG----------YKHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVI 1352 DL + K K R+ + +P +N+KE+ T V+ + MDS CSAN+LVI Sbjct: 374 SDLISSRKNGIPFSTVVSKDKLRRSLRLTPAANLKEVNPTAVQTSRVMDSSSCSANLLVI 433 Query: 1351 ESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGE 1172 ESDRC+R GA V LE L +W LV K GL+R + A+ MRP + NR TH +IWTG+ Sbjct: 434 ESDRCYRMVGATVALEISDLKEWVLVVKKDGLTRCTHLAQKSMRPCSSNRITHDVIWTGD 493 Query: 1171 NGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIA 992 + WKLEF NR+DWLIFK+L+ C DRN+ ++ IPVPGV EV Y+DS +PF R A Sbjct: 494 DSWKLEFPNRQDWLIFKDLYKECYDRNVPAPISKAIPVPGVREVLGYEDSSSLPFSRQDA 553 Query: 991 YITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAF 812 YI+ +DEV RAL ++ ANYDMD DEEWL+K N+ F + ++ EKFE M+D Sbjct: 554 YISFNNDEVVRALTKRTANYDMDCEDEEWLKKFNSEFF-VESEEQEHLSEEKFELMIDTL 612 Query: 811 EKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSS 632 E+A Y +PD+ D A +FC L R++ V AVY YWMKK+KQ ALLRVFQ + + Sbjct: 613 ERAFYSSPDDFVDGRAAVNFCIDLGRREVVEAVYGYWMKKQKQRRSALLRVFQLHQGKKA 672 Query: 631 QLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLADISM 464 L+ KP RK+RSFKRQ Q GRGK+ A A E EQ+AM+ ++ AK+ A S+ Sbjct: 673 SLIPKPGLRKRRSFKRQASQFGRGKKPSLLQAMAAEHDALEEQNAMRNLEAAKASAKSSV 732 Query: 463 ESVLLKRRRAQILMDNADLATY 398 ES +LKRRRAQ+LM+NADLA Y Sbjct: 733 ESAILKRRRAQMLMENADLAVY 754 >ref|XP_002311034.2| hypothetical protein POPTR_0008s02470g [Populus trichocarpa] gi|550332250|gb|EEE88401.2| hypothetical protein POPTR_0008s02470g [Populus trichocarpa] Length = 774 Score = 472 bits (1215), Expect = e-130 Identities = 311/792 (39%), Positives = 429/792 (54%), Gaps = 28/792 (3%) Frame = -2 Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGN-GDEWF-CIIDDT 2516 MP V +RR+TRVF SVVK DGARVLRSG+R + G+ K R + GDE + II +T Sbjct: 1 MPSVGLRRTTRVF---SVVKGVDGARVLRSGRRLWPESGDGKLRRSSDGDELYQTIIKNT 57 Query: 2515 ADVPRCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMH----GIVYNRK 2348 + + ++ + E + D L R K GIVY+RK Sbjct: 58 NNHIKNQNSNSNLKYKENNGWTHDVKLKKDRGIVIAIAAPKKIKRVKSEKEKFGIVYSRK 117 Query: 2347 RRRLPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXXXSGH 2168 R+RL G K +P D+ +GI F R+QRR+ G Sbjct: 118 RKRLGGEKS-ENPEDKKFGIQFSRRQRRRE----------------------------GS 148 Query: 2167 ESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRVF 1988 ES ++ L +VE +S +CFL S+L +SL+E A F+ S+P+ VF Sbjct: 149 ESQESLVCTPQLVALVEGCSSSNGWLSCFLSSVLGHAMRVSLSLSELADFLLSDPISSVF 208 Query: 1987 AQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASP--FTFMRLHFSVV 1814 A +G+HF+ +L +D I GIC+ FE RQ +PMFS+DF A P F FM L V Sbjct: 209 ASNGLHFVRDLP------SDRI--GICKFFETRQLLPMFSVDFSAIPSCFAFMHLSLFVK 260 Query: 1813 LRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWSSSVKKRKVDF 1634 R L L V + G ++++K C T+ F ++ + S R V Sbjct: 261 FRCLSLIPVN-NSVDGDDDDDEIMSESKGDQSCTSTKTDFTQKITVVPKTDSYGCRVVLH 319 Query: 1633 -IVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH-------DLFRAGY 1478 VRA+ R+ R+ NSR +Q++R+SLR R RNSS G H DL + Sbjct: 320 PSVRASKLTGRNTQHRNGLNSRGIQKRRSSLRRGRPRNSSIGGLHKANGALVSDLISSRK 379 Query: 1477 ----------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFRE 1328 K K R+ Q SP +++KEL V +K+ M+ CSANIL+ E+DRC+R Sbjct: 380 IGIPFSSVVSKEKLRRSIQSSPAASIKELNCAAVGVKKGMNLSSCSANILITETDRCYRI 439 Query: 1327 EGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEFS 1148 EGA VMLE +W LV K +GL+RY + A+ +MR NRFTH +IW G++ WKLEF Sbjct: 440 EGATVMLEFTDSKEWVLVVKKNGLTRYSHLAQKIMRTCVSNRFTHDIIWNGDDNWKLEFP 499 Query: 1147 NRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDDE 968 NR+DW IFKEL+ C D N+ ++ IPVPGV V D G PF RP AYI+ +DE Sbjct: 500 NRQDWFIFKELYKECSDHNVPASVSKAIPVPGVRGVLDNGDCGSAPFSRPYAYISSNNDE 559 Query: 967 VARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHTP 788 VARAL R A+YDMDS DEEWL+K N + +++ + FE M+DA E++ + P Sbjct: 560 VARALSRSTASYDMDSEDEEWLKKYNKEF----LAESDHLSEDNFELMIDALERSYFCDP 615 Query: 787 DEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPVF 608 D+ +DES A +C R++ AVY YWMKKRKQ LLRVFQ + + L+ KPV Sbjct: 616 DDFTDESAAAKYCKDFGRRELAKAVYGYWMKKRKQKRSPLLRVFQGHQAKKTPLIPKPVL 675 Query: 607 RKKRSFKRQMRQSGRGKQQIFFHASAVEPE--QDAMQRVQKAKSLADISMESVLLKRRRA 434 RK+RSFKR Q GRGKQ A A E + A+++V++A++ S+E+ +LKR++A Sbjct: 676 RKRRSFKRPPSQFGRGKQPSLLQAMAAEKDALHSALRKVEEARNSVKRSVEAAMLKRQKA 735 Query: 433 QILMDNADLATY 398 Q+LM NADLAT+ Sbjct: 736 QLLMKNADLATF 747 >ref|XP_007010267.1| Enhancer of polycomb-like transcription factor protein, putative isoform 1 [Theobroma cacao] gi|508727180|gb|EOY19077.1| Enhancer of polycomb-like transcription factor protein, putative isoform 1 [Theobroma cacao] Length = 767 Score = 469 bits (1208), Expect = e-129 Identities = 307/795 (38%), Positives = 444/795 (55%), Gaps = 31/795 (3%) Frame = -2 Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVR--GNGDEWFCIIDDT 2516 MP V MRR+TRVF +VK ++ ARVLRSG+R D GE KP R GDE + ++ Sbjct: 1 MPSVGMRRTTRVF---RMVKSSEVARVLRSGRRLWPDSGEAKPKRLANEGDENYNLMKKA 57 Query: 2515 ADVPRCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIVYNRKR--R 2342 P+ +++ ++ R K G N ++ R Sbjct: 58 ---------------PKSEVNGVAAEVS---------------GRPKRLGNEENPRKQSR 87 Query: 2341 RLPGNKF-VSSPRDRMYGISFVRKQRRKR-SSGHSINELPRKDCQXXXXXXXXXXXXSGH 2168 ++ F S D+M+GI + RK++R +GH + + + + Sbjct: 88 KMKAGAFNTSGSVDKMFGIVYTRKRKRNGVQNGHLSGNSGQGNYGKKISRRQAIENRNTN 147 Query: 2167 ESSIDVIHGAILDIVVESS-CNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRV 1991 E DV + VVE+ CN F+ FL+ +L +VK + + L+E AAF+ S+P+ V Sbjct: 148 E---DVEEPKMFSFVVENGDCNGC--FSNFLILVLGYVKRAEVRLSELAAFLMSQPISSV 202 Query: 1990 FAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFSVVL 1811 ++ +GV+F R GIC+ F A+ IP+FSLDF A P F+ +H+S VL Sbjct: 203 YSSNGVNFFWGPRNRT---------GICKFFGAKDSIPLFSLDFSAVPRYFLYMHYSKVL 253 Query: 1810 RSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGS---NSMASWSSSVKKRKV 1640 R L R + ++D+++ PC+ + + S N+ + K + Sbjct: 254 R-------LKRIQIVPVNSDEIVSDSEEDEPCVTSVVDVCKSTSGNAAVEIDNLGSKVVL 306 Query: 1639 DFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH-------DLF--- 1490 VRA+ R+ R+ +SR++Q++R+SLR RARN S +G H DL Sbjct: 307 HPSVRASKLTGRNAQCRNGLSSRSIQKRRSSLRRRRARNPSIVGIHKANGALMSDLISSR 366 Query: 1489 RAGY-------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFR 1331 R G K+K R + S +NV ++ S++ +L QN+DS CSANILVIE+DRC+R Sbjct: 367 RNGIPFSSVVSKNKLRSSVRNSSVANVSDVGSSISDLMQNVDSSQCSANILVIEADRCYR 426 Query: 1330 EEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEF 1151 EEGA V LE A +W LV K +++ KA+ MRPS+ NRFTHA+IWTG++ WKLEF Sbjct: 427 EEGAIVTLELSASREWLLVVKKGSSTKFACKADKFMRPSSCNRFTHAIIWTGDDNWKLEF 486 Query: 1150 SNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDD 971 NR+DW+IFK+L+ C +RN+ + + IPVPGV+EVP Y+D VPF RP YI++ D Sbjct: 487 PNRQDWIIFKDLYKECSERNVPASTVKAIPVPGVHEVPGYEDRRSVPFCRPDFYISLDGD 546 Query: 970 EVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHT 791 EV+RAL ++ ANYDMDS DEEWL+K NN F G+ G ++ + FE M+DAFEKA + + Sbjct: 547 EVSRALAKRTANYDMDSEDEEWLKKFNNEFFSGN-GHCEHLSEDCFELMVDAFEKAYFCS 605 Query: 790 PDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPV 611 PD+ S+E+ A C L + V AV+ YW++KRKQ ALLRVFQ + + L+ KP Sbjct: 606 PDDYSNENAAAHLCLDLGTRGLVEAVHTYWLRKRKQRRSALLRVFQGHQVKKAPLVPKPF 665 Query: 610 FRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLADISMESVLLKR 443 RK+RSFKRQ GRGKQ A A E EQ+AM ++++A+ A S+E +LKR Sbjct: 666 LRKRRSFKRQ-ASHGRGKQPYLLQALAAERDSMAEQNAMLKLEEARVSASRSVELAVLKR 724 Query: 442 RRAQILMDNADLATY 398 +R Q+LM+NADLATY Sbjct: 725 QRTQLLMENADLATY 739 >ref|XP_006360530.1| PREDICTED: uncharacterized protein LOC102597035 isoform X1 [Solanum tuberosum] Length = 781 Score = 459 bits (1182), Expect = e-126 Identities = 299/822 (36%), Positives = 433/822 (52%), Gaps = 37/822 (4%) Frame = -2 Query: 2674 MRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNGDEWFCIIDDT-----AD 2510 MRR+TR+F G RVLRSG+R S GE K + +GDEW ++D+ AD Sbjct: 7 MRRTTRIF----------GTRVLRSGRRLSTP-GEAKRAK-HGDEWIGLLDNVGGGGAAD 54 Query: 2509 VPRCKSIDWYEVD-------PERDIDVTDFNLNLAXXXXXXXXXXXXXSR--DKMHGIVY 2357 RCK W + + E DIDV +++ + D+M G+VY Sbjct: 55 ATRCKKNGWLKKEVALNLEADEMDIDVDSKSMDELESPEAPVVETISPNSNIDRMWGLVY 114 Query: 2356 NRKRRRLPGNKFVSSPRD-RMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXX 2180 RKR+R+ + D R YG FVRK++ + + + Sbjct: 115 TRKRKRVADSVKGKVLTDVRRYGKQFVRKKKVRSAYAKDL-------------------- 154 Query: 2179 XSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPL 2000 G V G ++V +S S + +C L IL +++ S +SL + F+ S+PL Sbjct: 155 --GKSEDGQVSSGI---VIVNTSYGSGYWVSCLLNCILMYLRRSTVSLQQIFGFINSKPL 209 Query: 1999 VRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFS 1820 V + G+ + + R I G C I R +P+F+LDF P F+ LH S Sbjct: 210 RDVNSLQGILLFKDQTPR------KIKTGACVISGVRCSVPVFTLDFSTVPCFFLYLHSS 263 Query: 1819 VVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCI-PTEMGFPGSNSMASWS----SSV 1655 ++LR + + L+ + I + +T++K+ + C+ P N+ + + Sbjct: 264 LLLRFVPMSYALVMQPTVAIDEV-TVTNDKEIVSCLSPVTQSELDVNTQSGLDVVAPGAY 322 Query: 1654 KKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSS-------------S 1514 +K++ + + + NSRN+Q++R+SLRS R R+SS Sbjct: 323 DSKKIEVVNPTVGLPKLAARHLQPRNSRNIQKRRSSLRSMRGRHSSFGTQNATGVLTSDR 382 Query: 1513 MGFHHDLFRAGYK---HKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESD 1343 + F D R + ++ R QK+ +VKELKS LV L QN++S CSAN+LVIE D Sbjct: 383 LRFRRDGLRFSSRTPHYELRSSRQKTSTPSVKELKSALVGLTQNIESTSCSANVLVIEPD 442 Query: 1342 RCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGW 1163 +C+REEGA + +E A W L K G+ R+ E VMRP + NR TH +IW G+NGW Sbjct: 443 KCYREEGAVIGMELSAAKQWILAVKIGGVRRFNLTTEKVMRPCSSNRVTHDIIWVGDNGW 502 Query: 1162 KLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYIT 983 KLEF R+DWLIFKEL+ C DRN+Q + IPVPGV EV Y +S F RP++YIT Sbjct: 503 KLEFPIRQDWLIFKELYKGCSDRNVQPPAVSIIPVPGVREVSGYAESNPPEFARPVSYIT 562 Query: 982 MRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKA 803 ++DDE+ARAL R ANYDMD DEEWL N D +++ + FE ++D FEK Sbjct: 563 VKDDELARALARSTANYDMDGDDEEWLRNFN----DQPSLENDHLSADSFELLIDNFEKG 618 Query: 802 AYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLM 623 Y PD+ SDE A S C + E+K+ V AVY YW+KKRKQN +L+++FQC PR +Q++ Sbjct: 619 FYCNPDDYSDEKAAVSSCPNKEKKEIVEAVYNYWLKKRKQNRSSLIKIFQCYQPRRTQVI 678 Query: 622 QKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVEPE-QDAMQRVQKAKSLADISMESVLLK 446 K +FRKKRSFKRQ ++GRGK + F A E E Q+A+ +V++AK+ A+ S + + Sbjct: 679 PKSIFRKKRSFKRQGSKAGRGKHRPFLPAVVAEKEQQNAVLKVKEAKAAANKSEDLAVRM 738 Query: 445 RRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEAYVPFIL 320 R++AQ LM+NADLATY +S EA P L Sbjct: 739 RQKAQQLMENADLATYKAMMALKIAEAAKIAKSKEAVGPIFL 780 >ref|XP_006360531.1| PREDICTED: uncharacterized protein LOC102597035 isoform X2 [Solanum tuberosum] Length = 779 Score = 459 bits (1180), Expect = e-126 Identities = 300/822 (36%), Positives = 434/822 (52%), Gaps = 37/822 (4%) Frame = -2 Query: 2674 MRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNGDEWFCIIDDT-----AD 2510 MRR+TR+F G RVLRSG+R S GE K + +GDEW ++D+ AD Sbjct: 7 MRRTTRIF----------GTRVLRSGRRLSTP-GEAKRAK-HGDEWIGLLDNVGGGGAAD 54 Query: 2509 VPRCKSIDWYEVD-------PERDIDVTDFNLNLAXXXXXXXXXXXXXSR--DKMHGIVY 2357 RCK W + + E DIDV +++ + D+M G+VY Sbjct: 55 ATRCKKNGWLKKEVALNLEADEMDIDVDSKSMDELESPEAPVVETISPNSNIDRMWGLVY 114 Query: 2356 NRKRRRLPGNKFVSSPRD-RMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXX 2180 RKR+R+ + D R YG FVRK++ + + + Sbjct: 115 TRKRKRVADSVKGKVLTDVRRYGKQFVRKKKVRSAYAKDL-------------------- 154 Query: 2179 XSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPL 2000 G V G ++V +S S + +C L IL +++ S +SL + F+ S+PL Sbjct: 155 --GKSEDGQVSSGI---VIVNTSYGSGYWVSCLLNCILMYLRRSTVSLQQIFGFINSKPL 209 Query: 1999 VRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFS 1820 V + G+ L ++ K I G C I R +P+F+LDF P F+ LH S Sbjct: 210 RDVNSLQGI-----LLFKTPRK---IKTGACVISGVRCSVPVFTLDFSTVPCFFLYLHSS 261 Query: 1819 VVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCI-PTEMGFPGSNSMASWS----SSV 1655 ++LR + + L+ + I + +T++K+ + C+ P N+ + + Sbjct: 262 LLLRFVPMSYALVMQPTVAIDEV-TVTNDKEIVSCLSPVTQSELDVNTQSGLDVVAPGAY 320 Query: 1654 KKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSS-------------S 1514 +K++ + + + NSRN+Q++R+SLRS R R+SS Sbjct: 321 DSKKIEVVNPTVGLPKLAARHLQPRNSRNIQKRRSSLRSMRGRHSSFGTQNATGVLTSDR 380 Query: 1513 MGFHHDLFRAGYK---HKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESD 1343 + F D R + ++ R QK+ +VKELKS LV L QN++S CSAN+LVIE D Sbjct: 381 LRFRRDGLRFSSRTPHYELRSSRQKTSTPSVKELKSALVGLTQNIESTSCSANVLVIEPD 440 Query: 1342 RCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGW 1163 +C+REEGA + +E A W L K G+ R+ E VMRP + NR TH +IW G+NGW Sbjct: 441 KCYREEGAVIGMELSAAKQWILAVKIGGVRRFNLTTEKVMRPCSSNRVTHDIIWVGDNGW 500 Query: 1162 KLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYIT 983 KLEF R+DWLIFKEL+ C DRN+Q + IPVPGV EV Y +S F RP++YIT Sbjct: 501 KLEFPIRQDWLIFKELYKGCSDRNVQPPAVSIIPVPGVREVSGYAESNPPEFARPVSYIT 560 Query: 982 MRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKA 803 ++DDE+ARAL R ANYDMD DEEWL N D +++ + FE ++D FEK Sbjct: 561 VKDDELARALARSTANYDMDGDDEEWLRNFN----DQPSLENDHLSADSFELLIDNFEKG 616 Query: 802 AYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLM 623 Y PD+ SDE A S C + E+K+ V AVY YW+KKRKQN +L+++FQC PR +Q++ Sbjct: 617 FYCNPDDYSDEKAAVSSCPNKEKKEIVEAVYNYWLKKRKQNRSSLIKIFQCYQPRRTQVI 676 Query: 622 QKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVEPE-QDAMQRVQKAKSLADISMESVLLK 446 K +FRKKRSFKRQ ++GRGK + F A E E Q+A+ +V++AK+ A+ S + + Sbjct: 677 PKSIFRKKRSFKRQGSKAGRGKHRPFLPAVVAEKEQQNAVLKVKEAKAAANKSEDLAVRM 736 Query: 445 RRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEAYVPFIL 320 R++AQ LM+NADLATY +S EA P L Sbjct: 737 RQKAQQLMENADLATYKAMMALKIAEAAKIAKSKEAVGPIFL 778 >ref|XP_007010268.1| Enhancer of polycomb-like transcription factor protein, putative isoform 2 [Theobroma cacao] gi|508727181|gb|EOY19078.1| Enhancer of polycomb-like transcription factor protein, putative isoform 2 [Theobroma cacao] Length = 784 Score = 459 bits (1180), Expect = e-126 Identities = 307/812 (37%), Positives = 444/812 (54%), Gaps = 48/812 (5%) Frame = -2 Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVR--GNGDEWFCIIDDT 2516 MP V MRR+TRVF +VK ++ ARVLRSG+R D GE KP R GDE + ++ Sbjct: 1 MPSVGMRRTTRVF---RMVKSSEVARVLRSGRRLWPDSGEAKPKRLANEGDENYNLMKKA 57 Query: 2515 ADVPRCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIVYNRKR--R 2342 P+ +++ ++ R K G N ++ R Sbjct: 58 ---------------PKSEVNGVAAEVS---------------GRPKRLGNEENPRKQSR 87 Query: 2341 RLPGNKF-VSSPRDRMYGISFVRKQRRKR-SSGHSINELPRKDCQXXXXXXXXXXXXSGH 2168 ++ F S D+M+GI + RK++R +GH + + + + Sbjct: 88 KMKAGAFNTSGSVDKMFGIVYTRKRKRNGVQNGHLSGNSGQGNYGKKISRRQAIENRNTN 147 Query: 2167 ESSIDVIHGAILDIVVESS-CNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRV 1991 E DV + VVE+ CN F+ FL+ +L +VK + + L+E AAF+ S+P+ V Sbjct: 148 E---DVEEPKMFSFVVENGDCNGC--FSNFLILVLGYVKRAEVRLSELAAFLMSQPISSV 202 Query: 1990 FAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFSVVL 1811 ++ +GV+F R GIC+ F A+ IP+FSLDF A P F+ +H+S VL Sbjct: 203 YSSNGVNFFWGPRNRT---------GICKFFGAKDSIPLFSLDFSAVPRYFLYMHYSKVL 253 Query: 1810 RSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGS---NSMASWSSSVKKRKV 1640 R L R + ++D+++ PC+ + + S N+ + K + Sbjct: 254 R-------LKRIQIVPVNSDEIVSDSEEDEPCVTSVVDVCKSTSGNAAVEIDNLGSKVVL 306 Query: 1639 DFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH-------DLF--- 1490 VRA+ R+ R+ +SR++Q++R+SLR RARN S +G H DL Sbjct: 307 HPSVRASKLTGRNAQCRNGLSSRSIQKRRSSLRRRRARNPSIVGIHKANGALMSDLISSR 366 Query: 1489 RAGY-------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFR 1331 R G K+K R + S +NV ++ S++ +L QN+DS CSANILVIE+DRC+R Sbjct: 367 RNGIPFSSVVSKNKLRSSVRNSSVANVSDVGSSISDLMQNVDSSQCSANILVIEADRCYR 426 Query: 1330 EEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEF 1151 EEGA V LE A +W LV K +++ KA+ MRPS+ NRFTHA+IWTG++ WKLEF Sbjct: 427 EEGAIVTLELSASREWLLVVKKGSSTKFACKADKFMRPSSCNRFTHAIIWTGDDNWKLEF 486 Query: 1150 SNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDD 971 NR+DW+IFK+L+ C +RN+ + + IPVPGV+EVP Y+D VPF RP YI++ D Sbjct: 487 PNRQDWIIFKDLYKECSERNVPASTVKAIPVPGVHEVPGYEDRRSVPFCRPDFYISLDGD 546 Query: 970 EVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHT 791 EV+RAL ++ ANYDMDS DEEWL+K NN F G+ G ++ + FE M+DAFEKA + + Sbjct: 547 EVSRALAKRTANYDMDSEDEEWLKKFNNEFFSGN-GHCEHLSEDCFELMVDAFEKAYFCS 605 Query: 790 PDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPV 611 PD+ S+E+ A C L + V AV+ YW++KRKQ ALLRVFQ + + L+ KP Sbjct: 606 PDDYSNENAAAHLCLDLGTRGLVEAVHTYWLRKRKQRRSALLRVFQGHQVKKAPLVPKPF 665 Query: 610 FRKKRSFKRQMRQSGRGKQQIFFH-----------------ASAVE----PEQDAMQRVQ 494 RK+RSFKRQ GRGKQ A A E EQ+AM +++ Sbjct: 666 LRKRRSFKRQ-ASHGRGKQPYLLQGPRFRYNAETSIICNCAALAAERDSMAEQNAMLKLE 724 Query: 493 KAKSLADISMESVLLKRRRAQILMDNADLATY 398 +A+ A S+E +LKR+R Q+LM+NADLATY Sbjct: 725 EARVSASRSVELAVLKRQRTQLLMENADLATY 756 >ref|XP_007221419.1| hypothetical protein PRUPE_ppa001422mg [Prunus persica] gi|462418131|gb|EMJ22618.1| hypothetical protein PRUPE_ppa001422mg [Prunus persica] Length = 832 Score = 456 bits (1173), Expect = e-125 Identities = 314/835 (37%), Positives = 433/835 (51%), Gaps = 42/835 (5%) Frame = -2 Query: 2695 TLMPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRG-NGDE-WFCIID 2522 T MP VEMRR+TRVF V DGARVLRSG+R + E K R NGDE W ++ Sbjct: 52 TEMPSVEMRRTTRVFGMGMVKGGVDGARVLRSGRRLWPESSESKLERARNGDEDWLKLMK 111 Query: 2521 DTA--DVPRCKSIDWYEVD----PERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIV 2360 A V W + P R+ V +L K +GIV Sbjct: 112 SHAGESVVGLNHKKWAGANQVGSPRRNTPV--LKTSLVKKPQSNELLADLLKEHKRYGIV 169 Query: 2359 YNRKRRRLPGNKFVSSPR-----DRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXX 2195 Y RKR+R + + + DRMYG F R+QR K+S EL Sbjct: 170 YTRKRKRASASFLGNVEKENGSDDRMYGRRFARRQRMKKSK-----EL------------ 212 Query: 2194 XXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFM 2015 +S + +L VESS + FL S+L ++ + + LTEF+ F+ Sbjct: 213 ---------DSHPGFVCPEVLCFSVESSWAQGYWAGRFLYSVLVYMTRASLGLTEFSEFL 263 Query: 2014 CSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFM 1835 EP+ +FA +G+ F + S G+C++F A QFIP+FS+DF A P FM Sbjct: 264 ALEPIGSIFASYGIQFSRDRSCTRR-------SGVCKLFGAEQFIPLFSVDFSAVPGCFM 316 Query: 1834 RLHFSVVLR---SLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWS 1664 + S+ LR L + +++ + +G + D+ + + I N A S Sbjct: 317 FMQTSMHLRFRCHLTVNNLIDGHENGEFIDQGDDDDDGEKVDFI--------ENRHALHS 368 Query: 1663 SSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH----- 1499 S VR A RS R+ SR +Q++R+SLR R+RN S + Sbjct: 369 S----------VRVPKLACRSTQYRNGLTSRGIQKRRSSLRRRRSRNPSLVSLRKPNGAL 418 Query: 1498 -----DLFRAGY-------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILV 1355 + + G KH RK S N+K T+ K+++DS CSANIL Sbjct: 419 VSELISIRKNGLPFSSVESKHMLRKSVSLSLAGNLKAESLTIEGSKRDLDSTSCSANILF 478 Query: 1354 IESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWT- 1178 E D+C+RE+GA VMLE + +W LV K +GL+RY +KAE VMRP + NR T A+IW+ Sbjct: 479 TELDKCYREDGATVMLEMSSSGEWLLVVKKNGLTRYTHKAEKVMRPCSKNRITQAIIWSA 538 Query: 1177 ---GENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPF 1007 G+N WKLEF NR DW IFK+L+ C DR + + + IPVPGV EVP Y DS F Sbjct: 539 DSNGDNNWKLEFPNRCDWAIFKDLYKECSDRVVPAPAIKFIPVPGVREVPGYADSHSTLF 598 Query: 1006 VRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEE 827 RP +YI + DDEV+RA+ ++ ANYDMDS DEEWL+K N+ F + + +++ + FE Sbjct: 599 DRPESYIYLNDDEVSRAMAKRTANYDMDSDDEEWLKKFNSDFF-AENELHDHVSEDNFEL 657 Query: 826 MMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNH-VALLRVFQC 650 M+DAFEKA Y P + +DE+ A + C + R++ V A+Y YWM KRKQ +LLRVFQ Sbjct: 658 MVDAFEKAFYCRPYDFADENAAANLCLDMGRREVVEAIYSYWMNKRKQKRSSSLLRVFQG 717 Query: 649 PPPRSSQLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKS 482 + + L KPV RK+RSFKRQ Q GRGKQ F A A E EQ+A+ +V++AK+ Sbjct: 718 HQSKRALLDPKPVLRKRRSFKRQPSQFGRGKQPSFLQAMAAEQDALQEQNAIHKVEEAKA 777 Query: 481 LADISMESVLLKRRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEAYVPFILD 317 AD S+E + KR+RAQ+LM NADL TY SP+A ++LD Sbjct: 778 EADRSVELAIRKRKRAQLLMQNADLVTYKATMAFRIAEAAQVLGSPDAAAAYVLD 832 >ref|XP_004243418.1| PREDICTED: uncharacterized protein LOC101263728 [Solanum lycopersicum] Length = 790 Score = 451 bits (1161), Expect = e-124 Identities = 292/832 (35%), Positives = 438/832 (52%), Gaps = 47/832 (5%) Frame = -2 Query: 2674 MRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNGDEWFCIIDDT------- 2516 MRR+TR+F G RVLRSG+R S E K + +GDEW ++D+ Sbjct: 7 MRRTTRIF----------GTRVLRSGRRLSTSF-EAKRAK-HGDEWIGLLDNVGGGGGAA 54 Query: 2515 ADVPRCKSIDWYEVDPERDIDVTDFNLNL---------AXXXXXXXXXXXXXSRDKMHGI 2363 AD RCK W + + +++ + N+++ D+M G+ Sbjct: 55 ADATRCKKKGWLKKEVALNLEADEMNIDVDSKSMDEQETVEAPVVDTVSPKSYIDRMWGL 114 Query: 2362 VYNRKRRRLPGNKFVSSPRDRM------YGISFVRKQRRKRSSGHSINELPRKDCQXXXX 2201 VY RKR+R+ + S R ++ YG F+RK +K S ++ + +D Q Sbjct: 115 VYTRKRKRVDLKRH-DSVRGKVLTDVMRYGKQFIRK--KKHRSAYAKDSDKSEDGQF--- 168 Query: 2200 XXXXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAA 2021 S D++ +V +S S + +C L +L +++ S +SL + Sbjct: 169 -------------SSDIV-------IVNTSYGSGYWVSCLLNCMLMYLRRSTVSLQQIFG 208 Query: 2020 FMCSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFT 1841 F+ S+PL V++ G+ L + + R I G C I R +P+F+LDF P Sbjct: 209 FINSKPLRDVWSLQGILLLKDQTSR------KIKTGACVISGVRCSVPVFTLDFSTVPCF 262 Query: 1840 FMRLHFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCI-PTEMGFPGSNSMASWS 1664 F+ LH S++LR + + L+ + I + +T++ + + C+ P + N+ + Sbjct: 263 FLYLHSSLLLRFVPMSYALVMQPTVAIDEVT-VTNDMELVSCLTPVTLSELDVNTQSGHD 321 Query: 1663 ----SSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSS------- 1517 + +K++ + + + NSRN+Q++R+SLRS R R+SS Sbjct: 322 VVAPGAYDSKKIEVVNTTVGLPKSTARHLQPRNSRNIQKRRSSLRSMRGRHSSFGTQNAS 381 Query: 1516 ------SMGFHHDLFRAGYK---HKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSAN 1364 + F D R + ++ R QK+ +VKELKS LV L QN+++ CSAN Sbjct: 382 GVLTSDRLRFRRDGLRFSSRTPHYELRSSRQKTSMPSVKELKSALVRLTQNIETASCSAN 441 Query: 1363 ILVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMI 1184 ILV E D+C+REEGA + +E A W L K G+ R+ E VMRP + NR TH +I Sbjct: 442 ILVTEPDKCYREEGAVIGMELSAAKQWILAVKIGGVRRFNLTTEKVMRPCSSNRVTHDLI 501 Query: 1183 WTGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPFV 1004 W G++GWKLEF +R+DWLIFKEL+ C DRN+Q + IPVPGV EV Y +S F Sbjct: 502 WVGDSGWKLEFPDRQDWLIFKELYKGCSDRNVQPPAVSIIPVPGVSEVSGYAESNPPFFA 561 Query: 1003 RPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEM 824 RP++YIT++DDE+ARAL R ANYDMD DEEWL N D +++ + FE + Sbjct: 562 RPVSYITVKDDELARALARSTANYDMDGDDEEWLRNFN----DQPSLENDHLSTDSFELL 617 Query: 823 MDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPP 644 +D FEK Y PD+ SDE A S C + E+K+ V AVY YW KKRKQN +L+++FQC Sbjct: 618 IDHFEKGFYCNPDDYSDEKAAVSSCPNKEKKEIVEAVYSYWSKKRKQNRSSLIKIFQCYQ 677 Query: 643 PRSSQLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLA 476 PR +Q++ K +FRKKRSFKRQ ++GRGK + F A E +Q+A+ +V++AK+ A Sbjct: 678 PRRTQVIPKSIFRKKRSFKRQGSKAGRGKHRPFLPAVVAENDAVQQQNAVLKVKEAKAAA 737 Query: 475 DISMESVLLKRRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEAYVPFIL 320 + S + + R++AQ LM+NADLATY +S EA P L Sbjct: 738 NKSEDLAVRMRQKAQQLMENADLATYKAMMALRIAEAAKIAKSKEAVAPIFL 789 >gb|EXC25392.1| hypothetical protein L484_016774 [Morus notabilis] Length = 795 Score = 437 bits (1123), Expect = e-119 Identities = 302/819 (36%), Positives = 435/819 (53%), Gaps = 55/819 (6%) Frame = -2 Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNGD--EWFCI---- 2528 MP V MRR+TRVF VVK DGARVLRSG+R D GE K +R + D +WF I Sbjct: 1 MPSVGMRRTTRVF---GVVKGVDGARVLRSGRRLWPDSGEVK-LRRHSDVYDWFKIGKGD 56 Query: 2527 ----------IDDTADVPRCKSIDWYEVD-PERDIDVTDFNLNLAXXXXXXXXXXXXXSR 2381 +T P+ K+ E+ P+ + + ++LA Sbjct: 57 GGLGYDSNGWAHNTNSKPK-KTPPVAEIKAPKPEDNNRGVGVDLAHGGRRP--------- 106 Query: 2380 DKMHGIVYNRKRRRLP----GNKFVSSPR-----DRMYGISFVRKQRRKRSSGHSINELP 2228 D+M G+VY+RKR+ L GN V+S + YG FVR+QRRK +SG S Sbjct: 107 DRMFGLVYSRKRKNLAVRSSGNASVNSETLGGSVGKRYGRRFVRRQRRKLNSGESFAVAD 166 Query: 2227 RKDCQXXXXXXXXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSS 2048 D S ++ ++ +V SS + L SIL ++ + Sbjct: 167 DSD------------------SRLEFTPSEVVSVVFGSSMDRNFYAVGVLCSILVYLTRA 208 Query: 2047 RISLTEFAAFMCSEPLVRVFAQHGVH-FLANLSYRISWKNDLISPGICQIFEARQFIPMF 1871 R+ LT+ AF+ SEP+ RV + G++ FL + S + C++F A +F+P+F Sbjct: 209 RLRLTDLFAFLVSEPISRVNSSCGINIFLDHPSIKRF--------ASCKLFGAPEFVPLF 260 Query: 1870 SLDFCASPFTFMRLHFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFP 1691 +DF A P FM +H + R P L+G + I+D+++ ++ P Sbjct: 261 CVDFSAIPLCFMHMHSCMFFRYKRQPS-----LAGNNEIDEMISDDEE------DQLSSP 309 Query: 1690 GSNSM-------ASWSSSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTR 1532 G +++ A + S + + +A+ FA RS R+ SR +Q++R+SLR + Sbjct: 310 GKDALESKPLLSAEANHSENRLASNPSFKASKFACRSNQYRNGLISRGIQKRRSSLRRRK 369 Query: 1531 ARNSSSMGFHH-------DL--FRAGY-------KHKKRKLAQKSPCSNVKELKSTLVEL 1400 ARN S G DL FR +K R+ + + +KE+ ST+ + Sbjct: 370 ARNPSLCGVQKPNNALLSDLVSFRKNSVSLSLTSNNKLRRSLRSNSARKLKEVSSTVADS 429 Query: 1399 KQNMDSVCCSANILVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMR 1220 Q+MDS C AN+L+IE ++C+RE G ++LE L W + K G +++ +KAE VMR Sbjct: 430 TQDMDSTSCCANVLIIEPEKCYREGGFSIVLESSPLGGWLIAVKKDGSTKFTHKAEKVMR 489 Query: 1219 PSTPNRFTHAMIWTGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEV 1040 P + NRFTH ++WT ++GWKLEF NR+DWLIFK+L+ C DRN+ + +P+PGV EV Sbjct: 490 PCSSNRFTHDIMWTADDGWKLEFPNRKDWLIFKDLYQECSDRNMLAPGVKVVPIPGVNEV 549 Query: 1039 PCYDDSGYVPFVRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGV 860 DS F RP +YI+++DDE+ RAL RK +NYDMD DEEWL KLNN F + Sbjct: 550 SQKGDSHCTLFRRPDSYISVKDDELCRALKRKTSNYDMDLEDEEWLNKLNN-EFSVENET 608 Query: 859 PNYILPEKFEEMMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYW-MKKRKQ 683 + +KFE M+DAFEKA + +P + SD T CS L + A+Y YW MKKRKQ Sbjct: 609 YECVSDDKFESMIDAFEKAFFCSPYDNSDVKSLTDLCSHLGGDKAIEAIYVYWTMKKRKQ 668 Query: 682 NHVALLRVFQCPPPRSSQLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVE----PEQ 515 +L+R+FQ R + L+ KP RKKRSF RQ Q GRGKQ F A E EQ Sbjct: 669 KRPSLIRIFQLYQGRRT-LVPKPAIRKKRSFNRQPSQVGRGKQSSFLQAMVAERDAAEEQ 727 Query: 514 DAMQRVQKAKSLADISMESVLLKRRRAQILMDNADLATY 398 +AM RV++AK+ A+ +E + R+RAQ+LM+NADLATY Sbjct: 728 NAMHRVEEAKASANRCVELAVESRQRAQLLMNNADLATY 766 >ref|XP_004166800.1| PREDICTED: uncharacterized LOC101207239 [Cucumis sativus] Length = 819 Score = 417 bits (1073), Expect = e-113 Identities = 293/833 (35%), Positives = 422/833 (50%), Gaps = 56/833 (6%) Frame = -2 Query: 2668 RSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRG-NGDEWFCIIDDTADVPRC-- 2498 R TRVF +VK +DGARVLRSG+R + GE K + + +W+ IID + Sbjct: 6 RRTRVF---GLVKGSDGARVLRSGRRLWPESGEVKLKKSKDASDWYPIIDGRGNGGGSGH 62 Query: 2497 -----KSIDWYEVDPERDIDVT---DFNLNLAXXXXXXXXXXXXXSRDKMHGI------V 2360 K V P+R + V D + + + DK G+ V Sbjct: 63 GRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPVKVFPRIGNDDKSSGVDRMFGKV 122 Query: 2359 YNRKRRR-----------LPGNKFVSSPRDRMYGISFVRKQR-RKRSSGHSINELPRKDC 2216 Y+RKR+R + + +S DRM+G+ F+R+QR RK H + + Sbjct: 123 YSRKRKRGRLEDGEVFDEMESDNVLSG--DRMFGLRFIRRQRSRKTDVEHWESTAGGRTS 180 Query: 2215 QXXXXXXXXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISL 2036 H I L I SS + F+ F++++L+ KS +S+ Sbjct: 181 NLHF-----------HRQRILHPRDCALTIFAGSSVDGGC-FSDFILTVLRHFKSPGLSV 228 Query: 2035 TEFAAFMCSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGIC---QIFEARQFIPMFSL 1865 +F+AF+ S P+ VFA G+ FL G C IF +RQ IPMF L Sbjct: 229 AKFSAFLLSNPINEVFALKGMRFLQGYP----------PTGCCGMFAIFGSRQSIPMFHL 278 Query: 1864 DFCASPFTFMRLHFSVVLRSLYLPDVLIRYLSGL-IKKAREITDNKKCLPCIPTEMGFPG 1688 DF A P FM L+ + LR + L+ + L + + + ++ +P+ + Sbjct: 279 DFSAIPLPFMFLYSEMFLRVTRIQARLVYNNNQLDVDISSDSEEDSVEELHVPSPVSSLE 338 Query: 1687 SNSMASWSSSVKKRKVDF-IVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSM 1511 MA K R V VRAT R++ R+ +SR ++++R+SLR R R+ S Sbjct: 339 RKPMAFLFDRPKTRSVSHPSVRATRLGTRTMQYRNGFSSRGIRKRRSSLRIRRPRSHSLS 398 Query: 1510 GFHHDL-------------FRAGYKHKKRKL-AQKSPCSNVKELKSTLVELKQNMDSVCC 1373 + F +G + K A + ++E ST + ++DS CC Sbjct: 399 AMQKSIGPLAVDDVKLGVSFPSGASCNRHKSSAVRDSAGRIRETNSTALRSAMDVDSSCC 458 Query: 1372 SANILVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTH 1193 ANIL++E+D+C REEGA ++LE A +W LV K G +RY +KAE VM+PS+ NRFTH Sbjct: 459 KANILIVEADKCLREEGANIVLEFSASCEWLLVVKKDGSTRYTHKAERVMKPSSCNRFTH 518 Query: 1192 AMIWTGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYV 1013 A++W+ +NGWKLEF NRRDW IFK+L+ C DRN+ + A+ IPVP V EVP Y DS Sbjct: 519 AILWSIDNGWKLEFPNRRDWFIFKDLYKECSDRNIPCLIAKAIPVPRVSEVPDYVDSSGA 578 Query: 1012 PFVRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKF 833 F RP YI++ DDEV RA+ + ANYDMDS DEEWL + N+G+ D + F Sbjct: 579 SFQRPDTYISVNDDEVCRAMTKSTANYDMDSEDEEWLVEFNDGLIATDKH-QECFSEDNF 637 Query: 832 EEMMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQ 653 E M+DAFEK Y PD SDE C+ L V ++Y YW KKRKQ +L+RVFQ Sbjct: 638 ESMVDAFEKGFYCNPDAFSDEKVPADICTPLASPSIVESLYTYWTKKRKQRKSSLIRVFQ 697 Query: 652 C-PPPRSSQLMQKPVFRKKRSFKRQMRQSGRGK-------QQIFFHASAVEPEQDAMQRV 497 R L+ KP+ R+KRS KRQ QSG G+ + I + AVE +Q+AMQ+ Sbjct: 698 AYQSKRKPPLVPKPMMRRKRSLKRQPSQSGSGRTPQPSILEAILWRRDAVE-DQNAMQKY 756 Query: 496 QKAKSLADISMESVLLKRRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEA 338 +++K+ + +E+ + KR+RAQ+L++NADLA Y +SPEA Sbjct: 757 EESKAAVEKCIENAVNKRQRAQLLLENADLAVYKAMSALRIAEAIETSDSPEA 809 >ref|XP_004140897.1| PREDICTED: uncharacterized protein LOC101207239 [Cucumis sativus] Length = 819 Score = 417 bits (1073), Expect = e-113 Identities = 293/833 (35%), Positives = 422/833 (50%), Gaps = 56/833 (6%) Frame = -2 Query: 2668 RSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRG-NGDEWFCIIDDTADVPRC-- 2498 R TRVF +VK +DGARVLRSG+R + GE K + + +W+ IID + Sbjct: 6 RRTRVF---GLVKGSDGARVLRSGRRLWPESGEVKLKKSKDASDWYPIIDGRGNGGGSGH 62 Query: 2497 -----KSIDWYEVDPERDIDVT---DFNLNLAXXXXXXXXXXXXXSRDKMHGI------V 2360 K V P+R + V D + + + DK G+ V Sbjct: 63 GRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPVKVFPRIGNDDKSSGVDRMFGKV 122 Query: 2359 YNRKRRR-----------LPGNKFVSSPRDRMYGISFVRKQR-RKRSSGHSINELPRKDC 2216 Y+RKR+R + + +S DRM+G+ F+R+QR RK H + + Sbjct: 123 YSRKRKRGRLEDGEVFDEMESDNVLSG--DRMFGLRFIRRQRSRKTDVEHWESTAGGRTS 180 Query: 2215 QXXXXXXXXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISL 2036 H I L I SS + F+ F++++L+ KS +S+ Sbjct: 181 NLHF-----------HRQRILHPRDCALTIFAGSSVDGGC-FSDFILTVLRHFKSPGLSV 228 Query: 2035 TEFAAFMCSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGIC---QIFEARQFIPMFSL 1865 +F+AF+ S P+ VFA G+ FL G C IF +RQ IPMF L Sbjct: 229 AKFSAFLLSNPINEVFALKGMRFLQGYP----------PTGCCGMFAIFGSRQSIPMFHL 278 Query: 1864 DFCASPFTFMRLHFSVVLRSLYLPDVLIRYLSGL-IKKAREITDNKKCLPCIPTEMGFPG 1688 DF A P FM L+ + LR + L+ + L + + + ++ +P+ + Sbjct: 279 DFSAIPLPFMFLYSEMFLRVTRIQARLVYNNNQLDVDISSDSEEDSVEELHVPSPVSSLE 338 Query: 1687 SNSMASWSSSVKKRKVDF-IVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSM 1511 MA K R V VRAT R++ R+ +SR ++++R+SLR R R+ S Sbjct: 339 RKPMAFLFDRPKTRSVSHPSVRATRLGTRTMQYRNGFSSRGIRKRRSSLRIRRPRSHSLA 398 Query: 1510 GFHHDL-------------FRAGYKHKKRKL-AQKSPCSNVKELKSTLVELKQNMDSVCC 1373 + F +G + K A + ++E ST + ++DS CC Sbjct: 399 AMQKSIGPLAVDDVKLGVSFPSGASCNRHKSSAVRDSAGRIRETNSTALGSAMDVDSSCC 458 Query: 1372 SANILVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTH 1193 ANIL++E+D+C REEGA ++LE A +W LV K G +RY +KAE VM+PS+ NRFTH Sbjct: 459 KANILIVEADKCLREEGANIVLEFSASCEWLLVVKKDGSTRYTHKAERVMKPSSCNRFTH 518 Query: 1192 AMIWTGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYV 1013 A++W+ +NGWKLEF NRRDW IFK+L+ C DRN+ + A+ IPVP V EVP Y DS Sbjct: 519 AILWSIDNGWKLEFPNRRDWFIFKDLYKECSDRNIPCLIAKAIPVPRVSEVPDYVDSSGA 578 Query: 1012 PFVRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKF 833 F RP YI++ DDEV RA+ + ANYDMDS DEEWL + N+G+ D + F Sbjct: 579 SFQRPDTYISVNDDEVCRAMTKSTANYDMDSEDEEWLIEFNDGLIATDKH-QECFSEDNF 637 Query: 832 EEMMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQ 653 E M+DAFEK Y PD SDE C+ L V ++Y YW KKRKQ +L+RVFQ Sbjct: 638 ESMVDAFEKGFYCNPDAFSDEKAPADICTPLASPSIVESLYTYWTKKRKQRKSSLIRVFQ 697 Query: 652 C-PPPRSSQLMQKPVFRKKRSFKRQMRQSGRGK-------QQIFFHASAVEPEQDAMQRV 497 R L+ KP+ R+KRS KRQ QSG G+ + I + AVE +Q+AMQ+ Sbjct: 698 AYQSKRKPPLVPKPMMRRKRSLKRQPSQSGSGRTPQPSILEAILWRRDAVE-DQNAMQKY 756 Query: 496 QKAKSLADISMESVLLKRRRAQILMDNADLATYXXXXXXXXXXXXXXXESPEA 338 +++K+ + +E+ + KR+RAQ+L++NADLA Y +SPEA Sbjct: 757 EESKAAVEKCIENAVSKRQRAQLLLENADLAVYKAMSALRIAEAIETSDSPEA 809 >gb|EYU39775.1| hypothetical protein MIMGU_mgv1a001436mg [Mimulus guttatus] Length = 820 Score = 414 bits (1063), Expect = e-112 Identities = 298/836 (35%), Positives = 420/836 (50%), Gaps = 72/836 (8%) Frame = -2 Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGE---EKPVRGNGDE--WFCII 2525 MP V MRR+TRVF G RVLRSG+R + + K R + E W I Sbjct: 1 MPSVGMRRNTRVF----------GTRVLRSGRRLWTEPSKGSNNKNARASHAENKWTDIP 50 Query: 2524 DDTADVPRCKSIDWYEVDPERDIDVTDFNL----NLAXXXXXXXXXXXXXSRDKMHGIVY 2357 D + D P D + ++ + RD+M GIVY Sbjct: 51 DGGGGGGGDAASDRLNHTPREDKNSASSDMIVDPTIEERAPEGGGAVEVKDRDRMCGIVY 110 Query: 2356 NRKRRR-LPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXX 2180 RKR+R L D+ YG FVR++ RKR E K Sbjct: 111 RRKRKRKLVELGKTGLTEDKRYGKKFVRERWRKRFGATESFESCAK----------FGGS 160 Query: 2179 XSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPL 2000 G + V++ ESS + CFL +L ++ RI + +AFM S+P+ Sbjct: 161 VRGRRELVVVVN--------ESSNWCGYWVACFLSCVLSYMTKVRIGMRRMSAFMLSKPI 212 Query: 1999 VRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLHFS 1820 V++ HGV F+ + + N + PG+C I +R +P+FS+DF A P F+ + S Sbjct: 213 FDVYSSHGVLFVQDAI--TARNNGIKKPGLCIISGSRSLVPIFSVDFSAIPSVFVHMQTS 270 Query: 1819 VVLRSLYLPDVLI-RYLSGLIKKAREIT--DNKKCLPCIPTEMGFPG-SNSMASWSSSVK 1652 + LRS +L +L+ R ++ E+T D + L FP + S S ++ Sbjct: 271 LYLRSEHLAFLLVARSTDDDYEEDEEVTAMDEEPYL--------FPSCEQNQDSLDSPIR 322 Query: 1651 KRKV-DFIVRATDFARRSV-STRHSA---------------NSRNVQRKRTSLRSTRARN 1523 D + D +R + S+ HS NSRN++++R+SLR R R Sbjct: 323 DVSCSDVLAFGNDDSRGKIESSSHSPLGLPKSSALRSLQLRNSRNIKKRRSSLRRKRGRP 382 Query: 1522 SSSM-------GFHHDLFR-------------------AGYKHKKRKLA----------- 1454 SS D FR + K+ +K + Sbjct: 383 PSSFRTQKSSGALASDFFRIRNDAVQFSALSPTRLLRSSDKKNSDKKKSDKNSSDKKSSD 442 Query: 1453 QKSPCSNVKELKSTLVELKQNMDSVCCSANILVIESDRCFREEGAKVMLECLALNDWYLV 1274 +KS SN+KE K Q++ CSANIL+ E+D+C+REEGA V LE W+LV Sbjct: 443 KKSSTSNIKETKPAT----QDIYPSTCSANILITETDKCYREEGATVALELSPSKQWFLV 498 Query: 1273 AKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGENGWKLEFSNRRDWLIFKELHMACMDR 1094 G RY AE VMRPS NRF+HA+IW+G+ +KLEFSN++DW +FKEL+ C +R Sbjct: 499 IGKDGTKRYNLTAEKVMRPSCSNRFSHAVIWSGDCNFKLEFSNKQDWFVFKELYKQCSER 558 Query: 1093 NLQVVSARTIPVPGVYEVPCYDDSGYVPFVRPIAYITMRDDEVARALVRKPANYDMDSGD 914 N+Q S IPVPGV EV + ++P+VRP YIT++DDE+ RALV+K ANYDMDS D Sbjct: 559 NMQSPSVSVIPVPGVQEVSMPFYNNFMPYVRPDNYITVKDDELIRALVKKGANYDMDSDD 618 Query: 913 EEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDAFEKAAYHTPDEVSDESRATSFCSSLER 734 EEWL + N+ + G + + + PE FE ++DA EK + PDE +E A FC LER Sbjct: 619 EEWLSEFNDELC-GGMELQEPVSPECFELVIDALEKGVHCNPDENFEELAAYDFCMHLER 677 Query: 733 KDTVAAVYQYWMKKRKQNHVALLRVFQCPPPRSSQLMQKPVFRKKRSFKRQMRQSGRGKQ 554 ++ + A+ YW+KKRKQ AL+R+FQ PR Q++ K VFRKKRSFKRQ Q GRGKQ Sbjct: 678 REVIEAIRNYWVKKRKQKRSALVRIFQLYQPRRIQVIPKSVFRKKRSFKRQASQGGRGKQ 737 Query: 553 QIFFHASAVE----PEQDAMQRVQKAKSLADISMESVLLKRRRAQILMDNADLATY 398 + A A E +Q+ Q++Q+AK+ A+ + KR+RAQ+LM+NADLATY Sbjct: 738 RPILQAIAAERDALEQQNNAQKLQEAKAAAERFEALAVEKRQRAQMLMENADLATY 793 >ref|XP_006398922.1| hypothetical protein EUTSA_v10012741mg [Eutrema salsugineum] gi|557100012|gb|ESQ40375.1| hypothetical protein EUTSA_v10012741mg [Eutrema salsugineum] Length = 777 Score = 414 bits (1063), Expect = e-112 Identities = 281/806 (34%), Positives = 418/806 (51%), Gaps = 42/806 (5%) Frame = -2 Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNG---DEWFC---- 2531 MP V MRR+TRVF VVK DGARVLRSG+R + E K R + +W C Sbjct: 1 MPSVGMRRTTRVF---GVVKAADGARVLRSGRRIWPNVDEPKVKRAHDVVDRDWNCLNPS 57 Query: 2530 ------IIDDTADVPRCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMH 2369 + ++ + E+ E+D DF + + DK+ Sbjct: 58 KGKGNKVSGGRSNGAGSRPCSPREISSEKDDKEIDFPVRKRRKVATAEAVGDEKTVDKLF 117 Query: 2368 GIVYNRKRRRLPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXX 2189 G+VY+RKR+RL G + + + + F RRKR S ++ PR+ Sbjct: 118 GVVYSRKRKRLSGQSSDNRSEEPLRSLKFYC--RRKRLSDRVVS--PRR----------- 162 Query: 2188 XXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCS 2009 ++G ++ + V++SC + T F++ ++++V+ ++ L+ A+F S Sbjct: 163 -------------LYGPVITLTVDASCEESWFSTVFVL-VMRYVRRGQLGLSSLASFFLS 208 Query: 2008 EPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRL 1829 +P+ VFA HGV FLA + L S G+C+ F A +P+FS DF A P FM + Sbjct: 209 QPINDVFADHGVRFLA--------EPPLSSRGVCKFFGALNCLPLFSADFNAIPRCFMDM 260 Query: 1828 HFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCL----PCIPTEMGFPGSNSMASWSS 1661 HF++ LR + ++ L+ E +D++ + PC P G + Sbjct: 261 HFTLFLRVVPRSFAFVKKSLYLLNNPVEESDSESEIVLSEPCNPRNGVVVGLHPS----- 315 Query: 1660 SVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH------ 1499 V A+ + R S ++Q++R+SLR RARN S G H Sbjct: 316 ----------VTASKLTGGNAQYRGSLGFHSIQKRRSSLRRRRARNLSH-GVHKPHNGTP 364 Query: 1498 -DLFRAGYKHKKRKLAQKSPCSNVKELKS---------TLVELKQNMDSVCCSANILVIE 1349 +K++ ++ + S+V S + K+ +DS+CCSANILVI Sbjct: 365 VSELSGNWKNRTTSVSSRKLRSSVLNNSSPSSNGISTISKPRTKEELDSLCCSANILVIG 424 Query: 1348 SDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWTGEN 1169 SDRC REEG VMLE + +W++V K G RY ++A MRP + NRFT +++W G+N Sbjct: 425 SDRCTREEGCGVMLEFSSSKEWFVVIKKDGAIRYRHRARKTMRPCSCNRFTQSIVWLGDN 484 Query: 1168 GWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCY--DDSGYVPFVRPI 995 WKLEF +++DWL FKE++ C +RN+ +A+ IP+PGV EV Y D + + FV P+ Sbjct: 485 DWKLEFCDKQDWLGFKEIYNECYERNILEQNAKVIPIPGVREVSGYSEDIADFPSFVMPV 544 Query: 994 AYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEEMMDA 815 YI++++DEV RA+ R A YDMDS DEEWLE+ N + + + + FE M+D Sbjct: 545 PYISVKEDEVTRAMARNIAIYDMDSEDEEWLERQNEEMLGEEHEQSQRLEQDAFELMIDG 604 Query: 814 FEKAAYHTP-DEVSDESRAT-SFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQCPPP 641 FEK + +P D++ +E AT + S L R++ V AV+ YW +KRKQ LLRVFQ Sbjct: 605 FEKCFFQSPADDLLNEKAATVASLSYLGRQEVVEAVHDYWARKRKQRKAPLLRVFQGHQA 664 Query: 640 RSSQLMQKPVFRKKRSFKRQMRQ-SGRGKQQIFFHASAVE----PEQDAMQRVQKAKSLA 476 + S L+ K VFRK+RSFKRQ Q G+ KQ A E EQ+ RV++AK+LA Sbjct: 665 KKSPLLFKHVFRKRRSFKRQGSQLHGKSKQLSLVGVKAAEQEASEEQNDYLRVEEAKALA 724 Query: 475 DISMESVLLKRRRAQILMDNADLATY 398 D +ME + KRRRAQ+L +NADLA Y Sbjct: 725 DRAMEIAIAKRRRAQVLAENADLAVY 750 >ref|NP_196087.1| Enhancer of polycomb-like transcription factor protein [Arabidopsis thaliana] gi|7413529|emb|CAB86009.1| putative protein [Arabidopsis thaliana] gi|332003387|gb|AED90770.1| Enhancer of polycomb-like transcription factor protein [Arabidopsis thaliana] Length = 766 Score = 409 bits (1052), Expect = e-111 Identities = 291/810 (35%), Positives = 411/810 (50%), Gaps = 46/810 (5%) Frame = -2 Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGNGDEWFCIIDDTAD 2510 MP V MRR+TRVF VVK DGARVLRSG+R + GE K R + ++D D Sbjct: 1 MPSVGMRRTTRVF---GVVKAADGARVLRSGRRIWPNVGEPKVRRAHD-----VVDRDCD 52 Query: 2509 -------------VPRCKSIDW----YEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSR 2381 V KS +V E++ V DF + Sbjct: 53 SVLKNQNKSKGNKVSSGKSNSQPCSPKQVSSEKEDKVDDFPVTKRRKVRNEGVGDEKTV- 111 Query: 2380 DKMHGIVYNRKRRRLPGNKFVSSPRDRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXX 2201 DKM GIVY+RKR+RL + + + F R+ RRK S S Sbjct: 112 DKMFGIVYSRKRKRLCEPSSSDRSEEPLRSLKFYRR-RRKLSQRVS-------------- 156 Query: 2200 XXXXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAA 2021 ++L + V+ SC T F ++ +++++ + L+ A+ Sbjct: 157 --------------------SVLTLTVDWSCEDCWFLTVFGLA-MRYIRREELRLSSLAS 195 Query: 2020 FMCSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFT 1841 F S+P+ +VFA HGV FL ++ L S G+C+ F A +P+FS DF P Sbjct: 196 FFLSQPINQVFADHGVRFLV--------RSPLSSRGVCKFFGAMSCLPLFSADFAVIPRW 247 Query: 1840 FMRLHFSVVLRSLYLPDVLIRYLSGLIKKAREITDNKKCL----PCIPTEMGFPGSNSMA 1673 FM +HF++ +R L + L+ E +D++ L PC P G + Sbjct: 248 FMDMHFTLFVRVLPRSFFFVEKSLYLLNNPIEESDSESELALPEPCTPRNGVVVGLHPS- 306 Query: 1672 SWSSSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSS----SMGF 1505 VRA+ + R + S + Q++R+SLR RARN S + Sbjct: 307 --------------VRASKLTGGNAQYRGNLGSHSFQKRRSSLRRRRARNLSHNAHKLNN 352 Query: 1504 HHDLFRAGYKHKKRK------------LAQKSPCSNVKELKSTLVELKQNMDSVCCSANI 1361 +F K R L+ SP SN + + + K+ +DS+CCSANI Sbjct: 353 GTPVFDISGSRKNRTAAVSSKKLRSSVLSNSSPVSNGISI-IPMTKTKEELDSICCSANI 411 Query: 1360 LVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIW 1181 L+I SDRC REEG VMLE + +W+LV K G RY + A+ MRP + NR THA +W Sbjct: 412 LMIHSDRCTREEGFSVMLEASSSKEWFLVIKKDGAIRYSHMAQRTMRPFSSNRITHATVW 471 Query: 1180 TGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDD--SGYVPF 1007 G + WKLEF +R+DWL FK+++ C +RNL S + IP+PGV EV Y + + F Sbjct: 472 MGGDNWKLEFCDRQDWLGFKDIYKECYERNLLEQSVKVIPIPGVREVCGYAEYIDNFPSF 531 Query: 1006 VR-PIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFE 830 R P++YI++ +DEV+RA+ R A YDMDS DEEWLE+ N + + + + E FE Sbjct: 532 SRPPVSYISVNEDEVSRAMARSIALYDMDSEDEEWLERQNQKMLNEEDDQYLQLQREAFE 591 Query: 829 EMMDAFEKAAYHTP-DEVSDESRAT-SFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVF 656 M+D FEK +H+P D++ DE AT S L R++ V AV+ YW+KKRKQ LLR+F Sbjct: 592 LMIDGFEKYHFHSPADDLLDEKAATIGSISYLGRQEVVEAVHDYWLKKRKQRKAPLLRIF 651 Query: 655 QCPPPRSSQLMQKPVFRKKRSFKRQMRQ-SGRGKQQI--FFHASAVEP-EQDAMQRVQKA 488 Q + +QL+ KPVFRK+RSFKRQ Q G+ KQ A EP E+D + R+++A Sbjct: 652 QGHQVKKTQLLSKPVFRKRRSFKRQGSQLHGKAKQTSPWMVAVKAAEPEEEDDILRMEEA 711 Query: 487 KSLADISMESVLLKRRRAQILMDNADLATY 398 K LAD +ME+ + KRRRAQIL +NADLA Y Sbjct: 712 KVLADKTMETAIAKRRRAQILAENADLAVY 741 >ref|XP_007221418.1| hypothetical protein PRUPE_ppa001422mg [Prunus persica] gi|462418130|gb|EMJ22617.1| hypothetical protein PRUPE_ppa001422mg [Prunus persica] Length = 768 Score = 408 bits (1048), Expect = e-111 Identities = 283/759 (37%), Positives = 390/759 (51%), Gaps = 38/759 (5%) Frame = -2 Query: 2695 TLMPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRG-NGDE-WFCIID 2522 T MP VEMRR+TRVF V DGARVLRSG+R + E K R NGDE W ++ Sbjct: 52 TEMPSVEMRRTTRVFGMGMVKGGVDGARVLRSGRRLWPESSESKLERARNGDEDWLKLMK 111 Query: 2521 DTA--DVPRCKSIDWYEVD----PERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIV 2360 A V W + P R+ V +L K +GIV Sbjct: 112 SHAGESVVGLNHKKWAGANQVGSPRRNTPV--LKTSLVKKPQSNELLADLLKEHKRYGIV 169 Query: 2359 YNRKRRRLPGNKFVSSPR-----DRMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXX 2195 Y RKR+R + + + DRMYG F R+QR K+S EL Sbjct: 170 YTRKRKRASASFLGNVEKENGSDDRMYGRRFARRQRMKKSK-----EL------------ 212 Query: 2194 XXXXXXSGHESSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFM 2015 +S + +L VESS + FL S+L ++ + + LTEF+ F+ Sbjct: 213 ---------DSHPGFVCPEVLCFSVESSWAQGYWAGRFLYSVLVYMTRASLGLTEFSEFL 263 Query: 2014 CSEPLVRVFAQHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFM 1835 EP+ +FA +G+ F + S G+C++F A QFIP+FS+DF A P FM Sbjct: 264 ALEPIGSIFASYGIQFSRDRSCTRR-------SGVCKLFGAEQFIPLFSVDFSAVPGCFM 316 Query: 1834 RLHFSVVLR---SLYLPDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGSNSMASWS 1664 + S+ LR L + +++ + +G + D+ + + I N A S Sbjct: 317 FMQTSMHLRFRCHLTVNNLIDGHENGEFIDQGDDDDDGEKVDFI--------ENRHALHS 368 Query: 1663 SSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGFHH----- 1499 S VR A RS R+ SR +Q++R+SLR R+RN S + Sbjct: 369 S----------VRVPKLACRSTQYRNGLTSRGIQKRRSSLRRRRSRNPSLVSLRKPNGAL 418 Query: 1498 -----DLFRAGY-------KHKKRKLAQKSPCSNVKELKSTLVELKQNMDSVCCSANILV 1355 + + G KH RK S N+K T+ K+++DS CSANIL Sbjct: 419 VSELISIRKNGLPFSSVESKHMLRKSVSLSLAGNLKAESLTIEGSKRDLDSTSCSANILF 478 Query: 1354 IESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTHAMIWT- 1178 E D+C+RE+GA VMLE + +W LV K +GL+RY +KAE VMRP + NR T A+IW+ Sbjct: 479 TELDKCYREDGATVMLEMSSSGEWLLVVKKNGLTRYTHKAEKVMRPCSKNRITQAIIWSA 538 Query: 1177 ---GENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYVPF 1007 G+N WKLEF NR DW IFK+L+ C DR + + + IPVPGV EVP Y DS F Sbjct: 539 DSNGDNNWKLEFPNRCDWAIFKDLYKECSDRVVPAPAIKFIPVPGVREVPGYADSHSTLF 598 Query: 1006 VRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKFEE 827 RP +YI + DDEV+RA+ ++ ANYDMDS DEEWL+K N+ F + + +++ + FE Sbjct: 599 DRPESYIYLNDDEVSRAMAKRTANYDMDSDDEEWLKKFNSDFF-AENELHDHVSEDNFEL 657 Query: 826 MMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNH-VALLRVFQC 650 M+DAFEKA Y P + +DE+ A + C + R++ V A+Y YWM KRKQ +LLRVFQ Sbjct: 658 MVDAFEKAFYCRPYDFADENAAANLCLDMGRREVVEAIYSYWMNKRKQKRSSSLLRVFQG 717 Query: 649 PPPRSSQLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHAS 533 + + L KPV RK+RSFKRQ Q GRGKQ F + Sbjct: 718 HQSKRALLDPKPVLRKRRSFKRQPSQFGRGKQPSFLQGT 756 >ref|XP_007145542.1| hypothetical protein PHAVU_007G247300g [Phaseolus vulgaris] gi|561018732|gb|ESW17536.1| hypothetical protein PHAVU_007G247300g [Phaseolus vulgaris] Length = 734 Score = 402 bits (1032), Expect = e-109 Identities = 284/808 (35%), Positives = 393/808 (48%), Gaps = 44/808 (5%) Frame = -2 Query: 2689 MPLVEMRRSTRVFVPKSVVKDTDGARVLRSGKRFSLDCGEEKPVRGN-GDEWFCIIDDTA 2513 MP MRR+TRVF +K D ARVLRSG+R D GE K R + GDE Sbjct: 1 MPAAGMRRTTRVFG----MKGADTARVLRSGRRLWPDSGEVKTKRSSDGDE--------- 47 Query: 2512 DVPRCKSIDWYEVDPERDIDVTDFNLNLAXXXXXXXXXXXXXSRDKMHGIVYNRKRRRLP 2333 + V P + KM ++ R + Sbjct: 48 ----------WAVTPAKAA--------------------------KMDAVMTPRGTAKGK 71 Query: 2332 GNKFVSSPRD----RMYGISFVRKQRRKRSSGHSINELPRKDCQXXXXXXXXXXXXSGHE 2165 + V RD R +GI +VR+++ + G Sbjct: 72 RQEAVVDARDSTVDRRFGIVYVRRRKGLKKEGS--------------------------R 105 Query: 2164 SSIDVIHGAILDIVVESSCNSAHRFTCFLVSILKFVKSSRISLTEFAAFMCSEPLVRVFA 1985 S++V +L +VV + F L S++++ K R+S + + F S + VFA Sbjct: 106 RSVEVSR-CVLSVVVSRCAGKSALFLRLLASVVRYAKRVRVSPRKLSGFFMSGAVNGVFA 164 Query: 1984 QHGVHFLANLSYRISWKNDLISPGICQIFEARQFIPMFSLDFCASPFTFMRLH----FSV 1817 G+ F+ ++ GICQ F +F+P+FS+DF A P F LH F Sbjct: 165 SQGMQFVKG--------PPAVNSGICQFFGVTEFVPLFSVDFSAVPLCFEYLHSAMFFKS 216 Query: 1816 VLRSLYL----------------PDVLIRYLSGLIKKAREITDNKKCLPCIPTEMGFPGS 1685 +LRSL+L D L+ Y + K+ T + + Sbjct: 217 MLRSLFLVCNPINVRSDVEDMESDDDLLEYQNE--KQISSNTFKGELSETVTVTSDVIEI 274 Query: 1684 NSMASWSSSVKKRKVDFIVRATDFARRSVSTRHSANSRNVQRKRTSLRSTRARNSSSMGF 1505 N + S SSVK T A R+ R+ NSR +Q++R+SLR +ARN S G Sbjct: 275 NDVLSLQSSVKS--------TTRAAGRNGQYRNMLNSRGIQKRRSSLRKRKARNPSMGGL 326 Query: 1504 H---------------HDLFRAGYKHKK-RKLAQKSPCSNVKELKSTLVELKQNMDSVCC 1373 ++ F K+ R LA S ++KE S +V+ K+ + C Sbjct: 327 RRNGAVAFELTGGRKGNNQFSGVTSSKRLRSLANGSTTGSLKEASSAIVDSKERLGLSSC 386 Query: 1372 SANILVIESDRCFREEGAKVMLECLALNDWYLVAKTHGLSRYMYKAENVMRPSTPNRFTH 1193 SAN+LV E +C R EGA V LE A +W L K L+R +KAE VMRP + NRFTH Sbjct: 387 SANLLVSEIHQCHRVEGAIVTLEMSASKEWLLTVKKDELTRSTFKAEKVMRPCSSNRFTH 446 Query: 1192 AMIWTGENGWKLEFSNRRDWLIFKELHMACMDRNLQVVSARTIPVPGVYEVPCYDDSGYV 1013 A++++ +NGWKLEF+NR+DW +FK+L+ C DRN+ +A+ IPVPGV EV Y +S Sbjct: 447 AIMYSLDNGWKLEFTNRQDWNVFKDLYKKCSDRNIPSTAAKFIPVPGVREVSSYAESNSF 506 Query: 1012 PFVRPIAYITMRDDEVARALVRKPANYDMDSGDEEWLEKLNNGVFDGDVGVPNYILPEKF 833 PF RP YI++ DE+ RA+ R ANYDMDS DEEWL+K NN N + + F Sbjct: 507 PFHRPDTYISVFGDELTRAMARTTANYDMDSEDEEWLKKFNN-------ECQNPVSDDNF 559 Query: 832 EEMMDAFEKAAYHTPDEVSDESRATSFCSSLERKDTVAAVYQYWMKKRKQNHVALLRVFQ 653 E ++D EK Y PDE+ DE AT+ C L K+ V AVY YWM+KRKQ L+RVFQ Sbjct: 560 ELIIDTLEKVYYCNPDELFDEKSATNGCQDLGSKEVVEAVYNYWMRKRKQKRSLLIRVFQ 619 Query: 652 CPPPRSSQLMQKPVFRKKRSFKRQMRQSGRGKQQIFFHASAVEP---EQDAMQRVQKAKS 482 + + L+ KP+ RK+RSFKRQ Q GR Q A A E E++AM R+++AK+ Sbjct: 620 GHQSKRAPLIPKPLLRKRRSFKRQPSQFGRSNQPSVLKAFAAEQDAMEENAMLRIEEAKA 679 Query: 481 LADISMESVLLKRRRAQILMDNADLATY 398 A++SME + KRRRAQ L NADLATY Sbjct: 680 NANMSMELAIHKRRRAQSLAQNADLATY 707