BLASTX nr result
ID: Zingiber25_contig00022257
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Zingiber25_contig00022257 (2739 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI20940.3| unnamed protein product [Vitis vinifera] 347 1e-92 ref|XP_002281922.1| PREDICTED: uncharacterized protein LOC100264... 347 1e-92 gb|EMJ26677.1| hypothetical protein PRUPE_ppa000151mg [Prunus pe... 341 1e-90 ref|XP_002516604.1| hypothetical protein RCOM_0804080 [Ricinus c... 338 7e-90 gb|EOY31350.1| Enhancer of polycomb-like transcription factor pr... 315 7e-83 gb|EOY31349.1| Enhancer of polycomb-like transcription factor pr... 315 7e-83 gb|EOY31346.1| Enhancer of polycomb-like transcription factor pr... 315 7e-83 gb|ESW09082.1| hypothetical protein PHAVU_009G098700g [Phaseolus... 309 4e-81 gb|EOY31348.1| Enhancer of polycomb-like transcription factor pr... 306 2e-80 ref|XP_004498624.1| PREDICTED: uncharacterized protein LOC101499... 306 2e-80 ref|XP_004292962.1| PREDICTED: uncharacterized protein LOC101313... 306 4e-80 ref|XP_006476180.1| PREDICTED: uncharacterized protein LOC102626... 305 9e-80 ref|XP_006476179.1| PREDICTED: uncharacterized protein LOC102626... 305 9e-80 ref|XP_006450576.1| hypothetical protein CICLE_v100072352mg, par... 303 2e-79 ref|XP_002324830.2| hypothetical protein POPTR_0018s01030g [Popu... 302 5e-79 ref|XP_002309585.2| hypothetical protein POPTR_0006s26240g [Popu... 301 1e-78 gb|EXC20799.1| hypothetical protein L484_007381 [Morus notabilis] 298 1e-77 ref|XP_004162065.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 294 1e-76 ref|XP_006601120.1| PREDICTED: uncharacterized protein LOC100789... 285 7e-74 ref|XP_006601123.1| PREDICTED: uncharacterized protein LOC100792... 283 2e-73 >emb|CBI20940.3| unnamed protein product [Vitis vinifera] Length = 1634 Score = 347 bits (891), Expect = 1e-92 Identities = 279/878 (31%), Positives = 400/878 (45%), Gaps = 83/878 (9%) Frame = +2 Query: 254 MEIRVEKIEESDIPKKPRSLDLQSIYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKLGSP 433 ME VE S+I KK RSLDLQSIY NK ++K S Sbjct: 1 MEHSVENSGGSEISKKSRSLDLQSIYRSKVSQEGD--------------NKILKRKHSS- 45 Query: 434 FEEDGVVLP---KSRKKNKKEVFLSSLVPTSKRQP--------------NSSNVPKSNKD 562 E DG V K + ++K V LSSL K +SS +P S K Sbjct: 46 -ENDGEVESGQGKKKSNSRKAVSLSSLKSLLKNSHKSLDEVYADGLGSGSSSGLPDSKKK 104 Query: 563 HISSVNRSNDTSNRSVTTDNLQGTNADAQHLANDLYSQLSGKTDGAYVTDVSKNKVFMSD 742 + + +D S G N+ +++L N++ Sbjct: 105 ELGLSQKLDDNS----------GLNSISRNLDNNVIR----------------------- 131 Query: 743 SFLAPKRRRGISNSRKCKDPLSLETEISNPCHAKTNTDVQDNESIDKDDLNQKALNDXXX 922 PKR RG R+ L+ S+P +K Q + D L Sbjct: 132 ---IPKRPRGFVRRRRFDGNHMLQPGRSSPASSKDVFVDQITKLSDDSATRVVPLKIKRK 188 Query: 923 XXXXXVEECDMSGSHKDDSAPS-TSAEHLSFIKDENRKAGNRKDHVKMSEQMNLRNRARS 1099 +E SGS SAP + + + + N R K ++ NL + +S Sbjct: 189 KGFDDFKENRSSGS---SSAPHYKEGDEIKVVDNGNSSLRKRMPRKKQVKRKNLSSEGKS 245 Query: 1100 ---------ADNSVNSVPVSQYDEENLEENAARMLSSRFDPNCTDFCAKRLKGAADSAND 1252 ADN + + + DEENLEENAARMLSSRFDPNCT F + S N Sbjct: 246 IVKEEAVPLADNPIKNC--DEEDEENLEENAARMLSSRFDPNCTGFSSNGKASTPQSTNG 303 Query: 1253 IYFSQSNTE-----RMKGSQGKVCSVGSAGSRTLRPRR-HNGKSFARKRRHFYEVFLRDM 1414 + F S + RM G + R LRPR+ H K +RKRRHFYE+F R++ Sbjct: 304 LSFLLSPDQDCMIHRMNSLVGSESASVDTAGRVLRPRKQHKQKGLSRKRRHFYEIFSRNL 363 Query: 1415 DPHCLVKQRIRVFWPLDQNWYFGLIKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFP 1594 D + ++ +RI+VFWPLDQ+WYFGL+K YDP +LHHVKYDDRDEEWI+L+ ERFKLLL P Sbjct: 364 DAYWVLNRRIKVFWPLDQSWYFGLVKDYDPERKLHHVKYDDRDEEWIDLRHERFKLLLLP 423 Query: 1595 SEV-----SSKFNFGKPGLEKENKEAE--------EVDTEESSYLGSLAESEPIISWLAR 1735 SEV K G + EN+E + ++ E+ S +G +SEPIISWLAR Sbjct: 424 SEVPGKADRKKMEMGDKCPDDENEERKHRKRGGKRDLPMEDDSCIGGYMDSEPIISWLAR 483 Query: 1736 TTRRATSCPSNTTKIHLRVSPLKDTGPSLLLEPPKGNIRMSQLD----MRSNNLLPNCKK 1903 ++RR S P + K P + PSLL + N + LD R + L N Sbjct: 484 SSRRIKSSPFHVMKKQKTSYPSSNAVPSLLSDNTDSNAQ-GCLDGSSLKRDKDRLNNSAM 542 Query: 1904 SDQLYDWNINGISERKRSIDSEEKKLPYVYFRKRFRNKKEDFKTKLTQHAVDCDQGGLAS 2083 D+ D S +I +++K+P VYFR+R + + H V S Sbjct: 543 PDEFTDAEKIEKSVPGSTICYKDEKVPIVYFRRRLKRFQ-------GLHYVSEVHNVCGS 595 Query: 2084 ISSTVNSMAPV-----------------------WE------VKEKLTLVNFQQVIFELN 2176 S V S PV W +K + ++N + FE + Sbjct: 596 ASELVPSPVPVIDRLGTLEEFLLSLRQSDQFALLWSSDGAGLLKLSIPMINSRHFRFEFS 655 Query: 2177 LPLQCTLNFTSQCQSFGLFRSLYIADHGELVCGWPDVHMEVFFVDDIPGLRFILFEGCLK 2356 LP LN ++F LF ++ + +G ++ WP V +E+ FVD++ GLRF+LFEGCLK Sbjct: 656 LPALPVLNCAFGAENFWLFHTVLLHQYGVVMPKWPKVRLEMLFVDNLVGLRFLLFEGCLK 715 Query: 2357 RSISLFCLIIKILNQHVEKYNFTDTEIPCSSIVLCISRLDNSREELLF-LNLFSKVESSR 2533 ++++ CL++ I NQ E+ + D + P +SI +S + + +++L+F FSKV+ S+ Sbjct: 716 QAVAFVCLVLTIFNQPNEQGRYVDLQFPVTSIKFKLSCVQDLQKQLVFAFYNFSKVKDSK 775 Query: 2534 WKRFEGKLKKLC--TKEASIS-TPYIRVNTLYFSTNEI 2638 W + KLK+ C TK+ +S Y + L TN + Sbjct: 776 WFYLDCKLKRYCLLTKQLPLSECTYDNIMALQSGTNPL 813 >ref|XP_002281922.1| PREDICTED: uncharacterized protein LOC100264575 [Vitis vinifera] Length = 1679 Score = 347 bits (891), Expect = 1e-92 Identities = 279/878 (31%), Positives = 400/878 (45%), Gaps = 83/878 (9%) Frame = +2 Query: 254 MEIRVEKIEESDIPKKPRSLDLQSIYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKLGSP 433 ME VE S+I KK RSLDLQSIY NK ++K S Sbjct: 1 MEHSVENSGGSEISKKSRSLDLQSIYRSKVSQEGD--------------NKILKRKHSS- 45 Query: 434 FEEDGVVLP---KSRKKNKKEVFLSSLVPTSKRQP--------------NSSNVPKSNKD 562 E DG V K + ++K V LSSL K +SS +P S K Sbjct: 46 -ENDGEVESGQGKKKSNSRKAVSLSSLKSLLKNSHKSLDEVYADGLGSGSSSGLPDSKKK 104 Query: 563 HISSVNRSNDTSNRSVTTDNLQGTNADAQHLANDLYSQLSGKTDGAYVTDVSKNKVFMSD 742 + + +D S G N+ +++L N++ Sbjct: 105 ELGLSQKLDDNS----------GLNSISRNLDNNVIR----------------------- 131 Query: 743 SFLAPKRRRGISNSRKCKDPLSLETEISNPCHAKTNTDVQDNESIDKDDLNQKALNDXXX 922 PKR RG R+ L+ S+P +K Q + D L Sbjct: 132 ---IPKRPRGFVRRRRFDGNHMLQPGRSSPASSKDVFVDQITKLSDDSATRVVPLKIKRK 188 Query: 923 XXXXXVEECDMSGSHKDDSAPS-TSAEHLSFIKDENRKAGNRKDHVKMSEQMNLRNRARS 1099 +E SGS SAP + + + + N R K ++ NL + +S Sbjct: 189 KGFDDFKENRSSGS---SSAPHYKEGDEIKVVDNGNSSLRKRMPRKKQVKRKNLSSEGKS 245 Query: 1100 ---------ADNSVNSVPVSQYDEENLEENAARMLSSRFDPNCTDFCAKRLKGAADSAND 1252 ADN + + + DEENLEENAARMLSSRFDPNCT F + S N Sbjct: 246 IVKEEAVPLADNPIKNC--DEEDEENLEENAARMLSSRFDPNCTGFSSNGKASTPQSTNG 303 Query: 1253 IYFSQSNTE-----RMKGSQGKVCSVGSAGSRTLRPRR-HNGKSFARKRRHFYEVFLRDM 1414 + F S + RM G + R LRPR+ H K +RKRRHFYE+F R++ Sbjct: 304 LSFLLSPDQDCMIHRMNSLVGSESASVDTAGRVLRPRKQHKQKGLSRKRRHFYEIFSRNL 363 Query: 1415 DPHCLVKQRIRVFWPLDQNWYFGLIKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFP 1594 D + ++ +RI+VFWPLDQ+WYFGL+K YDP +LHHVKYDDRDEEWI+L+ ERFKLLL P Sbjct: 364 DAYWVLNRRIKVFWPLDQSWYFGLVKDYDPERKLHHVKYDDRDEEWIDLRHERFKLLLLP 423 Query: 1595 SEV-----SSKFNFGKPGLEKENKEAE--------EVDTEESSYLGSLAESEPIISWLAR 1735 SEV K G + EN+E + ++ E+ S +G +SEPIISWLAR Sbjct: 424 SEVPGKADRKKMEMGDKCPDDENEERKHRKRGGKRDLPMEDDSCIGGYMDSEPIISWLAR 483 Query: 1736 TTRRATSCPSNTTKIHLRVSPLKDTGPSLLLEPPKGNIRMSQLD----MRSNNLLPNCKK 1903 ++RR S P + K P + PSLL + N + LD R + L N Sbjct: 484 SSRRIKSSPFHVMKKQKTSYPSSNAVPSLLSDNTDSNAQ-GCLDGSSLKRDKDRLNNSAM 542 Query: 1904 SDQLYDWNINGISERKRSIDSEEKKLPYVYFRKRFRNKKEDFKTKLTQHAVDCDQGGLAS 2083 D+ D S +I +++K+P VYFR+R + + H V S Sbjct: 543 PDEFTDAEKIEKSVPGSTICYKDEKVPIVYFRRRLKRFQ-------GLHYVSEVHNVCGS 595 Query: 2084 ISSTVNSMAPV-----------------------WE------VKEKLTLVNFQQVIFELN 2176 S V S PV W +K + ++N + FE + Sbjct: 596 ASELVPSPVPVIDRLGTLEEFLLSLRQSDQFALLWSSDGAGLLKLSIPMINSRHFRFEFS 655 Query: 2177 LPLQCTLNFTSQCQSFGLFRSLYIADHGELVCGWPDVHMEVFFVDDIPGLRFILFEGCLK 2356 LP LN ++F LF ++ + +G ++ WP V +E+ FVD++ GLRF+LFEGCLK Sbjct: 656 LPALPVLNCAFGAENFWLFHTVLLHQYGVVMPKWPKVRLEMLFVDNLVGLRFLLFEGCLK 715 Query: 2357 RSISLFCLIIKILNQHVEKYNFTDTEIPCSSIVLCISRLDNSREELLF-LNLFSKVESSR 2533 ++++ CL++ I NQ E+ + D + P +SI +S + + +++L+F FSKV+ S+ Sbjct: 716 QAVAFVCLVLTIFNQPNEQGRYVDLQFPVTSIKFKLSCVQDLQKQLVFAFYNFSKVKDSK 775 Query: 2534 WKRFEGKLKKLC--TKEASIS-TPYIRVNTLYFSTNEI 2638 W + KLK+ C TK+ +S Y + L TN + Sbjct: 776 WFYLDCKLKRYCLLTKQLPLSECTYDNIMALQSGTNPL 813 >gb|EMJ26677.1| hypothetical protein PRUPE_ppa000151mg [Prunus persica] Length = 1617 Score = 341 bits (874), Expect = 1e-90 Identities = 273/867 (31%), Positives = 402/867 (46%), Gaps = 73/867 (8%) Frame = +2 Query: 254 MEIRVEKIEESDIPKKPRSLDLQSIYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKL--- 424 ME R+E ++IP+K RSLDL+S+Y + P K L Sbjct: 1 MENRIENSHGTEIPRKSRSLDLKSLYKSRTT------------------KEVPTKSLKRK 42 Query: 425 GSPFEEDGVVLPKSRKKNKKEVFLSSLVPTSKRQPNSSNVPKSNKDHISSVNRSNDTSNR 604 GS EDG +KK++KEV LSSL NV S+K + V S S Sbjct: 43 GSA--EDGDENRDKKKKSRKEVSLSSL----------KNVNTSSKKSLDEVYHSGLNSGS 90 Query: 605 SVTTDNLQGTNADAQHLANDLYSQLSGKTDGAYVTDVSKNKVFMSDSFLAPKRRRGISNS 784 + +A + +G + N + + P+R+RG Sbjct: 91 H---------DPEAVKCGSSQILDSGSGFNGVSSLSLGNNVIQI------PRRKRGFVGR 135 Query: 785 RKCK--------DPLSLETEISNPCH--AKTNTD---VQDNESIDKDDLNQKALNDXXXX 925 +K + D + + + + H AK N D QD K + + Sbjct: 136 KKFEGGQVLKLPDQSAGKVGLVDQNHQIAKLNVDDLGTQDELLNVKRKKGRDDFKENIDS 195 Query: 926 XXXXVEECDMSGSHKDDSAPSTSAEHLSFIKDENRKAGNRKDHVKMSE-QMNLRNRARSA 1102 D G H S S L + NR+ K + ++ A+ A Sbjct: 196 ELNSAPHADKEGVHTSHSVVSNGDSSLKKSRRNQDNEENRRSRRKRKDLACGSKSAAKEA 255 Query: 1103 DNSVNSVPVSQYD-----EENLEENAARMLSSRFDPNCTDFCAKRLKGAADSANDIYFSQ 1267 D V+S S +D EENLEENAARMLSSRFDP+CT F + A +SAN + F Sbjct: 256 DPLVDSSTKSCHDLQEDDEENLEENAARMLSSRFDPSCTGFSSNNKASALESANGLSFLL 315 Query: 1268 SNTERMKGSQGKVCSVGSAGS-----RTLRPRR-HNGKSFARKRRHFYEVFLRDMDPHCL 1429 S+ + + K S + S R LRPR+ H K +RKRRHFYEVFL ++D + + Sbjct: 316 SSGQDFDSRRSKSISGSESPSVDNSGRVLRPRKQHKEKGHSRKRRHFYEVFLGNLDAYWV 375 Query: 1430 VKQRIRVFWPLDQNWYFGLIKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFPSEVSS 1609 +RI+VFWPLDQ WY+GL+ YD +LHHVKYDDRDEEWI+LQ ERFKLLL PSEV Sbjct: 376 TNRRIKVFWPLDQTWYYGLVNDYDKEKKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPG 435 Query: 1610 KFNFGKPGLE-------------KENKEAEEVDTEESSYLGSLAESEPIISWLARTTRRA 1750 K K ++ K+ E+ +E+ S +GS ++EPIISWLAR+ RR Sbjct: 436 KIERKKSTQRNRSSVERKGNLKPRKEKKKRELTSEDDSCMGSYMDTEPIISWLARSNRRV 495 Query: 1751 TSCPSNTTKIHLRVSPLKDTGPSLLLEPP-------KGNIRMSQLDMRSNNLLPNCKKSD 1909 S PS K K L L+PP + IR S RS+++L K + Sbjct: 496 KS-PSCAVK--------KQKTSGLSLKPPLSDEDVIRDKIRTSHNSGRSSDVLRQEKPTS 546 Query: 1910 QLYDWNINGISERKRSIDSEEKKLPYVYFRKRFRNKKEDFKTKLTQHAVDCDQGGLASIS 2089 Q S + K+P VYFR+R + T HA + G + S Sbjct: 547 Q-------------GSTCPRDSKMPIVYFRRRRKTGSVLSHTSKGNHAYVSELGSITSFV 593 Query: 2090 ST---------------VNSMAPVWEVKE----KLTLVNFQ--QVIFELNLPLQCTLNFT 2206 +++ P+W + + KLTL + +V FEL +P+ T+N Sbjct: 594 PVKEIGDLEEPYDFVRRLDANGPLWYIDDAGLLKLTLPRTEAGKVTFELGVPMHSTIN-D 652 Query: 2207 SQCQSFGLFRSLYIADHGELVCGWPDVHMEVFFVDDIPGLRFILFEGCLKRSISLFCLII 2386 S F LF + + +G +V WP V++E+ FVD++ GLRF+LFEGCL+++++ L++ Sbjct: 653 SFGVEFSLFHAAMLHRYGTVVITWPKVYLEMLFVDNVVGLRFLLFEGCLEQAVAFVFLVL 712 Query: 2387 KILNQHVEKYNFTDTEIPCSSIVLCISRLDNSREELLF-LNLFSKVESSRWKRFEGKLKK 2563 + + +E+ F D ++P +SI S + R++L+F + FS+V+ S+WK + K++ Sbjct: 713 ALFHHPIEQGKFLDFQLPVTSIRFKFSCVQLLRKQLVFAVYNFSQVKKSKWKYLDSKVRS 772 Query: 2564 LC--TKEASIS-TPYIRVNTLYFSTNE 2635 C TK+ +S Y + L TN+ Sbjct: 773 HCLLTKKLPLSECTYDSIQALQNGTNQ 799 >ref|XP_002516604.1| hypothetical protein RCOM_0804080 [Ricinus communis] gi|223544424|gb|EEF45945.1| hypothetical protein RCOM_0804080 [Ricinus communis] Length = 1705 Score = 338 bits (867), Expect = 7e-90 Identities = 270/882 (30%), Positives = 410/882 (46%), Gaps = 83/882 (9%) Frame = +2 Query: 254 MEIRVEKIEESDIPKKPRSLDLQSIYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKLGSP 433 ME R+ E++IPKK RSLDL+S+Y K ++K GS Sbjct: 1 MENRIGNSHEAEIPKKSRSLDLRSLYQSSEGSKEAQI-------------KNLKRKGGSD 47 Query: 434 FEEDGVVLPKSRKKNKKEVFLSSLVPTSKRQPNSSNVPKSNKDHISSVNRSNDTSNRSVT 613 + G + RKK++K V +SS VN + S V Sbjct: 48 VDNSGF---EKRKKSRKAVSISSF---------------------RKVNGNGSKSLEEVY 83 Query: 614 TDNLQGTNADAQHLANDLYSQLSGKTDGAYVTDVSKNKVFMSDSFLAPKRRRGISNSRKC 793 +L + D + + + +Q + V+ +S+N D P+R+RG +K Sbjct: 84 NGSLSSGSHDTKEIKSGSLNQQRVNNSNSGVSKISQNLEGSFDKI--PRRKRGFVGRKKV 141 Query: 794 KDPLSLETEISNPCHA---KTNTD------VQDN-ESIDKDDLNQKALNDXXXXXXXXVE 943 + ++++ P K TD V+D + ++ + QK ++D Sbjct: 142 EK----DSQVLKPAEESRDKLETDQISKLTVKDTGKVVESSKVKQKKVSDDFKENRISER 197 Query: 944 ECDMSGSHKDDSA---------------PSTSAEHLSFIKDENRKAGNRKDHVK----MS 1066 SG H ++ S + + D ++K RK K +S Sbjct: 198 S---SGRHCEEDGHTGHSVARSVVLSLWKSQTGHSVEIDDDSSKKKSLRKRSRKRKNLIS 254 Query: 1067 EQMNLRNRARSADNSVNSVPVSQYDEENLEENAARMLSSRFDPNCTDFCAKRLKGAADSA 1246 E ++ A + ++ S + DEENLEENAARMLSSRFD +CT F + S Sbjct: 255 EDKSVAKEAEPSVDAEVSCDLHDDDEENLEENAARMLSSRFDTSCTGFSSNSKASPVPST 314 Query: 1247 NDIYFSQSNTERMKGS-----QGKVCSVGSAGSRTLRPRR-HNGKSFARKRRHFYEVFLR 1408 N + F S+ + G + A +R LRPR+ H K +RKRRH+YE+F Sbjct: 315 NGLSFLLSSGQEFATHGPNYISGSESASLDAAARILRPRKQHKEKGSSRKRRHYYEIFSG 374 Query: 1409 DMDPHCLVKQRIRVFWPLDQNWYFGLIKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLL 1588 D+D + ++ +RI+VFWPLDQ+WY+GL+ YD V +LHHVKYDDRDEEWINLQ ERFKLLL Sbjct: 375 DLDAYWVLNRRIKVFWPLDQSWYYGLVNDYDNVRKLHHVKYDDRDEEWINLQDERFKLLL 434 Query: 1589 FPSEV-----------SSKFNFGKPGLEKENKEAEEVDTEESSYLGSLAESEPIISWLAR 1735 PSEV K + G G K +KE + E+ SY+G+ +SEPIISWLAR Sbjct: 435 LPSEVPGKPQRKRSRTKEKISKGGKGKLKPSKEKRDSTIEDDSYVGNYMDSEPIISWLAR 494 Query: 1736 TTRRATSCPSNTTKIHLRVSPLKDT-GPSLLLEPPKGNIRMSQLDMRSNNLLPNCKKSDQ 1912 +T R S P K +VS + T PSLL E S+ D+ S + N + Sbjct: 495 STHRVKSSPLRALK-KQKVSGISLTSAPSLLPEEAVCRNECSEGDLLSRD-KSNLSGNSA 552 Query: 1913 LYDWNINGISERKRSIDSEEKKLPYVYFRKRF-------RNKKEDFKTKL---------- 2041 L G + I ++ KLP VY+R+RF R+ ED + Sbjct: 553 LPGRFTAGGRDEVPDISPKDNKLPVVYYRRRFRCANSMPRHASEDNHVSIGVPESDTSLV 612 Query: 2042 ----TQHAVDCDQGGLASIS-----STVNSMAPVW--EVKEKL----TLVNFQQVIFELN 2176 A + LA + +++ +W +V+ L LV +Q F L Sbjct: 613 PAVYVSRAFEKQDISLARVDPDSDLGRLDTAEALWLSDVRGLLRLNTELVEPRQFRFGLR 672 Query: 2177 LPLQCTLNFTSQCQSFGLFRSLYIADHGELVCGWPDVHMEVFFVDDIPGLRFILFEGCLK 2356 +P+ NF+ +L + HG L+ WP VH+E+ FVD+I GLRF+LFEGCLK Sbjct: 673 IPVLSVHNFSFISGHTWFCNALLLLQHGRLMTTWPRVHLEMLFVDNIVGLRFLLFEGCLK 732 Query: 2357 RSISLFCLIIKILNQHVEKYNFTDTEIPCSSIVLCISRLDNSREELLF-LNLFSKVESSR 2533 ++I+ ++ + +Q E F D ++P +SI S + + R++L+F FS++++S+ Sbjct: 733 QAIAFVLQVLTVFHQPTEHGKFVDLQLPVTSIKFKFSCIQDFRKQLVFAFYNFSELKNSK 792 Query: 2534 WKRFEGKLKKLC--TKEASIS-TPYIRVNTLYFSTNEISHSS 2650 W + +LK+ C TK+ +S Y V L T+++ SS Sbjct: 793 WMHLDSRLKRHCLLTKQLPLSECTYDNVKALQNGTSQLLDSS 834 >gb|EOY31350.1| Enhancer of polycomb-like transcription factor protein, putative isoform 5 [Theobroma cacao] Length = 1522 Score = 315 bits (807), Expect = 7e-83 Identities = 253/859 (29%), Positives = 396/859 (46%), Gaps = 61/859 (7%) Frame = +2 Query: 254 MEIRVEKIEESDIPKKPRSLDLQSIYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKLGSP 433 ME R+ ++IP+K RSLDL+S+Y NK+ ++K S Sbjct: 1 MENRIGNSHGAEIPRKSRSLDLKSLYKSGDSKESSK-------------NKSLKRKDSS- 46 Query: 434 FEEDGVVLPKSRKKNKKEVFLSSLVPTSKRQPNSSNVPKS-----NKDHISSVNRSNDTS 598 ++G +S NK++ +L +S R + SN KS N S ++ S Sbjct: 47 --QEGDDEKRSSNNNKRKKSRKALPLSSFRTVDGSNSSKSLTEVYNGGFSSGLHDSESLK 104 Query: 599 NRSVTTDNLQGTNADAQHLA-NDLYSQLSGKTDGAYVTDVSKNKVFMSDSFLAPKRRRGI 775 N ++ G A+ L+ D +++ + G V +NK F +R + Sbjct: 105 NLGLSQKLKNGCGANGISLSLGDSETRIPRRKRGF----VGRNK------FEGGQRLKLA 154 Query: 776 SNSRKCKDPLSLETEISNPCHAKTNTDVQDNESIDKDDLNQKALNDXXXXXXXXVEECDM 955 S + E ++++ N + + DD + ++ E+ Sbjct: 155 GRSSSTVGDVKEEVKLTSEDSGTQNESSKVKQKKFIDDFKENRNSESSLVQHLKEEDGVA 214 Query: 956 SGSHKDDSAPSTSAEHLSFIKDENRKAGNRKDHVKMSEQMNLRNRARSADNSVNSVPVSQ 1135 + +D S +K R RKD VK + + + + + Sbjct: 215 AYLAVNDGD--------SLLKKSQRNPRKRKDSVKGGKSVAKKAEILVGSSVKTCDDFKE 266 Query: 1136 YDEENLEENAARMLSSRFDPNCTDFCAKRLKGAADSANDIYF----SQSNTERMKGSQGK 1303 DEENLEENAARMLSSRFDP+CT F + + S N F Q+ + K G Sbjct: 267 DDEENLEENAARMLSSRFDPSCTGFSSNSKVSVSPSENGFSFLLSSGQNASSGSKTFSGS 326 Query: 1304 VCSVGSAGSRTLRPRR-HNGKSFARKRRHFYEVFLRDMDPHCLVKQRIRVFWPLDQNWYF 1480 + A R LRPR+ H KS +RKRRHFYE++ D+D ++ +RI+VFWPLD++WY+ Sbjct: 327 ESASVDASGRVLRPRKSHKEKSNSRKRRHFYEIYSGDLDASWVLNRRIKVFWPLDKSWYY 386 Query: 1481 GLIKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFPSEVSSKFNFGKP---------- 1630 GL+ YD +LHHVKYDDRDEEWINLQ ERFKLLLFPSEV SK + Sbjct: 387 GLVNEYDKERKLHHVKYDDRDEEWINLQNERFKLLLFPSEVPSKSERKRSRRKRCSDDRI 446 Query: 1631 -GLEKENKEAEEVDTEESSYLGSLAESEPIISWLARTTRRATSCPSNTTKIHLRVSPLKD 1807 L+ +E V TE+ S GS +SEPIISWLAR++ R SCP K + S Sbjct: 447 RNLKPNREEKRNVVTEDDSGNGSYMDSEPIISWLARSSHRVKSCPLRAVK-RQKTSASSH 505 Query: 1808 TGPS---LLLEPPKGNIRMSQLDMRSNNLLPNCKKSDQLYDWNINGISERKRSIDS---- 1966 + P L E N + ++ +R + + + L D ++GI S+ S Sbjct: 506 SSPGQPLLCDEAVDENSCLYRVSLRVDKI--ELSGASALSDRPVDGIRVEDSSLGSTSCL 563 Query: 1967 EEKKLPYVYFRKRFRNKKEDFKTKLTQHAVDCDQGGLASISSTVNSMAPVWEVKE----- 2131 ++ K P VYFR+RFR ++ + V +S+S ++ S+A V E ++ Sbjct: 564 KDSKHPIVYFRRRFRRTEKALCQASEGNCV------ASSVSESITSLASVDEFQDLGELD 617 Query: 2132 -----------------------KLTLVNFQQVIFELNLPLQCTLNFTSQCQSFGLFRSL 2242 ++L+ +Q F L+ P+ N +SF L +L Sbjct: 618 VCLGRLDPEGDLLFSDNAGQLRLNISLLRTKQFRFGLSFPVFSVSNNLFGTKSFSLVHTL 677 Query: 2243 YIADHGELVCGWPDVHMEVFFVDDIPGLRFILFEGCLKRSISLFCLIIKILNQHVEKYNF 2422 + G ++ WP VH+E+ FVD+ GLRF+LFEG LK++++ ++ + E+ F Sbjct: 678 LLLQCGTVMTIWPMVHLEILFVDNEVGLRFLLFEGSLKQAVAFVFRVLTVFYLPTEQGKF 737 Query: 2423 TDTEIPCSSIVLCISRLDNSREELLF-LNLFSKVESSRWKRFEGKLKKLC--TKEASIS- 2590 D ++P +SI S + R++++F F +V+ S+W + KLK+ C T++ +S Sbjct: 738 ADLQLPVTSIRFKFSCSQDFRKQIVFAFYNFHEVKHSKWVFLDSKLKRQCLITRQLPLSE 797 Query: 2591 TPYIRVNTLYFSTNEISHS 2647 Y + L TN++ S Sbjct: 798 CTYDNIKALQNGTNQLLSS 816 >gb|EOY31349.1| Enhancer of polycomb-like transcription factor protein, putative isoform 4 [Theobroma cacao] Length = 1721 Score = 315 bits (807), Expect = 7e-83 Identities = 253/859 (29%), Positives = 396/859 (46%), Gaps = 61/859 (7%) Frame = +2 Query: 254 MEIRVEKIEESDIPKKPRSLDLQSIYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKLGSP 433 ME R+ ++IP+K RSLDL+S+Y NK+ ++K S Sbjct: 1 MENRIGNSHGAEIPRKSRSLDLKSLYKSGDSKESSK-------------NKSLKRKDSS- 46 Query: 434 FEEDGVVLPKSRKKNKKEVFLSSLVPTSKRQPNSSNVPKS-----NKDHISSVNRSNDTS 598 ++G +S NK++ +L +S R + SN KS N S ++ S Sbjct: 47 --QEGDDEKRSSNNNKRKKSRKALPLSSFRTVDGSNSSKSLTEVYNGGFSSGLHDSESLK 104 Query: 599 NRSVTTDNLQGTNADAQHLA-NDLYSQLSGKTDGAYVTDVSKNKVFMSDSFLAPKRRRGI 775 N ++ G A+ L+ D +++ + G V +NK F +R + Sbjct: 105 NLGLSQKLKNGCGANGISLSLGDSETRIPRRKRGF----VGRNK------FEGGQRLKLA 154 Query: 776 SNSRKCKDPLSLETEISNPCHAKTNTDVQDNESIDKDDLNQKALNDXXXXXXXXVEECDM 955 S + E ++++ N + + DD + ++ E+ Sbjct: 155 GRSSSTVGDVKEEVKLTSEDSGTQNESSKVKQKKFIDDFKENRNSESSLVQHLKEEDGVA 214 Query: 956 SGSHKDDSAPSTSAEHLSFIKDENRKAGNRKDHVKMSEQMNLRNRARSADNSVNSVPVSQ 1135 + +D S +K R RKD VK + + + + + Sbjct: 215 AYLAVNDGD--------SLLKKSQRNPRKRKDSVKGGKSVAKKAEILVGSSVKTCDDFKE 266 Query: 1136 YDEENLEENAARMLSSRFDPNCTDFCAKRLKGAADSANDIYF----SQSNTERMKGSQGK 1303 DEENLEENAARMLSSRFDP+CT F + + S N F Q+ + K G Sbjct: 267 DDEENLEENAARMLSSRFDPSCTGFSSNSKVSVSPSENGFSFLLSSGQNASSGSKTFSGS 326 Query: 1304 VCSVGSAGSRTLRPRR-HNGKSFARKRRHFYEVFLRDMDPHCLVKQRIRVFWPLDQNWYF 1480 + A R LRPR+ H KS +RKRRHFYE++ D+D ++ +RI+VFWPLD++WY+ Sbjct: 327 ESASVDASGRVLRPRKSHKEKSNSRKRRHFYEIYSGDLDASWVLNRRIKVFWPLDKSWYY 386 Query: 1481 GLIKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFPSEVSSKFNFGKP---------- 1630 GL+ YD +LHHVKYDDRDEEWINLQ ERFKLLLFPSEV SK + Sbjct: 387 GLVNEYDKERKLHHVKYDDRDEEWINLQNERFKLLLFPSEVPSKSERKRSRRKRCSDDRI 446 Query: 1631 -GLEKENKEAEEVDTEESSYLGSLAESEPIISWLARTTRRATSCPSNTTKIHLRVSPLKD 1807 L+ +E V TE+ S GS +SEPIISWLAR++ R SCP K + S Sbjct: 447 RNLKPNREEKRNVVTEDDSGNGSYMDSEPIISWLARSSHRVKSCPLRAVK-RQKTSASSH 505 Query: 1808 TGPS---LLLEPPKGNIRMSQLDMRSNNLLPNCKKSDQLYDWNINGISERKRSIDS---- 1966 + P L E N + ++ +R + + + L D ++GI S+ S Sbjct: 506 SSPGQPLLCDEAVDENSCLYRVSLRVDKI--ELSGASALSDRPVDGIRVEDSSLGSTSCL 563 Query: 1967 EEKKLPYVYFRKRFRNKKEDFKTKLTQHAVDCDQGGLASISSTVNSMAPVWEVKE----- 2131 ++ K P VYFR+RFR ++ + V +S+S ++ S+A V E ++ Sbjct: 564 KDSKHPIVYFRRRFRRTEKALCQASEGNCV------ASSVSESITSLASVDEFQDLGELD 617 Query: 2132 -----------------------KLTLVNFQQVIFELNLPLQCTLNFTSQCQSFGLFRSL 2242 ++L+ +Q F L+ P+ N +SF L +L Sbjct: 618 VCLGRLDPEGDLLFSDNAGQLRLNISLLRTKQFRFGLSFPVFSVSNNLFGTKSFSLVHTL 677 Query: 2243 YIADHGELVCGWPDVHMEVFFVDDIPGLRFILFEGCLKRSISLFCLIIKILNQHVEKYNF 2422 + G ++ WP VH+E+ FVD+ GLRF+LFEG LK++++ ++ + E+ F Sbjct: 678 LLLQCGTVMTIWPMVHLEILFVDNEVGLRFLLFEGSLKQAVAFVFRVLTVFYLPTEQGKF 737 Query: 2423 TDTEIPCSSIVLCISRLDNSREELLF-LNLFSKVESSRWKRFEGKLKKLC--TKEASIS- 2590 D ++P +SI S + R++++F F +V+ S+W + KLK+ C T++ +S Sbjct: 738 ADLQLPVTSIRFKFSCSQDFRKQIVFAFYNFHEVKHSKWVFLDSKLKRQCLITRQLPLSE 797 Query: 2591 TPYIRVNTLYFSTNEISHS 2647 Y + L TN++ S Sbjct: 798 CTYDNIKALQNGTNQLLSS 816 >gb|EOY31346.1| Enhancer of polycomb-like transcription factor protein, putative isoform 1 [Theobroma cacao] gi|508784091|gb|EOY31347.1| Enhancer of polycomb-like transcription factor protein, putative isoform 1 [Theobroma cacao] Length = 1693 Score = 315 bits (807), Expect = 7e-83 Identities = 253/859 (29%), Positives = 396/859 (46%), Gaps = 61/859 (7%) Frame = +2 Query: 254 MEIRVEKIEESDIPKKPRSLDLQSIYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKLGSP 433 ME R+ ++IP+K RSLDL+S+Y NK+ ++K S Sbjct: 1 MENRIGNSHGAEIPRKSRSLDLKSLYKSGDSKESSK-------------NKSLKRKDSS- 46 Query: 434 FEEDGVVLPKSRKKNKKEVFLSSLVPTSKRQPNSSNVPKS-----NKDHISSVNRSNDTS 598 ++G +S NK++ +L +S R + SN KS N S ++ S Sbjct: 47 --QEGDDEKRSSNNNKRKKSRKALPLSSFRTVDGSNSSKSLTEVYNGGFSSGLHDSESLK 104 Query: 599 NRSVTTDNLQGTNADAQHLA-NDLYSQLSGKTDGAYVTDVSKNKVFMSDSFLAPKRRRGI 775 N ++ G A+ L+ D +++ + G V +NK F +R + Sbjct: 105 NLGLSQKLKNGCGANGISLSLGDSETRIPRRKRGF----VGRNK------FEGGQRLKLA 154 Query: 776 SNSRKCKDPLSLETEISNPCHAKTNTDVQDNESIDKDDLNQKALNDXXXXXXXXVEECDM 955 S + E ++++ N + + DD + ++ E+ Sbjct: 155 GRSSSTVGDVKEEVKLTSEDSGTQNESSKVKQKKFIDDFKENRNSESSLVQHLKEEDGVA 214 Query: 956 SGSHKDDSAPSTSAEHLSFIKDENRKAGNRKDHVKMSEQMNLRNRARSADNSVNSVPVSQ 1135 + +D S +K R RKD VK + + + + + Sbjct: 215 AYLAVNDGD--------SLLKKSQRNPRKRKDSVKGGKSVAKKAEILVGSSVKTCDDFKE 266 Query: 1136 YDEENLEENAARMLSSRFDPNCTDFCAKRLKGAADSANDIYF----SQSNTERMKGSQGK 1303 DEENLEENAARMLSSRFDP+CT F + + S N F Q+ + K G Sbjct: 267 DDEENLEENAARMLSSRFDPSCTGFSSNSKVSVSPSENGFSFLLSSGQNASSGSKTFSGS 326 Query: 1304 VCSVGSAGSRTLRPRR-HNGKSFARKRRHFYEVFLRDMDPHCLVKQRIRVFWPLDQNWYF 1480 + A R LRPR+ H KS +RKRRHFYE++ D+D ++ +RI+VFWPLD++WY+ Sbjct: 327 ESASVDASGRVLRPRKSHKEKSNSRKRRHFYEIYSGDLDASWVLNRRIKVFWPLDKSWYY 386 Query: 1481 GLIKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFPSEVSSKFNFGKP---------- 1630 GL+ YD +LHHVKYDDRDEEWINLQ ERFKLLLFPSEV SK + Sbjct: 387 GLVNEYDKERKLHHVKYDDRDEEWINLQNERFKLLLFPSEVPSKSERKRSRRKRCSDDRI 446 Query: 1631 -GLEKENKEAEEVDTEESSYLGSLAESEPIISWLARTTRRATSCPSNTTKIHLRVSPLKD 1807 L+ +E V TE+ S GS +SEPIISWLAR++ R SCP K + S Sbjct: 447 RNLKPNREEKRNVVTEDDSGNGSYMDSEPIISWLARSSHRVKSCPLRAVK-RQKTSASSH 505 Query: 1808 TGPS---LLLEPPKGNIRMSQLDMRSNNLLPNCKKSDQLYDWNINGISERKRSIDS---- 1966 + P L E N + ++ +R + + + L D ++GI S+ S Sbjct: 506 SSPGQPLLCDEAVDENSCLYRVSLRVDKI--ELSGASALSDRPVDGIRVEDSSLGSTSCL 563 Query: 1967 EEKKLPYVYFRKRFRNKKEDFKTKLTQHAVDCDQGGLASISSTVNSMAPVWEVKE----- 2131 ++ K P VYFR+RFR ++ + V +S+S ++ S+A V E ++ Sbjct: 564 KDSKHPIVYFRRRFRRTEKALCQASEGNCV------ASSVSESITSLASVDEFQDLGELD 617 Query: 2132 -----------------------KLTLVNFQQVIFELNLPLQCTLNFTSQCQSFGLFRSL 2242 ++L+ +Q F L+ P+ N +SF L +L Sbjct: 618 VCLGRLDPEGDLLFSDNAGQLRLNISLLRTKQFRFGLSFPVFSVSNNLFGTKSFSLVHTL 677 Query: 2243 YIADHGELVCGWPDVHMEVFFVDDIPGLRFILFEGCLKRSISLFCLIIKILNQHVEKYNF 2422 + G ++ WP VH+E+ FVD+ GLRF+LFEG LK++++ ++ + E+ F Sbjct: 678 LLLQCGTVMTIWPMVHLEILFVDNEVGLRFLLFEGSLKQAVAFVFRVLTVFYLPTEQGKF 737 Query: 2423 TDTEIPCSSIVLCISRLDNSREELLF-LNLFSKVESSRWKRFEGKLKKLC--TKEASIS- 2590 D ++P +SI S + R++++F F +V+ S+W + KLK+ C T++ +S Sbjct: 738 ADLQLPVTSIRFKFSCSQDFRKQIVFAFYNFHEVKHSKWVFLDSKLKRQCLITRQLPLSE 797 Query: 2591 TPYIRVNTLYFSTNEISHS 2647 Y + L TN++ S Sbjct: 798 CTYDNIKALQNGTNQLLSS 816 >gb|ESW09082.1| hypothetical protein PHAVU_009G098700g [Phaseolus vulgaris] Length = 1699 Score = 309 bits (792), Expect = 4e-81 Identities = 258/868 (29%), Positives = 390/868 (44%), Gaps = 96/868 (11%) Frame = +2 Query: 254 MEIRVEKIEESDIPKKPRSLDLQSIYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKLGSP 433 ME R E + IPKK RSLDL+S+Y ++P+K L Sbjct: 1 MEDREESTHGTAIPKKSRSLDLKSLYKPKVR------------------KESPEKGLKRK 42 Query: 434 FEEDGVVLPKSRKKNK--KEVFLSSLVPTSKRQPNSSNVPKSNKDHISSVNRSNDTSNRS 607 G V + KK K KEV LSSL + D N+ Sbjct: 43 GSHLGGVHENTNKKKKTRKEVSLSSL-------------------------ENADVGNKK 77 Query: 608 VTTDNLQ-GTNADAQHLANDLYSQLSGKTDGAYVTDVSKNKVFMSDSFLAPKRRRGISNS 784 V + Q G + Q L +L K T +++ + ++ PKRRR Sbjct: 78 VVDEECQKGLGSGWQDLCEQ---KLEPKQGSGSNTVLNRGSLCFDENVHIPKRRRDFVGR 134 Query: 785 RKCK--DPLSLETEISNPCH-----AKTNTDVQDNESIDKDDLNQKALNDXXXXXXXXVE 943 RK + L E SN K +++V D I+ + K D + Sbjct: 135 RKIEVGPAPRLAGESSNTGGHGEQILKLSSNVLDR-GIESSKIKHK--RDFDECKGTKSK 191 Query: 944 ECDMSGSHKDDSAPSTSAEHLSFIKDENRKAGNRKDHVKMSEQMNLRNRARSADNS---- 1111 SG + + +F D NR A K + S+ + + +A + D Sbjct: 192 SAVKSGDSSSKKSLKKDRKQKAFAPDRNRVATEVKPPIDSSKASDYKQKAVAPDRRRVAK 251 Query: 1112 ---------------------------------VNSVPVSQY----DEENLEENAARMLS 1180 ++ +S Y +EENLEENAARMLS Sbjct: 252 EVQPLIDDTKTSDYKQKSLAPDRNKVAKEVKPLIDDNKISDYLREDEEENLEENAARMLS 311 Query: 1181 SRFDPNCTDFCAKRLKGAADSANDIYFSQSNTERMKG------SQGKVCSVGSAGSRTLR 1342 SRFDPN FC+ S+N + F S++ + S + SV +AG R LR Sbjct: 312 SRFDPNYAGFCSSSKPSTLPSSNGLSFLLSSSRNIDSWASKSQSGSESASVDTAG-RVLR 370 Query: 1343 PRR-HNGKSFARKRRHFYEVFLRDMDPHCLVKQRIRVFWPLDQNWYFGLIKGYDPVTRLH 1519 PR+ +N K +R+RRHFYE+ L D+D H ++ QRI+VFWPLDQ WY GL+ Y+ T+ H Sbjct: 371 PRKQYNEKGRSRRRRHFYEISLGDLDKHWILNQRIKVFWPLDQIWYHGLVDDYNKETKCH 430 Query: 1520 HVKYDDRDEEWINLQKERFKLLLFPSEVSSK------------FNFGKPGLEKENKEAEE 1663 H+KYDDR+EEWINL+ ERFKLLL PSEV K K L + ++ + Sbjct: 431 HIKYDDREEEWINLETERFKLLLLPSEVPGKAGKKRAVRKNKSSGQQKRSLSSKERKIRD 490 Query: 1664 VDTEESSYLGSLAESEPIISWLARTTRRATSCPSNTTKIHLRVSPLKDTGPSLLLEPPKG 1843 V TE++S S ++EPIISWLAR++ R S N K L T SL E K Sbjct: 491 VITEDNSCGESCMDTEPIISWLARSSHRFRSSALNGVKRKKNPITLPSTASSLWNEAVKT 550 Query: 1844 NIRMSQLDMR--SNNLLPNCKKSDQLYDWNINGISERKRSIDSEEKKLPYVYFRKRFRNK 2017 +++ R ++L + D+L D N S + ++ K P VY+R+RFR Sbjct: 551 RRCLAESSPRDGKSSLSRDSVSDDKLGD-NFGRKSPLQSFSCPKDDKRPIVYYRRRFRK- 608 Query: 2018 KEDFKTKLTQH-AVDCDQGGLASISSTVNSMAPVWEVKEKLT----------------LV 2146 T ++ H + D AS S + + +A + +VKE + Sbjct: 609 ----PTPMSPHISEDKHVNTTASCSISFDPVAQLMDVKESNDGRGEIEGPLCYLHNGGVF 664 Query: 2147 NF------QQVIFELNLPLQCTLNFTSQCQSFGLFRSLYIADHGELVCGWPDVHMEVFFV 2308 NF F+L P+Q +N + + ++ LFR++ + +G +V WP VH+E+ FV Sbjct: 665 NFFLETGSATFKFDLKYPIQSVMNDSFKLENLWLFRAILLLQYGTVVTLWPRVHLEMLFV 724 Query: 2309 DDIPGLRFILFEGCLKRSISLFCLIIKILNQHVEKYNFTDTEIPCSSIVLCISRLDNSRE 2488 D++ GLRF+LFEGCL + + ++++ +Q E+ + D ++P +SI S + +R+ Sbjct: 725 DNVAGLRFLLFEGCLMMAAAFIFCVLRLFHQPGEQGKYIDLQLPATSIRFRFSSVYGTRK 784 Query: 2489 ELLF-LNLFSKVESSRWKRFEGKLKKLC 2569 L+F FS+V++S+W + KL++ C Sbjct: 785 PLVFTFYNFSRVKNSKWMYLDSKLQRHC 812 >gb|EOY31348.1| Enhancer of polycomb-like transcription factor protein, putative isoform 3 [Theobroma cacao] Length = 1674 Score = 306 bits (785), Expect = 2e-80 Identities = 246/797 (30%), Positives = 374/797 (46%), Gaps = 69/797 (8%) Frame = +2 Query: 464 SRKKNKKEVFLSSLVPTSKRQPNSSNVPKSNKDHISSVNRSNDTSNRSVT-TDNLQGTNA 640 S+ K+ K S KR N++ KS K S R+ D SN S + T+ G + Sbjct: 16 SKNKSLKRKDSSQEGDDEKRSSNNNKRKKSRKALPLSSFRTVDGSNSSKSLTEVYNGGFS 75 Query: 641 DAQHLANDLYSQ-LSGKT-DGAYVTDVSKNKVFMSDSFLA-PKRRRGISNSRKCKDPLSL 811 H + L + LS K +G +S + + DS P+R+RG K + L Sbjct: 76 SGLHDSESLKNLGLSQKLKNGCGANGISLS---LGDSETRIPRRKRGFVGRNKFEGGQRL 132 Query: 812 ETEISNPCHAKTNTDVQDNESIDKDD---------LNQKALNDXXXXXXXXVEECDMSGS 964 + + + T DV++ + +D + QK D Sbjct: 133 KLAGRS---SSTVGDVKEEVKLTSEDSGTQNESSKVKQKKFIDDFKENRNSESSLVQHLK 189 Query: 965 HKDDSAPSTSA-EHLSFIKDENRKAGNRKDHVKMSEQMNLRNRARSADNSVNSVPVSQYD 1141 +D A + + S +K R RKD VK + + + + + D Sbjct: 190 EEDGVAAYLAVNDGDSLLKKSQRNPRKRKDSVKGGKSVAKKAEILVGSSVKTCDDFKEDD 249 Query: 1142 EENLEENAARMLSSRFDPNCTDFCAKRLKGAADSANDIYF----SQSNTERMKGSQGKVC 1309 EENLEENAARMLSSRFDP+CT F + + S N F Q+ + K G Sbjct: 250 EENLEENAARMLSSRFDPSCTGFSSNSKVSVSPSENGFSFLLSSGQNASSGSKTFSGSES 309 Query: 1310 SVGSAGSRTLRPRR-HNGKSFARKRRHFYEVFLRDMDPHCLVKQRIRVFWPLDQNWYFGL 1486 + A R LRPR+ H KS +RKRRHFYE++ D+D ++ +RI+VFWPLD++WY+GL Sbjct: 310 ASVDASGRVLRPRKSHKEKSNSRKRRHFYEIYSGDLDASWVLNRRIKVFWPLDKSWYYGL 369 Query: 1487 IKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFPSEVSSKFNFGKP-----------G 1633 + YD +LHHVKYDDRDEEWINLQ ERFKLLLFPSEV SK + Sbjct: 370 VNEYDKERKLHHVKYDDRDEEWINLQNERFKLLLFPSEVPSKSERKRSRRKRCSDDRIRN 429 Query: 1634 LEKENKEAEEVDTEESSYLGSLAESEPIISWLARTTRRATSCPSNTTKIHLRVSPLKDTG 1813 L+ +E V TE+ S GS +SEPIISWLAR++ R SCP K + S + Sbjct: 430 LKPNREEKRNVVTEDDSGNGSYMDSEPIISWLARSSHRVKSCPLRAVK-RQKTSASSHSS 488 Query: 1814 PS---LLLEPPKGNIRMSQLDMRSNNLLPNCKKSDQLYDWNINGISERKRSIDS----EE 1972 P L E N + ++ +R + + + L D ++GI S+ S ++ Sbjct: 489 PGQPLLCDEAVDENSCLYRVSLRVDKI--ELSGASALSDRPVDGIRVEDSSLGSTSCLKD 546 Query: 1973 KKLPYVYFRKRFRNKKEDFKTKLTQHAVDCDQGGLASISSTVNSMAPVWEVKE------- 2131 K P VYFR+RFR ++ + V +S+S ++ S+A V E ++ Sbjct: 547 SKHPIVYFRRRFRRTEKALCQASEGNCV------ASSVSESITSLASVDEFQDLGELDVC 600 Query: 2132 ---------------------KLTLVNFQQVIFELNLPLQCTLNFTSQCQSFGLFRSLYI 2248 ++L+ +Q F L+ P+ N +SF L +L + Sbjct: 601 LGRLDPEGDLLFSDNAGQLRLNISLLRTKQFRFGLSFPVFSVSNNLFGTKSFSLVHTLLL 660 Query: 2249 ADHGELVCGWPDVHMEVFFVDDIPGLRFILFEGCLKRSISLFCLIIKILNQHVEKYNFTD 2428 G ++ WP VH+E+ FVD+ GLRF+LFEG LK++++ ++ + E+ F D Sbjct: 661 LQCGTVMTIWPMVHLEILFVDNEVGLRFLLFEGSLKQAVAFVFRVLTVFYLPTEQGKFAD 720 Query: 2429 TEIPCSSIVLCISRLDNSREELLF-LNLFSKVESSRWKRFEGKLKKLC--TKEASIS-TP 2596 ++P +SI S + R++++F F +V+ S+W + KLK+ C T++ +S Sbjct: 721 LQLPVTSIRFKFSCSQDFRKQIVFAFYNFHEVKHSKWVFLDSKLKRQCLITRQLPLSECT 780 Query: 2597 YIRVNTLYFSTNEISHS 2647 Y + L TN++ S Sbjct: 781 YDNIKALQNGTNQLLSS 797 >ref|XP_004498624.1| PREDICTED: uncharacterized protein LOC101499788 [Cicer arietinum] Length = 1658 Score = 306 bits (785), Expect = 2e-80 Identities = 255/844 (30%), Positives = 389/844 (46%), Gaps = 59/844 (6%) Frame = +2 Query: 296 KKPRSLDLQSIYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKLGSPFEEDGVVLPKSRKK 475 KK RSLDL+S+Y +K K+ GS G RKK Sbjct: 16 KKSRSLDLKSLYKSKLTEEV---------------SKKNSKRKGSGSPGGGEEKKNKRKK 60 Query: 476 NKKEVFLSSLVPTSKRQPNSSNVPKSNKDHISSVNRSNDTSNRSVTTDNL-QGTNADAQH 652 +KEV LSSL + + S + VT + QG ++ Sbjct: 61 ARKEVSLSSL-------------------------ENGEGSGKKVTDEECKQGPSSGGDD 95 Query: 653 LANDLYSQLSGKTDGAYVTDVSKNKVFMSDSFLAPKRRRGISNSRKCKDPLSLETEISNP 832 L G T + S+ + PKR+R + +K + S + +P Sbjct: 96 LVELKLGVSKGVTSSS---GPSRVLLGAGGDVCIPKRKRTLVGRKKSEIGQSSNL-VRHP 151 Query: 833 CHAKTNTD-------------VQDNESIDKDDLNQKALNDXXXXXXXXVEECDMSGSHKD 973 + + D VQ ++ K LN+ N V+ +G H Sbjct: 152 SPSIGHDDQVPKLGSDDSGRAVQSSKINLKKHLNEFKENRNSDSNSISVKHVKENGDHAP 211 Query: 974 DSAPSTSAEHLSFIKDENRKAGN-RKDHVKMSEQMNLRNRARSADNSVNSVPVSQYDEEN 1150 S ++ L K ++RK D ++S++ N +R SV + + DEEN Sbjct: 212 HSVVNSDHSSLKKSKKKDRKRKTLASDKPRVSKEAEPLNDSRKI-----SVELQEDDEEN 266 Query: 1151 LEENAARMLSSRFDPNCTDFCAKRLKGAADSANDIYFSQSNTERMKG------SQGKVCS 1312 LEENAARMLSSRFDP+CT F + SAN + F S++ + S + S Sbjct: 267 LEENAARMLSSRFDPSCTGFSSSGKSSPLPSANGLSFLLSSSRNIVNHGSKSRSGSESAS 326 Query: 1313 VGSAGSRTLRPRR-HNGKSFARKRRHFYEVFLRDMDPHCLVKQRIRVFWPLDQNWYFGLI 1489 V +AG R LRPR+ + K +RKRRHFYE+ D+D + ++ +RI+VFWPLDQ+WY+GL+ Sbjct: 327 VDTAG-RNLRPRQQYKDKEKSRKRRHFYEILPGDVDAYWVLNRRIKVFWPLDQSWYYGLV 385 Query: 1490 KGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFPSEVSSKFNFGKP------------G 1633 YD RLHH+KYDDRDEEWI+LQ ERFKLLL +EV + G+ Sbjct: 386 NDYDEQQRLHHIKYDDRDEEWIDLQTERFKLLLLRNEVPGRAKGGRALTKSRRSDQQNGS 445 Query: 1634 LEKENKEAEEVDTEESSYLGSLAESEPIISWLARTTRRATSCPSNTTKIHLRVSPLKDTG 1813 ++ ++ EV E+ S S +SEPIISWLAR++ R S + K T Sbjct: 446 KSRKERQKREVIAEDDSCGESSMDSEPIISWLARSSHRFKSSSFHGIKKQKTSVTHPSTT 505 Query: 1814 PSLLLEPP---KGNIRMSQLDMRSNNLLPNCKKSDQLYDWNINGISERKRSIDSEEKKLP 1984 SLL + P KGN S +N+L D L D N S + + +++K P Sbjct: 506 SSLLYDEPVSVKGNTTKSSSRDVTNDLSSGSISQDNLGD-NFGEKSSLQSATHIKDRKQP 564 Query: 1985 YVYFRKRFRNKKE-DFKTKLTQHAV---------DCDQGGLASIS--STVNSMAPVW--- 2119 VY+RKRFR + +H V D GG+ ++ S P+W Sbjct: 565 AVYYRKRFRRSAAMSLPVLVEKHIVVSTPCSVSFDHVVGGIQNVKKPSDRRFEGPLWFNY 624 Query: 2120 --EVKEKLTLVNFQQVIFELNLPLQCTLNFTSQCQSFGLFRSLYIADHGELVCGWPDVHM 2293 V + + + F+LN P++ LN Q ++ ++ + +G +V WP V + Sbjct: 625 DEGVSKLVWDMESASFKFDLNFPIRLILNEAFQSENLWFLYAVLLFRYGTIVTKWPRVCL 684 Query: 2294 EVFFVDDIPGLRFILFEGCLKRSISLFCLIIKILNQHVEKYNF-TDTEIPCSSIVLCISR 2470 E+ FVD++ GLRF+LFEGCLK + + ++K+ Q + N+ ++P +SI +S Sbjct: 685 EMLFVDNVVGLRFLLFEGCLKMAATFVFFVLKVFRQPAPRGNYDLHLQLPFTSIGFKLSS 744 Query: 2471 LDNSREELLF-LNLFSKVESSRWKRFEGKLKKLC--TKEASIS-TPYIRVNTLYFSTNEI 2638 L +++ L+F L FSK+++S W + KLK+ C +K+ +S Y + L ++E Sbjct: 745 LHVTKQPLVFALYNFSKLKNSNWVYLDSKLKRHCLFSKQLHLSECTYDNIQALQHGSSEF 804 Query: 2639 SHSS 2650 + +S Sbjct: 805 TTAS 808 >ref|XP_004292962.1| PREDICTED: uncharacterized protein LOC101313578 [Fragaria vesca subsp. vesca] Length = 1673 Score = 306 bits (783), Expect = 4e-80 Identities = 258/845 (30%), Positives = 406/845 (48%), Gaps = 66/845 (7%) Frame = +2 Query: 254 MEIRVEKIEESDIPKKPRSLDLQSIYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKLGSP 433 ME RVE ++IP++ RSLD++S+Y K+ GS Sbjct: 1 MENRVEISHGTEIPRRSRSLDVKSLYRSRSTKEA---------------ENQSLKRNGSE 45 Query: 434 FEEDGVVLPKSRKKNKKEVFLSSLVPTSKRQPNSSNVPKSNKDHISSVNRSNDTSNRSVT 613 + DG + +KK++KEV LSSL NV S+ ++++ D S + Sbjct: 46 GDGDG----EKKKKSRKEVSLSSL----------KNVNSSSSSSWKNIDKEYDRGLESGS 91 Query: 614 TDNLQGTNADAQHLANDLYSQLSGKTDGAYVTDVSKNKVFMSDSFLAPKRRRGISNSRKC 793 D + +Q L + G+ + VS+ + S P+R+RG +K Sbjct: 92 HDPEASNSGSSQKLDS-----------GSRLNSVSQLSLDNS-GIQIPRRKRGFVGRKKF 139 Query: 794 KDPLSLETEISNPCHAKTNTDVQDNE--SIDKDDLNQKALNDXXXXXXXXVEECDMSGSH 967 + +L+ +S+ K + Q+++ + ++L+ +A ++EC + + Sbjct: 140 EGGQALK--LSDESAGKASIADQNHQVAKLSGEELDSQA-EGWKAERNKGLDECKENLNS 196 Query: 968 KDDSA----PSTSAEHLSFIKDENR--KAGNRKDHVKMSEQMNLRNRARSADNSVNSVPV 1129 + + A + E S + + N K RK + R A+ A+ VNS Sbjct: 197 ELNGALHAKKENALESRSVVSNGNSSLKKSRRKSRKSKDLSSDSRTDAKKAEPLVNSSTK 256 Query: 1130 S-----QYDEENLEENAARMLSSRFDPNCTDFCAKRLKGAADSANDIY---FSQSNTERM 1285 + + +EENLEENAA MLSSRFDP+CT F A S+N + F ++ + Sbjct: 257 ACQASHEDEEENLEENAAMMLSSRFDPSCTGFSLNAKACAMQSSNGLSGQDFDGHMSKSL 316 Query: 1286 KGSQGKVCSVGSAGSRTLRPR---RHNGKSFARKRRHFYEVFLRDMDPHCLVKQRIRVFW 1456 GS+ S+ +AG RTLRPR H K RKRRHFYE+F D+D +V +RI+VFW Sbjct: 317 SGSESP--SIDNAG-RTLRPRPRKHHKEKKGTRKRRHFYEIFFGDLDACWVVNRRIKVFW 373 Query: 1457 PLDQNWYFGLIKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFPSEVSSKFN------ 1618 PLDQ+WY+GL+ YD +LHH++YDDR+EEWI+LQ ERFKLLL P+EV K Sbjct: 374 PLDQSWYYGLVNDYDKDKKLHHIRYDDREEEWIDLQHERFKLLLLPTEVPGKAKKRSFIR 433 Query: 1619 -FGKPGLE-----KENKEAEEVDTEESSYLGSLAESEPIISWLARTTRRATSCPSNTTKI 1780 G E ++ K+ ++ +E+ S +GS +SEPIISWLAR+TRR S PS+ K Sbjct: 434 ITGSEEREENLKPRKEKKKRDLMSEDDSCIGSCMDSEPIISWLARSTRRIKS-PSHAVK- 491 Query: 1781 HLRVSPLKDTGPSLLLEPPKGNIRMSQLDMRSNNLLPNCKKSDQLYDWNINGISERKRS- 1957 + S L L + + + + R + KS + + E KR+ Sbjct: 492 KQKTSGLSPKSLPTLSDSAGTHGCLGDVSSRRDT-----SKSSSNSGRYSDALREEKRAP 546 Query: 1958 ---IDSEEKKLPYVYFRKRFRNKKEDFKTKLTQHAVDCDQGGLASISSTVNSMAPV---W 2119 I E+ ++P VY+RKR R KT + D+ S+ PV W Sbjct: 547 EGDIYPEDSRMPIVYYRKRLR------KTGSVLSQIYKDEHASMYGHRCCTSVTPVEEIW 600 Query: 2120 EVKE-----------------------KLTL--VNFQQVIFELNLPLQCTLNFTSQCQSF 2224 +++E KLTL V +VIF+ L L +N + + Sbjct: 601 DLEEPDDHVVILDRSWPLWYSDGAGLLKLTLPWVESGKVIFKC-LQLHSLINDSLGVELL 659 Query: 2225 GLFRSLYIADHGELVCGWPDVHMEVFFVDDIPGLRFILFEGCLKRSISLFCLIIKILNQH 2404 + + HG +V WP +H+E+ FVD++ GLRF+LFEGCLK+++ L LI+ + +Q Sbjct: 660 RFCHAAMLLRHGIVVITWPKIHLEMLFVDNVVGLRFLLFEGCLKQAVVLVFLILTLFHQP 719 Query: 2405 VEKYNFTDTEIPCSSIVLCISRLDNSREELLF-LNLFSKVESSRWKRFEGKLKKLC--TK 2575 ++ TD ++P +SI S + + +EL+F F +V++S+W + KL + C TK Sbjct: 720 NDQGKLTDFQLPATSIRFKFSCVQHLGKELVFAFYNFCRVKNSKWMHLDNKLGRHCLLTK 779 Query: 2576 EASIS 2590 + +S Sbjct: 780 KLPLS 784 >ref|XP_006476180.1| PREDICTED: uncharacterized protein LOC102626885 isoform X2 [Citrus sinensis] Length = 1813 Score = 305 bits (780), Expect = 9e-80 Identities = 210/658 (31%), Positives = 319/658 (48%), Gaps = 54/658 (8%) Frame = +2 Query: 758 KRRRGISNSRKCKDPLSLETEISNPCHAKTNTDVQDNESIDKDDLNQKALNDXXXXXXXX 937 K+ G++NS + L+ E H+ N N S+ + N D Sbjct: 265 KKPNGVTNSNSGQ---CLKEENEGASHSVLNNS---NSSLKESRRNNSKRKDSARHKKSV 318 Query: 938 VEECD----MSGSH---KDDSAPSTSAEHLSFIKDENRKAGNRKDH------VKMSEQMN 1078 +E + SG+ KD + + + D + K RKD V + Sbjct: 319 AKEAEHVINASGNVSNIKDSDRDRSVGKEAEPLVDASAKVSKRKDFSQDKISVAKEADIL 378 Query: 1079 LRNRARSADNSVNSVPVSQYDEENLEENAARMLSSRFDPNCTDFCAKRLKGAADSANDIY 1258 + ++ DN + DEENLEENAA MLSSRFDP+CT F + + S N + Sbjct: 379 IDTSGKACDNLLE-------DEENLEENAAMMLSSRFDPSCTGFSSNGK--SIVSPNGLS 429 Query: 1259 FSQSNTERMKGSQGKVCSVGSAGSRTLRPRRHNG-KSFARKRRHFYEVFLRDMDPHCLVK 1435 F S+ + G S+ A R LRPR H+ K +RKRRH+YE+F D+D ++K Sbjct: 430 FLLSSGQ---GPGSHDSSLLDAAGRALRPRTHHREKGHSRKRRHYYEIFSGDLDGFWVLK 486 Query: 1436 QRIRVFWPLDQNWYFGLIKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFPSEVSSK- 1612 +RI+VFWPLDQ WY+GL+ YD +LHHVKYDDRDEEWINL+ ERFKLLL PSEV K Sbjct: 487 RRIKVFWPLDQCWYYGLVDDYDKGKKLHHVKYDDRDEEWINLENERFKLLLLPSEVPGKA 546 Query: 1613 -----------FNFGKPGLE-KENKEAEEVDTEESSYLGSLAESEPIISWLARTTRRATS 1756 + GK L+ + KE ++TEE + +GS ESEPIISWLAR+T R S Sbjct: 547 ARRRSRKRVNSVDEGKLSLKSSKEKEKRNLNTEEENCMGSYMESEPIISWLARSTHRVKS 606 Query: 1757 CPSNTTKIHLRVSPLKDTGPSLLLEPPKGNIRMSQLDMRSNNLLPNCKKSDQLYDWNING 1936 P+ K ++S L T L GN D +++ N K D+ D Sbjct: 607 SPTPAMK-KQKISDLYPTSGPPFLANKVGNAHGLDADSKTSKFSSNSKLPDRFTDGGRGE 665 Query: 1937 ISERKRSIDSEEKKLPYVYFRKRFRNKKEDFKTKLTQHAVDCDQGGLASISSTVNSMAPV 2116 S + S++ LP VY+R+RFR K T + AS++ +S+ Sbjct: 666 ESTSENPTCSKDSGLPIVYYRRRFR--KTGSSLCSTSSGNNISSSTPASVTLLSSSIGEF 723 Query: 2117 WEVKE--------------------------KLTLVNFQQVIFELNLPLQCTLNFTSQCQ 2218 W+ +E + L++ +Q F+ + P+ LN+ + + Sbjct: 724 WDFEEHDTFCKREVSNGASWSTTTGSGRVGLTIPLIDPKQARFKFSFPVLSILNYAFEAE 783 Query: 2219 SFGLFRSLYIADHGELVCGWPDVHMEVFFVDDIPGLRFILFEGCLKRSISLFCLIIKILN 2398 + L +++ +G+L+ WP V +E+ FVD++ GLR+ LFE CLK+++ L++ + + Sbjct: 784 NLWLVHEVFLLHYGKLITMWPSVQLEMLFVDNVVGLRYFLFEDCLKQAVGYVFLVLSLFH 843 Query: 2399 QHVEKYNFTDTEIPCSSIVLCISRLDNSREELLF-LNLFSKVESSRWKRFEGKLKKLC 2569 Q +D ++P +SI S N ++ +F F++V++S W + KLK+ C Sbjct: 844 QPNVLGKCSDRQLPVTSIRFKFSCFQNLSKQFVFAFYNFAEVKNSTWMYMDSKLKRHC 901 >ref|XP_006476179.1| PREDICTED: uncharacterized protein LOC102626885 isoform X1 [Citrus sinensis] Length = 1816 Score = 305 bits (780), Expect = 9e-80 Identities = 210/658 (31%), Positives = 319/658 (48%), Gaps = 54/658 (8%) Frame = +2 Query: 758 KRRRGISNSRKCKDPLSLETEISNPCHAKTNTDVQDNESIDKDDLNQKALNDXXXXXXXX 937 K+ G++NS + L+ E H+ N N S+ + N D Sbjct: 265 KKPNGVTNSNSGQ---CLKEENEGASHSVLNNS---NSSLKESRRNNSKRKDSARHKKSV 318 Query: 938 VEECD----MSGSH---KDDSAPSTSAEHLSFIKDENRKAGNRKDH------VKMSEQMN 1078 +E + SG+ KD + + + D + K RKD V + Sbjct: 319 AKEAEHVINASGNVSNIKDSDRDRSVGKEAEPLVDASAKVSKRKDFSQDKISVAKEADIL 378 Query: 1079 LRNRARSADNSVNSVPVSQYDEENLEENAARMLSSRFDPNCTDFCAKRLKGAADSANDIY 1258 + ++ DN + DEENLEENAA MLSSRFDP+CT F + + S N + Sbjct: 379 IDTSGKACDNLLE-------DEENLEENAAMMLSSRFDPSCTGFSSNGK--SIVSPNGLS 429 Query: 1259 FSQSNTERMKGSQGKVCSVGSAGSRTLRPRRHNG-KSFARKRRHFYEVFLRDMDPHCLVK 1435 F S+ + G S+ A R LRPR H+ K +RKRRH+YE+F D+D ++K Sbjct: 430 FLLSSGQ---GPGSHDSSLLDAAGRALRPRTHHREKGHSRKRRHYYEIFSGDLDGFWVLK 486 Query: 1436 QRIRVFWPLDQNWYFGLIKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFPSEVSSK- 1612 +RI+VFWPLDQ WY+GL+ YD +LHHVKYDDRDEEWINL+ ERFKLLL PSEV K Sbjct: 487 RRIKVFWPLDQCWYYGLVDDYDKGKKLHHVKYDDRDEEWINLENERFKLLLLPSEVPGKA 546 Query: 1613 -----------FNFGKPGLE-KENKEAEEVDTEESSYLGSLAESEPIISWLARTTRRATS 1756 + GK L+ + KE ++TEE + +GS ESEPIISWLAR+T R S Sbjct: 547 ARRRSRKRVNSVDEGKLSLKSSKEKEKRNLNTEEENCMGSYMESEPIISWLARSTHRVKS 606 Query: 1757 CPSNTTKIHLRVSPLKDTGPSLLLEPPKGNIRMSQLDMRSNNLLPNCKKSDQLYDWNING 1936 P+ K ++S L T L GN D +++ N K D+ D Sbjct: 607 SPTPAMK-KQKISDLYPTSGPPFLANKVGNAHGLDADSKTSKFSSNSKLPDRFTDGGRGE 665 Query: 1937 ISERKRSIDSEEKKLPYVYFRKRFRNKKEDFKTKLTQHAVDCDQGGLASISSTVNSMAPV 2116 S + S++ LP VY+R+RFR K T + AS++ +S+ Sbjct: 666 ESTSENPTCSKDSGLPIVYYRRRFR--KTGSSLCSTSSGNNISSSTPASVTLLSSSIGEF 723 Query: 2117 WEVKE--------------------------KLTLVNFQQVIFELNLPLQCTLNFTSQCQ 2218 W+ +E + L++ +Q F+ + P+ LN+ + + Sbjct: 724 WDFEEHDTFCKREVSNGASWSTTTGSGRVGLTIPLIDPKQARFKFSFPVLSILNYAFEAE 783 Query: 2219 SFGLFRSLYIADHGELVCGWPDVHMEVFFVDDIPGLRFILFEGCLKRSISLFCLIIKILN 2398 + L +++ +G+L+ WP V +E+ FVD++ GLR+ LFE CLK+++ L++ + + Sbjct: 784 NLWLVHEVFLLHYGKLITMWPSVQLEMLFVDNVVGLRYFLFEDCLKQAVGYVFLVLSLFH 843 Query: 2399 QHVEKYNFTDTEIPCSSIVLCISRLDNSREELLF-LNLFSKVESSRWKRFEGKLKKLC 2569 Q +D ++P +SI S N ++ +F F++V++S W + KLK+ C Sbjct: 844 QPNVLGKCSDRQLPVTSIRFKFSCFQNLSKQFVFAFYNFAEVKNSTWMYMDSKLKRHC 901 >ref|XP_006450576.1| hypothetical protein CICLE_v100072352mg, partial [Citrus clementina] gi|557553802|gb|ESR63816.1| hypothetical protein CICLE_v100072352mg, partial [Citrus clementina] Length = 940 Score = 303 bits (777), Expect = 2e-79 Identities = 210/658 (31%), Positives = 319/658 (48%), Gaps = 54/658 (8%) Frame = +2 Query: 758 KRRRGISNSRKCKDPLSLETEISNPCHAKTNTDVQDNESIDKDDLNQKALNDXXXXXXXX 937 K+ G++NS + L+ E H+ N N S+ + N D Sbjct: 265 KKPNGVTNSNSGQ---CLKEENEGASHSVLNNS---NSSLKESRRNNSKGKDSARHKKSV 318 Query: 938 VEECD----MSGSH---KDDSAPSTSAEHLSFIKDENRKAGNRKDH------VKMSEQMN 1078 +E + SG+ KD + + + D + K RKD V + Sbjct: 319 AKEAEHVINASGNVSNIKDSDRDRSVGKEAEPLVDASAKLSKRKDFSQDKISVAKEADIL 378 Query: 1079 LRNRARSADNSVNSVPVSQYDEENLEENAARMLSSRFDPNCTDFCAKRLKGAADSANDIY 1258 + ++ DN + DEENLEENAA MLSSRFDP+CT F + + S N + Sbjct: 379 IDTSGKACDNLLE-------DEENLEENAAMMLSSRFDPSCTGFSSNGK--SIVSPNGLS 429 Query: 1259 FSQSNTERMKGSQGKVCSVGSAGSRTLRPRRHNG-KSFARKRRHFYEVFLRDMDPHCLVK 1435 F S+ + G S+ A R LRPR H+ K +RKRRH+YE+F D+D ++K Sbjct: 430 FLLSSGQ---GPGSHDSSLLDAAGRALRPRTHHREKGHSRKRRHYYEIFSGDLDGFWVLK 486 Query: 1436 QRIRVFWPLDQNWYFGLIKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFPSEVSSK- 1612 +RI+VFWPLDQ WY+GL+ YD +LHHVKYDDRDEEWINL+ ERFKLLL PSEV K Sbjct: 487 RRIKVFWPLDQCWYYGLVDDYDKGKKLHHVKYDDRDEEWINLENERFKLLLLPSEVPGKA 546 Query: 1613 -----------FNFGKPGLE-KENKEAEEVDTEESSYLGSLAESEPIISWLARTTRRATS 1756 + GK L+ + KE ++TEE + +GS ESEPIISWLAR+T R S Sbjct: 547 ARRRSRKRVNSVDEGKLSLKSSKEKEKRNLNTEEENCMGSYMESEPIISWLARSTHRVKS 606 Query: 1757 CPSNTTKIHLRVSPLKDTGPSLLLEPPKGNIRMSQLDMRSNNLLPNCKKSDQLYDWNING 1936 P+ K ++S L T L GN D +++ N K D+ D Sbjct: 607 SPTPAMK-KQKISDLYPTSGPPFLANKVGNAHGLDADSKTSKFSSNSKLPDRFTDGGRGE 665 Query: 1937 ISERKRSIDSEEKKLPYVYFRKRFRNKKEDFKTKLTQHAVDCDQGGLASISSTVNSMAPV 2116 S + S++ LP VY+R+RFR K T + AS++ +S+ Sbjct: 666 ESTSENPTCSKDSGLPIVYYRRRFR--KTGSSLCSTSSGNNISSSTPASVTLLSSSIGEF 723 Query: 2117 WEVKE--------------------------KLTLVNFQQVIFELNLPLQCTLNFTSQCQ 2218 W+ +E + L++ +Q F+ + P+ LN+ + + Sbjct: 724 WDFEEHDTFCKREVSNGASWSTTTGSGRVGLTIPLIDPKQARFKFSFPVLSILNYAFEAE 783 Query: 2219 SFGLFRSLYIADHGELVCGWPDVHMEVFFVDDIPGLRFILFEGCLKRSISLFCLIIKILN 2398 + L +++ +G+L+ WP V +E+ FVD++ GLR+ LFE CLK+++ L++ + + Sbjct: 784 NLWLVHEVFLLHYGKLITMWPSVQLEMLFVDNVVGLRYFLFEDCLKQAVGYVFLVLSLFH 843 Query: 2399 QHVEKYNFTDTEIPCSSIVLCISRLDNSREELLF-LNLFSKVESSRWKRFEGKLKKLC 2569 Q +D ++P +SI S N ++ +F F++V++S W + KLK+ C Sbjct: 844 QPNVLGKCSDRQLPVTSIRFKFSCFQNLSKQFVFAFYNFAEVKNSTWMYMDSKLKRHC 901 >ref|XP_002324830.2| hypothetical protein POPTR_0018s01030g [Populus trichocarpa] gi|550317762|gb|EEF03395.2| hypothetical protein POPTR_0018s01030g [Populus trichocarpa] Length = 1722 Score = 302 bits (774), Expect = 5e-79 Identities = 261/875 (29%), Positives = 396/875 (45%), Gaps = 96/875 (10%) Frame = +2 Query: 254 MEIRVEKIEESDIPKKPRSLDLQSIYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKLGSP 433 ME RV K +IPKK RSLD +S+Y N +K G+ Sbjct: 1 MENRVGKSHGVEIPKKSRSLDHKSLYESKNPKGDQNS------------NNLKRKGGGAG 48 Query: 434 FEEDGVVLPKSRKKNKKEVFLSSLVPTSKRQPNSSNVPKSNKDHISSVNRS-NDTSNRSV 610 +E G +KK++KEV +SS NK+ SS ++S + NRS+ Sbjct: 49 DDEKG----HEKKKSRKEVSISSF---------------KNKNVNSSYSKSLKEVYNRSL 89 Query: 611 TTDNLQGTNADAQHLANDL-YSQLSGKTDGAYVTDVSKNKVFM----------------- 736 ++ + + Q LA+ +S +S DG + + F+ Sbjct: 90 SSGLKESKSGLIQRLADSNGFSGVSLPLDGGVFKIPRRKRGFVGRKKVDNGSEGSKLTGG 149 Query: 737 ----------SDSFLAPKRRRGISNS-RKCK---------DPLSLETEIS--------NP 832 +D + + N R+ K D + ++++ P Sbjct: 150 FGREAGNVDQADKLTGEDESKWVENGGRELKAVGISGGEVDDVDQASKLTVEDKGKQVEP 209 Query: 833 CHAKTNTDVQDNESIDKDDLN-QKALNDXXXXXXXXVEECDMSGSHKDDSAPSTSAEHLS 1009 AK D + D+LN + L + V S S + + P Sbjct: 210 LKAKQKKGSDDLKENRNDELNASRNLEEEDGHEGHSVATKRDSSSKRPHNGPLVDNNGDL 269 Query: 1010 FIKDENRKAGNRKDHVKMSEQMNLRNRARSADNSVNSVPVSQYDEE-NLEENAARMLSSR 1186 +K RK +K V S++ + + D S+ V DEE NLEENAA MLSSR Sbjct: 270 SLKKSLRKRSRKKGMV--SDKKRTKEDDPTVDTSMKMSGVFHDDEEENLEENAAMMLSSR 327 Query: 1187 FDPNCTDFCAKRLKGAADSANDIY-FSQSNTERMKGSQGKVCSVGSAGSRTLRPRRHNG- 1360 FDP+CT F + A+ S ND F + + GS+ SV + G R LRPR+ N Sbjct: 328 FDPSCTGFSSNSKASASPSKNDFQEFVAHGSSYVSGSESS--SVDTDG-RVLRPRKQNKE 384 Query: 1361 KSFARKRRHFYEVFLRDMDPHCLVKQRIRVFWPLDQNWYFGLIKGYDPVTRLHHVKYDDR 1540 K RKRRH+YEVF D+D H ++ +RI+VFWPLDQ WY GL+ YD +LHH+KYDDR Sbjct: 385 KGSTRKRRHYYEVFSGDLDAHWVLNRRIKVFWPLDQRWYHGLVGDYDKERKLHHIKYDDR 444 Query: 1541 DEEWINLQKERFKLLLFPSEVSSKF--------NFGKPGLEKE---NKEAEEVDTEESSY 1687 DEEWI+LQ ERFKLLL PSEV K N G +++ KE ++ TE+ SY Sbjct: 445 DEEWIDLQNERFKLLLLPSEVPGKMRRKRSITSNKRSDGWKEKLTSRKEKRDLMTEDDSY 504 Query: 1688 LGSLAESEPIISWLARTTRRATSCPSNTTKIHLRVSPLKDTGPSLLLEPPKGNIRMSQLD 1867 G+ ESEPIISWLAR+T R S P + K + S L T + P +S L Sbjct: 505 EGAYMESEPIISWLARSTHRVKSSPLHALK-KQKTSYLSST-----MTP------LSSLK 552 Query: 1868 MRSNNLLPNCKKSDQLYDWNINGISERKRSIDSEEKKLPYVYFRKRFRNKK----EDFKT 2035 L N SD + + + + + ++ KLP VY+RKRFR + K Sbjct: 553 RDKCKLSYNSASSDSVATDGRSDLPVMESPVFPKDSKLPIVYYRKRFRKTSNVLCHESKG 612 Query: 2036 KLTQHAVDCDQGGLASI----------------------SSTVNSMAPVWE------VKE 2131 +V L + S+ ++S P+W ++ Sbjct: 613 ICVSASVPETDSSLVPLTVAFWALQEHYTSLGRLDRDLDSNRLDSSDPLWSTGNAGLLRL 672 Query: 2132 KLTLVNFQQVIFELNLPLQCTLNFTS-QCQSFGLFRSLYIADHGELVCGWPDVHMEVFFV 2308 ++ + + F+L+ L LN+ S ++ L ++ + +G L+ WP +H+E+ FV Sbjct: 673 NISATEPRWLRFKLSFQLPSFLNYYSFGSENVWLIHAVLLLQYGMLMTTWPRIHLEMLFV 732 Query: 2309 DDIPGLRFILFEGCLKRSISLFCLIIKILNQHVEKYNFTDTEIPCSSIVLCISRLDNSRE 2488 D++ GLRF+LFEGCL ++++ L++ + +Q E+ D ++P +SI S + + R+ Sbjct: 733 DNMVGLRFLLFEGCLMQAVAFVFLVLTVFHQPREQEKSADFQLPITSIRYRFSCIRDLRK 792 Query: 2489 ELLF-LNLFSKVESSRWKRFEGKLKKLCTKEASIS 2590 F FS+VE+S+WK + KLK+ C +S Sbjct: 793 HFAFSFYNFSEVENSKWKYLDHKLKRHCLAYRQLS 827 >ref|XP_002309585.2| hypothetical protein POPTR_0006s26240g [Populus trichocarpa] gi|550337121|gb|EEE93108.2| hypothetical protein POPTR_0006s26240g [Populus trichocarpa] Length = 1685 Score = 301 bits (770), Expect = 1e-78 Identities = 269/892 (30%), Positives = 400/892 (44%), Gaps = 88/892 (9%) Frame = +2 Query: 158 LDLKLRRQSARHV*----WVIKFGSYSIFL*IWQSLMEIRVEKIEESDIPKKPRSLDLQS 325 L + L + ++R + W + FG LME RV K IPKK RSLDL+S Sbjct: 7 LHISLEKSNSRSISIQGFWFLVFG-----------LMENRVGKSHGVGIPKKSRSLDLKS 55 Query: 326 IYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKLGSPFEEDGVVLPKSRKKNKKEVFLSSL 505 +Y N +K G +E G KK++KEV +SS Sbjct: 56 LYETKNSKWYQNS------------NNLKRKGGGIGDDEKG----HKNKKSRKEVCISSF 99 Query: 506 VPTSKRQPNSSNVPKSNKDHISSVNRSNDTS--NRSVTTDNLQGTNADAQHLANDLYSQL 679 + S ++ + +SS + T R ++ G + + A + + Sbjct: 100 KNVNSSY--SKSLKEVYNGSLSSGLKDPRTGLIQRLADSNGFSGASLPLEDGAVKIPRRK 157 Query: 680 SGKTDGAYVTDVSKN------------KVFMSDSFLAPKRRRGISN-SRKCKDPLSLETE 820 G V + S+ +D +G+ N S++ K + L + Sbjct: 158 RGFVGRRKVDNGSEGGKLARGFGREVGNADQADKLTGEDEGKGVENGSQESKAVVILVSV 217 Query: 821 ISNPCHAKTNTDVQDNESIDKDDLNQKALNDXXXXXXXXVEECDMSGSHKDDSAP---ST 991 + + A T + ++ QK +D E D S K++ S Sbjct: 218 VGDVDQASKLTGEGKAKQVEHSKAKQKKGSDDLKENRNG--ELDASRHLKEEDGHDDHSV 275 Query: 992 SAEHLSFIK------------DENRKAGNRKDHVKMSEQMNLRNRARSADNSVN-SVPVS 1132 + + S +K D + K RK K + ++ + R + AD SV+ S+ +S Sbjct: 276 ATKRDSSLKKSDNCPLVVNNGDSSLKKSLRKRSRKKKDMVSNKKRTKEADPSVDASIKIS 335 Query: 1133 QY----DEENLEENAARMLSSRFDPNCTDFCAKRLKGAADSANDIY-FSQSNTERMKGSQ 1297 DEENLEENAA MLSSRFDP+CT F + A+ S + F+ + + GS+ Sbjct: 336 DVLHDEDEENLEENAAMMLSSRFDPSCTGFSSNSKASASPSKDGFQEFAARESSYVSGSE 395 Query: 1298 GKVCSVGSAGSRTLRPRRHNG-KSFARKRRHFYEVFLRDMDPHCLVKQRIRVFWPLDQNW 1474 SV + G R LRPR+ N K RKRRH+YE+F D+D H ++ +RI+VFWPLDQ+W Sbjct: 396 SS--SVDTDG-RVLRPRKQNKEKGNTRKRRHYYEIFSGDLDAHWVLNRRIKVFWPLDQSW 452 Query: 1475 YFGLIKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFPSEVSSK------------FN 1618 Y GL+ YD +LHHVKYDDRDEEWINLQ ERFKLL+ P EV +K N Sbjct: 453 YHGLVGDYDKDRKLHHVKYDDRDEEWINLQNERFKLLMLPCEVPAKTRRKRSVTRNKCSN 512 Query: 1619 FGKPGLEKENKEAEEVDTEESSYLGSLAESEPIISWLARTTRRATSCPSNTTKIHLRVSP 1798 GK L KE ++ TE+ SY G+ +SEPIISWLAR+T R S P K + S Sbjct: 513 GGKEKL-MSRKEKRDLMTEDDSYEGAYMDSEPIISWLARSTHRVKSSPLCALK-KQKTSY 570 Query: 1799 LKDTGPSLLLEPPKGNIRMSQLDMRSNNLLPNCKKSDQLYDWNINGISERKRSIDSEEKK 1978 L T L S L+ L N S+ + +G+ ++ + + K Sbjct: 571 LSSTRTPL-----------SSLNRDRGKLCSNSASSESVATDGRSGLPVMEKPVYPKGSK 619 Query: 1979 LPYVYFRKRFRNKKEDF--KTKLTQHAVDCDQGGLASISSTVNSMA-------------- 2110 LP VY+RKRFR ++K + + + + TVNS A Sbjct: 620 LPIVYYRKRFRETSNVLCHESKGVHISASVAESVRSLVHHTVNSGALEEHDTSLGRLNPD 679 Query: 2111 ----------PVWEV-KEKLTLVNFQQVIFE-LNLPLQCTLNFTSQCQSFG-----LFRS 2239 P+W K L +N + L L + SFG L + Sbjct: 680 EDLDRLDAFDPLWSTNKAGLLRLNISAIEPRWFRFKLSFLLPSVPRHYSFGSEIVWLIHA 739 Query: 2240 LYIADHGELVCGWPDVHMEVFFVDDIPGLRFILFEGCLKRSISLFCLIIKILNQHVEKY- 2416 + + +G L+ WP +H+E+ FVD+ GLRF+LFEGCLK +++ L++ I Q E+ Sbjct: 740 MALLQYGMLMTTWPRIHLEMLFVDNGVGLRFLLFEGCLKEAVAFVFLVLTIFYQPNEQQG 799 Query: 2417 NFTDTEIPCSSIVLCISRLDNSREELLF-LNLFSKVESSRWKRFEGKLKKLC 2569 D ++P +SI S + + R++ F + FS+VE+S+W + KLKK C Sbjct: 800 KCADFQLPITSIRFKFSCIQDFRKQFAFAFHNFSEVENSKWIYLDHKLKKHC 851 >gb|EXC20799.1| hypothetical protein L484_007381 [Morus notabilis] Length = 1690 Score = 298 bits (762), Expect = 1e-77 Identities = 256/875 (29%), Positives = 387/875 (44%), Gaps = 78/875 (8%) Frame = +2 Query: 254 MEIRVEKIEESDIPKKPRSLDLQSIYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKLGSP 433 ME R+E + +++P+K RSLDL+S+Y NK ++K + Sbjct: 1 MENRIESSDGAEVPRKSRSLDLKSLYKHRVTKDVQ--------------NKKLKRKASAD 46 Query: 434 FEEDGVVLPKSRKKNKKEVFLSSLVPTSKRQPNSSNVPKSNKDHISSVNRSNDTSNRSVT 613 ++ K +KK+ KEV LSSL TS + NV K +SS Sbjct: 47 DGDENS--EKKKKKSVKEVSLSSLKNTSSS--SKKNVDKDCHKGLSS------------- 89 Query: 614 TDNLQGTNADAQHLANDLYSQLSGKTDGAYVTDVSKNKVFMSDSFLAPKRRRGISNSRKC 793 G + D++ L + +L+G ++ +S N D P+R+RG +K Sbjct: 90 -----GLH-DSKDLKLEAKQKLNGSIGFKSISSLSLN----DDVIQIPRRKRGFVGRKKG 139 Query: 794 KD------------PLSLETEISNPCHAKTNTDVQD---NESIDKDDLNQKALNDXXXXX 928 + L L +IS + + V+ + DD + +++ Sbjct: 140 EGGHVPRRQGLSCGKLDLVDQISKLSGDDSGSQVESVKVKRTKGFDDFKENRISE----- 194 Query: 929 XXXVEECDMSGSHKDDSAPSTSAEHL------SFIKDENRKAGNRKDHVKMSEQMNLRNR 1090 S S + HL S K RK K+ + +++ + Sbjct: 195 ---------SNSARHAEEEHERVNHLVVSNGDSLFKKSRRKRSKTKN-LSPDDKVGAKEA 244 Query: 1091 ARSADNSVNSVPVSQYD-EENLEENAARMLSSRFDPNCTDFCAKRLKGAADSANDIYFSQ 1267 ADNS SQ D EENLEENAA MLSSRFDPNCT F + + A + + + F Sbjct: 245 EPLADNSTMMCNDSQEDDEENLEENAAMMLSSRFDPNCTGFSSNKASAFA-TVDGLSFLL 303 Query: 1268 SN-----TERMKGSQGKVCSVGSAGSRTLRPR-RHNGKSFARKRRHFYEVFLRDMDPHCL 1429 S+ + R + G A R LRPR +H K +RKRRHFYEVF D+D + Sbjct: 304 SSGRDFVSRRSRSLSGSESPSVDAAGRVLRPRIQHKEKGHSRKRRHFYEVFFGDLDADWV 363 Query: 1430 VKQRIRVFWPLDQNWYFGLIKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFPSEVSS 1609 + +RI+VFWPLDQ+WY+GL+ YD +LHHVKYDDRDEEWI+LQ ERFKLLL PSEV Sbjct: 364 LNRRIKVFWPLDQSWYYGLVNDYDREKKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPG 423 Query: 1610 KFNFGKPGLE-------------KENKEAEEVDTEESSYLGS-LAESEPIISWLARTTRR 1747 K + + K+ K+ ++ ++ S +GS +SEPIISWLAR+ RR Sbjct: 424 KAACRRSRIRDRSSVQRKSSSKPKKEKKKGDISMQDDSCIGSNYMDSEPIISWLARSRRR 483 Query: 1748 ATSCPSNTTKIHLRVSPLKDTGPSLLLEPPKGNIRMSQLDMRSNNLLPNCKK---SDQLY 1918 S P + L+ D +L P N S S + + +K + L Sbjct: 484 VKS-PFHA----LKKQKPSDLSVKPVLPPFSNNAVNSNRCFESGTVRRDKRKFSRNSNLS 538 Query: 1919 DWNINGISERKRSIDS----EEKKLPYVYFRKRFRNKKEDFKTKLTQHAVDCDQGGLASI 2086 N + + + +S ++ K+P VYFR+RFR KT L D + Sbjct: 539 GRFANDAMKEESTSESISCPKDSKMPIVYFRRRFR------KTGLELSRGCEDNHACRNT 592 Query: 2087 SSTVNSMAP-----------------------VWEVKE----KLTLVNFQ--QVIFELNL 2179 V S AP +W V + KL L + + F+++ Sbjct: 593 LDPVTSFAPAVDDTRDWVKWDVLLGRLDLGGLLWSVDDAGLLKLMLPGLESGKFKFDVDF 652 Query: 2180 PLQCTLNFTSQCQSFGLFRSLYIADHGELVCGWPDVHMEVFFVDDIPGLRFILFEGCLKR 2359 P+ L ++ L S + +G ++ WP VH+E+ FVD++ GLRF+LFEGCL + Sbjct: 653 PILSGLYDIFGVENLWLSHSAVLLHYGTVMIRWPQVHLEMLFVDNVFGLRFLLFEGCLNQ 712 Query: 2360 SISLFCLIIKILNQHVEKYNFTDTEIPCSSIVLCISRLDNSREELLFLNLFSKVESSRWK 2539 +++L L+++ +Q E+ F D + L + E F N FS VE+S+W Sbjct: 713 ALALVFLVVRTFHQPTERVKFVDMPVTSIRFKLTCFQHHKKHLEFAFCN-FSTVENSKWI 771 Query: 2540 RFEGKLKKLCTKEASISTPYIRVNTLYFSTNEISH 2644 + KL++ C + P + + N H Sbjct: 772 YLDRKLRRHCLVTKQLPLPECTYDNIKMLQNRTVH 806 >ref|XP_004162065.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101228859 [Cucumis sativus] Length = 1466 Score = 294 bits (753), Expect = 1e-76 Identities = 249/845 (29%), Positives = 383/845 (45%), Gaps = 43/845 (5%) Frame = +2 Query: 254 MEIRVEKIEESDIPKKPRSLDLQSIYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKLGSP 433 ME +E +DIPKK RSLDL+S+Y NK ++K + Sbjct: 17 MENSLENSHGTDIPKKSRSLDLKSLYESKVSKEVQ--------------NKRLKRKGRA- 61 Query: 434 FEEDGVVLPKSRKKNKKEVFLSSLVPTSKRQPNSSNVPKSNKDHISSVNRSNDTSNRSVT 613 EDG V K+ ++N+K+V LS+ R S + + + + + S S +++ Sbjct: 62 --EDGDV-QKNERRNRKKVSLSNFSSIYSRSRKSLD-----EVYDAGLGSSGHDSKKALK 113 Query: 614 TDNLQGTNADAQH-----LANDLYSQLSGKTDGAYVTDVSKN--KVFMSDSFLAPKRRRG 772 +++ N+ ++ + ++ + + G +V + ++ L K Sbjct: 114 SESKDKLNSSSEFNEVPLILDENVMHIPKRKRGGFVRRKKSHDGQILKPSGQLDAKAGSL 173 Query: 773 ISNSRKCKDPLSLETEISNPCHAKTNTDVQ---DNESIDKDDLNQKALNDXXXXXXXXVE 943 + + D +I+ ++ V+ N + DL +K + Sbjct: 174 DAKAGSLDDKAGTVDQIAKSSVKDSSDQVECCKTNRKLAFKDLKEKEPKELRLHLKKEDG 233 Query: 944 ECDM---------------SGSHKDDSAPSTSAEHLSFIKDENRKAGNRKDHVKMSEQMN 1078 + D G H D S + K RK RK S+ + Sbjct: 234 QADQLTRENELNPASRLKEEGEHIDHSVVKPVSPSSKKSKKNVRK---RKISASGSKSNS 290 Query: 1079 LRNRARSADNSVNSVPVSQYDEENLEENAARMLSSRFDPNCTDFCAKRLKGAADSANDIY 1258 A + ++ + DEENLEENAARMLSSRFDPNCT F + KG+ N + Sbjct: 291 KEGEASISQSTKRRDGFPEDDEENLEENAARMLSSRFDPNCTGFXSSNTKGSLPPTNGLS 350 Query: 1259 FSQS----NTERMKGSQGKVCSVGSAGSRTLRPRRHNG-KSFARKRRHFYEVFLRDMDPH 1423 F S N R + SV +AG R LRPR+ K +RKRRHFY++ D+D Sbjct: 351 FLLSSGHDNVSRGLKPGLESASVDAAG-RVLRPRKQRKEKKXSRKRRHFYDILFGDIDAA 409 Query: 1424 CLVKQRIRVFWPLDQNWYFGLIKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFPSEV 1603 ++ +RI+VFWPLDQ WY+GL+ YD +LHHVKYDDRDEEWI+LQ ERFKLLL PSEV Sbjct: 410 WVLNRRIKVFWPLDQIWYYGLVNDYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPSEV 469 Query: 1604 SSKFNFGK------PGLEK------ENKEAEEVDTEESSYLGSLAESEPIISWLARTTRR 1747 + K P EK + KE + V E+ +GS +SEPIISWLAR+T R Sbjct: 470 PGREERRKSAVGNDPANEKGRSGSRKGKETDAVILEDDCNIGSYMDSEPIISWLARSTHR 529 Query: 1748 ATSCPSNTTKIHLRVSPLKDTGPSLLLEPPKGNIRMSQLDMRSNNLLPNCKKSDQLYDWN 1927 S PS+ +K + S L S E P +N L+ + ++L D + Sbjct: 530 NKSSPSHNSK-RQKTSSLSSKSGSQANEKP------------ANLLVKSSGMPERLADVD 576 Query: 1928 INGISERKRSIDSEEKKLPYVYFRKRFRNKKEDFKTKLTQHAVDCDQGGLASISSTVNSM 2107 S + + S +KLP VYFRKRFRN + H + D S +S S Sbjct: 577 GPEKSASETTTCSTTRKLPIVYFRKRFRNIGTEMP-----HKRETDFASRRSHASLSFSF 631 Query: 2108 APVWEVKEKLTLVNFQQVIFELNLPLQCTLNFTSQCQSFGLFRSLYIADHGELVCGWPDV 2287 + K ++ L Q+ I + + + + HG L WP V Sbjct: 632 LILMMWKNQIFLPEGQKRIGYYGVLMMLAM----------------LIQHGTLTLLWPKV 675 Query: 2288 HMEVFFVDDIPGLRFILFEGCLKRSISLFCLIIKILNQHVEKYNFTDTEIPCSSIVLCIS 2467 +E+ FVD++ GLRF+LFEGCL ++++ L++K+ ++ + D + P +SI S Sbjct: 676 QLEMLFVDNVVGLRFLLFEGCLMQAVAFIFLVLKMFQSPGKQGRYADFQFPVTSIRFKFS 735 Query: 2468 RLDNSREELLF-LNLFSKVESSRWKRFEGKLKKLCTKEASISTPYIRVNTLYFSTNEISH 2644 L + ++L+F + FS+++ S+W + +LKK C IS Y + ++ + Sbjct: 736 CLQDIGKQLVFAFHNFSEIKYSKWVHLD-RLKKYCL----ISKQLPLTECTYDNIKKLQN 790 Query: 2645 SSKQF 2659 S QF Sbjct: 791 SKTQF 795 >ref|XP_006601120.1| PREDICTED: uncharacterized protein LOC100789801 isoform X1 [Glycine max] gi|571538233|ref|XP_006601121.1| PREDICTED: uncharacterized protein LOC100789801 isoform X2 [Glycine max] Length = 1602 Score = 285 bits (729), Expect = 7e-74 Identities = 245/859 (28%), Positives = 397/859 (46%), Gaps = 60/859 (6%) Frame = +2 Query: 254 MEIRVEKIEESDIPKKPRSLDLQSIYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKLGSP 433 ME R + ++ IPKK RSLDL+S+Y K K++G+ Sbjct: 1 MEGRAQNSNDTTIPKKSRSLDLKSLYKSKLTENTA---------------KKNLKRIGN- 44 Query: 434 FEEDGVVLPKSRKKNKKEVFLSSLVPTSKRQPNSSNVPKSNKDHISSVNRSNDTSNRSVT 613 G + +KK +KEV LSSL + SS + +SS + + + + SV Sbjct: 45 -SSGGGDEKRKKKKARKEVSLSSL----ENGDGSSELKLGVSQKLSSSSSTLNRVSFSVG 99 Query: 614 TDNLQGTNADAQHLANDLYSQLSGKTDGAYVTDVSKNKVFMSDSFLAPKRRRGISNSRKC 793 D++Q + + S + V + S K+ +D PK Sbjct: 100 DDDVQIPKRKRSFVGR----KKSELGLASKVVEQSGLKIGYNDQ--VPKLG--------- 144 Query: 794 KDPLSLETEISNPCHAKTNTDVQDNESIDKDDLNQKALNDXXXXXXXXVEECDMSGSHKD 973 D L E K + ++N + D + V+ +G Sbjct: 145 SDDLGSGVESFKIKRKKEFDEFKENRNSDSNS----------------VQHAKENGDCAS 188 Query: 974 DSAPSTSAEHLSFIKDENRKA-GNRKDHVKMSEQMNLRNRARSADNSVNSVPVS---QYD 1141 S ++ LS + ++RK + D K+S++ A+ V+S +S Q + Sbjct: 189 HSVVNSGDSSLSKSRRQHRKRKASAIDSTKVSKE---------AEPLVSSSKISDDLQDE 239 Query: 1142 EENLEENAARMLSSRFDPNCTDFCAKRLKGAADSANDIYFSQSNTER-----MKGSQGKV 1306 EENLEENAARMLSSRFDP+CT F K +N + F QS+++ +K G Sbjct: 240 EENLEENAARMLSSRFDPSCTGFSMK-------GSNGLSFFQSSSQSIVNHSLKSPLGSE 292 Query: 1307 CSVGSAGSRTLRPRR-HNGKSFARKRRHFYEVFLRDMDPHCLVKQRIRVFWPLDQNWYFG 1483 + R LRPR+ + KS +RKRRHFYE+ L D+D + ++ +RI++FWPLDQ+WY+G Sbjct: 293 STSADTAGRVLRPRKQYKNKSNSRKRRHFYEILLGDVDAYWVLNRRIKIFWPLDQSWYYG 352 Query: 1484 LIKGYDPVTRLHHVKYDDRDEEWINLQKERFKLLLFPSEV------------SSKFNFGK 1627 L+ YD ++L+H+KYDDRD +W+NLQ ERFKLLL SEV S F+ K Sbjct: 353 LVDNYDEGSKLYHIKYDDRDVKWVNLQTERFKLLLLRSEVPGNAKGERALMKRSSFDHQK 412 Query: 1628 PGLEKENKEAEEVDTEESSYLGSLAESEPIISWLARTTRRATSCPSNTTKIHLRVSPLKD 1807 ++ ++ E + + S +SEPIISWLAR++ R S K V+ Sbjct: 413 GSKSRKERQRTEENAGDDRCGESSMDSEPIISWLARSSHRLRSI-QGIKKQKTSVTVPST 471 Query: 1808 TGPSLLLEP--PKGNIRMSQLDMRSNNLLPNCKKSDQLYDWNINGISERKRSIDSEEKKL 1981 T L EP KG++ S + N D+ + + S + +++ K Sbjct: 472 TSSFLYDEPVTAKGHLAKSSVRDVEKNFSTGSVSQDK-FSEDFKDKSSLQSVTCAKDGKQ 530 Query: 1982 PYVYFRKRFRNKKEDFKTKLTQH-----------AVDCDQGGLASISSTVNSMAPV---- 2116 P VYFR+R+ +K +++ A+D GG+ ++ + ++S V Sbjct: 531 PIVYFRRRWVHKPAPISPHISEENHAIISASGSVALDHMFGGVENVKNPIDSRVEVGGPL 590 Query: 2117 ------------WEVKEKLTLVNFQQVIFELNLPLQCTLNFTSQCQSFGLFRSLYIADHG 2260 W++K +F+ F LN P++ LN Q ++ L ++ + G Sbjct: 591 FFTYKAGVPKVFWDMKS----ASFK---FGLNFPMRLVLNDFFQSENLWLLYTVLLLRFG 643 Query: 2261 ELVCGWPDVHMEVFFVDDIPGLRFILFEGCLKRSISLFCLIIKILNQHVEKYNFTDTEIP 2440 ++ WP V++E+ FVD++ GLRF+LFEGCL + + ++++ +Q + + D + P Sbjct: 644 TVMAKWPRVYLEMLFVDNVVGLRFLLFEGCLNTAAAFVFFVLRVFHQPDCQGKYVDLQFP 703 Query: 2441 CSSIVLCISRLDNSREELLF-LNLFSKVESSRWKRFEGKLKKLC--TKEASIS-TPYIRV 2608 C+SI S + ++ L+F FS+V++S+W + KLK+ C +K+ +S Y + Sbjct: 704 CTSIGFKFSSVHVIKKPLVFEFYNFSEVKNSKWMHLDSKLKEHCLLSKQLHLSECTYDNI 763 Query: 2609 NTLY-----FSTNEISHSS 2650 L FS IS SS Sbjct: 764 QALQNGSRRFSITSISGSS 782 >ref|XP_006601123.1| PREDICTED: uncharacterized protein LOC100792436 isoform X2 [Glycine max] Length = 1469 Score = 283 bits (725), Expect = 2e-73 Identities = 250/848 (29%), Positives = 393/848 (46%), Gaps = 49/848 (5%) Frame = +2 Query: 254 MEIRVEKIEESDIPKKPRSLDLQSIYXXXXXXXXXXXXXXXXXXXXXXXNKAPQKKLGSP 433 ME R E ++ I KK RSLDL+S+Y K K++G+ Sbjct: 1 MEGRAENTNDTAILKKSRSLDLKSLYKSKLTENTA---------------KKNLKRIGN- 44 Query: 434 FEEDGVVLPKSRKKNKKEVFLSSLVPTSKRQPNSSNVPKSNKDHISSVNRSNDTSNRSVT 613 G + +KK +K+VFLSSL + SS + +SS + + + + SV Sbjct: 45 -SSGGGDEKRKKKKARKKVFLSSL----ENGDGSSELKLGVSQRLSSSSSTLNRISFSVG 99 Query: 614 TDNLQGTNADAQHLANDLYSQLSGKTDGAYVTDVSKNKVFMSDSFLAPKRRRGISNSRKC 793 D++Q + + S + V + S K+ D PK Sbjct: 100 DDDVQIPKRKRSFVGR----KKSELVQASKVVEQSGLKIGYGDQ--VPKLG--------- 144 Query: 794 KDPLSLETEISNPCHAKTNTDVQDNESIDKDDLNQKALNDXXXXXXXXVEECDMSGSHKD 973 D L E H K + ++N + D + + V+E SH Sbjct: 145 SDDLGSGVESFKIKHTKEFDEFKENRNSDSNSVQH-------------VKEDGDCASH-- 189 Query: 974 DSAPSTSAEHLSFIKDENRK-AGNRKDHVKMSEQMNLRNRARSADNSVNSVPVS---QYD 1141 S ++ LS + +NRK + D K+S++ A+ V+S + Q + Sbjct: 190 -SVVNSGDSSLSKSRRKNRKRKASALDRTKVSKE---------AEPLVSSCKIPGDLQDE 239 Query: 1142 EENLEENAARMLSSRFDPNCTDFCAKRLKGAADSANDIYFSQSNTER-MKGSQGKVCSVG 1318 EENLEENAARMLSSRFDP+CT F K L G + SQS R +K G + Sbjct: 240 EENLEENAARMLSSRFDPSCTGFSMKGLNGLPFFGSS---SQSIVNRGLKSQSGSESASA 296 Query: 1319 SAGSRTLRPRR-HNGKSFARKRRHFYEVFLRDMDPHCLVKQRIRVFWPLDQNWYFGLIKG 1495 R LRPR+ + K +RKRRHFY++ L D++ + ++ +RI++FWPLDQ+WY+G + Sbjct: 297 DTAGRILRPRKQYKNKGDSRKRRHFYKILLGDVNAYWVLNRRIKIFWPLDQSWYYGFVDN 356 Query: 1496 YDPVTRLHHVKYDDRDEEWINLQKERFKLLLFPSEVSSKFNFGKPGL-----------EK 1642 YD ++L+H+KYDDRD EW+NL ERFKLLL SEV G+ L K Sbjct: 357 YDEGSKLYHIKYDDRDVEWVNLHTERFKLLLLRSEVPGNAK-GERALTKRRSSDHQKGSK 415 Query: 1643 ENKEAEEVDTEESSYLGSLAESEPIISWLARTTRRATSCPSNTTKIHLRVSPLKDTGPSL 1822 +KE + ++ S S+ +SEPIISWLAR++ R S K + T S Sbjct: 416 SSKERQRTTEDDRSGESSM-DSEPIISWLARSSHRLRSSFQGIKK-QKTSGTIPSTMSSF 473 Query: 1823 LLEPP---KGNIRMSQLDMRSNNLLPNCKKSDQLYDWNINGISERKRSIDSEEKKLPYVY 1993 L + P KG++ L NN + D+L D + S + +++ K P VY Sbjct: 474 LYDEPVTAKGHLAKISLRGVKNNFSSDSVSQDKLSD-DFRDKSSLLSATATKDGKQPIVY 532 Query: 1994 FRKRFRNKKEDFKTKLTQ--HAVDCDQG---------GLASISSTVNSMAPV-----WEV 2125 FR+R R K +++ +A+ G G+ + + N A V + + Sbjct: 533 FRRRIR-KPAPISPHISEENYAITGASGSVAFNHMFCGVEKMKNPSNGRAEVGGPLCFTL 591 Query: 2126 KEKLTLVNFQ----QVIFELNLPLQCTLNFTSQCQSFGLFRSLYIADHGELVCGWPDVHM 2293 K ++ + + F LN P++ LN Q ++ L S+ + G ++ WP V + Sbjct: 592 KAGVSKIFWDMESASFKFGLNFPMRLVLNDFFQSENLWLLYSVLLLRFGTVMTKWPRVCL 651 Query: 2294 EVFFVDDIPGLRFILFEGCLKRSISLFCLIIKILNQHVEKYNFTDTEIPCSSIVLCISRL 2473 E+ FVD++ GLRF+LFEGCL + + F ++++ +Q + + D + PC+SI S + Sbjct: 652 EMLFVDNVVGLRFLLFEGCLNMAAAFFFFVLRVFHQPAYRGKYVDLQFPCTSIGFKFSSV 711 Query: 2474 DNSREELLF-LNLFSKVESSRWKRFEGKLKKLC--TKEASIS-TPYIRVNTLY-----FS 2626 ++ L+F FS+V++S+W + KLK+ C +K+ +S Y + L FS Sbjct: 712 HVIKKPLVFEFYNFSEVKNSKWMCLDSKLKRHCLLSKQLHLSECTYDNIQALQNGSCRFS 771 Query: 2627 TNEISHSS 2650 +S SS Sbjct: 772 ITSVSGSS 779