BLASTX nr result
ID: Mentha25_contig00053110
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00053110 (1074 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU35582.1| hypothetical protein MIMGU_mgv1a000247mg [Mimulus... 394 e-107 ref|XP_006484533.1| PREDICTED: ENHANCER OF AG-4 protein 2-like i... 229 2e-57 ref|XP_006437587.1| hypothetical protein CICLE_v100305712mg, par... 229 2e-57 ref|XP_007225468.1| hypothetical protein PRUPE_ppa000196mg [Prun... 218 3e-54 ref|XP_002277110.2| PREDICTED: uncharacterized protein LOC100255... 214 4e-53 gb|EXB55170.1| hypothetical protein L484_018096 [Morus notabilis] 205 2e-50 ref|XP_006590799.1| PREDICTED: ENHANCER OF AG-4 protein 2-like i... 204 5e-50 ref|XP_007131630.1| hypothetical protein PHAVU_011G029400g, part... 201 4e-49 ref|XP_004297740.1| PREDICTED: ENHANCER OF AG-4 protein 2-like [... 200 1e-48 ref|XP_006286897.1| hypothetical protein CARUB_v10000040mg [Caps... 198 4e-48 ref|XP_007034335.1| Tudor/PWWP/MBT domain-containing protein, pu... 197 6e-48 ref|XP_007034332.1| Tudor/PWWP/MBT domain-containing protein, pu... 197 6e-48 ref|XP_007034330.1| Tudor/PWWP/MBT domain-containing protein, pu... 197 6e-48 ref|XP_007034329.1| Tudor/PWWP/MBT domain-containing protein, pu... 197 6e-48 ref|XP_002520919.1| conserved hypothetical protein [Ricinus comm... 196 1e-47 ref|XP_002872040.1| enhancer of ag-4 2 [Arabidopsis lyrata subsp... 195 3e-47 ref|XP_002310078.2| hypothetical protein POPTR_0007s07750g [Popu... 194 7e-47 ref|NP_197706.1| ENHANCER OF AG-4 protein 2 [Arabidopsis thalian... 194 7e-47 emb|CBI27142.3| unnamed protein product [Vitis vinifera] 194 7e-47 ref|XP_006394583.1| hypothetical protein EUTSA_v10003519mg [Eutr... 193 1e-46 >gb|EYU35582.1| hypothetical protein MIMGU_mgv1a000247mg [Mimulus guttatus] Length = 1370 Score = 394 bits (1012), Expect = e-107 Identities = 216/372 (58%), Positives = 256/372 (68%), Gaps = 35/372 (9%) Frame = +1 Query: 46 TKSELSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQA 225 TKSELSLGDLVLAKVKGFPAWPAKI RPEDW+ +PDPKKYFVQFFGT EIAFVAPADIQA Sbjct: 14 TKSELSLGDLVLAKVKGFPAWPAKIGRPEDWERSPDPKKYFVQFFGTAEIAFVAPADIQA 73 Query: 226 FTSESKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASETHS-- 399 FTSESKNKLT RCQGKTV++FA+AVKEIC+EFE LQRKNL G+RD +NA+NLASETHS Sbjct: 74 FTSESKNKLTTRCQGKTVRFFAKAVKEICEEFEVLQRKNLGGVRDDNNAQNLASETHSVD 133 Query: 400 ----EASEVSVMNGRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVKPCSSDDMN 567 EA EVS+ NG D+E P+CKLE KGL+D S L+H + EM+ Q+VKPC SD MN Sbjct: 134 PLVDEALEVSINNGIDNEGPSCKLEVKGLTDQGSELEHSSQRQDEMECQDVKPCLSDVMN 193 Query: 568 RCSSPHVSSRERNKSCTESTNLGK---------------------------ETNGDQSAS 666 SPH+SS ++NK T +N K + Q Sbjct: 194 HGLSPHLSSGKKNKLSTNPSNQMKGAELRSSPSKQAFVKEEGSRGVKVKERHPDAGQGEL 253 Query: 667 TNGRQPKLATEIKRKHDGAKRRNCDAVVSRDHNEDVVQMKHASGGNIKVSSADNSRSDLG 846 TNG QPKL T KRKH+G R+ ++ S + D Q + GGNIK+SSADNS+S Sbjct: 254 TNGHQPKLVTGTKRKHEGTMHRDIGSIKSPKYIGDGGQKPYVLGGNIKLSSADNSKSGAS 313 Query: 847 IGSERIGKKWLKGKKHSAAVYDGRGDAEVFSEDNSEVICRKKMKFQHDQEKQASRTNEAS 1026 IGSER GKK LK KK S AV D +GD+E+ +E++SE+I RKKMK +HD +KQ SR +EAS Sbjct: 314 IGSERKGKKLLKEKKPSEAVDDIQGDSEIMAEEHSEIISRKKMKIRHDHQKQTSRRDEAS 373 Query: 1027 DP--PKMDDMED 1056 P PK D D Sbjct: 374 LPKMPKGADNAD 385 >ref|XP_006484533.1| PREDICTED: ENHANCER OF AG-4 protein 2-like isoform X1 [Citrus sinensis] Length = 1446 Score = 229 bits (583), Expect = 2e-57 Identities = 152/371 (40%), Positives = 205/371 (55%), Gaps = 33/371 (8%) Frame = +1 Query: 49 KSELSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAF 228 KS+LSLGDLVLAKVKGFPAWPAKISRPEDW APDPKKYFVQFFGT+EIAFVAP DIQAF Sbjct: 15 KSQLSLGDLVLAKVKGFPAWPAKISRPEDWDRAPDPKKYFVQFFGTQEIAFVAPVDIQAF 74 Query: 229 TSESKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASETHS-EA 405 TSESK+KL+ARCQGKTVKYFAQAVKEIC FEELQ+K S R ++ L E S + Sbjct: 75 TSESKSKLSARCQGKTVKYFAQAVKEICVAFEELQKKKSSESRLDNDRSALGFEAASVDG 134 Query: 406 SEVSVMNGRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVK---PCSSDDMNRCS 576 +V + +G + PN + +T+ + D ++L+ C LGE + +++K C +DD+ Sbjct: 135 EDVDLKDGTCAVIPNGETKTEDICDFGTKLEPCSNSLGETESEDIKRSISCHADDI---L 191 Query: 577 SPHVSSRERNK-----------SCTESTNLGKETNGDQSASTNGRQ-PKLATEIKRKHDG 720 SP +SS + K S ++ K + Q A NG + K+A+ K+ DG Sbjct: 192 SPVLSSEKNMKVSNGSQSKDEASSDNKEDINKHPDKGQKAFPNGHKLKKMASGSKKAFDG 251 Query: 721 A---KRRNCDAVVSRD--------------HNEDVVQMKHASGGNIKVSSADNSRSDLGI 849 + ++ N D +D ++D+ K AS G++ S D +SD I Sbjct: 252 SVGGQKGNLDVTSLKDDSSGQCVNIPDSDKQHKDISDGKIASNGSMAELSQDGLKSDSDI 311 Query: 850 GSERIGKKWLKGKKHSAAVYDGRGDAEVFSEDNSEVICRKKMKFQHDQEKQASRTNEASD 1029 G+ + K L+ K+ + G + + EV KK K TN + Sbjct: 312 GTGKT-KDLLRAKRG----FKGSDVEDTIASSKGEVSGNKKSAQAGTTGKLRLGTNGNLN 366 Query: 1030 PPKMDDMEDSR 1062 P K DS+ Sbjct: 367 PVKKSKCIDSK 377 >ref|XP_006437587.1| hypothetical protein CICLE_v100305712mg, partial [Citrus clementina] gi|557539783|gb|ESR50827.1| hypothetical protein CICLE_v100305712mg, partial [Citrus clementina] Length = 700 Score = 229 bits (583), Expect = 2e-57 Identities = 152/371 (40%), Positives = 205/371 (55%), Gaps = 33/371 (8%) Frame = +1 Query: 49 KSELSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAF 228 KS+LSLGDLVLAKVKGFPAWPAKISRPEDW APDPKKYFVQFFGT+EIAFVAP DIQAF Sbjct: 15 KSQLSLGDLVLAKVKGFPAWPAKISRPEDWDRAPDPKKYFVQFFGTQEIAFVAPVDIQAF 74 Query: 229 TSESKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASETHS-EA 405 TSESK+KL+ARCQGKTVKYFAQAVKEIC FEELQ+K S R ++ L E S + Sbjct: 75 TSESKSKLSARCQGKTVKYFAQAVKEICVAFEELQKKKSSESRLDNDRSALGFEAASVDG 134 Query: 406 SEVSVMNGRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVK---PCSSDDMNRCS 576 +V + +G + PN + +T+ + D ++L+ C LGE + +++K C +DD+ Sbjct: 135 EDVDLKDGTCAVIPNGETKTEDICDFGTKLEPCSNSLGETESEDIKRSISCHADDI---L 191 Query: 577 SPHVSSRERNK-----------SCTESTNLGKETNGDQSASTNGRQ-PKLATEIKRKHDG 720 SP +SS + K S ++ K + Q A NG + K+A+ K+ DG Sbjct: 192 SPVLSSEKNMKVSNGSQSKDEASSDNKEDINKHPDKGQKAFPNGHKLKKMASGSKKAFDG 251 Query: 721 A---KRRNCDAVVSRD--------------HNEDVVQMKHASGGNIKVSSADNSRSDLGI 849 + ++ N D +D ++D+ K AS G++ S D +SD I Sbjct: 252 SVGGQKGNLDVTSLKDDSSGQCVNIPDSDKQHKDISDGKVASNGSMAELSQDGLKSDSDI 311 Query: 850 GSERIGKKWLKGKKHSAAVYDGRGDAEVFSEDNSEVICRKKMKFQHDQEKQASRTNEASD 1029 G+ + K L+ K+ + G + + EV KK K TN + Sbjct: 312 GTGKT-KDLLRAKRG----FKGSDVEDTIASSKGEVSGNKKSAQAGTTGKLRLGTNGNLN 366 Query: 1030 PPKMDDMEDSR 1062 P K DS+ Sbjct: 367 PVKKSKCIDSK 377 >ref|XP_007225468.1| hypothetical protein PRUPE_ppa000196mg [Prunus persica] gi|596285528|ref|XP_007225469.1| hypothetical protein PRUPE_ppa000196mg [Prunus persica] gi|462422404|gb|EMJ26667.1| hypothetical protein PRUPE_ppa000196mg [Prunus persica] gi|462422405|gb|EMJ26668.1| hypothetical protein PRUPE_ppa000196mg [Prunus persica] Length = 1480 Score = 218 bits (555), Expect = 3e-54 Identities = 151/390 (38%), Positives = 196/390 (50%), Gaps = 53/390 (13%) Frame = +1 Query: 49 KSELSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAF 228 KS+LSLGDLVLAKVKGFP WPAKISRPEDWK PDPKKYFVQFFGTEEIAFVAPADIQAF Sbjct: 15 KSQLSLGDLVLAKVKGFPYWPAKISRPEDWKKVPDPKKYFVQFFGTEEIAFVAPADIQAF 74 Query: 229 TSESKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSN---AENLASETHS 399 TSE K KLT R GKT K F+QAVK+IC+EF+ELQ+K + +RD ++ + + Sbjct: 75 TSELKVKLTGRLPGKT-KNFSQAVKDICEEFDELQKKKSNDLRDDTDPGCEVPSVNGVEN 133 Query: 400 EASEVSVMNG----RDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVKPCSSDDMN 567 EV + +G +DS K E +G+ D S+L+ C +I GE ++V P +S N Sbjct: 134 NGVEVELKDGGEGTQDSNGETLK-EEEGIGDFGSKLERCSQIRGENGIEDVNPSTSCGAN 192 Query: 568 RCSSPHVSSRERNKSCT---------------ESTNLGKETNGD---------------Q 657 SSP +SS +NK S N+ ++ +G Q Sbjct: 193 ESSSPIISSETKNKMSAVSQPKKEVLKKSNPDNSCNMKEDVSGSKHEEDGVRTKKHSERQ 252 Query: 658 SASTNGRQPKLATEIKRKHDGAKRRN----------------CDAVVSRDHNEDVVQMKH 789 + NG + T KRKHDG + D S + D + K Sbjct: 253 RSLANGHKSMKITGSKRKHDGTVEGHKNSFSVTSLKEDGSVFLDRPKSGERLRDGTKGKL 312 Query: 790 ASGGNIKVSSADNSRSDLGIGSERIGKKWLKGKKHSAAVYDGRGDAEVFSEDNSEVICRK 969 SGG + S D +SD GI + K LK K AV D + + + + + + Sbjct: 313 GSGGRKREFSPDARKSDSGIRGGKKAKDLLKAKNQIEAVDDMKDSVDDPVDQAKDKLSGR 372 Query: 970 KMKFQHDQEKQASRTNEASDPPKMDDMEDS 1059 K Q K +N+ S P K DS Sbjct: 373 TKKVQLGLGKLNLESNDISHPAKKSKHVDS 402 >ref|XP_002277110.2| PREDICTED: uncharacterized protein LOC100255898 [Vitis vinifera] Length = 1565 Score = 214 bits (546), Expect = 4e-53 Identities = 117/210 (55%), Positives = 139/210 (66%), Gaps = 6/210 (2%) Frame = +1 Query: 49 KSELSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAF 228 KSEL LGDLVLAKVKGFPAWPAKI +PEDW PDPKKYFVQFFGTEEIAFVAP DI+AF Sbjct: 15 KSELRLGDLVLAKVKGFPAWPAKIGKPEDWDRTPDPKKYFVQFFGTEEIAFVAPGDIEAF 74 Query: 229 TSESKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASETHS--- 399 TSE KNKL+ARC+GKTVK+FAQAVKEIC +EELQ+KN SG RD + SE S Sbjct: 75 TSEVKNKLSARCRGKTVKFFAQAVKEICDAYEELQQKNTSGSRDDRDRTAPESEAPSVDG 134 Query: 400 ---EASEVSVMNGRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVKPCSSDDMNR 570 + E + +G + N + +GL D S L+HC GE D Q+VKP +S N Sbjct: 135 VGDDRVEDDLKDGIGTVRLNGETVIEGLGDCGSGLEHCFHKQGEPDDQDVKPATSAHAND 194 Query: 571 CSSPHVSSRERNKSCTESTNLGKETNGDQS 660 SP + S ++NK+ + KET S Sbjct: 195 NLSPAIFSEKKNKA-SNGARTPKETESTSS 223 >gb|EXB55170.1| hypothetical protein L484_018096 [Morus notabilis] Length = 1409 Score = 205 bits (522), Expect = 2e-50 Identities = 139/335 (41%), Positives = 184/335 (54%), Gaps = 33/335 (9%) Frame = +1 Query: 49 KSELSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAF 228 K +LSLGDLVLAKVKGFP WPAKISRPEDWK DPKKYFVQFFGTEEIAFVAPADIQAF Sbjct: 15 KGQLSLGDLVLAKVKGFPFWPAKISRPEDWKKPHDPKKYFVQFFGTEEIAFVAPADIQAF 74 Query: 229 TSESKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASETHS--- 399 TSE+K KL+ARCQGK K F QAVK+IC+ F+ELQ+ S +RD ++ L E S Sbjct: 75 TSEAKAKLSARCQGK-AKPFTQAVKQICEAFDELQKNKSSDLRDDTDRSELGCEVRSIDG 133 Query: 400 --------EASEVSVMNGRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVKPCSS 555 + + S M G D E N + + D S+L+ C + GE D Q++KP Sbjct: 134 VENNEADADTKDGSGMIGSDEETMN-----EEIGDSSSKLERCSQRRGESDNQDLKPF-- 186 Query: 556 DDMNRCSSPHVSSRERNKSCTESTNLGK-------------------ETNGDQSASTNGR 678 ++ CSS VSS ++ E + K +G ++ S + Sbjct: 187 --VDACSSGGVSSALSSEKKGEILEVAKSKEVIVKSEPDSSNPEEVLSDDGQRAVSNGHK 244 Query: 679 QPKLATEIKRKHDGAKRRNCDAVVSRDHNEDVVQMKHASGGNIKVSSADNSRSDLGIGSE 858 K+ +E KRK +G + D S + +D ++ K+A+GG+ K +N R GSE Sbjct: 245 LKKMGSESKRKSEGGLEVHKDP-KSCEQLKDGMKKKNATGGSRKEYFLENKR-----GSE 298 Query: 859 RIGKKWLKGK---KHSAAVYDGRGDAEVFSEDNSE 954 G K KG+ K+ V + + V E+ SE Sbjct: 299 TCGGKKAKGEAKTKNHLKVPNDTHRSSVDPEEQSE 333 >ref|XP_006590799.1| PREDICTED: ENHANCER OF AG-4 protein 2-like isoform X1 [Glycine max] Length = 1453 Score = 204 bits (519), Expect = 5e-50 Identities = 125/291 (42%), Positives = 164/291 (56%), Gaps = 8/291 (2%) Frame = +1 Query: 58 LSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAFTSE 237 LSLGDLVLAKVKGFPAWPAKISRPEDW PDPKKYFVQFFGT+EIAFVAPADIQAFTSE Sbjct: 18 LSLGDLVLAKVKGFPAWPAKISRPEDWDKVPDPKKYFVQFFGTKEIAFVAPADIQAFTSE 77 Query: 238 SKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASETHSEASEVS 417 +KNKL+AR QGKT KYFAQAVKEIC F+E+Q++ SG+ D ++ ++ SE S V Sbjct: 78 AKNKLSARLQGKT-KYFAQAVKEICAAFDEMQKQKASGLADDTDDSHIGSEAPSNDGVVG 136 Query: 418 VMNGRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVKPCSSDDMNR---CSSPHV 588 + N + + + ++ S L++C+ +GE D Q+ K S+ N SSP + Sbjct: 137 NLKDAADAVSNAEKDNIDMDNVCSNLEYCVPRIGENDSQDEKLSVSNHPNESSSVSSPVI 196 Query: 589 SSR-----ERNKSCTESTNLGKETNGDQSASTNGRQPKLATEIKRKHDGAKRRNCDAVVS 753 ++ E K+ +S+ G D NG RK D R+ +A Sbjct: 197 KNKLAIGSETKKNANKSSFKGASNVNDFRQDANGHSDLTNGTKTRKLDNGSRKKSEAASG 256 Query: 754 RDHNEDVVQMKHASGGNIKVSSADNSRSDLGIGSERIGKKWLKGKKHSAAV 906 + N K GN R DL E + K +K +K++ +V Sbjct: 257 SNRNGGSSTGKFMKEGNC------TGRGDLSRSGETL--KAVKKRKNAFSV 299 >ref|XP_007131630.1| hypothetical protein PHAVU_011G029400g, partial [Phaseolus vulgaris] gi|561004630|gb|ESW03624.1| hypothetical protein PHAVU_011G029400g, partial [Phaseolus vulgaris] Length = 863 Score = 201 bits (511), Expect = 4e-49 Identities = 126/294 (42%), Positives = 171/294 (58%), Gaps = 33/294 (11%) Frame = +1 Query: 58 LSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAFTSE 237 LSLGDLVLAKVKGFPAWPAKISRPEDW+ PDPKKYFVQFFGT+EIAFVAPADIQAFTSE Sbjct: 18 LSLGDLVLAKVKGFPAWPAKISRPEDWEKIPDPKKYFVQFFGTKEIAFVAPADIQAFTSE 77 Query: 238 SKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASETHS-EASEV 414 +KNKL+AR GKT K+F+QAVKEIC F+E+Q++ SG+ D ++ ++ SE S + V Sbjct: 78 AKNKLSARLHGKT-KHFSQAVKEICAAFDEMQKQKASGLTDDTDDSHIGSEAPSNDGVVV 136 Query: 415 SVMNGRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVKPCSSDDMNR---CSSPH 585 ++ + DS N + + + ++ S L+HC + +GE D + K SD N SSP Sbjct: 137 NLKDAIDSVLSNAEKDNIDMDNVGSNLEHCTQRVGENDSLDEKYSVSDHPNESSSVSSPV 196 Query: 586 VSSR-----ERNKSCTESTNLGKETNGD-------QSASTNGRQP-KLATEIKRKHDGA- 723 + S+ E K+ +S+ G D S TNG +P KL ++++ + A Sbjct: 197 IKSKLSMGSEPKKNANKSSLKGASNVNDFGQDDNRHSGITNGTKPRKLVNGLRKRSEAAG 256 Query: 724 -------------KRRNCD--AVVSRDHNEDVVQMKHASGGNIKVSSADNSRSD 840 K NC +SR K + ++K+ S D +SD Sbjct: 257 DRDRNGGSSTGVLKEGNCTGRGDLSRSRETMKAGKKRKTAFDVKLDSPDTLKSD 310 >ref|XP_004297740.1| PREDICTED: ENHANCER OF AG-4 protein 2-like [Fragaria vesca subsp. vesca] Length = 1458 Score = 200 bits (508), Expect = 1e-48 Identities = 135/334 (40%), Positives = 177/334 (52%), Gaps = 52/334 (15%) Frame = +1 Query: 49 KSELSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAF 228 K++LSLGDLVLAKVKG P WPAKIS+PEDW+ PDPKKYFVQFFGTEEIAFVAP DIQAF Sbjct: 15 KAQLSLGDLVLAKVKGHPFWPAKISKPEDWQKVPDPKKYFVQFFGTEEIAFVAPVDIQAF 74 Query: 229 TSESKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIR-DYSNAENLASETHSEA 405 TS+SK+K++ARCQGK+ KYF+QAVKEIC+ F+ELQ+KN + +R D +++ + Sbjct: 75 TSDSKSKISARCQGKS-KYFSQAVKEICEAFDELQKKNSNDLRVDTDRSDHGCDALSVDG 133 Query: 406 SEVSVMN----------GRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVKPCSS 555 E + +N G D E K E G D S+L+ C ++ GE D ++V P +S Sbjct: 134 VEDNGVNVEIKDDKGVVGSDGE--TVKEECTG--DFGSKLERCSQLRGENDTEDVDPSTS 189 Query: 556 DDMNRCSSPHVSSRERNKSC-----------------TESTNLGKETNGDQSASTNGRQP 684 SSP SS E++K TE ++L E + S Q Sbjct: 190 CGAKESSSPVFSSEEKDKMSSVVHPKVPKTSNSSHLKTEVSDLKHEDDDIHSKKHGEGQR 249 Query: 685 KLATEIK-RKHDGAKRRN-----------------------CDAVVSRDHNEDVVQMKHA 792 L K K G+K+R+ D S D D K Sbjct: 250 SLVNGHKMTKSSGSKKRSDGMVEVHKGSSLTSLKEDGSIGCVDRPQSHDRLRDGTTGKTV 309 Query: 793 SGGNIKVSSADNSRSDLGIGSERIGKKWLKGKKH 894 SG N + S D+ + + GIG + K LK KK+ Sbjct: 310 SGSNKRKLSQDSLKPETGIGDGKRSKDLLKAKKY 343 >ref|XP_006286897.1| hypothetical protein CARUB_v10000040mg [Capsella rubella] gi|482555603|gb|EOA19795.1| hypothetical protein CARUB_v10000040mg [Capsella rubella] Length = 1402 Score = 198 bits (503), Expect = 4e-48 Identities = 138/347 (39%), Positives = 185/347 (53%), Gaps = 17/347 (4%) Frame = +1 Query: 49 KSELSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAF 228 K +L LGDLVLAKVKGFPAWPAKISR EDW APDPKKYFVQFFGTEEIAFVAP DIQAF Sbjct: 15 KGQLILGDLVLAKVKGFPAWPAKISRAEDWNRAPDPKKYFVQFFGTEEIAFVAPPDIQAF 74 Query: 229 TSESKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASETHSEAS 408 TSE+K+KL ARCQGKTVKYFAQAV++IC FE LQ + L +E +A+ Sbjct: 75 TSEAKSKLLARCQGKTVKYFAQAVQDICTAFEALQN---------HKSNILGNEDPLDAA 125 Query: 409 EVSVMNGRDSEE-PNCKLETKGLSDLDSRLDHCLEILGEMDGQNVKPCSSDDMNRCSSPH 585 E S+ + + E+ G ++D+R+D CL ++D N K ++ R SS Sbjct: 126 EPSLRKAEKVDRTDHIYTESDGTDNVDTRVDPCLP---KVDKNNGKDTKAEKGKRDSSSF 182 Query: 586 VSSR-ERNKSCTESTNLG------KETNGDQSASTN-------GRQPKLAT--EIKRKHD 717 + S+ S +ES G K+ + D+ TN Q KLA +IK+ D Sbjct: 183 LESKITTTSSGSESPQHGSDDPKIKDEDFDKGTDTNACVEQFGNGQKKLANGRKIKKVAD 242 Query: 718 GAKRRNCDAVVSRDHNEDVVQMKHASGGNIKVSSADNSRSDLGIGSERIGKKWLKGKKHS 897 G+ R++ D V + D H GG ++D S+ G+ +E+ K GK + Sbjct: 243 GSDRKDEDTV-----HRDKSNNSHVPGGRAASGNSD-SKKFKGLLTEKSSSKVSAGKNEN 296 Query: 898 AAVYDGRGDAEVFSEDNSEVICRKKMKFQHDQEKQASRTNEASDPPK 1038 + + G + KK + + + K A R +E S K Sbjct: 297 SPGFKGG-------------VSGKKRRLETELGKPALRVDETSRAAK 330 >ref|XP_007034335.1| Tudor/PWWP/MBT domain-containing protein, putative isoform 7 [Theobroma cacao] gi|508713364|gb|EOY05261.1| Tudor/PWWP/MBT domain-containing protein, putative isoform 7 [Theobroma cacao] Length = 1411 Score = 197 bits (501), Expect = 6e-48 Identities = 138/382 (36%), Positives = 194/382 (50%), Gaps = 47/382 (12%) Frame = +1 Query: 58 LSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAFTSE 237 LSLGDLVLAKVKGFP WPAKISRPEDW+ PDPKKYFVQFFGT+EIAFVAP DIQAFTSE Sbjct: 17 LSLGDLVLAKVKGFPPWPAKISRPEDWEREPDPKKYFVQFFGTQEIAFVAPGDIQAFTSE 76 Query: 238 SKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASE------THS 399 +K+KL+A+CQ +T K+F QAVKEIC F+EL + SG+RD ++ E T Sbjct: 77 TKSKLSAKCQVRT-KHFVQAVKEICVAFDELHEEKWSGLRDETDRSTPGCEASSVDGTED 135 Query: 400 EASEVSVMNGRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVKPCSSDDMNRCSS 579 + +EV + NG + P + ++G DL S L+ C GE++ +++KP S + CS Sbjct: 136 DGAEVDLKNGTGAVAPGRETTSEGKGDLASNLERC-SCRGEINSEDIKPSISGHADDCSF 194 Query: 580 PHVSSRERNK---------------SCTESTNLGKETNGDQSASTNGRQPKLATEIKRKH 714 +SS ++K S E +++ +E +GD+ A+ N + L + K K Sbjct: 195 LIMSSEVKHKISNGEQPKTEVLFPSSLDEPSHIKEEFSGDKIATVNCTKKTLRDDQKSKK 254 Query: 715 -------------DGAKRRNCDAVVSRDHNEDVVQMKH-------------ASGGNIKVS 816 +G K + A +D +H SG +I+ Sbjct: 255 MASGFKKGTEVFVEGHKSSSSAATFLKDDKSGGSLDRHDSEEQPKDRVKGKVSGSSIRKF 314 Query: 817 SADNSRSDLGIGSERIGKKWLKGKKHSAAVYDGRGDAEVFSEDNSEVICRKKMKFQHDQE 996 S D + D + K+ LK K + A D + DA S+ + KK + + Sbjct: 315 SPDAPKLDSNYTGGKKAKQLLKTKSNFKATDDVQ-DAVTNSKGET---TGKKKRGEPGIG 370 Query: 997 KQASRTNEASDPPKMDDMEDSR 1062 K T+E P K D + Sbjct: 371 KSKLGTDEILHPAKKSKFVDMK 392 >ref|XP_007034332.1| Tudor/PWWP/MBT domain-containing protein, putative isoform 4 [Theobroma cacao] gi|508713361|gb|EOY05258.1| Tudor/PWWP/MBT domain-containing protein, putative isoform 4 [Theobroma cacao] Length = 1333 Score = 197 bits (501), Expect = 6e-48 Identities = 138/382 (36%), Positives = 194/382 (50%), Gaps = 47/382 (12%) Frame = +1 Query: 58 LSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAFTSE 237 LSLGDLVLAKVKGFP WPAKISRPEDW+ PDPKKYFVQFFGT+EIAFVAP DIQAFTSE Sbjct: 17 LSLGDLVLAKVKGFPPWPAKISRPEDWEREPDPKKYFVQFFGTQEIAFVAPGDIQAFTSE 76 Query: 238 SKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASE------THS 399 +K+KL+A+CQ +T K+F QAVKEIC F+EL + SG+RD ++ E T Sbjct: 77 TKSKLSAKCQVRT-KHFVQAVKEICVAFDELHEEKWSGLRDETDRSTPGCEASSVDGTED 135 Query: 400 EASEVSVMNGRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVKPCSSDDMNRCSS 579 + +EV + NG + P + ++G DL S L+ C GE++ +++KP S + CS Sbjct: 136 DGAEVDLKNGTGAVAPGRETTSEGKGDLASNLERC-SCRGEINSEDIKPSISGHADDCSF 194 Query: 580 PHVSSRERNK---------------SCTESTNLGKETNGDQSASTNGRQPKLATEIKRKH 714 +SS ++K S E +++ +E +GD+ A+ N + L + K K Sbjct: 195 LIMSSEVKHKISNGEQPKTEVLFPSSLDEPSHIKEEFSGDKIATVNCTKKTLRDDQKSKK 254 Query: 715 -------------DGAKRRNCDAVVSRDHNEDVVQMKH-------------ASGGNIKVS 816 +G K + A +D +H SG +I+ Sbjct: 255 MASGFKKGTEVFVEGHKSSSSAATFLKDDKSGGSLDRHDSEEQPKDRVKGKVSGSSIRKF 314 Query: 817 SADNSRSDLGIGSERIGKKWLKGKKHSAAVYDGRGDAEVFSEDNSEVICRKKMKFQHDQE 996 S D + D + K+ LK K + A D + DA S+ + KK + + Sbjct: 315 SPDAPKLDSNYTGGKKAKQLLKTKSNFKATDDVQ-DAVTNSKGET---TGKKKRGEPGIG 370 Query: 997 KQASRTNEASDPPKMDDMEDSR 1062 K T+E P K D + Sbjct: 371 KSKLGTDEILHPAKKSKFVDMK 392 >ref|XP_007034330.1| Tudor/PWWP/MBT domain-containing protein, putative isoform 2 [Theobroma cacao] gi|590656652|ref|XP_007034331.1| Tudor/PWWP/MBT domain-containing protein, putative isoform 2 [Theobroma cacao] gi|590656659|ref|XP_007034333.1| Tudor/PWWP/MBT domain-containing protein, putative isoform 2 [Theobroma cacao] gi|508713359|gb|EOY05256.1| Tudor/PWWP/MBT domain-containing protein, putative isoform 2 [Theobroma cacao] gi|508713360|gb|EOY05257.1| Tudor/PWWP/MBT domain-containing protein, putative isoform 2 [Theobroma cacao] gi|508713362|gb|EOY05259.1| Tudor/PWWP/MBT domain-containing protein, putative isoform 2 [Theobroma cacao] Length = 1452 Score = 197 bits (501), Expect = 6e-48 Identities = 138/382 (36%), Positives = 194/382 (50%), Gaps = 47/382 (12%) Frame = +1 Query: 58 LSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAFTSE 237 LSLGDLVLAKVKGFP WPAKISRPEDW+ PDPKKYFVQFFGT+EIAFVAP DIQAFTSE Sbjct: 17 LSLGDLVLAKVKGFPPWPAKISRPEDWEREPDPKKYFVQFFGTQEIAFVAPGDIQAFTSE 76 Query: 238 SKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASE------THS 399 +K+KL+A+CQ +T K+F QAVKEIC F+EL + SG+RD ++ E T Sbjct: 77 TKSKLSAKCQVRT-KHFVQAVKEICVAFDELHEEKWSGLRDETDRSTPGCEASSVDGTED 135 Query: 400 EASEVSVMNGRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVKPCSSDDMNRCSS 579 + +EV + NG + P + ++G DL S L+ C GE++ +++KP S + CS Sbjct: 136 DGAEVDLKNGTGAVAPGRETTSEGKGDLASNLERC-SCRGEINSEDIKPSISGHADDCSF 194 Query: 580 PHVSSRERNK---------------SCTESTNLGKETNGDQSASTNGRQPKLATEIKRKH 714 +SS ++K S E +++ +E +GD+ A+ N + L + K K Sbjct: 195 LIMSSEVKHKISNGEQPKTEVLFPSSLDEPSHIKEEFSGDKIATVNCTKKTLRDDQKSKK 254 Query: 715 -------------DGAKRRNCDAVVSRDHNEDVVQMKH-------------ASGGNIKVS 816 +G K + A +D +H SG +I+ Sbjct: 255 MASGFKKGTEVFVEGHKSSSSAATFLKDDKSGGSLDRHDSEEQPKDRVKGKVSGSSIRKF 314 Query: 817 SADNSRSDLGIGSERIGKKWLKGKKHSAAVYDGRGDAEVFSEDNSEVICRKKMKFQHDQE 996 S D + D + K+ LK K + A D + DA S+ + KK + + Sbjct: 315 SPDAPKLDSNYTGGKKAKQLLKTKSNFKATDDVQ-DAVTNSKGET---TGKKKRGEPGIG 370 Query: 997 KQASRTNEASDPPKMDDMEDSR 1062 K T+E P K D + Sbjct: 371 KSKLGTDEILHPAKKSKFVDMK 392 >ref|XP_007034329.1| Tudor/PWWP/MBT domain-containing protein, putative isoform 1 [Theobroma cacao] gi|508713358|gb|EOY05255.1| Tudor/PWWP/MBT domain-containing protein, putative isoform 1 [Theobroma cacao] Length = 1415 Score = 197 bits (501), Expect = 6e-48 Identities = 138/382 (36%), Positives = 194/382 (50%), Gaps = 47/382 (12%) Frame = +1 Query: 58 LSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAFTSE 237 LSLGDLVLAKVKGFP WPAKISRPEDW+ PDPKKYFVQFFGT+EIAFVAP DIQAFTSE Sbjct: 17 LSLGDLVLAKVKGFPPWPAKISRPEDWEREPDPKKYFVQFFGTQEIAFVAPGDIQAFTSE 76 Query: 238 SKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASE------THS 399 +K+KL+A+CQ +T K+F QAVKEIC F+EL + SG+RD ++ E T Sbjct: 77 TKSKLSAKCQVRT-KHFVQAVKEICVAFDELHEEKWSGLRDETDRSTPGCEASSVDGTED 135 Query: 400 EASEVSVMNGRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVKPCSSDDMNRCSS 579 + +EV + NG + P + ++G DL S L+ C GE++ +++KP S + CS Sbjct: 136 DGAEVDLKNGTGAVAPGRETTSEGKGDLASNLERC-SCRGEINSEDIKPSISGHADDCSF 194 Query: 580 PHVSSRERNK---------------SCTESTNLGKETNGDQSASTNGRQPKLATEIKRKH 714 +SS ++K S E +++ +E +GD+ A+ N + L + K K Sbjct: 195 LIMSSEVKHKISNGEQPKTEVLFPSSLDEPSHIKEEFSGDKIATVNCTKKTLRDDQKSKK 254 Query: 715 -------------DGAKRRNCDAVVSRDHNEDVVQMKH-------------ASGGNIKVS 816 +G K + A +D +H SG +I+ Sbjct: 255 MASGFKKGTEVFVEGHKSSSSAATFLKDDKSGGSLDRHDSEEQPKDRVKGKVSGSSIRKF 314 Query: 817 SADNSRSDLGIGSERIGKKWLKGKKHSAAVYDGRGDAEVFSEDNSEVICRKKMKFQHDQE 996 S D + D + K+ LK K + A D + DA S+ + KK + + Sbjct: 315 SPDAPKLDSNYTGGKKAKQLLKTKSNFKATDDVQ-DAVTNSKGET---TGKKKRGEPGIG 370 Query: 997 KQASRTNEASDPPKMDDMEDSR 1062 K T+E P K D + Sbjct: 371 KSKLGTDEILHPAKKSKFVDMK 392 >ref|XP_002520919.1| conserved hypothetical protein [Ricinus communis] gi|223539885|gb|EEF41464.1| conserved hypothetical protein [Ricinus communis] Length = 1425 Score = 196 bits (499), Expect = 1e-47 Identities = 135/322 (41%), Positives = 175/322 (54%), Gaps = 41/322 (12%) Frame = +1 Query: 49 KSELSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAF 228 KS+L LGDLVLAKVKGFPAWPAKISRPEDW+ APDPKKYFVQFFGTEEIAFVAPADIQ F Sbjct: 15 KSQLKLGDLVLAKVKGFPAWPAKISRPEDWERAPDPKKYFVQFFGTEEIAFVAPADIQVF 74 Query: 229 TSESKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASETHS--- 399 T E NKL+ARCQGKT KYFAQAVKEIC F+E+ ++ SG L E S Sbjct: 75 TRELMNKLSARCQGKT-KYFAQAVKEICTAFQEIDKEKSSGA--------LGCEAPSVDG 125 Query: 400 -EASEVSVMNGRDSEEPNCKLET-KGLSDLDSRLDHCLEILGEMDGQNVKPCSSDDMNRC 573 E E+ V + K ET D S+L HC G+ + ++VKP S D+ Sbjct: 126 IEEDEIEVEVNDEMGTGGPKGETWNEEGDSSSKLKHCSHRQGQTEREDVKPTLSCDVKDN 185 Query: 574 SSPHVSSRERNK-------------SCTESTNLGK-ETNGD--------------QSAST 669 SSP +SS ++ K SC + K E +GD ++ ST Sbjct: 186 SSPVMSSEKKVKISSPQQQMVVSSTSCLGDPSYVKDEVSGDVNVDVDCTNNPRNGETTST 245 Query: 670 NGRQPK-LATEIKRK-------HDGAKRRNCDAVVSRDHNEDVVQMKHASGGNIKVSSAD 825 NG + + + E KR+ H+ ++ + + +D V K +SGG + S + Sbjct: 246 NGHKSRTIVIESKREPESSADVHNSSRTNGSLVPDNSEPLKDGVNEKDSSGGTMSKFSLN 305 Query: 826 NSRSDLGIGSERIGKKWLKGKK 891 +SD G + + K+ L K+ Sbjct: 306 AVKSDSGTRTGKKSKELLVAKR 327 >ref|XP_002872040.1| enhancer of ag-4 2 [Arabidopsis lyrata subsp. lyrata] gi|297317877|gb|EFH48299.1| enhancer of ag-4 2 [Arabidopsis lyrata subsp. lyrata] Length = 1398 Score = 195 bits (495), Expect = 3e-47 Identities = 140/357 (39%), Positives = 193/357 (54%), Gaps = 19/357 (5%) Frame = +1 Query: 46 TKSELSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQA 225 TK +L LGDLVLAKVKGFPAWPAKISRPEDW APDPKKYFVQFFGTEEIAFVAP DIQA Sbjct: 14 TKGQLILGDLVLAKVKGFPAWPAKISRPEDWDRAPDPKKYFVQFFGTEEIAFVAPPDIQA 73 Query: 226 FTSESKNKLTARCQGKTVKYFAQAVKEICKEFEELQ--RKNLSGIRDYSNAENLASETHS 399 FTSE+K+KL ARCQGKTVKYFAQAV++IC FEELQ + N+SG D +A Sbjct: 74 FTSEAKSKLLARCQGKTVKYFAQAVEQICTAFEELQNHKSNVSGDEDPLDA------AEP 127 Query: 400 EASEVSVMNGRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVKPCSSDDMNRC-S 576 ++ +++G D + +E+ G ++ DSR+D C +N+ + ++ + S Sbjct: 128 GLTKAEIVDGTD----HIVIESDGTNNFDSRVDPCFP-------KNIGEETKAEIGKLDS 176 Query: 577 SPHVSSR-ERNKSCTESTNLG----KETNGDQSASTNGR---------QPKLAT--EIKR 708 SP + S+ S +ES G + GD T+G Q KL+ IK+ Sbjct: 177 SPFLESKITTTFSGSESLEHGSYDPRLKEGDFDKGTDGSACIEHFGNGQKKLSNGKRIKK 236 Query: 709 KHDGAKRRNCDAVVSRDHNEDVVQMKHASGGNIKVSSADNSRSDLGIGSERIGKKWLKGK 888 + G+ + D V RD + + H GG ++D S+ G+ +E+ K GK Sbjct: 237 EAGGSDIKGED-TVHRDRSNN----SHVPGGRTASGNSD-SKKLKGLLTEKSSSKVSAGK 290 Query: 889 KHSAAVYDGRGDAEVFSEDNSEVICRKKMKFQHDQEKQASRTNEASDPPKMDDMEDS 1059 ++ + G + KK + + + K A R +E+S K E + Sbjct: 291 HENSPGFKGG-------------VSGKKRRLESELGKVAPRVDESSRAAKKPRCESA 334 >ref|XP_002310078.2| hypothetical protein POPTR_0007s07750g [Populus trichocarpa] gi|550334362|gb|EEE90528.2| hypothetical protein POPTR_0007s07750g [Populus trichocarpa] Length = 1482 Score = 194 bits (492), Expect = 7e-47 Identities = 140/368 (38%), Positives = 191/368 (51%), Gaps = 32/368 (8%) Frame = +1 Query: 49 KSELSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAF 228 K +L LGDLVLAKVKG+P+WPAKISRPEDWK D KK FV FFGT+EIAFVAP+DIQ F Sbjct: 13 KLQLRLGDLVLAKVKGYPSWPAKISRPEDWKRVADAKKVFVYFFGTQEIAFVAPSDIQVF 72 Query: 229 TSESKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASETHSEAS 408 T+E KNKL+ARCQ K ++F+QAVKEIC FEELQ+ SG+ D ++ L SE S S Sbjct: 73 TNEVKNKLSARCQSKKDRFFSQAVKEICAAFEELQKGKSSGLGDNTDRSALGSEGQSVDS 132 Query: 409 EVSVMNGRDSEEPNCKLETKGL-----SDLDSRLDHCLEILGEMDGQNVKPCSSDDMNRC 573 G D E K+ G+ + S+L+HC GE + +KP S D + Sbjct: 133 MEEDGAGDDLNEGMGKVGQSGVMWDSGREFSSKLEHCSSRRGEAGSEGMKPSVSCDTDDS 192 Query: 574 SSPHVSSRERNK-------------SCTESTNLGKE---TNGDQSAS-----TNGRQPKL 690 SSP +SS + K S ++ + K+ NG+ + NG + + Sbjct: 193 SSPGISSENKVKTFDGEQPQEVLSASSLDNVSFVKDEASCNGNLDVNCMNNLCNGEEART 252 Query: 691 -ATEIKRKHDGAKRR-NCDAVVSRDHNEDVVQMKHASGGNIKVSSADNSRSDLGIGSERI 864 E K GA R+ CD SR+ + + KHAS G I+ S +SD G R Sbjct: 253 NPHESKTVVSGADRKLECD---SREQVKGGEKGKHAS-GRIRDSPPGPPKSDSGANGGR- 307 Query: 865 GKKWLKGKKHSAAVYDGRGDA-EVFSEDNSEVICRKKMKFQHDQEKQASRTNEASDPP-- 1035 A + + + D VF++ + + +KK + + + K T E ++P Sbjct: 308 ----------KAELSEAKKDTIMVFNDIHENKVFQKKRRARPEHGKSELETTETTNPAKK 357 Query: 1036 -KMDDMED 1056 K DMED Sbjct: 358 LKRVDMED 365 >ref|NP_197706.1| ENHANCER OF AG-4 protein 2 [Arabidopsis thaliana] gi|75215223|sp|Q9XER9.1|HUA2_ARATH RecName: Full=ENHANCER OF AG-4 protein 2; AltName: Full=Protein AERIAL ROSETTE 1 gi|4868120|gb|AAD31171.1| putative transcription factor [Arabidopsis thaliana] gi|10177804|dbj|BAB11170.1| transcription factor-like protein [Arabidopsis thaliana] gi|225898925|dbj|BAH30593.1| hypothetical protein [Arabidopsis thaliana] gi|332005744|gb|AED93127.1| ENHANCER OF AG-4 protein 2 [Arabidopsis thaliana] Length = 1392 Score = 194 bits (492), Expect = 7e-47 Identities = 137/351 (39%), Positives = 186/351 (52%), Gaps = 14/351 (3%) Frame = +1 Query: 49 KSELSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAF 228 K +L LGDLVLAKVKGFPAWPAKISRPEDW APDPKKYFVQFFGTEEIAFVAP DIQAF Sbjct: 15 KGQLVLGDLVLAKVKGFPAWPAKISRPEDWDRAPDPKKYFVQFFGTEEIAFVAPPDIQAF 74 Query: 229 TSESKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASETHSEAS 408 TSE+K+KL ARCQGKTVKYFAQAV++IC FE LQ + + D E+ T + Sbjct: 75 TSEAKSKLLARCQGKTVKYFAQAVEQICTAFEGLQNHKSNALGD----EDSLDATEPGLT 130 Query: 409 EVSVMNGRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVK------PCSS--DDM 564 + +++G D + +E++ + + R+D C L E +G+ K SS + Sbjct: 131 KAEIVDGTD----HIVIESERTDNFNFRVDPCFPKLDENNGEERKAEIRKLDSSSFLESK 186 Query: 565 NRCSSPHVSSRERN----KSCTESTNLGKETNGDQSASTNGRQPKLAT--EIKRKHDGAK 726 + +SP S E + K E + G + + NG Q KLA IK++ G+ Sbjct: 187 VKTTSPVSESLEHSSFDPKIKKEDFDKGTDGSACNEHFGNG-QKKLANGKRIKKEAGGSD 245 Query: 727 RRNCDAVVSRDHNEDVVQMKHASGGNIKVSSADNSRSDLGIGSERIGKKWLKGKKHSAAV 906 R+ D V + D H GG ++D+ +S G+ +E+ K + KH Sbjct: 246 RKGEDTV-----HRDKSNNSHVPGGRTASGNSDSKKSK-GLLTEKTSSK-VSADKHEN-- 296 Query: 907 YDGRGDAEVFSEDNSEVICRKKMKFQHDQEKQASRTNEASDPPKMDDMEDS 1059 S + KK + + +Q K A R +E+S K E + Sbjct: 297 ----------SPGIKVGVSGKKRRLESEQGKLAPRVDESSRAAKKPRCESA 337 >emb|CBI27142.3| unnamed protein product [Vitis vinifera] Length = 1240 Score = 194 bits (492), Expect = 7e-47 Identities = 130/293 (44%), Positives = 157/293 (53%) Frame = +1 Query: 49 KSELSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAF 228 KSEL LGDLVLAKVKGFPAWPAKI +PEDW PDPKKYFVQFFGTEEIAFVAP DI+AF Sbjct: 15 KSELRLGDLVLAKVKGFPAWPAKIGKPEDWDRTPDPKKYFVQFFGTEEIAFVAPGDIEAF 74 Query: 229 TSESKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASETHSEAS 408 TSE KNKL+ARC+GKTVK+FAQAVKEIC +EELQ+KN S + + + + SE ++AS Sbjct: 75 TSEVKNKLSARCRGKTVKFFAQAVKEICDAYEELQQKNTSAHANDNLSPAIFSEKKNKAS 134 Query: 409 EVSVMNGRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVKPCSSDDMNRCSSPHV 588 NG + + ET+ S D EI Sbjct: 135 -----NGARTPK-----ETESTSSPDKPFYVKEEIPN----------------------- 161 Query: 589 SSRERNKSCTESTNLGKETNGDQSASTNGRQPKLATEIKRKHDGAKRRNCDAVVSRDHNE 768 +S E + CT T + G S HD N + S ++ Sbjct: 162 NSNEEDIICTGRTQVATPMKGSNSC----------------HD-----NVEGGSSSCWDD 200 Query: 769 DVVQMKHASGGNIKVSSADNSRSDLGIGSERIGKKWLKGKKHSAAVYDGRGDA 927 D Q K ASGG++K SS D +SD I S GK+ LK KK D + DA Sbjct: 201 DGTQSKIASGGSMKESSPDTLKSDSDITS---GKRALKAKKQLKVTVDRQKDA 250 >ref|XP_006394583.1| hypothetical protein EUTSA_v10003519mg [Eutrema salsugineum] gi|557091222|gb|ESQ31869.1| hypothetical protein EUTSA_v10003519mg [Eutrema salsugineum] Length = 1406 Score = 193 bits (490), Expect = 1e-46 Identities = 124/300 (41%), Positives = 165/300 (55%), Gaps = 11/300 (3%) Frame = +1 Query: 49 KSELSLGDLVLAKVKGFPAWPAKISRPEDWKHAPDPKKYFVQFFGTEEIAFVAPADIQAF 228 K +L LGDLVLAKVKGFPAWPAK+SRPEDW APDPKKYFVQFFGT+EIAFVAP DIQAF Sbjct: 15 KGQLILGDLVLAKVKGFPAWPAKVSRPEDWDRAPDPKKYFVQFFGTQEIAFVAPPDIQAF 74 Query: 229 TSESKNKLTARCQGKTVKYFAQAVKEICKEFEELQRKNLSGIRDYSNAENLASETHSEAS 408 TSE+K+KL ARCQGKTVK+FAQAV EIC FEEL+ +G+ +E+ + + Sbjct: 75 TSEAKSKLLARCQGKTVKFFAQAVTEICTAFEELKNHKSNGL----GSEDPMDAAEPDLT 130 Query: 409 EVSVMNGRDSEEPNCKLETKGLSDLDSRLDHCLEILGEMDGQNVKP-----CSSDDMNRC 573 + +++G D + E+ G + DSR D C L + G+ K SS + Sbjct: 131 KAEIVDGTD----HIFTESDGTGNFDSRTDPCFPKLDKSSGEETKAEIGTRDSSSFLGSK 186 Query: 574 SSPHVSSRERNKSC---TESTNLGKETNGDQSASTNGR-QPKLATEIKR--KHDGAKRRN 735 + S + SC + + K T+GD G Q LA KR K G+ R Sbjct: 187 ITSSGSESLESGSCDPKIKEKDFAKGTDGDGCIEHFGNGQKNLAANGKRIKKLAGSTDRK 246 Query: 736 CDAVVSRDHNEDVVQMKHASGGNIKVSSADNSRSDLGIGSERIGKKWLKGKKHSAAVYDG 915 + V RD + H G + +D+ +S G+ +E+ K GK ++ + G Sbjct: 247 GEDPVHRDKSTS----SHVPDGRVASGKSDSKKSK-GLLTEKSSSKVSGGKHENSPGFKG 301