BLASTX nr result
ID: Achyranthes22_contig00016389
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Achyranthes22_contig00016389 (1926 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002273558.1| PREDICTED: uncharacterized protein LOC100268... 404 e-110 emb|CBI27315.3| unnamed protein product [Vitis vinifera] 402 e-109 ref|XP_006484353.1| PREDICTED: uncharacterized protein LOC102628... 374 e-100 gb|EMJ28449.1| hypothetical protein PRUPE_ppa017973mg [Prunus pe... 351 6e-94 ref|XP_002312290.1| hypothetical protein POPTR_0008s09730g [Popu... 350 1e-93 gb|EXB44873.1| hypothetical protein L484_026455 [Morus notabilis] 347 8e-93 ref|XP_006437884.1| hypothetical protein CICLE_v10033741mg, part... 344 7e-92 ref|XP_002512056.1| conserved hypothetical protein [Ricinus comm... 342 3e-91 gb|EOY01639.1| Histone-lysine N-methyltransferase ATX1, putative... 337 8e-90 gb|EOY01640.1| Histone-lysine N-methyltransferase ATX1, putative... 335 5e-89 ref|XP_006599179.1| PREDICTED: uncharacterized protein LOC100802... 322 3e-85 ref|XP_006599178.1| PREDICTED: uncharacterized protein LOC100802... 322 4e-85 gb|ESW28388.1| hypothetical protein PHAVU_003G282800g [Phaseolus... 322 5e-85 ref|XP_004509752.1| PREDICTED: uncharacterized protein LOC101515... 320 1e-84 ref|XP_004298357.1| PREDICTED: uncharacterized protein LOC101305... 317 9e-84 ref|XP_002893407.1| predicted protein [Arabidopsis lyrata subsp.... 310 1e-81 ref|XP_006415912.1| hypothetical protein EUTSA_v10006590mg [Eutr... 309 3e-81 gb|AAG50686.1|AC079829_19 hypothetical protein [Arabidopsis thal... 306 2e-80 gb|AAF98582.1|AC013427_25 This gene may be cut off [Arabidopsis ... 306 2e-80 ref|NP_001185099.1| DNA binding protein [Arabidopsis thaliana] g... 306 2e-80 >ref|XP_002273558.1| PREDICTED: uncharacterized protein LOC100268093 [Vitis vinifera] Length = 1242 Score = 404 bits (1037), Expect = e-110 Identities = 207/479 (43%), Positives = 294/479 (61%), Gaps = 4/479 (0%) Frame = +2 Query: 86 VHVRHSDGYIEKDMTGNQNSDLMYQSVDQGRDS----SRLHIREEKTATSSSCEEKQELN 253 + V YI+K + QN + +SV +G S + + E + T+ K E+ Sbjct: 760 LRVESCQAYIDKKLVEQQNLVKLNRSVQKGGTSFGENNMSNAEEVQAGTNLKAHIKMEVK 819 Query: 254 DEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKESSQG 433 +++G + +GCY HPM + SV+L EI+ICV CG L D+ LF+Y +T KE Sbjct: 820 HDLVGNTELVGCYVHPMPVLSVLLNTREDEIHICVLCGLLVDKDTILFIYKVTIKEPRLQ 879 Query: 434 NPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHCLCP 613 +P VG+T + LP+LKD G EVA+D+ LQ TPDG+ LVL++SI+ PYCRE+ + CLC Sbjct: 880 SPTFVGYTPIILPTLKDRSGGEVALDRFGLQFTPDGQSLVLLNSIKTPYCREQKIPCLCS 939 Query: 614 QCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIWIMN 793 C+ CFE+NA+KIV +KLG++S+V KL T V C+LVCEP HL+A++ESGR+++W+MN Sbjct: 940 ACKLECFEENAIKIVQIKLGFLSVVEKLKTVDSVQCILVCEPNHLVAVEESGRLHVWVMN 999 Query: 794 STWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXXXXX 973 STWSVQTE ++IP+Y+ + IV+LKRIPK A +VVGH+G+GEFSLWDI +R Sbjct: 1000 STWSVQTEDFIIPTYDCVSPCIVELKRIPKCAPLVVGHHGFGEFSLWDISQRILISRFAM 1059 Query: 974 XXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSEAYISCPVEGEDV 1153 +F+P+SLFS+ S+ + + K+ AT WFS+H+E Y P+ GE + Sbjct: 1060 PSISIFEFIPISLFSFQSEVPLSSNPDVDLHINKIMAATKMWFSKHNENYTFLPLGGESI 1119 Query: 1154 AVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXXXXXXXXXX 1333 AVWL + T SDS Q ++ G W L L++K++VILG+ALD Sbjct: 1120 AVWLLVSTLSDSDTQHDNQMNDCQTNPVGWWRLALLVKNMVILGSALDPRAAAIGASAGH 1179 Query: 1334 XXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDGGQLCVYLH 1510 VY+W+L+ G+K +LH +G + IATDD S V A+A DGGQL VYLH Sbjct: 1180 GIIGTHDGLVYMWELSTGTKLGSLHYFKG-GVSCIATDDSRSDVFAVAGDGGQLLVYLH 1237 >emb|CBI27315.3| unnamed protein product [Vitis vinifera] Length = 1177 Score = 402 bits (1034), Expect = e-109 Identities = 206/471 (43%), Positives = 292/471 (61%), Gaps = 4/471 (0%) Frame = +2 Query: 110 YIEKDMTGNQNSDLMYQSVDQGRDS----SRLHIREEKTATSSSCEEKQELNDEIIGTMK 277 YI+K + QN + +SV +G S + + E + T+ K E+ +++G + Sbjct: 703 YIDKKLVEQQNLVKLNRSVQKGGTSFGENNMSNAEEVQAGTNLKAHIKMEVKHDLVGNTE 762 Query: 278 FIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKESSQGNPCMVGHT 457 +GCY HPM + SV+L EI+ICV CG L D+ LF+Y +T KE +P VG+T Sbjct: 763 LVGCYVHPMPVLSVLLNTREDEIHICVLCGLLVDKDTILFIYKVTIKEPRLQSPTFVGYT 822 Query: 458 SMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHCLCPQCESFCFE 637 + LP+LKD G EVA+D+ LQ TPDG+ LVL++SI+ PYCRE+ + CLC C+ CFE Sbjct: 823 PIILPTLKDRSGGEVALDRFGLQFTPDGQSLVLLNSIKTPYCREQKIPCLCSACKLECFE 882 Query: 638 KNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIWIMNSTWSVQTE 817 +NA+KIV +KLG++S+V KL T V C+LVCEP HL+A++ESGR+++W+MNSTWSVQTE Sbjct: 883 ENAIKIVQIKLGFLSVVEKLKTVDSVQCILVCEPNHLVAVEESGRLHVWVMNSTWSVQTE 942 Query: 818 LYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXXXXXXXXXXLDF 997 ++IP+Y+ + IV+LKRIPK A +VVGH+G+GEFSLWDI +R +F Sbjct: 943 DFIIPTYDCVSPCIVELKRIPKCAPLVVGHHGFGEFSLWDISQRILISRFAMPSISIFEF 1002 Query: 998 LPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSEAYISCPVEGEDVAVWLFIRT 1177 +P+SLFS+ S+ + + K+ AT WFS+H+E Y P+ GE +AVWL + T Sbjct: 1003 IPISLFSFQSEVPLSSNPDVDLHINKIMAATKMWFSKHNENYTFLPLGGESIAVWLLVST 1062 Query: 1178 SSDSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXXXXXXXXXXXXXXXXXX 1357 SDS Q ++ G W L L++K++VILG+ALD Sbjct: 1063 LSDSDTQHDNQMNDCQTNPVGWWRLALLVKNMVILGSALDPRAAAIGASAGHGIIGTHDG 1122 Query: 1358 QVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDGGQLCVYLH 1510 VY+W+L+ G+K +LH +G + IATDD S V A+A DGGQL VYLH Sbjct: 1123 LVYMWELSTGTKLGSLHYFKG-GVSCIATDDSRSDVFAVAGDGGQLLVYLH 1172 >ref|XP_006484353.1| PREDICTED: uncharacterized protein LOC102628159 [Citrus sinensis] Length = 1252 Score = 374 bits (959), Expect = e-100 Identities = 200/461 (43%), Positives = 270/461 (58%), Gaps = 4/461 (0%) Frame = +2 Query: 140 NSDLMYQSVD-QGRDSSRLHIREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISS 316 NS ++ Q + G + + + +E + ++ ++ E +E+ GT +GCY P+ I S Sbjct: 788 NSSVVSQKQEISGCEYTSSNAKESQVSSDLKLQKNVECINELAGTFDLMGCYFFPLPILS 847 Query: 317 VMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGR 496 V+L +IY+CVSCG L D+KR LF+YT+ +E GNP VGHTS+ LP LKD FGR Sbjct: 848 VLLSTTGDKIYVCVSCGFLVDKKRTLFIYTVDIQEPRVGNPSCVGHTSVMLPFLKDNFGR 907 Query: 497 EVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGY 676 E+A+++S TPDG+ LVL+DS++ PYCRE CLC C S ++NAVKIV VK GY Sbjct: 908 EIALERSCALFTPDGQYLVLLDSMKTPYCREGRSDCLCSTCTSHRLDENAVKIVKVKPGY 967 Query: 677 VSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSR 856 VS+V KL T V C+LVCEP+HLIA+ ESG++++W MNS+WS Q E +IP + + Sbjct: 968 VSVVAKLKTDDCVQCILVCEPKHLIAVGESGKLHLWEMNSSWSAQVEECIIPINDCIYPC 1027 Query: 857 IVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGL 1036 IV++KRIPK A +VVGHNG+GEF +WDI KR F P++LFSW G Sbjct: 1028 IVEMKRIPKCAPLVVGHNGFGEFGIWDISKRVLVSRFSAARASIYQFFPINLFSWQRNG- 1086 Query: 1037 TKDKVITEGCMRKLQEATMSWFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSS 1216 V + + AT S FS+HSE CP GED A+WL + T SDS Q C S Sbjct: 1087 ---SVSMDASLELTNTATTSLFSKHSEKSSFCPSVGEDSAIWLLVSTISDSDAQHNCMSR 1143 Query: 1217 NNHLISNGCWWLGLMIKDVVILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKR 1396 + W L L++K+ VILG+ LD VY W+L+ G+K Sbjct: 1144 DCQKNPVRFWRLALLVKNRVILGSPLDPRASAIGASSGLGIIGTNDGLVYAWELSSGNKL 1203 Query: 1397 EALHDLEGHTILRIATDDLTSSVVAIA---RDGGQLCVYLH 1510 LH +G T+ IATDD +A+A DGGQL VYLH Sbjct: 1204 GILHHFKGGTVSCIATDDSGLQALAVAGDGPDGGQLLVYLH 1244 >gb|EMJ28449.1| hypothetical protein PRUPE_ppa017973mg [Prunus persica] Length = 1170 Score = 351 bits (901), Expect = 6e-94 Identities = 195/477 (40%), Positives = 277/477 (58%), Gaps = 3/477 (0%) Frame = +2 Query: 92 VRHSDGYIEKDMTGNQNS-DLMYQSVDQGRDSSRLHIREEKTATSSSCEEKQELNDEIIG 268 V + +++KD+ G++N + Q + + +H +S S ELN+E+ G Sbjct: 699 VSRLENHVDKDVVGHENLLEPNDTETSQKQGTGLMHDPNSVPHSSDSKPHSMELNNELTG 758 Query: 269 TMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKESSQGNPCMV 448 +++F+G Y H + SV+L EIY+CV CG L D+ LF+Y + +E G P V Sbjct: 759 SLEFVGRYSHQNPVLSVLLSAKGTEIYVCVLCGPLVDKDGSLFIYKVAIEEPRVGCPSFV 818 Query: 449 GHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHCLCPQCESF 628 GHTS++LP KD FGR +A+++S+LQ TPDG+ LVL+DSI+ PYCR+ ++HCLC C S Sbjct: 819 GHTSVTLPIRKDYFGR-IALERSSLQFTPDGQYLVLLDSIKTPYCRQGSIHCLCSTCTSN 877 Query: 629 CFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIWIMNSTWSV 808 C E+N VKIV V+LGYVS V L + C+LVCEP +L+A+ ESGR+++W+MNSTWS Sbjct: 878 CSEENTVKIVQVRLGYVSKVASLKAVDSLECILVCEPNNLVAVGESGRLHLWVMNSTWSA 937 Query: 809 QTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXXXXXXXXXX 988 Q E +V+P+ + + IV+LKRIP +VVGHNG+GEFSLWDI K Sbjct: 938 QIENFVLPAEDCISPGIVELKRIPNCTHIVVGHNGFGEFSLWDISKCILVSRFSAASSSI 997 Query: 989 LDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATM-SWFSRHSEAYISCPVEGEDVAVWL 1165 F+PVSLF+W K E + +L AT + FS +EGED+AVWL Sbjct: 998 CQFVPVSLFTWRIKCPVSSYSDIEEHINELVAATSNNQFS----------LEGEDIAVWL 1047 Query: 1166 FIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXXXXXXXXXXXXXX 1345 + +SSDS Q S + G W L LM+K++VI G+ALD Sbjct: 1048 LVSSSSDSDAQQDYVSDDCDSNPMGRWRLALMVKNMVIFGSALDPRAAVIGASAGQGICG 1107 Query: 1346 XXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDG-GQLCVYLHT 1513 VY+W+L+ G+K A+H +G ++ IATDD S A+A G QL V+LH+ Sbjct: 1108 TCDGLVYMWELSTGNKFGAMHHFKGGSVSCIATDDSRPSPGAVAVAGDNQLLVFLHS 1164 >ref|XP_002312290.1| hypothetical protein POPTR_0008s09730g [Populus trichocarpa] gi|222852110|gb|EEE89657.1| hypothetical protein POPTR_0008s09730g [Populus trichocarpa] Length = 1312 Score = 350 bits (898), Expect = 1e-93 Identities = 176/436 (40%), Positives = 255/436 (58%) Frame = +2 Query: 197 IREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALE 376 ++E +T + + N+E+ G + +GCY HPM + S+++ EI +C CG L Sbjct: 872 VKEVQTNSDLKLHRNLKHNNELEGNFELVGCYLHPMPVLSLLVVTKGDEINVCALCGHLV 931 Query: 377 DRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVL 556 D+ R LFLY L +E+ GNP VGHTS++ P D FGRE A+++S LQLTPDG+ LVL Sbjct: 932 DKNRTLFLYKLAIEETRTGNPSFVGHTSVTFPFSTDIFGRETALERSGLQLTPDGQNLVL 991 Query: 557 VDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCE 736 + S++ PYCRE CLC C C E++ VKIV VK GYVS++VKL+T + C+LVCE Sbjct: 992 LGSMKTPYCREGRTDCLCSTCSLNCSEQSTVKIVQVKTGYVSVLVKLSTFDSMQCILVCE 1051 Query: 737 PEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGY 916 P HLIA ESGR+++W MNS WS TE ++I + + + IV+LKR+P AS+VVG+NG+ Sbjct: 1052 PNHLIAAGESGRLHLWTMNSAWSAPTEEFIISANDCISPCIVELKRVPNCASVVVGNNGF 1111 Query: 917 GEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMS 1096 GEF++WD+ +R F P+S F+W E + + +AT Sbjct: 1112 GEFTVWDVSRRMFMARVSSPSASACQFFPISSFTWQRVVHGFHYSTVEEQIDGIVDATKL 1171 Query: 1097 WFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVV 1276 WFS +SE Y P++GED+A+WL + T + Q SS+ + G W L L++K+++ Sbjct: 1172 WFSENSEYYSLPPLDGEDIAIWLLVSTIPELDTQEDYISSDCGINPVGWWRLALLVKNML 1231 Query: 1277 ILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLT 1456 ILG ALD VY+W+ G++ LH EG ++ IATD+ Sbjct: 1232 ILGKALDPRAAAIGSSSGNGIIGTFDGLVYMWEFTTGTRLGTLHHFEGESVSCIATDNSK 1291 Query: 1457 SSVVAIARDGGQLCVY 1504 V+++A D GQL VY Sbjct: 1292 PGVISVAGDKGQLLVY 1307 >gb|EXB44873.1| hypothetical protein L484_026455 [Morus notabilis] Length = 1147 Score = 347 bits (891), Expect = 8e-93 Identities = 182/427 (42%), Positives = 251/427 (58%) Frame = +2 Query: 209 KTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKR 388 +T S +EK D +G + IGCY HP+ + S+++ +I+ICV CG ++ R Sbjct: 709 ETVEMGSSDEKSHTKD--LGLGELIGCYLHPLPVLSLLVCTTGEDIHICVLCGLRVNKDR 766 Query: 389 DLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSI 568 LF+Y + T+E G P VGHTS++LPSLKD FG+E+A+++S LQ TP G+ LVL+D I Sbjct: 767 TLFIYKIATQEPRVGYPSFVGHTSVTLPSLKDYFGKEIALERSGLQYTPGGQYLVLLDCI 826 Query: 569 RMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHL 748 R PYCR+ + CLCP C S FE++AVKIV VKLGYVS+VVKL T + CVLVCEP HL Sbjct: 827 RTPYCRQGTIPCLCPACASGSFEEDAVKIVEVKLGYVSVVVKLKTLESLQCVLVCEPNHL 886 Query: 749 IALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFS 928 +A+ ESGR+++W+MN WS QTE +++P+ + + IV+LKRIPK +VVGHNG+GEFS Sbjct: 887 VAVGESGRLHLWVMNPAWSAQTEQFILPANDLVSPGIVELKRIPKCVRLVVGHNGFGEFS 946 Query: 929 LWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSR 1108 L +F PV+LF W KG + G + ++ AT WFS Sbjct: 947 L-------------------CEFFPVALFGWKKKGHSFGDCNVHGHVNRMMAATNMWFSE 987 Query: 1109 HSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGA 1288 + S P+ E++AVWL + SDS S + H S G W L L++K++VILG Sbjct: 988 QTND-DSLPLLEEEIAVWLLVSVPSDSDDHHDYTSGDYHTKSVGWWRLALLVKNMVILGG 1046 Query: 1289 ALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVV 1468 ALD VYIW+++ G+K LH G ++ IATDD V Sbjct: 1047 ALDPSAEAIGASAGHGIIGTCDGLVYIWEMSTGTKLGTLHHFRGSSVSCIATDDSKKGAV 1106 Query: 1469 AIARDGG 1489 AI+ G Sbjct: 1107 AISGGEG 1113 >ref|XP_006437884.1| hypothetical protein CICLE_v10033741mg, partial [Citrus clementina] gi|557540080|gb|ESR51124.1| hypothetical protein CICLE_v10033741mg, partial [Citrus clementina] Length = 1177 Score = 344 bits (883), Expect = 7e-92 Identities = 181/424 (42%), Positives = 247/424 (58%), Gaps = 1/424 (0%) Frame = +2 Query: 140 NSDLMYQSVD-QGRDSSRLHIREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISS 316 NS ++ Q + G + + + +E + ++ ++ E +E+ GT +GCY P+ I S Sbjct: 755 NSSVVSQKQEISGCEYTSSNAKESQVSSDLKLQKNVECINELAGTFDLMGCYFFPLPILS 814 Query: 317 VMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGR 496 V+L +IY+CVSCG L D+KR LF+YT+ +E GNP VGHTS+ LP LKD FGR Sbjct: 815 VLLSTTGDKIYVCVSCGFLVDKKRTLFIYTVDIQEPRVGNPSCVGHTSVMLPFLKDNFGR 874 Query: 497 EVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGY 676 E+A+++S TPDG+ LVL+DS++ PYCRE CLC C S ++NAVKIV V GY Sbjct: 875 EIALERSCALFTPDGQYLVLLDSMKTPYCREGRSDCLCSTCTSHRLDENAVKIVKVNPGY 934 Query: 677 VSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSR 856 VS+V KL T V C+LVCEP+HLIA+ ESG++++W MNS+WS Q E +IP + + Sbjct: 935 VSVVAKLKTDDCVQCILVCEPKHLIAVGESGKLHLWEMNSSWSAQVEECIIPINDCIYPC 994 Query: 857 IVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGL 1036 IV++KRIPK A +VVGHNG+GEF +WDI KR F P++LFSW G Sbjct: 995 IVEMKRIPKCAPLVVGHNGFGEFGIWDISKRVLVSRFSAARASIYQFFPINLFSWQRNG- 1053 Query: 1037 TKDKVITEGCMRKLQEATMSWFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSS 1216 V + + AT S FS+HSE CP GED A+WL + T SDS Q C S Sbjct: 1054 ---SVSMDASLELTNTATTSLFSKHSEKSSFCPSVGEDSAIWLLVSTISDSDAQHNCMSR 1110 Query: 1217 NNHLISNGCWWLGLMIKDVVILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKR 1396 + W L L++K+ VILG+ LD VY W+L+ G+K Sbjct: 1111 DCQKNPVRFWRLALLVKNRVILGSPLDPRASAIGASSGLGIIGTNDGLVYAWELSSGNKL 1170 Query: 1397 EALH 1408 LH Sbjct: 1171 GILH 1174 >ref|XP_002512056.1| conserved hypothetical protein [Ricinus communis] gi|223549236|gb|EEF50725.1| conserved hypothetical protein [Ricinus communis] Length = 1246 Score = 342 bits (877), Expect = 3e-91 Identities = 182/419 (43%), Positives = 252/419 (60%), Gaps = 2/419 (0%) Frame = +2 Query: 245 ELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKES 424 EL +E+ G ++F+GCY HPM + S++++R +EIYICV CG L ++ R LFLY L + Sbjct: 751 ELTNELDGIVEFLGCYFHPMPVLSLLVRRKGNEIYICVLCGLLVEKDRTLFLYKLAIEGP 810 Query: 425 SQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHC 604 G PC +GHTS++ PS FGRE++ ++S LQLTPDG+ LVL+ S R P CRE L C Sbjct: 811 RIGCPCFIGHTSVTWPSSTGIFGREISFERSGLQLTPDGQCLVLLGSTRAPCCREGRLEC 870 Query: 605 LCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIW 784 LC C S CF N VKIV VK GYVS++VKL T + C+LVCEP+HL+A E+ R+++W Sbjct: 871 LCSACASDCFGSNGVKIVQVKAGYVSVLVKLKTNDSLQCILVCEPDHLVAAGENSRLHLW 930 Query: 785 IMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXX 964 MNS WS TE + I S ++ I++LKRIPK S+V+GH+G+GEF+LWDI KR Sbjct: 931 TMNSVWSAPTEEFTIQSNDYTSPCIMELKRIPKCTSLVIGHDGFGEFTLWDISKRIFVSK 990 Query: 965 XXXXXXXXLDFLPVSLFSWSSK--GLTKDKVITEGCMRKLQEATMSWFSRHSEAYISCPV 1138 F P+SLF W + GL+ V E + +L +AT FS HS I+ + Sbjct: 991 FSSPSNSVHQFSPISLFHWQREVHGLSYSNV--EAHVNRLMDAT-KMFSGHS---INHSL 1044 Query: 1139 EGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXXXXX 1318 ED+A+W + T+ DS SS++ + G W L L++K+ +ILG+ALD Sbjct: 1045 PHEDIAIWFLVSTAPDSDALHDYGSSHSQINPVGYWRLALLMKNSLILGSALDPRAAAIG 1104 Query: 1319 XXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDGGQL 1495 VY+W+L G K LH +G + IATDD S V+AIA D G++ Sbjct: 1105 TSAGHGIIGTLDGLVYMWELLTGKKLGTLHKFKGGSASCIATDDSGSGVLAIADDKGEI 1163 >gb|EOY01639.1| Histone-lysine N-methyltransferase ATX1, putative isoform 1 [Theobroma cacao] Length = 1329 Score = 337 bits (865), Expect = 8e-90 Identities = 186/455 (40%), Positives = 258/455 (56%), Gaps = 10/455 (2%) Frame = +2 Query: 179 DSSRLHIREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICV 358 D++R RE + ++ + ELN ++ G + +G Y HP+ ISSV L +EI+ICV Sbjct: 873 DTNRSKAREVQGSSDVNHCRDVELNCDLRGIVNLVGSYFHPLPISSVWLCTKGNEIHICV 932 Query: 359 SCGALEDRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSA------ 520 CG L D+ R LFLY ++ +E S G P VG+TS++L + FG + + SA Sbjct: 933 LCGLLVDKDRTLFLYRVSIEEPSIGCPSFVGYTSVTLTFSEVSFGGRICCNSSAIFIIDS 992 Query: 521 ----LQLTPDGRGLVLVDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIV 688 LQ TPDG+ LVL+D I+ PYCRE + C+C C S C +N VKIV V GYVS+V Sbjct: 993 ERCGLQFTPDGQCLVLLDGIKTPYCREGIIDCICSICSSGCSNENGVKIVQVNHGYVSLV 1052 Query: 689 VKLNTTFPVHCVLVCEPEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDL 868 KL T V C+LVCE +L+A SGR+++W+MNSTWS TE +++P+ + L +V+L Sbjct: 1053 AKLETVESVQCILVCENNYLVAAGTSGRLHLWVMNSTWSAWTEEFILPAGDCLSPCVVEL 1112 Query: 869 KRIPKSASMVVGHNGYGEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDK 1048 KRIPK A +V+GHNG GEF +WDI+KR FLP+SLFSW D Sbjct: 1113 KRIPKCARLVIGHNGIGEFVVWDILKRLILSRFSASGNPIKQFLPISLFSWQPVFSYAD- 1171 Query: 1049 VITEGCMRKLQEATMSWFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHL 1228 G + ++ T FS H + + P+EGED+A+WL + T SD Q SN Sbjct: 1172 --MNGRIDEIFTTTKILFSEHKDCFFP-PLEGEDIALWLLLSTVSDFEDQYERLPSNCQA 1228 Query: 1229 ISNGCWWLGLMIKDVVILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALH 1408 W L L++KD VILG+ LD VY+W+L+ G++ LH Sbjct: 1229 NPARSWRLALLVKDRVILGSTLDPRAAAIGASFDHGIIGRDDGLVYMWELSTGTRLGVLH 1288 Query: 1409 DLEGHTILRIATDDLTSSVVAIARDGGQLCVYLHT 1513 +G ++ IATDDL VVA+A D GQL +YLH+ Sbjct: 1289 HFKGGSVSCIATDDLRPDVVAVAADDGQLLIYLHS 1323 >gb|EOY01640.1| Histone-lysine N-methyltransferase ATX1, putative isoform 2 [Theobroma cacao] gi|508709744|gb|EOY01641.1| Histone-lysine N-methyltransferase ATX1, putative isoform 2 [Theobroma cacao] Length = 1128 Score = 335 bits (858), Expect = 5e-89 Identities = 183/445 (41%), Positives = 255/445 (57%) Frame = +2 Query: 179 DSSRLHIREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICV 358 D++R RE + ++ + ELN ++ G + +G Y HP+ ISSV L +EI+ICV Sbjct: 688 DTNRSKAREVQGSSDVNHCRDVELNCDLRGIVNLVGSYFHPLPISSVWLCTKGNEIHICV 747 Query: 359 SCGALEDRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPD 538 CG L D+ R LFLY ++ +E S G P VG+TS++L E+ ++ LQ TPD Sbjct: 748 LCGLLVDKDRTLFLYRVSIEEPSIGCPSFVGYTSVTLTF------SEIDSERCGLQFTPD 801 Query: 539 GRGLVLVDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVH 718 G+ LVL+D I+ PYCRE + C+C C S C +N VKIV V GYVS+V KL T V Sbjct: 802 GQCLVLLDGIKTPYCREGIIDCICSICSSGCSNENGVKIVQVNHGYVSLVAKLETVESVQ 861 Query: 719 CVLVCEPEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMV 898 C+LVCE +L+A SGR+++W+MNSTWS TE +++P+ + L +V+LKRIPK A +V Sbjct: 862 CILVCENNYLVAAGTSGRLHLWVMNSTWSAWTEEFILPAGDCLSPCVVELKRIPKCARLV 921 Query: 899 VGHNGYGEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKL 1078 +GHNG GEF +WDI+KR FLP+SLFSW D G + ++ Sbjct: 922 IGHNGIGEFVVWDILKRLILSRFSASGNPIKQFLPISLFSWQPVFSYAD---MNGRIDEI 978 Query: 1079 QEATMSWFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGL 1258 T FS H + + P+EGED+A+WL + T SD Q SN W L L Sbjct: 979 FTTTKILFSEHKDCFFP-PLEGEDIALWLLLSTVSDFEDQYERLPSNCQANPARSWRLAL 1037 Query: 1259 MIKDVVILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRI 1438 ++KD VILG+ LD VY+W+L+ G++ LH +G ++ I Sbjct: 1038 LVKDRVILGSTLDPRAAAIGASFDHGIIGRDDGLVYMWELSTGTRLGVLHHFKGGSVSCI 1097 Query: 1439 ATDDLTSSVVAIARDGGQLCVYLHT 1513 ATDDL VVA+A D GQL +YLH+ Sbjct: 1098 ATDDLRPDVVAVAADDGQLLIYLHS 1122 >ref|XP_006599179.1| PREDICTED: uncharacterized protein LOC100802319 isoform X2 [Glycine max] Length = 1115 Score = 322 bits (826), Expect = 3e-85 Identities = 194/550 (35%), Positives = 289/550 (52%), Gaps = 43/550 (7%) Frame = +2 Query: 2 LSQNCE--FQEKTIDFTINKKDLQIVE---GCYVHVRHSDGYI-----------EKDMTG 133 + QNC+ E +D ++ KDL I E +HV+ + ++ +D TG Sbjct: 566 MPQNCDVCIPESVLD-DMSPKDLIIYERSDDACLHVKENPAHVFLSSVQKDLPTAQDFTG 624 Query: 134 NQNSDLMYQS--------------VDQGRDSSR---LHIREEKTATSSSCE--------E 238 + + L Q+ VD SS+ L E K + + + Sbjct: 625 DDTAGLCVQTPQIRSDVLGGHSNLVDPNPTSSQNLTLFADENKCFGTKEVQLISEPMPLQ 684 Query: 239 KQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTK 418 QEL + + ++KF+G Y HPM +SS+ L EI++CV CG L + R LF Y + Sbjct: 685 NQELKNNLGSSVKFVGRYLHPMPVSSLFLSTREDEIHVCVLCGYLTGQYRTLFTYKVAIA 744 Query: 419 ESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNL 598 E + G P ++ H+S+ LP K F +E V++S +QLTP G+ +VL+ SI+ P CRE + Sbjct: 745 EPTLGCPSVMAHSSILLPDPKHNFIKETMVERSGVQLTPGGQYVVLIGSIKTPNCREGKI 804 Query: 599 HCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIY 778 C C C+S C EKNA+KIV V+ GYVS+V L T VHC+LVCEP L+++ ESG++ Sbjct: 805 DCHCSTCKSVCSEKNALKIVQVEHGYVSVVTTLETVDNVHCILVCEPNRLVSVGESGKLQ 864 Query: 779 IWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXX 958 +W+MNS WS + E ++IP+ + I++LKR+PK +VVGHN GEFSLWDI K Sbjct: 865 VWVMNSKWSEKIEYFIIPADGSVSPGIMELKRVPKCTHLVVGHNSRGEFSLWDIAKCNCV 924 Query: 959 XXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSEAYISCPV 1138 +F P+SLF W +KG V E KL EAT W+S + P+ Sbjct: 925 TSFSALKSPVNEFFPISLFQWQTKGSGFSNVNIEEQADKLLEATNLWYSEQRDICWFSPI 984 Query: 1139 EGEDVAVWLFIRTSS--DSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXXX 1312 E EDVA+WLF+ T+S DS SS+ + + W L L++K+ +I G+ LD Sbjct: 985 E-EDVAMWLFVSTTSDLDSCHNHVSTSSSYDIHTARSWRLALLMKNSIIFGSPLDLRTSG 1043 Query: 1313 XXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDGGQ 1492 VY+W+L++GSK + LH + + +ATDD + + +A G+ Sbjct: 1044 NGVSCGYGIISTSDGVVYMWELSKGSKLDTLHHFQDGNVTCVATDD-SRGALGVAGGRGE 1102 Query: 1493 LCVYLHT*DL 1522 L +YLH +L Sbjct: 1103 LLLYLHDPEL 1112 >ref|XP_006599178.1| PREDICTED: uncharacterized protein LOC100802319 isoform X1 [Glycine max] Length = 1217 Score = 322 bits (825), Expect = 4e-85 Identities = 170/431 (39%), Positives = 248/431 (57%), Gaps = 2/431 (0%) Frame = +2 Query: 236 EKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTT 415 + QEL + + ++KF+G Y HPM +SS+ L EI++CV CG L + R LF Y + Sbjct: 786 QNQELKNNLGSSVKFVGRYLHPMPVSSLFLSTREDEIHVCVLCGYLTGQYRTLFTYKVAI 845 Query: 416 KESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKN 595 E + G P ++ H+S+ LP K F +E V++S +QLTP G+ +VL+ SI+ P CRE Sbjct: 846 AEPTLGCPSVMAHSSILLPDPKHNFIKETMVERSGVQLTPGGQYVVLIGSIKTPNCREGK 905 Query: 596 LHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRI 775 + C C C+S C EKNA+KIV V+ GYVS+V L T VHC+LVCEP L+++ ESG++ Sbjct: 906 IDCHCSTCKSVCSEKNALKIVQVEHGYVSVVTTLETVDNVHCILVCEPNRLVSVGESGKL 965 Query: 776 YIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXX 955 +W+MNS WS + E ++IP+ + I++LKR+PK +VVGHN GEFSLWDI K Sbjct: 966 QVWVMNSKWSEKIEYFIIPADGSVSPGIMELKRVPKCTHLVVGHNSRGEFSLWDIAKCNC 1025 Query: 956 XXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSEAYISCP 1135 +F P+SLF W +KG V E KL EAT W+S + P Sbjct: 1026 VTSFSALKSPVNEFFPISLFQWQTKGSGFSNVNIEEQADKLLEATNLWYSEQRDICWFSP 1085 Query: 1136 VEGEDVAVWLFIRTSS--DSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXX 1309 +E EDVA+WLF+ T+S DS SS+ + + W L L++K+ +I G+ LD Sbjct: 1086 IE-EDVAMWLFVSTTSDLDSCHNHVSTSSSYDIHTARSWRLALLMKNSIIFGSPLDLRTS 1144 Query: 1310 XXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDGG 1489 VY+W+L++GSK + LH + + +ATDD + + +A G Sbjct: 1145 GNGVSCGYGIISTSDGVVYMWELSKGSKLDTLHHFQDGNVTCVATDD-SRGALGVAGGRG 1203 Query: 1490 QLCVYLHT*DL 1522 +L +YLH +L Sbjct: 1204 ELLLYLHDPEL 1214 >gb|ESW28388.1| hypothetical protein PHAVU_003G282800g [Phaseolus vulgaris] Length = 1211 Score = 322 bits (824), Expect = 5e-85 Identities = 174/435 (40%), Positives = 253/435 (58%), Gaps = 6/435 (1%) Frame = +2 Query: 236 EKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTT 415 + +EL + ++KF+GCY HPM +SS+ L E++ICV CG L D+ R LF Y + Sbjct: 780 QNEELKSNLGSSVKFVGCYLHPMPVSSLFLSTKEDEVHICVLCGHLTDQYRTLFTYKVAI 839 Query: 416 KESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKN 595 E + G P ++ H+S+ LP K F +E V++S +QLTP G+ +VL+ SI+ P CRE Sbjct: 840 TEPTLGYPSVMAHSSILLPDPKHNFIKETMVERSGVQLTPGGQYIVLIGSIKAPNCREGK 899 Query: 596 LHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRI 775 + C C C S +EKNA+KIV V+ GYVS+V L T VHC+LVCEP L+++ ESG++ Sbjct: 900 IDCSCSTCTSVFYEKNALKIVQVEHGYVSVVTTLETADNVHCILVCEPNRLVSVGESGKL 959 Query: 776 YIWIMNSTWSVQTELYVIPSYEFLPS-RIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRX 952 +W+MNS WS +TE ++IP+ + S IV+LK++PKS +VVGHN YGEFSLWDI K Sbjct: 960 EVWVMNSKWSEKTEHFIIPTDDGSASPGIVELKKVPKSTHLVVGHNSYGEFSLWDIAKCN 1019 Query: 953 XXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSEAYISC 1132 +F P+SLF W +KG E KL +AT SW+S+ E Sbjct: 1020 CVARFSAIKSPINEFFPISLFQWQTKGSGFSYASMEEQADKLLKATNSWYSQQRETSWPS 1079 Query: 1133 PVEGEDVAVWLFIRTSSDSYPQSACY-----SSNNHLISNGCWWLGLMIKDVVILGAALD 1297 P+E E+VA+WLF+ T SD Q C+ SS+ + + W L LM+K+ + G+ L+ Sbjct: 1080 PLE-ENVAMWLFVSTYSD---QDCCHNPTSTSSSFDIHTARSWRLALMMKNSINFGSPLN 1135 Query: 1298 XXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIA 1477 VY+W+L++GSK LH + + +ATD+ + + +A Sbjct: 1136 LRTCGIGVSSGYGIIGTTEGVVYMWELSKGSKLYTLHQFQDGNVACVATDN-SRGALGVA 1194 Query: 1478 RDGGQLCVYLHT*DL 1522 GGQL +YLH +L Sbjct: 1195 -GGGQLLLYLHIPEL 1208 >ref|XP_004509752.1| PREDICTED: uncharacterized protein LOC101515165 [Cicer arietinum] Length = 1239 Score = 320 bits (820), Expect = 1e-84 Identities = 171/436 (39%), Positives = 242/436 (55%), Gaps = 5/436 (1%) Frame = +2 Query: 218 TSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLF 397 T +CE K LN + KF+G Y HPM +SS++++ EI+ICV CG L ++R LF Sbjct: 802 TQRNCELKNNLNSNV----KFVGRYMHPMPVSSLLIRTREDEIHICVICGLLMSQQRTLF 857 Query: 398 LYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMP 577 Y + KES+ G P ++ H+ + LP F RE V+ + ++LTPDG+ +VL+ SIR P Sbjct: 858 TYKVAIKESNFGFPSVMAHSPIILPDPNHNFIRETMVESTGVELTPDGQYIVLIGSIRTP 917 Query: 578 YCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIAL 757 CRE + C C C S C EK+A+KIVHV+ GYVS++ L VHC+LVCEP L+++ Sbjct: 918 NCREGKIDCCCSTCTSVCSEKSALKIVHVQCGYVSLMATLEVIDDVHCILVCEPNRLVSV 977 Query: 758 DESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWD 937 ESGR+++W+MNSTWS E ++IP + IV+LK++PK A +VVG N GEFSLWD Sbjct: 978 GESGRLHVWVMNSTWSEMVEYFIIPPDGSMSPGIVELKKVPKCAHLVVGRNICGEFSLWD 1037 Query: 938 IVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSE 1117 I K +F P+SLF K + E KL EAT W S E Sbjct: 1038 ITKLNCVSSFSASKYPINEFSPISLFHLQRKDVGFSYASIEEKAEKLLEATKLWHSEQRE 1097 Query: 1118 AYISCPVEGEDVAVWLFIRTSS--DSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAA 1291 + P +DVA+W + T S D SS++ + S W L L++++ ++ G+ Sbjct: 1098 TSVFLP--SQDVAIWFLVSTPSDVDCCQNHVSTSSHHDVHSARSWRLALLVENSIVFGSP 1155 Query: 1292 LDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSS--- 1462 LD VY W+L+ GSK + LH E T+ +ATD+ S+ Sbjct: 1156 LDPRATAIGVSGGYGISSTSDGVVYTWELSRGSKVDTLHRFEDGTVTSLATDESNSNSRG 1215 Query: 1463 VVAIARDGGQLCVYLH 1510 V +A DGGQL +YLH Sbjct: 1216 AVGVAGDGGQLLLYLH 1231 >ref|XP_004298357.1| PREDICTED: uncharacterized protein LOC101305752 [Fragaria vesca subsp. vesca] Length = 1259 Score = 317 bits (813), Expect = 9e-84 Identities = 178/472 (37%), Positives = 257/472 (54%), Gaps = 1/472 (0%) Frame = +2 Query: 92 VRHSDGYIEKDMTGNQNSDLMYQSVDQGRDSSRLHIREEKTATSSSCE-EKQELNDEIIG 268 V H + ++K + GN+N S + SS+ + K+E N+ + G Sbjct: 792 VSHLENQVDKKVVGNENLLQFIDSETSHKQGPSFSYDPNSIPFSSNTKPHKKEHNNGLAG 851 Query: 269 TMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKESSQGNPCMV 448 ++F+GCY P+ + SV+L IY+ V CG L + LF+Y + +E G+ +V Sbjct: 852 ILEFVGCYTQPVPVLSVLLSTKGRYIYVSVLCGLLVGKDVSLFIYKVAIEEPMVGHSSLV 911 Query: 449 GHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHCLCPQCESF 628 GHTS++LP L D + +A+++ LQ PDG+ LVL+D IR P+CR+ HCLC C S Sbjct: 912 GHTSLTLPDLTD-YSNGMALERFCLQFIPDGQCLVLLDKIRTPFCRQGKTHCLCTTCASS 970 Query: 629 CFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIWIMNSTWSV 808 C E++AVKIV VKLGYVS+V +L C+LVCEP +L+++ +SGR+++W+M+STWS Sbjct: 971 CSEEDAVKIVQVKLGYVSLVTRLKAAQSQRCILVCEPNNLVSVGKSGRLHLWVMDSTWSA 1030 Query: 809 QTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXXXXXXXXXX 988 Q E V+PS + + +VDLKRIP ++VGHNGYGEFSLWDI K Sbjct: 1031 QMEYIVMPSEDCISPGVVDLKRIPNCTHLIVGHNGYGEFSLWDITKCIFVSRFSAPSGSI 1090 Query: 989 LDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSEAYISCPVEGEDVAVWLF 1168 F+P+SLF+W E + ++ M+ S+ +Y EGEDVA+ L Sbjct: 1091 CQFVPISLFAWQMNFHASSHFEMEEHVNQM----MASISKTLSSY-----EGEDVAICLL 1141 Query: 1169 IRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXXXXXXXXXXXXXXX 1348 + SSDS Q N H G W L LM+K++VILG ALD Sbjct: 1142 V-LSSDSDAQHDYELGNCHPNPVGRWRLALMVKNIVILGTALDSRASVIGASAGQGICGT 1200 Query: 1349 XXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDGGQLCVY 1504 VY W+L+ G+K +H +G ++ I+ DD S VAIA D Q+ VY Sbjct: 1201 CDGLVYTWELSSGTKLGTMHHFKGGSVSCISNDDSRSGAVAIAGD-NQVLVY 1251 >ref|XP_002893407.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297339249|gb|EFH69666.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 1194 Score = 310 bits (795), Expect = 1e-81 Identities = 170/447 (38%), Positives = 245/447 (54%) Frame = +2 Query: 170 QGRDSSRLHIREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIY 349 + R ++E +++ ++N+E+ T++ +GCY HPM +SSV+LK +EIY Sbjct: 755 ENTSEKRTSVQEFPASSNLEINRDVKINNEMGKTVELLGCYFHPMPVSSVLLKSAGNEIY 814 Query: 350 ICVSCGALEDRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQL 529 ICV A EDR R LF+Y ++ K S+G P ++GHT LP + D+ G ++ S L Sbjct: 815 ICVLSFATEDRVRTLFMYKMSAKAPSKGFPSIIGHTPAILPIVDDKSGGNRTLEISNLHF 874 Query: 530 TPDGRGLVLVDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTF 709 TPDG L+L+ +I+ PYCR++ C C C S CFE+NAV+IV VK G+VS+V KL Sbjct: 875 TPDGLHLILIGNIKTPYCRKRETDCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADD 934 Query: 710 PVHCVLVCEPEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSA 889 V CV+VC+P +LIA +SG + +W MNS WS TE VI + + S I++LK+IPK Sbjct: 935 SVQCVVVCDPNNLIAAVKSGNLIVWAMNSHWSGSTEESVILANPCISSCIMELKKIPKCP 994 Query: 890 SMVVGHNGYGEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCM 1069 +V+GHNG GEF++WDI KR +F+P SLF+W E + Sbjct: 995 HLVIGHNGIGEFTIWDISKRSLVSRFVSPSNLIFEFIPTSLFAWHP---VHSHSTIEDHV 1051 Query: 1070 RKLQEATMSWFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWW 1249 + AT WFS+ P E +D A+WL + T +S + S CW Sbjct: 1052 DMILAATKLWFSKGINNKTLVPAEVKDTAIWLLVSTDLESDAKCDRVESPAR-----CWR 1106 Query: 1250 LGLMIKDVVILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTI 1429 L L++K+ +ILG LD VY+WDL+ G+K +LHD +G + Sbjct: 1107 LALLVKNQLILGNQLDPRADVAGTISGHGVAGTLDGLVYMWDLSTGAKLGSLHDFKGQRV 1166 Query: 1430 LRIATDDLTSSVVAIARDGGQLCVYLH 1510 I+TDD S + IA + GQL VY H Sbjct: 1167 SCISTDD--SRNICIASEDGQLLVYCH 1191 >ref|XP_006415912.1| hypothetical protein EUTSA_v10006590mg [Eutrema salsugineum] gi|557093683|gb|ESQ34265.1| hypothetical protein EUTSA_v10006590mg [Eutrema salsugineum] Length = 1207 Score = 309 bits (791), Expect = 3e-81 Identities = 165/422 (39%), Positives = 240/422 (56%) Frame = +2 Query: 245 ELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALEDRKRDLFLYTLTTKES 424 ++N+E+ T++ +G Y HPM +S+V L+ +EIYICV A EDR LF+Y ++ K Sbjct: 786 KINNEMEKTVELLGYYFHPMPVSTVSLQYVGNEIYICVLSFATEDRVSTLFMYKISAKSP 845 Query: 425 SQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVLVDSIRMPYCREKNLHC 604 ++G P +VGHT LP + D+ GR +++S L TPDG+ L+ +I+ PYCR++ + C Sbjct: 846 TRGFPSVVGHTPAILPIVDDKSGRNRTLERSYLHFTPDGQHLIFTGNIKTPYCRQREIDC 905 Query: 605 LCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCEPEHLIALDESGRIYIW 784 LC C S FE+NAV+IV VK GYVS+V KL V CV+VC+P +LIA+ +SG + W Sbjct: 906 LCLTCTSASFEENAVRIVEVKAGYVSLVTKLQAVDSVQCVVVCDPNYLIAVVKSGNLIAW 965 Query: 785 IMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGYGEFSLWDIVKRXXXXX 964 MNS W TE +VI + + S IV+LK+IPK +++GHNG GEF++WDI KR Sbjct: 966 AMNSDWRGSTEEFVILANPCISSCIVELKKIPKCPHLIIGHNGIGEFTIWDISKRSLVSR 1025 Query: 965 XXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMSWFSRHSEAYISCPVEG 1144 +F+P SLF+W + + E + + AT WFS+ P E Sbjct: 1026 FVSPSNLIFEFIPTSLFAWHT---VHNHSTIEDHVDVILAATKLWFSKGVNNKTLVPAEV 1082 Query: 1145 EDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVVILGAALDXXXXXXXXX 1324 ED A+WL + T D P + C + CW L L++++ VILG+ LD Sbjct: 1083 EDTAIWLLVSTDPD--PDAICDRVES---PARCWRLALLVRNQVILGSQLDPRADVAGTV 1137 Query: 1325 XXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLTSSVVAIARDGGQLCVY 1504 VY+WDL+ G+K +LHD +G + I++DD S + IA + GQL VY Sbjct: 1138 SGHGVAGTLDGHVYMWDLSTGTKLGSLHDFKGQGVSCISSDD--SGNICIASEDGQLLVY 1195 Query: 1505 LH 1510 H Sbjct: 1196 CH 1197 >gb|AAG50686.1|AC079829_19 hypothetical protein [Arabidopsis thaliana] Length = 1196 Score = 306 bits (785), Expect = 2e-80 Identities = 168/438 (38%), Positives = 244/438 (55%) Frame = +2 Query: 197 IREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALE 376 ++E +++ ++N+E+ T++ +GCY HPM +SSV+L+ +EIYI V A E Sbjct: 766 VQEFPASSNLKLNRDVKINNEMEKTVELLGCYFHPMPVSSVLLRTVGNEIYILVLSFATE 825 Query: 377 DRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVL 556 DR R LF+Y ++ + S+G P ++GHT LP + D+ ++ S L TPDG L+L Sbjct: 826 DRVRTLFMYKMSAEAPSKGFPSIIGHTPAILPIVDDKSSGNGTLEISNLHFTPDGLHLIL 885 Query: 557 VDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCE 736 +I+ PYCR++ C C C S CFE+NAV+IV VK G+VS+V KL V CV+VC+ Sbjct: 886 TGNIKTPYCRKRETDCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCD 945 Query: 737 PEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGY 916 P +LIA +SG + +W MNS WS TE YVI + + S I++LK+IPK +V+GHNG Sbjct: 946 PNNLIAAVKSGNLIVWAMNSHWSGPTEEYVILANPCISSCIMELKKIPKCPHLVIGHNGI 1005 Query: 917 GEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMS 1096 GEF++WDI KR +F+P SLF+W E + + AT Sbjct: 1006 GEFTIWDISKRSLVSRFVSPSNLIFEFIPTSLFAWHP---VHSHSTIEDNVDMILAATKL 1062 Query: 1097 WFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVV 1276 WFS+ P E +D A+WL + T DS + C + + CW L L++KD + Sbjct: 1063 WFSKGVNNKTLVPAEVKDTAIWLLVSTDLDS--DAKCDRVESPV---RCWRLALLVKDQL 1117 Query: 1277 ILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLT 1456 ILG+ LD VY+WDL+ G+K +LHD +G + I+TDD Sbjct: 1118 ILGSQLDPRADVAGTISGHGVAGTLDGLVYMWDLSTGTKLGSLHDFKGQRVSCISTDD-- 1175 Query: 1457 SSVVAIARDGGQLCVYLH 1510 S + IA + GQL VY H Sbjct: 1176 SRNICIASEDGQLLVYCH 1193 >gb|AAF98582.1|AC013427_25 This gene may be cut off [Arabidopsis thaliana] Length = 554 Score = 306 bits (785), Expect = 2e-80 Identities = 168/438 (38%), Positives = 244/438 (55%) Frame = +2 Query: 197 IREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALE 376 ++E +++ ++N+E+ T++ +GCY HPM +SSV+L+ +EIYI V A E Sbjct: 124 VQEFPASSNLKLNRDVKINNEMEKTVELLGCYFHPMPVSSVLLRTVGNEIYILVLSFATE 183 Query: 377 DRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVL 556 DR R LF+Y ++ + S+G P ++GHT LP + D+ ++ S L TPDG L+L Sbjct: 184 DRVRTLFMYKMSAEAPSKGFPSIIGHTPAILPIVDDKSSGNGTLEISNLHFTPDGLHLIL 243 Query: 557 VDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCE 736 +I+ PYCR++ C C C S CFE+NAV+IV VK G+VS+V KL V CV+VC+ Sbjct: 244 TGNIKTPYCRKRETDCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCD 303 Query: 737 PEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGY 916 P +LIA +SG + +W MNS WS TE YVI + + S I++LK+IPK +V+GHNG Sbjct: 304 PNNLIAAVKSGNLIVWAMNSHWSGPTEEYVILANPCISSCIMELKKIPKCPHLVIGHNGI 363 Query: 917 GEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMS 1096 GEF++WDI KR +F+P SLF+W E + + AT Sbjct: 364 GEFTIWDISKRSLVSRFVSPSNLIFEFIPTSLFAWHP---VHSHSTIEDNVDMILAATKL 420 Query: 1097 WFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVV 1276 WFS+ P E +D A+WL + T DS + C + + CW L L++KD + Sbjct: 421 WFSKGVNNKTLVPAEVKDTAIWLLVSTDLDS--DAKCDRVESPV---RCWRLALLVKDQL 475 Query: 1277 ILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLT 1456 ILG+ LD VY+WDL+ G+K +LHD +G + I+TDD Sbjct: 476 ILGSQLDPRADVAGTISGHGVAGTLDGLVYMWDLSTGTKLGSLHDFKGQRVSCISTDD-- 533 Query: 1457 SSVVAIARDGGQLCVYLH 1510 S + IA + GQL VY H Sbjct: 534 SRNICIASEDGQLLVYCH 551 >ref|NP_001185099.1| DNA binding protein [Arabidopsis thaliana] gi|332192557|gb|AEE30678.1| DNA binding protein [Arabidopsis thaliana] Length = 1194 Score = 306 bits (785), Expect = 2e-80 Identities = 168/438 (38%), Positives = 244/438 (55%) Frame = +2 Query: 197 IREEKTATSSSCEEKQELNDEIIGTMKFIGCYDHPMQISSVMLKRNPHEIYICVSCGALE 376 ++E +++ ++N+E+ T++ +GCY HPM +SSV+L+ +EIYI V A E Sbjct: 764 VQEFPASSNLKLNRDVKINNEMEKTVELLGCYFHPMPVSSVLLRTVGNEIYILVLSFATE 823 Query: 377 DRKRDLFLYTLTTKESSQGNPCMVGHTSMSLPSLKDEFGREVAVDKSALQLTPDGRGLVL 556 DR R LF+Y ++ + S+G P ++GHT LP + D+ ++ S L TPDG L+L Sbjct: 824 DRVRTLFMYKMSAEAPSKGFPSIIGHTPAILPIVDDKSSGNGTLEISNLHFTPDGLHLIL 883 Query: 557 VDSIRMPYCREKNLHCLCPQCESFCFEKNAVKIVHVKLGYVSIVVKLNTTFPVHCVLVCE 736 +I+ PYCR++ C C C S CFE+NAV+IV VK G+VS+V KL V CV+VC+ Sbjct: 884 TGNIKTPYCRKRETDCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCD 943 Query: 737 PEHLIALDESGRIYIWIMNSTWSVQTELYVIPSYEFLPSRIVDLKRIPKSASMVVGHNGY 916 P +LIA +SG + +W MNS WS TE YVI + + S I++LK+IPK +V+GHNG Sbjct: 944 PNNLIAAVKSGNLIVWAMNSHWSGPTEEYVILANPCISSCIMELKKIPKCPHLVIGHNGI 1003 Query: 917 GEFSLWDIVKRXXXXXXXXXXXXXLDFLPVSLFSWSSKGLTKDKVITEGCMRKLQEATMS 1096 GEF++WDI KR +F+P SLF+W E + + AT Sbjct: 1004 GEFTIWDISKRSLVSRFVSPSNLIFEFIPTSLFAWHP---VHSHSTIEDNVDMILAATKL 1060 Query: 1097 WFSRHSEAYISCPVEGEDVAVWLFIRTSSDSYPQSACYSSNNHLISNGCWWLGLMIKDVV 1276 WFS+ P E +D A+WL + T DS + C + + CW L L++KD + Sbjct: 1061 WFSKGVNNKTLVPAEVKDTAIWLLVSTDLDS--DAKCDRVESPV---RCWRLALLVKDQL 1115 Query: 1277 ILGAALDXXXXXXXXXXXXXXXXXXXXQVYIWDLAEGSKREALHDLEGHTILRIATDDLT 1456 ILG+ LD VY+WDL+ G+K +LHD +G + I+TDD Sbjct: 1116 ILGSQLDPRADVAGTISGHGVAGTLDGLVYMWDLSTGTKLGSLHDFKGQRVSCISTDD-- 1173 Query: 1457 SSVVAIARDGGQLCVYLH 1510 S + IA + GQL VY H Sbjct: 1174 SRNICIASEDGQLLVYCH 1191