BLASTX nr result
ID: Mentha25_contig00008557
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00008557 (1051 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254... 209 1e-51 ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591... 205 3e-50 ref|XP_007009265.1| HAT and BED zinc finger domain-containing pr... 194 5e-47 ref|XP_002316272.2| hypothetical protein POPTR_0010s20835g [Popu... 189 2e-45 ref|XP_007049027.1| HAT transposon superfamily, putative [Theobr... 187 5e-45 ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817... 182 2e-43 ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817... 182 2e-43 ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806... 180 8e-43 ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phas... 139 2e-30 ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phas... 139 3e-30 ref|XP_004981234.1| PREDICTED: uncharacterized protein LOC101757... 135 4e-29 ref|XP_002466179.1| hypothetical protein SORBIDRAFT_01g003040 [S... 132 3e-28 gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea] 124 6e-26 gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo] 113 1e-22 ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226... 111 4e-22 ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302... 109 2e-21 ref|XP_002524204.1| DNA binding protein, putative [Ricinus commu... 108 3e-21 ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626... 107 6e-21 ref|NP_188861.2| hAT dimerization domain-containing protein [Ara... 100 2e-18 ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thal... 98 7e-18 >ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254391 [Solanum lycopersicum] Length = 748 Score = 209 bits (533), Expect = 1e-51 Identities = 124/305 (40%), Positives = 163/305 (53%), Gaps = 1/305 (0%) Frame = +3 Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317 M SNLE VA T +K DPAW HCE K+G RV+LKCIYCGK+FKGGGI+R KEHLAGQKGN Sbjct: 1 MGSNLEPVAVTSQKHDPAWKHCEMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGN 60 Query: 318 GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497 +TC +V PD+RL M + L G LA E+ Y + I +++A Sbjct: 61 ASTCLRVQPDVRLLMQDSLNG-VVMKKRKKQKLAEEITTY--NAIDTSDIAAE------- 110 Query: 498 ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNCYLNSHGIMDASG 677 +TCG +++ ++ P+S+ S+ +LN Sbjct: 111 ---------------FTDTCGLNTQ-------VDLLPMSQAIEHTSSLFLN--------- 139 Query: 678 DREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKV-SVVDMAIGRFFF 854 R+ G + + P +N K+V + V MA+ RF Sbjct: 140 -RDQGPNNRKKKSRIRKGASSS-------------NNLPIINQSKRVNNQVHMAVARFLL 185 Query: 855 DVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAWG 1034 D +P DAVNS YFQPM+D IASQG V PSY+DLRSW+LK+SV EVR D++QC+S W Sbjct: 186 DARVPLDAVNSVYFQPMIDVIASQGPPVSAPSYHDLRSWVLKSSVQEVRTDIDQCSSTWA 245 Query: 1035 RTGCS 1049 RTGCS Sbjct: 246 RTGCS 250 >ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591442 [Solanum tuberosum] Length = 755 Score = 205 bits (521), Expect = 3e-50 Identities = 122/304 (40%), Positives = 156/304 (51%) Frame = +3 Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317 M SNLE V T +K DPAW HCE K+G RV+LKCIYCGK+FKGGGI+R KEHLAGQKGN Sbjct: 1 MGSNLEPVPVTSQKHDPAWKHCEMFKNGERVQLKCIYCGKIFKGGGIHRIKEHLAGQKGN 60 Query: 318 GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497 +TC +V PD+RL M + L G LA E+ Y T A Sbjct: 61 ASTCLRVQPDVRLLMQDSLNG-VVMKKRKKQKLAEEITTYNAGTATSDIAAE-------- 111 Query: 498 ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNCYLNSHGIMDASG 677 +TCG D++ ++ P+ + SN +LN + G Sbjct: 112 ---------------FTDTCGLDTQ-------VDLLPMPQAIEHTSNLFLNRDQGPNNIG 149 Query: 678 DREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFFFD 857 R+ K+ + P SK+ + V MA+ RF D Sbjct: 150 ARK----------------KKSRIRKGASSSNNNAMLLPINQSKRVNNHVHMAVARFLLD 193 Query: 858 VGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAWGR 1037 +P DAVNS YFQPM+D IASQG V PSY++LRSW+LK SV EVR D++QC+S W R Sbjct: 194 ARVPLDAVNSVYFQPMIDVIASQGPQVSAPSYHELRSWVLKASVQEVRNDIDQCSSTWAR 253 Query: 1038 TGCS 1049 +GCS Sbjct: 254 SGCS 257 >ref|XP_007009265.1| HAT and BED zinc finger domain-containing protein, putative [Theobroma cacao] gi|508726178|gb|EOY18075.1| HAT and BED zinc finger domain-containing protein, putative [Theobroma cacao] Length = 749 Score = 194 bits (493), Expect = 5e-47 Identities = 122/304 (40%), Positives = 159/304 (52%) Frame = +3 Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317 M SNLE + T +K DPAW HC+ ++G RV+LKCIYCGK+F+GGGI+R KEHLAGQKGN Sbjct: 1 MASNLEPIPITSQKHDPAWKHCQMFRNGERVQLKCIYCGKIFRGGGIHRIKEHLAGQKGN 60 Query: 318 GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497 +TC V D+RL M E L G E+ KI E +N + ++ + Sbjct: 61 ASTCFHVPSDVRLLMRESLDG-------------VEVKKRKKQKI--AEEMSNANQVSSE 105 Query: 498 ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNCYLNSHGIMDASG 677 + YD +V NT EG + P S+ +N G + SG Sbjct: 106 ----IDTYD---NQVDTNTGLLMIEGPDTLQP------------SSSLLVNREGTSNVSG 146 Query: 678 DREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFFFD 857 DR V T+ L +K+ + V +AIGRF FD Sbjct: 147 DRRKRGKGKSSAAESNALVVNTV----------------GLGAKRVNNHVHVAIGRFLFD 190 Query: 858 VGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAWGR 1037 +G P DAVNS YFQPM+DAI S G+GV+ PS DL+ WILK SV EV+ D ++ T+AW R Sbjct: 191 IGAPLDAVNSVYFQPMVDAIISGGSGVLMPSCSDLQGWILKKSVEEVKSDNDKVTAAWVR 250 Query: 1038 TGCS 1049 TGCS Sbjct: 251 TGCS 254 >ref|XP_002316272.2| hypothetical protein POPTR_0010s20835g [Populus trichocarpa] gi|550330253|gb|EEF02443.2| hypothetical protein POPTR_0010s20835g [Populus trichocarpa] Length = 608 Score = 189 bits (480), Expect = 2e-45 Identities = 118/304 (38%), Positives = 158/304 (51%) Frame = +3 Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317 M SNLE + T +K DPAW HC+ K+G RV+LKC+YCGK+FKGGGI+R KEHLAGQKGN Sbjct: 1 MGSNLEPIPITSQKHDPAWKHCQMFKNGERVQLKCVYCGKIFKGGGIHRIKEHLAGQKGN 60 Query: 318 GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497 ATC +V D+RL M++ + +A E IT ++ G+ D Sbjct: 61 AATCVQVPSDVRL-MMQQSLDGVVVKKRKKQKIAEE--------ITNLNPVSSEIGV-FD 110 Query: 498 ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNCYLNSHGIMDASG 677 ++V+ +G E+ T D P+S + V + G+ G Sbjct: 111 KDVN------TGMELTGVTDAID-------------PVSSLLVTGED------GMGKKGG 145 Query: 678 DREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFFFD 857 +R T+ G P K+K + MAIGRF +D Sbjct: 146 ERRKRGRGRGRGSVTNAKAVVTMGS-----------GMPLSGGKRKNDHIHMAIGRFLYD 194 Query: 858 VGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAWGR 1037 +G DAVNSAYFQ M+ AIAS G+ VV PSY+DLR W+LKNSV EV+ DV++ + W R Sbjct: 195 IGASLDAVNSAYFQLMVQAIASGGSEVVVPSYHDLRGWVLKNSVEEVKNDVDKHIATWER 254 Query: 1038 TGCS 1049 TGCS Sbjct: 255 TGCS 258 >ref|XP_007049027.1| HAT transposon superfamily, putative [Theobroma cacao] gi|508701288|gb|EOX93184.1| HAT transposon superfamily, putative [Theobroma cacao] Length = 750 Score = 187 bits (476), Expect = 5e-45 Identities = 111/304 (36%), Positives = 158/304 (51%) Frame = +3 Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317 M+ NL ++ T++KQDPAWNHCE K+G R+++KC+YCGK+FKGGGI+RFKEHLAG+KG Sbjct: 1 MELNLTPISITKQKQDPAWNHCEAFKNGERLQIKCMYCGKMFKGGGIHRFKEHLAGRKGQ 60 Query: 318 GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497 G C +V P +R M E L G + +A G S G +D Sbjct: 61 GPICEQVPPGVRALMQESLNGVLLKQDNKQNAIPELLACGGSSPHAG----------EID 110 Query: 498 ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNCYLNSHGIMDASG 677 ++ + + N V P + L+ +E +S +++ G Sbjct: 111 KS------------------AYSDDVNNGVKPIQV--LNSLEP-------DSSLVLNGKG 143 Query: 678 DREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFFFD 857 + G+ + + L AL S + V MAIGRF +D Sbjct: 144 EVSQGIRDSKKRGRDRSLLANSHSCAKSDL---------ALVSIGAENPVHMAIGRFLYD 194 Query: 858 VGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAWGR 1037 +G+ DAVNS YFQPM+DAIAS G+G+V PS DLR WILKN + EV+ D+++ + WG+ Sbjct: 195 IGVNLDAVNSVYFQPMIDAIASTGSGIVPPSSQDLRGWILKNVMEEVKDDIDRNKTMWGK 254 Query: 1038 TGCS 1049 TGCS Sbjct: 255 TGCS 258 >ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817502 isoform X4 [Glycine max] Length = 729 Score = 182 bits (463), Expect = 2e-43 Identities = 112/306 (36%), Positives = 155/306 (50%), Gaps = 2/306 (0%) Frame = +3 Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317 M SNLE V T +K DPAW H + K+G +V+LKCIYC K+FKGGGI+R KEHLA QKGN Sbjct: 1 MGSNLEPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGN 60 Query: 318 GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497 +TCS+V D+RL M + L G M+ + + + NN ++V+ Sbjct: 61 ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSVNPLTTVVNSLPNNNNRVVDVN 120 Query: 498 ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNC--YLNSHGIMDA 671 + + + + V N EG + N+ ++ K+ Y NS G++ Sbjct: 121 QGLQAIGVEHNSSLVVN-----PGEGMSR----NMERRKKMRATKNPAAVYANSEGVIAV 171 Query: 672 SGDREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFF 851 + L KK + + MAIGRF Sbjct: 172 EKN--------------------------------------GLFPKKMDNHIYMAIGRFL 193 Query: 852 FDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAW 1031 +D+G P DAVNS YFQ M+DAIAS+G G P +++LR WILKNSV EV+ D+++C W Sbjct: 194 YDIGAPFDAVNSVYFQEMVDAIASRGVGFERPWHHELRGWILKNSVEEVKNDIDRCKMTW 253 Query: 1032 GRTGCS 1049 GRTGCS Sbjct: 254 GRTGCS 259 >ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817502 isoform X1 [Glycine max] gi|571489936|ref|XP_006591345.1| PREDICTED: uncharacterized protein LOC100817502 isoform X2 [Glycine max] gi|571489939|ref|XP_006591346.1| PREDICTED: uncharacterized protein LOC100817502 isoform X3 [Glycine max] Length = 759 Score = 182 bits (463), Expect = 2e-43 Identities = 112/306 (36%), Positives = 155/306 (50%), Gaps = 2/306 (0%) Frame = +3 Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317 M SNLE V T +K DPAW H + K+G +V+LKCIYC K+FKGGGI+R KEHLA QKGN Sbjct: 1 MGSNLEPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGN 60 Query: 318 GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497 +TCS+V D+RL M + L G M+ + + + NN ++V+ Sbjct: 61 ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSVNPLTTVVNSLPNNNNRVVDVN 120 Query: 498 ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNC--YLNSHGIMDA 671 + + + + V N EG + N+ ++ K+ Y NS G++ Sbjct: 121 QGLQAIGVEHNSSLVVN-----PGEGMSR----NMERRKKMRATKNPAAVYANSEGVIAV 171 Query: 672 SGDREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFF 851 + L KK + + MAIGRF Sbjct: 172 EKN--------------------------------------GLFPKKMDNHIYMAIGRFL 193 Query: 852 FDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAW 1031 +D+G P DAVNS YFQ M+DAIAS+G G P +++LR WILKNSV EV+ D+++C W Sbjct: 194 YDIGAPFDAVNSVYFQEMVDAIASRGVGFERPWHHELRGWILKNSVEEVKNDIDRCKMTW 253 Query: 1032 GRTGCS 1049 GRTGCS Sbjct: 254 GRTGCS 259 >ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806265 isoform X1 [Glycine max] gi|571542833|ref|XP_006601996.1| PREDICTED: uncharacterized protein LOC100806265 isoform X2 [Glycine max] Length = 758 Score = 180 bits (457), Expect = 8e-43 Identities = 111/306 (36%), Positives = 155/306 (50%), Gaps = 2/306 (0%) Frame = +3 Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317 M SNLE V T +K DPAW H + K+G +V+LKCIYC K+FKGGGI+R KEHLA QKGN Sbjct: 1 MGSNLEPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGN 60 Query: 318 GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497 +TCS+V D+RL M + L G + E+ + + NN ++V+ Sbjct: 61 ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQR-IEEEIMSVNPLTTVVNSLPNNNQVVDVN 119 Query: 498 ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNC--YLNSHGIMDA 671 + + + + V N EG + N+ ++ AK+ Y NS ++ Sbjct: 120 QGLQAIGVEHNSTLVVN-----PGEGMSR----NMERRKKMRAAKNPAAVYANSEDVVAV 170 Query: 672 SGDREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFF 851 + L KK + + MAIGRF Sbjct: 171 EKN--------------------------------------GLFPKKMDNHIYMAIGRFL 192 Query: 852 FDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAW 1031 +D+G P DAVN +FQ M+DAIAS+G G PS+++LR WILKNSV EV+ D+++C W Sbjct: 193 YDIGAPFDAVNLVFFQEMVDAIASKGTGFERPSHHELRGWILKNSVEEVKNDIDRCKMTW 252 Query: 1032 GRTGCS 1049 GRTGCS Sbjct: 253 GRTGCS 258 >ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] gi|561036894|gb|ESW35424.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] Length = 869 Score = 139 bits (350), Expect = 2e-30 Identities = 97/307 (31%), Positives = 148/307 (48%), Gaps = 2/307 (0%) Frame = +3 Query: 135 EMDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKG 314 +M SNLE V T +K DPAW H + K+G +V+LKCIYC K+FKGGGI+R KEHLA QKG Sbjct: 113 KMGSNLEPVPITSQKHDPAWKHVQMYKNGDKVQLKCIYCQKMFKGGGIHRIKEHLACQKG 172 Query: 315 NGATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNV 494 N +TCS+V D+RL M + L G + E+ + + NN ++V Sbjct: 173 NASTCSRVPHDVRLHMQQSLDG-VVVKKRRKQKIEEEIMSVNPLTTVVNSLPNNNQ-VDV 230 Query: 495 DENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNC--YLNSHGIMD 668 ++ + D + V N G N+ ++ +K+ Y NS G++ Sbjct: 231 NQGLQAIGVDHNSSLVVNPGEGMSK---------NMERRKKMRASKNPAAIYANSEGVVA 281 Query: 669 ASGDREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRF 848 ++G+ + + L ++ + A+NS + Sbjct: 282 V---EKNGLFPKRVDNHIHMAIGRFL--------YDIGAPFDAVNSV------------Y 318 Query: 849 FFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSA 1028 F ++ DA++S +GAG PS+++LR WILKNSV EV+ D+++C Sbjct: 319 FHEM---VDAISS------------RGAGFERPSHHELRGWILKNSVEEVKNDIDRCKMT 363 Query: 1029 WGRTGCS 1049 WGRTGCS Sbjct: 364 WGRTGCS 370 >ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] gi|561036895|gb|ESW35425.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] Length = 756 Score = 139 bits (349), Expect = 3e-30 Identities = 97/306 (31%), Positives = 147/306 (48%), Gaps = 2/306 (0%) Frame = +3 Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317 M SNLE V T +K DPAW H + K+G +V+LKCIYC K+FKGGGI+R KEHLA QKGN Sbjct: 1 MGSNLEPVPITSQKHDPAWKHVQMYKNGDKVQLKCIYCQKMFKGGGIHRIKEHLACQKGN 60 Query: 318 GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497 +TCS+V D+RL M + L G + E+ + + NN ++V+ Sbjct: 61 ASTCSRVPHDVRLHMQQSLDG-VVVKKRRKQKIEEEIMSVNPLTTVVNSLPNNNQ-VDVN 118 Query: 498 ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNC--YLNSHGIMDA 671 + + D + V N G N+ ++ +K+ Y NS G++ Sbjct: 119 QGLQAIGVDHNSSLVVNPGEGMSK---------NMERRKKMRASKNPAAIYANSEGVVAV 169 Query: 672 SGDREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFF 851 ++G+ + + L ++ + A+NS +F Sbjct: 170 ---EKNGLFPKRVDNHIHMAIGRFL--------YDIGAPFDAVNSV------------YF 206 Query: 852 FDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAW 1031 ++ DA++S +GAG PS+++LR WILKNSV EV+ D+++C W Sbjct: 207 HEM---VDAISS------------RGAGFERPSHHELRGWILKNSVEEVKNDIDRCKMTW 251 Query: 1032 GRTGCS 1049 GRTGCS Sbjct: 252 GRTGCS 257 >ref|XP_004981234.1| PREDICTED: uncharacterized protein LOC101757413 [Setaria italica] Length = 803 Score = 135 bits (339), Expect = 4e-29 Identities = 99/297 (33%), Positives = 134/297 (45%), Gaps = 5/297 (1%) Frame = +3 Query: 174 KKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGNGATCSKVHPDIR 353 +K DPAW HC ++ RV LKC YCGK F GGGI+RFKEHLA + GN C KV D++ Sbjct: 28 QKHDPAWKHCLMVRAEGRVRLKCAYCGKHFLGGGIHRFKEHLARRPGNACCCPKVPRDVQ 87 Query: 354 LQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVDENVH-VPVYDIS 530 M+ L A T A+ SG D +H +P+ ++ Sbjct: 88 DTMMRSLDAVAAKKMQRKLANALPPGDMRRFAPTDASPASAASGGATDSPIHMIPLNEVL 147 Query: 531 GFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNCYLNSHGIMDASGDREDGMXXXXX 710 FE D + PPL E + + M ++ Sbjct: 148 DFE----PVPLDEQR---------PPLP--ETMRGSVSSKKKRKMLSNASTPPLTPPTLQ 192 Query: 711 XXXXXXXVTKTLXXXXXXLNIEVPP----GYPALNSKKKVSVVDMAIGRFFFDVGLPADA 878 T L ++ P G+ L+ K++VSV A+GRF +DVG+P +A Sbjct: 193 QHVPSTPQTNPLHQVVMAVDAVTPSSGHFGHAGLD-KEQVSV---AVGRFLYDVGVPLEA 248 Query: 879 VNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAWGRTGCS 1049 VNS YFQPML+AIAS G SY+D R ILK S+ + +E +W RTGCS Sbjct: 249 VNSVYFQPMLEAIASAGGRPEALSYHDFRGHILKKSLDDATSRLEFFKGSWTRTGCS 305 >ref|XP_002466179.1| hypothetical protein SORBIDRAFT_01g003040 [Sorghum bicolor] gi|241920033|gb|EER93177.1| hypothetical protein SORBIDRAFT_01g003040 [Sorghum bicolor] Length = 747 Score = 132 bits (331), Expect = 3e-28 Identities = 96/312 (30%), Positives = 140/312 (44%), Gaps = 20/312 (6%) Frame = +3 Query: 174 KKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGNGATCSKVHPDIR 353 +K DPAW HC ++ R+ LKC YCGK F GGGI+RFKEHLA + GN C KV D++ Sbjct: 37 QKHDPAWKHCVMVRSDGRLRLKCAYCGKHFLGGGIHRFKEHLARRPGNACCCPKVPRDVQ 96 Query: 354 LQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGT-------EVANNGSGLNVDENVHV 512 ML L LAA ++ + T A+ G+G + + Sbjct: 97 DTMLRSL--DAVAAKKMQRKLAASLSPGDMRRFAATSAPPASVSTASGGTGSPIH---MI 151 Query: 513 PVYDISGFE----------VANNTCGFDSEG---NAHVSPCNIPPLSEVEVAKSNCYLNS 653 P+ ++ FE V + S G V+ PPL + ++ Sbjct: 152 PLNEVLDFEPVPLEEQRPLVPEGSMRGSSSGKKKRKQVTSATAPPL----IPQTR---PQ 204 Query: 654 HGIMDASGDREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDM 833 H + + G+ T L ++ P Y + + V + Sbjct: 205 HVLATPQTNLLHGLQHVPPTPH-----TNPLHQVVMAVDAVTPAEYFEHAAPSEKEQVSV 259 Query: 834 AIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVE 1013 A+GRF +D G+P +AVNS YFQPML+AIA+ G SY+D+R +LK S+ +V +E Sbjct: 260 AVGRFLYDAGVPLEAVNSVYFQPMLEAIAAAGGRPDVLSYHDVRGHVLKRSLDDVMSHLE 319 Query: 1014 QCTSAWGRTGCS 1049 +W RTGCS Sbjct: 320 FFRGSWTRTGCS 331 >gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea] Length = 724 Score = 124 bits (311), Expect = 6e-26 Identities = 57/75 (76%), Positives = 65/75 (86%) Frame = +3 Query: 825 VDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRY 1004 V MA+GRFF DVGLPA+A NSAYFQPM++AIASQ AGV+GPSY DLRSWILKN VHE RY Sbjct: 175 VHMAVGRFFVDVGLPAEAANSAYFQPMVEAIASQEAGVIGPSYQDLRSWILKNLVHETRY 234 Query: 1005 DVEQCTSAWGRTGCS 1049 DV+Q +AW RTGC+ Sbjct: 235 DVDQYANAWERTGCT 249 Score = 108 bits (271), Expect = 3e-21 Identities = 46/81 (56%), Positives = 61/81 (75%) Frame = +3 Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317 M+ ++E V T +K DPAW HC+ K ++ LKCIYCGK+FKGGGI+R KEHLAGQKGN Sbjct: 1 MEPHMELVPMTSQKHDPAWKHCQMFKTEEKIHLKCIYCGKIFKGGGIHRIKEHLAGQKGN 60 Query: 318 GATCSKVHPDIRLQMLEVLIG 380 +TC +V P+++ QML+ L G Sbjct: 61 ASTCLRVLPEVKQQMLDSLNG 81 >gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo] Length = 752 Score = 113 bits (283), Expect = 1e-22 Identities = 51/93 (54%), Positives = 68/93 (73%) Frame = +3 Query: 771 IEVPPGYPALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPS 950 I +P G L+S + + V MAIGRF +D+G +AVNSAYFQPM+++IA G G++ PS Sbjct: 165 IVIPNGGGILDSNRDRNQVHMAIGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPS 224 Query: 951 YYDLRSWILKNSVHEVRYDVEQCTSAWGRTGCS 1049 Y+D+R WILKNSV EVR D ++C + WG TGCS Sbjct: 225 YHDIRGWILKNSVEEVRGDFDRCKATWGMTGCS 257 Score = 104 bits (260), Expect = 5e-20 Identities = 47/81 (58%), Positives = 59/81 (72%) Frame = +3 Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317 M S L+ V T +K DPAW HC+ K+G RV+LKC+YC K+FKGGGI+R KEHLAGQKGN Sbjct: 1 MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60 Query: 318 GATCSKVHPDIRLQMLEVLIG 380 +TC V P+++ M E L G Sbjct: 61 ASTCHSVPPEVQNIMQESLDG 81 >ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226173 [Cucumis sativus] Length = 752 Score = 111 bits (278), Expect = 4e-22 Identities = 49/93 (52%), Positives = 68/93 (73%) Frame = +3 Query: 771 IEVPPGYPALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPS 950 I +P G L+S + + V MA+GRF +D+G +AVNSAYFQPM+++IA G G++ PS Sbjct: 165 IVIPNGGGILDSNRDRNQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPS 224 Query: 951 YYDLRSWILKNSVHEVRYDVEQCTSAWGRTGCS 1049 Y+D+R WILKNS+ EVR D ++C + WG TGCS Sbjct: 225 YHDIRGWILKNSMEEVRSDFDRCKATWGITGCS 257 Score = 104 bits (260), Expect = 5e-20 Identities = 47/81 (58%), Positives = 59/81 (72%) Frame = +3 Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317 M S L+ V T +K DPAW HC+ K+G RV+LKC+YC K+FKGGGI+R KEHLAGQKGN Sbjct: 1 MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60 Query: 318 GATCSKVHPDIRLQMLEVLIG 380 +TC V P+++ M E L G Sbjct: 61 ASTCHSVPPEVQNIMQESLDG 81 >ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302111 [Fragaria vesca subsp. vesca] Length = 754 Score = 109 bits (273), Expect = 2e-21 Identities = 53/85 (62%), Positives = 65/85 (76%) Frame = +3 Query: 795 ALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWI 974 AL S+K S V AIGRF FD+G P +AVNSAYFQPM+DAIAS G G+ P+ +DLRSWI Sbjct: 168 ALVSRKVNSYVHEAIGRFLFDIGAPPEAVNSAYFQPMIDAIASGGPGMEPPTCHDLRSWI 227 Query: 975 LKNSVHEVRYDVEQCTSAWGRTGCS 1049 LKNSV E R ++++ + WGRTGCS Sbjct: 228 LKNSVEEARNNIDKHRATWGRTGCS 252 Score = 103 bits (258), Expect = 9e-20 Identities = 45/77 (58%), Positives = 57/77 (74%) Frame = +3 Query: 150 LESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGNGATC 329 +E V T +K DPAW HC+ K G R++LKCIYC K+F+GGGI+R KEHLAGQKGN +TC Sbjct: 1 MEPVPITSQKHDPAWKHCQMFKSGDRIQLKCIYCSKLFRGGGIHRIKEHLAGQKGNASTC 60 Query: 330 SKVHPDIRLQMLEVLIG 380 +V PD+R M + L G Sbjct: 61 LRVPPDVRGLMQQSLDG 77 >ref|XP_002524204.1| DNA binding protein, putative [Ricinus communis] gi|223536481|gb|EEF38128.1| DNA binding protein, putative [Ricinus communis] Length = 753 Score = 108 bits (271), Expect = 3e-21 Identities = 49/82 (59%), Positives = 63/82 (76%), Gaps = 1/82 (1%) Frame = +3 Query: 138 MDSN-LESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKG 314 MDS+ LE + T +K DPAW HC+ K+G RV+LKC+YCGK+FKGGGI+R KEHLAGQKG Sbjct: 1 MDSDDLEPIPITSQKHDPAWKHCQMFKNGERVQLKCVYCGKIFKGGGIHRIKEHLAGQKG 60 Query: 315 NGATCSKVHPDIRLQMLEVLIG 380 N +TC +V D++L M + L G Sbjct: 61 NASTCLQVPTDVKLIMQQSLDG 82 Score = 104 bits (260), Expect = 5e-20 Identities = 51/85 (60%), Positives = 62/85 (72%) Frame = +3 Query: 795 ALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWI 974 AL +K+ V MAIGRF +D+G P DAVNS YFQPM+DAIAS G V PS +DLR WI Sbjct: 178 ALGAKRVNDHVHMAIGRFLYDIGAPLDAVNSVYFQPMVDAIASGGLDVGMPSCHDLRGWI 237 Query: 975 LKNSVHEVRYDVEQCTSAWGRTGCS 1049 LKNSV EV+ +V++ + W RTGCS Sbjct: 238 LKNSVEEVKTEVDKHMATWARTGCS 262 >ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626522 [Citrus sinensis] Length = 745 Score = 107 bits (268), Expect = 6e-21 Identities = 48/85 (56%), Positives = 65/85 (76%) Frame = +3 Query: 795 ALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWI 974 +L++ + + + MA+GRF +D+G P DAVNS YFQPM+DAIAS G PSY+D+R WI Sbjct: 170 SLDATRGNNPIFMAVGRFLYDIGAPLDAVNSEYFQPMVDAIASGGPEAAMPSYHDIRGWI 229 Query: 975 LKNSVHEVRYDVEQCTSAWGRTGCS 1049 LKNSV EV+ DV++ T+ WG+TGCS Sbjct: 230 LKNSVEEVKNDVDRYTTTWGKTGCS 254 Score = 102 bits (255), Expect = 2e-19 Identities = 46/81 (56%), Positives = 60/81 (74%) Frame = +3 Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317 M S LE + + +K DPAW HC+ K+G RV+LKC+YC K+F+GGGI+R KEHLA QKGN Sbjct: 1 MASGLEPIPISSQKHDPAWKHCQMFKNGDRVQLKCLYCFKLFRGGGIHRIKEHLACQKGN 60 Query: 318 GATCSKVHPDIRLQMLEVLIG 380 +TCS+V D+RL M + L G Sbjct: 61 ASTCSRVPLDVRLAMQQSLDG 81 >ref|NP_188861.2| hAT dimerization domain-containing protein [Arabidopsis thaliana] gi|79313325|ref|NP_001030742.1| hAT dimerization domain-containing protein [Arabidopsis thaliana] gi|11994740|dbj|BAB03069.1| transposase-like protein [Arabidopsis thaliana] gi|28393360|gb|AAO42104.1| unknown protein [Arabidopsis thaliana] gi|28827622|gb|AAO50655.1| unknown protein [Arabidopsis thaliana] gi|332643084|gb|AEE76605.1| hAT dimerization domain-containing protein [Arabidopsis thaliana] gi|332643085|gb|AEE76606.1| hAT dimerization domain-containing protein [Arabidopsis thaliana] Length = 761 Score = 99.8 bits (247), Expect = 2e-18 Identities = 45/81 (55%), Positives = 59/81 (72%) Frame = +3 Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317 MDS+LE VA T +KQD AW HCE K G RV+++C+YC K+FKGGGI R KEHLAG+KG Sbjct: 1 MDSDLEPVALTPQKQDSAWKHCEVYKYGDRVQMRCLYCRKMFKGGGITRVKEHLAGKKGQ 60 Query: 318 GATCSKVHPDIRLQMLEVLIG 380 G C +V ++RL + + + G Sbjct: 61 GTICDQVPDEVRLFLQQCIDG 81 Score = 92.0 bits (227), Expect = 4e-16 Identities = 42/82 (51%), Positives = 56/82 (68%) Frame = +3 Query: 804 SKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKN 983 SK++ V MA+GRF FD+G DA NS QP +DAI S G GV P++ DLR WILK+ Sbjct: 183 SKEREKTVHMAMGRFLFDIGADFDAANSVNVQPFIDAIVSGGFGVSIPTHEDLRGWILKS 242 Query: 984 SVHEVRYDVEQCTSAWGRTGCS 1049 V EV+ ++++C + W RTGCS Sbjct: 243 CVEEVKKEIDECKTLWKRTGCS 264 >ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thaliana] gi|240255844|ref|NP_193238.5| hAT transposon superfamily [Arabidopsis thaliana] gi|332658140|gb|AEE83540.1| hAT transposon superfamily [Arabidopsis thaliana] gi|332658141|gb|AEE83541.1| hAT transposon superfamily [Arabidopsis thaliana] Length = 768 Score = 97.8 bits (242), Expect = 7e-18 Identities = 44/81 (54%), Positives = 58/81 (71%) Frame = +3 Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317 MD+ LE VA T +KQD AW HCE K G R++++C+YC K+FKGGGI R KEHLAG+KG Sbjct: 1 MDAELEPVALTPQKQDNAWKHCEIYKYGDRLQMRCLYCRKMFKGGGITRVKEHLAGKKGQ 60 Query: 318 GATCSKVHPDIRLQMLEVLIG 380 G C +V D+RL + + + G Sbjct: 61 GTICDQVPEDVRLFLQQCIDG 81 Score = 95.5 bits (236), Expect = 3e-17 Identities = 43/81 (53%), Positives = 57/81 (70%) Frame = +3 Query: 807 KKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNS 986 + + + + MAIGRF F +G DAVNS FQPM+DAIAS G GV P++ DLR WILKN Sbjct: 188 RDRENTIHMAIGRFLFGIGADFDAVNSVNFQPMIDAIASGGFGVSAPTHDDLRGWILKNC 247 Query: 987 VHEVRYDVEQCTSAWGRTGCS 1049 V E+ ++++C + W RTGCS Sbjct: 248 VEEMAKEIDECKAMWKRTGCS 268