BLASTX nr result
ID: Catharanthus23_contig00016208
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00016208 (720 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591... 221 2e-55 ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254... 212 8e-53 ref|XP_002316272.2| hypothetical protein POPTR_0010s20835g [Popu... 190 3e-46 gb|EOY18075.1| HAT and BED zinc finger domain-containing protein... 189 6e-46 ref|XP_002524204.1| DNA binding protein, putative [Ricinus commu... 186 5e-45 gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo] 183 4e-44 ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226... 179 1e-42 gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea] 178 1e-42 gb|ESW35425.1| hypothetical protein PHAVU_001G234100g [Phaseolus... 174 3e-41 gb|ESW35424.1| hypothetical protein PHAVU_001G234100g [Phaseolus... 174 3e-41 ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302... 168 1e-39 ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817... 164 3e-38 ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817... 164 3e-38 ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806... 164 3e-38 ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626... 163 5e-38 gb|ESW31639.1| hypothetical protein PHAVU_002G255200g [Phaseolus... 144 2e-32 gb|EOX93184.1| HAT transposon superfamily, putative [Theobroma c... 134 4e-29 ref|NP_188861.2| hAT dimerization domain-containing protein [Ara... 121 3e-25 ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thal... 119 1e-24 ref|XP_002521049.1| DNA binding protein, putative [Ricinus commu... 119 1e-24 >ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591442 [Solanum tuberosum] Length = 755 Score = 221 bits (563), Expect = 2e-55 Identities = 101/144 (70%), Positives = 124/144 (86%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 M S+LE VPVTSQKHDPAWKHC+M+KNGER+QLKC+YCGKIFKGGGIHRIKEHLAGQKGN Sbjct: 1 MGSNLEPVPVTSQKHDPAWKHCEMFKNGERVQLKCIYCGKIFKGGGIHRIKEHLAGQKGN 60 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159 AS+CLRVQPDVRLLMQ+SLNGVVM+KRKKQKLA+EI YN G+ + + C L+T Sbjct: 61 ASTCLRVQPDVRLLMQDSLNGVVMKKRKKQKLAEEITTYNAGTATSDIAAEFTDTCGLDT 120 Query: 158 EVNLLPAPDALDHNTDLFVDSEEG 87 +V+LLP P A++H ++LF++ ++G Sbjct: 121 QVDLLPMPQAIEHTSNLFLNRDQG 144 >ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254391 [Solanum lycopersicum] Length = 748 Score = 212 bits (540), Expect = 8e-53 Identities = 100/144 (69%), Positives = 121/144 (84%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 M S+LE V VTSQKHDPAWKHC+M+KNG+R+QLKC+YCGKIFKGGGIHRIKEHLAGQKGN Sbjct: 1 MGSNLEPVAVTSQKHDPAWKHCEMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGN 60 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159 AS+CLRVQPDVRLLMQ+SLNGVVM+KRKKQKLA+EI YN S+I + C LNT Sbjct: 61 ASTCLRVQPDVRLLMQDSLNGVVMKKRKKQKLAEEITTYNAIDTSDI-AAEFTDTCGLNT 119 Query: 158 EVNLLPAPDALDHNTDLFVDSEEG 87 +V+LLP A++H + LF++ ++G Sbjct: 120 QVDLLPMSQAIEHTSSLFLNRDQG 143 >ref|XP_002316272.2| hypothetical protein POPTR_0010s20835g [Populus trichocarpa] gi|550330253|gb|EEF02443.2| hypothetical protein POPTR_0010s20835g [Populus trichocarpa] Length = 608 Score = 190 bits (483), Expect = 3e-46 Identities = 93/146 (63%), Positives = 116/146 (79%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 M S+LE +P+TSQKHDPAWKHCQM+KNGER+QLKC+YCGKIFKGGGIHRIKEHLAGQKGN Sbjct: 1 MGSNLEPIPITSQKHDPAWKHCQMFKNGERVQLKCVYCGKIFKGGGIHRIKEHLAGQKGN 60 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159 A++C++V DVRL+MQ+SL+GVV++KRKKQK+A+EI N N S SEI D+NT Sbjct: 61 AATCVQVPSDVRLMMQQSLDGVVVKKRKKQKIAEEITNLNPVS-SEIGVFDK----DVNT 115 Query: 158 EVNLLPAPDALDHNTDLFVDSEEGLG 81 + L DA+D + L V E+G+G Sbjct: 116 GMELTGVTDAIDPVSSLLVTGEDGMG 141 >gb|EOY18075.1| HAT and BED zinc finger domain-containing protein, putative [Theobroma cacao] Length = 749 Score = 189 bits (481), Expect = 6e-46 Identities = 92/142 (64%), Positives = 115/142 (80%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 MAS+LE +P+TSQKHDPAWKHCQM++NGER+QLKC+YCGKIF+GGGIHRIKEHLAGQKGN Sbjct: 1 MASNLEPIPITSQKHDPAWKHCQMFRNGERVQLKCIYCGKIFRGGGIHRIKEHLAGQKGN 60 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159 AS+C V DVRLLM+ESL+GV ++KRKKQK+A+E+ N N S SEIDT +N D NT Sbjct: 61 ASTCFHVPSDVRLLMRESLDGVEVKKRKKQKIAEEMSNANQVS-SEIDTY--DNQVDTNT 117 Query: 158 EVNLLPAPDALDHNTDLFVDSE 93 + ++ PD L ++ L V+ E Sbjct: 118 GLLMIEGPDTLQPSSSLLVNRE 139 >ref|XP_002524204.1| DNA binding protein, putative [Ricinus communis] gi|223536481|gb|EEF38128.1| DNA binding protein, putative [Ricinus communis] Length = 753 Score = 186 bits (473), Expect = 5e-45 Identities = 87/144 (60%), Positives = 117/144 (81%), Gaps = 1/144 (0%) Frame = -2 Query: 515 ASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGNA 336 + DLE +P+TSQKHDPAWKHCQM+KNGER+QLKC+YCGKIFKGGGIHRIKEHLAGQKGNA Sbjct: 3 SDDLEPIPITSQKHDPAWKHCQMFKNGERVQLKCVYCGKIFKGGGIHRIKEHLAGQKGNA 62 Query: 335 SSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYN-VGSVSEIDTLADNNHCDLNT 159 S+CL+V DV+L+MQ+SL+GVV++KRKKQK+A+EI N N V EI+ A N+ +++T Sbjct: 63 STCLQVPTDVKLIMQQSLDGVVVKKRKKQKIAEEITNLNPVIGGGEIEVFA-NDQIEVST 121 Query: 158 EVNLLPAPDALDHNTDLFVDSEEG 87 + L+ + ++ ++ L + +EG Sbjct: 122 GMELIGVSNVIEPSSSLLISGQEG 145 >gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo] Length = 752 Score = 183 bits (465), Expect = 4e-44 Identities = 85/144 (59%), Positives = 115/144 (79%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 M+S L+ VP+T QKHDPAWKHCQM+KNG+R+QLKCLYC K+FKGGGIHRIKEHLAGQKGN Sbjct: 1 MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159 AS+C V P+V+ +MQESL+GV+M+KRK+QKL +E+ N N +E+D + +NH D+++ Sbjct: 61 ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN-AMTAEVDAI--SNHMDMDS 117 Query: 158 EVNLLPAPDALDHNTDLFVDSEEG 87 ++L+ + LD N+ L + EEG Sbjct: 118 SIHLIEVAEPLDTNSALLLTHEEG 141 >ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226173 [Cucumis sativus] Length = 752 Score = 179 bits (453), Expect = 1e-42 Identities = 83/144 (57%), Positives = 114/144 (79%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 M+S L+ VP+T QKHDPAWKHCQM+KNG+R+QLKCLYC K+FKGGGIHRIKEHLAGQKGN Sbjct: 1 MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159 AS+C V P+V+ +MQESL+GV+M+KRK+QKL +E+ N N E+D + +NH D+++ Sbjct: 61 ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNT-MTGEVDGI--SNHMDMDS 117 Query: 158 EVNLLPAPDALDHNTDLFVDSEEG 87 ++L+ + L+ N+ L + E+G Sbjct: 118 SIHLIEVAEPLETNSVLLLTHEKG 141 >gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea] Length = 724 Score = 178 bits (452), Expect = 1e-42 Identities = 85/146 (58%), Positives = 107/146 (73%), Gaps = 2/146 (1%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 M +ELVP+TSQKHDPAWKHCQM+K E+I LKC+YCGKIFKGGGIHRIKEHLAGQKGN Sbjct: 1 MEPHMELVPMTSQKHDPAWKHCQMFKTEEKIHLKCIYCGKIFKGGGIHRIKEHLAGQKGN 60 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLAD--NNHCDL 165 AS+CLRV P+V+ M +SLNGV ++K+KK KL +++ Y D AD N H L Sbjct: 61 ASTCLRVLPEVKQQMLDSLNGVAVKKKKKLKLTEQLSGY--------DNPADRVNEHSSL 112 Query: 164 NTEVNLLPAPDALDHNTDLFVDSEEG 87 N+E LP P+ ++H+ D + + EEG Sbjct: 113 NSEAFFLPGPEIVEHDDDAYEEGEEG 138 >gb|ESW35425.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] Length = 756 Score = 174 bits (440), Expect = 3e-41 Identities = 84/145 (57%), Positives = 113/145 (77%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 M S+LE VP+TSQKHDPAWKH QMYKNG+++QLKC+YC K+FKGGGIHRIKEHLA QKGN Sbjct: 1 MGSNLEPVPITSQKHDPAWKHVQMYKNGDKVQLKCIYCQKMFKGGGIHRIKEHLACQKGN 60 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159 AS+C RV DVRL MQ+SL+GVV++KR+KQK+ +EI++ N + +++L +NN D+N Sbjct: 61 ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQKIEEEIMSVN-PLTTVVNSLPNNNQVDVNQ 119 Query: 158 EVNLLPAPDALDHNTDLFVDSEEGL 84 + + +DHN+ L V+ EG+ Sbjct: 120 GLQAI----GVDHNSSLVVNPGEGM 140 >gb|ESW35424.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] Length = 869 Score = 174 bits (440), Expect = 3e-41 Identities = 84/145 (57%), Positives = 113/145 (77%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 M S+LE VP+TSQKHDPAWKH QMYKNG+++QLKC+YC K+FKGGGIHRIKEHLA QKGN Sbjct: 114 MGSNLEPVPITSQKHDPAWKHVQMYKNGDKVQLKCIYCQKMFKGGGIHRIKEHLACQKGN 173 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159 AS+C RV DVRL MQ+SL+GVV++KR+KQK+ +EI++ N + +++L +NN D+N Sbjct: 174 ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQKIEEEIMSVN-PLTTVVNSLPNNNQVDVNQ 232 Query: 158 EVNLLPAPDALDHNTDLFVDSEEGL 84 + + +DHN+ L V+ EG+ Sbjct: 233 GLQAI----GVDHNSSLVVNPGEGM 253 >ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302111 [Fragaria vesca subsp. vesca] Length = 754 Score = 168 bits (426), Expect = 1e-39 Identities = 78/121 (64%), Positives = 98/121 (80%) Frame = -2 Query: 506 LELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGNASSC 327 +E VP+TSQKHDPAWKHCQM+K+G+RIQLKC+YC K+F+GGGIHRIKEHLAGQKGNAS+C Sbjct: 1 MEPVPITSQKHDPAWKHCQMFKSGDRIQLKCIYCSKLFRGGGIHRIKEHLAGQKGNASTC 60 Query: 326 LRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNTEVNL 147 LRV PDVR LMQ+SL+GVV++KR +QKL +EI N ++D+L D+N V L Sbjct: 61 LRVPPDVRGLMQQSLDGVVVKKRNRQKLDEEITNITPPQDGDVDSLG-GTQSDVNNAVQL 119 Query: 146 L 144 + Sbjct: 120 V 120 >ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817502 isoform X4 [Glycine max] Length = 729 Score = 164 bits (415), Expect = 3e-38 Identities = 81/147 (55%), Positives = 114/147 (77%), Gaps = 2/147 (1%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 M S+LE VP+TSQKHDPAWKH QM+KNG+++QLKC+YC K+FKGGGIHRIKEHLA QKGN Sbjct: 1 MGSNLEPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGN 60 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNH--CDL 165 AS+C RV DVRL MQ+SL+GVV++KR+KQ++ +EI++ N + +++L +NN+ D+ Sbjct: 61 ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSVN-PLTTVVNSLPNNNNRVVDV 119 Query: 164 NTEVNLLPAPDALDHNTDLFVDSEEGL 84 N + + ++HN+ L V+ EG+ Sbjct: 120 NQGLQAI----GVEHNSSLVVNPGEGM 142 >ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817502 isoform X1 [Glycine max] gi|571489936|ref|XP_006591345.1| PREDICTED: uncharacterized protein LOC100817502 isoform X2 [Glycine max] gi|571489939|ref|XP_006591346.1| PREDICTED: uncharacterized protein LOC100817502 isoform X3 [Glycine max] Length = 759 Score = 164 bits (415), Expect = 3e-38 Identities = 81/147 (55%), Positives = 114/147 (77%), Gaps = 2/147 (1%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 M S+LE VP+TSQKHDPAWKH QM+KNG+++QLKC+YC K+FKGGGIHRIKEHLA QKGN Sbjct: 1 MGSNLEPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGN 60 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNH--CDL 165 AS+C RV DVRL MQ+SL+GVV++KR+KQ++ +EI++ N + +++L +NN+ D+ Sbjct: 61 ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSVN-PLTTVVNSLPNNNNRVVDV 119 Query: 164 NTEVNLLPAPDALDHNTDLFVDSEEGL 84 N + + ++HN+ L V+ EG+ Sbjct: 120 NQGLQAI----GVEHNSSLVVNPGEGM 142 >ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806265 isoform X1 [Glycine max] gi|571542833|ref|XP_006601996.1| PREDICTED: uncharacterized protein LOC100806265 isoform X2 [Glycine max] Length = 758 Score = 164 bits (414), Expect = 3e-38 Identities = 81/146 (55%), Positives = 113/146 (77%), Gaps = 1/146 (0%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 M S+LE VP+TSQKHDPAWKH QM+KNG+++QLKC+YC K+FKGGGIHRIKEHLA QKGN Sbjct: 1 MGSNLEPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGN 60 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNH-CDLN 162 AS+C RV DVRL MQ+SL+GVV++KR+KQ++ +EI++ N + +++L +NN D+N Sbjct: 61 ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSVN-PLTTVVNSLPNNNQVVDVN 119 Query: 161 TEVNLLPAPDALDHNTDLFVDSEEGL 84 + + ++HN+ L V+ EG+ Sbjct: 120 QGLQAI----GVEHNSTLVVNPGEGM 141 >ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626522 [Citrus sinensis] Length = 745 Score = 163 bits (413), Expect = 5e-38 Identities = 74/100 (74%), Positives = 90/100 (90%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 MAS LE +P++SQKHDPAWKHCQM+KNG+R+QLKCLYC K+F+GGGIHRIKEHLA QKGN Sbjct: 1 MASGLEPIPISSQKHDPAWKHCQMFKNGDRVQLKCLYCFKLFRGGGIHRIKEHLACQKGN 60 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYN 219 AS+C RV DVRL MQ+SL+GVV++K+KKQK+A+EI N N Sbjct: 61 ASTCSRVPLDVRLAMQQSLDGVVVKKKKKQKIAEEITNNN 100 >gb|ESW31639.1| hypothetical protein PHAVU_002G255200g [Phaseolus vulgaris] gi|561033061|gb|ESW31640.1| hypothetical protein PHAVU_002G255200g [Phaseolus vulgaris] Length = 172 Score = 144 bits (364), Expect = 2e-32 Identities = 72/139 (51%), Positives = 102/139 (73%) Frame = -2 Query: 500 LVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGNASSCLR 321 LVP+TSQKHDP WKH QM+KN +++QLKC+Y K+F+GGGI RIKEHLA QKGNAS C R Sbjct: 11 LVPITSQKHDPIWKHVQMFKNSDKVQLKCIYFLKMFEGGGIRRIKEHLACQKGNASICSR 70 Query: 320 VQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNTEVNLLP 141 + DV+L MQ+SL+G V++K +KQK+ +EII+ N ++ ++ L +NN D+N + + Sbjct: 71 LPHDVKLNMQQSLDGAVVKKMRKQKI-EEIISVNPLGIA-VNLLPNNNQVDVNQGLQAI- 127 Query: 140 APDALDHNTDLFVDSEEGL 84 +DHN+ L V+ EG+ Sbjct: 128 ---GVDHNSSLVVNPSEGM 143 >gb|EOX93184.1| HAT transposon superfamily, putative [Theobroma cacao] Length = 750 Score = 134 bits (336), Expect = 4e-29 Identities = 66/144 (45%), Positives = 95/144 (65%), Gaps = 2/144 (1%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 M +L + +T QK DPAW HC+ +KNGER+Q+KC+YCGK+FKGGGIHR KEHLAG+KG Sbjct: 1 MELNLTPISITKQKQDPAWNHCEAFKNGERLQIKCMYCGKMFKGGGIHRFKEHLAGRKGQ 60 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGS--VSEIDTLADNNHCDL 165 C +V P VR LMQESLNGV++++ KQ E++ S EID A ++ D+ Sbjct: 61 GPICEQVPPGVRALMQESLNGVLLKQDNKQNAIPELLACGGSSPHAGEIDKSAYSD--DV 118 Query: 164 NTEVNLLPAPDALDHNTDLFVDSE 93 N V + ++L+ ++ L ++ + Sbjct: 119 NNGVKPIQVLNSLEPDSSLVLNGK 142 >ref|NP_188861.2| hAT dimerization domain-containing protein [Arabidopsis thaliana] gi|79313325|ref|NP_001030742.1| hAT dimerization domain-containing protein [Arabidopsis thaliana] gi|11994740|dbj|BAB03069.1| transposase-like protein [Arabidopsis thaliana] gi|28393360|gb|AAO42104.1| unknown protein [Arabidopsis thaliana] gi|28827622|gb|AAO50655.1| unknown protein [Arabidopsis thaliana] gi|332643084|gb|AEE76605.1| hAT dimerization domain-containing protein [Arabidopsis thaliana] gi|332643085|gb|AEE76606.1| hAT dimerization domain-containing protein [Arabidopsis thaliana] Length = 761 Score = 121 bits (303), Expect = 3e-25 Identities = 55/109 (50%), Positives = 79/109 (72%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 M SDLE V +T QK D AWKHC++YK G+R+Q++CLYC K+FKGGGI R+KEHLAG+KG Sbjct: 1 MDSDLEPVALTPQKQDSAWKHCEVYKYGDRVQMRCLYCRKMFKGGGITRVKEHLAGKKGQ 60 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDT 192 + C +V +VRL +Q+ ++G V R+RK++K + E + E++T Sbjct: 61 GTICDQVPDEVRLFLQQCIDGTVRRQRKRRKSSPEPLPIAYFPPCEVET 109 >ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thaliana] gi|240255844|ref|NP_193238.5| hAT transposon superfamily [Arabidopsis thaliana] gi|332658140|gb|AEE83540.1| hAT transposon superfamily [Arabidopsis thaliana] gi|332658141|gb|AEE83541.1| hAT transposon superfamily [Arabidopsis thaliana] Length = 768 Score = 119 bits (297), Expect = 1e-24 Identities = 52/95 (54%), Positives = 73/95 (76%) Frame = -2 Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339 M ++LE V +T QK D AWKHC++YK G+R+Q++CLYC K+FKGGGI R+KEHLAG+KG Sbjct: 1 MDAELEPVALTPQKQDNAWKHCEIYKYGDRLQMRCLYCRKMFKGGGITRVKEHLAGKKGQ 60 Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADE 234 + C +V DVRL +Q+ ++G V R+RK+ K + E Sbjct: 61 GTICDQVPEDVRLFLQQCIDGTVRRQRKRHKSSSE 95 >ref|XP_002521049.1| DNA binding protein, putative [Ricinus communis] gi|223539752|gb|EEF41333.1| DNA binding protein, putative [Ricinus communis] Length = 854 Score = 119 bits (297), Expect = 1e-24 Identities = 65/137 (47%), Positives = 91/137 (66%), Gaps = 1/137 (0%) Frame = -2 Query: 491 VTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGNASSCLRVQP 312 VT K D AWK+CQ K G+R+Q+KC YCGK+FKGGGIHR KEHLAG+KG A C RV Sbjct: 122 VTRHKKDMAWKYCQPSKYGDRVQIKCNYCGKVFKGGGIHRFKEHLAGRKGAAPICDRVPS 181 Query: 311 DVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVS-EIDTLADNNHCDLNTEVNLLPAP 135 DVRLLMQ+ L+ VV +++K++ + +E IN + V DT A NH + N AP Sbjct: 182 DVRLLMQQCLHEVVPKQKKQKVVIEETINVDSPPVPLNTDTFA--NHFGDEDDDN--GAP 237 Query: 134 DALDHNTDLFVDSEEGL 84 +++ N++L ++ ++ L Sbjct: 238 ISVEFNSNLSLEEDDVL 254 Score = 102 bits (255), Expect = 9e-20 Identities = 49/114 (42%), Positives = 71/114 (62%) Frame = -2 Query: 509 DLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGNASS 330 +++ P KHD WK+C+M K GE++ +KC YCGKIFKGGGI R KEHLAG+KG Sbjct: 2 EIQTPPSLGHKHDLGWKYCEMIKEGEKVHIKCSYCGKIFKGGGIFRFKEHLAGRKGGGPM 61 Query: 329 CLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCD 168 CL V DVRLLM+++L+ V K+ ++ + + E+ +L +N + D Sbjct: 62 CLNVPADVRLLMEQTLD-VSSAKQSSRRQSSRL-----KMTPELPSLPNNKNSD 109