BLASTX nr result
ID: Rehmannia23_contig00018336
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia23_contig00018336 (801 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591... 213 5e-53 gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea] 211 3e-52 ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254... 207 5e-51 ref|XP_002316272.2| hypothetical protein POPTR_0010s20835g [Popu... 192 2e-46 gb|EOY18075.1| HAT and BED zinc finger domain-containing protein... 190 6e-46 ref|XP_002524204.1| DNA binding protein, putative [Ricinus commu... 184 3e-44 ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302... 171 3e-40 gb|ESW35424.1| hypothetical protein PHAVU_001G234100g [Phaseolus... 169 1e-39 gb|ESW35425.1| hypothetical protein PHAVU_001G234100g [Phaseolus... 168 2e-39 ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817... 166 7e-39 ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817... 166 7e-39 ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806... 166 9e-39 ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626... 163 7e-38 gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo] 162 1e-37 ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226... 161 2e-37 gb|EOX93184.1| HAT transposon superfamily, putative [Theobroma c... 135 1e-29 gb|ESW31639.1| hypothetical protein PHAVU_002G255200g [Phaseolus... 125 1e-26 ref|XP_002521049.1| DNA binding protein, putative [Ricinus commu... 119 2e-24 ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thal... 118 3e-24 gb|AAM98154.1| putative protein [Arabidopsis thaliana] 118 3e-24 >ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591442 [Solanum tuberosum] Length = 755 Score = 213 bits (543), Expect = 5e-53 Identities = 111/198 (56%), Positives = 136/198 (68%), Gaps = 5/198 (2%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 EPV VTSQKHDPAWKHC+MFKNG+RVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNA+TCL Sbjct: 6 EPVPVTSQKHDPAWKHCEMFKNGERVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNASTCL 65 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEVANN-----ALNTEVVL 238 RVQPDVRL M +SLNGV +KKRKKQKLAEEI+ Y N GT+ ++A L+T+V L Sbjct: 66 RVQPDVRLLMQDSLNGVVMKKRKKQKLAEEITTY-NAGTATSDIAAEFTDTCGLDTQVDL 124 Query: 237 LPVPEMMEHDVDVYTNQEDXXXXXXXXXXXXXXXXKAPDMVNSVSTAALACFPATSSKKI 58 LP+P+ +EH +++ N++ A N+ P SK++ Sbjct: 125 LPMPQAIEHTSNLFLNRDQGPNNIGARKKKSRIRKGASSSNNNA-----MLLPINQSKRV 179 Query: 57 SNTVNMAVGRFFFDVGLP 4 +N V+MAV RF D +P Sbjct: 180 NNHVHMAVARFLLDARVP 197 >gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea] Length = 724 Score = 211 bits (536), Expect = 3e-52 Identities = 110/194 (56%), Positives = 133/194 (68%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 E V +TSQKHDPAWKHCQMFK +++ LKCIYCGKIFKGGGIHRIKEHLAGQKGNA+TCL Sbjct: 6 ELVPMTSQKHDPAWKHCQMFKTEEKIHLKCIYCGKIFKGGGIHRIKEHLAGQKGNASTCL 65 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEVANNALNTEVVLLPVPE 223 RV P+V+ QML+SLNGVAVKK+KK KL E++SGYDNP E +++LN+E LP PE Sbjct: 66 RVLPEVKQQMLDSLNGVAVKKKKKLKLTEQLSGYDNPADRVNE--HSSLNSEAFFLPGPE 123 Query: 222 MMEHDVDVYTNQEDXXXXXXXXXXXXXXXXKAPDMVNSVSTAALACFPATSSKKISNTVN 43 ++EHD D Y E+ K P + ++A S + S V+ Sbjct: 124 IVEHDDDAYEEGEEGTTSKRGPRQKRPQIRKNP-------SESMALMSLPSVQPCSKKVH 176 Query: 42 MAVGRFFFDVGLPA 1 MAVGRFF DVGLPA Sbjct: 177 MAVGRFFVDVGLPA 190 >ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254391 [Solanum lycopersicum] Length = 748 Score = 207 bits (526), Expect = 5e-51 Identities = 109/196 (55%), Positives = 134/196 (68%), Gaps = 3/196 (1%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 EPVAVTSQKHDPAWKHC+MFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNA+TCL Sbjct: 6 EPVAVTSQKHDPAWKHCEMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNASTCL 65 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEVA---NNALNTEVVLLP 232 RVQPDVRL M +SLNGV +KKRKKQKLAEEI+ Y+ TS + LNT+V LLP Sbjct: 66 RVQPDVRLLMQDSLNGVVMKKRKKQKLAEEITTYNAIDTSDIAAEFTDTCGLNTQVDLLP 125 Query: 231 VPEMMEHDVDVYTNQEDXXXXXXXXXXXXXXXXKAPDMVNSVSTAALACFPATSSKKISN 52 + + +EH ++ N++ K + ++++ SK+++N Sbjct: 126 MSQAIEHTSSLFLNRDQ-----------GPNNRKKKSRIRKGASSSNNLPIINQSKRVNN 174 Query: 51 TVNMAVGRFFFDVGLP 4 V+MAV RF D +P Sbjct: 175 QVHMAVARFLLDARVP 190 >ref|XP_002316272.2| hypothetical protein POPTR_0010s20835g [Populus trichocarpa] gi|550330253|gb|EEF02443.2| hypothetical protein POPTR_0010s20835g [Populus trichocarpa] Length = 608 Score = 192 bits (487), Expect = 2e-46 Identities = 96/192 (50%), Positives = 128/192 (66%), Gaps = 1/192 (0%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 EP+ +TSQKHDPAWKHCQMFKNG+RVQLKC+YCGKIFKGGGIHRIKEHLAGQKGNAATC+ Sbjct: 6 EPIPITSQKHDPAWKHCQMFKNGERVQLKCVYCGKIFKGGGIHRIKEHLAGQKGNAATCV 65 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEVANNALNTEVVLLPVPE 223 +V DVRL M +SL+GV VKKRKKQK+AEEI+ NP +S + V + +NT + L V + Sbjct: 66 QVPSDVRLMMQQSLDGVVVKKRKKQKIAEEITNL-NPVSSEIGVFDKDVNTGMELTGVTD 124 Query: 222 MMEHDVDVYTNQEDXXXXXXXXXXXXXXXXKAPDMVNSVSTAALAC-FPATSSKKISNTV 46 ++ + ED + N+ + + P + K+ ++ + Sbjct: 125 AIDPVSSLLVTGEDGMGKKGGERRKRGRGRGRGSVTNAKAVVTMGSGMPLSGGKRKNDHI 184 Query: 45 NMAVGRFFFDVG 10 +MA+GRF +D+G Sbjct: 185 HMAIGRFLYDIG 196 >gb|EOY18075.1| HAT and BED zinc finger domain-containing protein, putative [Theobroma cacao] Length = 749 Score = 190 bits (482), Expect = 6e-46 Identities = 98/195 (50%), Positives = 129/195 (66%), Gaps = 2/195 (1%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 EP+ +TSQKHDPAWKHCQMF+NG+RVQLKCIYCGKIF+GGGIHRIKEHLAGQKGNA+TC Sbjct: 6 EPIPITSQKHDPAWKHCQMFRNGERVQLKCIYCGKIFRGGGIHRIKEHLAGQKGNASTCF 65 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEVANNALNTEVVLLPV-- 229 V DVRL M ESL+GV VKKRKKQK+AEE+S N +S ++ +N ++T LL + Sbjct: 66 HVPSDVRLLMRESLDGVEVKKRKKQKIAEEMSN-ANQVSSEIDTYDNQVDTNTGLLMIEG 124 Query: 228 PEMMEHDVDVYTNQEDXXXXXXXXXXXXXXXXKAPDMVNSVSTAALACFPATSSKKISNT 49 P+ ++ + N+E A + S A + +K+++N Sbjct: 125 PDTLQPSSSLLVNREGTSNVSGDRRKRGKGKSSAAE-----SNALVVNTVGLGAKRVNNH 179 Query: 48 VNMAVGRFFFDVGLP 4 V++A+GRF FD+G P Sbjct: 180 VHVAIGRFLFDIGAP 194 >ref|XP_002524204.1| DNA binding protein, putative [Ricinus communis] gi|223536481|gb|EEF38128.1| DNA binding protein, putative [Ricinus communis] Length = 753 Score = 184 bits (467), Expect = 3e-44 Identities = 97/198 (48%), Positives = 134/198 (67%), Gaps = 5/198 (2%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 EP+ +TSQKHDPAWKHCQMFKNG+RVQLKC+YCGKIFKGGGIHRIKEHLAGQKGNA+TCL Sbjct: 7 EPIPITSQKHDPAWKHCQMFKNGERVQLKCVYCGKIFKGGGIHRIKEHLAGQKGNASTCL 66 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEV---ANNAL--NTEVVL 238 +V DV+L M +SL+GV VKKRKKQK+AEEI+ NP G E+ AN+ + +T + L Sbjct: 67 QVPTDVKLIMQQSLDGVVVKKRKKQKIAEEITNL-NPVIGGGEIEVFANDQIEVSTGMEL 125 Query: 237 LPVPEMMEHDVDVYTNQEDXXXXXXXXXXXXXXXXKAPDMVNSVSTAALACFPATSSKKI 58 + V ++E + + ++ + N++ + + A +K++ Sbjct: 126 IGVSNVIEPSSSLLISGQEGKANKGGERRKRGRSKGSGANANAI-VSMNSNRMALGAKRV 184 Query: 57 SNTVNMAVGRFFFDVGLP 4 ++ V+MA+GRF +D+G P Sbjct: 185 NDHVHMAIGRFLYDIGAP 202 >ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302111 [Fragaria vesca subsp. vesca] Length = 754 Score = 171 bits (433), Expect = 3e-40 Identities = 95/196 (48%), Positives = 122/196 (62%), Gaps = 3/196 (1%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 EPV +TSQKHDPAWKHCQMFK+GDR+QLKCIYC K+F+GGGIHRIKEHLAGQKGNA+TCL Sbjct: 2 EPVPITSQKHDPAWKHCQMFKSGDRIQLKCIYCSKLFRGGGIHRIKEHLAGQKGNASTCL 61 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEV---ANNALNTEVVLLP 232 RV PDVR M +SL+GV VKKR +QKL EEI+ P V+ + +N V L+ Sbjct: 62 RVPPDVRGLMQQSLDGVVVKKRNRQKLDEEITNITPPQDGDVDSLGGTQSDVNNAVQLVG 121 Query: 231 VPEMMEHDVDVYTNQEDXXXXXXXXXXXXXXXXKAPDMVNSVSTAALACFPATSSKKISN 52 V +E + N+E + +S + A S+K+++ Sbjct: 122 V--SVEPISRLLVNRE---GVTSVRSMDRRKRGRGKSSWSSHGVHGVCNGGALVSRKVNS 176 Query: 51 TVNMAVGRFFFDVGLP 4 V+ A+GRF FD+G P Sbjct: 177 YVHEAIGRFLFDIGAP 192 >gb|ESW35424.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] Length = 869 Score = 169 bits (427), Expect = 1e-39 Identities = 97/203 (47%), Positives = 122/203 (60%), Gaps = 4/203 (1%) Frame = -1 Query: 600 KVSNMEEPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKG 421 K+ + EPV +TSQKHDPAWKH QM+KNGD+VQLKCIYC K+FKGGGIHRIKEHLA QKG Sbjct: 113 KMGSNLEPVPITSQKHDPAWKHVQMYKNGDKVQLKCIYCQKMFKGGGIHRIKEHLACQKG 172 Query: 420 NAATCLRVQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEVANNALNTEVV 241 NA+TC RV DVRL M +SL+GV VKKR+KQK+ EEI NP T+ V N +V Sbjct: 173 NASTCSRVPHDVRLHMQQSLDGVVVKKRRKQKIEEEIMSV-NPLTTVVNSLPNNNQVDVN 231 Query: 240 LLPVPEMMEHDVDVYTNQ-EDXXXXXXXXXXXXXXXXKAPDMVNSVSTAAL---ACFPAT 73 ++H+ + N E A NS A+ FP Sbjct: 232 QGLQAIGVDHNSSLVVNPGEGMSKNMERRKKMRASKNPAAIYANSEGVVAVEKNGLFP-- 289 Query: 72 SSKKISNTVNMAVGRFFFDVGLP 4 K++ N ++MA+GRF +D+G P Sbjct: 290 --KRVDNHIHMAIGRFLYDIGAP 310 >gb|ESW35425.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] Length = 756 Score = 168 bits (425), Expect = 2e-39 Identities = 96/197 (48%), Positives = 119/197 (60%), Gaps = 4/197 (2%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 EPV +TSQKHDPAWKH QM+KNGD+VQLKCIYC K+FKGGGIHRIKEHLA QKGNA+TC Sbjct: 6 EPVPITSQKHDPAWKHVQMYKNGDKVQLKCIYCQKMFKGGGIHRIKEHLACQKGNASTCS 65 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEVANNALNTEVVLLPVPE 223 RV DVRL M +SL+GV VKKR+KQK+ EEI NP T+ V N +V Sbjct: 66 RVPHDVRLHMQQSLDGVVVKKRRKQKIEEEIMSV-NPLTTVVNSLPNNNQVDVNQGLQAI 124 Query: 222 MMEHDVDVYTNQ-EDXXXXXXXXXXXXXXXXKAPDMVNSVSTAAL---ACFPATSSKKIS 55 ++H+ + N E A NS A+ FP K++ Sbjct: 125 GVDHNSSLVVNPGEGMSKNMERRKKMRASKNPAAIYANSEGVVAVEKNGLFP----KRVD 180 Query: 54 NTVNMAVGRFFFDVGLP 4 N ++MA+GRF +D+G P Sbjct: 181 NHIHMAIGRFLYDIGAP 197 >ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817502 isoform X4 [Glycine max] Length = 729 Score = 166 bits (421), Expect = 7e-39 Identities = 99/199 (49%), Positives = 120/199 (60%), Gaps = 6/199 (3%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 EPV +TSQKHDPAWKH QMFKNGD+VQLKCIYC K+FKGGGIHRIKEHLA QKGNA+TC Sbjct: 6 EPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGNASTCS 65 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEVANNALNTEVVLLPVPE 223 RV DVRL M +SL+GV VKKR+KQ++ EEI NP T+ V N N V + + Sbjct: 66 RVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSV-NPLTTVVNSLPNNNNRVVDVNQGLQ 124 Query: 222 M--MEHDVDVYTNQ-EDXXXXXXXXXXXXXXXXKAPDMVNSVSTAAL---ACFPATSSKK 61 +EH+ + N E A NS A+ FP KK Sbjct: 125 AIGVEHNSSLVVNPGEGMSRNMERRKKMRATKNPAAVYANSEGVIAVEKNGLFP----KK 180 Query: 60 ISNTVNMAVGRFFFDVGLP 4 + N + MA+GRF +D+G P Sbjct: 181 MDNHIYMAIGRFLYDIGAP 199 >ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817502 isoform X1 [Glycine max] gi|571489936|ref|XP_006591345.1| PREDICTED: uncharacterized protein LOC100817502 isoform X2 [Glycine max] gi|571489939|ref|XP_006591346.1| PREDICTED: uncharacterized protein LOC100817502 isoform X3 [Glycine max] Length = 759 Score = 166 bits (421), Expect = 7e-39 Identities = 99/199 (49%), Positives = 120/199 (60%), Gaps = 6/199 (3%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 EPV +TSQKHDPAWKH QMFKNGD+VQLKCIYC K+FKGGGIHRIKEHLA QKGNA+TC Sbjct: 6 EPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGNASTCS 65 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEVANNALNTEVVLLPVPE 223 RV DVRL M +SL+GV VKKR+KQ++ EEI NP T+ V N N V + + Sbjct: 66 RVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSV-NPLTTVVNSLPNNNNRVVDVNQGLQ 124 Query: 222 M--MEHDVDVYTNQ-EDXXXXXXXXXXXXXXXXKAPDMVNSVSTAAL---ACFPATSSKK 61 +EH+ + N E A NS A+ FP KK Sbjct: 125 AIGVEHNSSLVVNPGEGMSRNMERRKKMRATKNPAAVYANSEGVIAVEKNGLFP----KK 180 Query: 60 ISNTVNMAVGRFFFDVGLP 4 + N + MA+GRF +D+G P Sbjct: 181 MDNHIYMAIGRFLYDIGAP 199 >ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806265 isoform X1 [Glycine max] gi|571542833|ref|XP_006601996.1| PREDICTED: uncharacterized protein LOC100806265 isoform X2 [Glycine max] Length = 758 Score = 166 bits (420), Expect = 9e-39 Identities = 100/200 (50%), Positives = 121/200 (60%), Gaps = 7/200 (3%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 EPV +TSQKHDPAWKH QMFKNGD+VQLKCIYC K+FKGGGIHRIKEHLA QKGNA+TC Sbjct: 6 EPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGNASTCS 65 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEVANNALNTEVVLLPVPE 223 RV DVRL M +SL+GV VKKR+KQ++ EEI NP T+ V N N +VV + Sbjct: 66 RVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSV-NPLTTVVNSLPN--NNQVVDVNQGL 122 Query: 222 M---MEHDVDVYTNQ-EDXXXXXXXXXXXXXXXXKAPDMVNSVSTAAL---ACFPATSSK 64 +EH+ + N E A NS A+ FP K Sbjct: 123 QAIGVEHNSTLVVNPGEGMSRNMERRKKMRAAKNPAAVYANSEDVVAVEKNGLFP----K 178 Query: 63 KISNTVNMAVGRFFFDVGLP 4 K+ N + MA+GRF +D+G P Sbjct: 179 KMDNHIYMAIGRFLYDIGAP 198 >ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626522 [Citrus sinensis] Length = 745 Score = 163 bits (412), Expect = 7e-38 Identities = 96/202 (47%), Positives = 123/202 (60%), Gaps = 9/202 (4%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 EP+ ++SQKHDPAWKHCQMFKNGDRVQLKC+YC K+F+GGGIHRIKEHLA QKGNA+TC Sbjct: 6 EPIPISSQKHDPAWKHCQMFKNGDRVQLKCLYCFKLFRGGGIHRIKEHLACQKGNASTCS 65 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEVANNALNTEVVLLPV-- 229 RV DVRL M +SL+GV VKK+KKQK+AEEI+ +N T G A LP+ Sbjct: 66 RVPLDVRLAMQQSLDGVVVKKKKKQKIAEEIT--NNNPTFGEVYAFTDQGDVTPGLPLLD 123 Query: 228 ----PEMMEHDV---DVYTNQEDXXXXXXXXXXXXXXXXKAPDMVNSVSTAALACFPATS 70 PE + V DV +N VN+ + A ++ + Sbjct: 124 DSNTPEACSNLVVSRDVISNTTGDKRKRWRGKN---------SSVNAYTGAMISA--SLD 172 Query: 69 SKKISNTVNMAVGRFFFDVGLP 4 + + +N + MAVGRF +D+G P Sbjct: 173 ATRGNNPIFMAVGRFLYDIGAP 194 >gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo] Length = 752 Score = 162 bits (410), Expect = 1e-37 Identities = 89/197 (45%), Positives = 122/197 (61%), Gaps = 6/197 (3%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 +PV +T QKHDPAWKHCQMFKNGDRVQLKC+YC K+FKGGGIHRIKEHLAGQKGNA+TC Sbjct: 6 QPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGNASTCH 65 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEVANN--ALNTEVVLLPV 229 V P+V+ M ESL+GV +KKRK+QKL EE++ N T+ V+ +N +++ + L+ V Sbjct: 66 SVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNV-NAMTAEVDAISNHMDMDSSIHLIEV 124 Query: 228 PEMMEHDVDVYTNQEDXXXXXXXXXXXXXXXXKAPDMVNSVSTAALACFP----ATSSKK 61 E ++ + + E+ + +S + P S + Sbjct: 125 AEPLDTNSALLLTHEE------GTSNKVGRKKGSKGKSSSCLDREMIVIPNGGGILDSNR 178 Query: 60 ISNTVNMAVGRFFFDVG 10 N V+MA+GRF +D+G Sbjct: 179 DRNQVHMAIGRFLYDIG 195 >ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226173 [Cucumis sativus] Length = 752 Score = 161 bits (408), Expect = 2e-37 Identities = 92/197 (46%), Positives = 119/197 (60%), Gaps = 6/197 (3%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 +PV +T QKHDPAWKHCQMFKNGDRVQLKC+YC K+FKGGGIHRIKEHLAGQKGNA+TC Sbjct: 6 QPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGNASTCH 65 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEVANN--ALNTEVVLLPV 229 V P+V+ M ESL+GV +KKRK+QKL EE++ N T V+ +N +++ + L+ V Sbjct: 66 SVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNV-NTMTGEVDGISNHMDMDSSIHLIEV 124 Query: 228 PEMMEHDVDVYTNQEDXXXXXXXXXXXXXXXXKAPDMVNSVSTAALACFP----ATSSKK 61 E +E TN + +S + P S + Sbjct: 125 AEPLE------TNSVLLLTHEKGTSNKVGRKKGSKGKSSSCLEREMIVIPNGGGILDSNR 178 Query: 60 ISNTVNMAVGRFFFDVG 10 N V+MAVGRF +D+G Sbjct: 179 DRNQVHMAVGRFLYDIG 195 >gb|EOX93184.1| HAT transposon superfamily, putative [Theobroma cacao] Length = 750 Score = 135 bits (341), Expect = 1e-29 Identities = 77/195 (39%), Positives = 104/195 (53%), Gaps = 4/195 (2%) Frame = -1 Query: 579 PVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCLR 400 P+++T QK DPAW HC+ FKNG+R+Q+KC+YCGK+FKGGGIHR KEHLAG+KG C + Sbjct: 7 PISITKQKQDPAWNHCEAFKNGERLQIKCMYCGKMFKGGGIHRFKEHLAGRKGQGPICEQ 66 Query: 399 VQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEVANNA----LNTEVVLLP 232 V P VR M ESLNGV +K+ KQ E+ E+ +A +N V + Sbjct: 67 VPPGVRALMQESLNGVLLKQDNKQNAIPELLACGGSSPHAGEIDKSAYSDDVNNGVKPIQ 126 Query: 231 VPEMMEHDVDVYTNQEDXXXXXXXXXXXXXXXXKAPDMVNSVSTAALACFPATSSKKISN 52 V +E D + N + + NS S A A S N Sbjct: 127 VLNSLEPDSSLVLNGKGEVSQGIRDSKKRGRDRSL--LANSHSCAKSDL--ALVSIGAEN 182 Query: 51 TVNMAVGRFFFDVGL 7 V+MA+GRF +D+G+ Sbjct: 183 PVHMAIGRFLYDIGV 197 >gb|ESW31639.1| hypothetical protein PHAVU_002G255200g [Phaseolus vulgaris] gi|561033061|gb|ESW31640.1| hypothetical protein PHAVU_002G255200g [Phaseolus vulgaris] Length = 172 Score = 125 bits (315), Expect = 1e-26 Identities = 60/90 (66%), Positives = 71/90 (78%) Frame = -1 Query: 576 VAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCLRV 397 V +TSQKHDP WKH QMFKN D+VQLKCIY K+F+GGGI RIKEHLA QKGNA+ C R+ Sbjct: 12 VPITSQKHDPIWKHVQMFKNSDKVQLKCIYFLKMFEGGGIRRIKEHLACQKGNASICSRL 71 Query: 396 QPDVRLQMLESLNGVAVKKRKKQKLAEEIS 307 DV+L M +SL+G VKK +KQK+ E IS Sbjct: 72 PHDVKLNMQQSLDGAVVKKMRKQKIEEIIS 101 >ref|XP_002521049.1| DNA binding protein, putative [Ricinus communis] gi|223539752|gb|EEF41333.1| DNA binding protein, putative [Ricinus communis] Length = 854 Score = 119 bits (297), Expect = 2e-24 Identities = 72/204 (35%), Positives = 101/204 (49%), Gaps = 14/204 (6%) Frame = -1 Query: 579 PVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCLR 400 P+ VT K D AWK+CQ K GDRVQ+KC YCGK+FKGGGIHR KEHLAG+KG A C R Sbjct: 119 PIIVTRHKKDMAWKYCQPSKYGDRVQIKCNYCGKVFKGGGIHRFKEHLAGRKGAAPICDR 178 Query: 399 VQPDVRLQMLESLNGVAVKKRKKQKLAEEISGYDNPGTSGVEVANNALNTEVVLLPVPEM 220 V DVRL M + L+ V K++K++ + EE D+P PVP Sbjct: 179 VPSDVRLLMQQCLHEVVPKQKKQKVVIEETINVDSP-------------------PVPLN 219 Query: 219 MEHDVDVYTNQEDXXXXXXXXXXXXXXXXKAPDMVNSVS----------TAALA----CF 82 + + + +++D + D++N + T+A+ Sbjct: 220 TDTFANHFGDEDDDNGAPISVEFNSNLSLEEDDVLNQGNLHTRKRGRGKTSAIVDHGDPL 279 Query: 81 PATSSKKISNTVNMAVGRFFFDVG 10 K I N ++ VGRF +D+G Sbjct: 280 DVVHLKMIDNVIHTTVGRFLYDIG 303 Score = 97.1 bits (240), Expect = 7e-18 Identities = 42/78 (53%), Positives = 56/78 (71%) Frame = -1 Query: 558 KHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCLRVQPDVRL 379 KHD WK+C+M K G++V +KC YCGKIFKGGGI R KEHLAG+KG CL V DVRL Sbjct: 12 KHDLGWKYCEMIKEGEKVHIKCSYCGKIFKGGGIFRFKEHLAGRKGGGPMCLNVPADVRL 71 Query: 378 QMLESLNGVAVKKRKKQK 325 M ++L+ + K+ +++ Sbjct: 72 LMEQTLDVSSAKQSSRRQ 89 >ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thaliana] gi|240255844|ref|NP_193238.5| hAT transposon superfamily [Arabidopsis thaliana] gi|332658140|gb|AEE83540.1| hAT transposon superfamily [Arabidopsis thaliana] gi|332658141|gb|AEE83541.1| hAT transposon superfamily [Arabidopsis thaliana] Length = 768 Score = 118 bits (295), Expect = 3e-24 Identities = 68/212 (32%), Positives = 109/212 (51%), Gaps = 21/212 (9%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 EPVA+T QK D AWKHC+++K GDR+Q++C+YC K+FKGGGI R+KEHLAG+KG C Sbjct: 6 EPVALTPQKQDNAWKHCEIYKYGDRLQMRCLYCRKMFKGGGITRVKEHLAGKKGQGTICD 65 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEE---------------------ISGYDNPGT 286 +V DVRL + + ++G ++RK+ K + E G+ +PG+ Sbjct: 66 QVPEDVRLFLQQCIDGTVRRQRKRHKSSSEPLSVASLPPIEGDMMVVQPDVNDGFKSPGS 125 Query: 285 SGVEVANNALNTEVVLLPVPEMMEHDVDVYTNQEDXXXXXXXXXXXXXXXXKAPDMVNSV 106 S V V N +L + Y ++++ +++ V Sbjct: 126 SDVVVQNESLLSG----------RTKQRTYRSKKNAFENGSASNNVDLIGRDMDNLI-PV 174 Query: 105 STAALACFPATSSKKISNTVNMAVGRFFFDVG 10 + +++ S + NT++MA+GRF F +G Sbjct: 175 AISSVKNIVHPSFRDRENTIHMAIGRFLFGIG 206 >gb|AAM98154.1| putative protein [Arabidopsis thaliana] Length = 768 Score = 118 bits (295), Expect = 3e-24 Identities = 68/212 (32%), Positives = 109/212 (51%), Gaps = 21/212 (9%) Frame = -1 Query: 582 EPVAVTSQKHDPAWKHCQMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCL 403 EPVA+T QK D AWKHC+++K GDR+Q++C+YC K+FKGGGI R+KEHLAG+KG C Sbjct: 6 EPVALTPQKQDNAWKHCEIYKYGDRLQMRCLYCRKMFKGGGITRVKEHLAGKKGQGTICD 65 Query: 402 RVQPDVRLQMLESLNGVAVKKRKKQKLAEE---------------------ISGYDNPGT 286 +V DVRL + + ++G ++RK+ K + E G+ +PG+ Sbjct: 66 QVPEDVRLFLQQCIDGTVRRQRKRHKSSSEPLSVASLPPIEGDMMVVQPDVNDGFKSPGS 125 Query: 285 SGVEVANNALNTEVVLLPVPEMMEHDVDVYTNQEDXXXXXXXXXXXXXXXXKAPDMVNSV 106 S V V N +L + Y ++++ +++ V Sbjct: 126 SDVVVQNESLLSG----------RTKQRTYRSKKNAFENGSASNNVDLIGRDMDNLI-PV 174 Query: 105 STAALACFPATSSKKISNTVNMAVGRFFFDVG 10 + +++ S + NT++MA+GRF F +G Sbjct: 175 AISSVKNIVHPSFRDRENTIHMAIGRFLFGIG 206