BLASTX nr result
ID: Mentha25_contig00014446
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00014446 (1152 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI22554.3| unnamed protein product [Vitis vinifera] 506 e-141 ref|XP_002267489.2| PREDICTED: uncharacterized protein LOC100265... 503 e-140 ref|XP_004502603.1| PREDICTED: uncharacterized protein LOC101496... 501 e-139 gb|EXB72473.1| hypothetical protein L484_011475 [Morus notabilis] 496 e-138 ref|XP_004308021.1| PREDICTED: uncharacterized protein LOC101298... 491 e-136 ref|XP_002513602.1| protein dimerization, putative [Ricinus comm... 491 e-136 ref|XP_004145979.1| PREDICTED: uncharacterized protein LOC101215... 489 e-136 ref|XP_007038931.1| HAT transposon superfamily isoform 2 [Theobr... 486 e-135 ref|XP_007038933.1| HAT transposon superfamily isoform 4 [Theobr... 486 e-134 ref|XP_006350604.1| PREDICTED: uncharacterized protein LOC102593... 485 e-134 ref|XP_004234278.1| PREDICTED: uncharacterized protein LOC101256... 484 e-134 ref|XP_004160403.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 484 e-134 ref|XP_003602175.1| Protein dimerization [Medicago truncatula] g... 484 e-134 ref|XP_006490683.1| PREDICTED: uncharacterized protein LOC102618... 483 e-134 ref|XP_006581618.1| PREDICTED: uncharacterized protein LOC100808... 483 e-134 ref|XP_006300772.1| hypothetical protein CARUB_v10019846mg, part... 440 e-121 ref|NP_178092.4| hAT family dimerization domain-containing prote... 424 e-116 ref|XP_007218857.1| hypothetical protein PRUPE_ppa002763mg [Prun... 412 e-112 ref|XP_007038930.1| HAT transposon superfamily isoform 1 [Theobr... 384 e-104 gb|EEC70201.1| hypothetical protein OsI_00947 [Oryza sativa Indi... 340 5e-91 >emb|CBI22554.3| unnamed protein product [Vitis vinifera] Length = 731 Score = 506 bits (1302), Expect = e-141 Identities = 248/348 (71%), Positives = 293/348 (84%) Frame = -2 Query: 1046 SCLSMVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRD 867 S +SMVREKDVCWEYA++L+GNKVRCKFC R+LNGGISRLKHHLSRLPSKGVNPC+KVRD Sbjct: 50 SFISMVREKDVCWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRD 109 Query: 866 DVTDKVREIIASKDDTRETLTIKKQNLQEFKTPAVSISSGNALMAVGAAPISGKVFAXXX 687 DVTD+VR II+SK+D +ET + KKQ + E K+P + S+ ALM+V K+F Sbjct: 110 DVTDRVRAIISSKEDGKETSSAKKQRVAEAKSPG-NYSAIKALMSVETPSPIAKIFPPIT 168 Query: 686 XXXXXXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTL 507 D ENAERSIALFFFEN+LDFSVARSSSYQ MI+AV KCG GF GPSA+ L Sbjct: 169 HMGPSSSN--DGENAERSIALFFFENKLDFSVARSSSYQLMIEAVSKCGHGFRGPSAEIL 226 Query: 506 KTTWLENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSV 327 KTTWLE IKSE+SLQSKDIE+EWA TGCTIIA+TWTDNKSRALINFLVSSPSRTFFHKSV Sbjct: 227 KTTWLERIKSEVSLQSKDIEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSV 286 Query: 326 DASSYYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCAS 147 DASSY+KN KYL+DLFDS+IQD G +NVVQ+I+D LN G+A+HI+QNYG++FV+PCAS Sbjct: 287 DASSYFKNTKYLADLFDSVIQDLGPDNVVQIIMDSTLNYTGVASHIVQNYGTVFVSPCAS 346 Query: 146 QCMNGILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 QC+N ILE+FCK+DW++RCILQAQ +SK+IYNN+SML +M+ TGGQD Sbjct: 347 QCLNLILEDFCKIDWVNRCILQAQTISKFIYNNASMLDLMKKSTGGQD 394 >ref|XP_002267489.2| PREDICTED: uncharacterized protein LOC100265581 [Vitis vinifera] Length = 723 Score = 503 bits (1295), Expect = e-140 Identities = 246/346 (71%), Positives = 292/346 (84%) Frame = -2 Query: 1040 LSMVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDV 861 L++VREKDVCWEYA++L+GNKVRCKFC R+LNGGISRLKHHLSRLPSKGVNPC+KVRDDV Sbjct: 44 LAVVREKDVCWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDV 103 Query: 860 TDKVREIIASKDDTRETLTIKKQNLQEFKTPAVSISSGNALMAVGAAPISGKVFAXXXXX 681 TD+VR II+SK+D +ET + KKQ + E K+P + S+ ALM+V K+F Sbjct: 104 TDRVRAIISSKEDGKETSSAKKQRVAEAKSPG-NYSAIKALMSVETPSPIAKIFPPITHM 162 Query: 680 XXXXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTLKT 501 D ENAERSIALFFFEN+LDFSVARSSSYQ MI+AV KCG GF GPSA+ LKT Sbjct: 163 GPSSSN--DGENAERSIALFFFENKLDFSVARSSSYQLMIEAVSKCGHGFRGPSAEILKT 220 Query: 500 TWLENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSVDA 321 TWLE IKSE+SLQSKDIE+EWA TGCTIIA+TWTDNKSRALINFLVSSPSRTFFHKSVDA Sbjct: 221 TWLERIKSEVSLQSKDIEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDA 280 Query: 320 SSYYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCASQC 141 SSY+KN KYL+DLFDS+IQD G +NVVQ+I+D LN G+A+HI+QNYG++FV+PCASQC Sbjct: 281 SSYFKNTKYLADLFDSVIQDLGPDNVVQIIMDSTLNYTGVASHIVQNYGTVFVSPCASQC 340 Query: 140 MNGILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 +N ILE+FCK+DW++RCILQAQ +SK+IYNN+SML +M+ TGGQD Sbjct: 341 LNLILEDFCKIDWVNRCILQAQTISKFIYNNASMLDLMKKSTGGQD 386 >ref|XP_004502603.1| PREDICTED: uncharacterized protein LOC101496447 isoform X1 [Cicer arietinum] gi|502136218|ref|XP_004502604.1| PREDICTED: uncharacterized protein LOC101496447 isoform X2 [Cicer arietinum] Length = 679 Score = 501 bits (1291), Expect = e-139 Identities = 241/344 (70%), Positives = 290/344 (84%) Frame = -2 Query: 1034 MVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVTD 855 MVREKDVCWEYA++L+GNKVRCKFC R+LNGGISRLKHHLSR PSKGVNPC+KVRDDVTD Sbjct: 1 MVREKDVCWEYAEKLDGNKVRCKFCQRVLNGGISRLKHHLSRFPSKGVNPCSKVRDDVTD 60 Query: 854 KVREIIASKDDTRETLTIKKQNLQEFKTPAVSISSGNALMAVGAAPISGKVFAXXXXXXX 675 +VR IIASKD+ +ET ++KKQ + E K+P S+S+ ALM++ +GK+F Sbjct: 61 RVRNIIASKDEIKETTSVKKQKVAEVKSPG-SLSATKALMSLETTSPTGKIFPTSNPLTP 119 Query: 674 XXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTLKTTW 495 + ENAERSIALFFFEN+LDFSVARSSSYQ MIDA+ KCG GF GPSA+ LKTTW Sbjct: 120 SSTN--NQENAERSIALFFFENKLDFSVARSSSYQLMIDAIGKCGPGFTGPSAEILKTTW 177 Query: 494 LENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSVDASS 315 LE IKSE+ LQSKD+E+EWA TGCTIIA+TWTD KS+A+INFLVSSPSRTFFHKSVDAS+ Sbjct: 178 LERIKSEVGLQSKDVEKEWATTGCTIIADTWTDYKSKAIINFLVSSPSRTFFHKSVDASA 237 Query: 314 YYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCASQCMN 135 Y+KN K+L+DLFDS+IQ+FG ENVVQ+I+D N G+ANHI+QNYG+IFV+PCASQC+N Sbjct: 238 YFKNTKWLADLFDSVIQEFGPENVVQIIMDSSFNYTGIANHIVQNYGTIFVSPCASQCLN 297 Query: 134 GILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 ILEEF KVDWISRCILQAQ +SK IYNN+S+L +M+ ++GGQ+ Sbjct: 298 LILEEFTKVDWISRCILQAQTISKLIYNNASLLDLMKKYSGGQE 341 >gb|EXB72473.1| hypothetical protein L484_011475 [Morus notabilis] Length = 694 Score = 496 bits (1278), Expect = e-138 Identities = 240/346 (69%), Positives = 287/346 (82%) Frame = -2 Query: 1040 LSMVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDV 861 +++VREKDVCWEYA++L+GNKVRCKFC R+LNGGISRLKHHLSRLPSKGVNPC+KVRDDV Sbjct: 14 VTVVREKDVCWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDV 73 Query: 860 TDKVREIIASKDDTRETLTIKKQNLQEFKTPAVSISSGNALMAVGAAPISGKVFAXXXXX 681 TD+VR IIASK+D +ET + KKQ L E K+P ++S+ AL++ KVF Sbjct: 74 TDRVRAIIASKEDVKETSSTKKQKLVEVKSPG-NVSASKALVSTDTTSPVAKVFPAVTPV 132 Query: 680 XXXXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTLKT 501 ENAERSIALFFFEN+LDF +ARSSSYQ M+DA+ KCG GF GPSA+TLKT Sbjct: 133 APPSLN--SQENAERSIALFFFENKLDFGIARSSSYQLMVDAIAKCGPGFTGPSAETLKT 190 Query: 500 TWLENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSVDA 321 TWLE IKSE+SLQSKDIE+EW TGCTIIA+TWTDNKSRALINFLVSSPSRTFFHKSVDA Sbjct: 191 TWLERIKSEMSLQSKDIEKEWMTTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDA 250 Query: 320 SSYYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCASQC 141 S+Y+KN+K L+DLFDS+IQDFG +NVVQVI+D N G+ANHILQNY +IFV+PC SQC Sbjct: 251 SAYFKNMKCLADLFDSVIQDFGPDNVVQVIMDSSFNYTGVANHILQNYSTIFVSPCVSQC 310 Query: 140 MNGILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 +N ILEEF KVDW++RCILQ Q +SK+IYN++SML +M+ +TGGQ+ Sbjct: 311 LNLILEEFSKVDWVNRCILQGQTISKFIYNSASMLDLMKKYTGGQE 356 >ref|XP_004308021.1| PREDICTED: uncharacterized protein LOC101298657 [Fragaria vesca subsp. vesca] Length = 681 Score = 491 bits (1265), Expect = e-136 Identities = 238/345 (68%), Positives = 285/345 (82%), Gaps = 1/345 (0%) Frame = -2 Query: 1034 MVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVTD 855 MVREKD CWEYA++L+GNKV+CKFC R+LNGGISRLKHHLSRLPSKGVNPC+KVRDDVTD Sbjct: 1 MVREKDTCWEYAEKLDGNKVKCKFCQRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTD 60 Query: 854 KVREIIASKDDTRETLTI-KKQNLQEFKTPAVSISSGNALMAVGAAPISGKVFAXXXXXX 678 KVR IIASK++ +ET + KK+ E K+P V++S ALM++ KV+ Sbjct: 61 KVRTIIASKEEVKETSSSSKKKKFVEVKSPPVNVSPVKALMSMETPSPIQKVYPNVTPMA 120 Query: 677 XXXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTLKTT 498 + ENAERSIALFFFEN++DFS+AR+SSYQ MIDA+ KCG GF GPSA+TLKTT Sbjct: 121 PLSMN--NQENAERSIALFFFENKIDFSIARTSSYQLMIDAITKCGPGFTGPSAETLKTT 178 Query: 497 WLENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSVDAS 318 WLE +K+E+SLQSKDIE+EW TGCTIIA+TWTDNKSRALINFLVSSPSRTFFHKSVDAS Sbjct: 179 WLERVKTEMSLQSKDIEKEWTTTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDAS 238 Query: 317 SYYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCASQCM 138 +Y+KN K L++LFDS+IQDFG ENVVQ+I+D N G+ANHIL NY +IFV+PCASQC+ Sbjct: 239 AYFKNTKCLAELFDSVIQDFGPENVVQIIMDSSFNYTGVANHILTNYTTIFVSPCASQCL 298 Query: 137 NGILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 N ILEEF KVDW++RC LQAQ +SK+IYNN+SML +M+ FTGGQD Sbjct: 299 NLILEEFSKVDWVNRCFLQAQTISKFIYNNASMLDLMKRFTGGQD 343 >ref|XP_002513602.1| protein dimerization, putative [Ricinus communis] gi|223547510|gb|EEF49005.1| protein dimerization, putative [Ricinus communis] Length = 688 Score = 491 bits (1263), Expect = e-136 Identities = 243/346 (70%), Positives = 285/346 (82%) Frame = -2 Query: 1040 LSMVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDV 861 LS+VREKDVCWEYA++L+GNKV+CKFC R+LNGGISRLKHHLSRLPSKGVNPC+KVRDDV Sbjct: 8 LSVVREKDVCWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDV 67 Query: 860 TDKVREIIASKDDTRETLTIKKQNLQEFKTPAVSISSGNALMAVGAAPISGKVFAXXXXX 681 TD+VR IIASK+D +E + KKQ E K+PA I + AL+ V + + KV+ Sbjct: 68 TDRVRAIIASKEDIKEPSSAKKQRPAEAKSPA-HIYATKALVNVESVAPAAKVYPTVTSI 126 Query: 680 XXXXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTLKT 501 + ENAERSIALFFFEN+LDFSVARS SYQ MI+A+ KCG GF GPSA+ LKT Sbjct: 127 SPPSLS--NQENAERSIALFFFENKLDFSVARSPSYQLMIEAIEKCGPGFTGPSAEILKT 184 Query: 500 TWLENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSVDA 321 TWLE IKSE+SLQ KD E+EW TGCTIIA+TWTDNKSRALINF VSSPSRTFFHKSVDA Sbjct: 185 TWLERIKSEVSLQLKDTEKEWTTTGCTIIADTWTDNKSRALINFFVSSPSRTFFHKSVDA 244 Query: 320 SSYYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCASQC 141 SSY+KN K L+DLFDS+IQDFGAENVVQ+I+D N G+ANHILQNYG+IFV+PCASQC Sbjct: 245 SSYFKNTKCLADLFDSVIQDFGAENVVQIIMDSSFNYTGVANHILQNYGTIFVSPCASQC 304 Query: 140 MNGILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 +N ILE+F KVDW++RCI QAQ +SK+IYNNSSML +M+ FTGGQ+ Sbjct: 305 LNLILEDFSKVDWVNRCISQAQTLSKFIYNNSSMLDLMKKFTGGQE 350 >ref|XP_004145979.1| PREDICTED: uncharacterized protein LOC101215128, partial [Cucumis sativus] Length = 685 Score = 489 bits (1259), Expect = e-136 Identities = 237/347 (68%), Positives = 286/347 (82%), Gaps = 2/347 (0%) Frame = -2 Query: 1037 SMVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVT 858 S+VREKD+CWEYA++L+GNKV+CKFC R+LNGGISRLKHHLSRLPS+GVNPC+KVRDDV+ Sbjct: 4 SVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVS 63 Query: 857 DKVREIIASKDDTRETLTIKKQNLQEFKT--PAVSISSGNALMAVGAAPISGKVFAXXXX 684 D+VR I+A++++ +E T KKQ L E KT SIS +++++ KVF Sbjct: 64 DRVRAILATREEIKEASTGKKQKLAEVKTVESVPSISMCKSVVSIETPSPVAKVFPTVTP 123 Query: 683 XXXXXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTLK 504 +HENAE+SIALFFFEN+LDFS+ARSSSYQ MIDA+ KCG GF GPSA+TLK Sbjct: 124 MAPPSLH--NHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLK 181 Query: 503 TTWLENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSVD 324 TTWLE IK+E+SLQSKDIE+EW TGCTII +TWTDNKSRALINFLVSSPSRTFFHKSVD Sbjct: 182 TTWLERIKTEVSLQSKDIEKEWTTTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSVD 241 Query: 323 ASSYYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCASQ 144 AS+Y+KN K L DLFDS+IQDFG ENVVQ+I+D LN G ANHILQ YG+IFV+PCASQ Sbjct: 242 ASTYFKNTKCLGDLFDSVIQDFGHENVVQIIMDSSLNYSGTANHILQTYGTIFVSPCASQ 301 Query: 143 CMNGILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 C+N ILEEF KVDW++RCILQAQ +SK++YN+SS+L +MR FTGGQ+ Sbjct: 302 CLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQE 348 >ref|XP_007038931.1| HAT transposon superfamily isoform 2 [Theobroma cacao] gi|590673575|ref|XP_007038932.1| HAT transposon superfamily isoform 2 [Theobroma cacao] gi|508776176|gb|EOY23432.1| HAT transposon superfamily isoform 2 [Theobroma cacao] gi|508776177|gb|EOY23433.1| HAT transposon superfamily isoform 2 [Theobroma cacao] Length = 678 Score = 486 bits (1251), Expect = e-135 Identities = 238/344 (69%), Positives = 285/344 (82%) Frame = -2 Query: 1034 MVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVTD 855 MVREKDVCWEYA++L+GNKVRCKFC R+LNGGISRLKHHLSRLPSKGVNPC+KVRDDVTD Sbjct: 1 MVREKDVCWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTD 60 Query: 854 KVREIIASKDDTRETLTIKKQNLQEFKTPAVSISSGNALMAVGAAPISGKVFAXXXXXXX 675 +VR I++SK++ +ET ++KKQ + E ++P +IS+ + ++ + A+ KVF Sbjct: 61 RVRAILSSKEEIKETSSVKKQKIAEARSPG-NISTCSKIIPLEASSPVAKVFPATSPIAP 119 Query: 674 XXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTLKTTW 495 EN ERSIALFFFEN+LDFSVARSSSYQ MIDAV K G GF GPS +TLKT W Sbjct: 120 PSLN--SQENVERSIALFFFENKLDFSVARSSSYQAMIDAVGKFGPGFTGPSVETLKTMW 177 Query: 494 LENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSVDASS 315 LE IKSE+ LQSKD E+EWA TGCTIIA+TWTDNKSRALINFLVSSPSRTFFHKSVDASS Sbjct: 178 LERIKSEVCLQSKDTEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASS 237 Query: 314 YYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCASQCMN 135 Y+KN K L+DLFDS+IQDFG ENVVQ+I+D N G++NHILQNYG+IFV+PCASQC+N Sbjct: 238 YFKNTKCLADLFDSVIQDFGPENVVQIIMDSSFNYTGISNHILQNYGTIFVSPCASQCLN 297 Query: 134 GILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 ILEEF KVDW++RCILQAQ +SK++YNN+SML +M+ FTG Q+ Sbjct: 298 LILEEFSKVDWVNRCILQAQTLSKFLYNNASMLDLMKKFTGEQE 341 >ref|XP_007038933.1| HAT transposon superfamily isoform 4 [Theobroma cacao] gi|508776178|gb|EOY23434.1| HAT transposon superfamily isoform 4 [Theobroma cacao] Length = 682 Score = 486 bits (1250), Expect = e-134 Identities = 237/346 (68%), Positives = 287/346 (82%) Frame = -2 Query: 1040 LSMVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDV 861 +++VREKDVCWEYA++L+GNKVRCKFC R+LNGGISRLKHHLSRLPSKGVNPC+KVRDDV Sbjct: 3 MAVVREKDVCWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDV 62 Query: 860 TDKVREIIASKDDTRETLTIKKQNLQEFKTPAVSISSGNALMAVGAAPISGKVFAXXXXX 681 TD+VR I++SK++ +ET ++KKQ + E ++P +IS+ + ++ + A+ KVF Sbjct: 63 TDRVRAILSSKEEIKETSSVKKQKIAEARSPG-NISTCSKIIPLEASSPVAKVFPATSPI 121 Query: 680 XXXXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTLKT 501 EN ERSIALFFFEN+LDFSVARSSSYQ MIDAV K G GF GPS +TLKT Sbjct: 122 APPSLN--SQENVERSIALFFFENKLDFSVARSSSYQAMIDAVGKFGPGFTGPSVETLKT 179 Query: 500 TWLENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSVDA 321 WLE IKSE+ LQSKD E+EWA TGCTIIA+TWTDNKSRALINFLVSSPSRTFFHKSVDA Sbjct: 180 MWLERIKSEVCLQSKDTEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDA 239 Query: 320 SSYYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCASQC 141 SSY+KN K L+DLFDS+IQDFG ENVVQ+I+D N G++NHILQNYG+IFV+PCASQC Sbjct: 240 SSYFKNTKCLADLFDSVIQDFGPENVVQIIMDSSFNYTGISNHILQNYGTIFVSPCASQC 299 Query: 140 MNGILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 +N ILEEF KVDW++RCILQAQ +SK++YNN+SML +M+ FTG Q+ Sbjct: 300 LNLILEEFSKVDWVNRCILQAQTLSKFLYNNASMLDLMKKFTGEQE 345 >ref|XP_006350604.1| PREDICTED: uncharacterized protein LOC102593027 isoform X1 [Solanum tuberosum] gi|565367925|ref|XP_006350605.1| PREDICTED: uncharacterized protein LOC102593027 isoform X2 [Solanum tuberosum] Length = 675 Score = 485 bits (1248), Expect = e-134 Identities = 240/344 (69%), Positives = 284/344 (82%) Frame = -2 Query: 1034 MVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVTD 855 MVREKDVCWEYA++L+GNKVRCKFC RILNGGISRLKHHLSRLPSKGVNPCTKVRDDVTD Sbjct: 1 MVREKDVCWEYAEKLDGNKVRCKFCLRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVTD 60 Query: 854 KVREIIASKDDTRETLTIKKQNLQEFKTPAVSISSGNALMAVGAAPISGKVFAXXXXXXX 675 +VR+II SK E + KK L E K A +IS L++V ++F Sbjct: 61 RVRDIIGSK----EPPSTKKHKLIETKALA-NISPEKLLLSVEPITPIARIFPPIGQAIS 115 Query: 674 XXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTLKTTW 495 + ENAERSIALFFFEN++DF VARSSSY QMI+AV KCGSGF+GPS +TLK TW Sbjct: 116 SSGN--NQENAERSIALFFFENKIDFGVARSSSYHQMIEAVGKCGSGFIGPSPETLKATW 173 Query: 494 LENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSVDASS 315 LE IKSE+SLQSKD+E+EWAMTGCT+IAETWTDNK +ALINFLVSSPSRTFF+KSVDASS Sbjct: 174 LERIKSEVSLQSKDVEKEWAMTGCTLIAETWTDNKMKALINFLVSSPSRTFFYKSVDASS 233 Query: 314 YYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCASQCMN 135 Y+KN+K LS+LFDSIIQDFG ENVVQVI+D+ L+C G+ NHILQNYG++FV+PCASQC+N Sbjct: 234 YFKNLKCLSELFDSIIQDFGPENVVQVIVDNTLHCTGIVNHILQNYGNVFVSPCASQCIN 293 Query: 134 GILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 IL+EF K+DW++RCILQAQ +SK+IYNNS +L +M+ FTGGQ+ Sbjct: 294 AILDEFSKLDWVNRCILQAQSISKFIYNNSPLLDLMKKFTGGQE 337 >ref|XP_004234278.1| PREDICTED: uncharacterized protein LOC101256946 [Solanum lycopersicum] Length = 739 Score = 484 bits (1247), Expect = e-134 Identities = 243/364 (66%), Positives = 293/364 (80%), Gaps = 3/364 (0%) Frame = -2 Query: 1085 III*MCRV*NWLNSCL---SMVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHL 915 +I+ ++ + N C ++VREKDVCWEYA++LEGNKVRCKFC RILNGGISRLKHHL Sbjct: 45 VIVQKLKLIQFTNLCYFLPTVVREKDVCWEYAEKLEGNKVRCKFCLRILNGGISRLKHHL 104 Query: 914 SRLPSKGVNPCTKVRDDVTDKVREIIASKDDTRETLTIKKQNLQEFKTPAVSISSGNALM 735 SRLPSKGVNPCTKVRDDVTD+VR+II SK E + KK L E K A +IS L+ Sbjct: 105 SRLPSKGVNPCTKVRDDVTDRVRDIIGSK----EPPSTKKHKLIETKALA-NISPEKPLL 159 Query: 734 AVGAAPISGKVFAXXXXXXXXXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDA 555 +V ++F + ENAERSIALFFFEN++DF VARSSSY QMI+A Sbjct: 160 SVEPITPIARIFPPIGQAISSSGN--NQENAERSIALFFFENKIDFGVARSSSYHQMIEA 217 Query: 554 VRKCGSGFLGPSADTLKTTWLENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALI 375 V KCGSGF+GPS +TLK TWLE IKSE+SLQSKD+E+EWAMTGCT+IAETWTDNK +ALI Sbjct: 218 VGKCGSGFIGPSPETLKATWLERIKSEVSLQSKDVEKEWAMTGCTLIAETWTDNKMKALI 277 Query: 374 NFLVSSPSRTFFHKSVDASSYYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLAN 195 NFLVSSPSRTFF+KSVDASSY+KN+K LS+LFDSIIQDFG ENVVQVI+D+ L+C G+ N Sbjct: 278 NFLVSSPSRTFFYKSVDASSYFKNLKCLSELFDSIIQDFGPENVVQVIVDNTLHCTGIVN 337 Query: 194 HILQNYGSIFVTPCASQCMNGILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFT 15 HILQNYG++FV+PCASQC+N IL+EF K+DW++RCILQAQ +SK+IYNNS +L +M+ FT Sbjct: 338 HILQNYGNVFVSPCASQCINAILDEFSKLDWVNRCILQAQSLSKFIYNNSPLLDLMKKFT 397 Query: 14 GGQD 3 GGQ+ Sbjct: 398 GGQE 401 >ref|XP_004160403.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101215128 [Cucumis sativus] Length = 784 Score = 484 bits (1247), Expect = e-134 Identities = 235/347 (67%), Positives = 284/347 (81%), Gaps = 2/347 (0%) Frame = -2 Query: 1037 SMVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVT 858 S+VREKD+CWEYA++L+GNKV+CKFC R+LNGGISRLKHHLSRLPS+GVNPC+KVRDDV+ Sbjct: 103 SVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVS 162 Query: 857 DKVREIIASKDDTRETLTIKKQNLQEFKT--PAVSISSGNALMAVGAAPISGKVFAXXXX 684 D+VR I+A++++ +E T KKQ L E KT SIS +++++ KVF Sbjct: 163 DRVRAILATREEIKEASTGKKQKLAEVKTVESVPSISMCKSVVSIETPSPVAKVFPTVTP 222 Query: 683 XXXXXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTLK 504 +HENAE+SIALF FEN+LDFS+ARSSSYQ MIDA+ KCG GF GPSA+TLK Sbjct: 223 MAPPSLH--NHENAEKSIALFXFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLK 280 Query: 503 TTWLENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSVD 324 TTWLE IK+E+SLQSKDIE+EW TGCTII +TWTDNKSRALINF VSSPSRTFFHKSVD Sbjct: 281 TTWLERIKTEVSLQSKDIEKEWTTTGCTIIVDTWTDNKSRALINFXVSSPSRTFFHKSVD 340 Query: 323 ASSYYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCASQ 144 AS+Y+KN K L DLFDS+IQDFG ENVVQ+I+D LN G ANHILQ YG+IFV+PCASQ Sbjct: 341 ASTYFKNTKCLGDLFDSVIQDFGHENVVQIIMDSSLNYSGTANHILQTYGTIFVSPCASQ 400 Query: 143 CMNGILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 C+N ILEEF KVDW++RCILQAQ +SK++YN+SS+L +MR FTGGQ+ Sbjct: 401 CLNSILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQE 447 >ref|XP_003602175.1| Protein dimerization [Medicago truncatula] gi|355491223|gb|AES72426.1| Protein dimerization [Medicago truncatula] Length = 786 Score = 484 bits (1247), Expect = e-134 Identities = 233/344 (67%), Positives = 285/344 (82%) Frame = -2 Query: 1034 MVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVTD 855 MVREKDVCWEYA++L+GNKV+CKFC R+LNGGISRLKHHLSR PSKGVNPC+KVRDDVTD Sbjct: 107 MVREKDVCWEYAEKLDGNKVKCKFCQRVLNGGISRLKHHLSRFPSKGVNPCSKVRDDVTD 166 Query: 854 KVREIIASKDDTRETLTIKKQNLQEFKTPAVSISSGNALMAVGAAPISGKVFAXXXXXXX 675 +VR IIASK++ +ET ++KKQ + E +P S S+ AL+++ GK+F Sbjct: 167 RVRNIIASKEEVKETSSVKKQKVSEVISPG-SHSATKALISLDTTLPIGKMFPSSNPMTP 225 Query: 674 XXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTLKTTW 495 + ENAERSIALFFFEN+LDFSVARSSSYQ MIDA+ KCG GF GPSA+ LKT W Sbjct: 226 SSTN--NQENAERSIALFFFENKLDFSVARSSSYQLMIDAITKCGPGFTGPSAEILKTIW 283 Query: 494 LENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSVDASS 315 LE IKSE+ LQSKD+E+EWA TGCTIIA+TWTD KS+A+INFLVSSPSR FFHKSVDAS+ Sbjct: 284 LERIKSEVGLQSKDVEKEWATTGCTIIADTWTDYKSKAIINFLVSSPSRIFFHKSVDASA 343 Query: 314 YYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCASQCMN 135 Y+KN K+L+DLFDS+IQ+FG ENVVQ+I+D N G+ NHI+QNYG+IFV+PCASQC+N Sbjct: 344 YFKNTKWLADLFDSVIQEFGPENVVQIIMDSSFNYTGIGNHIVQNYGTIFVSPCASQCLN 403 Query: 134 GILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 ILEEF K+DWISRCILQAQ +SK IYNN+S+L +M++++GGQ+ Sbjct: 404 LILEEFTKIDWISRCILQAQTISKLIYNNASLLDLMKSYSGGQE 447 >ref|XP_006490683.1| PREDICTED: uncharacterized protein LOC102618477 [Citrus sinensis] Length = 764 Score = 483 bits (1244), Expect = e-134 Identities = 238/343 (69%), Positives = 284/343 (82%) Frame = -2 Query: 1037 SMVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVT 858 ++VREKD+CWEYA++L+GNKVRCKFC R+LNGGISRLKHHLSRLPSKGVNPC+KVRDDVT Sbjct: 88 AVVREKDICWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVT 147 Query: 857 DKVREIIASKDDTRETLTIKKQNLQEFKTPAVSISSGNALMAVGAAPISGKVFAXXXXXX 678 D+VR IIASK+D +ET KKQ + E K + SS + + +P++ KVFA Sbjct: 148 DRVRAIIASKEDVKETPIGKKQRVAEAKPVGIVCSSKSLMPLETPSPVT-KVFATMTPMG 206 Query: 677 XXXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTLKTT 498 + ENAERSIALFFFEN+LDF+VARSSSYQQMIDAV KCG GF GPSA+ LKT Sbjct: 207 NSSLN--NQENAERSIALFFFENKLDFAVARSSSYQQMIDAVGKCGPGFTGPSAEALKTM 264 Query: 497 WLENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSVDAS 318 WL+ IKSE+++QSKDIE+EWAMTGCTIIA+TWTDNKS+ALINFLVSSPSRTFF KSVD S Sbjct: 265 WLDRIKSEVNVQSKDIEKEWAMTGCTIIADTWTDNKSKALINFLVSSPSRTFFLKSVDTS 324 Query: 317 SYYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCASQCM 138 S +KN KYL+D+FDS+IQD G ENVVQ+I+D N G+ANHILQNYG+IFV+PCASQ + Sbjct: 325 SNFKNTKYLADIFDSVIQDIGPENVVQIIMDSSFNYTGVANHILQNYGTIFVSPCASQSL 384 Query: 137 NGILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGG 9 N ILEEF KVDW++RCILQAQ +SK+IYNN+SML +M+ FTGG Sbjct: 385 NIILEEFSKVDWVNRCILQAQTISKFIYNNASMLDLMKKFTGG 427 >ref|XP_006581618.1| PREDICTED: uncharacterized protein LOC100808813 isoform X1 [Glycine max] gi|571460166|ref|XP_006581619.1| PREDICTED: uncharacterized protein LOC100808813 isoform X2 [Glycine max] Length = 679 Score = 483 bits (1244), Expect = e-134 Identities = 233/344 (67%), Positives = 288/344 (83%) Frame = -2 Query: 1034 MVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVTD 855 MVREKDVCWEYA++L+GNKVRCKFC R+LNGGISRLKHHLSR PSKGVNPC+KVRDDVTD Sbjct: 1 MVREKDVCWEYAEKLDGNKVRCKFCQRVLNGGISRLKHHLSRFPSKGVNPCSKVRDDVTD 60 Query: 854 KVREIIASKDDTRETLTIKKQNLQEFKTPAVSISSGNALMAVGAAPISGKVFAXXXXXXX 675 +VR IIASK++ +ET + KKQ + E K+P+ ++S+ AL+++ AA K+F Sbjct: 61 RVRGIIASKEEVKETSSAKKQKIAEVKSPS-NLSASKALVSLDAASPVMKIFPTGHPMTP 119 Query: 674 XXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTLKTTW 495 + E AERSIALFFFEN+LDFSVARSSSYQ MIDA+ KCG GF GPSA+TLKT W Sbjct: 120 SSTN--NQEIAERSIALFFFENKLDFSVARSSSYQLMIDAIAKCGPGFTGPSAETLKTIW 177 Query: 494 LENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSVDASS 315 LE +KSE+ LQ+KD+E+EWA TGCTI+A+TWTD KS+A+INFLVSSPSRTFFHKSVDAS+ Sbjct: 178 LERMKSEVGLQTKDVEKEWATTGCTILADTWTDYKSKAIINFLVSSPSRTFFHKSVDASA 237 Query: 314 YYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCASQCMN 135 Y+KN K+L+DLFDS+IQ+FG ENVVQ+I+D +N +ANHI+Q+YG+IFV+PCASQC+N Sbjct: 238 YFKNTKWLADLFDSVIQEFGPENVVQIIMDSSVNYTVIANHIVQSYGTIFVSPCASQCLN 297 Query: 134 GILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 ILEEF KVDWISRCILQAQ +SK IYNN+S+L + + +TGGQ+ Sbjct: 298 LILEEFSKVDWISRCILQAQTISKLIYNNASLLDLTKKYTGGQE 341 >ref|XP_006300772.1| hypothetical protein CARUB_v10019846mg, partial [Capsella rubella] gi|482569482|gb|EOA33670.1| hypothetical protein CARUB_v10019846mg, partial [Capsella rubella] Length = 768 Score = 440 bits (1131), Expect = e-121 Identities = 218/353 (61%), Positives = 267/353 (75%), Gaps = 8/353 (2%) Frame = -2 Query: 1037 SMVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVT 858 SMVREKD+CWEYA++L+GNKV+CKFC R+LNGGISRLKHHLSRLPSKGVNPC KVRDDVT Sbjct: 99 SMVREKDICWEYAEKLDGNKVKCKFCSRVLNGGISRLKHHLSRLPSKGVNPCAKVRDDVT 158 Query: 857 DKVREIIASKDDTRET-LTIKKQNLQEFKTPA------VSISSGNALMAVGA-APISGKV 702 D+VR I+A+KDD +++ LT K E K P V++SSG+ L AP + Sbjct: 159 DRVRSILAAKDDPKDSPLTTNKYKPPEVKPPLSASLLPVTVSSGSKLFPTSILAPPTPNA 218 Query: 701 FAXXXXXXXXXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGP 522 AERSI+LFFFEN++D+ VARS SY M+DA+ KCG F P Sbjct: 219 QVI----------------AERSISLFFFENKIDWCVARSPSYHHMLDAIAKCGPAFFAP 262 Query: 521 SADTLKTTWLENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTF 342 S +LKT WL+ +KSE+SLQ KD E+EW TGCTIIAE WTDNKSRALINF VSSPSR F Sbjct: 263 SPLSLKTEWLDRVKSEISLQLKDSEKEWVTTGCTIIAEAWTDNKSRALINFSVSSPSRIF 322 Query: 341 FHKSVDASSYYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFV 162 FHKSVDASSY+KN K L+DLFDS+IQD G E++VQ+I+D+ + G++NHILQNYGSIFV Sbjct: 323 FHKSVDASSYFKNTKCLADLFDSVIQDIGQEHIVQIIMDNSFSYTGISNHILQNYGSIFV 382 Query: 161 TPCASQCMNGILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 +PCASQC++ ILEEF KVDW+++CI QAQV+SK++YNN +L +MR TGGQD Sbjct: 383 SPCASQCLSIILEEFSKVDWVNQCISQAQVISKFVYNNRPVLDLMRKLTGGQD 435 >ref|NP_178092.4| hAT family dimerization domain-containing protein [Arabidopsis thaliana] gi|332198172|gb|AEE36293.1| hAT family dimerization domain-containing protein [Arabidopsis thaliana] Length = 651 Score = 424 bits (1091), Expect = e-116 Identities = 211/344 (61%), Positives = 258/344 (75%) Frame = -2 Query: 1034 MVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVTD 855 MVREKD+CWEYA++L+GNKV+CKFC R+LNGGISRLKHHLSRLPSKGVNPC KVRDDVTD Sbjct: 1 MVREKDICWEYAEKLDGNKVKCKFCSRVLNGGISRLKHHLSRLPSKGVNPCAKVRDDVTD 60 Query: 854 KVREIIASKDDTRETLTIKKQNLQEFKTPAVSISSGNALMAVGAAPISGKVFAXXXXXXX 675 +VR I+++KDD T ++K P L AP S VF Sbjct: 61 RVRSILSAKDDPPIT--------NKYKPPP-------PLSPPFDAPASKLVFPSSPPNA- 104 Query: 674 XXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTLKTTW 495 + AERSI+LFFFEN++DF+VARS SY M+DAV KCG GF+ PS KT W Sbjct: 105 -------QDIAERSISLFFFENKIDFAVARSPSYHHMLDAVAKCGPGFVAPSP---KTEW 154 Query: 494 LENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSVDASS 315 L+ +KS++SLQ KD E+EW TGCTIIAE WTDNKSRALINF VSSPSR FFHKSVDASS Sbjct: 155 LDRVKSDISLQLKDTEKEWVTTGCTIIAEAWTDNKSRALINFSVSSPSRIFFHKSVDASS 214 Query: 314 YYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCASQCMN 135 Y+KN K L+DLFDS+IQD G E++VQ+I+D+ G++NH+LQNY +IFV+PCASQC+N Sbjct: 215 YFKNSKCLADLFDSVIQDIGQEHIVQIIMDNSFCYTGISNHLLQNYATIFVSPCASQCLN 274 Query: 134 GILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 ILEEF KVDW+++CI QAQV+SK++YNNS +L ++R TGGQD Sbjct: 275 IILEEFSKVDWVNQCISQAQVISKFVYNNSPVLDLLRKLTGGQD 318 >ref|XP_007218857.1| hypothetical protein PRUPE_ppa002763mg [Prunus persica] gi|462415319|gb|EMJ20056.1| hypothetical protein PRUPE_ppa002763mg [Prunus persica] Length = 636 Score = 412 bits (1059), Expect = e-112 Identities = 210/344 (61%), Positives = 249/344 (72%) Frame = -2 Query: 1034 MVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVTD 855 MVREKDVCWEYA++L+GNKVRCKFC R+LNGGISRLKHHLSRLPSKGVNPC+KVRDDVTD Sbjct: 1 MVREKDVCWEYAEKLDGNKVRCKFCQRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTD 60 Query: 854 KVREIIASKDDTRETLTIKKQNLQEFKTPAVSISSGNALMAVGAAPISGKVFAXXXXXXX 675 +VR IIASK++ +ET + KKQ L E K+P ++S+ ALM+ KVF Sbjct: 61 RVRTIIASKEEVKETSSGKKQKLVEVKSPG-NVSASKALMSFDTPTPIQKVFPNVTPMVP 119 Query: 674 XXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPSADTLKTTW 495 + ENAER+IALFFFEN+LDFS+ARSSSYQ MIDA+ KCG GF+GPSA+TLKTTW Sbjct: 120 PPLN--NQENAERNIALFFFENKLDFSIARSSSYQLMIDAIEKCGPGFIGPSAETLKTTW 177 Query: 494 LENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFFHKSVDASS 315 LE IKSE+SLQSKDIE+EW TGCTIIA+TWTDNKSRALINFL Sbjct: 178 LERIKSEMSLQSKDIEKEWTTTGCTIIADTWTDNKSRALINFL----------------- 220 Query: 314 YYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVTPCASQCMN 135 +I+D N G+ANHILQNY +IFV+PCASQC+N Sbjct: 221 --------------------------IIMDSSFNYTGVANHILQNYATIFVSPCASQCLN 254 Query: 134 GILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 ILEEF KVDW++RCILQAQ +SK+IYNN+SML +M+ FTGGQ+ Sbjct: 255 LILEEFSKVDWVNRCILQAQTISKFIYNNASMLDLMKKFTGGQE 298 >ref|XP_007038930.1| HAT transposon superfamily isoform 1 [Theobroma cacao] gi|508776175|gb|EOY23431.1| HAT transposon superfamily isoform 1 [Theobroma cacao] Length = 640 Score = 384 bits (986), Expect = e-104 Identities = 192/292 (65%), Positives = 234/292 (80%) Frame = -2 Query: 878 KVRDDVTDKVREIIASKDDTRETLTIKKQNLQEFKTPAVSISSGNALMAVGAAPISGKVF 699 KVRDDVTD+VR I++SK++ +ET ++KKQ + E ++P +IS+ + ++ + A+ KVF Sbjct: 15 KVRDDVTDRVRAILSSKEEIKETSSVKKQKIAEARSPG-NISTCSKIIPLEASSPVAKVF 73 Query: 698 AXXXXXXXXXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMIDAVRKCGSGFLGPS 519 EN ERSIALFFFEN+LDFSVARSSSYQ MIDAV K G GF GPS Sbjct: 74 PATSPIAPPSLN--SQENVERSIALFFFENKLDFSVARSSSYQAMIDAVGKFGPGFTGPS 131 Query: 518 ADTLKTTWLENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRALINFLVSSPSRTFF 339 +TLKT WLE IKSE+ LQSKD E+EWA TGCTIIA+TWTDNKSRALINFLVSSPSRTFF Sbjct: 132 VETLKTMWLERIKSEVCLQSKDTEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFF 191 Query: 338 HKSVDASSYYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGLANHILQNYGSIFVT 159 HKSVDASSY+KN K L+DLFDS+IQDFG ENVVQ+I+D N G++NHILQNYG+IFV+ Sbjct: 192 HKSVDASSYFKNTKCLADLFDSVIQDFGPENVVQIIMDSSFNYTGISNHILQNYGTIFVS 251 Query: 158 PCASQCMNGILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRNFTGGQD 3 PCASQC+N ILEEF KVDW++RCILQAQ +SK++YNN+SML +M+ FTG Q+ Sbjct: 252 PCASQCLNLILEEFSKVDWVNRCILQAQTLSKFLYNNASMLDLMKKFTGEQE 303 >gb|EEC70201.1| hypothetical protein OsI_00947 [Oryza sativa Indica Group] Length = 1045 Score = 340 bits (873), Expect = 5e-91 Identities = 172/366 (46%), Positives = 246/366 (67%), Gaps = 20/366 (5%) Frame = -2 Query: 1040 LSMVREKDVCWEYADRLEGNKVRCKFCDRILNGGISRLKHHLSRLPSKGVNPCTKVRDDV 861 L+++RE+DVCWEY D++EGNKVRC+FC ++LNGGISRLK HLS++ SKGVNPCTKV+ DV Sbjct: 351 LTILRERDVCWEYCDKMEGNKVRCRFCYKVLNGGISRLKFHLSQISSKGVNPCTKVKPDV 410 Query: 860 TDKVREIIASKDDTRETLTIKKQNLQEFK--------------------TPAVSISSGNA 741 +KV+ +IA+K++ RET +K+Q E +PA++ +S Sbjct: 411 IEKVKAVIAAKEEHRETQVLKRQRDTELSVRPRRIRDLPSQPTSPERATSPAITSTSDQT 470 Query: 740 LMAVGAAPISGKVFAXXXXXXXXXXXXSDHENAERSIALFFFENRLDFSVARSSSYQQMI 561 A +S V AER IA FFFEN+LD+++A S SY+ M+ Sbjct: 471 QFL--ALEVSTPVLKLSSVTNKARSAP--QSEAERCIAEFFFENKLDYNIADSVSYRHMM 526 Query: 560 DAVRKCGSGFLGPSADTLKTTWLENIKSELSLQSKDIEREWAMTGCTIIAETWTDNKSRA 381 +A+ G GF GPSA+ LKT WL +KSE+ ++K+IE++WA TGCTI+A++WTDNKS+A Sbjct: 527 EALG--GQGFRGPSAEVLKTKWLHKLKSEVLQKTKEIEKDWATTGCTILADSWTDNKSKA 584 Query: 380 LINFLVSSPSRTFFHKSVDASSYYKNIKYLSDLFDSIIQDFGAENVVQVIIDDELNCPGL 201 LINF VSSP TFF K+VDAS + K+ L +LFD +I++ G +NVVQ+I D +N + Sbjct: 585 LINFSVSSPLGTFFLKTVDASPHIKS-HQLYELFDDVIREVGPDNVVQIITDRNINYGSV 643 Query: 200 ANHILQNYGSIFVTPCASQCMNGILEEFCKVDWISRCILQAQVVSKYIYNNSSMLIMMRN 21 I+QNY +IF +PCAS C+N +L++F K+DW++RCI QAQ +++++YNN +L +MR Sbjct: 644 DKLIMQNYNTIFWSPCASSCVNSMLDDFSKIDWVNRCICQAQTITRFVYNNKWVLDLMRK 703 Query: 20 FTGGQD 3 GQ+ Sbjct: 704 CIAGQE 709