BLASTX nr result
ID: Mentha29_contig00009665
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00009665 (1292 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591... 557 e-156 gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea] 556 e-156 ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254... 551 e-154 ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626... 503 e-140 ref|XP_002524204.1| DNA binding protein, putative [Ricinus commu... 488 e-135 ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302... 478 e-132 ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817... 476 e-132 ref|XP_007009265.1| HAT and BED zinc finger domain-containing pr... 475 e-131 ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phas... 473 e-130 ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phas... 473 e-130 gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo] 472 e-130 ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226... 470 e-130 ref|XP_004305893.1| PREDICTED: uncharacterized protein LOC101310... 468 e-129 ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806... 467 e-129 ref|XP_007214932.1| hypothetical protein PRUPE_ppa001359mg [Prun... 461 e-127 ref|XP_002521049.1| DNA binding protein, putative [Ricinus commu... 454 e-125 ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Popu... 447 e-123 ref|XP_007049027.1| HAT transposon superfamily, putative [Theobr... 445 e-122 ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817... 442 e-121 ref|NP_188861.2| hAT dimerization domain-containing protein [Ara... 409 e-111 >ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591442 [Solanum tuberosum] Length = 755 Score = 557 bits (1435), Expect = e-156 Identities = 265/429 (61%), Positives = 330/429 (76%) Frame = +3 Query: 6 VIPAGSFKKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLR 185 ++P K+ N+ V+MAV RF D +P D+ NS YFQPMID IASQG + PSYH+LR Sbjct: 170 LLPINQSKRVNNHVHMAVARFLLDARVPLDAVNSVYFQPMIDVIASQGPQVSAPSYHELR 229 Query: 186 NSILKNVIHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXX 365 + +LK + EVR D+DQC + W R+GCS+LVDE +GKGKT +NF YC EGT+FL Sbjct: 230 SWVLKASVQEVRNDIDQCSSTWARSGCSVLVDEWITGKGKTLLNFLVYCPEGTMFLRSVD 289 Query: 366 XXXXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGH 545 LYEL+KE+VEEVG+RNVLQVVT+ E+RY+IAGKRLTD YP++FWTPCA H Sbjct: 290 ASTLINSTDYLYELLKEVVEEVGVRNVLQVVTSNEERYIIAGKRLTDAYPTLFWTPCAAH 349 Query: 546 CIDLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDF 725 IDLML+D+ + + +++QA+SISR+IY+N +++MMR++T GVDLVD+G TRS TDF Sbjct: 350 SIDLMLEDLKKLEWIDTIMEQAKSISRFIYNNNILLSMMRKFTLGVDLVDLGVTRSATDF 409 Query: 726 MTLKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLL 905 +TLKR+VNI+H+LQSMV S EW ES YSK E FA+ D I NQSFWS+C+ + RLTDP+L Sbjct: 410 LTLKRMVNIKHNLQSMVTSVEWAESPYSKKPEGFALLDYIGNQSFWSTCSLVCRLTDPIL 469 Query: 906 RLFRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAA 1085 RL R+V S + PAM YV+AG+YRAKE IKKEL K++Y YW+IID RWE LQRHPLHAA Sbjct: 470 RLLRMVSSEERPAMAYVYAGVYRAKETIKKELVNKKDYSVYWNIIDHRWESLQRHPLHAA 529 Query: 1086 GFYLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMA 1265 GFYLNPKFFY+ E D H HI+SLV DCIE+LVPD K+ DKI+KE SY AGDFGRKMA Sbjct: 530 GFYLNPKFFYTTEEDVHLHIRSLVYDCIEKLVPDPKIQDKIVKETTSYLNSAGDFGRKMA 589 Query: 1266 IRARDTLLP 1292 +RARDTL P Sbjct: 590 VRARDTLFP 598 >gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea] Length = 724 Score = 556 bits (1433), Expect = e-156 Identities = 275/430 (63%), Positives = 335/430 (77%) Frame = +3 Query: 3 AVIPAGSFKKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDL 182 A++ S + + V+MAVGRFF DVGLPA++ANS YFQPM++AIASQ A +GPSY DL Sbjct: 161 ALMSLPSVQPCSKKVHMAVGRFFVDVGLPAEAANSAYFQPMVEAIASQEAGVIGPSYQDL 220 Query: 183 RNSILKNVIHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXX 362 R+ ILKN++HE RYDVDQ AW RTGC++LVD+ SGKG+TFVNFF Y SE TIF Sbjct: 221 RSWILKNLVHETRYDVDQYANAWERTGCTVLVDDWNSGKGETFVNFFVYNSEATIFYRSA 280 Query: 363 XXXXXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAG 542 LYEL+KE VE++G++NVLQV+T+ ED+Y AGKRL TYPS+FW+PCAG Sbjct: 281 NVSHGIVSADDLYELLKETVEQIGVKNVLQVITSCEDQYAFAGKRLATTYPSVFWSPCAG 340 Query: 543 HCIDLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTD 722 C+DLMLQD+ P VK+ L+QA+SISRYIYSN V+NM+RR+TFG+DL+D G T S T+ Sbjct: 341 LCVDLMLQDMEHLPMVKVTLEQAKSISRYIYSNGFVLNMLRRHTFGLDLLDEGITPSSTN 400 Query: 723 FMTLKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPL 902 FMTLKR++++RH LQSMV SE+W +S +S+ E FA+ D++++QSFWS+CASI L DPL Sbjct: 401 FMTLKRMLSMRHHLQSMVTSEDWIQSPHSQKPEGFALLDTMTSQSFWSACASITNLIDPL 460 Query: 903 LRLFRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHA 1082 LRL RI+ S K PAMGYV+AGLYRAKEAIKK E+YL Y +IID RWEQLQ+HPLH Sbjct: 461 LRLLRIISSGKKPAMGYVYAGLYRAKEAIKKHF-VSEDYLVYLNIIDRRWEQLQQHPLHG 519 Query: 1083 AGFYLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKM 1262 AGFYLNPKFFYSLEGD +S+V DCIERLVPD +V DKIMKE YH G GDFGRKM Sbjct: 520 AGFYLNPKFFYSLEGDALLRSRSMVYDCIERLVPDPEVQDKIMKEMTYYHGGVGDFGRKM 579 Query: 1263 AIRARDTLLP 1292 AIRARDTLLP Sbjct: 580 AIRARDTLLP 589 >ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254391 [Solanum lycopersicum] Length = 748 Score = 551 bits (1419), Expect = e-154 Identities = 263/422 (62%), Positives = 325/422 (77%) Frame = +3 Query: 27 KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206 K+ N+ V+MAV RF D +P D+ NS YFQPMID IASQG PSYHDLR+ +LK+ Sbjct: 170 KRVNNQVHMAVARFLLDARVPLDAVNSVYFQPMIDVIASQGPPVSAPSYHDLRSWVLKSS 229 Query: 207 IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386 + EVR D+DQC + W RTGCS+L+DE +GKGK +NF YC +GT+FL Sbjct: 230 VQEVRTDIDQCSSTWARTGCSVLIDELITGKGKILLNFLVYCPQGTMFLRSVDASTLINS 289 Query: 387 XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566 LYEL+KE+V+E+G+RNVLQVVT+ E+RYVIAGKRLTD YP++FWTPCA H IDLML+ Sbjct: 290 TDYLYELLKEVVDEIGVRNVLQVVTSNEERYVIAGKRLTDAYPTLFWTPCAAHSIDLMLE 349 Query: 567 DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746 D + + +++QA+SISR+IY+N +++MMR++T GVDLVD+G TRS TDF+TLKR+ Sbjct: 350 DFNKLEWIDTIMEQAKSISRFIYNNNILLSMMRKFTLGVDLVDLGVTRSATDFLTLKRMQ 409 Query: 747 NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926 NI+H+LQSMV S EW ES YSK E FA+ D ISNQSFWS+C+ I RLTDP+LRL R+V Sbjct: 410 NIKHNLQSMVTSVEWAESPYSKKPEGFALLDYISNQSFWSTCSLICRLTDPILRLLRMVS 469 Query: 927 SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106 S + PAM YV+AG+YRAKE IKKEL K++Y YW+IID RWE LQRHPLHAAGFYLNPK Sbjct: 470 SEERPAMPYVYAGVYRAKETIKKELVNKKDYSVYWNIIDHRWESLQRHPLHAAGFYLNPK 529 Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286 FFY+ E D H HI+SLV DCIE+LVPD K+ DKI+KE SY AGDFGRKMA+RARDTL Sbjct: 530 FFYTTEEDVHLHIRSLVYDCIEKLVPDPKIQDKIVKETTSYLNSAGDFGRKMAVRARDTL 589 Query: 1287 LP 1292 P Sbjct: 590 FP 591 >ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626522 [Citrus sinensis] Length = 745 Score = 503 bits (1296), Expect = e-140 Identities = 235/421 (55%), Positives = 319/421 (75%) Frame = +3 Query: 30 KANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNVI 209 + N+ + MAVGRF +D+G P D+ NS YFQPM+DAIAS G EA PSYHD+R ILKN + Sbjct: 175 RGNNPIFMAVGRFLYDIGAPLDAVNSEYFQPMVDAIASGGPEAAMPSYHDIRGWILKNSV 234 Query: 210 HEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXXX 389 EV+ DVD+ WG+TGCSILVD+ + G+T + F AYC EGT+FL Sbjct: 235 EEVKNDVDRYTTTWGKTGCSILVDQWNTEAGRTLLCFLAYCPEGTVFLKSVDASGIMNSS 294 Query: 390 XVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQD 569 LYEL+K++VEEVG+R+VLQV+T+ E++++ AG+RLTDT+P+++WTPCA C+DL+L+D Sbjct: 295 DALYELLKQVVEEVGVRHVLQVITSSEEQFIAAGRRLTDTFPTLYWTPCAARCLDLILED 354 Query: 570 IAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVN 749 A+ + +++QAR+++R++Y+++ V+NM+RRYTFG D+V+ G TRS T+F TL+R+++ Sbjct: 355 FAKLEWINAIIEQARAVTRFVYNHSVVLNMLRRYTFGNDIVEPGITRSATNFTTLRRMIS 414 Query: 750 IRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRS 929 ++ +LQ+MV S+EW + YSK + D +SNQSFWSSC I+ LT+PLLRL RIV S Sbjct: 415 LKPNLQAMVTSQEWMDCPYSKKPGGLEMLDIVSNQSFWSSCGLIVCLTNPLLRLLRIVGS 474 Query: 930 LKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKF 1109 + P++GYV+AG+YRAK+A+KKEL ++EY+ YW+IID WEQL PLHAAGF+LNPKF Sbjct: 475 ERRPSIGYVYAGMYRAKDALKKELIKRDEYMVYWNIIDHWWEQLWHLPLHAAGFFLNPKF 534 Query: 1110 FYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLL 1289 FYS++GD H+ I S + DCIERLVPD KV DKI KE Y GDFGRKMAIRARDTLL Sbjct: 535 FYSIKGDIHNEIVSRMFDCIERLVPDTKVQDKISKEINLYKDAVGDFGRKMAIRARDTLL 594 Query: 1290 P 1292 P Sbjct: 595 P 595 >ref|XP_002524204.1| DNA binding protein, putative [Ricinus communis] gi|223536481|gb|EEF38128.1| DNA binding protein, putative [Ricinus communis] Length = 753 Score = 488 bits (1256), Expect = e-135 Identities = 232/422 (54%), Positives = 313/422 (74%) Frame = +3 Query: 27 KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206 K+ N V+MA+GRF +D+G P D+ NS YFQPM+DAIAS G + PS HDLR ILKN Sbjct: 182 KRVNDHVHMAIGRFLYDIGAPLDAVNSVYFQPMVDAIASGGLDVGMPSCHDLRGWILKNS 241 Query: 207 IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386 + EV+ +VD+ +A W RTGCS+LVD+ + G+T ++F YCSEG +FL Sbjct: 242 VEEVKTEVDKHMATWARTGCSVLVDQWNTLMGRTLLSFLVYCSEGVVFLKSVDASDIINS 301 Query: 387 XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566 LYEL+K++VEEVG+R+VLQV+T++E++Y++ G+RLTDT+P+++ PCA HCIDL+L+ Sbjct: 302 SDALYELIKKVVEEVGVRHVLQVITSMEEQYIVVGRRLTDTFPTLYRAPCAAHCIDLILE 361 Query: 567 DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746 D A+ + V+ QARSI+R++Y+++ V+NM++RYTFG ++V G T T+F TLKR+V Sbjct: 362 DFAKLEWISTVILQARSITRFVYNHSVVLNMVKRYTFGSEIVATGLTHFATNFETLKRMV 421 Query: 747 NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926 +++H+LQ+MV S+EW + YSK + D +SNQSFWSSC I LT+PLLRL RIV Sbjct: 422 DLKHTLQTMVTSQEWMDCPYSKKPRGLEMLDLLSNQSFWSSCVLITNLTNPLLRLLRIVS 481 Query: 927 SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106 S K P MGYV+AG+YRAKEAIKKEL +++Y+ YW+IID WEQ PLHAAGF+LNPK Sbjct: 482 SKKRPPMGYVYAGIYRAKEAIKKELVKRKDYMVYWNIIDHWWEQQSNLPLHAAGFFLNPK 541 Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286 YS+EGD H+ I S + DCIE+LVPD+ V DKI KE SY +GDFGRKMA+RAR+TL Sbjct: 542 VLYSIEGDLHNEILSGMFDCIEKLVPDVTVQDKITKEINSYKNASGDFGRKMAVRARETL 601 Query: 1287 LP 1292 LP Sbjct: 602 LP 603 >ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302111 [Fragaria vesca subsp. vesca] Length = 754 Score = 478 bits (1231), Expect = e-132 Identities = 231/424 (54%), Positives = 307/424 (72%), Gaps = 2/424 (0%) Frame = +3 Query: 27 KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206 +K NS V+ A+GRF FD+G P ++ NS YFQPMIDAIAS G P+ HDLR+ ILKN Sbjct: 172 RKVNSYVHEAIGRFLFDIGAPPEAVNSAYFQPMIDAIASGGPGMEPPTCHDLRSWILKNS 231 Query: 207 IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386 + E R ++D+ A WGRTGCSILVD+ + ++F Y EGT+FL Sbjct: 232 VEEARNNIDKHRATWGRTGCSILVDQWNTELDNVMLSFLVYSPEGTVFLESVDASAIINS 291 Query: 387 XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566 LY+L++ +VE+VG+ +V+QV+T+ E+++V+AG+RL DT+P++FW PCA C+DL+L+ Sbjct: 292 SDALYDLLRRVVEDVGVGDVVQVITSGEEQFVVAGRRLADTFPNLFWIPCAARCLDLILE 351 Query: 567 DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746 D + V++QARSI++++Y++ V+N++RR TFG D+V+ G TR T F TLKR+V Sbjct: 352 DFGSLDWIHAVIEQARSITKFVYNHNVVLNLVRRSTFGNDIVEPGVTRFGTSFTTLKRLV 411 Query: 747 NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSIS--NQSFWSSCASIIRLTDPLLRLFRI 920 +++H LQ MV S+EW + YSK+ + D IS +QSFWSSC I+RLT PLLR+ R+ Sbjct: 412 DLKHCLQVMVTSQEWMDCPYSKEPGGLEISDLISDRDQSFWSSCTLIVRLTSPLLRVLRM 471 Query: 921 VRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLN 1100 V K PAMG+++AG+YRAKEAIKKEL +EEY+ YW+IID RWEQ PLHAAGFYLN Sbjct: 472 VGCEKRPAMGFIYAGMYRAKEAIKKELVKREEYMVYWNIIDQRWEQHWNFPLHAAGFYLN 531 Query: 1101 PKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARD 1280 PK FYS+EGD H+ IQS + DCIER+VPD+KV DKIMKE SY AGDF RKMAIRARD Sbjct: 532 PKIFYSIEGDIHNSIQSGMYDCIERMVPDIKVQDKIMKEIISYKNAAGDFRRKMAIRARD 591 Query: 1281 TLLP 1292 TLLP Sbjct: 592 TLLP 595 >ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817502 isoform X1 [Glycine max] gi|571489936|ref|XP_006591345.1| PREDICTED: uncharacterized protein LOC100817502 isoform X2 [Glycine max] gi|571489939|ref|XP_006591346.1| PREDICTED: uncharacterized protein LOC100817502 isoform X3 [Glycine max] Length = 759 Score = 476 bits (1225), Expect = e-132 Identities = 225/422 (53%), Positives = 307/422 (72%) Frame = +3 Query: 27 KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206 KK ++ + MA+GRF +D+G P D+ NS YFQ M+DAIAS+G P +H+LR ILKN Sbjct: 179 KKMDNHIYMAIGRFLYDIGAPFDAVNSVYFQEMVDAIASRGVGFERPWHHELRGWILKNS 238 Query: 207 IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386 + EV+ D+D+C WGRTGCSILVD+ T+ GK ++F AYC EG +FL Sbjct: 239 VEEVKNDIDRCKMTWGRTGCSILVDQWTTETGKILISFLAYCPEGLVFLRSLDATEISTS 298 Query: 387 XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566 LY+L+K++VEEVG V+QV+T+ E++Y IAG+RLTDT+P+++ +P A HCIDL+L+ Sbjct: 299 ADFLYDLIKQVVEEVGAGQVVQVITSGEEQYGIAGRRLTDTFPTLYLSPSAAHCIDLILE 358 Query: 567 DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746 D + V++QARS++R++Y+ +A++NM++RYT G D+VD + T+F TLKR+V Sbjct: 359 DFGNLEWISAVIEQARSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSHFATNFTTLKRMV 418 Query: 747 NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926 +++H+LQ++V S+EW +S YSK + D +SNQ+FWSSC I+ LT PLL++ RI Sbjct: 419 DLKHNLQALVTSQEWADSPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVMRIAS 478 Query: 927 SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106 S PAMGYV+AG+YRAKEAIKK L +EEY+ YW+II RWE+L HPLHAAGFYLNPK Sbjct: 479 SEMRPAMGYVYAGMYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPK 538 Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286 FFYS++GD H I S + DCIERLVPD ++ DKI+KE Y +GDFGRKMA+RARD L Sbjct: 539 FFYSIQGDIHGQIVSGMFDCIERLVPDTRIQDKIIKEINLYKSASGDFGRKMAVRARDNL 598 Query: 1287 LP 1292 LP Sbjct: 599 LP 600 >ref|XP_007009265.1| HAT and BED zinc finger domain-containing protein, putative [Theobroma cacao] gi|508726178|gb|EOY18075.1| HAT and BED zinc finger domain-containing protein, putative [Theobroma cacao] Length = 749 Score = 475 bits (1223), Expect = e-131 Identities = 225/422 (53%), Positives = 310/422 (73%) Frame = +3 Query: 27 KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206 K+ N+ V++A+GRF FD+G P D+ NS YFQPM+DAI S G+ + PS DL+ ILK Sbjct: 174 KRVNNHVHVAIGRFLFDIGAPLDAVNSVYFQPMVDAIISGGSGVLMPSCSDLQGWILKKS 233 Query: 207 IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386 + EV+ D D+ AAW RTGCSILV++ + G+ +NF YC EGT+FL Sbjct: 234 VEEVKSDNDKVTAAWVRTGCSILVNQWNTQTGRILLNFLVYCPEGTVFLKSVDASSVINS 293 Query: 387 XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566 LYEL+K++VEEVG ++VLQV+T E++Y++AG+RL +T+P+++WTPCA HCI+L+L+ Sbjct: 294 SDALYELLKQVVEEVGSKHVLQVITNAEEQYIVAGRRLAETFPTLYWTPCAAHCINLILE 353 Query: 567 DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746 D A+ + ++++QARSI+R++Y+++ V+NM+RRYT G D+V+ T S T+F TLK+++ Sbjct: 354 DFAKLEWINVIIEQARSITRFVYNHSVVLNMVRRYTLGNDIVEPAVTCSATNFTTLKQMI 413 Query: 747 NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926 +++++LQ+MV S+EW + YSK + D +SN SFWSS I +LT+PLLR+ R+V Sbjct: 414 DLKNNLQAMVTSQEWMDCPYSKKPGGLEMLDLVSNPSFWSSSVLITQLTNPLLRVLRMVG 473 Query: 927 SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106 S K PAMGYV+AG+YRAKE IKKEL + EY+ YW+IID WEQ HPLH AGFYLNPK Sbjct: 474 SKKRPAMGYVYAGMYRAKETIKKELVKRNEYMIYWNIIDHWWEQQWHHPLHGAGFYLNPK 533 Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286 FFYS+EGD + + S + DCIE+LVPD+KV DKI KE SY GDFGRKMA+RARDTL Sbjct: 534 FFYSMEGDMPNEMLSGMLDCIEKLVPDVKVQDKISKEINSYKNTVGDFGRKMAVRARDTL 593 Query: 1287 LP 1292 LP Sbjct: 594 LP 595 >ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] gi|561036895|gb|ESW35425.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] Length = 756 Score = 473 bits (1216), Expect = e-130 Identities = 216/422 (51%), Positives = 308/422 (72%) Frame = +3 Query: 27 KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206 K+ ++ ++MA+GRF +D+G P D+ NS YF M+DAI+S+GA PS+H+LR ILKN Sbjct: 177 KRVDNHIHMAIGRFLYDIGAPFDAVNSVYFHEMVDAISSRGAGFERPSHHELRGWILKNS 236 Query: 207 IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386 + EV+ D+D+C WGRTGCSILVD+ + G+ ++F AYC EG +FL Sbjct: 237 VEEVKNDIDRCKMTWGRTGCSILVDQWATETGRVLISFLAYCPEGVVFLKSMDATEISTS 296 Query: 387 XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566 LY+++K++V+EVG+ VLQV+T+ E++Y +AG+RLTDT+P+++W+P A HCID +L+ Sbjct: 297 ADFLYDMIKQVVDEVGVGQVLQVITSGEEQYAVAGRRLTDTFPTLYWSPSAAHCIDFILE 356 Query: 567 DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746 D + V++QA+S++R++Y+ +A++ M++RYT G D+VD ++ T+F TLKR+V Sbjct: 357 DFGNLEWISAVIEQAKSVTRFVYNYSAILIMVKRYTLGNDIVDPSFSQFATNFTTLKRMV 416 Query: 747 NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926 +++H+LQ++V S+EW + YSK + D +S+Q+FWSSC I+RLT PLL++ RI Sbjct: 417 DLKHNLQALVTSQEWADCPYSKKSAGLEMLDCLSSQTFWSSCDMIVRLTAPLLKVLRIAS 476 Query: 927 SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106 S PAMGY++AG+YRAKEAIKK L +EEY+ YW+II RWE+L HPLHAAGFYLNPK Sbjct: 477 SEMRPAMGYIYAGIYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPK 536 Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286 FFYS++GD H I S + DCIERLV D ++ DKI+KE Y AGDFGRKMA+RARD L Sbjct: 537 FFYSIQGDIHSQIVSGMFDCIERLVSDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNL 596 Query: 1287 LP 1292 LP Sbjct: 597 LP 598 >ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] gi|561036894|gb|ESW35424.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] Length = 869 Score = 473 bits (1216), Expect = e-130 Identities = 216/422 (51%), Positives = 308/422 (72%) Frame = +3 Query: 27 KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206 K+ ++ ++MA+GRF +D+G P D+ NS YF M+DAI+S+GA PS+H+LR ILKN Sbjct: 290 KRVDNHIHMAIGRFLYDIGAPFDAVNSVYFHEMVDAISSRGAGFERPSHHELRGWILKNS 349 Query: 207 IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386 + EV+ D+D+C WGRTGCSILVD+ + G+ ++F AYC EG +FL Sbjct: 350 VEEVKNDIDRCKMTWGRTGCSILVDQWATETGRVLISFLAYCPEGVVFLKSMDATEISTS 409 Query: 387 XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566 LY+++K++V+EVG+ VLQV+T+ E++Y +AG+RLTDT+P+++W+P A HCID +L+ Sbjct: 410 ADFLYDMIKQVVDEVGVGQVLQVITSGEEQYAVAGRRLTDTFPTLYWSPSAAHCIDFILE 469 Query: 567 DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746 D + V++QA+S++R++Y+ +A++ M++RYT G D+VD ++ T+F TLKR+V Sbjct: 470 DFGNLEWISAVIEQAKSVTRFVYNYSAILIMVKRYTLGNDIVDPSFSQFATNFTTLKRMV 529 Query: 747 NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926 +++H+LQ++V S+EW + YSK + D +S+Q+FWSSC I+RLT PLL++ RI Sbjct: 530 DLKHNLQALVTSQEWADCPYSKKSAGLEMLDCLSSQTFWSSCDMIVRLTAPLLKVLRIAS 589 Query: 927 SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106 S PAMGY++AG+YRAKEAIKK L +EEY+ YW+II RWE+L HPLHAAGFYLNPK Sbjct: 590 SEMRPAMGYIYAGIYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPK 649 Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286 FFYS++GD H I S + DCIERLV D ++ DKI+KE Y AGDFGRKMA+RARD L Sbjct: 650 FFYSIQGDIHSQIVSGMFDCIERLVSDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNL 709 Query: 1287 LP 1292 LP Sbjct: 710 LP 711 >gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo] Length = 752 Score = 472 bits (1215), Expect = e-130 Identities = 230/433 (53%), Positives = 308/433 (71%), Gaps = 4/433 (0%) Frame = +3 Query: 6 VIPAGS----FKKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSY 173 VIP G + + V+MA+GRF +D+G ++ NS YFQPMI++IA G + PSY Sbjct: 166 VIPNGGGILDSNRDRNQVHMAIGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSY 225 Query: 174 HDLRNSILKNVIHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFL 353 HD+R ILKN + EVR D D+C A WG TGCS++VD+ + G+T +NF YC +GT+FL Sbjct: 226 HDIRGWILKNSVEEVRGDFDRCKATWGMTGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFL 285 Query: 354 XXXXXXXXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTP 533 +LYEL+K++VE+VG+++V+QV+T E+ + IAG++L+DTYP+++WTP Sbjct: 286 ESVDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTP 345 Query: 534 CAGHCIDLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRS 713 CA C+DL+L DI V V++QARSI+R++Y+N+ V+NM+R+ TFG D+V+ TRS Sbjct: 346 CAASCVDLILADIGNIEDVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRS 405 Query: 714 FTDFMTLKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLT 893 T+F TL R+V+++ LQ+MV S+EW +S YSK + D IS++SFWSSC SIIRLT Sbjct: 406 ATNFATLNRMVDLKRCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIIRLT 465 Query: 894 DPLLRLFRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHP 1073 +PLLR+ RIV S K PAMGYV+A +Y AK AIK EL ++ Y+ YW+IID RWE RHP Sbjct: 466 NPLLRVLRIVGSGKRPAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHP 525 Query: 1074 LHAAGFYLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFG 1253 L AAGFYLNPK+FYS+EGD H I S + DCIERLV D V DKI+KE SY +GDF Sbjct: 526 LCAAGFYLNPKYFYSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFA 585 Query: 1254 RKMAIRARDTLLP 1292 RK AIRAR TLLP Sbjct: 586 RKTAIRARGTLLP 598 >ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226173 [Cucumis sativus] Length = 752 Score = 470 bits (1210), Expect = e-130 Identities = 230/433 (53%), Positives = 308/433 (71%), Gaps = 4/433 (0%) Frame = +3 Query: 6 VIPAGS----FKKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSY 173 VIP G + + V+MAVGRF +D+G ++ NS YFQPMI++IA G + PSY Sbjct: 166 VIPNGGGILDSNRDRNQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSY 225 Query: 174 HDLRNSILKNVIHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFL 353 HD+R ILKN + EVR D D+C A WG TGCS++VD+ + G+T +NF YC +GT+FL Sbjct: 226 HDIRGWILKNSMEEVRSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFL 285 Query: 354 XXXXXXXXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTP 533 +LYEL+K++VE+VG+++V+QV+T E+ + IAG++L+DTYP+++WTP Sbjct: 286 ESVDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTP 345 Query: 534 CAGHCIDLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRS 713 CA C+DL+L DI V V++QARSI+R++Y+N+ V+NM+R+ TFG D+V+ TRS Sbjct: 346 CAASCVDLILGDIGNIEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRS 405 Query: 714 FTDFMTLKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLT 893 T+F TL R+V+++ LQ+MV S+EW +S YSK + D IS++SFWSSC SII LT Sbjct: 406 ATNFATLNRMVDLKRCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLT 465 Query: 894 DPLLRLFRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHP 1073 +PLLR+ RIV S K PAMGYV+A +Y AK AIK EL ++ Y+ YW+IID RWE RHP Sbjct: 466 NPLLRVLRIVGSGKRPAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHP 525 Query: 1074 LHAAGFYLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFG 1253 L+AAGFYLNPK+FYS+EGD H I S + DCIERLV D V DKI+KE SY +GDF Sbjct: 526 LYAAGFYLNPKYFYSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFA 585 Query: 1254 RKMAIRARDTLLP 1292 RK AIRAR TLLP Sbjct: 586 RKTAIRARGTLLP 598 >ref|XP_004305893.1| PREDICTED: uncharacterized protein LOC101310825 [Fragaria vesca subsp. vesca] Length = 869 Score = 468 bits (1203), Expect = e-129 Identities = 229/427 (53%), Positives = 301/427 (70%), Gaps = 2/427 (0%) Frame = +3 Query: 18 GSFKKANSV-VNMAVGRFFFDVGLPADSA-NSPYFQPMIDAIASQGAEAVGPSYHDLRNS 191 G +KANS + MA+GRF +++ P D+ NS YFQPMIDAIAS G E+ PSYHDLR Sbjct: 289 GEVEKANSQQIQMAIGRFLYEIQAPLDAVKNSLYFQPMIDAIASGGMESKAPSYHDLRGW 348 Query: 192 ILKNVIHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXX 371 IL + EV+ ++ Q +W R GCS+LV++ S KG+ +NF YC EGT +L Sbjct: 349 ILNDAAEEVKNEIYQHTNSWERNGCSLLVNQFNSEKGRILLNFSVYCPEGTTYLKSVDAS 408 Query: 372 XXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCI 551 LYE++K++VEEVG+R VLQV+T E+ YV+AGKRL DT+P+++W+PCA CI Sbjct: 409 TFINSPDALYEILKQVVEEVGVRRVLQVITNSEEHYVVAGKRLMDTFPTLYWSPCAAACI 468 Query: 552 DLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMT 731 + +L+D +F + ++ QARS++R+IY + ++NMMRRYTFG D+V +G TR TDFMT Sbjct: 469 NSILEDFGKFEWINSIIAQARSVTRFIYKHVVILNMMRRYTFGNDIVKLGITRYATDFMT 528 Query: 732 LKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRL 911 LK++ +++ +LQ+MV S+EW YSK E A+ D +SN +FWSSC I R T+PLL++ Sbjct: 529 LKQMADLKFNLQTMVTSKEWEGCPYSKTPEGLAMLDLLSNHTFWSSCIMITRFTNPLLQV 588 Query: 912 FRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGF 1091 RIV S K AMGYVF G+YRAKE IK+EL KE Y YW+IID RW +L HPLHAAGF Sbjct: 589 LRIVGSQKKAAMGYVFGGMYRAKETIKRELVKKEVYTAYWNIIDYRWAKLWDHPLHAAGF 648 Query: 1092 YLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIR 1271 YLNPKFFYS++G+ H I S + DCIE+LVPDLKV D+I KE Y GD GR +AIR Sbjct: 649 YLNPKFFYSIKGEMHKVIMSRMFDCIEKLVPDLKVQDEISKEINLYQNAVGDMGRNLAIR 708 Query: 1272 ARDTLLP 1292 ARDTLLP Sbjct: 709 ARDTLLP 715 >ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806265 isoform X1 [Glycine max] gi|571542833|ref|XP_006601996.1| PREDICTED: uncharacterized protein LOC100806265 isoform X2 [Glycine max] Length = 758 Score = 467 bits (1201), Expect = e-129 Identities = 218/422 (51%), Positives = 304/422 (72%) Frame = +3 Query: 27 KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206 KK ++ + MA+GRF +D+G P D+ N +FQ M+DAIAS+G PS+H+LR ILKN Sbjct: 178 KKMDNHIYMAIGRFLYDIGAPFDAVNLVFFQEMVDAIASKGTGFERPSHHELRGWILKNS 237 Query: 207 IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386 + EV+ D+D+C WGRTGCSILVD+ T+ + ++F AYC EG +FL Sbjct: 238 VEEVKNDIDRCKMTWGRTGCSILVDQWTTETSRILISFLAYCPEGLVFLKSLDATEILTS 297 Query: 387 XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566 LY+L+K++VEE+G+ V+QV+T+ E++Y IAG+RL DT+P+++W+P A HCIDL+L+ Sbjct: 298 PDFLYDLIKQVVEEIGVGKVVQVITSGEEQYGIAGRRLMDTFPTLYWSPSAAHCIDLILE 357 Query: 567 DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746 D + V++QA+S++R++Y+ +A++NM++RYT G D+VD +R T+F TLKR+V Sbjct: 358 DFGNLEWISAVIEQAKSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSRFATNFTTLKRMV 417 Query: 747 NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926 +++H+LQ++V S+EW + YSK + D +SNQ+FWSSC I+ LT PLL++ RI Sbjct: 418 DLKHNLQALVTSQEWADCPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVLRIAG 477 Query: 927 SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106 S P MGYV+AG+YR KEAIKK L +EEY+ YW+II RWE+L HPLHAAGFYLNPK Sbjct: 478 SEMRPGMGYVYAGMYRVKEAIKKALGKREEYMVYWNIIHHRWERLWNHPLHAAGFYLNPK 537 Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286 FFYS++GD I S + DCIERLVPD ++ DKI+KE Y AGDFGRKMA+RARD L Sbjct: 538 FFYSIQGDILGQIVSGMFDCIERLVPDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNL 597 Query: 1287 LP 1292 LP Sbjct: 598 LP 599 >ref|XP_007214932.1| hypothetical protein PRUPE_ppa001359mg [Prunus persica] gi|462411082|gb|EMJ16131.1| hypothetical protein PRUPE_ppa001359mg [Prunus persica] Length = 845 Score = 461 bits (1186), Expect = e-127 Identities = 223/420 (53%), Positives = 299/420 (71%), Gaps = 1/420 (0%) Frame = +3 Query: 36 NSVVNMAVGRFFFDVGLPADSA-NSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNVIH 212 N ++MA+GRF +++ P D NS YFQPMIDAIAS G + PSY DLR ILKN + Sbjct: 275 NQQIHMAIGRFLYEIQAPLDVVKNSVYFQPMIDAIASGGKGTIAPSYDDLRGWILKNAVG 334 Query: 213 EVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXXXX 392 EV+ D+ Q + W RTGCS+LV++ +S KGKT +NF C EGTI+L Sbjct: 335 EVKSDIHQHMETWARTGCSLLVNQWSSEKGKTLLNFAVQCPEGTIYLKSVDASYFIFSPD 394 Query: 393 VLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDI 572 L+E +KE+VEEVG+ +VLQV+T E+++ +AGKRL DT+P+++W+PC IDL+L+D Sbjct: 395 ALFEFLKEVVEEVGVGHVLQVITNTEEQFAVAGKRLMDTFPTLYWSPCVATSIDLILEDF 454 Query: 573 AEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNI 752 + + V++QARS++R+IY + ++NMMRRYTFG D+V +G TR T+F TLK++ ++ Sbjct: 455 GKVEWINSVIEQARSVTRFIYKHVVILNMMRRYTFGNDIVRLGVTRFATNFTTLKQMADL 514 Query: 753 RHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSL 932 + +LQSMV S+EW YSK E AV D +SN SFWS+C + LT+PLLR+ RIV S Sbjct: 515 KFNLQSMVTSKEWMCCPYSKTPEGSAVLDVLSNHSFWSACILVTHLTNPLLRVLRIVGSQ 574 Query: 933 KIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFF 1112 K AMGYVFAG+YRAKE IK+EL +EEY+ YW IID RW++L PLHAAGFYLNPKFF Sbjct: 575 KRAAMGYVFAGIYRAKETIKRELVKREEYMVYWDIIDYRWKKLWPLPLHAAGFYLNPKFF 634 Query: 1113 YSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLP 1292 YS++GD H+ I S + DCIERLVPD+K+ D+++KE Y GD GR +A+RARD LLP Sbjct: 635 YSVKGDLHNEIISRMFDCIERLVPDIKIQDEVIKEINLYKNAVGDLGRNLAVRARDNLLP 694 >ref|XP_002521049.1| DNA binding protein, putative [Ricinus communis] gi|223539752|gb|EEF41333.1| DNA binding protein, putative [Ricinus communis] Length = 854 Score = 454 bits (1167), Expect = e-125 Identities = 214/422 (50%), Positives = 297/422 (70%) Frame = +3 Query: 27 KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206 K ++V++ VGRF +D+G D+ +S YF+ +ID ++S + AV PS HDLR ILK + Sbjct: 285 KMIDNVIHTTVGRFLYDIGANFDALDSIYFRSLIDMLSSGASGAVAPSNHDLRGWILKKL 344 Query: 207 IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386 + E++ D+DQ W RTGCS+LV+E S G T +NF CS+GT+FL Sbjct: 345 VEEIKNDIDQSRTTWARTGCSVLVEEWNSESGITLLNFLVNCSQGTVFLKSVEASHIIYS 404 Query: 387 XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566 LY L+K++VEEVG NVLQV+T + Y +AGKRL + +PS+FW PCA HC+DL+L+ Sbjct: 405 PDGLYVLLKQVVEEVGASNVLQVITNGNEHYTVAGKRLMEAFPSLFWAPCAVHCLDLILE 464 Query: 567 DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746 D A+ + V++QA+S++R++Y+++AV+N+MR++T+G D+V G TRS T+F L+R+ Sbjct: 465 DFAKLEWIDAVIEQAKSVTRFVYNHSAVLNLMRKFTYGKDIVQQGLTRSATNFTMLQRMA 524 Query: 747 NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926 + + +LQ+M+ S+EW + YSK A+ D ISN+SFWSSC IIRLT PL+R+ I Sbjct: 525 DFKLNLQTMITSQEWMDCPYSKQHGGLAMLDIISNRSFWSSCILIIRLTSPLIRVLGIAG 584 Query: 927 SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106 + AMGY+FAG+YRAKE IK+EL +E+Y+ YW+IID RW+Q + PLH AGF+LNPK Sbjct: 585 GKRKAAMGYIFAGIYRAKETIKRELVKREDYMVYWNIIDHRWDQRRHPPLHVAGFFLNPK 644 Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286 FFYS+EGD H+ I S V DCIERLVPD++V DKI KE Y GD GRKMAIR+R TL Sbjct: 645 FFYSIEGDVHNEILSRVFDCIERLVPDIEVQDKIAKELNIYKNAVGDLGRKMAIRSRGTL 704 Query: 1287 LP 1292 LP Sbjct: 705 LP 706 >ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Populus trichocarpa] gi|550335284|gb|ERP58729.1| hypothetical protein POPTR_0006s02210g [Populus trichocarpa] Length = 847 Score = 447 bits (1149), Expect = e-123 Identities = 219/430 (50%), Positives = 300/430 (69%) Frame = +3 Query: 3 AVIPAGSFKKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDL 182 A+I GS + A++ ++ GRF +D+G D+ +S + QP+ID +A PS+ DL Sbjct: 275 ALIAMGS-ETADNPIHAIWGRFLYDIGASLDAMDSNFSQPLIDTVAYGRPGIAAPSHQDL 333 Query: 183 RNSILKNVIHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXX 362 R ILK+++ EV+ D++Q W +TGCS+LV+E S G T +NF YCS+GT+FL Sbjct: 334 RGRILKSLVEEVKSDINQYKTRWVKTGCSLLVEECNSESGVTTLNFLVYCSKGTVFLKSV 393 Query: 363 XXXXXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAG 542 LYEL+K +VEEVG N+LQV+T E+ Y+ AGK+L DT+PS++W PCA Sbjct: 394 DASNLIHSTDGLYELLKLMVEEVGAGNILQVITNGEEHYIAAGKKLMDTFPSLYWAPCAA 453 Query: 543 HCIDLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTD 722 CIDL+L+DI + + VL+QA+S++R++Y+N+AV+N+MR++T G D+V G TRS T+ Sbjct: 454 RCIDLILEDIGKLDWINTVLEQAKSVTRFVYNNSAVLNLMRKFTSGSDIVQQGITRSATN 513 Query: 723 FMTLKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPL 902 F LKR+ N + +LQ+MV S+EW + YSK A+ D I+N+SFWSSC IIRLT PL Sbjct: 514 FTALKRMANFKLNLQTMVTSQEWMDCPYSKQPGGLAMVDIITNRSFWSSCILIIRLTSPL 573 Query: 903 LRLFRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHA 1082 L++ IV S K AMGYVF+G+YRAKE IKKEL +E+Y+ YW+IID RWEQ + PLHA Sbjct: 574 LQVLVIVSSEKRAAMGYVFSGIYRAKETIKKELVKREDYMVYWNIIDHRWEQQWQTPLHA 633 Query: 1083 AGFYLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKM 1262 AGF+ NPKFFYS+EGD H+ I S + DCIERLVPD +V DKI+KE Y G G+K+ Sbjct: 634 AGFFFNPKFFYSIEGDMHNKILSRMFDCIERLVPDTEVQDKIVKELTLYKNAEGHLGKKL 693 Query: 1263 AIRARDTLLP 1292 AIRAR T+LP Sbjct: 694 AIRARGTMLP 703 >ref|XP_007049027.1| HAT transposon superfamily, putative [Theobroma cacao] gi|508701288|gb|EOX93184.1| HAT transposon superfamily, putative [Theobroma cacao] Length = 750 Score = 445 bits (1145), Expect = e-122 Identities = 210/420 (50%), Positives = 298/420 (70%) Frame = +3 Query: 33 ANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNVIH 212 A + V+MA+GRF +D+G+ D+ NS YFQPMIDAIAS G+ V PS DLR ILKNV+ Sbjct: 180 AENPVHMAIGRFLYDIGVNLDAVNSVYFQPMIDAIASTGSGIVPPSSQDLRGWILKNVME 239 Query: 213 EVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXXXX 392 EV+ D+D+ WG+TGCSILV++ + G+T ++F YC + T+FL Sbjct: 240 EVKDDIDRNKTMWGKTGCSILVEQWSPKSGRTLLSFLVYCPQATVFLKSVDASRVIFSAD 299 Query: 393 VLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDI 572 L EL+K++VEEVG+ NV+QV+T E++Y +AGKRL +++PS++W PC HC+D+ML+D Sbjct: 300 HLNELLKQVVEEVGVENVVQVITNCEEQYFLAGKRLMESFPSLYWAPCLVHCVDMMLEDF 359 Query: 573 AEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNI 752 A + ++QA+S++R++Y+++ V+NMMRR+TF D+V+ TR ++F TLKR+ ++ Sbjct: 360 ANLEWISETIEQAKSVTRFVYNHSVVLNMMRRFTFHNDIVEPAVTRFASNFATLKRMADL 419 Query: 753 RHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSL 932 + LQ+MVNS++W+E Y+K + D + N+SFW+SC I+RL PLL++ IV S Sbjct: 420 KLKLQAMVNSQDWSECPYAKKPGGLVMLDIVKNRSFWNSCILIVRLIYPLLQVLEIVGSK 479 Query: 933 KIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFF 1112 K MGYV+AG+YRAKE IKKEL K++Y+ YW+IID RWEQ + PL+AA F+LNPKFF Sbjct: 480 KRSTMGYVYAGIYRAKETIKKELVKKDDYMVYWNIIDHRWEQQRHIPLYAAAFFLNPKFF 539 Query: 1113 YSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLP 1292 YS+EG+ H+ I S + DCIERLVPD V D+I++E Y GD GR MA+RARD LLP Sbjct: 540 YSIEGNIHNDILSSMFDCIERLVPDTNVQDQIVREIHLYKNATGDLGRPMAVRARDNLLP 599 >ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817502 isoform X4 [Glycine max] Length = 729 Score = 442 bits (1136), Expect = e-121 Identities = 217/422 (51%), Positives = 296/422 (70%) Frame = +3 Query: 27 KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206 KK ++ + MA+GRF +D+G P D+ NS YFQ M+DAIAS+G P +H+LR ILKN Sbjct: 179 KKMDNHIYMAIGRFLYDIGAPFDAVNSVYFQEMVDAIASRGVGFERPWHHELRGWILKNS 238 Query: 207 IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386 + EV+ D+D+C WGRTGCSILVD+ T T +F Sbjct: 239 VEEVKNDIDRCKMTWGRTGCSILVDQWT-----TETDF---------------------- 271 Query: 387 XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566 LY+L+K++VEEVG V+QV+T+ E++Y IAG+RLTDT+P+++ +P A HCIDL+L+ Sbjct: 272 ---LYDLIKQVVEEVGAGQVVQVITSGEEQYGIAGRRLTDTFPTLYLSPSAAHCIDLILE 328 Query: 567 DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746 D + V++QARS++R++Y+ +A++NM++RYT G D+VD + T+F TLKR+V Sbjct: 329 DFGNLEWISAVIEQARSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSHFATNFTTLKRMV 388 Query: 747 NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926 +++H+LQ++V S+EW +S YSK + D +SNQ+FWSSC I+ LT PLL++ RI Sbjct: 389 DLKHNLQALVTSQEWADSPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVMRIAS 448 Query: 927 SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106 S PAMGYV+AG+YRAKEAIKK L +EEY+ YW+II RWE+L HPLHAAGFYLNPK Sbjct: 449 SEMRPAMGYVYAGMYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPK 508 Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286 FFYS++GD H I S + DCIERLVPD ++ DKI+KE Y +GDFGRKMA+RARD L Sbjct: 509 FFYSIQGDIHGQIVSGMFDCIERLVPDTRIQDKIIKEINLYKSASGDFGRKMAVRARDNL 568 Query: 1287 LP 1292 LP Sbjct: 569 LP 570 >ref|NP_188861.2| hAT dimerization domain-containing protein [Arabidopsis thaliana] gi|79313325|ref|NP_001030742.1| hAT dimerization domain-containing protein [Arabidopsis thaliana] gi|11994740|dbj|BAB03069.1| transposase-like protein [Arabidopsis thaliana] gi|28393360|gb|AAO42104.1| unknown protein [Arabidopsis thaliana] gi|28827622|gb|AAO50655.1| unknown protein [Arabidopsis thaliana] gi|332643084|gb|AEE76605.1| hAT dimerization domain-containing protein [Arabidopsis thaliana] gi|332643085|gb|AEE76606.1| hAT dimerization domain-containing protein [Arabidopsis thaliana] Length = 761 Score = 409 bits (1052), Expect = e-111 Identities = 199/422 (47%), Positives = 278/422 (65%) Frame = +3 Query: 27 KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206 K+ V+MA+GRF FD+G D+ANS QP IDAI S G P++ DLR ILK+ Sbjct: 184 KEREKTVHMAMGRFLFDIGADFDAANSVNVQPFIDAIVSGGFGVSIPTHEDLRGWILKSC 243 Query: 207 IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386 + EV+ ++D+C W RTGCS+LV E S +G + F YC E +FL Sbjct: 244 VEEVKKEIDECKTLWKRTGCSVLVQELNSNEGPLILKFLVYCPEKVVFLKSVDASEILDS 303 Query: 387 XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566 LYEL+KE+VEE+G NV+QV+T ED Y AGK+L D YPS++W PCA HCID ML+ Sbjct: 304 EDKLYELLKEVVEEIGDTNVVQVITKCEDHYAAAGKKLMDVYPSLYWVPCAAHCIDKMLE 363 Query: 567 DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746 + + ++ +++QAR+++R IY+++ V+N+MR++TFG D+V T S T+F T+ RI Sbjct: 364 EFGKMDWIREIIEQARTVTRIIYNHSGVLNLMRKFTFGNDIVQPVCTSSATNFTTMGRIA 423 Query: 747 NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926 +++ LQ+MV S EW + SYSK+ A+ ++I+++ FW + +T P+LR+ RIV Sbjct: 424 DLKPYLQAMVTSSEWNDCSYSKEAGGLAMTETINDEDFWKALTLANHITAPILRVLRIVC 483 Query: 927 SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106 S + PAMGYV+A +YRAKEAIK L +EEY+ YW IID W Q PL+AAGFYLNPK Sbjct: 484 SERKPAMGYVYAAMYRAKEAIKTNLAHREEYIVYWKIIDRWWLQ---QPLYAAGFYLNPK 540 Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286 FFYS++ + I V DCIE+LVPD+ + D ++K+ SY G FGR +AIRARDT+ Sbjct: 541 FFYSIDEEMRSEIHLAVVDCIEKLVPDVNIQDIVIKDINSYKNAVGIFGRNLAIRARDTM 600 Query: 1287 LP 1292 LP Sbjct: 601 LP 602