BLASTX nr result

ID: Mentha29_contig00009665 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00009665
         (1292 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591...   557   e-156
gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea]       556   e-156
ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254...   551   e-154
ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626...   503   e-140
ref|XP_002524204.1| DNA binding protein, putative [Ricinus commu...   488   e-135
ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302...   478   e-132
ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817...   476   e-132
ref|XP_007009265.1| HAT and BED zinc finger domain-containing pr...   475   e-131
ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phas...   473   e-130
ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phas...   473   e-130
gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo]         472   e-130
ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226...   470   e-130
ref|XP_004305893.1| PREDICTED: uncharacterized protein LOC101310...   468   e-129
ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806...   467   e-129
ref|XP_007214932.1| hypothetical protein PRUPE_ppa001359mg [Prun...   461   e-127
ref|XP_002521049.1| DNA binding protein, putative [Ricinus commu...   454   e-125
ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Popu...   447   e-123
ref|XP_007049027.1| HAT transposon superfamily, putative [Theobr...   445   e-122
ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817...   442   e-121
ref|NP_188861.2| hAT dimerization domain-containing protein [Ara...   409   e-111

>ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591442 [Solanum tuberosum]
          Length = 755

 Score =  557 bits (1435), Expect = e-156
 Identities = 265/429 (61%), Positives = 330/429 (76%)
 Frame = +3

Query: 6    VIPAGSFKKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLR 185
            ++P    K+ N+ V+MAV RF  D  +P D+ NS YFQPMID IASQG +   PSYH+LR
Sbjct: 170  LLPINQSKRVNNHVHMAVARFLLDARVPLDAVNSVYFQPMIDVIASQGPQVSAPSYHELR 229

Query: 186  NSILKNVIHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXX 365
            + +LK  + EVR D+DQC + W R+GCS+LVDE  +GKGKT +NF  YC EGT+FL    
Sbjct: 230  SWVLKASVQEVRNDIDQCSSTWARSGCSVLVDEWITGKGKTLLNFLVYCPEGTMFLRSVD 289

Query: 366  XXXXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGH 545
                      LYEL+KE+VEEVG+RNVLQVVT+ E+RY+IAGKRLTD YP++FWTPCA H
Sbjct: 290  ASTLINSTDYLYELLKEVVEEVGVRNVLQVVTSNEERYIIAGKRLTDAYPTLFWTPCAAH 349

Query: 546  CIDLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDF 725
             IDLML+D+ +   +  +++QA+SISR+IY+N  +++MMR++T GVDLVD+G TRS TDF
Sbjct: 350  SIDLMLEDLKKLEWIDTIMEQAKSISRFIYNNNILLSMMRKFTLGVDLVDLGVTRSATDF 409

Query: 726  MTLKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLL 905
            +TLKR+VNI+H+LQSMV S EW ES YSK  E FA+ D I NQSFWS+C+ + RLTDP+L
Sbjct: 410  LTLKRMVNIKHNLQSMVTSVEWAESPYSKKPEGFALLDYIGNQSFWSTCSLVCRLTDPIL 469

Query: 906  RLFRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAA 1085
            RL R+V S + PAM YV+AG+YRAKE IKKEL  K++Y  YW+IID RWE LQRHPLHAA
Sbjct: 470  RLLRMVSSEERPAMAYVYAGVYRAKETIKKELVNKKDYSVYWNIIDHRWESLQRHPLHAA 529

Query: 1086 GFYLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMA 1265
            GFYLNPKFFY+ E D H HI+SLV DCIE+LVPD K+ DKI+KE  SY   AGDFGRKMA
Sbjct: 530  GFYLNPKFFYTTEEDVHLHIRSLVYDCIEKLVPDPKIQDKIVKETTSYLNSAGDFGRKMA 589

Query: 1266 IRARDTLLP 1292
            +RARDTL P
Sbjct: 590  VRARDTLFP 598


>gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea]
          Length = 724

 Score =  556 bits (1433), Expect = e-156
 Identities = 275/430 (63%), Positives = 335/430 (77%)
 Frame = +3

Query: 3    AVIPAGSFKKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDL 182
            A++   S +  +  V+MAVGRFF DVGLPA++ANS YFQPM++AIASQ A  +GPSY DL
Sbjct: 161  ALMSLPSVQPCSKKVHMAVGRFFVDVGLPAEAANSAYFQPMVEAIASQEAGVIGPSYQDL 220

Query: 183  RNSILKNVIHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXX 362
            R+ ILKN++HE RYDVDQ   AW RTGC++LVD+  SGKG+TFVNFF Y SE TIF    
Sbjct: 221  RSWILKNLVHETRYDVDQYANAWERTGCTVLVDDWNSGKGETFVNFFVYNSEATIFYRSA 280

Query: 363  XXXXXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAG 542
                       LYEL+KE VE++G++NVLQV+T+ ED+Y  AGKRL  TYPS+FW+PCAG
Sbjct: 281  NVSHGIVSADDLYELLKETVEQIGVKNVLQVITSCEDQYAFAGKRLATTYPSVFWSPCAG 340

Query: 543  HCIDLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTD 722
             C+DLMLQD+   P VK+ L+QA+SISRYIYSN  V+NM+RR+TFG+DL+D G T S T+
Sbjct: 341  LCVDLMLQDMEHLPMVKVTLEQAKSISRYIYSNGFVLNMLRRHTFGLDLLDEGITPSSTN 400

Query: 723  FMTLKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPL 902
            FMTLKR++++RH LQSMV SE+W +S +S+  E FA+ D++++QSFWS+CASI  L DPL
Sbjct: 401  FMTLKRMLSMRHHLQSMVTSEDWIQSPHSQKPEGFALLDTMTSQSFWSACASITNLIDPL 460

Query: 903  LRLFRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHA 1082
            LRL RI+ S K PAMGYV+AGLYRAKEAIKK     E+YL Y +IID RWEQLQ+HPLH 
Sbjct: 461  LRLLRIISSGKKPAMGYVYAGLYRAKEAIKKHF-VSEDYLVYLNIIDRRWEQLQQHPLHG 519

Query: 1083 AGFYLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKM 1262
            AGFYLNPKFFYSLEGD     +S+V DCIERLVPD +V DKIMKE   YH G GDFGRKM
Sbjct: 520  AGFYLNPKFFYSLEGDALLRSRSMVYDCIERLVPDPEVQDKIMKEMTYYHGGVGDFGRKM 579

Query: 1263 AIRARDTLLP 1292
            AIRARDTLLP
Sbjct: 580  AIRARDTLLP 589


>ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254391 [Solanum
            lycopersicum]
          Length = 748

 Score =  551 bits (1419), Expect = e-154
 Identities = 263/422 (62%), Positives = 325/422 (77%)
 Frame = +3

Query: 27   KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206
            K+ N+ V+MAV RF  D  +P D+ NS YFQPMID IASQG     PSYHDLR+ +LK+ 
Sbjct: 170  KRVNNQVHMAVARFLLDARVPLDAVNSVYFQPMIDVIASQGPPVSAPSYHDLRSWVLKSS 229

Query: 207  IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386
            + EVR D+DQC + W RTGCS+L+DE  +GKGK  +NF  YC +GT+FL           
Sbjct: 230  VQEVRTDIDQCSSTWARTGCSVLIDELITGKGKILLNFLVYCPQGTMFLRSVDASTLINS 289

Query: 387  XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566
               LYEL+KE+V+E+G+RNVLQVVT+ E+RYVIAGKRLTD YP++FWTPCA H IDLML+
Sbjct: 290  TDYLYELLKEVVDEIGVRNVLQVVTSNEERYVIAGKRLTDAYPTLFWTPCAAHSIDLMLE 349

Query: 567  DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746
            D  +   +  +++QA+SISR+IY+N  +++MMR++T GVDLVD+G TRS TDF+TLKR+ 
Sbjct: 350  DFNKLEWIDTIMEQAKSISRFIYNNNILLSMMRKFTLGVDLVDLGVTRSATDFLTLKRMQ 409

Query: 747  NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926
            NI+H+LQSMV S EW ES YSK  E FA+ D ISNQSFWS+C+ I RLTDP+LRL R+V 
Sbjct: 410  NIKHNLQSMVTSVEWAESPYSKKPEGFALLDYISNQSFWSTCSLICRLTDPILRLLRMVS 469

Query: 927  SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106
            S + PAM YV+AG+YRAKE IKKEL  K++Y  YW+IID RWE LQRHPLHAAGFYLNPK
Sbjct: 470  SEERPAMPYVYAGVYRAKETIKKELVNKKDYSVYWNIIDHRWESLQRHPLHAAGFYLNPK 529

Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286
            FFY+ E D H HI+SLV DCIE+LVPD K+ DKI+KE  SY   AGDFGRKMA+RARDTL
Sbjct: 530  FFYTTEEDVHLHIRSLVYDCIEKLVPDPKIQDKIVKETTSYLNSAGDFGRKMAVRARDTL 589

Query: 1287 LP 1292
             P
Sbjct: 590  FP 591


>ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626522 [Citrus sinensis]
          Length = 745

 Score =  503 bits (1296), Expect = e-140
 Identities = 235/421 (55%), Positives = 319/421 (75%)
 Frame = +3

Query: 30   KANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNVI 209
            + N+ + MAVGRF +D+G P D+ NS YFQPM+DAIAS G EA  PSYHD+R  ILKN +
Sbjct: 175  RGNNPIFMAVGRFLYDIGAPLDAVNSEYFQPMVDAIASGGPEAAMPSYHDIRGWILKNSV 234

Query: 210  HEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXXX 389
             EV+ DVD+    WG+TGCSILVD+  +  G+T + F AYC EGT+FL            
Sbjct: 235  EEVKNDVDRYTTTWGKTGCSILVDQWNTEAGRTLLCFLAYCPEGTVFLKSVDASGIMNSS 294

Query: 390  XVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQD 569
              LYEL+K++VEEVG+R+VLQV+T+ E++++ AG+RLTDT+P+++WTPCA  C+DL+L+D
Sbjct: 295  DALYELLKQVVEEVGVRHVLQVITSSEEQFIAAGRRLTDTFPTLYWTPCAARCLDLILED 354

Query: 570  IAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVN 749
             A+   +  +++QAR+++R++Y+++ V+NM+RRYTFG D+V+ G TRS T+F TL+R+++
Sbjct: 355  FAKLEWINAIIEQARAVTRFVYNHSVVLNMLRRYTFGNDIVEPGITRSATNFTTLRRMIS 414

Query: 750  IRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRS 929
            ++ +LQ+MV S+EW +  YSK      + D +SNQSFWSSC  I+ LT+PLLRL RIV S
Sbjct: 415  LKPNLQAMVTSQEWMDCPYSKKPGGLEMLDIVSNQSFWSSCGLIVCLTNPLLRLLRIVGS 474

Query: 930  LKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKF 1109
             + P++GYV+AG+YRAK+A+KKEL  ++EY+ YW+IID  WEQL   PLHAAGF+LNPKF
Sbjct: 475  ERRPSIGYVYAGMYRAKDALKKELIKRDEYMVYWNIIDHWWEQLWHLPLHAAGFFLNPKF 534

Query: 1110 FYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLL 1289
            FYS++GD H+ I S + DCIERLVPD KV DKI KE   Y    GDFGRKMAIRARDTLL
Sbjct: 535  FYSIKGDIHNEIVSRMFDCIERLVPDTKVQDKISKEINLYKDAVGDFGRKMAIRARDTLL 594

Query: 1290 P 1292
            P
Sbjct: 595  P 595


>ref|XP_002524204.1| DNA binding protein, putative [Ricinus communis]
            gi|223536481|gb|EEF38128.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 753

 Score =  488 bits (1256), Expect = e-135
 Identities = 232/422 (54%), Positives = 313/422 (74%)
 Frame = +3

Query: 27   KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206
            K+ N  V+MA+GRF +D+G P D+ NS YFQPM+DAIAS G +   PS HDLR  ILKN 
Sbjct: 182  KRVNDHVHMAIGRFLYDIGAPLDAVNSVYFQPMVDAIASGGLDVGMPSCHDLRGWILKNS 241

Query: 207  IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386
            + EV+ +VD+ +A W RTGCS+LVD+  +  G+T ++F  YCSEG +FL           
Sbjct: 242  VEEVKTEVDKHMATWARTGCSVLVDQWNTLMGRTLLSFLVYCSEGVVFLKSVDASDIINS 301

Query: 387  XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566
               LYEL+K++VEEVG+R+VLQV+T++E++Y++ G+RLTDT+P+++  PCA HCIDL+L+
Sbjct: 302  SDALYELIKKVVEEVGVRHVLQVITSMEEQYIVVGRRLTDTFPTLYRAPCAAHCIDLILE 361

Query: 567  DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746
            D A+   +  V+ QARSI+R++Y+++ V+NM++RYTFG ++V  G T   T+F TLKR+V
Sbjct: 362  DFAKLEWISTVILQARSITRFVYNHSVVLNMVKRYTFGSEIVATGLTHFATNFETLKRMV 421

Query: 747  NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926
            +++H+LQ+MV S+EW +  YSK      + D +SNQSFWSSC  I  LT+PLLRL RIV 
Sbjct: 422  DLKHTLQTMVTSQEWMDCPYSKKPRGLEMLDLLSNQSFWSSCVLITNLTNPLLRLLRIVS 481

Query: 927  SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106
            S K P MGYV+AG+YRAKEAIKKEL  +++Y+ YW+IID  WEQ    PLHAAGF+LNPK
Sbjct: 482  SKKRPPMGYVYAGIYRAKEAIKKELVKRKDYMVYWNIIDHWWEQQSNLPLHAAGFFLNPK 541

Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286
              YS+EGD H+ I S + DCIE+LVPD+ V DKI KE  SY   +GDFGRKMA+RAR+TL
Sbjct: 542  VLYSIEGDLHNEILSGMFDCIEKLVPDVTVQDKITKEINSYKNASGDFGRKMAVRARETL 601

Query: 1287 LP 1292
            LP
Sbjct: 602  LP 603


>ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302111 [Fragaria vesca
            subsp. vesca]
          Length = 754

 Score =  478 bits (1231), Expect = e-132
 Identities = 231/424 (54%), Positives = 307/424 (72%), Gaps = 2/424 (0%)
 Frame = +3

Query: 27   KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206
            +K NS V+ A+GRF FD+G P ++ NS YFQPMIDAIAS G     P+ HDLR+ ILKN 
Sbjct: 172  RKVNSYVHEAIGRFLFDIGAPPEAVNSAYFQPMIDAIASGGPGMEPPTCHDLRSWILKNS 231

Query: 207  IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386
            + E R ++D+  A WGRTGCSILVD+  +      ++F  Y  EGT+FL           
Sbjct: 232  VEEARNNIDKHRATWGRTGCSILVDQWNTELDNVMLSFLVYSPEGTVFLESVDASAIINS 291

Query: 387  XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566
               LY+L++ +VE+VG+ +V+QV+T+ E+++V+AG+RL DT+P++FW PCA  C+DL+L+
Sbjct: 292  SDALYDLLRRVVEDVGVGDVVQVITSGEEQFVVAGRRLADTFPNLFWIPCAARCLDLILE 351

Query: 567  DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746
            D      +  V++QARSI++++Y++  V+N++RR TFG D+V+ G TR  T F TLKR+V
Sbjct: 352  DFGSLDWIHAVIEQARSITKFVYNHNVVLNLVRRSTFGNDIVEPGVTRFGTSFTTLKRLV 411

Query: 747  NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSIS--NQSFWSSCASIIRLTDPLLRLFRI 920
            +++H LQ MV S+EW +  YSK+     + D IS  +QSFWSSC  I+RLT PLLR+ R+
Sbjct: 412  DLKHCLQVMVTSQEWMDCPYSKEPGGLEISDLISDRDQSFWSSCTLIVRLTSPLLRVLRM 471

Query: 921  VRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLN 1100
            V   K PAMG+++AG+YRAKEAIKKEL  +EEY+ YW+IID RWEQ    PLHAAGFYLN
Sbjct: 472  VGCEKRPAMGFIYAGMYRAKEAIKKELVKREEYMVYWNIIDQRWEQHWNFPLHAAGFYLN 531

Query: 1101 PKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARD 1280
            PK FYS+EGD H+ IQS + DCIER+VPD+KV DKIMKE  SY   AGDF RKMAIRARD
Sbjct: 532  PKIFYSIEGDIHNSIQSGMYDCIERMVPDIKVQDKIMKEIISYKNAAGDFRRKMAIRARD 591

Query: 1281 TLLP 1292
            TLLP
Sbjct: 592  TLLP 595


>ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817502 isoform X1 [Glycine
            max] gi|571489936|ref|XP_006591345.1| PREDICTED:
            uncharacterized protein LOC100817502 isoform X2 [Glycine
            max] gi|571489939|ref|XP_006591346.1| PREDICTED:
            uncharacterized protein LOC100817502 isoform X3 [Glycine
            max]
          Length = 759

 Score =  476 bits (1225), Expect = e-132
 Identities = 225/422 (53%), Positives = 307/422 (72%)
 Frame = +3

Query: 27   KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206
            KK ++ + MA+GRF +D+G P D+ NS YFQ M+DAIAS+G     P +H+LR  ILKN 
Sbjct: 179  KKMDNHIYMAIGRFLYDIGAPFDAVNSVYFQEMVDAIASRGVGFERPWHHELRGWILKNS 238

Query: 207  IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386
            + EV+ D+D+C   WGRTGCSILVD+ T+  GK  ++F AYC EG +FL           
Sbjct: 239  VEEVKNDIDRCKMTWGRTGCSILVDQWTTETGKILISFLAYCPEGLVFLRSLDATEISTS 298

Query: 387  XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566
               LY+L+K++VEEVG   V+QV+T+ E++Y IAG+RLTDT+P+++ +P A HCIDL+L+
Sbjct: 299  ADFLYDLIKQVVEEVGAGQVVQVITSGEEQYGIAGRRLTDTFPTLYLSPSAAHCIDLILE 358

Query: 567  DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746
            D      +  V++QARS++R++Y+ +A++NM++RYT G D+VD   +   T+F TLKR+V
Sbjct: 359  DFGNLEWISAVIEQARSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSHFATNFTTLKRMV 418

Query: 747  NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926
            +++H+LQ++V S+EW +S YSK      + D +SNQ+FWSSC  I+ LT PLL++ RI  
Sbjct: 419  DLKHNLQALVTSQEWADSPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVMRIAS 478

Query: 927  SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106
            S   PAMGYV+AG+YRAKEAIKK L  +EEY+ YW+II  RWE+L  HPLHAAGFYLNPK
Sbjct: 479  SEMRPAMGYVYAGMYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPK 538

Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286
            FFYS++GD H  I S + DCIERLVPD ++ DKI+KE   Y   +GDFGRKMA+RARD L
Sbjct: 539  FFYSIQGDIHGQIVSGMFDCIERLVPDTRIQDKIIKEINLYKSASGDFGRKMAVRARDNL 598

Query: 1287 LP 1292
            LP
Sbjct: 599  LP 600


>ref|XP_007009265.1| HAT and BED zinc finger domain-containing protein, putative
            [Theobroma cacao] gi|508726178|gb|EOY18075.1| HAT and BED
            zinc finger domain-containing protein, putative
            [Theobroma cacao]
          Length = 749

 Score =  475 bits (1223), Expect = e-131
 Identities = 225/422 (53%), Positives = 310/422 (73%)
 Frame = +3

Query: 27   KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206
            K+ N+ V++A+GRF FD+G P D+ NS YFQPM+DAI S G+  + PS  DL+  ILK  
Sbjct: 174  KRVNNHVHVAIGRFLFDIGAPLDAVNSVYFQPMVDAIISGGSGVLMPSCSDLQGWILKKS 233

Query: 207  IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386
            + EV+ D D+  AAW RTGCSILV++  +  G+  +NF  YC EGT+FL           
Sbjct: 234  VEEVKSDNDKVTAAWVRTGCSILVNQWNTQTGRILLNFLVYCPEGTVFLKSVDASSVINS 293

Query: 387  XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566
               LYEL+K++VEEVG ++VLQV+T  E++Y++AG+RL +T+P+++WTPCA HCI+L+L+
Sbjct: 294  SDALYELLKQVVEEVGSKHVLQVITNAEEQYIVAGRRLAETFPTLYWTPCAAHCINLILE 353

Query: 567  DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746
            D A+   + ++++QARSI+R++Y+++ V+NM+RRYT G D+V+   T S T+F TLK+++
Sbjct: 354  DFAKLEWINVIIEQARSITRFVYNHSVVLNMVRRYTLGNDIVEPAVTCSATNFTTLKQMI 413

Query: 747  NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926
            +++++LQ+MV S+EW +  YSK      + D +SN SFWSS   I +LT+PLLR+ R+V 
Sbjct: 414  DLKNNLQAMVTSQEWMDCPYSKKPGGLEMLDLVSNPSFWSSSVLITQLTNPLLRVLRMVG 473

Query: 927  SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106
            S K PAMGYV+AG+YRAKE IKKEL  + EY+ YW+IID  WEQ   HPLH AGFYLNPK
Sbjct: 474  SKKRPAMGYVYAGMYRAKETIKKELVKRNEYMIYWNIIDHWWEQQWHHPLHGAGFYLNPK 533

Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286
            FFYS+EGD  + + S + DCIE+LVPD+KV DKI KE  SY    GDFGRKMA+RARDTL
Sbjct: 534  FFYSMEGDMPNEMLSGMLDCIEKLVPDVKVQDKISKEINSYKNTVGDFGRKMAVRARDTL 593

Query: 1287 LP 1292
            LP
Sbjct: 594  LP 595


>ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris]
            gi|561036895|gb|ESW35425.1| hypothetical protein
            PHAVU_001G234100g [Phaseolus vulgaris]
          Length = 756

 Score =  473 bits (1216), Expect = e-130
 Identities = 216/422 (51%), Positives = 308/422 (72%)
 Frame = +3

Query: 27   KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206
            K+ ++ ++MA+GRF +D+G P D+ NS YF  M+DAI+S+GA    PS+H+LR  ILKN 
Sbjct: 177  KRVDNHIHMAIGRFLYDIGAPFDAVNSVYFHEMVDAISSRGAGFERPSHHELRGWILKNS 236

Query: 207  IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386
            + EV+ D+D+C   WGRTGCSILVD+  +  G+  ++F AYC EG +FL           
Sbjct: 237  VEEVKNDIDRCKMTWGRTGCSILVDQWATETGRVLISFLAYCPEGVVFLKSMDATEISTS 296

Query: 387  XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566
               LY+++K++V+EVG+  VLQV+T+ E++Y +AG+RLTDT+P+++W+P A HCID +L+
Sbjct: 297  ADFLYDMIKQVVDEVGVGQVLQVITSGEEQYAVAGRRLTDTFPTLYWSPSAAHCIDFILE 356

Query: 567  DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746
            D      +  V++QA+S++R++Y+ +A++ M++RYT G D+VD   ++  T+F TLKR+V
Sbjct: 357  DFGNLEWISAVIEQAKSVTRFVYNYSAILIMVKRYTLGNDIVDPSFSQFATNFTTLKRMV 416

Query: 747  NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926
            +++H+LQ++V S+EW +  YSK      + D +S+Q+FWSSC  I+RLT PLL++ RI  
Sbjct: 417  DLKHNLQALVTSQEWADCPYSKKSAGLEMLDCLSSQTFWSSCDMIVRLTAPLLKVLRIAS 476

Query: 927  SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106
            S   PAMGY++AG+YRAKEAIKK L  +EEY+ YW+II  RWE+L  HPLHAAGFYLNPK
Sbjct: 477  SEMRPAMGYIYAGIYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPK 536

Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286
            FFYS++GD H  I S + DCIERLV D ++ DKI+KE   Y   AGDFGRKMA+RARD L
Sbjct: 537  FFYSIQGDIHSQIVSGMFDCIERLVSDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNL 596

Query: 1287 LP 1292
            LP
Sbjct: 597  LP 598


>ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris]
            gi|561036894|gb|ESW35424.1| hypothetical protein
            PHAVU_001G234100g [Phaseolus vulgaris]
          Length = 869

 Score =  473 bits (1216), Expect = e-130
 Identities = 216/422 (51%), Positives = 308/422 (72%)
 Frame = +3

Query: 27   KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206
            K+ ++ ++MA+GRF +D+G P D+ NS YF  M+DAI+S+GA    PS+H+LR  ILKN 
Sbjct: 290  KRVDNHIHMAIGRFLYDIGAPFDAVNSVYFHEMVDAISSRGAGFERPSHHELRGWILKNS 349

Query: 207  IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386
            + EV+ D+D+C   WGRTGCSILVD+  +  G+  ++F AYC EG +FL           
Sbjct: 350  VEEVKNDIDRCKMTWGRTGCSILVDQWATETGRVLISFLAYCPEGVVFLKSMDATEISTS 409

Query: 387  XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566
               LY+++K++V+EVG+  VLQV+T+ E++Y +AG+RLTDT+P+++W+P A HCID +L+
Sbjct: 410  ADFLYDMIKQVVDEVGVGQVLQVITSGEEQYAVAGRRLTDTFPTLYWSPSAAHCIDFILE 469

Query: 567  DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746
            D      +  V++QA+S++R++Y+ +A++ M++RYT G D+VD   ++  T+F TLKR+V
Sbjct: 470  DFGNLEWISAVIEQAKSVTRFVYNYSAILIMVKRYTLGNDIVDPSFSQFATNFTTLKRMV 529

Query: 747  NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926
            +++H+LQ++V S+EW +  YSK      + D +S+Q+FWSSC  I+RLT PLL++ RI  
Sbjct: 530  DLKHNLQALVTSQEWADCPYSKKSAGLEMLDCLSSQTFWSSCDMIVRLTAPLLKVLRIAS 589

Query: 927  SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106
            S   PAMGY++AG+YRAKEAIKK L  +EEY+ YW+II  RWE+L  HPLHAAGFYLNPK
Sbjct: 590  SEMRPAMGYIYAGIYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPK 649

Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286
            FFYS++GD H  I S + DCIERLV D ++ DKI+KE   Y   AGDFGRKMA+RARD L
Sbjct: 650  FFYSIQGDIHSQIVSGMFDCIERLVSDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNL 709

Query: 1287 LP 1292
            LP
Sbjct: 710  LP 711


>gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo]
          Length = 752

 Score =  472 bits (1215), Expect = e-130
 Identities = 230/433 (53%), Positives = 308/433 (71%), Gaps = 4/433 (0%)
 Frame = +3

Query: 6    VIPAGS----FKKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSY 173
            VIP G       +  + V+MA+GRF +D+G   ++ NS YFQPMI++IA  G   + PSY
Sbjct: 166  VIPNGGGILDSNRDRNQVHMAIGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSY 225

Query: 174  HDLRNSILKNVIHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFL 353
            HD+R  ILKN + EVR D D+C A WG TGCS++VD+  +  G+T +NF  YC +GT+FL
Sbjct: 226  HDIRGWILKNSVEEVRGDFDRCKATWGMTGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFL 285

Query: 354  XXXXXXXXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTP 533
                         +LYEL+K++VE+VG+++V+QV+T  E+ + IAG++L+DTYP+++WTP
Sbjct: 286  ESVDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTP 345

Query: 534  CAGHCIDLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRS 713
            CA  C+DL+L DI     V  V++QARSI+R++Y+N+ V+NM+R+ TFG D+V+   TRS
Sbjct: 346  CAASCVDLILADIGNIEDVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRS 405

Query: 714  FTDFMTLKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLT 893
             T+F TL R+V+++  LQ+MV S+EW +S YSK      + D IS++SFWSSC SIIRLT
Sbjct: 406  ATNFATLNRMVDLKRCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIIRLT 465

Query: 894  DPLLRLFRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHP 1073
            +PLLR+ RIV S K PAMGYV+A +Y AK AIK EL  ++ Y+ YW+IID RWE   RHP
Sbjct: 466  NPLLRVLRIVGSGKRPAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHP 525

Query: 1074 LHAAGFYLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFG 1253
            L AAGFYLNPK+FYS+EGD H  I S + DCIERLV D  V DKI+KE  SY   +GDF 
Sbjct: 526  LCAAGFYLNPKYFYSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFA 585

Query: 1254 RKMAIRARDTLLP 1292
            RK AIRAR TLLP
Sbjct: 586  RKTAIRARGTLLP 598


>ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226173 [Cucumis sativus]
          Length = 752

 Score =  470 bits (1210), Expect = e-130
 Identities = 230/433 (53%), Positives = 308/433 (71%), Gaps = 4/433 (0%)
 Frame = +3

Query: 6    VIPAGS----FKKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSY 173
            VIP G       +  + V+MAVGRF +D+G   ++ NS YFQPMI++IA  G   + PSY
Sbjct: 166  VIPNGGGILDSNRDRNQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSY 225

Query: 174  HDLRNSILKNVIHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFL 353
            HD+R  ILKN + EVR D D+C A WG TGCS++VD+  +  G+T +NF  YC +GT+FL
Sbjct: 226  HDIRGWILKNSMEEVRSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFL 285

Query: 354  XXXXXXXXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTP 533
                         +LYEL+K++VE+VG+++V+QV+T  E+ + IAG++L+DTYP+++WTP
Sbjct: 286  ESVDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTP 345

Query: 534  CAGHCIDLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRS 713
            CA  C+DL+L DI     V  V++QARSI+R++Y+N+ V+NM+R+ TFG D+V+   TRS
Sbjct: 346  CAASCVDLILGDIGNIEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRS 405

Query: 714  FTDFMTLKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLT 893
             T+F TL R+V+++  LQ+MV S+EW +S YSK      + D IS++SFWSSC SII LT
Sbjct: 406  ATNFATLNRMVDLKRCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLT 465

Query: 894  DPLLRLFRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHP 1073
            +PLLR+ RIV S K PAMGYV+A +Y AK AIK EL  ++ Y+ YW+IID RWE   RHP
Sbjct: 466  NPLLRVLRIVGSGKRPAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHP 525

Query: 1074 LHAAGFYLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFG 1253
            L+AAGFYLNPK+FYS+EGD H  I S + DCIERLV D  V DKI+KE  SY   +GDF 
Sbjct: 526  LYAAGFYLNPKYFYSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFA 585

Query: 1254 RKMAIRARDTLLP 1292
            RK AIRAR TLLP
Sbjct: 586  RKTAIRARGTLLP 598


>ref|XP_004305893.1| PREDICTED: uncharacterized protein LOC101310825 [Fragaria vesca
            subsp. vesca]
          Length = 869

 Score =  468 bits (1203), Expect = e-129
 Identities = 229/427 (53%), Positives = 301/427 (70%), Gaps = 2/427 (0%)
 Frame = +3

Query: 18   GSFKKANSV-VNMAVGRFFFDVGLPADSA-NSPYFQPMIDAIASQGAEAVGPSYHDLRNS 191
            G  +KANS  + MA+GRF +++  P D+  NS YFQPMIDAIAS G E+  PSYHDLR  
Sbjct: 289  GEVEKANSQQIQMAIGRFLYEIQAPLDAVKNSLYFQPMIDAIASGGMESKAPSYHDLRGW 348

Query: 192  ILKNVIHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXX 371
            IL +   EV+ ++ Q   +W R GCS+LV++  S KG+  +NF  YC EGT +L      
Sbjct: 349  ILNDAAEEVKNEIYQHTNSWERNGCSLLVNQFNSEKGRILLNFSVYCPEGTTYLKSVDAS 408

Query: 372  XXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCI 551
                    LYE++K++VEEVG+R VLQV+T  E+ YV+AGKRL DT+P+++W+PCA  CI
Sbjct: 409  TFINSPDALYEILKQVVEEVGVRRVLQVITNSEEHYVVAGKRLMDTFPTLYWSPCAAACI 468

Query: 552  DLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMT 731
            + +L+D  +F  +  ++ QARS++R+IY +  ++NMMRRYTFG D+V +G TR  TDFMT
Sbjct: 469  NSILEDFGKFEWINSIIAQARSVTRFIYKHVVILNMMRRYTFGNDIVKLGITRYATDFMT 528

Query: 732  LKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRL 911
            LK++ +++ +LQ+MV S+EW    YSK  E  A+ D +SN +FWSSC  I R T+PLL++
Sbjct: 529  LKQMADLKFNLQTMVTSKEWEGCPYSKTPEGLAMLDLLSNHTFWSSCIMITRFTNPLLQV 588

Query: 912  FRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGF 1091
             RIV S K  AMGYVF G+YRAKE IK+EL  KE Y  YW+IID RW +L  HPLHAAGF
Sbjct: 589  LRIVGSQKKAAMGYVFGGMYRAKETIKRELVKKEVYTAYWNIIDYRWAKLWDHPLHAAGF 648

Query: 1092 YLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIR 1271
            YLNPKFFYS++G+ H  I S + DCIE+LVPDLKV D+I KE   Y    GD GR +AIR
Sbjct: 649  YLNPKFFYSIKGEMHKVIMSRMFDCIEKLVPDLKVQDEISKEINLYQNAVGDMGRNLAIR 708

Query: 1272 ARDTLLP 1292
            ARDTLLP
Sbjct: 709  ARDTLLP 715


>ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806265 isoform X1 [Glycine
            max] gi|571542833|ref|XP_006601996.1| PREDICTED:
            uncharacterized protein LOC100806265 isoform X2 [Glycine
            max]
          Length = 758

 Score =  467 bits (1201), Expect = e-129
 Identities = 218/422 (51%), Positives = 304/422 (72%)
 Frame = +3

Query: 27   KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206
            KK ++ + MA+GRF +D+G P D+ N  +FQ M+DAIAS+G     PS+H+LR  ILKN 
Sbjct: 178  KKMDNHIYMAIGRFLYDIGAPFDAVNLVFFQEMVDAIASKGTGFERPSHHELRGWILKNS 237

Query: 207  IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386
            + EV+ D+D+C   WGRTGCSILVD+ T+   +  ++F AYC EG +FL           
Sbjct: 238  VEEVKNDIDRCKMTWGRTGCSILVDQWTTETSRILISFLAYCPEGLVFLKSLDATEILTS 297

Query: 387  XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566
               LY+L+K++VEE+G+  V+QV+T+ E++Y IAG+RL DT+P+++W+P A HCIDL+L+
Sbjct: 298  PDFLYDLIKQVVEEIGVGKVVQVITSGEEQYGIAGRRLMDTFPTLYWSPSAAHCIDLILE 357

Query: 567  DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746
            D      +  V++QA+S++R++Y+ +A++NM++RYT G D+VD   +R  T+F TLKR+V
Sbjct: 358  DFGNLEWISAVIEQAKSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSRFATNFTTLKRMV 417

Query: 747  NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926
            +++H+LQ++V S+EW +  YSK      + D +SNQ+FWSSC  I+ LT PLL++ RI  
Sbjct: 418  DLKHNLQALVTSQEWADCPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVLRIAG 477

Query: 927  SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106
            S   P MGYV+AG+YR KEAIKK L  +EEY+ YW+II  RWE+L  HPLHAAGFYLNPK
Sbjct: 478  SEMRPGMGYVYAGMYRVKEAIKKALGKREEYMVYWNIIHHRWERLWNHPLHAAGFYLNPK 537

Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286
            FFYS++GD    I S + DCIERLVPD ++ DKI+KE   Y   AGDFGRKMA+RARD L
Sbjct: 538  FFYSIQGDILGQIVSGMFDCIERLVPDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNL 597

Query: 1287 LP 1292
            LP
Sbjct: 598  LP 599


>ref|XP_007214932.1| hypothetical protein PRUPE_ppa001359mg [Prunus persica]
            gi|462411082|gb|EMJ16131.1| hypothetical protein
            PRUPE_ppa001359mg [Prunus persica]
          Length = 845

 Score =  461 bits (1186), Expect = e-127
 Identities = 223/420 (53%), Positives = 299/420 (71%), Gaps = 1/420 (0%)
 Frame = +3

Query: 36   NSVVNMAVGRFFFDVGLPADSA-NSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNVIH 212
            N  ++MA+GRF +++  P D   NS YFQPMIDAIAS G   + PSY DLR  ILKN + 
Sbjct: 275  NQQIHMAIGRFLYEIQAPLDVVKNSVYFQPMIDAIASGGKGTIAPSYDDLRGWILKNAVG 334

Query: 213  EVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXXXX 392
            EV+ D+ Q +  W RTGCS+LV++ +S KGKT +NF   C EGTI+L             
Sbjct: 335  EVKSDIHQHMETWARTGCSLLVNQWSSEKGKTLLNFAVQCPEGTIYLKSVDASYFIFSPD 394

Query: 393  VLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDI 572
             L+E +KE+VEEVG+ +VLQV+T  E+++ +AGKRL DT+P+++W+PC    IDL+L+D 
Sbjct: 395  ALFEFLKEVVEEVGVGHVLQVITNTEEQFAVAGKRLMDTFPTLYWSPCVATSIDLILEDF 454

Query: 573  AEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNI 752
             +   +  V++QARS++R+IY +  ++NMMRRYTFG D+V +G TR  T+F TLK++ ++
Sbjct: 455  GKVEWINSVIEQARSVTRFIYKHVVILNMMRRYTFGNDIVRLGVTRFATNFTTLKQMADL 514

Query: 753  RHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSL 932
            + +LQSMV S+EW    YSK  E  AV D +SN SFWS+C  +  LT+PLLR+ RIV S 
Sbjct: 515  KFNLQSMVTSKEWMCCPYSKTPEGSAVLDVLSNHSFWSACILVTHLTNPLLRVLRIVGSQ 574

Query: 933  KIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFF 1112
            K  AMGYVFAG+YRAKE IK+EL  +EEY+ YW IID RW++L   PLHAAGFYLNPKFF
Sbjct: 575  KRAAMGYVFAGIYRAKETIKRELVKREEYMVYWDIIDYRWKKLWPLPLHAAGFYLNPKFF 634

Query: 1113 YSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLP 1292
            YS++GD H+ I S + DCIERLVPD+K+ D+++KE   Y    GD GR +A+RARD LLP
Sbjct: 635  YSVKGDLHNEIISRMFDCIERLVPDIKIQDEVIKEINLYKNAVGDLGRNLAVRARDNLLP 694


>ref|XP_002521049.1| DNA binding protein, putative [Ricinus communis]
            gi|223539752|gb|EEF41333.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 854

 Score =  454 bits (1167), Expect = e-125
 Identities = 214/422 (50%), Positives = 297/422 (70%)
 Frame = +3

Query: 27   KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206
            K  ++V++  VGRF +D+G   D+ +S YF+ +ID ++S  + AV PS HDLR  ILK +
Sbjct: 285  KMIDNVIHTTVGRFLYDIGANFDALDSIYFRSLIDMLSSGASGAVAPSNHDLRGWILKKL 344

Query: 207  IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386
            + E++ D+DQ    W RTGCS+LV+E  S  G T +NF   CS+GT+FL           
Sbjct: 345  VEEIKNDIDQSRTTWARTGCSVLVEEWNSESGITLLNFLVNCSQGTVFLKSVEASHIIYS 404

Query: 387  XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566
               LY L+K++VEEVG  NVLQV+T   + Y +AGKRL + +PS+FW PCA HC+DL+L+
Sbjct: 405  PDGLYVLLKQVVEEVGASNVLQVITNGNEHYTVAGKRLMEAFPSLFWAPCAVHCLDLILE 464

Query: 567  DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746
            D A+   +  V++QA+S++R++Y+++AV+N+MR++T+G D+V  G TRS T+F  L+R+ 
Sbjct: 465  DFAKLEWIDAVIEQAKSVTRFVYNHSAVLNLMRKFTYGKDIVQQGLTRSATNFTMLQRMA 524

Query: 747  NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926
            + + +LQ+M+ S+EW +  YSK     A+ D ISN+SFWSSC  IIRLT PL+R+  I  
Sbjct: 525  DFKLNLQTMITSQEWMDCPYSKQHGGLAMLDIISNRSFWSSCILIIRLTSPLIRVLGIAG 584

Query: 927  SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106
              +  AMGY+FAG+YRAKE IK+EL  +E+Y+ YW+IID RW+Q +  PLH AGF+LNPK
Sbjct: 585  GKRKAAMGYIFAGIYRAKETIKRELVKREDYMVYWNIIDHRWDQRRHPPLHVAGFFLNPK 644

Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286
            FFYS+EGD H+ I S V DCIERLVPD++V DKI KE   Y    GD GRKMAIR+R TL
Sbjct: 645  FFYSIEGDVHNEILSRVFDCIERLVPDIEVQDKIAKELNIYKNAVGDLGRKMAIRSRGTL 704

Query: 1287 LP 1292
            LP
Sbjct: 705  LP 706


>ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Populus trichocarpa]
            gi|550335284|gb|ERP58729.1| hypothetical protein
            POPTR_0006s02210g [Populus trichocarpa]
          Length = 847

 Score =  447 bits (1149), Expect = e-123
 Identities = 219/430 (50%), Positives = 300/430 (69%)
 Frame = +3

Query: 3    AVIPAGSFKKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDL 182
            A+I  GS + A++ ++   GRF +D+G   D+ +S + QP+ID +A        PS+ DL
Sbjct: 275  ALIAMGS-ETADNPIHAIWGRFLYDIGASLDAMDSNFSQPLIDTVAYGRPGIAAPSHQDL 333

Query: 183  RNSILKNVIHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXX 362
            R  ILK+++ EV+ D++Q    W +TGCS+LV+E  S  G T +NF  YCS+GT+FL   
Sbjct: 334  RGRILKSLVEEVKSDINQYKTRWVKTGCSLLVEECNSESGVTTLNFLVYCSKGTVFLKSV 393

Query: 363  XXXXXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAG 542
                       LYEL+K +VEEVG  N+LQV+T  E+ Y+ AGK+L DT+PS++W PCA 
Sbjct: 394  DASNLIHSTDGLYELLKLMVEEVGAGNILQVITNGEEHYIAAGKKLMDTFPSLYWAPCAA 453

Query: 543  HCIDLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTD 722
             CIDL+L+DI +   +  VL+QA+S++R++Y+N+AV+N+MR++T G D+V  G TRS T+
Sbjct: 454  RCIDLILEDIGKLDWINTVLEQAKSVTRFVYNNSAVLNLMRKFTSGSDIVQQGITRSATN 513

Query: 723  FMTLKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPL 902
            F  LKR+ N + +LQ+MV S+EW +  YSK     A+ D I+N+SFWSSC  IIRLT PL
Sbjct: 514  FTALKRMANFKLNLQTMVTSQEWMDCPYSKQPGGLAMVDIITNRSFWSSCILIIRLTSPL 573

Query: 903  LRLFRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHA 1082
            L++  IV S K  AMGYVF+G+YRAKE IKKEL  +E+Y+ YW+IID RWEQ  + PLHA
Sbjct: 574  LQVLVIVSSEKRAAMGYVFSGIYRAKETIKKELVKREDYMVYWNIIDHRWEQQWQTPLHA 633

Query: 1083 AGFYLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKM 1262
            AGF+ NPKFFYS+EGD H+ I S + DCIERLVPD +V DKI+KE   Y    G  G+K+
Sbjct: 634  AGFFFNPKFFYSIEGDMHNKILSRMFDCIERLVPDTEVQDKIVKELTLYKNAEGHLGKKL 693

Query: 1263 AIRARDTLLP 1292
            AIRAR T+LP
Sbjct: 694  AIRARGTMLP 703


>ref|XP_007049027.1| HAT transposon superfamily, putative [Theobroma cacao]
            gi|508701288|gb|EOX93184.1| HAT transposon superfamily,
            putative [Theobroma cacao]
          Length = 750

 Score =  445 bits (1145), Expect = e-122
 Identities = 210/420 (50%), Positives = 298/420 (70%)
 Frame = +3

Query: 33   ANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNVIH 212
            A + V+MA+GRF +D+G+  D+ NS YFQPMIDAIAS G+  V PS  DLR  ILKNV+ 
Sbjct: 180  AENPVHMAIGRFLYDIGVNLDAVNSVYFQPMIDAIASTGSGIVPPSSQDLRGWILKNVME 239

Query: 213  EVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXXXX 392
            EV+ D+D+    WG+TGCSILV++ +   G+T ++F  YC + T+FL             
Sbjct: 240  EVKDDIDRNKTMWGKTGCSILVEQWSPKSGRTLLSFLVYCPQATVFLKSVDASRVIFSAD 299

Query: 393  VLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDI 572
             L EL+K++VEEVG+ NV+QV+T  E++Y +AGKRL +++PS++W PC  HC+D+ML+D 
Sbjct: 300  HLNELLKQVVEEVGVENVVQVITNCEEQYFLAGKRLMESFPSLYWAPCLVHCVDMMLEDF 359

Query: 573  AEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNI 752
            A    +   ++QA+S++R++Y+++ V+NMMRR+TF  D+V+   TR  ++F TLKR+ ++
Sbjct: 360  ANLEWISETIEQAKSVTRFVYNHSVVLNMMRRFTFHNDIVEPAVTRFASNFATLKRMADL 419

Query: 753  RHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSL 932
            +  LQ+MVNS++W+E  Y+K      + D + N+SFW+SC  I+RL  PLL++  IV S 
Sbjct: 420  KLKLQAMVNSQDWSECPYAKKPGGLVMLDIVKNRSFWNSCILIVRLIYPLLQVLEIVGSK 479

Query: 933  KIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFF 1112
            K   MGYV+AG+YRAKE IKKEL  K++Y+ YW+IID RWEQ +  PL+AA F+LNPKFF
Sbjct: 480  KRSTMGYVYAGIYRAKETIKKELVKKDDYMVYWNIIDHRWEQQRHIPLYAAAFFLNPKFF 539

Query: 1113 YSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLP 1292
            YS+EG+ H+ I S + DCIERLVPD  V D+I++E   Y    GD GR MA+RARD LLP
Sbjct: 540  YSIEGNIHNDILSSMFDCIERLVPDTNVQDQIVREIHLYKNATGDLGRPMAVRARDNLLP 599


>ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817502 isoform X4 [Glycine
            max]
          Length = 729

 Score =  442 bits (1136), Expect = e-121
 Identities = 217/422 (51%), Positives = 296/422 (70%)
 Frame = +3

Query: 27   KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206
            KK ++ + MA+GRF +D+G P D+ NS YFQ M+DAIAS+G     P +H+LR  ILKN 
Sbjct: 179  KKMDNHIYMAIGRFLYDIGAPFDAVNSVYFQEMVDAIASRGVGFERPWHHELRGWILKNS 238

Query: 207  IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386
            + EV+ D+D+C   WGRTGCSILVD+ T     T  +F                      
Sbjct: 239  VEEVKNDIDRCKMTWGRTGCSILVDQWT-----TETDF---------------------- 271

Query: 387  XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566
               LY+L+K++VEEVG   V+QV+T+ E++Y IAG+RLTDT+P+++ +P A HCIDL+L+
Sbjct: 272  ---LYDLIKQVVEEVGAGQVVQVITSGEEQYGIAGRRLTDTFPTLYLSPSAAHCIDLILE 328

Query: 567  DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746
            D      +  V++QARS++R++Y+ +A++NM++RYT G D+VD   +   T+F TLKR+V
Sbjct: 329  DFGNLEWISAVIEQARSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSHFATNFTTLKRMV 388

Query: 747  NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926
            +++H+LQ++V S+EW +S YSK      + D +SNQ+FWSSC  I+ LT PLL++ RI  
Sbjct: 389  DLKHNLQALVTSQEWADSPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVMRIAS 448

Query: 927  SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106
            S   PAMGYV+AG+YRAKEAIKK L  +EEY+ YW+II  RWE+L  HPLHAAGFYLNPK
Sbjct: 449  SEMRPAMGYVYAGMYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPK 508

Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286
            FFYS++GD H  I S + DCIERLVPD ++ DKI+KE   Y   +GDFGRKMA+RARD L
Sbjct: 509  FFYSIQGDIHGQIVSGMFDCIERLVPDTRIQDKIIKEINLYKSASGDFGRKMAVRARDNL 568

Query: 1287 LP 1292
            LP
Sbjct: 569  LP 570


>ref|NP_188861.2| hAT dimerization domain-containing protein [Arabidopsis thaliana]
            gi|79313325|ref|NP_001030742.1| hAT dimerization
            domain-containing protein [Arabidopsis thaliana]
            gi|11994740|dbj|BAB03069.1| transposase-like protein
            [Arabidopsis thaliana] gi|28393360|gb|AAO42104.1| unknown
            protein [Arabidopsis thaliana] gi|28827622|gb|AAO50655.1|
            unknown protein [Arabidopsis thaliana]
            gi|332643084|gb|AEE76605.1| hAT dimerization
            domain-containing protein [Arabidopsis thaliana]
            gi|332643085|gb|AEE76606.1| hAT dimerization
            domain-containing protein [Arabidopsis thaliana]
          Length = 761

 Score =  409 bits (1052), Expect = e-111
 Identities = 199/422 (47%), Positives = 278/422 (65%)
 Frame = +3

Query: 27   KKANSVVNMAVGRFFFDVGLPADSANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 206
            K+    V+MA+GRF FD+G   D+ANS   QP IDAI S G     P++ DLR  ILK+ 
Sbjct: 184  KEREKTVHMAMGRFLFDIGADFDAANSVNVQPFIDAIVSGGFGVSIPTHEDLRGWILKSC 243

Query: 207  IHEVRYDVDQCIAAWGRTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 386
            + EV+ ++D+C   W RTGCS+LV E  S +G   + F  YC E  +FL           
Sbjct: 244  VEEVKKEIDECKTLWKRTGCSVLVQELNSNEGPLILKFLVYCPEKVVFLKSVDASEILDS 303

Query: 387  XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 566
               LYEL+KE+VEE+G  NV+QV+T  ED Y  AGK+L D YPS++W PCA HCID ML+
Sbjct: 304  EDKLYELLKEVVEEIGDTNVVQVITKCEDHYAAAGKKLMDVYPSLYWVPCAAHCIDKMLE 363

Query: 567  DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIV 746
            +  +   ++ +++QAR+++R IY+++ V+N+MR++TFG D+V    T S T+F T+ RI 
Sbjct: 364  EFGKMDWIREIIEQARTVTRIIYNHSGVLNLMRKFTFGNDIVQPVCTSSATNFTTMGRIA 423

Query: 747  NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 926
            +++  LQ+MV S EW + SYSK+    A+ ++I+++ FW +      +T P+LR+ RIV 
Sbjct: 424  DLKPYLQAMVTSSEWNDCSYSKEAGGLAMTETINDEDFWKALTLANHITAPILRVLRIVC 483

Query: 927  SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1106
            S + PAMGYV+A +YRAKEAIK  L  +EEY+ YW IID  W Q    PL+AAGFYLNPK
Sbjct: 484  SERKPAMGYVYAAMYRAKEAIKTNLAHREEYIVYWKIIDRWWLQ---QPLYAAGFYLNPK 540

Query: 1107 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1286
            FFYS++ +    I   V DCIE+LVPD+ + D ++K+  SY    G FGR +AIRARDT+
Sbjct: 541  FFYSIDEEMRSEIHLAVVDCIEKLVPDVNIQDIVIKDINSYKNAVGIFGRNLAIRARDTM 600

Query: 1287 LP 1292
            LP
Sbjct: 601  LP 602


Top