BLASTX nr result

ID: Mentha25_contig00017749 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00017749
         (1189 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591...   534   e-149
ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254...   530   e-148
gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea]       517   e-144
ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626...   469   e-129
ref|XP_007009265.1| HAT and BED zinc finger domain-containing pr...   465   e-128
ref|XP_004305893.1| PREDICTED: uncharacterized protein LOC101310...   462   e-127
ref|XP_002524204.1| DNA binding protein, putative [Ricinus commu...   462   e-127
ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302...   460   e-127
ref|XP_007214932.1| hypothetical protein PRUPE_ppa001359mg [Prun...   456   e-126
ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817...   456   e-125
ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phas...   451   e-124
ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phas...   451   e-124
ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Popu...   450   e-124
ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806...   448   e-123
ref|XP_002521049.1| DNA binding protein, putative [Ricinus commu...   444   e-122
gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo]         441   e-121
ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226...   437   e-120
ref|XP_007049027.1| HAT transposon superfamily, putative [Theobr...   426   e-117
ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817...   418   e-114
ref|NP_188861.2| hAT dimerization domain-containing protein [Ara...   387   e-105

>ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591442 [Solanum tuberosum]
          Length = 755

 Score =  534 bits (1376), Expect = e-149
 Identities = 256/392 (65%), Positives = 311/392 (79%)
 Frame = -3

Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005
            R+GCS+LVDE  +GKGKT +NF  YCPEGT+FL            D LYEL+KE+VEEVG
Sbjct: 253  RSGCSVLVDEWITGKGKTLLNFLVYCPEGTMFLRSVDASTLINSTDYLYELLKEVVEEVG 312

Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825
            +RNVLQVVT+ E+RY+IAGKRLTD YP++FWTPCA H IDLML+D+ +   +  +++QA+
Sbjct: 313  VRNVLQVVTSNEERYIIAGKRLTDAYPTLFWTPCAAHSIDLMLEDLKKLEWIDTIMEQAK 372

Query: 824  SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645
            SISR+IY+N  +++MMR++T GVDLVD+G TRS TDF+TLKR+VNI+H+LQSMV S EW 
Sbjct: 373  SISRFIYNNNILLSMMRKFTLGVDLVDLGVTRSATDFLTLKRMVNIKHNLQSMVTSVEWA 432

Query: 644  ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465
            ES YSK  E FA+ D I NQSFWS+C+ + RLTDP+LRL R+V S + PAM YV+AG+YR
Sbjct: 433  ESPYSKKPEGFALLDYIGNQSFWSTCSLVCRLTDPILRLLRMVSSEERPAMAYVYAGVYR 492

Query: 464  AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285
            AKE IKKEL  K++Y  YW+IID RWE LQRHPLHAAGFYLNPKFFY+ E D H HI+SL
Sbjct: 493  AKETIKKELVNKKDYSVYWNIIDHRWESLQRHPLHAAGFYLNPKFFYTTEEDVHLHIRSL 552

Query: 284  VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105
            V DCIE+LVPD K+ DKI+KE  SY   AGDFGRKMA+RARDTL P EWW TYGG CPNL
Sbjct: 553  VYDCIEKLVPDPKIQDKIVKETTSYLNSAGDFGRKMAVRARDTLFPAEWWSTYGGGCPNL 612

Query: 104  ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTN 9
            ARLAIRILSQT  LI+ K  +VPLE +H+  N
Sbjct: 613  ARLAIRILSQTSSLIRSKPGRVPLEEMHETKN 644


>ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254391 [Solanum
            lycopersicum]
          Length = 748

 Score =  530 bits (1365), Expect = e-148
 Identities = 253/392 (64%), Positives = 310/392 (79%)
 Frame = -3

Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005
            RTGCS+L+DE  +GKGK  +NF  YCP+GT+FL            D LYEL+KE+V+E+G
Sbjct: 246  RTGCSVLIDELITGKGKILLNFLVYCPQGTMFLRSVDASTLINSTDYLYELLKEVVDEIG 305

Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825
            +RNVLQVVT+ E+RYVIAGKRLTD YP++FWTPCA H IDLML+D  +   +  +++QA+
Sbjct: 306  VRNVLQVVTSNEERYVIAGKRLTDAYPTLFWTPCAAHSIDLMLEDFNKLEWIDTIMEQAK 365

Query: 824  SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645
            SISR+IY+N  +++MMR++T GVDLVD+G TRS TDF+TLKR+ NI+H+LQSMV S EW 
Sbjct: 366  SISRFIYNNNILLSMMRKFTLGVDLVDLGVTRSATDFLTLKRMQNIKHNLQSMVTSVEWA 425

Query: 644  ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465
            ES YSK  E FA+ D ISNQSFWS+C+ I RLTDP+LRL R+V S + PAM YV+AG+YR
Sbjct: 426  ESPYSKKPEGFALLDYISNQSFWSTCSLICRLTDPILRLLRMVSSEERPAMPYVYAGVYR 485

Query: 464  AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285
            AKE IKKEL  K++Y  YW+IID RWE LQRHPLHAAGFYLNPKFFY+ E D H HI+SL
Sbjct: 486  AKETIKKELVNKKDYSVYWNIIDHRWESLQRHPLHAAGFYLNPKFFYTTEEDVHLHIRSL 545

Query: 284  VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105
            V DCIE+LVPD K+ DKI+KE  SY   AGDFGRKMA+RARDTL P EWW TYGG CPNL
Sbjct: 546  VYDCIEKLVPDPKIQDKIVKETTSYLNSAGDFGRKMAVRARDTLFPAEWWSTYGGGCPNL 605

Query: 104  ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTN 9
            ARLAIRILSQT  LI+ K  ++P+E +H+ TN
Sbjct: 606  ARLAIRILSQTSSLIRSKPGRIPIEEMHETTN 637


>gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea]
          Length = 724

 Score =  517 bits (1331), Expect = e-144
 Identities = 254/393 (64%), Positives = 309/393 (78%), Gaps = 1/393 (0%)
 Frame = -3

Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005
            RTGC++LVD+  SGKG+TFVNFF Y  E TIF             D LYEL+KE VE++G
Sbjct: 245  RTGCTVLVDDWNSGKGETFVNFFVYNSEATIFYRSANVSHGIVSADDLYELLKETVEQIG 304

Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825
            ++NVLQV+T+ ED+Y  AGKRL  TYPS+FW+PCAG C+DLMLQD+   P VK+ L+QA+
Sbjct: 305  VKNVLQVITSCEDQYAFAGKRLATTYPSVFWSPCAGLCVDLMLQDMEHLPMVKVTLEQAK 364

Query: 824  SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645
            SISRYIYSN  V+NM+RR+TFG+DL+D G T S T+FMTLKR++++RH LQSMV SE+W 
Sbjct: 365  SISRYIYSNGFVLNMLRRHTFGLDLLDEGITPSSTNFMTLKRMLSMRHHLQSMVTSEDWI 424

Query: 644  ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465
            +S +S+  E FA+ D++++QSFWS+CASI  L DPLLRL RI+ S K PAMGYV+AGLYR
Sbjct: 425  QSPHSQKPEGFALLDTMTSQSFWSACASITNLIDPLLRLLRIISSGKKPAMGYVYAGLYR 484

Query: 464  AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285
            AKEAIKK     E+YL Y +IID RWEQLQ+HPLH AGFYLNPKFFYSLEGD     +S+
Sbjct: 485  AKEAIKKHF-VSEDYLVYLNIIDRRWEQLQQHPLHGAGFYLNPKFFYSLEGDALLRSRSM 543

Query: 284  VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105
            V DCIERLVPD +V DKIMKE   YH G GDFGRKMAIRARDTLLPTEWW+ YGG CPNL
Sbjct: 544  VYDCIERLVPDPEVQDKIMKEMTYYHGGVGDFGRKMAIRARDTLLPTEWWIAYGGSCPNL 603

Query: 104  ARLAIRILSQTCCLIQHK-LDKVPLEHLHKRTN 9
            +RLA+++LSQTC  IQ K LDK+PLE +H+  N
Sbjct: 604  SRLAVQVLSQTCGFIQLKLLDKLPLETMHRIKN 636


>ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626522 [Citrus sinensis]
          Length = 745

 Score =  469 bits (1206), Expect = e-129
 Identities = 219/395 (55%), Positives = 298/395 (75%)
 Frame = -3

Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008
            G+TGCSILVD+  +  G+T + F AYCPEGT+FL            D LYEL+K++VEEV
Sbjct: 249  GKTGCSILVDQWNTEAGRTLLCFLAYCPEGTVFLKSVDASGIMNSSDALYELLKQVVEEV 308

Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828
            G+R+VLQV+T+ E++++ AG+RLTDT+P+++WTPCA  C+DL+L+D A+   +  +++QA
Sbjct: 309  GVRHVLQVITSSEEQFIAAGRRLTDTFPTLYWTPCAARCLDLILEDFAKLEWINAIIEQA 368

Query: 827  RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648
            R+++R++Y+++ V+NM+RRYTFG D+V+ G TRS T+F TL+R+++++ +LQ+MV S+EW
Sbjct: 369  RAVTRFVYNHSVVLNMLRRYTFGNDIVEPGITRSATNFTTLRRMISLKPNLQAMVTSQEW 428

Query: 647  TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468
             +  YSK      + D +SNQSFWSSC  I+ LT+PLLRL RIV S + P++GYV+AG+Y
Sbjct: 429  MDCPYSKKPGGLEMLDIVSNQSFWSSCGLIVCLTNPLLRLLRIVGSERRPSIGYVYAGMY 488

Query: 467  RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288
            RAK+A+KKEL  ++EY+ YW+IID  WEQL   PLHAAGF+LNPKFFYS++GD H+ I S
Sbjct: 489  RAKDALKKELIKRDEYMVYWNIIDHWWEQLWHLPLHAAGFFLNPKFFYSIKGDIHNEIVS 548

Query: 287  LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPN 108
             + DCIERLVPD KV DKI KE   Y    GDFGRKMAIRARDTLLP EWW TYGG CPN
Sbjct: 549  RMFDCIERLVPDTKVQDKISKEINLYKDAVGDFGRKMAIRARDTLLPAEWWSTYGGSCPN 608

Query: 107  LARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3
            LARLA RI SQTC  +    +++  E ++   N L
Sbjct: 609  LARLATRIQSQTCSSLADTRNQIHFERIYDTRNCL 643


>ref|XP_007009265.1| HAT and BED zinc finger domain-containing protein, putative
            [Theobroma cacao] gi|508726178|gb|EOY18075.1| HAT and BED
            zinc finger domain-containing protein, putative
            [Theobroma cacao]
          Length = 749

 Score =  465 bits (1196), Expect = e-128
 Identities = 217/394 (55%), Positives = 294/394 (74%)
 Frame = -3

Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005
            RTGCSILV++  +  G+  +NF  YCPEGT+FL            D LYEL+K++VEEVG
Sbjct: 250  RTGCSILVNQWNTQTGRILLNFLVYCPEGTVFLKSVDASSVINSSDALYELLKQVVEEVG 309

Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825
             ++VLQV+T  E++Y++AG+RL +T+P+++WTPCA HCI+L+L+D A+   + ++++QAR
Sbjct: 310  SKHVLQVITNAEEQYIVAGRRLAETFPTLYWTPCAAHCINLILEDFAKLEWINVIIEQAR 369

Query: 824  SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645
            SI+R++Y+++ V+NM+RRYT G D+V+   T S T+F TLK++++++++LQ+MV S+EW 
Sbjct: 370  SITRFVYNHSVVLNMVRRYTLGNDIVEPAVTCSATNFTTLKQMIDLKNNLQAMVTSQEWM 429

Query: 644  ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465
            +  YSK      + D +SN SFWSS   I +LT+PLLR+ R+V S K PAMGYV+AG+YR
Sbjct: 430  DCPYSKKPGGLEMLDLVSNPSFWSSSVLITQLTNPLLRVLRMVGSKKRPAMGYVYAGMYR 489

Query: 464  AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285
            AKE IKKEL  + EY+ YW+IID  WEQ   HPLH AGFYLNPKFFYS+EGD  + + S 
Sbjct: 490  AKETIKKELVKRNEYMIYWNIIDHWWEQQWHHPLHGAGFYLNPKFFYSMEGDMPNEMLSG 549

Query: 284  VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105
            + DCIE+LVPD+KV DKI KE  SY    GDFGRKMA+RARDTLLP EWW TYGG CPNL
Sbjct: 550  MLDCIEKLVPDVKVQDKISKEINSYKNTVGDFGRKMAVRARDTLLPAEWWSTYGGSCPNL 609

Query: 104  ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3
            ARLAI +LSQTC  +  K + +P E LH+  N+L
Sbjct: 610  ARLAIHVLSQTCSTLGLKQNSIPFEKLHETRNFL 643


>ref|XP_004305893.1| PREDICTED: uncharacterized protein LOC101310825 [Fragaria vesca
            subsp. vesca]
          Length = 869

 Score =  462 bits (1189), Expect = e-127
 Identities = 220/394 (55%), Positives = 285/394 (72%)
 Frame = -3

Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005
            R GCS+LV++  S KG+  +NF  YCPEGT +L            D LYE++K++VEEVG
Sbjct: 370  RNGCSLLVNQFNSEKGRILLNFSVYCPEGTTYLKSVDASTFINSPDALYEILKQVVEEVG 429

Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825
            +R VLQV+T  E+ YV+AGKRL DT+P+++W+PCA  CI+ +L+D  +F  +  ++ QAR
Sbjct: 430  VRRVLQVITNSEEHYVVAGKRLMDTFPTLYWSPCAAACINSILEDFGKFEWINSIIAQAR 489

Query: 824  SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645
            S++R+IY +  ++NMMRRYTFG D+V +G TR  TDFMTLK++ +++ +LQ+MV S+EW 
Sbjct: 490  SVTRFIYKHVVILNMMRRYTFGNDIVKLGITRYATDFMTLKQMADLKFNLQTMVTSKEWE 549

Query: 644  ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465
               YSK  E  A+ D +SN +FWSSC  I R T+PLL++ RIV S K  AMGYVF G+YR
Sbjct: 550  GCPYSKTPEGLAMLDLLSNHTFWSSCIMITRFTNPLLQVLRIVGSQKKAAMGYVFGGMYR 609

Query: 464  AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285
            AKE IK+EL  KE Y  YW+IID RW +L  HPLHAAGFYLNPKFFYS++G+ H  I S 
Sbjct: 610  AKETIKRELVKKEVYTAYWNIIDYRWAKLWDHPLHAAGFYLNPKFFYSIKGEMHKVIMSR 669

Query: 284  VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105
            + DCIE+LVPDLKV D+I KE   Y    GD GR +AIRARDTLLP EWW TYG  CPN+
Sbjct: 670  MFDCIEKLVPDLKVQDEISKEINLYQNAVGDMGRNLAIRARDTLLPAEWWSTYGSGCPNM 729

Query: 104  ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3
            ARLA+ ILSQTC LIQ K +++P + LHK  N L
Sbjct: 730  ARLAVHILSQTCSLIQCKENQIPFDQLHKTRNSL 763


>ref|XP_002524204.1| DNA binding protein, putative [Ricinus communis]
            gi|223536481|gb|EEF38128.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 753

 Score =  462 bits (1188), Expect = e-127
 Identities = 219/394 (55%), Positives = 292/394 (74%)
 Frame = -3

Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005
            RTGCS+LVD+  +  G+T ++F  YC EG +FL            D LYEL+K++VEEVG
Sbjct: 258  RTGCSVLVDQWNTLMGRTLLSFLVYCSEGVVFLKSVDASDIINSSDALYELIKKVVEEVG 317

Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825
            +R+VLQV+T++E++Y++ G+RLTDT+P+++  PCA HCIDL+L+D A+   +  V+ QAR
Sbjct: 318  VRHVLQVITSMEEQYIVVGRRLTDTFPTLYRAPCAAHCIDLILEDFAKLEWISTVILQAR 377

Query: 824  SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645
            SI+R++Y+++ V+NM++RYTFG ++V  G T   T+F TLKR+V+++H+LQ+MV S+EW 
Sbjct: 378  SITRFVYNHSVVLNMVKRYTFGSEIVATGLTHFATNFETLKRMVDLKHTLQTMVTSQEWM 437

Query: 644  ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465
            +  YSK      + D +SNQSFWSSC  I  LT+PLLRL RIV S K P MGYV+AG+YR
Sbjct: 438  DCPYSKKPRGLEMLDLLSNQSFWSSCVLITNLTNPLLRLLRIVSSKKRPPMGYVYAGIYR 497

Query: 464  AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285
            AKEAIKKEL  +++Y+ YW+IID  WEQ    PLHAAGF+LNPK  YS+EGD H+ I S 
Sbjct: 498  AKEAIKKELVKRKDYMVYWNIIDHWWEQQSNLPLHAAGFFLNPKVLYSIEGDLHNEILSG 557

Query: 284  VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105
            + DCIE+LVPD+ V DKI KE  SY   +GDFGRKMA+RAR+TLLP EWW TYGG CPNL
Sbjct: 558  MFDCIEKLVPDVTVQDKITKEINSYKNASGDFGRKMAVRARETLLPAEWWSTYGGSCPNL 617

Query: 104  ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3
            ARLAIR+LSQ C    +KL+ + LE +H   N L
Sbjct: 618  ARLAIRVLSQPCSSFGYKLNHISLEQIHDTKNCL 651


>ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302111 [Fragaria vesca
            subsp. vesca]
          Length = 754

 Score =  460 bits (1184), Expect = e-127
 Identities = 222/397 (55%), Positives = 290/397 (73%), Gaps = 2/397 (0%)
 Frame = -3

Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008
            GRTGCSILVD+  +      ++F  Y PEGT+FL            D LY+L++ +VE+V
Sbjct: 247  GRTGCSILVDQWNTELDNVMLSFLVYSPEGTVFLESVDASAIINSSDALYDLLRRVVEDV 306

Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828
            G+ +V+QV+T+ E+++V+AG+RL DT+P++FW PCA  C+DL+L+D      +  V++QA
Sbjct: 307  GVGDVVQVITSGEEQFVVAGRRLADTFPNLFWIPCAARCLDLILEDFGSLDWIHAVIEQA 366

Query: 827  RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648
            RSI++++Y++  V+N++RR TFG D+V+ G TR  T F TLKR+V+++H LQ MV S+EW
Sbjct: 367  RSITKFVYNHNVVLNLVRRSTFGNDIVEPGVTRFGTSFTTLKRLVDLKHCLQVMVTSQEW 426

Query: 647  TESSYSKDQEAFAVQDSISN--QSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAG 474
             +  YSK+     + D IS+  QSFWSSC  I+RLT PLLR+ R+V   K PAMG+++AG
Sbjct: 427  MDCPYSKEPGGLEISDLISDRDQSFWSSCTLIVRLTSPLLRVLRMVGCEKRPAMGFIYAG 486

Query: 473  LYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHI 294
            +YRAKEAIKKEL  +EEY+ YW+IID RWEQ    PLHAAGFYLNPK FYS+EGD H+ I
Sbjct: 487  MYRAKEAIKKELVKREEYMVYWNIIDQRWEQHWNFPLHAAGFYLNPKIFYSIEGDIHNSI 546

Query: 293  QSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGEC 114
            QS + DCIER+VPD+KV DKIMKE  SY   AGDF RKMAIRARDTLLP EWW TYGG C
Sbjct: 547  QSGMYDCIERMVPDIKVQDKIMKEIISYKNAAGDFRRKMAIRARDTLLPAEWWSTYGGGC 606

Query: 113  PNLARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3
            PNLARLAIRILSQTC  I ++  ++P E  H   N L
Sbjct: 607  PNLARLAIRILSQTCGSIGYRQSQIPFEKAHGIRNCL 643


>ref|XP_007214932.1| hypothetical protein PRUPE_ppa001359mg [Prunus persica]
            gi|462411082|gb|EMJ16131.1| hypothetical protein
            PRUPE_ppa001359mg [Prunus persica]
          Length = 845

 Score =  456 bits (1174), Expect = e-126
 Identities = 218/394 (55%), Positives = 289/394 (73%)
 Frame = -3

Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005
            RTGCS+LV++ +S KGKT +NF   CPEGTI+L            D L+E +KE+VEEVG
Sbjct: 349  RTGCSLLVNQWSSEKGKTLLNFAVQCPEGTIYLKSVDASYFIFSPDALFEFLKEVVEEVG 408

Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825
            + +VLQV+T  E+++ +AGKRL DT+P+++W+PC    IDL+L+D  +   +  V++QAR
Sbjct: 409  VGHVLQVITNTEEQFAVAGKRLMDTFPTLYWSPCVATSIDLILEDFGKVEWINSVIEQAR 468

Query: 824  SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645
            S++R+IY +  ++NMMRRYTFG D+V +G TR  T+F TLK++ +++ +LQSMV S+EW 
Sbjct: 469  SVTRFIYKHVVILNMMRRYTFGNDIVRLGVTRFATNFTTLKQMADLKFNLQSMVTSKEWM 528

Query: 644  ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465
               YSK  E  AV D +SN SFWS+C  +  LT+PLLR+ RIV S K  AMGYVFAG+YR
Sbjct: 529  CCPYSKTPEGSAVLDVLSNHSFWSACILVTHLTNPLLRVLRIVGSQKRAAMGYVFAGIYR 588

Query: 464  AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285
            AKE IK+EL  +EEY+ YW IID RW++L   PLHAAGFYLNPKFFYS++GD H+ I S 
Sbjct: 589  AKETIKRELVKREEYMVYWDIIDYRWKKLWPLPLHAAGFYLNPKFFYSVKGDLHNEIISR 648

Query: 284  VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105
            + DCIERLVPD+K+ D+++KE   Y    GD GR +A+RARD LLP EWW TYG  CPNL
Sbjct: 649  MFDCIERLVPDIKIQDEVIKEINLYKNAVGDLGRNLAVRARDNLLPAEWWSTYGSSCPNL 708

Query: 104  ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3
            ARLAIRILSQTC ++Q + +++P E LHK  N L
Sbjct: 709  ARLAIRILSQTCSIVQGQENQIPFELLHKTRNSL 742


>ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817502 isoform X1 [Glycine
            max] gi|571489936|ref|XP_006591345.1| PREDICTED:
            uncharacterized protein LOC100817502 isoform X2 [Glycine
            max] gi|571489939|ref|XP_006591346.1| PREDICTED:
            uncharacterized protein LOC100817502 isoform X3 [Glycine
            max]
          Length = 759

 Score =  456 bits (1172), Expect = e-125
 Identities = 215/395 (54%), Positives = 292/395 (73%)
 Frame = -3

Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008
            GRTGCSILVD+ T+  GK  ++F AYCPEG +FL            D LY+L+K++VEEV
Sbjct: 254  GRTGCSILVDQWTTETGKILISFLAYCPEGLVFLRSLDATEISTSADFLYDLIKQVVEEV 313

Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828
            G   V+QV+T+ E++Y IAG+RLTDT+P+++ +P A HCIDL+L+D      +  V++QA
Sbjct: 314  GAGQVVQVITSGEEQYGIAGRRLTDTFPTLYLSPSAAHCIDLILEDFGNLEWISAVIEQA 373

Query: 827  RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648
            RS++R++Y+ +A++NM++RYT G D+VD   +   T+F TLKR+V+++H+LQ++V S+EW
Sbjct: 374  RSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSHFATNFTTLKRMVDLKHNLQALVTSQEW 433

Query: 647  TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468
             +S YSK      + D +SNQ+FWSSC  I+ LT PLL++ RI  S   PAMGYV+AG+Y
Sbjct: 434  ADSPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVMRIASSEMRPAMGYVYAGMY 493

Query: 467  RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288
            RAKEAIKK L  +EEY+ YW+II  RWE+L  HPLHAAGFYLNPKFFYS++GD H  I S
Sbjct: 494  RAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKFFYSIQGDIHGQIVS 553

Query: 287  LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPN 108
             + DCIERLVPD ++ DKI+KE   Y   +GDFGRKMA+RARD LLP+EWW TYGG CPN
Sbjct: 554  GMFDCIERLVPDTRIQDKIIKEINLYKSASGDFGRKMAVRARDNLLPSEWWSTYGGGCPN 613

Query: 107  LARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3
            L+RLAIRILSQT  ++  K +++P E +    N++
Sbjct: 614  LSRLAIRILSQTSSVMSCKRNQIPFEQIINTRNYI 648


>ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris]
            gi|561036895|gb|ESW35425.1| hypothetical protein
            PHAVU_001G234100g [Phaseolus vulgaris]
          Length = 756

 Score =  451 bits (1159), Expect = e-124
 Identities = 207/395 (52%), Positives = 291/395 (73%)
 Frame = -3

Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008
            GRTGCSILVD+  +  G+  ++F AYCPEG +FL            D LY+++K++V+EV
Sbjct: 252  GRTGCSILVDQWATETGRVLISFLAYCPEGVVFLKSMDATEISTSADFLYDMIKQVVDEV 311

Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828
            G+  VLQV+T+ E++Y +AG+RLTDT+P+++W+P A HCID +L+D      +  V++QA
Sbjct: 312  GVGQVLQVITSGEEQYAVAGRRLTDTFPTLYWSPSAAHCIDFILEDFGNLEWISAVIEQA 371

Query: 827  RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648
            +S++R++Y+ +A++ M++RYT G D+VD   ++  T+F TLKR+V+++H+LQ++V S+EW
Sbjct: 372  KSVTRFVYNYSAILIMVKRYTLGNDIVDPSFSQFATNFTTLKRMVDLKHNLQALVTSQEW 431

Query: 647  TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468
             +  YSK      + D +S+Q+FWSSC  I+RLT PLL++ RI  S   PAMGY++AG+Y
Sbjct: 432  ADCPYSKKSAGLEMLDCLSSQTFWSSCDMIVRLTAPLLKVLRIASSEMRPAMGYIYAGIY 491

Query: 467  RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288
            RAKEAIKK L  +EEY+ YW+II  RWE+L  HPLHAAGFYLNPKFFYS++GD H  I S
Sbjct: 492  RAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKFFYSIQGDIHSQIVS 551

Query: 287  LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPN 108
             + DCIERLV D ++ DKI+KE   Y   AGDFGRKMA+RARD LLP+EWW TYGG CPN
Sbjct: 552  GMFDCIERLVSDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNLLPSEWWSTYGGGCPN 611

Query: 107  LARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3
            L+RLAIRILSQT  ++  K +++P E +    N++
Sbjct: 612  LSRLAIRILSQTSSVMSCKRNQIPFEQIVNTRNYI 646


>ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris]
            gi|561036894|gb|ESW35424.1| hypothetical protein
            PHAVU_001G234100g [Phaseolus vulgaris]
          Length = 869

 Score =  451 bits (1159), Expect = e-124
 Identities = 207/395 (52%), Positives = 291/395 (73%)
 Frame = -3

Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008
            GRTGCSILVD+  +  G+  ++F AYCPEG +FL            D LY+++K++V+EV
Sbjct: 365  GRTGCSILVDQWATETGRVLISFLAYCPEGVVFLKSMDATEISTSADFLYDMIKQVVDEV 424

Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828
            G+  VLQV+T+ E++Y +AG+RLTDT+P+++W+P A HCID +L+D      +  V++QA
Sbjct: 425  GVGQVLQVITSGEEQYAVAGRRLTDTFPTLYWSPSAAHCIDFILEDFGNLEWISAVIEQA 484

Query: 827  RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648
            +S++R++Y+ +A++ M++RYT G D+VD   ++  T+F TLKR+V+++H+LQ++V S+EW
Sbjct: 485  KSVTRFVYNYSAILIMVKRYTLGNDIVDPSFSQFATNFTTLKRMVDLKHNLQALVTSQEW 544

Query: 647  TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468
             +  YSK      + D +S+Q+FWSSC  I+RLT PLL++ RI  S   PAMGY++AG+Y
Sbjct: 545  ADCPYSKKSAGLEMLDCLSSQTFWSSCDMIVRLTAPLLKVLRIASSEMRPAMGYIYAGIY 604

Query: 467  RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288
            RAKEAIKK L  +EEY+ YW+II  RWE+L  HPLHAAGFYLNPKFFYS++GD H  I S
Sbjct: 605  RAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKFFYSIQGDIHSQIVS 664

Query: 287  LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPN 108
             + DCIERLV D ++ DKI+KE   Y   AGDFGRKMA+RARD LLP+EWW TYGG CPN
Sbjct: 665  GMFDCIERLVSDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNLLPSEWWSTYGGGCPN 724

Query: 107  LARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3
            L+RLAIRILSQT  ++  K +++P E +    N++
Sbjct: 725  LSRLAIRILSQTSSVMSCKRNQIPFEQIVNTRNYI 759


>ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Populus trichocarpa]
            gi|550335284|gb|ERP58729.1| hypothetical protein
            POPTR_0006s02210g [Populus trichocarpa]
          Length = 847

 Score =  450 bits (1158), Expect = e-124
 Identities = 217/394 (55%), Positives = 284/394 (72%)
 Frame = -3

Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005
            +TGCS+LV+E  S  G T +NF  YC +GT+FL            D LYEL+K +VEEVG
Sbjct: 358  KTGCSLLVEECNSESGVTTLNFLVYCSKGTVFLKSVDASNLIHSTDGLYELLKLMVEEVG 417

Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825
              N+LQV+T  E+ Y+ AGK+L DT+PS++W PCA  CIDL+L+DI +   +  VL+QA+
Sbjct: 418  AGNILQVITNGEEHYIAAGKKLMDTFPSLYWAPCAARCIDLILEDIGKLDWINTVLEQAK 477

Query: 824  SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645
            S++R++Y+N+AV+N+MR++T G D+V  G TRS T+F  LKR+ N + +LQ+MV S+EW 
Sbjct: 478  SVTRFVYNNSAVLNLMRKFTSGSDIVQQGITRSATNFTALKRMANFKLNLQTMVTSQEWM 537

Query: 644  ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465
            +  YSK     A+ D I+N+SFWSSC  IIRLT PLL++  IV S K  AMGYVF+G+YR
Sbjct: 538  DCPYSKQPGGLAMVDIITNRSFWSSCILIIRLTSPLLQVLVIVSSEKRAAMGYVFSGIYR 597

Query: 464  AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285
            AKE IKKEL  +E+Y+ YW+IID RWEQ  + PLHAAGF+ NPKFFYS+EGD H+ I S 
Sbjct: 598  AKETIKKELVKREDYMVYWNIIDHRWEQQWQTPLHAAGFFFNPKFFYSIEGDMHNKILSR 657

Query: 284  VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105
            + DCIERLVPD +V DKI+KE   Y    G  G+K+AIRAR T+LPT+WW  YGG CPNL
Sbjct: 658  MFDCIERLVPDTEVQDKIVKELTLYKNAEGHLGKKLAIRARGTMLPTDWWSMYGGSCPNL 717

Query: 104  ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3
            ARLAIRILSQTC  I    + +P E +H+  N+L
Sbjct: 718  ARLAIRILSQTCSAIGCSHNHIPFEKVHRTRNFL 751


>ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806265 isoform X1 [Glycine
            max] gi|571542833|ref|XP_006601996.1| PREDICTED:
            uncharacterized protein LOC100806265 isoform X2 [Glycine
            max]
          Length = 758

 Score =  448 bits (1152), Expect = e-123
 Identities = 210/395 (53%), Positives = 289/395 (73%)
 Frame = -3

Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008
            GRTGCSILVD+ T+   +  ++F AYCPEG +FL            D LY+L+K++VEE+
Sbjct: 253  GRTGCSILVDQWTTETSRILISFLAYCPEGLVFLKSLDATEILTSPDFLYDLIKQVVEEI 312

Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828
            G+  V+QV+T+ E++Y IAG+RL DT+P+++W+P A HCIDL+L+D      +  V++QA
Sbjct: 313  GVGKVVQVITSGEEQYGIAGRRLMDTFPTLYWSPSAAHCIDLILEDFGNLEWISAVIEQA 372

Query: 827  RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648
            +S++R++Y+ +A++NM++RYT G D+VD   +R  T+F TLKR+V+++H+LQ++V S+EW
Sbjct: 373  KSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSRFATNFTTLKRMVDLKHNLQALVTSQEW 432

Query: 647  TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468
             +  YSK      + D +SNQ+FWSSC  I+ LT PLL++ RI  S   P MGYV+AG+Y
Sbjct: 433  ADCPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVLRIAGSEMRPGMGYVYAGMY 492

Query: 467  RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288
            R KEAIKK L  +EEY+ YW+II  RWE+L  HPLHAAGFYLNPKFFYS++GD    I S
Sbjct: 493  RVKEAIKKALGKREEYMVYWNIIHHRWERLWNHPLHAAGFYLNPKFFYSIQGDILGQIVS 552

Query: 287  LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPN 108
             + DCIERLVPD ++ DKI+KE   Y   AGDFGRKMA+RARD LLP+EWW TYGG CPN
Sbjct: 553  GMFDCIERLVPDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNLLPSEWWSTYGGGCPN 612

Query: 107  LARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3
            L+RLAIRILSQT  ++  K ++VP E +    N++
Sbjct: 613  LSRLAIRILSQTSSVMSCKRNQVPFEQIINTRNYI 647


>ref|XP_002521049.1| DNA binding protein, putative [Ricinus communis]
            gi|223539752|gb|EEF41333.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 854

 Score =  444 bits (1143), Expect = e-122
 Identities = 211/394 (53%), Positives = 281/394 (71%)
 Frame = -3

Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005
            RTGCS+LV+E  S  G T +NF   C +GT+FL            D LY L+K++VEEVG
Sbjct: 361  RTGCSVLVEEWNSESGITLLNFLVNCSQGTVFLKSVEASHIIYSPDGLYVLLKQVVEEVG 420

Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825
              NVLQV+T   + Y +AGKRL + +PS+FW PCA HC+DL+L+D A+   +  V++QA+
Sbjct: 421  ASNVLQVITNGNEHYTVAGKRLMEAFPSLFWAPCAVHCLDLILEDFAKLEWIDAVIEQAK 480

Query: 824  SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645
            S++R++Y+++AV+N+MR++T+G D+V  G TRS T+F  L+R+ + + +LQ+M+ S+EW 
Sbjct: 481  SVTRFVYNHSAVLNLMRKFTYGKDIVQQGLTRSATNFTMLQRMADFKLNLQTMITSQEWM 540

Query: 644  ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465
            +  YSK     A+ D ISN+SFWSSC  IIRLT PL+R+  I    +  AMGY+FAG+YR
Sbjct: 541  DCPYSKQHGGLAMLDIISNRSFWSSCILIIRLTSPLIRVLGIAGGKRKAAMGYIFAGIYR 600

Query: 464  AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285
            AKE IK+EL  +E+Y+ YW+IID RW+Q +  PLH AGF+LNPKFFYS+EGD H+ I S 
Sbjct: 601  AKETIKRELVKREDYMVYWNIIDHRWDQRRHPPLHVAGFFLNPKFFYSIEGDVHNEILSR 660

Query: 284  VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105
            V DCIERLVPD++V DKI KE   Y    GD GRKMAIR+R TLLP EWW TYGG CPNL
Sbjct: 661  VFDCIERLVPDIEVQDKIAKELNIYKNAVGDLGRKMAIRSRGTLLPAEWWSTYGGGCPNL 720

Query: 104  ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3
            ARLA+RILSQTC  I  + + +P E +H   N L
Sbjct: 721  ARLALRILSQTCSSIGCRSNHIPFEKVHATRNCL 754


>gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo]
          Length = 752

 Score =  441 bits (1133), Expect = e-121
 Identities = 216/394 (54%), Positives = 284/394 (72%), Gaps = 1/394 (0%)
 Frame = -3

Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008
            G TGCS++VD+  +  G+T +NF  YCP+GT+FL            D+LYEL+K++VE+V
Sbjct: 252  GMTGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLLYELLKKVVEQV 311

Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828
            G+++V+QV+T  E+ + IAG++L+DTYP+++WTPCA  C+DL+L DI     V  V++QA
Sbjct: 312  GVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILADIGNIEDVNTVIEQA 371

Query: 827  RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648
            RSI+R++Y+N+ V+NM+R+ TFG D+V+   TRS T+F TL R+V+++  LQ+MV S+EW
Sbjct: 372  RSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKRCLQNMVTSQEW 431

Query: 647  TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468
             +S YSK      + D IS++SFWSSC SIIRLT+PLLR+ RIV S K PAMGYV+A +Y
Sbjct: 432  MDSPYSKRPGGLEMLDLISSESFWSSCNSIIRLTNPLLRVLRIVGSGKRPAMGYVYAAMY 491

Query: 467  RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288
             AK AIK EL  ++ Y+ YW+IID RWE   RHPL AAGFYLNPK+FYS+EGD H  I S
Sbjct: 492  NAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLCAAGFYLNPKYFYSIEGDMHGEILS 551

Query: 287  LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYG-GECP 111
             + DCIERLV D  V DKI+KE  SY   +GDF RK AIRAR TLLP EWW T G G CP
Sbjct: 552  GMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAEWWSTCGEGGCP 611

Query: 110  NLARLAIRILSQTCCLIQHKLDKVPLEHLHKRTN 9
            NL RLA RILSQTC  +  K ++V  + LH   N
Sbjct: 612  NLTRLATRILSQTCSSVGFKQNQVFFDKLHDTRN 645


>ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226173 [Cucumis sativus]
          Length = 752

 Score =  437 bits (1123), Expect = e-120
 Identities = 214/394 (54%), Positives = 282/394 (71%), Gaps = 1/394 (0%)
 Frame = -3

Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008
            G TGCS++VD+  +  G+T +NF  YCP+GT+FL            D+LYEL+K++VE+V
Sbjct: 252  GITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLLYELLKKVVEQV 311

Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828
            G+++V+QV+T  E+ + IAG++L+DTYP+++WTPCA  C+DL+L DI     V  V++QA
Sbjct: 312  GVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTVIEQA 371

Query: 827  RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648
            RSI+R++Y+N+ V+NM+R+ TFG D+V+   TRS T+F TL R+V+++  LQ+MV S+EW
Sbjct: 372  RSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKRCLQNMVTSQEW 431

Query: 647  TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468
             +S YSK      + D IS++SFWSSC SII LT+PLLR+ RIV S K PAMGYV+A +Y
Sbjct: 432  MDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGKRPAMGYVYAAMY 491

Query: 467  RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288
             AK AIK EL  ++ Y+ YW+IID RWE   RHPL+AAGFYLNPK+FYS+EGD H  I S
Sbjct: 492  NAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFYSIEGDMHGEILS 551

Query: 287  LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYG-GECP 111
             + DCIERLV D  V DKI+KE  SY   +GDF RK AIRAR TLLP EWW T G G CP
Sbjct: 552  GMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAEWWSTCGEGGCP 611

Query: 110  NLARLAIRILSQTCCLIQHKLDKVPLEHLHKRTN 9
            NL RLA RILSQTC  +  K +    + LH   N
Sbjct: 612  NLTRLATRILSQTCSSVGFKQNDALFDKLHDTRN 645


>ref|XP_007049027.1| HAT transposon superfamily, putative [Theobroma cacao]
            gi|508701288|gb|EOX93184.1| HAT transposon superfamily,
            putative [Theobroma cacao]
          Length = 750

 Score =  426 bits (1096), Expect = e-117
 Identities = 199/395 (50%), Positives = 280/395 (70%)
 Frame = -3

Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008
            G+TGCSILV++ +   G+T ++F  YCP+ T+FL            D L EL+K++VEEV
Sbjct: 253  GKTGCSILVEQWSPKSGRTLLSFLVYCPQATVFLKSVDASRVIFSADHLNELLKQVVEEV 312

Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828
            G+ NV+QV+T  E++Y +AGKRL +++PS++W PC  HC+D+ML+D A    +   ++QA
Sbjct: 313  GVENVVQVITNCEEQYFLAGKRLMESFPSLYWAPCLVHCVDMMLEDFANLEWISETIEQA 372

Query: 827  RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648
            +S++R++Y+++ V+NMMRR+TF  D+V+   TR  ++F TLKR+ +++  LQ+MVNS++W
Sbjct: 373  KSVTRFVYNHSVVLNMMRRFTFHNDIVEPAVTRFASNFATLKRMADLKLKLQAMVNSQDW 432

Query: 647  TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468
            +E  Y+K      + D + N+SFW+SC  I+RL  PLL++  IV S K   MGYV+AG+Y
Sbjct: 433  SECPYAKKPGGLVMLDIVKNRSFWNSCILIVRLIYPLLQVLEIVGSKKRSTMGYVYAGIY 492

Query: 467  RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288
            RAKE IKKEL  K++Y+ YW+IID RWEQ +  PL+AA F+LNPKFFYS+EG+ H+ I S
Sbjct: 493  RAKETIKKELVKKDDYMVYWNIIDHRWEQQRHIPLYAAAFFLNPKFFYSIEGNIHNDILS 552

Query: 287  LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPN 108
             + DCIERLVPD  V D+I++E   Y    GD GR MA+RARD LLP EWW  YGG CPN
Sbjct: 553  SMFDCIERLVPDTNVQDQIVREIHLYKNATGDLGRPMAVRARDNLLPGEWWSMYGGGCPN 612

Query: 107  LARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3
            L  LAIRILSQTC  I  K +K+ +E +H   N+L
Sbjct: 613  LQHLAIRILSQTCSSIGSKPNKISIEEIHDTRNFL 647


>ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817502 isoform X4 [Glycine
            max]
          Length = 729

 Score =  418 bits (1075), Expect = e-114
 Identities = 205/395 (51%), Positives = 279/395 (70%)
 Frame = -3

Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008
            GRTGCSILVD+ T     T  +F                         LY+L+K++VEEV
Sbjct: 254  GRTGCSILVDQWT-----TETDF-------------------------LYDLIKQVVEEV 283

Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828
            G   V+QV+T+ E++Y IAG+RLTDT+P+++ +P A HCIDL+L+D      +  V++QA
Sbjct: 284  GAGQVVQVITSGEEQYGIAGRRLTDTFPTLYLSPSAAHCIDLILEDFGNLEWISAVIEQA 343

Query: 827  RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648
            RS++R++Y+ +A++NM++RYT G D+VD   +   T+F TLKR+V+++H+LQ++V S+EW
Sbjct: 344  RSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSHFATNFTTLKRMVDLKHNLQALVTSQEW 403

Query: 647  TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468
             +S YSK      + D +SNQ+FWSSC  I+ LT PLL++ RI  S   PAMGYV+AG+Y
Sbjct: 404  ADSPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVMRIASSEMRPAMGYVYAGMY 463

Query: 467  RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288
            RAKEAIKK L  +EEY+ YW+II  RWE+L  HPLHAAGFYLNPKFFYS++GD H  I S
Sbjct: 464  RAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKFFYSIQGDIHGQIVS 523

Query: 287  LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPN 108
             + DCIERLVPD ++ DKI+KE   Y   +GDFGRKMA+RARD LLP+EWW TYGG CPN
Sbjct: 524  GMFDCIERLVPDTRIQDKIIKEINLYKSASGDFGRKMAVRARDNLLPSEWWSTYGGGCPN 583

Query: 107  LARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3
            L+RLAIRILSQT  ++  K +++P E +    N++
Sbjct: 584  LSRLAIRILSQTSSVMSCKRNQIPFEQIINTRNYI 618


>ref|NP_188861.2| hAT dimerization domain-containing protein [Arabidopsis thaliana]
            gi|79313325|ref|NP_001030742.1| hAT dimerization
            domain-containing protein [Arabidopsis thaliana]
            gi|11994740|dbj|BAB03069.1| transposase-like protein
            [Arabidopsis thaliana] gi|28393360|gb|AAO42104.1| unknown
            protein [Arabidopsis thaliana] gi|28827622|gb|AAO50655.1|
            unknown protein [Arabidopsis thaliana]
            gi|332643084|gb|AEE76605.1| hAT dimerization
            domain-containing protein [Arabidopsis thaliana]
            gi|332643085|gb|AEE76606.1| hAT dimerization
            domain-containing protein [Arabidopsis thaliana]
          Length = 761

 Score =  387 bits (995), Expect = e-105
 Identities = 186/372 (50%), Positives = 253/372 (68%)
 Frame = -3

Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005
            RTGCS+LV E  S +G   + F  YCPE  +FL            D LYEL+KE+VEE+G
Sbjct: 260  RTGCSVLVQELNSNEGPLILKFLVYCPEKVVFLKSVDASEILDSEDKLYELLKEVVEEIG 319

Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825
              NV+QV+T  ED Y  AGK+L D YPS++W PCA HCID ML++  +   ++ +++QAR
Sbjct: 320  DTNVVQVITKCEDHYAAAGKKLMDVYPSLYWVPCAAHCIDKMLEEFGKMDWIREIIEQAR 379

Query: 824  SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645
            +++R IY+++ V+N+MR++TFG D+V    T S T+F T+ RI +++  LQ+MV S EW 
Sbjct: 380  TVTRIIYNHSGVLNLMRKFTFGNDIVQPVCTSSATNFTTMGRIADLKPYLQAMVTSSEWN 439

Query: 644  ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465
            + SYSK+    A+ ++I+++ FW +      +T P+LR+ RIV S + PAMGYV+A +YR
Sbjct: 440  DCSYSKEAGGLAMTETINDEDFWKALTLANHITAPILRVLRIVCSERKPAMGYVYAAMYR 499

Query: 464  AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285
            AKEAIK  L  +EEY+ YW IID  W Q    PL+AAGFYLNPKFFYS++ +    I   
Sbjct: 500  AKEAIKTNLAHREEYIVYWKIIDRWWLQ---QPLYAAGFYLNPKFFYSIDEEMRSEIHLA 556

Query: 284  VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105
            V DCIE+LVPD+ + D ++K+  SY    G FGR +AIRARDT+LP EWW TYG  C NL
Sbjct: 557  VVDCIEKLVPDVNIQDIVIKDINSYKNAVGIFGRNLAIRARDTMLPAEWWSTYGESCLNL 616

Query: 104  ARLAIRILSQTC 69
            +R AIRILSQTC
Sbjct: 617  SRFAIRILSQTC 628


Top