BLASTX nr result

ID: Mentha29_contig00016986 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00016986
         (2154 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006351643.1| PREDICTED: transcription factor bHLH155-like...   602   e-169
ref|XP_004247231.1| PREDICTED: transcription factor EMB1444-like...   601   e-169
ref|XP_006351645.1| PREDICTED: transcription factor bHLH155-like...   597   e-168
ref|XP_003632423.1| PREDICTED: uncharacterized basic helix-loop-...   568   e-159
emb|CBI37092.3| unnamed protein product [Vitis vinifera]              568   e-159
ref|XP_004292200.1| PREDICTED: transcription factor bHLH155-like...   531   e-148
ref|XP_007200308.1| hypothetical protein PRUPE_ppa001930mg [Prun...   530   e-148
gb|EXC31934.1| hypothetical protein L484_009784 [Morus notabilis]     525   e-146
ref|XP_007050338.1| Basic helix-loop-helix DNA-binding superfami...   520   e-144
ref|XP_004161538.1| PREDICTED: transcription factor EMB1444-like...   517   e-144
ref|XP_007050336.1| Basic helix-loop-helix-containing protein, p...   507   e-140
ref|XP_006443866.1| hypothetical protein CICLE_v10018993mg [Citr...   505   e-140
ref|XP_007050337.1| Basic helix-loop-helix DNA-binding superfami...   503   e-139
ref|XP_002532375.1| basic helix-loop-helix-containing protein, p...   501   e-139
emb|CCX35476.1| hypothetical protein [Malus domestica]                501   e-139
ref|XP_006383698.1| basic helix-loop-helix family protein [Popul...   498   e-138
ref|XP_006443867.1| hypothetical protein CICLE_v10018993mg [Citr...   496   e-137
ref|XP_004146986.1| PREDICTED: transcription factor EMB1444-like...   491   e-136
ref|XP_003553489.1| PREDICTED: transcription factor EMB1444-like...   481   e-133
ref|XP_006588678.1| PREDICTED: transcription factor EMB1444-like...   460   e-126

>ref|XP_006351643.1| PREDICTED: transcription factor bHLH155-like isoform X1 [Solanum
            tuberosum]
          Length = 722

 Score =  602 bits (1552), Expect = e-169
 Identities = 347/708 (49%), Positives = 459/708 (64%), Gaps = 19/708 (2%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRDKNWFNSNAGILNEGPYSHDPLGLAVAKMSYQVYSLG 180
            KL  RARMM TWEDAYYD + +  K    S AG L +G YS++ LG+AVAKMSY VYSLG
Sbjct: 25   KLTHRARMMLTWEDAYYDNDGFPGKKSPGSTAGNLYDGHYSNNHLGVAVAKMSYHVYSLG 84

Query: 181  EGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVVQLG 360
            EG+VG VA++GKH W+ +D+    +S + E+   W++QFSAGIKTI V AV PHGV+QLG
Sbjct: 85   EGIVGQVAITGKHLWLSADKVAAITSLAPEHCDGWQAQFSAGIKTIVVAAVAPHGVIQLG 144

Query: 361  SLHKIAEDLKLVDHIRNVFSNLQD----SLTGSMPSSLKNTSPSDKCSRPS-NPAFNQCL 525
            SL  I EDL+ + HIR+VFS LQ+     L  SM  S++N+  S+  +R S +  F  C+
Sbjct: 145  SLDSIPEDLRAIKHIRDVFSELQELMASCLRSSMQYSMENSCLSEISTRTSGSEVFQDCV 204

Query: 526  AKLKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRGGISNKKLKIK----MHETTECSTA 693
              L   V  D  N+WS L   + +SV++SC     GG  NK L+      +H T+   + 
Sbjct: 205  NNLGRSVCEDGRNMWSPLYTSVEKSVDHSCIFSQPGGFPNKILEAVHNQGLHRTSVQGSD 264

Query: 694  GSDTLIRSSCENISKRQLQEGDMELSDNPKHFGETNGVGDLGEKAEINCTAPVGRVEKSR 873
             S+ L+ +SCE+   +  +EG M    +PK  G+T+ +  LG K  ++   P  R + S 
Sbjct: 265  DSENLLPASCESSIIKHQEEGQMWEETDPKFEGQTSNLRVLG-KGSVDKCEPTFRSDASI 323

Query: 874  LCNAILPTEASEVVASIVPPHNLEPPVFTAHAD------ISFPELPSLQMHQDFANSELP 1035
                 +  +A +V     P  N       + AD      +   +LP+    +  A + L 
Sbjct: 324  ---GSVSYDAGQVTECPQPNRNN----LASEADNDRNRKLGLSDLPNAYADK-CAETNLG 375

Query: 1036 ITSNSSEMQASPFSFCAGYELFEALGPSFQKQNNCF-WEGENIGVDMAVGVSEGISSCSQ 1212
              +  ++   +PF FCAGYEL+EALGP FQK N+   WE      +MAV + EGI + S 
Sbjct: 376  FETQCNDTMHTPFRFCAGYELYEALGPVFQKGNSSKDWEAGK-REEMAVDMLEGIGTSSL 434

Query: 1213 LMENS-DMHLLDAVVGRASHKGDGTESEISGLNTEESLMTAETERTPCNS-VGTLSSAGF 1386
            +M N+ + HLL+AV+   +   +   S  S   + +SL+T E    PC+S +G +SS G+
Sbjct: 435  VMSNTGNEHLLEAVIANVNRYDNDCSSVKSFCKSVDSLLTTEITAEPCSSDIGAISSIGY 494

Query: 1387 SFDRDTSSSFNSV-TCDLESLKGISPASSSRGSEHVERSRLPAKLSKKRARPGESSRPRP 1563
            SFDR+T +SFNS  TC + S +G+S  S SRGS HVER   P K+ KKRARPGES RPRP
Sbjct: 495  SFDRETLNSFNSSGTCSIRSSRGLSSTSCSRGSGHVERPLEPVKMHKKRARPGESCRPRP 554

Query: 1564 RDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQSVTKHAEKLQKCSVSKLLDK 1743
            RDRQLIQDRIKELR+L+PNGSKCSIDSLLERTIKHM+FMQSVTKHA+KL KCS SKL+DK
Sbjct: 555  RDRQLIQDRIKELRDLVPNGSKCSIDSLLERTIKHMLFMQSVTKHADKLSKCSASKLVDK 614

Query: 1744 DTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQMLVEMLCKECDQFLEVAETIR 1923
            ++ +   S  E GSSWAVEVGNNQKVCP+ VEN+ MNGQMLVE+  ++   FL++AE IR
Sbjct: 615  ESDICGSSSHEVGSSWAVEVGNNQKVCPMRVENLGMNGQMLVEIF-EDGSHFLDIAEAIR 673

Query: 1924 SLGLNIMKGVSEAYGNKAWMCFVVENNRSMHRMDVLWSLMQLLQPKIS 2067
            SLGL I+KG++EAY  +  MCFVVEN+R++HRMDVLWSLMQLLQ KI+
Sbjct: 674  SLGLTILKGLAEAYSERTRMCFVVENDRTLHRMDVLWSLMQLLQAKIN 721


>ref|XP_004247231.1| PREDICTED: transcription factor EMB1444-like [Solanum lycopersicum]
          Length = 724

 Score =  601 bits (1550), Expect = e-169
 Identities = 347/708 (49%), Positives = 464/708 (65%), Gaps = 19/708 (2%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRDKNWFNSNAGILNEGPYSHDPLGLAVAKMSYQVYSLG 180
            KL  RARMM TWEDAYYD + +  K   +S AG L +G YS++ LG+AVAKMSY VYSLG
Sbjct: 25   KLTHRARMMLTWEDAYYDNDGFPGKKSPDSTAGNLYDGHYSNNHLGVAVAKMSYHVYSLG 84

Query: 181  EGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVVQLG 360
            EG+VG VA++GKH W+ +++    ++ + E+   W++QFSAGIKTI V AV PHGVVQLG
Sbjct: 85   EGIVGQVAITGKHLWLSANKVAAITNLAPEHCDGWQAQFSAGIKTIVVAAVAPHGVVQLG 144

Query: 361  SLHKIAEDLKLVDHIRNVFSNLQDSLTG----SMPSSLKNTSPSDKCSRPS-NPAFNQCL 525
            SL  I EDL+ + HIR+VFS LQ+ +T     SM  S++N+  S+  +R S +  F  C+
Sbjct: 145  SLDSIPEDLRAIKHIRDVFSELQELMTSCLRSSMQHSMENSCLSEISTRTSGSEIFQDCV 204

Query: 526  AKLKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRGGISNKKLKI----KMHETTECSTA 693
              L   V  D  N+WS L     +SV++SC     GG  NK L++    ++H ++   + 
Sbjct: 205  NNLGRSVCEDRRNMWSPLYTSFEKSVDHSCIFLQPGGYPNKILEVVNNQRLHRSSVQGSD 264

Query: 694  GSDTLIRSSCENISKRQLQEGDMELSDNPKHFGETNGVGDLGEKAEINCTAPVGRVEKSR 873
             S  L+ +SCE+   +  +EG M    +PK  G+T+ +  LG K  ++ + P  + + S 
Sbjct: 265  DSTNLLPASCESSIIKHQEEGQMWEETDPKFEGQTSNLRVLG-KGSVDKSEPNFKSDTSI 323

Query: 874  LCNAILPTEASEVVASIVPPHNLEPPVFTAHAD----ISFPELPSLQMHQDFANSELPIT 1041
                 +  +A +V     P  N       A+ D    +   +LP+    +  A + L   
Sbjct: 324  ---GSVSYDAGQVTEC--PQRNRNNLASEAYNDRNRMLGLSDLPNAYADK-CAETNLGFG 377

Query: 1042 SNSSEMQASPFSFCAGYELFEALGPSFQKQNNCF-WEGENIGVDMAVGVSEGISSCSQLM 1218
            +  ++   +PF FCAGYEL+EALGP FQK N+   WE      +MAV + EGI + S +M
Sbjct: 378  TECNDTMHTPFRFCAGYELYEALGPVFQKGNSSKDWEAGK-REEMAVDMLEGIGTSSLVM 436

Query: 1219 ENS-DMHLLDAVVGRASHKGDGTESEISGLNTEESLMTAETERTPCNS-VGTLSSAGFSF 1392
             N+ + HLL+AV+   +   +   S  S   + +SL+T E    PC+S +GT+SS G+SF
Sbjct: 437  SNTGNEHLLEAVIANVNRHDNDCSSVKSFCKSVDSLLTTEITAEPCSSDIGTISSTGYSF 496

Query: 1393 DRDTSSSFNSV-TCDLESLKGISPASSSRGSEHVERSRLPAKLSKKRARPGESSRPRPRD 1569
            DR+T +SFNS  TC + S +G+S  S SRGS HVER   P K+ KKRARPGES RPRPRD
Sbjct: 497  DRETLNSFNSSGTCSIRSSRGLSSTSCSRGSGHVERPLEPVKMHKKRARPGESCRPRPRD 556

Query: 1570 RQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQSVTKHAEKLQKCSVSKLLDKDT 1749
            RQLIQDRIKELR+L+PNGSKCSIDSLLERTIKHM+FMQSVTKHA+KL KCS SKL DK++
Sbjct: 557  RQLIQDRIKELRDLVPNGSKCSIDSLLERTIKHMLFMQSVTKHADKLSKCSASKLADKES 616

Query: 1750 GMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQMLVEMLCKECDQFLEVAETIRSL 1929
            G+   S  E GSSWAVEVGNNQKVCP+ VEN+ MNGQMLVE+  ++   FL++AE IRSL
Sbjct: 617  GICGSSSHEVGSSWAVEVGNNQKVCPMRVENLGMNGQMLVEIF-EDGSHFLDIAEAIRSL 675

Query: 1930 GLNIMKGVSEAYGNKAWMCFVVE--NNRSMHRMDVLWSLMQLLQPKIS 2067
            GL I+KG++EAYG +  MCFVVE  N+R++HRMDVLWSLMQLLQ KI+
Sbjct: 676  GLTILKGLAEAYGERTRMCFVVEGQNDRTLHRMDVLWSLMQLLQAKIN 723


>ref|XP_006351645.1| PREDICTED: transcription factor bHLH155-like isoform X3 [Solanum
            tuberosum]
          Length = 752

 Score =  597 bits (1539), Expect = e-168
 Identities = 347/710 (48%), Positives = 459/710 (64%), Gaps = 21/710 (2%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRDKNWFNSNAGILNEGPYSHDPLGLAVAKMSYQVYSLG 180
            KL  RARMM TWEDAYYD + +  K    S AG L +G YS++ LG+AVAKMSY VYSLG
Sbjct: 53   KLTHRARMMLTWEDAYYDNDGFPGKKSPGSTAGNLYDGHYSNNHLGVAVAKMSYHVYSLG 112

Query: 181  EGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVVQLG 360
            EG+VG VA++GKH W+ +D+    +S + E+   W++QFSAGIKTI V AV PHGV+QLG
Sbjct: 113  EGIVGQVAITGKHLWLSADKVAAITSLAPEHCDGWQAQFSAGIKTIVVAAVAPHGVIQLG 172

Query: 361  SLHKIAEDLKLVDHIRNVFSNLQD----SLTGSMPSSLKNTSPSDKCSRPS-NPAFNQCL 525
            SL  I EDL+ + HIR+VFS LQ+     L  SM  S++N+  S+  +R S +  F  C+
Sbjct: 173  SLDSIPEDLRAIKHIRDVFSELQELMASCLRSSMQYSMENSCLSEISTRTSGSEVFQDCV 232

Query: 526  AKLKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRGGISNKKLKIK----MHETTECSTA 693
              L   V  D  N+WS L   + +SV++SC     GG  NK L+      +H T+   + 
Sbjct: 233  NNLGRSVCEDGRNMWSPLYTSVEKSVDHSCIFSQPGGFPNKILEAVHNQGLHRTSVQGSD 292

Query: 694  GSDTLIRSSCENISKRQLQEGDMELSDNPKHFGETNGVGDLGEKAEINCTAPVGRVEKSR 873
             S+ L+ +SCE+   +  +EG M    +PK  G+T+ +  LG K  ++   P  R + S 
Sbjct: 293  DSENLLPASCESSIIKHQEEGQMWEETDPKFEGQTSNLRVLG-KGSVDKCEPTFRSDASI 351

Query: 874  LCNAILPTEASEVVASIVPPHNLEPPVFTAHAD------ISFPELPSLQMHQDFANSELP 1035
                 +  +A +V     P  N       + AD      +   +LP+    +  A + L 
Sbjct: 352  ---GSVSYDAGQVTECPQPNRNN----LASEADNDRNRKLGLSDLPNAYADK-CAETNLG 403

Query: 1036 ITSNSSEMQASPFSFCAGYELFEALGPSFQKQNNCF-WEGENIGVDMAVGVSEGISSCSQ 1212
              +  ++   +PF FCAGYEL+EALGP FQK N+   WE      +MAV + EGI + S 
Sbjct: 404  FETQCNDTMHTPFRFCAGYELYEALGPVFQKGNSSKDWEAGK-REEMAVDMLEGIGTSSL 462

Query: 1213 LMENS-DMHLLDAVVGRASHKGDGTESEISGLNTEESLMTAETERTPCNS-VGTLSSAGF 1386
            +M N+ + HLL+AV+   +   +   S  S   + +SL+T E    PC+S +G +SS G+
Sbjct: 463  VMSNTGNEHLLEAVIANVNRYDNDCSSVKSFCKSVDSLLTTEITAEPCSSDIGAISSIGY 522

Query: 1387 SFDRDTSSSFNSV-TCDLESLKGISPASSSRGSEHVERSRLPAKLSKKRARPGESSRPRP 1563
            SFDR+T +SFNS  TC + S +G+S  S SRGS HVER   P K+ KKRARPGES RPRP
Sbjct: 523  SFDRETLNSFNSSGTCSIRSSRGLSSTSCSRGSGHVERPLEPVKMHKKRARPGESCRPRP 582

Query: 1564 RDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQSVTKHAEKLQKCSVSKLLDK 1743
            RDRQLIQDRIKELR+L+PNGSKCSIDSLLERTIKHM+FMQSVTKHA+KL KCS SKL+DK
Sbjct: 583  RDRQLIQDRIKELRDLVPNGSKCSIDSLLERTIKHMLFMQSVTKHADKLSKCSASKLVDK 642

Query: 1744 DTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQMLVEMLCKECDQFLEVAETIR 1923
            ++ +   S  E GSSWAVEVGNNQKVCP+ VEN+ MNGQMLVE+  ++   FL++AE IR
Sbjct: 643  ESDICGSSSHEVGSSWAVEVGNNQKVCPMRVENLGMNGQMLVEIF-EDGSHFLDIAEAIR 701

Query: 1924 SLGLNIMKGVSEAYGNKAWMCFVVE--NNRSMHRMDVLWSLMQLLQPKIS 2067
            SLGL I+KG++EAY  +  MCFVVE  N+R++HRMDVLWSLMQLLQ KI+
Sbjct: 702  SLGLTILKGLAEAYSERTRMCFVVEGQNDRTLHRMDVLWSLMQLLQAKIN 751


>ref|XP_003632423.1| PREDICTED: uncharacterized basic helix-loop-helix protein
            At1g06150-like [Vitis vinifera]
          Length = 749

 Score =  568 bits (1463), Expect = e-159
 Identities = 334/723 (46%), Positives = 452/723 (62%), Gaps = 36/723 (4%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRDK---NWFNSNAGILNEGPYSHDPLGLAVAKMSYQVY 171
            KLK RARM+ TWEDAYYD +   D      F+     L++G YSHD LGLAVAKMSY VY
Sbjct: 25   KLKHRARMVLTWEDAYYDNHDQHDPLEDKCFSKTPDTLHDGHYSHDALGLAVAKMSYHVY 84

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVV 351
            SLGEG+VG VAV+GKH WIFSD+H  +SS S EY   W++QFSAGIKTI VVAV+PHGVV
Sbjct: 85   SLGEGIVGQVAVTGKHQWIFSDKHTTNSSSSFEYCDGWQAQFSAGIKTIVVVAVVPHGVV 144

Query: 352  QLGSLHKIAEDLKLVDHIRNVFSNLQDSLTGSMPSSLK-----NTSPSDKCSRPS-NPAF 513
            QLGSL ++ EDLKLV  I++VF  LQDS    +P  ++     + + SD  +R S +   
Sbjct: 145  QLGSLQQVVEDLKLVSRIKDVFFALQDSSVAYIPHPIQCSMKSSLAMSDISTRGSASDIV 204

Query: 514  NQCLAKLKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRGGISNKKLKIKMHET-TECST 690
               L  L   +H +  NVWS + P  G+  ++S          N+ + +   +   E S+
Sbjct: 205  PDSLFNLDKGIHKERPNVWSPMFPIFGKHNDSSFIFQLPAIHQNRAVNMFNKDGGLELSS 264

Query: 691  AGSDT---LIRSSCENISKRQLQEGDMELSDNPKHFGETNGVGDLGEKAEINCTA-PVGR 858
            + SD     ++   EN      ++  M+L  N K   E +G  D    +E N T+ P   
Sbjct: 265  SQSDESTKFLQPRSENFVLEGQKQVQMKLISNTKR-EEASGWRDADVSSEHNDTSYPYNS 323

Query: 859  -VEKSRLCNAILPTEASEVVASIVP--------PHNLEPPVFTAHAD--ISFPELPSLQM 1005
             +E    C+  L  + S+V  +  P         + ++      H +  +  P+   +Q+
Sbjct: 324  FMENINSCSTALAADKSQVDFACFPFGFFDSVDCNRIKLHGVNCHENGVLHLPDPSDMQL 383

Query: 1006 HQDFANS-ELPITSNSSEMQASPFSFCAGYELFEALGPSFQKQNN-CFWEGENIGVDMAV 1179
             ++     E P   +  +   +   F AG EL EALGP+F KQ+N C WE E    +  +
Sbjct: 384  QKNLEKKLEFPSELSHVDTSYTSLRFSAGSELHEALGPAFLKQSNYCDWETEKAETETTI 443

Query: 1180 GVSEGISSCSQLMENSDMHLLDAVVGRASHKGDGTESEISGLNTEESLMTAETERTPCN- 1356
             + EG+SS     ++   +LL+AVV +    G   +SE S   + +SL+T E    P + 
Sbjct: 444  ELPEGMSSSQLTSDSGSENLLEAVVAKVCQSGSDVKSEKSFCQSMQSLLTTEKIPEPSSH 503

Query: 1357 SVGTLSSAGFSFDR-----DTSSSF-NSVTCDLESLKGISPASSSRGSEHVERSRLPAKL 1518
            ++ T++SAG+S D+     +T + F +S  C + S +GIS    S  SE +ERS  P+K+
Sbjct: 504  TIHTVTSAGYSIDQSSLVEETQNCFKSSEVCGVTSQQGISSICPSSCSEQLERSAEPSKV 563

Query: 1519 SKKRARPGESSRPRPRDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQSVTKH 1698
            +KKRARPGES RPRPRDRQLIQDRIKELREL+PNGSKCSIDSLLERTIKHM+F+QS+T+H
Sbjct: 564  NKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIKHMLFLQSITRH 623

Query: 1699 AEKLQKCSVSKLLDKDTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQMLVEML 1878
            A+KL KC+ SKL  K+TG+   S  EQGSSWAVEVG++ KVCPIIVEN+NM+GQM+VEM+
Sbjct: 624  ADKLNKCAESKLHSKETGVLGSSNYEQGSSWAVEVGSHMKVCPIIVENLNMDGQMVVEMV 683

Query: 1879 CKECDQFLEVAETIRSLGLNIMKGVSEAYGNKAWMCFVVE--NNRSMHRMDVLWSLMQLL 2052
            C+EC +FLE+AE IRSLGL I+KGV+EA G K W+CFVVE  N+R+M RMD+LWSL+Q+L
Sbjct: 684  CEECSRFLEIAEAIRSLGLTILKGVTEARGEKTWICFVVEGQNSRNMRRMDILWSLVQIL 743

Query: 2053 QPK 2061
            QPK
Sbjct: 744  QPK 746


>emb|CBI37092.3| unnamed protein product [Vitis vinifera]
          Length = 774

 Score =  568 bits (1463), Expect = e-159
 Identities = 334/723 (46%), Positives = 452/723 (62%), Gaps = 36/723 (4%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRDK---NWFNSNAGILNEGPYSHDPLGLAVAKMSYQVY 171
            KLK RARM+ TWEDAYYD +   D      F+     L++G YSHD LGLAVAKMSY VY
Sbjct: 50   KLKHRARMVLTWEDAYYDNHDQHDPLEDKCFSKTPDTLHDGHYSHDALGLAVAKMSYHVY 109

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVV 351
            SLGEG+VG VAV+GKH WIFSD+H  +SS S EY   W++QFSAGIKTI VVAV+PHGVV
Sbjct: 110  SLGEGIVGQVAVTGKHQWIFSDKHTTNSSSSFEYCDGWQAQFSAGIKTIVVVAVVPHGVV 169

Query: 352  QLGSLHKIAEDLKLVDHIRNVFSNLQDSLTGSMPSSLK-----NTSPSDKCSRPS-NPAF 513
            QLGSL ++ EDLKLV  I++VF  LQDS    +P  ++     + + SD  +R S +   
Sbjct: 170  QLGSLQQVVEDLKLVSRIKDVFFALQDSSVAYIPHPIQCSMKSSLAMSDISTRGSASDIV 229

Query: 514  NQCLAKLKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRGGISNKKLKIKMHET-TECST 690
               L  L   +H +  NVWS + P  G+  ++S          N+ + +   +   E S+
Sbjct: 230  PDSLFNLDKGIHKERPNVWSPMFPIFGKHNDSSFIFQLPAIHQNRAVNMFNKDGGLELSS 289

Query: 691  AGSDT---LIRSSCENISKRQLQEGDMELSDNPKHFGETNGVGDLGEKAEINCTA-PVGR 858
            + SD     ++   EN      ++  M+L  N K   E +G  D    +E N T+ P   
Sbjct: 290  SQSDESTKFLQPRSENFVLEGQKQVQMKLISNTKR-EEASGWRDADVSSEHNDTSYPYNS 348

Query: 859  -VEKSRLCNAILPTEASEVVASIVP--------PHNLEPPVFTAHAD--ISFPELPSLQM 1005
             +E    C+  L  + S+V  +  P         + ++      H +  +  P+   +Q+
Sbjct: 349  FMENINSCSTALAADKSQVDFACFPFGFFDSVDCNRIKLHGVNCHENGVLHLPDPSDMQL 408

Query: 1006 HQDFANS-ELPITSNSSEMQASPFSFCAGYELFEALGPSFQKQNN-CFWEGENIGVDMAV 1179
             ++     E P   +  +   +   F AG EL EALGP+F KQ+N C WE E    +  +
Sbjct: 409  QKNLEKKLEFPSELSHVDTSYTSLRFSAGSELHEALGPAFLKQSNYCDWETEKAETETTI 468

Query: 1180 GVSEGISSCSQLMENSDMHLLDAVVGRASHKGDGTESEISGLNTEESLMTAETERTPCN- 1356
             + EG+SS     ++   +LL+AVV +    G   +SE S   + +SL+T E    P + 
Sbjct: 469  ELPEGMSSSQLTSDSGSENLLEAVVAKVCQSGSDVKSEKSFCQSMQSLLTTEKIPEPSSH 528

Query: 1357 SVGTLSSAGFSFDR-----DTSSSF-NSVTCDLESLKGISPASSSRGSEHVERSRLPAKL 1518
            ++ T++SAG+S D+     +T + F +S  C + S +GIS    S  SE +ERS  P+K+
Sbjct: 529  TIHTVTSAGYSIDQSSLVEETQNCFKSSEVCGVTSQQGISSICPSSCSEQLERSAEPSKV 588

Query: 1519 SKKRARPGESSRPRPRDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQSVTKH 1698
            +KKRARPGES RPRPRDRQLIQDRIKELREL+PNGSKCSIDSLLERTIKHM+F+QS+T+H
Sbjct: 589  NKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIKHMLFLQSITRH 648

Query: 1699 AEKLQKCSVSKLLDKDTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQMLVEML 1878
            A+KL KC+ SKL  K+TG+   S  EQGSSWAVEVG++ KVCPIIVEN+NM+GQM+VEM+
Sbjct: 649  ADKLNKCAESKLHSKETGVLGSSNYEQGSSWAVEVGSHMKVCPIIVENLNMDGQMVVEMV 708

Query: 1879 CKECDQFLEVAETIRSLGLNIMKGVSEAYGNKAWMCFVVE--NNRSMHRMDVLWSLMQLL 2052
            C+EC +FLE+AE IRSLGL I+KGV+EA G K W+CFVVE  N+R+M RMD+LWSL+Q+L
Sbjct: 709  CEECSRFLEIAEAIRSLGLTILKGVTEARGEKTWICFVVEGQNSRNMRRMDILWSLVQIL 768

Query: 2053 QPK 2061
            QPK
Sbjct: 769  QPK 771


>ref|XP_004292200.1| PREDICTED: transcription factor bHLH155-like [Fragaria vesca subsp.
            vesca]
          Length = 756

 Score =  531 bits (1368), Expect = e-148
 Identities = 315/708 (44%), Positives = 421/708 (59%), Gaps = 21/708 (2%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRDKNW---FNSNAGILNEGPYSHDPLGLAVAKMSYQVY 171
            KLK RARM+ TWEDAYYD  +  D +    F      L+     HD LGLA+AKMSY VY
Sbjct: 59   KLKHRARMVLTWEDAYYDNCEQYDNSGNRSFIKTLEALHGNHNMHDSLGLAMAKMSYHVY 118

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVV 351
            +LGEG+VG VA++GKH WIF+D  V D+   SEY   W+SQF AGI+TI VVAV+PHGVV
Sbjct: 119  TLGEGIVGQVAITGKHQWIFADNIVKDNCSPSEYCDGWQSQFLAGIRTIVVVAVVPHGVV 178

Query: 352  QLGSLHKIAEDLKLVDHIRNVFSNLQDSLTGSMPSSLKNTSPSDKCSRP--SNPAFNQCL 525
            QLGSL KI E+++L+ HI++ F        GS    L++   S   S    ++ AF  CL
Sbjct: 179  QLGSLKKITENVELISHIKDAF-------IGSKIPHLQHIQSSIVISPKILASGAFPDCL 231

Query: 526  AKLKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRGGISNKKLKIKMHETTECSTAGSDT 705
              L   ++ ++ +VW    P  G+  ++S   P  G   N    +  H   E S  G D 
Sbjct: 232  QNLDKAINREKSDVWLSAFPHSGKDGDSSYIFPLTGNFKNAVEVVNKHGELESSNIGGDE 291

Query: 706  L-----IRSSCENISKRQLQEGDMELSDNPKHFGETNGVGDLGEKAEINCTAPVGRVEKS 870
                   +SS  N+   +L    +EL D+ K  GE++G  D+G    I+ T     +  +
Sbjct: 292  SPKLHQSKSSIFNLENSKLV--GVELLDSRKCTGESSGCKDMG----ISSTNSADPLSHA 345

Query: 871  RLCNAILPTEASEVVASIVPPHNLEPPVFTAHADISFPELPSLQMHQDFANSELPITSNS 1050
              C  +  T  +  V   V   NL+      +  +   E   ++   +  N +       
Sbjct: 346  NDCADLSSTFVNSDVNDRV---NLDSIDLYRNEVLHVSEPSDVKFQSNLDNLKFQTELGQ 402

Query: 1051 SEMQASPFSFCAGYELFEALGPSFQKQNNCF-WEGENIGVDMAVGVSEGISSCSQLMENS 1227
            ++  +S   F AG EL EALGP+F  ++N F WE E IG      + EG++S     ++ 
Sbjct: 403  ADTSSSSLMFPAGCELHEALGPAFMHKSNFFDWEAEKIGDRTTAEMPEGMNSSQLTSDSC 462

Query: 1228 DMHLLDAVVGRASHKGDGTESEISGLNTEESLMTAETERTPCN-SVGTLSSAGFSFDR-- 1398
              HLL+AVV +  H G   +SE S   + +SL+T E    P + +  TL S  +S D+  
Sbjct: 463  PEHLLEAVVAKVCHSGSHVKSEKSFCKSMQSLLTTEKYPEPSSHTTHTLDSENYSIDQPS 522

Query: 1399 ----DTSSSFNSV-TCDLESLKGISPASSSRGSEHVERSRLPAKLSKKRARPGESSRPRP 1563
                DT    +S   C + S K  S    S  SE  ERS  PA+ +KKRARPGE+SRPRP
Sbjct: 523  MRGEDTQQCLSSSGICGVISPKWFSSPCPSACSEQQERSSGPARNNKKRARPGETSRPRP 582

Query: 1564 RDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQSVTKHAEKLQKCSVSKLLDK 1743
            RDRQLIQDRIKELREL PNG+KCSIDSLLERTIKHM+F+QS+TKHA+KL KC+ +KL  K
Sbjct: 583  RDRQLIQDRIKELRELTPNGAKCSIDSLLERTIKHMLFLQSITKHADKLNKCADAKLCPK 642

Query: 1744 DTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQMLVEMLCKECDQFLEVAETIR 1923
            +T M   +  E+GSSWAVEVG N KVC I+VEN+N NGQM+VEM+C+EC  FLE+AE IR
Sbjct: 643  ETSMLGSTNYERGSSWAVEVGGNLKVCSIVVENLNKNGQMVVEMICEECSHFLEIAEAIR 702

Query: 1924 SLGLNIMKGVSEAYGNKAWMCFVVE--NNRSMHRMDVLWSLMQLLQPK 2061
            SL L I+KG++EA G+K W+CF+VE  NNR++HRMD+LWSL+Q+LQPK
Sbjct: 703  SLSLTILKGLTEARGDKTWICFIVEAQNNRNIHRMDILWSLVQILQPK 750


>ref|XP_007200308.1| hypothetical protein PRUPE_ppa001930mg [Prunus persica]
            gi|462395708|gb|EMJ01507.1| hypothetical protein
            PRUPE_ppa001930mg [Prunus persica]
          Length = 739

 Score =  530 bits (1366), Expect = e-148
 Identities = 320/715 (44%), Positives = 434/715 (60%), Gaps = 28/715 (3%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRDKN---WFNSNAGILNEGPYSHDPLGLAVAKMSYQVY 171
            KLK RARM+ TWEDAYYD  +  D +    FN     L++  YSHDPLGLAVAKMSY VY
Sbjct: 26   KLKYRARMVLTWEDAYYDNCEQHDSSENRCFNKTLDRLHDSHYSHDPLGLAVAKMSYHVY 85

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVV 351
            +LGEG+VG VAV+ KH WIF+D    ++    +Y   W+SQFSAGI+TI VVAV PHGVV
Sbjct: 86   TLGEGIVGQVAVTRKHQWIFADNLFKNNCSPFQYCDGWQSQFSAGIRTIVVVAV-PHGVV 144

Query: 352  QLGSLHKIAEDLKLVDHIRNVFSNLQDSLTGSMPSSLKNTSPSDKCSRPSNP------AF 513
            QLGSL+K+ E++KLV  IR+VFS LQDS    + + L++   S  C    +P        
Sbjct: 145  QLGSLNKVIENVKLVSEIRDVFSTLQDSPVEQIRNPLQSGINSSACLTSISPKGLASGVI 204

Query: 514  NQCLAKLKGPVHADEV-NVWSDLIPPLGESVNNSCTLPPRGGISNKKLKI-KMHETTECS 687
              CL  L    + +E  +VWS + P +G+  ++S   P       K +++   H   E S
Sbjct: 205  TDCLHNLDKAANREESPDVWSSIFPHIGKDSDSSYVFPLPENCLKKAVELANKHGGLESS 264

Query: 688  TAG---SDTLIRSSCENISKRQLQEGDMELSDNPKHFGETNGVGDLGEKAEINCTAPV-- 852
              G   S  L +S    ++    +   +EL D  K  GE++G  D    A +  + P+  
Sbjct: 265  NLGCLESAKLHQSKSSILNSEHCKLVGVELLDRTKCKGESSGCKDT-RMASMIYSNPLSH 323

Query: 853  GRVEKS-RLCNAILPTEASEVVASIVPPHNLEPPVFTAHADISFPELPSLQMHQDFANSE 1029
            G V+++  LC++     A+ + ++     N++   F  +  +   E   ++  +D  N +
Sbjct: 324  GSVQENVNLCDSA-DLSATFLNSAAHGRVNVDRVDFYQNEVLQVSEPSDVKFQKDLENLD 382

Query: 1030 LPITSNSSEMQASPFSFCAGYELFEALGPSFQKQNNCF-WEGENIGVDMAVGVSEGISSC 1206
                S   +  ++  +F AG EL EALGP+F  + N F WE E  G  + + + EG+ + 
Sbjct: 383  FQTESGHMDTSSTSMAFPAGCELHEALGPAFLNKGNYFDWEAEKNGDGITIEMPEGMKTG 442

Query: 1207 SQLMENSDMHLLDAVVGRASHKGDGTESEISGLNTEESLMTAETERTPCN-SVGTLSSAG 1383
                ++   HLL+AVV    H G   +SE S   + +SL+T E    P + +  T+ S  
Sbjct: 443  QLTSDSCQEHLLEAVVANVCHSGTDVKSEKSFCKSMQSLLTTEKYPEPSSHTTHTIDSEN 502

Query: 1384 FSFDR------DTSSSFNSV-TCDLESLKGISPASSSRGSEHVERSRLPAKLSKKRARPG 1542
            +S D+      DT    +S   C + S K  S    S  SE +ERS  P+K +KKRARPG
Sbjct: 503  YSIDQPSLIAEDTQQCLSSSGVCGVISPKWFSSPCPSACSEQLERSSGPSKNNKKRARPG 562

Query: 1543 ESSRPRPRDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQSVTKHAEKLQKCS 1722
            E+SRPRPRDRQLIQDRIKELRELIPNG+KCSIDSLLERTIKHM+F+QS+TKHA+KL KC+
Sbjct: 563  ENSRPRPRDRQLIQDRIKELRELIPNGAKCSIDSLLERTIKHMLFLQSITKHADKLNKCA 622

Query: 1723 VSKLLDKDTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQMLVEMLCKECDQFL 1902
             +    K+  M   S  E+GSSWAVEVG N KVC I+VEN+N NGQM+VEM+C+EC  FL
Sbjct: 623  DA----KEASMLGSSNYERGSSWAVEVGGNLKVCSIMVENLNKNGQMVVEMMCEECSHFL 678

Query: 1903 EVAETIRSLGLNIMKGVSEAYGNKAWMCFVVE--NNRSMHRMDVLWSLMQLLQPK 2061
            E+AE IRSLGL I+KGV+EA  +K W+CFVVE  NNRS+HRMD+LWSL+Q+LQPK
Sbjct: 679  EIAEAIRSLGLTILKGVTEARSDKTWICFVVEGQNNRSIHRMDILWSLVQILQPK 733


>gb|EXC31934.1| hypothetical protein L484_009784 [Morus notabilis]
          Length = 750

 Score =  525 bits (1352), Expect = e-146
 Identities = 322/734 (43%), Positives = 436/734 (59%), Gaps = 47/734 (6%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRD---KNWFNSNAGILNEGPYSHDPLGLAVAKMSYQVY 171
            KLK RARM+ TWEDAYYD ++  D      F+      ++G YSHDPLGLAVAK+SY VY
Sbjct: 25   KLKHRARMVLTWEDAYYDKSEQHDPAENKCFSKKLEKSHDGLYSHDPLGLAVAKLSYHVY 84

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYG-AWKSQFSAGIKTIAVVAVIPHGV 348
            SLGEG+VG VAVSGKH WIF+D+H + +  S E+Y   W++QFSAGIKTIAVVAV+PHGV
Sbjct: 85   SLGEGIVGQVAVSGKHQWIFADKHKLSTYSSFEHYSDGWQNQFSAGIKTIAVVAVVPHGV 144

Query: 349  VQLGSLHKIAEDLKLVDHIRNVFSNLQDSLTGSMPSSLKNTS---------PSDKCSRPS 501
            VQLGS +++ ED++LV+HIR+VF +LQDSL G +P  ++++          PS   +  +
Sbjct: 145  VQLGSFNEVLEDMELVNHIRDVFMSLQDSLVGHVPVPIQSSVNSSVNLQDIPSKSFTSET 204

Query: 502  NPAFNQCLAKLKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRGGISNKKLKI-KMHETT 678
             P    CL  L   ++ +  ++W  + P +G+  ++   L        K + +   H   
Sbjct: 205  VP---DCLHNLDKTLNGEGPDIWFSIFPYVGKDGDSPYVLSLPNNYQEKAVDVVNKHGGL 261

Query: 679  ECSTAGSDT---LIRSSCENISKRQLQEGDMELSDNPKHFGETNG-----VGDLGEKAEI 834
            E ST G+D    L++S    +     +   M L DN K  GE +      VG +      
Sbjct: 262  EFSTNGTDESAKLLQSRTNILEHENHKVIGMNLRDNWKCAGEIDSCKDAAVGPVNNGNPF 321

Query: 835  NCTAPVGRVEKSRLCNAILPTEASEV---------VASIVPPH-NLEPPVFTAHADISFP 984
             C + +G V    L + +LP E  EV         V S V     L+   +  +  +   
Sbjct: 322  LCGSVMGDVN---LPSIVLPAEKVEVDSAHFSSGLVGSAVCDRVRLDSVDYYQNGVLHVS 378

Query: 985  ELPSLQMHQDFANSELPITSNSSEMQASPFSFCAGYELFEALGPSFQKQNNCF-WEG-EN 1158
               + +  +D  N E     +  +  ++   F AGYEL EALGP+F K +  F WE  E 
Sbjct: 379  GPSNTKFQKDPDNLEFQTELSHIDTSSTSLKFPAGYELHEALGPAFLKNSKYFDWEATET 438

Query: 1159 IGVDMAVGVSEGISSCSQLMENSDMHLLDAVVGRASHKGDGTESEISGLNTEESLMTAET 1338
             G   A+ + E +SS     ++   HLL+AV+          +SE S   + +SL++ E 
Sbjct: 439  EGT--ALEMPEQMSSRQLAADSHPEHLLEAVIANVCQSHSDVKSEKSFCKSVQSLLSTEK 496

Query: 1339 ERTPCN-----------SVGTLSSAGFSFDRDTSSSFNSVTCDLESLKGISPASSSRGSE 1485
               P +           S+G  S  G       SSS     C + S KG S    S  SE
Sbjct: 497  YPKPSSHTTLITDSSNHSIGQPSVKGEDKQHCLSSSG---ICGVMSPKGFSSTCPSASSE 553

Query: 1486 HVERSRLPAKLSKKRARPGESSRPRPRDRQLIQDRIKELRELIPNGSKCSIDSLLERTIK 1665
             +ERS +  K +KKRARPGE+ RPRPRDRQLIQDRIKELRELIPNG+KCSIDSLLERTIK
Sbjct: 554  QLERSSVHNKNNKKRARPGENCRPRPRDRQLIQDRIKELRELIPNGAKCSIDSLLERTIK 613

Query: 1666 HMIFMQSVTKHAEKLQKCSVSKLLDKDTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENI 1845
            HM+++QS+ KHA+KL K + +KL  K+T M + S  E+GSSWAVEVG N KVC I+VEN+
Sbjct: 614  HMLYLQSIAKHADKLNKYADTKLCHKETSMLESSTYERGSSWAVEVGGNLKVCSIVVENL 673

Query: 1846 NMNGQMLVEMLCKECDQFLEVAETIRSLGLNIMKGVSEAYGNKAWMCFVVE--NNRSMHR 2019
            N +GQM+VEM+C+EC  FLE+AE I+SLGL I+KGV+EA+G K W+CFVVE  +NRS+HR
Sbjct: 674  NKSGQMVVEMMCEECSHFLEIAEAIKSLGLTILKGVTEAHGEKTWICFVVEGQSNRSLHR 733

Query: 2020 MDVLWSLMQLLQPK 2061
            MD+LWSL+Q+LQPK
Sbjct: 734  MDILWSLVQILQPK 747


>ref|XP_007050338.1| Basic helix-loop-helix DNA-binding superfamily protein isoform 3
            [Theobroma cacao] gi|508702599|gb|EOX94495.1| Basic
            helix-loop-helix DNA-binding superfamily protein isoform
            3 [Theobroma cacao]
          Length = 737

 Score =  520 bits (1339), Expect = e-144
 Identities = 329/730 (45%), Positives = 425/730 (58%), Gaps = 40/730 (5%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRD---KNWFNSNAGILNEGPYSHDPLGLAVAKMSYQVY 171
            KLK RARM+ TWEDAYYD +   D    N F+     L  G  SHDPLGLAVAKMSY VY
Sbjct: 30   KLKHRARMVLTWEDAYYDNHDQHDPSENNCFHHTLDNLQSGYCSHDPLGLAVAKMSYHVY 89

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVV 351
            SLGEG+VG VAVSGKH WIF+D+HV  S    E+   W+SQF+AGI+TI VVAV+ HGVV
Sbjct: 90   SLGEGIVGQVAVSGKHQWIFADKHVNSSCSLFEFCDGWQSQFAAGIRTIVVVAVVQHGVV 149

Query: 352  QLGSLHKIAEDLKLVDHIRNVFSNLQDSLTG--------SMPSSL-----------KNTS 474
            QLGSL+K+ ED+KLV HIR+VF  LQDS  G        SM SSL            +  
Sbjct: 150  QLGSLNKVFEDVKLVSHIRDVFFALQDSSVGHIASPIECSMKSSLFQLDLPTKLLDSDGI 209

Query: 475  PSDKCSRPSNPAFNQCLAKLKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRGGISNKKL 654
            P DK      P  +  L +   P        +SD +  L  S N+     P+G +  +  
Sbjct: 210  PLDKTVDEQGP--DALLPEFSHP------RKYSDRLFVLPLSNNH-----PKGAVEVEN- 255

Query: 655  KIKMHETTECSTAGSDTLI-----RSSCENISKRQLQEGDMELSDNPKHFGETNGVGDLG 819
                HE  E S+A +D        RS+  N+ + Q Q G + L +N    GE +G  +  
Sbjct: 256  ---KHEGLELSSARNDESAKLLTPRSNVSNL-EHQNQLGRI-LINNGVWKGENSGWKNSS 310

Query: 820  EKAE-INCTAPVGRVEKSRLCNAILPTE-ASEVVASIVPPHNLEPPVFTAHADISFPELP 993
               E +    PVG  E+  + +A   +   +   +  V   +L       +  +  PE  
Sbjct: 311  LVPENVYANNPVGGRERYGVDHAYFSSNFLNSAHSDTVKSSSLSS---YPNEVLDIPESS 367

Query: 994  SLQMHQDFANSELPITSNSSEMQASPFSFCAGYELFEALGPSF-QKQNNCFWEGENIGVD 1170
             ++  +D          +  +   +   F  G EL+EALGP+F +K     W+ EN+   
Sbjct: 368  DMKFQKDLKKLGNQNEISHLDPMNTSLKFSVGCELYEALGPAFIRKSIYADWQAENMEAG 427

Query: 1171 MAVGVSEGISSCSQLMENSDMHLLDAVVGRASHKGDGTESEISGLNTEESLMTAETERTP 1350
              + + EG+SS     E+   +LL+AVV    H G   ++E S   +  SL+T      P
Sbjct: 428  GNIEMPEGMSSSQLTFESGSENLLEAVVANVCHSGSDIKAERSSCRSAPSLLTTGNTPEP 487

Query: 1351 CN-SVGTLSSAGFSFDRDTSSSFN-------SVTCDLESLKGISPASSSRGSEHVERSRL 1506
             + S  T++SAG+S ++ +    N       S  C   S KG S    S  SE  ERS  
Sbjct: 488  SSQSKHTINSAGYSINQSSLVEDNTQHCLNSSELCGAMSSKGFSSTCPSNCSEQFERSSE 547

Query: 1507 PAKLSKKRARPGESSRPRPRDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQS 1686
            PAK +KKRARPGE+ RPRPRDRQLIQDRIKELREL+PNG+KCSIDSLLERTIKHM+F+Q 
Sbjct: 548  PAKNNKKRARPGENPRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMVFLQG 607

Query: 1687 VTKHAEKLQKCSVSKLLDKDTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQML 1866
            +TKHA+KL KC+ SK+  K  GM   S  EQGSSWAVEVG++ KVC I+VEN N NGQ+L
Sbjct: 608  ITKHADKLSKCAESKIHHKGAGMLGSSNYEQGSSWAVEVGSHLKVCSIVVENTNKNGQIL 667

Query: 1867 VEMLCKECDQFLEVAETIRSLGLNIMKGVSEAYGNKAWMCFVVE--NNRSMHRMDVLWSL 2040
            VEMLC+EC  FLE+AE IRSLGL I+KGV+EA+G K W+CFVVE  NNR MHRMD+LWSL
Sbjct: 668  VEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGEKTWICFVVEGQNNRVMHRMDILWSL 727

Query: 2041 MQLLQPKISN 2070
            +Q+LQ + +N
Sbjct: 728  VQILQSQATN 737


>ref|XP_004161538.1| PREDICTED: transcription factor EMB1444-like [Cucumis sativus]
          Length = 691

 Score =  517 bits (1331), Expect = e-144
 Identities = 313/706 (44%), Positives = 410/706 (58%), Gaps = 21/706 (2%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRDK---NWFNSNAGILNEGPYSHDPLGLAVAKMSYQVY 171
            KLK RARM+ TWED YYD ++  +     +F        +G YSHD LGLAVAKMSY VY
Sbjct: 26   KLKHRARMVLTWEDGYYDNSEQHEPPEGKFFRKTLETFYDGHYSHDALGLAVAKMSYHVY 85

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVV 351
            SLGEG+VG VAV+GKH WI +D+ + + S + EY   W++QFSAGIKTI VVAV+PHGV+
Sbjct: 86   SLGEGIVGQVAVTGKHQWITADEQIPNFSSTIEYCDGWQTQFSAGIKTIVVVAVVPHGVL 145

Query: 352  QLGSLHKIAEDLKLVDHIRNVFSNLQDSLTGSMPSSLKNTSPSDKCSRPSNPAFNQCLAK 531
            QLGSL K+ ED+ LV  IRNVF  LQ+S  G +       S       PS     + LA 
Sbjct: 146  QLGSLDKVTEDVNLVTRIRNVFLTLQESSAGEIKPMHSCKSSGYMADIPS-----RSLAT 200

Query: 532  LKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRG-GISNKKLKIKMHETTECSTAGSDTL 708
             KG V +   NV  +L    G     S T  P G  + N K ++++ +   C        
Sbjct: 201  EKGEVASVSKNVGLELS---GSEAFESLTTKPDGINVENFKSQVRLLDDRMCG------- 250

Query: 709  IRSSCENISKRQLQEGDMELSDNPKHFGETNGVGD--LGEKAEINCTAPVGRVEKSRLCN 882
                                       GE +G  D  +G K +IN  +    ++   +C 
Sbjct: 251  ---------------------------GEPSGCKDKAVGLKQKINVQSQNSTMDMVNICG 283

Query: 883  AILPTEASEVVASIVPPHNLEPPVFTAHADISFPELPSLQMHQDFANSELPITSNSSEMQ 1062
             +LP E    + +     ++ P   +A+  ++   +     H +         S + EM 
Sbjct: 284  NLLPAEK---IMTNDAYFSMNPHPSSAYDGVNHNGMFIRTNHTEMYLQNDMEASETIEMY 340

Query: 1063 ASPFS--FCAGYELFEALGPSFQKQNNCF-WEGENIGVDMAVGVSEGISSCSQLMENSDM 1233
             S  S  F AGYEL E LGP+F K      W+ E +    A  +SEG+S      ++   
Sbjct: 341  PSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVLGGKAFELSEGMSGSQLTSDSPTE 400

Query: 1234 HLLDAVVGRASHKGDGTESEISGLNTEESLMTAETERTPCNSVGTLS-SAGFSFDRDTSS 1410
             LL+AVV    H G   +S+ S   + +SL+T E    P  +V T + S G+S  +  +S
Sbjct: 401  RLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTERIPEPSTNVTTSACSEGYSMGQSQTS 460

Query: 1411 SF---------NSVTCDLESLKGISPASSSRGSEHVERSRLPAKLSKKRARPGESSRPRP 1563
                       +S  C + S KG S   S  GSEH+++S  PAK SK+RARPGESSRPRP
Sbjct: 461  FTGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLDKSSEPAKNSKRRARPGESSRPRP 520

Query: 1564 RDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQSVTKHAEKLQKCSVSKLLDK 1743
            RDRQLIQDRIKELREL+PNG+KCSIDSLLERTIKHM+F+Q +TKHA+KL KC+  KL  K
Sbjct: 521  RDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLTKCANMKLHQK 580

Query: 1744 DTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQMLVEMLCKECDQFLEVAETIR 1923
             +GM   S ++QGSSWAVEVG   KVC IIVEN+N NGQ+LVEMLC+EC  FLE+AE IR
Sbjct: 581  GSGMLGTSDTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIR 640

Query: 1924 SLGLNIMKGVSEAYGNKAWMCFVV--ENNRSMHRMDVLWSLMQLLQ 2055
            SLGL I+KG++EA+G K W+CFVV  ENNR++HRMD+LWSL+Q+LQ
Sbjct: 641  SLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQ 686


>ref|XP_007050336.1| Basic helix-loop-helix-containing protein, putative isoform 1
            [Theobroma cacao] gi|508702597|gb|EOX94493.1| Basic
            helix-loop-helix-containing protein, putative isoform 1
            [Theobroma cacao]
          Length = 708

 Score =  507 bits (1305), Expect = e-140
 Identities = 325/722 (45%), Positives = 414/722 (57%), Gaps = 32/722 (4%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRD---KNWFNSNAGILNEGPYSHDPLGLAVAKMSYQVY 171
            KLK RARM+ TWEDAYYD +   D    N F+     L  G  SHDPLGLAVAKMSY VY
Sbjct: 30   KLKHRARMVLTWEDAYYDNHDQHDPSENNCFHHTLDNLQSGYCSHDPLGLAVAKMSYHVY 89

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVV 351
            SLGEG+VG VAVSGKH WIF+D+HV  S    E+   W+SQF+AGI+TI VVAV+ HGVV
Sbjct: 90   SLGEGIVGQVAVSGKHQWIFADKHVNSSCSLFEFCDGWQSQFAAGIRTIVVVAVVQHGVV 149

Query: 352  QLGSLHKIAEDLKLVDHIRNVFSNLQDSLTG--------SMPSSL-----------KNTS 474
            QLGSL+K+ ED+KLV HIR+VF  LQDS  G        SM SSL            +  
Sbjct: 150  QLGSLNKVFEDVKLVSHIRDVFFALQDSSVGHIASPIECSMKSSLFQLDLPTKLLDSDGI 209

Query: 475  PSDKCSRPSNPAFNQCLAKLKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRGGISNKKL 654
            P DK      P  +  L +   P        +SD +  L  S N+     P+G +  +  
Sbjct: 210  PLDKTVDEQGP--DALLPEFSHP------RKYSDRLFVLPLSNNH-----PKGAVEVEN- 255

Query: 655  KIKMHETTECSTAGSDTLI-----RSSCENISKRQLQEGDMELSDNPKHFGETNGVGDLG 819
                HE  E S+A +D        RS+  N+ + Q Q G + L +N    GE +G  +  
Sbjct: 256  ---KHEGLELSSARNDESAKLLTPRSNVSNL-EHQNQLGRI-LINNGVWKGENSGWKNSS 310

Query: 820  EKAE-INCTAPVGRVEKSRLCNAILPTE-ASEVVASIVPPHNLEPPVFTAHADISFPELP 993
               E +    PVG  E+  + +A   +   +   +  V   +L       +  +  PE  
Sbjct: 311  LVPENVYANNPVGGRERYGVDHAYFSSNFLNSAHSDTVKSSSLSS---YPNEVLDIPESS 367

Query: 994  SLQMHQDFANSELPITSNSSEMQASPFSFCAGYELFEALGPSF-QKQNNCFWEGENIGVD 1170
             ++  +D          +  +   +   F  G EL+EALGP+F +K     W+ EN+   
Sbjct: 368  DMKFQKDLKKLGNQNEISHLDPMNTSLKFSVGCELYEALGPAFIRKSIYADWQAENMEAG 427

Query: 1171 MAVGVSEGISSCSQLMENSDMHLLDAVVGRASHKGDGTESEISGLNTEESLMTAETERTP 1350
              + + EG+SS     E+   +LL+AVV    H G   ++E S   +  SL+T  T  TP
Sbjct: 428  GNIEMPEGMSSSQLTFESGSENLLEAVVANVCHSGSDIKAERSSCRSAPSLLT--TGNTP 485

Query: 1351 CNSVGTLSSAGFSFDRDTSSSFNSVTCDLESLKGISPASSSRGSEHVERSRLPAKLSKKR 1530
              S   L                   C   S KG S    S  SE  ERS  PAK +KKR
Sbjct: 486  EPSSQKL-------------------CGAMSSKGFSSTCPSNCSEQFERSSEPAKNNKKR 526

Query: 1531 ARPGESSRPRPRDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQSVTKHAEKL 1710
            ARPGE+ RPRPRDRQLIQDRIKELREL+PNG+KCSIDSLLERTIKHM+F+Q +TKHA+KL
Sbjct: 527  ARPGENPRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMVFLQGITKHADKL 586

Query: 1711 QKCSVSKLLDKDTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQMLVEMLCKEC 1890
             KC+ SK+  K  GM   S  EQGSSWAVEVG++ KVC I+VEN N NGQ+LVEMLC+EC
Sbjct: 587  SKCAESKIHHKGAGMLGSSNYEQGSSWAVEVGSHLKVCSIVVENTNKNGQILVEMLCEEC 646

Query: 1891 DQFLEVAETIRSLGLNIMKGVSEAYGNKAWMCFVVE--NNRSMHRMDVLWSLMQLLQPKI 2064
              FLE+AE IRSLGL I+KGV+EA+G K W+CFVVE  NNR MHRMD+LWSL+Q+LQ + 
Sbjct: 647  SHFLEIAEAIRSLGLTILKGVTEAHGEKTWICFVVEGQNNRVMHRMDILWSLVQILQSQA 706

Query: 2065 SN 2070
            +N
Sbjct: 707  TN 708


>ref|XP_006443866.1| hypothetical protein CICLE_v10018993mg [Citrus clementina]
            gi|557546128|gb|ESR57106.1| hypothetical protein
            CICLE_v10018993mg [Citrus clementina]
          Length = 748

 Score =  505 bits (1301), Expect = e-140
 Identities = 328/737 (44%), Positives = 428/737 (58%), Gaps = 47/737 (6%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYD--GNQYRDKNWFNSNA-GILNEGPYSHDPLGLAVAKMSYQVY 171
            KLK R RM+ TWED YYD  G Q   +N  +S +    + G YSHDPLGLAVAKMSY VY
Sbjct: 30   KLKHRTRMVLTWEDGYYDNCGQQDSLENKCSSESLENFHGGRYSHDPLGLAVAKMSYHVY 89

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVV 351
            SLGEG+VG VAV+GKH WIFSDQ V +S  S E+   W+SQFSAGI+TIAVVAV+PHGVV
Sbjct: 90   SLGEGIVGQVAVTGKHQWIFSDQLVTNSCSSFEFSDGWQSQFSAGIRTIAVVAVVPHGVV 149

Query: 352  QLGSLHKIAEDLKLVDHIRNVFSNLQDSLTG----SMPSSLKNTSPSDKCSRPSNP---- 507
            QLGSL ++ ED+K+V HIR+VF+ L D   G    ++ SS+KNT         S P    
Sbjct: 150  QLGSLDEVTEDMKVVTHIRDVFAALNDISVGHVSSTIQSSVKNTLSLPDLPTKSIPNRWH 209

Query: 508  ----AFNQCLAKLKGPV------HADEVNVWSDLIPPLGESV---NNSCTLPPRGGISNK 648
                  N+    ++ P+      H D    +S + P +G+ V   N    L   GG+ + 
Sbjct: 210  NLDEVVNRGGPDVQFPMFPYVEKHNDGSYAFSGMQPKIGDGVVNRNEGILLSSAGGVGSA 269

Query: 649  KLKIKMHETTECSTAGSDTLIRSSCENISKRQLQEGDMELSDNPKHFGETNGVGDLGEKA 828
            K+   +H              +S+  N+   Q Q G   +SD      E++G  DLG  +
Sbjct: 270  KI---LHP-------------KSNVINLD-YQNQMGIHFISDGMSRV-ESSGWKDLGVIS 311

Query: 829  EINCT--APVGRVEKSRLCNAILPTE---------ASEVVASIVPPH-NLEPPVFTAHAD 972
            E N T  +    ++   LC+  L  E         AS  + +++     LE      +  
Sbjct: 312  EQNGTPFSINSVIDSINLCSVALQAEKFVADRTYLASNPLEAVLGEQVKLECTDSCQNGM 371

Query: 973  ISFPELPSLQMHQDFANSELPITSNSSEMQASPFSFCAGYELFEALGPSF-QKQNNCFWE 1149
            +  PE+  ++  +D    +     N  +       F A  EL EALGP+F +K      E
Sbjct: 372  LHIPEISDIKFEKDLEKLQNQTELNHLDPSGMSLKFSAVSELHEALGPAFLRKDIYNDRE 431

Query: 1150 GENIGVDMAVGVSEGISSCSQLMENSDMHLLDAVVGRASHKGDGTESEISGLNTEESLMT 1329
             EN      VG+ E  SS   + ++   +LLDAVV    + G   +SE +   + +SL+T
Sbjct: 432  PENTVDGETVGMPELTSSSHLMFDSGSENLLDAVVASVCNSGSDVKSERTVCRSMQSLLT 491

Query: 1330 AETE-------RTPCNSVGTLSSAGFSFDRDTSSSFN-SVTCDLESLKGISPASSSRGSE 1485
             E +       +   NSV    S     + D     N S  C   S KG S    S  SE
Sbjct: 492  TEKKPESSSQSKNTNNSVSYSISQSSLVEEDAKHFLNSSEVCGAVSSKGFSSTCPSTCSE 551

Query: 1486 HVERSRLPAKLSKKRARPGESSRPRPRDRQLIQDRIKELRELIPNGSKCSIDSLLERTIK 1665
             ++ S  PAK +KKRAR GE+ RPRPRDRQLIQDRIKELREL+PNGSKCSIDSLLERTIK
Sbjct: 552  QLDMSSEPAKNNKKRARTGENGRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIK 611

Query: 1666 HMIFMQSVTKHAEKLQKCSVSKLLDKDTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENI 1845
            HM+F+QS+TKHA+KL KC+ SK+  K  G+   S  EQGSSWAVE+G++ KVC I+VEN+
Sbjct: 612  HMLFLQSITKHADKLSKCAESKMHQKGNGIHG-SNYEQGSSWAVEMGSHLKVCSIVVENL 670

Query: 1846 NMNGQMLVEMLCKECDQFLEVAETIRSLGLNIMKGVSEAYGNKAWMCFVVE--NNRSMHR 2019
            N NGQMLVEMLC+EC  FLE+AE IRSLGL I+KGV+EA+G+K W+CFVVE  +NR MHR
Sbjct: 671  NKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGDKTWICFVVEGQDNRIMHR 730

Query: 2020 MDVLWSLMQLLQPKISN 2070
            MDVLWSL+QLLQ K ++
Sbjct: 731  MDVLWSLVQLLQSKTTS 747


>ref|XP_007050337.1| Basic helix-loop-helix DNA-binding superfamily protein isoform 2
            [Theobroma cacao] gi|508702598|gb|EOX94494.1| Basic
            helix-loop-helix DNA-binding superfamily protein isoform
            2 [Theobroma cacao]
          Length = 709

 Score =  503 bits (1295), Expect = e-139
 Identities = 325/723 (44%), Positives = 414/723 (57%), Gaps = 33/723 (4%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRD---KNWFNSNAGILNEGPYSHDPLGLAVAKMSYQVY 171
            KLK RARM+ TWEDAYYD +   D    N F+     L  G  SHDPLGLAVAKMSY VY
Sbjct: 30   KLKHRARMVLTWEDAYYDNHDQHDPSENNCFHHTLDNLQSGYCSHDPLGLAVAKMSYHVY 89

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVV 351
            SLGEG+VG VAVSGKH WIF+D+HV  S    E+   W+SQF+AGI+TI VVAV+ HGVV
Sbjct: 90   SLGEGIVGQVAVSGKHQWIFADKHVNSSCSLFEFCDGWQSQFAAGIRTIVVVAVVQHGVV 149

Query: 352  QLGSLHKIA-EDLKLVDHIRNVFSNLQDSLTG--------SMPSSL-----------KNT 471
            QLGSL+K+  ED+KLV HIR+VF  LQDS  G        SM SSL            + 
Sbjct: 150  QLGSLNKVVFEDVKLVSHIRDVFFALQDSSVGHIASPIECSMKSSLFQLDLPTKLLDSDG 209

Query: 472  SPSDKCSRPSNPAFNQCLAKLKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRGGISNKK 651
             P DK      P  +  L +   P        +SD +  L  S N+     P+G +  + 
Sbjct: 210  IPLDKTVDEQGP--DALLPEFSHP------RKYSDRLFVLPLSNNH-----PKGAVEVEN 256

Query: 652  LKIKMHETTECSTAGSDTLI-----RSSCENISKRQLQEGDMELSDNPKHFGETNGVGDL 816
                 HE  E S+A +D        RS+  N+ + Q Q G + L +N    GE +G  + 
Sbjct: 257  ----KHEGLELSSARNDESAKLLTPRSNVSNL-EHQNQLGRI-LINNGVWKGENSGWKNS 310

Query: 817  GEKAE-INCTAPVGRVEKSRLCNAILPTE-ASEVVASIVPPHNLEPPVFTAHADISFPEL 990
                E +    PVG  E+  + +A   +   +   +  V   +L       +  +  PE 
Sbjct: 311  SLVPENVYANNPVGGRERYGVDHAYFSSNFLNSAHSDTVKSSSLSS---YPNEVLDIPES 367

Query: 991  PSLQMHQDFANSELPITSNSSEMQASPFSFCAGYELFEALGPSF-QKQNNCFWEGENIGV 1167
              ++  +D          +  +   +   F  G EL+EALGP+F +K     W+ EN+  
Sbjct: 368  SDMKFQKDLKKLGNQNEISHLDPMNTSLKFSVGCELYEALGPAFIRKSIYADWQAENMEA 427

Query: 1168 DMAVGVSEGISSCSQLMENSDMHLLDAVVGRASHKGDGTESEISGLNTEESLMTAETERT 1347
               + + EG+SS     E+   +LL+AVV    H G   ++E S   +  SL+T  T  T
Sbjct: 428  GGNIEMPEGMSSSQLTFESGSENLLEAVVANVCHSGSDIKAERSSCRSAPSLLT--TGNT 485

Query: 1348 PCNSVGTLSSAGFSFDRDTSSSFNSVTCDLESLKGISPASSSRGSEHVERSRLPAKLSKK 1527
            P  S   L                   C   S KG S    S  SE  ERS  PAK +KK
Sbjct: 486  PEPSSQKL-------------------CGAMSSKGFSSTCPSNCSEQFERSSEPAKNNKK 526

Query: 1528 RARPGESSRPRPRDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQSVTKHAEK 1707
            RARPGE+ RPRPRDRQLIQDRIKELREL+PNG+KCSIDSLLERTIKHM+F+Q +TKHA+K
Sbjct: 527  RARPGENPRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMVFLQGITKHADK 586

Query: 1708 LQKCSVSKLLDKDTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQMLVEMLCKE 1887
            L KC+ SK+  K  GM   S  EQGSSWAVEVG++ KVC I+VEN N NGQ+LVEMLC+E
Sbjct: 587  LSKCAESKIHHKGAGMLGSSNYEQGSSWAVEVGSHLKVCSIVVENTNKNGQILVEMLCEE 646

Query: 1888 CDQFLEVAETIRSLGLNIMKGVSEAYGNKAWMCFVVE--NNRSMHRMDVLWSLMQLLQPK 2061
            C  FLE+AE IRSLGL I+KGV+EA+G K W+CFVVE  NNR MHRMD+LWSL+Q+LQ +
Sbjct: 647  CSHFLEIAEAIRSLGLTILKGVTEAHGEKTWICFVVEGQNNRVMHRMDILWSLVQILQSQ 706

Query: 2062 ISN 2070
             +N
Sbjct: 707  ATN 709


>ref|XP_002532375.1| basic helix-loop-helix-containing protein, putative [Ricinus
            communis] gi|223527931|gb|EEF30018.1| basic
            helix-loop-helix-containing protein, putative [Ricinus
            communis]
          Length = 749

 Score =  501 bits (1291), Expect = e-139
 Identities = 331/752 (44%), Positives = 441/752 (58%), Gaps = 62/752 (8%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRD---KNWFNSNAGILNEGPYSHDPLGLAVAKMSYQVY 171
            KLK R RM+ TWEDAYY+  +  D      F      L  G YS+DP+GLAVAKMSY VY
Sbjct: 25   KLKHRTRMVLTWEDAYYNNCEQHDLLENKCFGETFENLCGGRYSNDPVGLAVAKMSYHVY 84

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVV 351
            SLGEG+VG VAV+GKH WI +D+HV +S  S E+   W+SQFSAGI+TI VVAV+PHGVV
Sbjct: 85   SLGEGIVGQVAVTGKHRWIVADKHVTNSISSFEFSDGWQSQFSAGIRTIIVVAVVPHGVV 144

Query: 352  QLGSLHKIAEDLKLVDHIRNVFSNLQDS--------LTGSMPSSL------KNTSPSDKC 489
            QLGSL+K+AED+KLV+HI++VFS+LQDS        L  SM +SL        +  S+  
Sbjct: 145  QLGSLNKVAEDMKLVNHIKDVFSSLQDSSVEQISIPLQYSMKTSLYLPDVPTQSLDSESV 204

Query: 490  SRPSNPAFNQCLAKLKGPVHADEVNVWSDLIPPLGESVNNSC--TLPPRGGISNKKLKIK 663
              P N   N   A  KGP +       S + P L +  ++S   +LP   GI ++K  ++
Sbjct: 205  VIPDNLC-NLDKAADKGPYNQ------STMFPYLQKQSDDSYFYSLP---GI-HQKTAVE 253

Query: 664  MHETTECSTAGSDTLIRSSCENISKRQLQEGDMELSDNPKHF-------------GETNG 804
            +      +  G   L  S   NIS  +L +    +S   +H              G+T+ 
Sbjct: 254  L-----VNKYGGGGL--SLPVNISSVKLLQPRSNISYLEQHNQVGINLVVDHTCGGKTSV 306

Query: 805  VGDLGEKAEINCTAPVGRVEKSR--LCNAILPTEASEVVASIVPPHNLEPPVFTAHAD-- 972
              D G  +E+N T  +    K    LC+ ILP +      +  P   L+  V   H    
Sbjct: 307  WKDPGRGSELNVTPHLDNSVKDNINLCDVILPDQKFGADPANFPMDLLDSTVCDRHKSDE 366

Query: 973  -------ISFPELPSLQMHQDFANSELPITSNSSEMQASP--FSFCAGYELFEALGPSFQ 1125
                   +  PE  S+ + +     +L   + SS +++S     F AG EL EALGP+F 
Sbjct: 367  IDILNGALDMPESSSIDLKKHL-EKKLEYQAGSSHLESSSTFLKFSAGCELHEALGPAFS 425

Query: 1126 KQNNCFW----EGENIGVDMAVGVSEGISSCSQLMENSDMHLLDAVVGRASHKGD-GTES 1290
            K   C +    EG+    D+ + V EGIS+     +    +LLDAVVG   + G    + 
Sbjct: 426  K--GCLYFDCEEGKTESADI-IEVPEGISTSQMTFDTGSENLLDAVVGNVCYSGSTDVKR 482

Query: 1291 EISGLNTEESLMTAETERTPCNSVGTLS-SAGFSFDRDTSSSFNSVTCDLESLKGISPAS 1467
            E S   + +SL+T E    P      ++ SAG+S +R +    ++  C   S  G+  A+
Sbjct: 483  EKSVCKSAQSLLTTEKMPEPSFQAKHITHSAGYSINRQSVVQNDTHNCS--SSTGVRGAT 540

Query: 1468 SSRG---------SEHVERSRLPAKLSKKRARPGESSRPRPRDRQLIQDRIKELRELIPN 1620
            SS G         SE ++R   PA+ +KKRARPGE+ RPRPRDRQLIQDRIKELREL+PN
Sbjct: 541  SSNGYSSNCPSTCSEQLDRRSEPAEKNKKRARPGENCRPRPRDRQLIQDRIKELRELVPN 600

Query: 1621 GSKCSIDSLLERTIKHMIFMQSVTKHAEKLQKCSVSKLLDKDTGMRKFSRSEQGSSWAVE 1800
            G+KCSIDSLLERTIKHM+F++S+TKHA+KL KC+ SK+  K T     S  E+GSSWAVE
Sbjct: 601  GAKCSIDSLLERTIKHMLFLESITKHADKLNKCAESKMYQKGTDT---SNYEKGSSWAVE 657

Query: 1801 VGNNQKVCPIIVENINMNGQMLVEMLCKECDQFLEVAETIRSLGLNIMKGVSEAYGNKAW 1980
            VG + KV  IIVE++N NGQMLVEMLC+EC  FLE+AE IRSLGL I+KG++E +G K W
Sbjct: 658  VGGHLKVSSIIVESLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGITEVHGEKTW 717

Query: 1981 MCFVVE--NNRSMHRMDVLWSLMQLLQPKISN 2070
            +CF+VE  NN+ MHRMD+LWSL+Q+LQPK SN
Sbjct: 718  ICFMVEGQNNKVMHRMDILWSLVQILQPKTSN 749


>emb|CCX35476.1| hypothetical protein [Malus domestica]
          Length = 741

 Score =  501 bits (1290), Expect = e-139
 Identities = 307/714 (42%), Positives = 422/714 (59%), Gaps = 27/714 (3%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDG--NQYRDKN-WFNSNAGILNEGPYSHDPLGLAVAKMSYQVY 171
            KLK RARM+ T EDAY+D    Q+  +N  F+     L++  YSHDPLGLAVAKMS  VY
Sbjct: 25   KLKHRARMVLTCEDAYFDNCEQQHSSENRCFSKTMDKLHDSHYSHDPLGLAVAKMSCHVY 84

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVV 351
            +LGEG+VG VAV+G+H WI++D  V ++    +Y   W+SQ+SAGI+TI VVAV+PH V+
Sbjct: 85   NLGEGIVGQVAVTGEHQWIYADDLVKNNCSPFQYCDGWQSQYSAGIRTIVVVAVVPHRVI 144

Query: 352  QLGSLHKIAEDLKLVDHIRNVFSNLQDSLTGSMPSSLKNTSPSDKCSRP------SNPAF 513
            QLGSL+K+AE++KL+  I + F  LQD     + +  +++  S  CS        ++   
Sbjct: 145  QLGSLNKVAENVKLISQITDAFKTLQDFPIEHILNPKQSSINSSVCSTNISLEGLASGVL 204

Query: 514  NQCLAKLKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRGGISNKKLKI-KMHETTECST 690
              C+  L    + +  ++W+ + P L +  ++S           +++++   H   E S 
Sbjct: 205  PDCVNNLDTATNRESSDIWASIFPHLVKDNDSSYVSSLTENCLKEEVELANKHGGLESSN 264

Query: 691  AGS---DTLIRSSCENISKRQLQEGDMELSDNPKHFGETNGVGDLGEKAEINCTAPVGR- 858
             GS     L +S    +S    +   +EL D+ K  GE++G  D G  A +    P+   
Sbjct: 265  FGSVEIGKLPQSKSSALSMEHHRLVGVELLDSRKCKGESSGCKDTG-MASVIYAHPLSHD 323

Query: 859  -VEKSRLCN-AILPTEASEVVASIVPPHNLEPPVFTAHADISFPELPSLQMHQDFANSEL 1032
             V    LC+ A LPT   +  A      N +      +  +   E   ++  +   N E 
Sbjct: 324  PVNIVNLCDFADLPTTFLDSTAH--ERINADRVDLHQNEVLHVSEPSVVKFQKGLENLEF 381

Query: 1033 PITSNSSEMQASPFSFCAGYELFEALGPSFQKQNNCF-WEGENIGVDMAVGVSEGISSCS 1209
               S   +  ++  +F AG EL EALGP+F  Q N F W     G  +   + EG+++  
Sbjct: 382  QTESGHMDTSSTSMTFPAGCELHEALGPAFLNQGNYFDWVAGKNGDRITPEIPEGMNTSQ 441

Query: 1210 QLMENSDMHLLDAVVGRASHKGDGTESEISGLNTEESLMTAETERTPCNSVG-TLSSAGF 1386
                +   HLL+AVV      G   +SE S   + +SL+T E    P + +  T+ S  +
Sbjct: 442  LTSASCQEHLLEAVVANVCQSGSLVKSEKSFCKSMQSLLTTEKCPEPSSRITHTIDSENY 501

Query: 1387 SFDRDTSSS-------FNSVTCDLESLKGISPASSSRGSEHVERSRLPAKLSKKRARPGE 1545
            S D+ + +         +S  C + S K  S    S  SE +ERS  P+K SKKRARPGE
Sbjct: 502  SIDQPSLTGEDMQQCLSSSGVCGVISPKWFSSPCPSACSEQLERSSGPSKNSKKRARPGE 561

Query: 1546 SSRPRPRDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQSVTKHAEKLQKCSV 1725
            SSRPRPRDRQLIQDRIKELRELIP G+KCSIDSLLERTIKHM+F+QSVTKHA+KL KC+ 
Sbjct: 562  SSRPRPRDRQLIQDRIKELRELIPTGAKCSIDSLLERTIKHMLFLQSVTKHADKLNKCAD 621

Query: 1726 SKLLDKDTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQMLVEMLCKECDQFLE 1905
            +KL  K+  M   S  E+GSSWAVEVG N KVC IIVEN+N NGQM+VE++C+EC  FLE
Sbjct: 622  AKLCPKEASMLGSSNYERGSSWAVEVGGNLKVCSIIVENLNKNGQMVVELMCEECSHFLE 681

Query: 1906 VAETIRSLGLNIMKGVSEAYGNKAWMCFVVE--NNRSMHRMDVLWSLMQLLQPK 2061
            +AE IRS GL I+KGV+EA G+K W+CFVVE  NNRS+HRMD+LWSL+Q+LQPK
Sbjct: 682  IAEAIRSSGLTILKGVTEARGDKTWICFVVEGQNNRSIHRMDILWSLVQILQPK 735


>ref|XP_006383698.1| basic helix-loop-helix family protein [Populus trichocarpa]
            gi|550339661|gb|ERP61495.1| basic helix-loop-helix family
            protein [Populus trichocarpa]
          Length = 694

 Score =  498 bits (1281), Expect = e-138
 Identities = 319/726 (43%), Positives = 407/726 (56%), Gaps = 36/726 (4%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRD---KNWFNSNAGILNEGPYSHDPLGLAVAKMSYQVY 171
            KLK RARM+ TWED YYD  +  D      F      L  G Y  DPLGLAVAKMSY VY
Sbjct: 27   KLKHRARMVLTWEDGYYDNCEQHDALENKCFRQTQENLRGGHYPRDPLGLAVAKMSYHVY 86

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVV 351
            SLGEG+VG VAVSGKH WIF+D+HV +S  S E+   W+SQFSAGI+TI VVAV+P+GVV
Sbjct: 87   SLGEGIVGQVAVSGKHQWIFADKHVTNSFSSYEFSDGWQSQFSAGIRTIVVVAVVPYGVV 146

Query: 352  QLGSLHKIAEDLKLVDHIRNVFSNLQDSLTGSMPSSLKNTSPSDKCSRPSNPAFNQCLAK 531
            QLGSL+K++ED+ LV HI++VF  LQDS    +      TSPS                 
Sbjct: 147  QLGSLNKVSEDVNLVTHIKDVFFALQDSTVSHV------TSPSQ---------------- 184

Query: 532  LKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRGGISNKKLKIKMHETTECSTAGSDTLI 711
                                   + N+  L     + NK+  +++   T  +    D L 
Sbjct: 185  ---------------------HGMKNALCLKTAAELKNKQEVLEI--PTPTNDESIDLLN 221

Query: 712  RSSCENISKRQLQEGDMELSDNPKHFGETNGVGDLGEKAEINCTAPVGRV--EKSRLCNA 885
              S  +    + Q G   +SD     GET+   DLG  +E N T        E   L + 
Sbjct: 222  LKSNASYLDHRSQLGMNIISDR-MFGGETSVWKDLGRGSEHNTTMHSNSFMRENVSLSDL 280

Query: 886  ILPTEASEVVASIVPPH---------------NLEPPVFTAHADISFPELPSLQMHQDFA 1020
            +LP E      +  P                 NL P V      ++ PE   +   +D  
Sbjct: 281  VLPNEKLGADLAGFPADLFDSTICDRDKSDSINLRPNVV-----LNAPESSDITFKRDLE 335

Query: 1021 NS-ELPITSNSSEMQASPFSFCAGYELFEALGPSFQKQNNCFWEGENIGVDMAVGV---S 1188
               + P  S       + F F AG EL EALGPSF   N C       G   A  +    
Sbjct: 336  KKLDHPAESTHFNSSDTFFKFSAGCELLEALGPSFL--NRCMPFDYQTGKSEAGNIFEMP 393

Query: 1189 EGISSCSQLMENSDMHLLDAVVGRASHKGDGTESEISGLNTEESLMTAETERTPCNSVGT 1368
            EG+SS     +    +LL+AVVG   H G   +SE SG  + +SL+TAE  + P  S+ T
Sbjct: 394  EGMSSSQMTFDFGSENLLEAVVGNVCHSGSDVKSEKSGCKSVQSLVTAE--KLPEPSIQT 451

Query: 1369 ---LSSAGFSFDRDT-------SSSFNSVTCDLESLKGISPASSSRGSEHVERSRLPAKL 1518
               ++SAG+S ++ +       + S ++  C   S KG S    S  SE +++    AK 
Sbjct: 452  KHIMNSAGYSINQSSVVEEDVHNLSNSTEVCGGMSSKGFSSTCPSTYSEQLDKRSESAKN 511

Query: 1519 SKKRARPGESSRPRPRDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQSVTKH 1698
            SKKRA+PGE+ RPRPRDRQLIQDRIKELREL+PNGSKCSIDSLLERTIKHM+F++++TKH
Sbjct: 512  SKKRAKPGENCRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIKHMLFLENITKH 571

Query: 1699 AEKLQKCSVSKLLDKDTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQMLVEML 1878
            A+KL KC+  K+  K T   + S  EQGSSWAVEVG + KV  IIVEN+N NGQMLVEML
Sbjct: 572  ADKLNKCAEPKMHQKGT---EASNYEQGSSWAVEVGGHLKVSSIIVENLNKNGQMLVEML 628

Query: 1879 CKECDQFLEVAETIRSLGLNIMKGVSEAYGNKAWMCFVVE--NNRSMHRMDVLWSLMQLL 2052
            C+EC  FLE+AE IRSLGL I+KG++E  G K W+CFVVE  NN+ MHRMD+LWSL+Q+L
Sbjct: 629  CEECSHFLEIAEAIRSLGLTILKGITEVQGEKTWICFVVEGQNNKIMHRMDILWSLVQIL 688

Query: 2053 QPKISN 2070
            QPK +N
Sbjct: 689  QPKTTN 694


>ref|XP_006443867.1| hypothetical protein CICLE_v10018993mg [Citrus clementina]
            gi|568851769|ref|XP_006479559.1| PREDICTED: transcription
            factor EMB1444-like [Citrus sinensis]
            gi|557546129|gb|ESR57107.1| hypothetical protein
            CICLE_v10018993mg [Citrus clementina]
          Length = 714

 Score =  496 bits (1277), Expect = e-137
 Identities = 324/729 (44%), Positives = 425/729 (58%), Gaps = 39/729 (5%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYD--GNQYRDKNWFNSNA-GILNEGPYSHDPLGLAVAKMSYQVY 171
            KLK R RM+ TWED YYD  G Q   +N  +S +    + G YSHDPLGLAVAKMSY VY
Sbjct: 30   KLKHRTRMVLTWEDGYYDNCGQQDSLENKCSSESLENFHGGRYSHDPLGLAVAKMSYHVY 89

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVV 351
            SLGEG+VG VAV+GKH WIFSDQ V +S  S E+   W+SQFSAGI+TIAVVAV+PHGVV
Sbjct: 90   SLGEGIVGQVAVTGKHQWIFSDQLVTNSCSSFEFSDGWQSQFSAGIRTIAVVAVVPHGVV 149

Query: 352  QLGSLHKIAEDLKLVDHIRNVFSNLQDSLTG----SMPSSLKNTSPSDKCSRPSNP---- 507
            QLGSL ++ ED+K+V HIR+VF+ L D   G    ++ SS+KNT         S P    
Sbjct: 150  QLGSLDEVTEDMKVVTHIRDVFAALNDISVGHVSSTIQSSVKNTLSLPDLPTKSIPNRWH 209

Query: 508  ----AFNQCLAKLKGPV------HADEVNVWSDLIPPLGESV---NNSCTLPPRGGISNK 648
                  N+    ++ P+      H D    +S + P +G+ V   N    L   GG+ + 
Sbjct: 210  NLDEVVNRGGPDVQFPMFPYVEKHNDGSYAFSGMQPKIGDGVVNRNEGILLSSAGGVGSA 269

Query: 649  KLKIKMHETTECSTAGSDTLIRSSCENISKRQLQEGDMELSDNPKHFGETNGVGDLGEKA 828
            K+   +H              +S+  N+   Q Q G   +SD      E++G  DLG  +
Sbjct: 270  KI---LHP-------------KSNVINLD-YQNQMGIHFISDGMSRV-ESSGWKDLGVIS 311

Query: 829  EINCT--APVGRVEKSRLCNAILPTE---------ASEVVASIVPPH-NLEPPVFTAHAD 972
            E N T  +    ++   LC+  L  E         AS  + +++     LE      +  
Sbjct: 312  EQNGTPFSINSVIDSINLCSVALQAEKFVADRTYLASNPLEAVLGEQVKLECTDSCQNGM 371

Query: 973  ISFPELPSLQMHQDFANSELPITSNSSEMQASPFSFCAGYELFEALGPSF-QKQNNCFWE 1149
            +  PE+  ++  +D    +     N  +       F A  EL EALGP+F +K      E
Sbjct: 372  LHIPEISDIKFEKDLEKLQNQTELNHLDPSGMSLKFSAVSELHEALGPAFLRKDIYNDRE 431

Query: 1150 GENIGVDMAVGVSEGISSCSQLMENSDMHLLDAVVGRASHKGDGTESEISGLNTEESLMT 1329
             EN      VG+ E  SS   + ++   +LLDAVV    + G   +SE +   + +SL+T
Sbjct: 432  PENTVDGETVGMPELTSSSHLMFDSGSENLLDAVVASVCNSGSDVKSERTVCRSMQSLLT 491

Query: 1330 AETERTPCNSVGTLSSAGFSFDRDTSSSFNSVTCDLESLKGISPASSSRGSEHVERSRLP 1509
              TE+ P                ++SS  +S        KG S    S  SE ++ S  P
Sbjct: 492  --TEKKP----------------ESSSQMSS--------KGFSSTCPSTCSEQLDMSSEP 525

Query: 1510 AKLSKKRARPGESSRPRPRDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQSV 1689
            AK +KKRAR GE+ RPRPRDRQLIQDRIKELREL+PNGSKCSIDSLLERTIKHM+F+QS+
Sbjct: 526  AKNNKKRARTGENGRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIKHMLFLQSI 585

Query: 1690 TKHAEKLQKCSVSKLLDKDTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQMLV 1869
            TKHA+KL KC+ SK+  K  G+   S  EQGSSWAVE+G++ KVC I+VEN+N NGQMLV
Sbjct: 586  TKHADKLSKCAESKMHQKGNGIHG-SNYEQGSSWAVEMGSHLKVCSIVVENLNKNGQMLV 644

Query: 1870 EMLCKECDQFLEVAETIRSLGLNIMKGVSEAYGNKAWMCFVVE--NNRSMHRMDVLWSLM 2043
            EMLC+EC  FLE+AE IRSLGL I+KGV+EA+G+K W+CFVVE  +NR MHRMDVLWSL+
Sbjct: 645  EMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGDKTWICFVVEGQDNRIMHRMDVLWSLV 704

Query: 2044 QLLQPKISN 2070
            QLLQ K ++
Sbjct: 705  QLLQSKTTS 713


>ref|XP_004146986.1| PREDICTED: transcription factor EMB1444-like [Cucumis sativus]
          Length = 677

 Score =  491 bits (1265), Expect = e-136
 Identities = 305/706 (43%), Positives = 401/706 (56%), Gaps = 21/706 (2%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRDK---NWFNSNAGILNEGPYSHDPLGLAVAKMSYQVY 171
            KLK RARM+ TWED YYD ++  +     +F        +G YSHD LGLAVAKMSY VY
Sbjct: 26   KLKHRARMVLTWEDGYYDNSEQHEPPEGKFFRKTLETFYDGHYSHDALGLAVAKMSYHVY 85

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVV 351
            SLGEG+VG VAV+GKH WI +D+ + +              FS+ I+TI VVAV+PHGV+
Sbjct: 86   SLGEGIVGQVAVTGKHQWITADEQIPN--------------FSSTIETIVVVAVVPHGVL 131

Query: 352  QLGSLHKIAEDLKLVDHIRNVFSNLQDSLTGSMPSSLKNTSPSDKCSRPSNPAFNQCLAK 531
            QLGSL K+ ED+ LV  IRNVF  LQ+S  G +       S       PS     + LA 
Sbjct: 132  QLGSLDKVTEDVNLVTRIRNVFLTLQESSAGEIKPMHSCKSSGYMADIPS-----RSLAT 186

Query: 532  LKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRG-GISNKKLKIKMHETTECSTAGSDTL 708
             KG V +   NV  +L    G     S T  P G  + N K ++++ +   C        
Sbjct: 187  EKGEVASVSKNVGLELS---GSEAFESLTTKPDGINVENFKSQVRLLDDRMCG------- 236

Query: 709  IRSSCENISKRQLQEGDMELSDNPKHFGETNGVGD--LGEKAEINCTAPVGRVEKSRLCN 882
                                       GE +G  D  +G K +IN  +    ++   +C 
Sbjct: 237  ---------------------------GEPSGCKDKAVGLKQKINVQSQNSTMDMVNICG 269

Query: 883  AILPTEASEVVASIVPPHNLEPPVFTAHADISFPELPSLQMHQDFANSELPITSNSSEMQ 1062
             +LP E    + +     ++ P   +A+  ++   +     H +         S + EM 
Sbjct: 270  NLLPAEK---IMTNDAYFSMNPHPSSAYDGVNHNGMFIRTNHTEMYLQNDMEASETIEMY 326

Query: 1063 ASPFS--FCAGYELFEALGPSFQKQNNCF-WEGENIGVDMAVGVSEGISSCSQLMENSDM 1233
             S  S  F AGYEL E LGP+F K      W+ E +    A  +SEG+S      ++   
Sbjct: 327  PSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVLGGKAFELSEGMSGSQLTSDSPTE 386

Query: 1234 HLLDAVVGRASHKGDGTESEISGLNTEESLMTAETERTPCNSVGTLS-SAGFSFDRDTSS 1410
             LL+AVV    H G   +S+ S   + +SL+T E    P  +V T + S G+S  +  +S
Sbjct: 387  RLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTERIPEPSTNVTTSACSEGYSMGQSQTS 446

Query: 1411 SF---------NSVTCDLESLKGISPASSSRGSEHVERSRLPAKLSKKRARPGESSRPRP 1563
                       +S  C + S KG S   S  GSEH+++S  PAK SK+RARPGESSRPRP
Sbjct: 447  FTGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLDKSSEPAKNSKRRARPGESSRPRP 506

Query: 1564 RDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQSVTKHAEKLQKCSVSKLLDK 1743
            RDRQLIQDRIKELREL+PNG+KCSIDSLLERTIKHM+F+Q +TKHA+KL KC+  KL  K
Sbjct: 507  RDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLTKCANMKLHQK 566

Query: 1744 DTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQMLVEMLCKECDQFLEVAETIR 1923
             +GM   S ++QGSSWAVEVG   KVC IIVEN+N NGQ+LVEMLC+EC  FLE+AE IR
Sbjct: 567  GSGMLGTSDTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIR 626

Query: 1924 SLGLNIMKGVSEAYGNKAWMCFVV--ENNRSMHRMDVLWSLMQLLQ 2055
            SLGL I+KG++EA+G K W+CFVV  ENNR++HRMD+LWSL+Q+LQ
Sbjct: 627  SLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQ 672


>ref|XP_003553489.1| PREDICTED: transcription factor EMB1444-like [Glycine max]
          Length = 756

 Score =  481 bits (1238), Expect = e-133
 Identities = 307/733 (41%), Positives = 424/733 (57%), Gaps = 46/733 (6%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDGNQYRDKNWFNSNAGILNE---GPYSHDPLGLAVAKMSYQVY 171
            KLK RARM+ TWEDAYY+     D +        L +   G +SH  LGLAVAKMSY  Y
Sbjct: 25   KLKHRARMILTWEDAYYNNPDDFDSSENKHCQKTLEQIGCGKFSHSALGLAVAKMSYHAY 84

Query: 172  SLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGVV 351
            SLGEG+VG VAV+GKH WI +D  V  S  S E+   W+SQFSAGI+TIAVVAV+P GVV
Sbjct: 85   SLGEGIVGQVAVTGKHRWICADNQVASSGLSFEFADGWQSQFSAGIRTIAVVAVVPLGVV 144

Query: 352  QLGSLHKIAEDLKLVDHIRNVFSNLQD---SLTGSMPSSLKNTSPSDKCSRP-SNPAFNQ 519
            QLGSL+K+ ED+  V HIRN+F + Q+        +  SLK++S  DK     S+     
Sbjct: 145  QLGSLNKVIEDMGFVTHIRNLFLSTQNYSIQCPSQIQGSLKSSSQLDKSKENFSSDIMRT 204

Query: 520  CLAKLKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRGGISNKKLKIKMHETTECSTAGS 699
            C    +  + ++  +V   L+P        +CT  P          +   E  E     S
Sbjct: 205  CFYDTQKSMKSETADV---LMPLQCSGTGRNCT--PPSACEKMSDNVAKQEGPELYNDES 259

Query: 700  DTLIRS--SCENISKRQLQEGDMELSDNPKHFGETNGVGDLGEKAEINCTAPVGR--VEK 867
              L++S  +  N+  ++ +E  M+     K+ G ++G  D+  ++E N ++ +     + 
Sbjct: 260  SILLQSISNMMNVDCQEFEE--MKPLYGTKYEGGSSGCKDMRLESEKNVSSFLNDFVTDN 317

Query: 868  SRLCNAILPTEASEVVASIVPPHNLEPPVFTA----HADIS------FPELPSLQMHQDF 1017
            +   + I P+E   V ++  P   L+  V  +    +ADI+      F +       Q  
Sbjct: 318  ASFNDVICPSEKVRVDSACFPSVFLDTVVCESDKLHYADINQKGAVNFAQPSEANSQQHI 377

Query: 1018 ANSEL---PITSNSSEMQASP--------FSFCAGYELFEALGPSFQKQNNCFWEGENIG 1164
              S+    P   +  + Q  P          F AG EL EALGP+F K   C      I 
Sbjct: 378  EKSKFHTEPCYKDIPDFQTEPCYKDASHILKFPAGCELHEALGPAFLKGGKCLDWPAQIN 437

Query: 1165 VDM-AVGVSEGISSCSQLMENSDMHLLDAVVGRASHKGDGTESEISGLNTEES-LMTAET 1338
             +M +V +S+ IS+     E+   HLL+A++   SH  +   SE+S   +++S +++A+ 
Sbjct: 438  QEMKSVEMSDEISTSQLTSESCPEHLLEAMLANFSHSNNDVNSELSFCKSKQSAIVSAKN 497

Query: 1339 ERTPCNSVGTLSSAGFSFD--------RDTSSSFNSVTCDLESLKGISPASSSRGSEHVE 1494
                 ++V T++S G+S D        +  S S +S  C + S KGIS    S  S  +E
Sbjct: 498  HEASIHNVHTINSEGYSIDQLSLVREDKHHSLSSSSGICGVMSSKGISSTFHSSNSGQLE 557

Query: 1495 RSRLPAKLSKKRARPGESSRPRPRDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMI 1674
            RS  P+K SKKRARPGES RPRPRDRQLIQDRIKELREL+PNG+KCSIDSLLERTIKHM+
Sbjct: 558  RSSEPSKNSKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHML 617

Query: 1675 FMQSVTKHAEKLQKCS--VSKLLDKDTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENIN 1848
            F+QS+TKHA+KL   S   SKL  K+  +   S  EQGSSWA+EVG + KV  I+VEN++
Sbjct: 618  FLQSITKHADKLTDFSDTKSKLHHKEADILGSSSYEQGSSWAMEVGGHLKVHSILVENLS 677

Query: 1849 MNGQMLVEMLCKECDQFLEVAETIRSLGLNIMKGVSEAYGNKAWMCFVVE--NNRSMHRM 2022
             NGQMLVEMLC+EC+ FLE+AE IRSLGL I+KG ++A+G K W+CFVVE  N R++HR+
Sbjct: 678  KNGQMLVEMLCEECNHFLEIAEAIRSLGLTILKGATKAHGEKMWICFVVEGQNKRNVHRL 737

Query: 2023 DVLWSLMQLLQPK 2061
            D+LW L+Q+LQ K
Sbjct: 738  DILWPLVQILQSK 750


>ref|XP_006588678.1| PREDICTED: transcription factor EMB1444-like [Glycine max]
          Length = 733

 Score =  460 bits (1184), Expect = e-126
 Identities = 305/732 (41%), Positives = 407/732 (55%), Gaps = 45/732 (6%)
 Frame = +1

Query: 1    KLKQRARMMFTWEDAYYDG----NQYRDKNWFNSNAGILNEGPYSHDPLGLAVAKMSYQV 168
            KLKQRARM+ TWEDAYYD         +K+  NS   I     +SHDPLGLAVAKMSY V
Sbjct: 25   KLKQRARMILTWEDAYYDNPSICESSENKSCHNSLEQI-GSADFSHDPLGLAVAKMSYHV 83

Query: 169  YSLGEGVVGHVAVSGKHSWIFSDQHVVDSSFSSEYYGAWKSQFSAGIKTIAVVAVIPHGV 348
            YSLGEG++G VAV+GKH WI  D HV  S  S E+   W+SQFSAGI+TI VVAV+  GV
Sbjct: 84   YSLGEGIIGQVAVTGKHRWICVDNHVTSSGPSFEFADGWQSQFSAGIRTIVVVAVVALGV 143

Query: 349  VQLGSLHKIAEDLKLVDHIRNVFSNLQD----SLTGSMPSSLKNTSPS-DKCSRPSNPAF 513
            VQLGSL+K+ ED+ +V  IR++F + QD     +   + +S+KN+S   D  +  S PA 
Sbjct: 144  VQLGSLNKVTEDMGVVSCIRSLFLSTQDYTISHVHNQVQNSVKNSSSVLDTKTSKSMPAL 203

Query: 514  NQCLAKLKGPVHADEVNVWSDLIPPLGESVNNSCTLPPRGGISNKKLKIKMHETTECSTA 693
            +     +K            D++ P      N     P        + +  H+  E ++ 
Sbjct: 204  HDTEKTMKHEA--------LDILMPFQCPRKN---YSPHAVHQKMVVDVAKHDFPELNSD 252

Query: 694  GSDTLIRSSCENISKRQLQEGDMELSDNPKHFGETNGVGDLGEKAEINCTAPVGRVEKSR 873
             S  L++S    ++  Q +   M   +  K  G + G  D   ++        G+   S 
Sbjct: 253  RSSILLQSMSNMMNVEQQKLVGMRPVNESKFEGNS-GCEDKSLES--------GKNVSSF 303

Query: 874  LCNAILPTEASEVVASIVPPHNLEPPVFTA--------------HADISFPELPSLQMHQ 1011
            L N ++       +A       ++P  F++              + DI+   + ++    
Sbjct: 304  LHNLVMDNNGVNDLACPSENVGVDPVSFSSGFLDAAVCVSDKFQYVDINEKGVLNVPRPS 363

Query: 1012 DFANSELPITSNSSEMQASP--------FSFCAGYELFEALGPSFQKQNNCFWEGENIGV 1167
            D   +   I S  S+ Q  P          F AGYEL EALGPSF K + CF        
Sbjct: 364  D---ANFQIKSEKSKFQTEPCYKDTSYTMKFPAGYELHEALGPSFLKGSKCFNWAAEANQ 420

Query: 1168 DMAVGVSEGISSCSQLM-ENSDMHLLDAVVGRASHKGDGTESEISGLNTEESLMTAETER 1344
            D+         SCSQL  E    HLL+A+V   SH  +   SE+S   + ++ + +   R
Sbjct: 421  DVKNAEMSDEISCSQLTSEFRPEHLLEAMVANISHSNNNVNSELSFSTSMQAAIASG--R 478

Query: 1345 TPCNSVGTLSSAGFSFDR------DTSSSFNSV-TCDLESLKGISPASSSRGSEHVERSR 1503
             P  SV T++S G S D+      D   S +S   C + S KG S    S  SE  ERS 
Sbjct: 479  NPEGSVHTINSEGCSIDQLPFVKEDKHYSLSSSGICGVMSPKGFSSTCPSSCSEQFERSS 538

Query: 1504 LPAKLSKKRARPGESSRPRPRDRQLIQDRIKELRELIPNGSKCSIDSLLERTIKHMIFMQ 1683
             P K SKKRARPGES RPRPRDRQLIQDRIKELREL+PNG+KCSIDSLLE TIKHM+F+Q
Sbjct: 539  EPTKNSKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLECTIKHMLFLQ 598

Query: 1684 SVTKHAEKLQKCSVSKLLDKDTGMRKFSRSEQGSSWAVEVGNNQKVCPIIVENINMNGQM 1863
            ++TKHA+KL K + +K   K   M K    +QGSSWA+EVG + KV  I+VEN+N NGQM
Sbjct: 599  NITKHADKLNKFADTK--TKLHHMEKDIPGQQGSSWAMEVGGHLKVSSILVENLNQNGQM 656

Query: 1864 LVEMLCKECDQFLEVAETIRSLGLNIMKGVSEAYGNKAWMCFVVE------NNRSMHRMD 2025
             VEM+C+EC  FLE+A+ IRSLG+ I+ G +EA+G K ++CFVVE      NNR++HR+D
Sbjct: 657  FVEMVCEECSHFLEIADAIRSLGMTILNGATEAHGEKTFVCFVVEAGSEGQNNRNLHRLD 716

Query: 2026 VLWSLMQLLQPK 2061
            +LWSL+QLLQ K
Sbjct: 717  ILWSLVQLLQSK 728


Top