BLASTX nr result

ID: Catharanthus22_contig00001750 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00001750
         (4678 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006351645.1| PREDICTED: transcription factor bHLH155-like...   690   0.0  
ref|XP_004247231.1| PREDICTED: transcription factor EMB1444-like...   669   0.0  
ref|XP_006351643.1| PREDICTED: transcription factor bHLH155-like...   664   0.0  
emb|CBI37092.3| unnamed protein product [Vitis vinifera]              644   0.0  
ref|XP_003632423.1| PREDICTED: uncharacterized basic helix-loop-...   630   e-177
gb|EXC31934.1| hypothetical protein L484_009784 [Morus notabilis]     575   e-161
gb|EOX94495.1| Basic helix-loop-helix DNA-binding superfamily pr...   568   e-159
gb|EMJ01507.1| hypothetical protein PRUPE_ppa001930mg [Prunus pe...   557   e-155
ref|XP_004292200.1| PREDICTED: transcription factor bHLH155-like...   556   e-155
ref|XP_002532375.1| basic helix-loop-helix-containing protein, p...   551   e-153
gb|EOX94493.1| Basic helix-loop-helix-containing protein, putati...   547   e-152
emb|CCX35476.1| hypothetical protein [Malus domestica]                546   e-152
gb|EOX94494.1| Basic helix-loop-helix DNA-binding superfamily pr...   543   e-151
ref|XP_006443866.1| hypothetical protein CICLE_v10018993mg [Citr...   525   e-146
ref|XP_006383698.1| basic helix-loop-helix family protein [Popul...   525   e-146
ref|XP_006351644.1| PREDICTED: transcription factor bHLH155-like...   516   e-143
ref|XP_006443867.1| hypothetical protein CICLE_v10018993mg [Citr...   512   e-142
ref|XP_004161538.1| PREDICTED: transcription factor EMB1444-like...   501   e-139
gb|ESW16913.1| hypothetical protein PHAVU_007G194600g [Phaseolus...   498   e-137
ref|XP_006588678.1| PREDICTED: transcription factor EMB1444-like...   494   e-136

>ref|XP_006351645.1| PREDICTED: transcription factor bHLH155-like isoform X3 [Solanum
            tuberosum]
          Length = 752

 Score =  690 bits (1780), Expect = 0.0
 Identities = 408/819 (49%), Positives = 504/819 (61%), Gaps = 33/819 (4%)
 Frame = -2

Query: 4062 MGKEDSMLL-PTVGPPIKRRAGLRRKQAGRGSYRGS*KFLRGKIRFFSSRVLLIGPFLRF 3886
            MGK+D MLL  TVGPPIKRRAGLRRKQAGRG                             
Sbjct: 1    MGKDDGMLLLSTVGPPIKRRAGLRRKQAGRG----------------------------- 31

Query: 3885 RIESWEGGGVIF*SSKVDLGARMGSRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWED 3706
               S+ G                       LRSLC +T WKYAVFWKL HR RMMLTWED
Sbjct: 32   ---SYRG----------------------TLRSLCCNTPWKYAVFWKLTHRARMMLTWED 66

Query: 3705 AYYDNDN-PEKNCSSSSADNLHDGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHL 3529
            AYYDND  P K    S+A NL+DGH S++ LG+AVAKMSYHVYSL           G+HL
Sbjct: 67   AYYDNDGFPGKKSPGSTAGNLYDGHYSNNHLGVAVAKMSYHVYSLGEGIVGQVAITGKHL 126

Query: 3528 WISADKHMNDPGLLFECHDGWKTQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNH 3349
            W+SADK      L  E  DGW+ QFSAGIKTI          +QLGSLD I EDL+ + H
Sbjct: 127  WLSADKVAAITSLAPEHCDGWQAQFSAGIKTIVVAAVAPHGVIQLGSLDSIPEDLRAIKH 186

Query: 3348 IRGVFFELQDPLAT--------SMESSCLSDVTTRTSGSGVSRDCIQNLDRRIDKSGVDF 3193
            IR VF ELQ+ +A+        SME+SCLS+++TRTSGS V +DC+ NL R + + G + 
Sbjct: 187  IRDVFSELQELMASCLRSSMQYSMENSCLSEISTRTSGSEVFQDCVNNLGRSVCEDGRNM 246

Query: 3192 WSSIYSSIEKPDHYPYNFSIPGSYQNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDA 3013
            WS +Y+S+EK   +   FS PG + N+I+E ++        V+G    +N       S  
Sbjct: 247  WSPLYTSVEKSVDHSCIFSQPGGFPNKILEAVHNQGLHRTSVQGSDDSENLLPASCESSI 306

Query: 3012 EQQQKHEPSGGIGNTRCEGQSSGFKSL-------VECLENNNVSLSDKVCCKNSVCDTAH 2854
             + Q+        + + EGQ+S  + L        E    ++ S+         V +   
Sbjct: 307  IKHQEEGQMWEETDPKFEGQTSNLRVLGKGSVDKCEPTFRSDASIGSVSYDAGQVTECPQ 366

Query: 2853 PASNSAMLIAGLPSEVQDSTACNDERDSSCVPELPDMHV--CKDTT--------NVLHMP 2704
            P  N+             S A ND      + +LP+ +   C +T         + +H P
Sbjct: 367  PNRNNLA-----------SEADNDRNRKLGLSDLPNAYADKCAETNLGFETQCNDTMHTP 415

Query: 2703 FRFCAGYELYEALGPAFRKQNTDCGWGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLE 2524
            FRFCAGYELYEALGP F+K N+   W A   E +MAV+M E +  SSL+ +N+G EHLLE
Sbjct: 416  FRFCAGYELYEALGPVFQKGNSSKDWEAGKRE-EMAVDMLEGIGTSSLVMSNTGNEHLLE 474

Query: 2523 AVVANISSEVDKTD--KSFVXXXXXXXXXXXXE-PCTSDVGSISSAGYSFDRETLNSLNS 2353
            AV+AN++   +     KSF               PC+SD+G+ISS GYSFDRETLNS NS
Sbjct: 475  AVIANVNRYDNDCSSVKSFCKSVDSLLTTEITAEPCSSDIGAISSIGYSFDRETLNSFNS 534

Query: 2352 SGAYGVRYSKGLSSNSCSRGS---EKPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKE 2182
            SG   +R S+GLSS SCSRGS   E+P E  K+HKKRARPGESCRPRPRDRQLIQDRIKE
Sbjct: 535  SGTCSIRSSRGLSSTSCSRGSGHVERPLEPVKMHKKRARPGESCRPRPRDRQLIQDRIKE 594

Query: 2181 LRELVPNGSKCSIDSLLERTIKHMIFMQSITKHAEKLNKCTGSKLLEQDPRGRGASVAEQ 2002
            LR+LVPNGSKCSIDSLLERTIKHM+FMQS+TKHA+KL+KC+ SKL++++    G+S  E 
Sbjct: 595  LRDLVPNGSKCSIDSLLERTIKHMLFMQSVTKHADKLSKCSASKLVDKESDICGSSSHEV 654

Query: 2001 GSSWAVEVGTNSKVCPIVVENINMNGQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTE 1822
            GSSWAVEVG N KVCP+ VEN+ MNGQMLVE+  E+ SHFL+IAEAIRS+GLTILKG+ E
Sbjct: 655  GSSWAVEVGNNQKVCPMRVENLGMNGQMLVEIF-EDGSHFLDIAEAIRSLGLTILKGLAE 713

Query: 1821 SCGEKTWMRFVVEGQNNRSLHRMDVLWSLMQLLQPKINV 1705
            +  E+T M FVVEGQN+R+LHRMDVLWSLMQLLQ KINV
Sbjct: 714  AYSERTRMCFVVEGQNDRTLHRMDVLWSLMQLLQAKINV 752


>ref|XP_004247231.1| PREDICTED: transcription factor EMB1444-like [Solanum lycopersicum]
          Length = 724

 Score =  669 bits (1725), Expect = 0.0
 Identities = 384/734 (52%), Positives = 480/734 (65%), Gaps = 29/734 (3%)
 Frame = -2

Query: 3819 MGSRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDNDN-PEKNCSSSSADNLH 3643
            M S+L +ALRSLC +T WKYAVFWKL HR RMMLTWEDAYYDND  P K    S+A NL+
Sbjct: 1    MASQLQQALRSLCCNTPWKYAVFWKLTHRARMMLTWEDAYYDNDGFPGKKSPDSTAGNLY 60

Query: 3642 DGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHDGWK 3463
            DGH S++ LG+AVAKMSYHVYSL           G+HLW+SA+K      L  E  DGW+
Sbjct: 61   DGHYSNNHLGVAVAKMSYHVYSLGEGIVGQVAITGKHLWLSANKVAAITNLAPEHCDGWQ 120

Query: 3462 TQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQDPLAT-------- 3307
             QFSAGIKTI          VQLGSLD I EDL+ + HIR VF ELQ+ + +        
Sbjct: 121  AQFSAGIKTIVVAAVAPHGVVQLGSLDSIPEDLRAIKHIRDVFSELQELMTSCLRSSMQH 180

Query: 3306 SMESSCLSDVTTRTSGSGVSRDCIQNLDRRIDKSGVDFWSSIYSSIEKPDHYPYNFSIPG 3127
            SME+SCLS+++TRTSGS + +DC+ NL R + +   + WS +Y+S EK   +   F  PG
Sbjct: 181  SMENSCLSEISTRTSGSEIFQDCVNNLGRSVCEDRRNMWSPLYTSFEKSVDHSCIFLQPG 240

Query: 3126 SYQNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSGGIGNTRCEGQSS 2947
             Y N+I+E++N      + V+G     N       S   + Q+        + + EGQ+S
Sbjct: 241  GYPNKILEVVNNQRLHRSSVQGSDDSTNLLPASCESSIIKHQEEGQMWEETDPKFEGQTS 300

Query: 2946 GFKSLVECLENNNVSLSDKVCCKNSVCDTA-HPASNSAMLIAGLPSEVQD---STACNDE 2779
              +     L   +V  S+     N   DT+    S  A  +   P   ++   S A ND 
Sbjct: 301  NLR----VLGKGSVDKSEP----NFKSDTSIGSVSYDAGQVTECPQRNRNNLASEAYNDR 352

Query: 2778 RDSSCVPELPDMHV--CKDTT--------NVLHMPFRFCAGYELYEALGPAFRKQNTDCG 2629
                 + +LP+ +   C +T         + +H PFRFCAGYELYEALGP F+K N+   
Sbjct: 353  NRMLGLSDLPNAYADKCAETNLGFGTECNDTMHTPFRFCAGYELYEALGPVFQKGNSSKD 412

Query: 2628 WGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANISSEVDKTD--KSFVXXXXX 2455
            W A   E +MAV+M E +  SSL+ +N+G EHLLEAV+AN++   +     KSF      
Sbjct: 413  WEAGKRE-EMAVDMLEGIGTSSLVMSNTGNEHLLEAVIANVNRHDNDCSSVKSFCKSVDS 471

Query: 2454 XXXXXXXE-PCTSDVGSISSAGYSFDRETLNSLNSSGAYGVRYSKGLSSNSCSRGS---E 2287
                     PC+SD+G+ISS GYSFDRETLNS NSSG   +R S+GLSS SCSRGS   E
Sbjct: 472  LLTTEITAEPCSSDIGTISSTGYSFDRETLNSFNSSGTCSIRSSRGLSSTSCSRGSGHVE 531

Query: 2286 KPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIKHMI 2107
            +P E  K+HKKRARPGESCRPRPRDRQLIQDRIKELR+LVPNGSKCSIDSLLERTIKHM+
Sbjct: 532  RPLEPVKMHKKRARPGESCRPRPRDRQLIQDRIKELRDLVPNGSKCSIDSLLERTIKHML 591

Query: 2106 FMQSITKHAEKLNKCTGSKLLEQDPRGRGASVAEQGSSWAVEVGTNSKVCPIVVENINMN 1927
            FMQS+TKHA+KL+KC+ SKL +++    G+S  E GSSWAVEVG N KVCP+ VEN+ MN
Sbjct: 592  FMQSVTKHADKLSKCSASKLADKESGICGSSSHEVGSSWAVEVGNNQKVCPMRVENLGMN 651

Query: 1926 GQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTESCGEKTWMRFVVEGQNNRSLHRMDV 1747
            GQMLVE+  E+ SHFL+IAEAIRS+GLTILKG+ E+ GE+T M FVVEGQN+R+LHRMDV
Sbjct: 652  GQMLVEIF-EDGSHFLDIAEAIRSLGLTILKGLAEAYGERTRMCFVVEGQNDRTLHRMDV 710

Query: 1746 LWSLMQLLQPKINV 1705
            LWSLMQLLQ KIN+
Sbjct: 711  LWSLMQLLQAKINL 724


>ref|XP_006351643.1| PREDICTED: transcription factor bHLH155-like isoform X1 [Solanum
            tuberosum]
          Length = 722

 Score =  664 bits (1712), Expect = 0.0
 Identities = 381/737 (51%), Positives = 477/737 (64%), Gaps = 32/737 (4%)
 Frame = -2

Query: 3819 MGSRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDNDN-PEKNCSSSSADNLH 3643
            M S+L +ALRSLC +T WKYAVFWKL HR RMMLTWEDAYYDND  P K    S+A NL+
Sbjct: 1    MASQLQQALRSLCCNTPWKYAVFWKLTHRARMMLTWEDAYYDNDGFPGKKSPGSTAGNLY 60

Query: 3642 DGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHDGWK 3463
            DGH S++ LG+AVAKMSYHVYSL           G+HLW+SADK      L  E  DGW+
Sbjct: 61   DGHYSNNHLGVAVAKMSYHVYSLGEGIVGQVAITGKHLWLSADKVAAITSLAPEHCDGWQ 120

Query: 3462 TQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQDPLAT-------- 3307
             QFSAGIKTI          +QLGSLD I EDL+ + HIR VF ELQ+ +A+        
Sbjct: 121  AQFSAGIKTIVVAAVAPHGVIQLGSLDSIPEDLRAIKHIRDVFSELQELMASCLRSSMQY 180

Query: 3306 SMESSCLSDVTTRTSGSGVSRDCIQNLDRRIDKSGVDFWSSIYSSIEKPDHYPYNFSIPG 3127
            SME+SCLS+++TRTSGS V +DC+ NL R + + G + WS +Y+S+EK   +   FS PG
Sbjct: 181  SMENSCLSEISTRTSGSEVFQDCVNNLGRSVCEDGRNMWSPLYTSVEKSVDHSCIFSQPG 240

Query: 3126 SYQNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSGGIGNTRCEGQSS 2947
             + N+I+E ++        V+G    +N       S   + Q+        + + EGQ+S
Sbjct: 241  GFPNKILEAVHNQGLHRTSVQGSDDSENLLPASCESSIIKHQEEGQMWEETDPKFEGQTS 300

Query: 2946 GFKSL-------VECLENNNVSLSDKVCCKNSVCDTAHPASNSAMLIAGLPSEVQDSTAC 2788
              + L        E    ++ S+         V +   P  N+             S A 
Sbjct: 301  NLRVLGKGSVDKCEPTFRSDASIGSVSYDAGQVTECPQPNRNNLA-----------SEAD 349

Query: 2787 NDERDSSCVPELPDMHV--CKDTT--------NVLHMPFRFCAGYELYEALGPAFRKQNT 2638
            ND      + +LP+ +   C +T         + +H PFRFCAGYELYEALGP F+K N+
Sbjct: 350  NDRNRKLGLSDLPNAYADKCAETNLGFETQCNDTMHTPFRFCAGYELYEALGPVFQKGNS 409

Query: 2637 DCGWGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANISSEVDKTD--KSFVXX 2464
               W A   E +MAV+M E +  SSL+ +N+G EHLLEAV+AN++   +     KSF   
Sbjct: 410  SKDWEAGKRE-EMAVDMLEGIGTSSLVMSNTGNEHLLEAVIANVNRYDNDCSSVKSFCKS 468

Query: 2463 XXXXXXXXXXE-PCTSDVGSISSAGYSFDRETLNSLNSSGAYGVRYSKGLSSNSCSRGS- 2290
                        PC+SD+G+ISS GYSFDRETLNS NSSG   +R S+GLSS SCSRGS 
Sbjct: 469  VDSLLTTEITAEPCSSDIGAISSIGYSFDRETLNSFNSSGTCSIRSSRGLSSTSCSRGSG 528

Query: 2289 --EKPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIK 2116
              E+P E  K+HKKRARPGESCRPRPRDRQLIQDRIKELR+LVPNGSKCSIDSLLERTIK
Sbjct: 529  HVERPLEPVKMHKKRARPGESCRPRPRDRQLIQDRIKELRDLVPNGSKCSIDSLLERTIK 588

Query: 2115 HMIFMQSITKHAEKLNKCTGSKLLEQDPRGRGASVAEQGSSWAVEVGTNSKVCPIVVENI 1936
            HM+FMQS+TKHA+KL+KC+ SKL++++    G+S  E GSSWAVEVG N KVCP+ VEN+
Sbjct: 589  HMLFMQSVTKHADKLSKCSASKLVDKESDICGSSSHEVGSSWAVEVGNNQKVCPMRVENL 648

Query: 1935 NMNGQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTESCGEKTWMRFVVEGQNNRSLHR 1756
             MNGQMLVE+  E+ SHFL+IAEAIRS+GLTILKG+ E+  E+T M FVVE  N+R+LHR
Sbjct: 649  GMNGQMLVEIF-EDGSHFLDIAEAIRSLGLTILKGLAEAYSERTRMCFVVE--NDRTLHR 705

Query: 1755 MDVLWSLMQLLQPKINV 1705
            MDVLWSLMQLLQ KINV
Sbjct: 706  MDVLWSLMQLLQAKINV 722


>emb|CBI37092.3| unnamed protein product [Vitis vinifera]
          Length = 774

 Score =  644 bits (1661), Expect = 0.0
 Identities = 390/825 (47%), Positives = 484/825 (58%), Gaps = 46/825 (5%)
 Frame = -2

Query: 4050 DSMLLPTVGPPIKRRAGLRRKQAGRGSYRGS*KFLRGKIRFFSSRVLLIGPFLRFRIESW 3871
            D +LLPTVGPPIKRRAGLR KQ                                      
Sbjct: 2    DRLLLPTVGPPIKRRAGLRIKQ-------------------------------------- 23

Query: 3870 EGGGVIF*SSKVDLGARMGSRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDN 3691
                           A M + L + LRSLC +T WKYAVFWKLKHR RM+LTWEDAYYDN
Sbjct: 24   ---------------AEMATDLQQTLRSLCFNTEWKYAVFWKLKHRARMVLTWEDAYYDN 68

Query: 3690 ----DNPEKNCSSSSADNLHDGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWI 3523
                D  E  C S + D LHDGH SHD LGLAVAKMSYHVYSL           G+H WI
Sbjct: 69   HDQHDPLEDKCFSKTPDTLHDGHYSHDALGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWI 128

Query: 3522 SADKHMNDPGLLFECHDGWKTQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIR 3343
             +DKH  +    FE  DGW+ QFSAGIKTI          VQLGSL Q+ EDLK+V+ I+
Sbjct: 129  FSDKHTTNSSSSFEYCDGWQAQFSAGIKTIVVVAVVPHGVVQLGSLQQVVEDLKLVSRIK 188

Query: 3342 GVFFELQD--------PLATSMESS-CLSDVTTRTSGSGVSRDCIQNLDRRIDKSGVDFW 3190
             VFF LQD        P+  SM+SS  +SD++TR S S +  D + NLD+ I K   + W
Sbjct: 189  DVFFALQDSSVAYIPHPIQCSMKSSLAMSDISTRGSASDIVPDSLFNLDKGIHKERPNVW 248

Query: 3189 SSIYSSIEKPDHYPYNFSIPGSYQNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDAE 3010
            S ++    K +   + F +P  +QN+ V M N   G             F  P S +   
Sbjct: 249  SPMFPIFGKHNDSSFIFQLPAIHQNRAVNMFNKDGGLELSSSQSDESTKFLQPRSENFVL 308

Query: 3009 QQQKHEPSGGIGNTRCEGQSSGFKSLVECLENNNVSLSDKVCCKN-SVCDTAHPASNSAM 2833
            + QK      I NT+ E ++SG++      E+N+ S       +N + C TA  A  S +
Sbjct: 309  EGQKQVQMKLISNTKRE-EASGWRDADVSSEHNDTSYPYNSFMENINSCSTALAADKSQV 367

Query: 2832 LIAGLPSEVQDSTACN---------DERDSSCVPELPDMHVCKDTTNVLHMP-------- 2704
              A  P    DS  CN          E     +P+  DM + K+    L  P        
Sbjct: 368  DFACFPFGFFDSVDCNRIKLHGVNCHENGVLHLPDPSDMQLQKNLEKKLEFPSELSHVDT 427

Query: 2703 ----FRFCAGYELYEALGPAFRKQNTDCGWGAETTETDMAVEMSEEMPDSSLLTANSGTE 2536
                 RF AG EL+EALGPAF KQ+  C W  E  ET+  +E+ E M  SS LT++SG+E
Sbjct: 428  SYTSLRFSAGSELHEALGPAFLKQSNYCDWETEKAETETTIELPEGM-SSSQLTSDSGSE 486

Query: 2535 HLLEAVVANI--SSEVDKTDKSFVXXXXXXXXXXXXE-PCTSDVGSISSAGYSFDR---- 2377
            +LLEAVVA +  S    K++KSF               P +  + +++SAGYS D+    
Sbjct: 487  NLLEAVVAKVCQSGSDVKSEKSFCQSMQSLLTTEKIPEPSSHTIHTVTSAGYSIDQSSLV 546

Query: 2376 -ETLNSLNSSGAYGVRYSKGLSS---NSCSRGSEKPQESNKVHKKRARPGESCRPRPRDR 2209
             ET N   SS   GV   +G+SS   +SCS   E+  E +KV+KKRARPGESCRPRPRDR
Sbjct: 547  EETQNCFKSSEVCGVTSQQGISSICPSSCSEQLERSAEPSKVNKKRARPGESCRPRPRDR 606

Query: 2208 QLIQDRIKELRELVPNGSKCSIDSLLERTIKHMIFMQSITKHAEKLNKCTGSKLLEQDPR 2029
            QLIQDRIKELRELVPNGSKCSIDSLLERTIKHM+F+QSIT+HA+KLNKC  SKL  ++  
Sbjct: 607  QLIQDRIKELRELVPNGSKCSIDSLLERTIKHMLFLQSITRHADKLNKCAESKLHSKETG 666

Query: 2028 GRGASVAEQGSSWAVEVGTNSKVCPIVVENINMNGQMLVELLCEECSHFLEIAEAIRSMG 1849
              G+S  EQGSSWAVEVG++ KVCPI+VEN+NM+GQM+VE++CEECS FLEIAEAIRS+G
Sbjct: 667  VLGSSNYEQGSSWAVEVGSHMKVCPIIVENLNMDGQMVVEMVCEECSRFLEIAEAIRSLG 726

Query: 1848 LTILKGVTESCGEKTWMRFVVEGQNNRSLHRMDVLWSLMQLLQPK 1714
            LTILKGVTE+ GEKTW+ FVVEGQN+R++ RMD+LWSL+Q+LQPK
Sbjct: 727  LTILKGVTEARGEKTWICFVVEGQNSRNMRRMDILWSLVQILQPK 771


>ref|XP_003632423.1| PREDICTED: uncharacterized basic helix-loop-helix protein
            At1g06150-like [Vitis vinifera]
          Length = 749

 Score =  630 bits (1625), Expect = e-177
 Identities = 370/748 (49%), Positives = 463/748 (61%), Gaps = 46/748 (6%)
 Frame = -2

Query: 3819 MGSRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDN----DNPEKNCSSSSAD 3652
            M + L + LRSLC +T WKYAVFWKLKHR RM+LTWEDAYYDN    D  E  C S + D
Sbjct: 1    MATDLQQTLRSLCFNTEWKYAVFWKLKHRARMVLTWEDAYYDNHDQHDPLEDKCFSKTPD 60

Query: 3651 NLHDGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHD 3472
             LHDGH SHD LGLAVAKMSYHVYSL           G+H WI +DKH  +    FE  D
Sbjct: 61   TLHDGHYSHDALGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDKHTTNSSSSFEYCD 120

Query: 3471 GWKTQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQD--------P 3316
            GW+ QFSAGIKTI          VQLGSL Q+ EDLK+V+ I+ VFF LQD        P
Sbjct: 121  GWQAQFSAGIKTIVVVAVVPHGVVQLGSLQQVVEDLKLVSRIKDVFFALQDSSVAYIPHP 180

Query: 3315 LATSMESS-CLSDVTTRTSGSGVSRDCIQNLDRRIDKSGVDFWSSIYSSIEKPDHYPYNF 3139
            +  SM+SS  +SD++TR S S +  D + NLD+ I K   + WS ++    K +   + F
Sbjct: 181  IQCSMKSSLAMSDISTRGSASDIVPDSLFNLDKGIHKERPNVWSPMFPIFGKHNDSSFIF 240

Query: 3138 SIPGSYQNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSGGIGNTRCE 2959
             +P  +QN+ V M N   G             F  P S +   + QK      I NT+ E
Sbjct: 241  QLPAIHQNRAVNMFNKDGGLELSSSQSDESTKFLQPRSENFVLEGQKQVQMKLISNTKRE 300

Query: 2958 GQSSGFKSLVECLENNNVSLSDKVCCKN-SVCDTAHPASNSAMLIAGLPSEVQDSTACN- 2785
             ++SG++      E+N+ S       +N + C TA  A  S +  A  P    DS  CN 
Sbjct: 301  -EASGWRDADVSSEHNDTSYPYNSFMENINSCSTALAADKSQVDFACFPFGFFDSVDCNR 359

Query: 2784 --------DERDSSCVPELPDMHVCKDTTNVLHMP------------FRFCAGYELYEAL 2665
                     E     +P+  DM + K+    L  P             RF AG EL+EAL
Sbjct: 360  IKLHGVNCHENGVLHLPDPSDMQLQKNLEKKLEFPSELSHVDTSYTSLRFSAGSELHEAL 419

Query: 2664 GPAFRKQNTDCGWGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANI--SSEVD 2491
            GPAF KQ+  C W  E  ET+  +E+ E M  SS LT++SG+E+LLEAVVA +  S    
Sbjct: 420  GPAFLKQSNYCDWETEKAETETTIELPEGM-SSSQLTSDSGSENLLEAVVAKVCQSGSDV 478

Query: 2490 KTDKSFVXXXXXXXXXXXXE-PCTSDVGSISSAGYSFDR-----ETLNSLNSSGAYGVRY 2329
            K++KSF               P +  + +++SAGYS D+     ET N   SS   GV  
Sbjct: 479  KSEKSFCQSMQSLLTTEKIPEPSSHTIHTVTSAGYSIDQSSLVEETQNCFKSSEVCGVTS 538

Query: 2328 SKGLSS---NSCSRGSEKPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKELRELVPNG 2158
             +G+SS   +SCS   E+  E +KV+KKRARPGESCRPRPRDRQLIQDRIKELRELVPNG
Sbjct: 539  QQGISSICPSSCSEQLERSAEPSKVNKKRARPGESCRPRPRDRQLIQDRIKELRELVPNG 598

Query: 2157 SKCSIDSLLERTIKHMIFMQSITKHAEKLNKCTGSKLLEQDPRGRGASVAEQGSSWAVEV 1978
            SKCSIDSLLERTIKHM+F+QSIT+HA+KLNKC  SKL  ++    G+S  EQGSSWAVEV
Sbjct: 599  SKCSIDSLLERTIKHMLFLQSITRHADKLNKCAESKLHSKETGVLGSSNYEQGSSWAVEV 658

Query: 1977 GTNSKVCPIVVENINMNGQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTESCGEKTWM 1798
            G++ KVCPI+VEN+NM+GQM+VE++CEECS FLEIAEAIRS+GLTILKGVTE+ GEKTW+
Sbjct: 659  GSHMKVCPIIVENLNMDGQMVVEMVCEECSRFLEIAEAIRSLGLTILKGVTEARGEKTWI 718

Query: 1797 RFVVEGQNNRSLHRMDVLWSLMQLLQPK 1714
             FVVEGQN+R++ RMD+LWSL+Q+LQPK
Sbjct: 719  CFVVEGQNSRNMRRMDILWSLVQILQPK 746


>gb|EXC31934.1| hypothetical protein L484_009784 [Morus notabilis]
          Length = 750

 Score =  575 bits (1483), Expect = e-161
 Identities = 352/751 (46%), Positives = 453/751 (60%), Gaps = 49/751 (6%)
 Frame = -2

Query: 3819 MGSRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYD----NDNPEKNCSSSSAD 3652
            MG+ L + LRSLC +T WKYAVFWKLKHR RM+LTWEDAYYD    +D  E  C S   +
Sbjct: 1    MGTDLQQILRSLCFNTEWKYAVFWKLKHRARMVLTWEDAYYDKSEQHDPAENKCFSKKLE 60

Query: 3651 NLHDGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECH- 3475
              HDG  SHDPLGLAVAK+SYHVYSL           G+H WI ADKH       FE + 
Sbjct: 61   KSHDGLYSHDPLGLAVAKLSYHVYSLGEGIVGQVAVSGKHQWIFADKHKLSTYSSFEHYS 120

Query: 3474 DGWKTQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQD-------- 3319
            DGW+ QFSAGIKTIA         VQLGS +++ ED+++VNHIR VF  LQD        
Sbjct: 121  DGWQNQFSAGIKTIAVVAVVPHGVVQLGSFNEVLEDMELVNHIRDVFMSLQDSLVGHVPV 180

Query: 3318 PLATSMESSC-LSDVTTRTSGSGVSRDCIQNLDRRIDKSGVDFWSSIYSSIEKPDHYPYN 3142
            P+ +S+ SS  L D+ +++  S    DC+ NLD+ ++  G D W SI+  + K    PY 
Sbjct: 181  PIQSSVNSSVNLQDIPSKSFTSETVPDCLHNLDKTLNGEGPDIWFSIFPYVGKDGDSPYV 240

Query: 3141 FSIPGSYQNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSGGI--GNT 2968
             S+P +YQ + V+++N + G      G    ++  L  SR++  + + H+  G     N 
Sbjct: 241  LSLPNNYQEKAVDVVNKHGGLEFSTNGTD--ESAKLLQSRTNILEHENHKVIGMNLRDNW 298

Query: 2967 RCEGQSSGFK-SLVECLENNNVSLSDKVCCKNSVCDTAHPASNSAMLIAGLPSEVQDSTA 2791
            +C G+    K + V  + N N  L   V    ++     PA    +  A   S +  S  
Sbjct: 299  KCAGEIDSCKDAAVGPVNNGNPFLCGSVMGDVNLPSIVLPAEKVEVDSAHFSSGLVGSAV 358

Query: 2790 CNDER-DSSCVPELPDMHVC--------KDTTNVLHMP-----------FRFCAGYELYE 2671
            C+  R DS    +   +HV         KD  N+                +F AGYEL+E
Sbjct: 359  CDRVRLDSVDYYQNGVLHVSGPSNTKFQKDPDNLEFQTELSHIDTSSTSLKFPAGYELHE 418

Query: 2670 ALGPAFRKQNTDCGWGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANI--SSE 2497
            ALGPAF K +    W A  TE   A+EM E+M  S  L A+S  EHLLEAV+AN+  S  
Sbjct: 419  ALGPAFLKNSKYFDWEATETE-GTALEMPEQM-SSRQLAADSHPEHLLEAVIANVCQSHS 476

Query: 2496 VDKTDKSFVXXXXXXXXXXXXEPCTSDVGSIS-SAGYSFDRETLNS------LNSSGAYG 2338
              K++KSF                +S    I+ S+ +S  + ++        L+SSG  G
Sbjct: 477  DVKSEKSFCKSVQSLLSTEKYPKPSSHTTLITDSSNHSIGQPSVKGEDKQHCLSSSGICG 536

Query: 2337 VRYSKGLSSNSCSRGSEKPQES---NKVHKKRARPGESCRPRPRDRQLIQDRIKELRELV 2167
            V   KG SS   S  SE+ + S   NK +KKRARPGE+CRPRPRDRQLIQDRIKELREL+
Sbjct: 537  VMSPKGFSSTCPSASSEQLERSSVHNKNNKKRARPGENCRPRPRDRQLIQDRIKELRELI 596

Query: 2166 PNGSKCSIDSLLERTIKHMIFMQSITKHAEKLNKCTGSKLLEQDPRGRGASVAEQGSSWA 1987
            PNG+KCSIDSLLERTIKHM+++QSI KHA+KLNK   +KL  ++     +S  E+GSSWA
Sbjct: 597  PNGAKCSIDSLLERTIKHMLYLQSIAKHADKLNKYADTKLCHKETSMLESSTYERGSSWA 656

Query: 1986 VEVGTNSKVCPIVVENINMNGQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTESCGEK 1807
            VEVG N KVC IVVEN+N +GQM+VE++CEECSHFLEIAEAI+S+GLTILKGVTE+ GEK
Sbjct: 657  VEVGGNLKVCSIVVENLNKSGQMVVEMMCEECSHFLEIAEAIKSLGLTILKGVTEAHGEK 716

Query: 1806 TWMRFVVEGQNNRSLHRMDVLWSLMQLLQPK 1714
            TW+ FVVEGQ+NRSLHRMD+LWSL+Q+LQPK
Sbjct: 717  TWICFVVEGQSNRSLHRMDILWSLVQILQPK 747


>gb|EOX94495.1| Basic helix-loop-helix DNA-binding superfamily protein isoform 3
            [Theobroma cacao]
          Length = 737

 Score =  568 bits (1464), Expect = e-159
 Identities = 350/734 (47%), Positives = 449/734 (61%), Gaps = 36/734 (4%)
 Frame = -2

Query: 3813 SRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDN----DNPEKNCSSSSADNL 3646
            S LH+ LRSLCL+T WKYAVFWKLKHR RM+LTWEDAYYDN    D  E NC   + DNL
Sbjct: 8    SGLHQILRSLCLNTEWKYAVFWKLKHRARMVLTWEDAYYDNHDQHDPSENNCFHHTLDNL 67

Query: 3645 HDGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHDGW 3466
              G+ SHDPLGLAVAKMSYHVYSL           G+H WI ADKH+N    LFE  DGW
Sbjct: 68   QSGYCSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWIFADKHVNSSCSLFEFCDGW 127

Query: 3465 KTQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQD--------PLA 3310
            ++QF+AGI+TI          VQLGSL+++ ED+K+V+HIR VFF LQD        P+ 
Sbjct: 128  QSQFAAGIRTIVVVAVVQHGVVQLGSLNKVFEDVKLVSHIRDVFFALQDSSVGHIASPIE 187

Query: 3309 TSMESSCLS-DVTTRTSGSGVSRDCIQNLDRRIDKSGVDFWSSIYSSIEKPDHYPYNFSI 3133
             SM+SS    D+ T+   S    D I  LD+ +D+ G D     +S   K     +   +
Sbjct: 188  CSMKSSLFQLDLPTKLLDS----DGIP-LDKTVDEQGPDALLPEFSHPRKYSDRLFVLPL 242

Query: 3132 PGSYQNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSGGI--GNTRCE 2959
              ++    VE+ N +EG    +      ++  L   RS+    +     G I   N   +
Sbjct: 243  SNNHPKGAVEVENKHEGLE--LSSARNDESAKLLTPRSNVSNLEHQNQLGRILINNGVWK 300

Query: 2958 GQSSGFK--SLV--ECLENNNVSLS-----DKVCCKNSVCDTAHPASNSAMLIAGLPSEV 2806
            G++SG+K  SLV      NN V        D     ++  ++AH  +  +  ++  P+EV
Sbjct: 301  GENSGWKNSSLVPENVYANNPVGGRERYGVDHAYFSSNFLNSAHSDTVKSSSLSSYPNEV 360

Query: 2805 QDSTACNDERDSSCVPELPDMHVCKDTTNVLHMPFRFCAGYELYEALGPAFRKQNTDCGW 2626
             D    +D +    + +L + +      + ++   +F  G ELYEALGPAF +++    W
Sbjct: 361  LDIPESSDMKFQKDLKKLGNQNEISHL-DPMNTSLKFSVGCELYEALGPAFIRKSIYADW 419

Query: 2625 GAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANI---SSEVDKTDKSFVXXXXX 2455
             AE  E    +EM E M  SS LT  SG+E+LLEAVVAN+    S++     S       
Sbjct: 420  QAENMEAGGNIEMPEGM-SSSQLTFESGSENLLEAVVANVCHSGSDIKAERSSCRSAPSL 478

Query: 2454 XXXXXXXEPCTSDVGSISSAGYSFDRETL------NSLNSSGAYGVRYSKGLSS---NSC 2302
                   EP +    +I+SAGYS ++ +L      + LNSS   G   SKG SS   ++C
Sbjct: 479  LTTGNTPEPSSQSKHTINSAGYSINQSSLVEDNTQHCLNSSELCGAMSSKGFSSTCPSNC 538

Query: 2301 SRGSEKPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERT 2122
            S   E+  E  K +KKRARPGE+ RPRPRDRQLIQDRIKELRELVPNG+KCSIDSLLERT
Sbjct: 539  SEQFERSSEPAKNNKKRARPGENPRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERT 598

Query: 2121 IKHMIFMQSITKHAEKLNKCTGSKLLEQDPRGRGASVAEQGSSWAVEVGTNSKVCPIVVE 1942
            IKHM+F+Q ITKHA+KL+KC  SK+  +     G+S  EQGSSWAVEVG++ KVC IVVE
Sbjct: 599  IKHMVFLQGITKHADKLSKCAESKIHHKGAGMLGSSNYEQGSSWAVEVGSHLKVCSIVVE 658

Query: 1941 NINMNGQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTESCGEKTWMRFVVEGQNNRSL 1762
            N N NGQ+LVE+LCEECSHFLEIAEAIRS+GLTILKGVTE+ GEKTW+ FVVEGQNNR +
Sbjct: 659  NTNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGEKTWICFVVEGQNNRVM 718

Query: 1761 HRMDVLWSLMQLLQ 1720
            HRMD+LWSL+Q+LQ
Sbjct: 719  HRMDILWSLVQILQ 732


>gb|EMJ01507.1| hypothetical protein PRUPE_ppa001930mg [Prunus persica]
          Length = 739

 Score =  557 bits (1435), Expect = e-155
 Identities = 340/739 (46%), Positives = 439/739 (59%), Gaps = 39/739 (5%)
 Frame = -2

Query: 3813 SRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDN----DNPEKNCSSSSADNL 3646
            S LH  LRSLC +T W YA+FWKLK+R RM+LTWEDAYYDN    D+ E  C + + D L
Sbjct: 4    SDLHHVLRSLCFNTEWNYAIFWKLKYRARMVLTWEDAYYDNCEQHDSSENRCFNKTLDRL 63

Query: 3645 HDGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHDGW 3466
            HD H SHDPLGLAVAKMSYHVY+L            +H WI AD    +    F+  DGW
Sbjct: 64   HDSHYSHDPLGLAVAKMSYHVYTLGEGIVGQVAVTRKHQWIFADNLFKNNCSPFQYCDGW 123

Query: 3465 KTQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQD--------PLA 3310
            ++QFSAGI+TI           QLGSL+++ E++K+V+ IR VF  LQD        PL 
Sbjct: 124  QSQFSAGIRTIVVVAVPHGVV-QLGSLNKVIENVKLVSEIRDVFSTLQDSPVEQIRNPLQ 182

Query: 3309 TSMESS-CLSDVTTRTSGSGVSRDCIQNLDRRIDKS-GVDFWSSIYSSIEKPDHYPYNFS 3136
            + + SS CL+ ++ +   SGV  DC+ NLD+  ++    D WSSI+  I K     Y F 
Sbjct: 183  SGINSSACLTSISPKGLASGVITDCLHNLDKAANREESPDVWSSIFPHIGKDSDSSYVFP 242

Query: 3135 IPGSYQNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSGGIGNTRCEG 2956
            +P +   + VE+ N + G  +   GC      H   S     +  K      +  T+C+G
Sbjct: 243  LPENCLKKAVELANKHGGLESSNLGCLESAKLHQSKSSILNSEHCKLVGVELLDRTKCKG 302

Query: 2955 QSSGFKS--LVECLENNNVSLSDKVCCKNSVCDTAHPASNSAMLIAGLPSEVQDSTACND 2782
            +SSG K   +   + +N +S    V    ++CD+A    ++  L +     V        
Sbjct: 303  ESSGCKDTRMASMIYSNPLS-HGSVQENVNLCDSAD--LSATFLNSAAHGRVNVDRVDFY 359

Query: 2781 ERDSSCVPELPDMHVCKDTTNVL------HMP-----FRFCAGYELYEALGPAFRKQNTD 2635
            + +   V E  D+   KD  N+       HM        F AG EL+EALGPAF  +   
Sbjct: 360  QNEVLQVSEPSDVKFQKDLENLDFQTESGHMDTSSTSMAFPAGCELHEALGPAFLNKGNY 419

Query: 2634 CGWGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANI--SSEVDKTDKSFVXXX 2461
              W AE     + +EM E M    L T++S  EHLLEAVVAN+  S    K++KSF    
Sbjct: 420  FDWEAEKNGDGITIEMPEGMKTGQL-TSDSCQEHLLEAVVANVCHSGTDVKSEKSFCKSM 478

Query: 2460 XXXXXXXXXE-PCTSDVGSISSAGYSFDR------ETLNSLNSSGAYGVRYSKGLSS--- 2311
                       P +    +I S  YS D+      +T   L+SSG  GV   K  SS   
Sbjct: 479  QSLLTTEKYPEPSSHTTHTIDSENYSIDQPSLIAEDTQQCLSSSGVCGVISPKWFSSPCP 538

Query: 2310 NSCSRGSEKPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLL 2131
            ++CS   E+    +K +KKRARPGE+ RPRPRDRQLIQDRIKELREL+PNG+KCSIDSLL
Sbjct: 539  SACSEQLERSSGPSKNNKKRARPGENSRPRPRDRQLIQDRIKELRELIPNGAKCSIDSLL 598

Query: 2130 ERTIKHMIFMQSITKHAEKLNKCTGSKLLEQDPRGRGASVAEQGSSWAVEVGTNSKVCPI 1951
            ERTIKHM+F+QSITKHA+KLNKC  +K    +    G+S  E+GSSWAVEVG N KVC I
Sbjct: 599  ERTIKHMLFLQSITKHADKLNKCADAK----EASMLGSSNYERGSSWAVEVGGNLKVCSI 654

Query: 1950 VVENINMNGQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTESCGEKTWMRFVVEGQNN 1771
            +VEN+N NGQM+VE++CEECSHFLEIAEAIRS+GLTILKGVTE+  +KTW+ FVVEGQNN
Sbjct: 655  MVENLNKNGQMVVEMMCEECSHFLEIAEAIRSLGLTILKGVTEARSDKTWICFVVEGQNN 714

Query: 1770 RSLHRMDVLWSLMQLLQPK 1714
            RS+HRMD+LWSL+Q+LQPK
Sbjct: 715  RSIHRMDILWSLVQILQPK 733


>ref|XP_004292200.1| PREDICTED: transcription factor bHLH155-like [Fragaria vesca subsp.
            vesca]
          Length = 756

 Score =  556 bits (1434), Expect = e-155
 Identities = 339/797 (42%), Positives = 448/797 (56%), Gaps = 18/797 (2%)
 Frame = -2

Query: 4050 DSMLLPTVGPPIKRRAGLRRKQAGRGSYRGS*KFLRGKIRFFSSRVLLIGPFLRFRIESW 3871
            D + L  VGPPIKRRAGLRRKQAGRGSYRG                              
Sbjct: 4    DRLPLAAVGPPIKRRAGLRRKQAGRGSYRG------------------------------ 33

Query: 3870 EGGGVIF*SSKVDLGARMGSRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDN 3691
                            +MG+ LHR LRSLC +T W YA+FWKLKHR RM+LTWEDAYYDN
Sbjct: 34   ----------------KMGTDLHRVLRSLCFNTEWNYAIFWKLKHRARMVLTWEDAYYDN 77

Query: 3690 ----DNPEKNCSSSSADNLHDGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWI 3523
                DN        + + LH  H+ HD LGLA+AKMSYHVY+L           G+H WI
Sbjct: 78   CEQYDNSGNRSFIKTLEALHGNHNMHDSLGLAMAKMSYHVYTLGEGIVGQVAITGKHQWI 137

Query: 3522 SADKHMNDPGLLFECHDGWKTQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIR 3343
             AD  + D     E  DGW++QF AGI+TI          VQLGSL +I E++++++HI+
Sbjct: 138  FADNIVKDNCSPSEYCDGWQSQFLAGIRTIVVVAVVPHGVVQLGSLKKITENVELISHIK 197

Query: 3342 GVFFELQDPLATSMESSCLSDVTTRTSGSGVSRDCIQNLDRRIDKSGVDFWSSIYSSIEK 3163
              F   + P    ++SS +  ++ +   SG   DC+QNLD+ I++   D W S +    K
Sbjct: 198  DAFIGSKIPHLQHIQSSIV--ISPKILASGAFPDCLQNLDKAINREKSDVWLSAFPHSGK 255

Query: 3162 PDHYPYNFSIPGSYQNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSG 2983
                 Y F + G+++N  VE++N +    +   G       H   S     +  K     
Sbjct: 256  DGDSSYIFPLTGNFKNA-VEVVNKHGELESSNIGGDESPKLHQSKSSIFNLENSKLVGVE 314

Query: 2982 GIGNTRCEGQSSGFKSLVECLENNNVSLSDKVCCKNSVCDTAHPASNSAMLIAGLPSEVQ 2803
             + + +C G+SSG K +     N+   LS    C +      +   N  + +  +     
Sbjct: 315  LLDSRKCTGESSGCKDMGISSTNSADPLSHANDCADLSSTFVNSDVNDRVNLDSIDLYRN 374

Query: 2802 DSTACNDERDSSCVPELPDMHVCKDT--TNVLHMPFRFCAGYELYEALGPAFRKQNTDCG 2629
            +    ++  D      L ++    +    +       F AG EL+EALGPAF  ++    
Sbjct: 375  EVLHVSEPSDVKFQSNLDNLKFQTELGQADTSSSSLMFPAGCELHEALGPAFMHKSNFFD 434

Query: 2628 WGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANI--SSEVDKTDKSFVXXXXX 2455
            W AE        EM E M +SS LT++S  EHLLEAVVA +  S    K++KSF      
Sbjct: 435  WEAEKIGDRTTAEMPEGM-NSSQLTSDSCPEHLLEAVVAKVCHSGSHVKSEKSFCKSMQS 493

Query: 2454 XXXXXXXE-PCTSDVGSISSAGYSFDR------ETLNSLNSSGAYGVRYSKGLSS---NS 2305
                     P +    ++ S  YS D+      +T   L+SSG  GV   K  SS   ++
Sbjct: 494  LLTTEKYPEPSSHTTHTLDSENYSIDQPSMRGEDTQQCLSSSGICGVISPKWFSSPCPSA 553

Query: 2304 CSRGSEKPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLER 2125
            CS   E+     + +KKRARPGE+ RPRPRDRQLIQDRIKELREL PNG+KCSIDSLLER
Sbjct: 554  CSEQQERSSGPARNNKKRARPGETSRPRPRDRQLIQDRIKELRELTPNGAKCSIDSLLER 613

Query: 2124 TIKHMIFMQSITKHAEKLNKCTGSKLLEQDPRGRGASVAEQGSSWAVEVGTNSKVCPIVV 1945
            TIKHM+F+QSITKHA+KLNKC  +KL  ++    G++  E+GSSWAVEVG N KVC IVV
Sbjct: 614  TIKHMLFLQSITKHADKLNKCADAKLCPKETSMLGSTNYERGSSWAVEVGGNLKVCSIVV 673

Query: 1944 ENINMNGQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTESCGEKTWMRFVVEGQNNRS 1765
            EN+N NGQM+VE++CEECSHFLEIAEAIRS+ LTILKG+TE+ G+KTW+ F+VE QNNR+
Sbjct: 674  ENLNKNGQMVVEMICEECSHFLEIAEAIRSLSLTILKGLTEARGDKTWICFIVEAQNNRN 733

Query: 1764 LHRMDVLWSLMQLLQPK 1714
            +HRMD+LWSL+Q+LQPK
Sbjct: 734  IHRMDILWSLVQILQPK 750


>ref|XP_002532375.1| basic helix-loop-helix-containing protein, putative [Ricinus
            communis] gi|223527931|gb|EEF30018.1| basic
            helix-loop-helix-containing protein, putative [Ricinus
            communis]
          Length = 749

 Score =  551 bits (1419), Expect = e-153
 Identities = 345/755 (45%), Positives = 453/755 (60%), Gaps = 53/755 (7%)
 Frame = -2

Query: 3819 MGSRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDN----DNPEKNCSSSSAD 3652
            MG+ LH  LRSLC +T WKYAVFWKLKHRTRM+LTWEDAYY+N    D  E  C   + +
Sbjct: 1    MGTDLHNTLRSLCFNTDWKYAVFWKLKHRTRMVLTWEDAYYNNCEQHDLLENKCFGETFE 60

Query: 3651 NLHDGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHD 3472
            NL  G  S+DP+GLAVAKMSYHVYSL           G+H WI ADKH+ +    FE  D
Sbjct: 61   NLCGGRYSNDPVGLAVAKMSYHVYSLGEGIVGQVAVTGKHRWIVADKHVTNSISSFEFSD 120

Query: 3471 GWKTQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQD--------P 3316
            GW++QFSAGI+TI          VQLGSL+++AED+K+VNHI+ VF  LQD        P
Sbjct: 121  GWQSQFSAGIRTIIVVAVVPHGVVQLGSLNKVAEDMKLVNHIKDVFSSLQDSSVEQISIP 180

Query: 3315 LATSMESSC-LSDVTTRT--SGSGVSRDCIQNLDRRIDKSGVDFWSSIYSSIEKPDHYPY 3145
            L  SM++S  L DV T++  S S V  D + NLD+  DK   +  S+++  ++K     Y
Sbjct: 181  LQYSMKTSLYLPDVPTQSLDSESVVIPDNLCNLDKAADKGPYN-QSTMFPYLQKQSDDSY 239

Query: 3144 NFSIPGSYQNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSGG--IGN 2971
             +S+PG +Q   VE++N Y G   L     I  +  L   RS+    ++H   G   + +
Sbjct: 240  FYSLPGIHQKTAVELVNKY-GGGGLSLPVNIS-SVKLLQPRSNISYLEQHNQVGINLVVD 297

Query: 2970 TRCEGQSSGFKSLVECLENN-NVSLSDKVCCKNSVCDTAHPASNSAMLIAGLPSEVQDST 2794
              C G++S +K      E N    L + V    ++CD   P        A  P ++ DST
Sbjct: 298  HTCGGKTSVWKDPGRGSELNVTPHLDNSVKDNINLCDVILPDQKFGADPANFPMDLLDST 357

Query: 2793 ACNDERDSSC--------VPELPDMHVCKDTTNVLHMP------------FRFCAGYELY 2674
             C+  +            +PE   + + K     L                +F AG EL+
Sbjct: 358  VCDRHKSDEIDILNGALDMPESSSIDLKKHLEKKLEYQAGSSHLESSSTFLKFSAGCELH 417

Query: 2673 EALGPAFRKQNT--DCGWGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANI-- 2506
            EALGPAF K     DC  G   TE+   +E+ E +  +S +T ++G+E+LL+AVV N+  
Sbjct: 418  EALGPAFSKGCLYFDCEEGK--TESADIIEVPEGI-STSQMTFDTGSENLLDAVVGNVCY 474

Query: 2505 --SSEVDKTDKSFVXXXXXXXXXXXXEPCTSDVGSISSAGYSFDRETL------NSLNSS 2350
              S++V +                  EP         SAGYS +R+++      N  +S+
Sbjct: 475  SGSTDVKREKSVCKSAQSLLTTEKMPEPSFQAKHITHSAGYSINRQSVVQNDTHNCSSST 534

Query: 2349 GAYGVRYSKGLSSN---SCSRGSEKPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKEL 2179
            G  G   S G SSN   +CS   ++  E  + +KKRARPGE+CRPRPRDRQLIQDRIKEL
Sbjct: 535  GVRGATSSNGYSSNCPSTCSEQLDRRSEPAEKNKKRARPGENCRPRPRDRQLIQDRIKEL 594

Query: 2178 RELVPNGSKCSIDSLLERTIKHMIFMQSITKHAEKLNKCTGSKLLEQDPRGRGASVAEQG 1999
            RELVPNG+KCSIDSLLERTIKHM+F++SITKHA+KLNKC  SK+ +   +G   S  E+G
Sbjct: 595  RELVPNGAKCSIDSLLERTIKHMLFLESITKHADKLNKCAESKMYQ---KGTDTSNYEKG 651

Query: 1998 SSWAVEVGTNSKVCPIVVENINMNGQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTES 1819
            SSWAVEVG + KV  I+VE++N NGQMLVE+LCEECSHFLEIAEAIRS+GLTILKG+TE 
Sbjct: 652  SSWAVEVGGHLKVSSIIVESLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGITEV 711

Query: 1818 CGEKTWMRFVVEGQNNRSLHRMDVLWSLMQLLQPK 1714
             GEKTW+ F+VEGQNN+ +HRMD+LWSL+Q+LQPK
Sbjct: 712  HGEKTWICFMVEGQNNKVMHRMDILWSLVQILQPK 746


>gb|EOX94493.1| Basic helix-loop-helix-containing protein, putative isoform 1
            [Theobroma cacao]
          Length = 708

 Score =  547 bits (1410), Expect = e-152
 Identities = 341/728 (46%), Positives = 435/728 (59%), Gaps = 30/728 (4%)
 Frame = -2

Query: 3813 SRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDN----DNPEKNCSSSSADNL 3646
            S LH+ LRSLCL+T WKYAVFWKLKHR RM+LTWEDAYYDN    D  E NC   + DNL
Sbjct: 8    SGLHQILRSLCLNTEWKYAVFWKLKHRARMVLTWEDAYYDNHDQHDPSENNCFHHTLDNL 67

Query: 3645 HDGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHDGW 3466
              G+ SHDPLGLAVAKMSYHVYSL           G+H WI ADKH+N    LFE  DGW
Sbjct: 68   QSGYCSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWIFADKHVNSSCSLFEFCDGW 127

Query: 3465 KTQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQD--------PLA 3310
            ++QF+AGI+TI          VQLGSL+++ ED+K+V+HIR VFF LQD        P+ 
Sbjct: 128  QSQFAAGIRTIVVVAVVQHGVVQLGSLNKVFEDVKLVSHIRDVFFALQDSSVGHIASPIE 187

Query: 3309 TSMESSCLS-DVTTRTSGS-GVSRDCIQNLDRRIDKSGVDFWSSIYSSIEKPDHYPYNFS 3136
             SM+SS    D+ T+   S G+       LD+ +D+ G D     +S   K     +   
Sbjct: 188  CSMKSSLFQLDLPTKLLDSDGIP------LDKTVDEQGPDALLPEFSHPRKYSDRLFVLP 241

Query: 3135 IPGSYQNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSGG--IGNTRC 2962
            +  ++    VE+ N +EG    +      ++  L   RS+    +     G   I N   
Sbjct: 242  LSNNHPKGAVEVENKHEGLE--LSSARNDESAKLLTPRSNVSNLEHQNQLGRILINNGVW 299

Query: 2961 EGQSSGFK--SLV--ECLENNNVSLS-----DKVCCKNSVCDTAHPASNSAMLIAGLPSE 2809
            +G++SG+K  SLV      NN V        D     ++  ++AH  +  +  ++  P+E
Sbjct: 300  KGENSGWKNSSLVPENVYANNPVGGRERYGVDHAYFSSNFLNSAHSDTVKSSSLSSYPNE 359

Query: 2808 VQDSTACNDERDSSCVPELPDMHVCKDTTNVLHMPFRFCAGYELYEALGPAFRKQNTDCG 2629
            V D    +D +    + +L + +      + ++   +F  G ELYEALGPAF +++    
Sbjct: 360  VLDIPESSDMKFQKDLKKLGNQNEISH-LDPMNTSLKFSVGCELYEALGPAFIRKSIYAD 418

Query: 2628 WGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANI--SSEVDKTDKSFVXXXXX 2455
            W AE  E    +EM E M  SS LT  SG+E+LLEAVVAN+  S    K ++S       
Sbjct: 419  WQAENMEAGGNIEMPEGM-SSSQLTFESGSENLLEAVVANVCHSGSDIKAERS------- 470

Query: 2454 XXXXXXXEPCTSDVGSISSAGYSFDRETLNSLNSSGAYGVRYSKGLSS---NSCSRGSEK 2284
                           S  SA            +S    G   SKG SS   ++CS   E+
Sbjct: 471  ---------------SCRSAPSLLTTGNTPEPSSQKLCGAMSSKGFSSTCPSNCSEQFER 515

Query: 2283 PQESNKVHKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIKHMIF 2104
              E  K +KKRARPGE+ RPRPRDRQLIQDRIKELRELVPNG+KCSIDSLLERTIKHM+F
Sbjct: 516  SSEPAKNNKKRARPGENPRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMVF 575

Query: 2103 MQSITKHAEKLNKCTGSKLLEQDPRGRGASVAEQGSSWAVEVGTNSKVCPIVVENINMNG 1924
            +Q ITKHA+KL+KC  SK+  +     G+S  EQGSSWAVEVG++ KVC IVVEN N NG
Sbjct: 576  LQGITKHADKLSKCAESKIHHKGAGMLGSSNYEQGSSWAVEVGSHLKVCSIVVENTNKNG 635

Query: 1923 QMLVELLCEECSHFLEIAEAIRSMGLTILKGVTESCGEKTWMRFVVEGQNNRSLHRMDVL 1744
            Q+LVE+LCEECSHFLEIAEAIRS+GLTILKGVTE+ GEKTW+ FVVEGQNNR +HRMD+L
Sbjct: 636  QILVEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGEKTWICFVVEGQNNRVMHRMDIL 695

Query: 1743 WSLMQLLQ 1720
            WSL+Q+LQ
Sbjct: 696  WSLVQILQ 703


>emb|CCX35476.1| hypothetical protein [Malus domestica]
          Length = 741

 Score =  546 bits (1406), Expect = e-152
 Identities = 340/755 (45%), Positives = 444/755 (58%), Gaps = 53/755 (7%)
 Frame = -2

Query: 3819 MGSRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDN----DNPEKNCSSSSAD 3652
            MG+ LH  LRSLC +T W YAV WKLKHR RM+LT EDAY+DN     + E  C S + D
Sbjct: 1    MGTDLHNILRSLCFNTEWNYAVSWKLKHRARMVLTCEDAYFDNCEQQHSSENRCFSKTMD 60

Query: 3651 NLHDGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHD 3472
             LHD H SHDPLGLAVAKMS HVY+L           G H WI AD  + +    F+  D
Sbjct: 61   KLHDSHYSHDPLGLAVAKMSCHVYNLGEGIVGQVAVTGEHQWIYADDLVKNNCSPFQYCD 120

Query: 3471 GWKTQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQD--------P 3316
            GW++Q+SAGI+TI          +QLGSL+++AE++K+++ I   F  LQD        P
Sbjct: 121  GWQSQYSAGIRTIVVVAVVPHRVIQLGSLNKVAENVKLISQITDAFKTLQDFPIEHILNP 180

Query: 3315 LATSMESS-CLSDVTTRTSGSGVSRDCIQNLDRRIDKSGVDFWSSIYSSIEKPDHYPYNF 3139
              +S+ SS C ++++     SGV  DC+ NLD   ++   D W+SI+  + K +   Y  
Sbjct: 181  KQSSINSSVCSTNISLEGLASGVLPDCVNNLDTATNRESSDIWASIFPHLVKDNDSSYVS 240

Query: 3138 SIPGSYQNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSGG--IGNTR 2965
            S+  +   + VE+ N + G  +     G  +   LP S+S A   + H   G   + + +
Sbjct: 241  SLTENCLKEEVELANKHGGLES--SNFGSVEIGKLPQSKSSALSMEHHRLVGVELLDSRK 298

Query: 2964 CEGQSSGFKSLVECLENNNVSLSDKVCCKNSVCDTAHPASNSAMLI------AGLPSEVQ 2803
            C+G+SSG      C +    S+             AHP S+  + I      A LP+   
Sbjct: 299  CKGESSG------CKDTGMASVI-----------YAHPLSHDPVNIVNLCDFADLPTTFL 341

Query: 2802 DSTA---CNDER---DSSCVPELPDMHVCKDTTNVLHMPFR--------------FCAGY 2683
            DSTA    N +R     + V  + +  V K    + ++ F+              F AG 
Sbjct: 342  DSTAHERINADRVDLHQNEVLHVSEPSVVKFQKGLENLEFQTESGHMDTSSTSMTFPAGC 401

Query: 2682 ELYEALGPAFRKQNTDCGWGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANI- 2506
            EL+EALGPAF  Q     W A      +  E+ E M ++S LT+ S  EHLLEAVVAN+ 
Sbjct: 402  ELHEALGPAFLNQGNYFDWVAGKNGDRITPEIPEGM-NTSQLTSASCQEHLLEAVVANVC 460

Query: 2505 -SSEVDKTDKSFVXXXXXXXXXXXXE-PCTSDVGSISSAGYSFDRETLNS------LNSS 2350
             S  + K++KSF               P +    +I S  YS D+ +L        L+SS
Sbjct: 461  QSGSLVKSEKSFCKSMQSLLTTEKCPEPSSRITHTIDSENYSIDQPSLTGEDMQQCLSSS 520

Query: 2349 GAYGVRYSKGLSS---NSCSRGSEKPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKEL 2179
            G  GV   K  SS   ++CS   E+    +K  KKRARPGES RPRPRDRQLIQDRIKEL
Sbjct: 521  GVCGVISPKWFSSPCPSACSEQLERSSGPSKNSKKRARPGESSRPRPRDRQLIQDRIKEL 580

Query: 2178 RELVPNGSKCSIDSLLERTIKHMIFMQSITKHAEKLNKCTGSKLLEQDPRGRGASVAEQG 1999
            REL+P G+KCSIDSLLERTIKHM+F+QS+TKHA+KLNKC  +KL  ++    G+S  E+G
Sbjct: 581  RELIPTGAKCSIDSLLERTIKHMLFLQSVTKHADKLNKCADAKLCPKEASMLGSSNYERG 640

Query: 1998 SSWAVEVGTNSKVCPIVVENINMNGQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTES 1819
            SSWAVEVG N KVC I+VEN+N NGQM+VEL+CEECSHFLEIAEAIRS GLTILKGVTE+
Sbjct: 641  SSWAVEVGGNLKVCSIIVENLNKNGQMVVELMCEECSHFLEIAEAIRSSGLTILKGVTEA 700

Query: 1818 CGEKTWMRFVVEGQNNRSLHRMDVLWSLMQLLQPK 1714
             G+KTW+ FVVEGQNNRS+HRMD+LWSL+Q+LQPK
Sbjct: 701  RGDKTWICFVVEGQNNRSIHRMDILWSLVQILQPK 735


>gb|EOX94494.1| Basic helix-loop-helix DNA-binding superfamily protein isoform 2
            [Theobroma cacao]
          Length = 709

 Score =  543 bits (1400), Expect = e-151
 Identities = 341/729 (46%), Positives = 435/729 (59%), Gaps = 31/729 (4%)
 Frame = -2

Query: 3813 SRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDN----DNPEKNCSSSSADNL 3646
            S LH+ LRSLCL+T WKYAVFWKLKHR RM+LTWEDAYYDN    D  E NC   + DNL
Sbjct: 8    SGLHQILRSLCLNTEWKYAVFWKLKHRARMVLTWEDAYYDNHDQHDPSENNCFHHTLDNL 67

Query: 3645 HDGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHDGW 3466
              G+ SHDPLGLAVAKMSYHVYSL           G+H WI ADKH+N    LFE  DGW
Sbjct: 68   QSGYCSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWIFADKHVNSSCSLFEFCDGW 127

Query: 3465 KTQFSAGIKTIAXXXXXXXXXVQLGSLDQIA-EDLKMVNHIRGVFFELQD--------PL 3313
            ++QF+AGI+TI          VQLGSL+++  ED+K+V+HIR VFF LQD        P+
Sbjct: 128  QSQFAAGIRTIVVVAVVQHGVVQLGSLNKVVFEDVKLVSHIRDVFFALQDSSVGHIASPI 187

Query: 3312 ATSMESSCLS-DVTTRTSGS-GVSRDCIQNLDRRIDKSGVDFWSSIYSSIEKPDHYPYNF 3139
              SM+SS    D+ T+   S G+       LD+ +D+ G D     +S   K     +  
Sbjct: 188  ECSMKSSLFQLDLPTKLLDSDGIP------LDKTVDEQGPDALLPEFSHPRKYSDRLFVL 241

Query: 3138 SIPGSYQNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSGG--IGNTR 2965
             +  ++    VE+ N +EG    +      ++  L   RS+    +     G   I N  
Sbjct: 242  PLSNNHPKGAVEVENKHEGLE--LSSARNDESAKLLTPRSNVSNLEHQNQLGRILINNGV 299

Query: 2964 CEGQSSGFK--SLV--ECLENNNVSLS-----DKVCCKNSVCDTAHPASNSAMLIAGLPS 2812
             +G++SG+K  SLV      NN V        D     ++  ++AH  +  +  ++  P+
Sbjct: 300  WKGENSGWKNSSLVPENVYANNPVGGRERYGVDHAYFSSNFLNSAHSDTVKSSSLSSYPN 359

Query: 2811 EVQDSTACNDERDSSCVPELPDMHVCKDTTNVLHMPFRFCAGYELYEALGPAFRKQNTDC 2632
            EV D    +D +    + +L + +      + ++   +F  G ELYEALGPAF +++   
Sbjct: 360  EVLDIPESSDMKFQKDLKKLGNQNEISH-LDPMNTSLKFSVGCELYEALGPAFIRKSIYA 418

Query: 2631 GWGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANI--SSEVDKTDKSFVXXXX 2458
             W AE  E    +EM E M  SS LT  SG+E+LLEAVVAN+  S    K ++S      
Sbjct: 419  DWQAENMEAGGNIEMPEGM-SSSQLTFESGSENLLEAVVANVCHSGSDIKAERS------ 471

Query: 2457 XXXXXXXXEPCTSDVGSISSAGYSFDRETLNSLNSSGAYGVRYSKGLSS---NSCSRGSE 2287
                            S  SA            +S    G   SKG SS   ++CS   E
Sbjct: 472  ----------------SCRSAPSLLTTGNTPEPSSQKLCGAMSSKGFSSTCPSNCSEQFE 515

Query: 2286 KPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIKHMI 2107
            +  E  K +KKRARPGE+ RPRPRDRQLIQDRIKELRELVPNG+KCSIDSLLERTIKHM+
Sbjct: 516  RSSEPAKNNKKRARPGENPRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMV 575

Query: 2106 FMQSITKHAEKLNKCTGSKLLEQDPRGRGASVAEQGSSWAVEVGTNSKVCPIVVENINMN 1927
            F+Q ITKHA+KL+KC  SK+  +     G+S  EQGSSWAVEVG++ KVC IVVEN N N
Sbjct: 576  FLQGITKHADKLSKCAESKIHHKGAGMLGSSNYEQGSSWAVEVGSHLKVCSIVVENTNKN 635

Query: 1926 GQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTESCGEKTWMRFVVEGQNNRSLHRMDV 1747
            GQ+LVE+LCEECSHFLEIAEAIRS+GLTILKGVTE+ GEKTW+ FVVEGQNNR +HRMD+
Sbjct: 636  GQILVEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGEKTWICFVVEGQNNRVMHRMDI 695

Query: 1746 LWSLMQLLQ 1720
            LWSL+Q+LQ
Sbjct: 696  LWSLVQILQ 704


>ref|XP_006443866.1| hypothetical protein CICLE_v10018993mg [Citrus clementina]
            gi|557546128|gb|ESR57106.1| hypothetical protein
            CICLE_v10018993mg [Citrus clementina]
          Length = 748

 Score =  525 bits (1353), Expect = e-146
 Identities = 341/746 (45%), Positives = 437/746 (58%), Gaps = 48/746 (6%)
 Frame = -2

Query: 3807 LHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDN----DNPEKNCSSSSADNLHD 3640
            LH  L+SLC +T WKYAVFWKLKHRTRM+LTWED YYDN    D+ E  CSS S +N H 
Sbjct: 10   LHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDNCGQQDSLENKCSSESLENFHG 69

Query: 3639 GHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHDGWKT 3460
            G  SHDPLGLAVAKMSYHVYSL           G+H WI +D+ + +    FE  DGW++
Sbjct: 70   GRYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSSFEFSDGWQS 129

Query: 3459 QFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQDPLATSMESSCLSD 3280
            QFSAGI+TIA         VQLGSLD++ ED+K+V HIR VF  L D     + S+  S 
Sbjct: 130  QFSAGIRTIAVVAVVPHGVVQLGSLDEVTEDMKVVTHIRDVFAALNDISVGHVSSTIQSS 189

Query: 3279 VTTRTSGSGVSRDCI----QNLDRRIDKSGVDFWSSIYSSIEKPDHYPYNFSIPGSYQNQ 3112
            V    S   +    I     NLD  +++ G D    ++  +EK +   Y FS     Q +
Sbjct: 190  VKNTLSLPDLPTKSIPNRWHNLDEVVNRGGPDVQFPMFPYVEKHNDGSYAFS---GMQPK 246

Query: 3111 IVE-MINTYEGAA-NLVRGCGIGDNFHLPISRSDAEQQQK---HEPSGGIGNTRCEGQSS 2947
            I + ++N  EG   +   G G     H   +  + + Q +   H  S G+       +SS
Sbjct: 247  IGDGVVNRNEGILLSSAGGVGSAKILHPKSNVINLDYQNQMGIHFISDGMSRV----ESS 302

Query: 2946 GFKSLVECLENNNVSLSDK-------VC-----CKNSVCDTAHPASNSAMLIAGLPSEVQ 2803
            G+K L    E N    S         +C      +  V D  + ASN    + G   +++
Sbjct: 303  GWKDLGVISEQNGTPFSINSVIDSINLCSVALQAEKFVADRTYLASNPLEAVLGEQVKLE 362

Query: 2802 DSTACNDERDSSCVPELPDMHVCKDT------TNVLH-----MPFRFCAGYELYEALGPA 2656
             + +C  +     +PE+ D+   KD       T + H     M  +F A  EL+EALGPA
Sbjct: 363  CTDSC--QNGMLHIPEISDIKFEKDLEKLQNQTELNHLDPSGMSLKFSAVSELHEALGPA 420

Query: 2655 FRKQNTDCGWGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANI-SSEVDKTDK 2479
            F +++       E T     V M E +  SS L  +SG+E+LL+AVVA++ +S  D   +
Sbjct: 421  FLRKDIYNDREPENTVDGETVGMPE-LTSSSHLMFDSGSENLLDAVVASVCNSGSDVKSE 479

Query: 2478 SFVXXXXXXXXXXXXEPCTSDVG--SISSAGYSFDRETL------NSLNSSGAYGVRYSK 2323
              V            +P +S     + +S  YS  + +L      + LNSS   G   SK
Sbjct: 480  RTVCRSMQSLLTTEKKPESSSQSKNTNNSVSYSISQSSLVEEDAKHFLNSSEVCGAVSSK 539

Query: 2322 GLSS---NSCSRGSEKPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSK 2152
            G SS   ++CS   +   E  K +KKRAR GE+ RPRPRDRQLIQDRIKELRELVPNGSK
Sbjct: 540  GFSSTCPSTCSEQLDMSSEPAKNNKKRARTGENGRPRPRDRQLIQDRIKELRELVPNGSK 599

Query: 2151 CSIDSLLERTIKHMIFMQSITKHAEKLNKCTGSKLLEQDPRGRGASVAEQGSSWAVEVGT 1972
            CSIDSLLERTIKHM+F+QSITKHA+KL+KC  SK+  Q   G   S  EQGSSWAVE+G+
Sbjct: 600  CSIDSLLERTIKHMLFLQSITKHADKLSKCAESKM-HQKGNGIHGSNYEQGSSWAVEMGS 658

Query: 1971 NSKVCPIVVENINMNGQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTESCGEKTWMRF 1792
            + KVC IVVEN+N NGQMLVE+LCEECSHFLEIAEAIRS+GLTILKGVTE+ G+KTW+ F
Sbjct: 659  HLKVCSIVVENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGDKTWICF 718

Query: 1791 VVEGQNNRSLHRMDVLWSLMQLLQPK 1714
            VVEGQ+NR +HRMDVLWSL+QLLQ K
Sbjct: 719  VVEGQDNRIMHRMDVLWSLVQLLQSK 744


>ref|XP_006383698.1| basic helix-loop-helix family protein [Populus trichocarpa]
            gi|550339661|gb|ERP61495.1| basic helix-loop-helix family
            protein [Populus trichocarpa]
          Length = 694

 Score =  525 bits (1353), Expect = e-146
 Identities = 327/740 (44%), Positives = 418/740 (56%), Gaps = 38/740 (5%)
 Frame = -2

Query: 3819 MGSRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDN----DNPEKNCSSSSAD 3652
            MG+ LH  LRSLC +T W YAVFWKLKHR RM+LTWED YYDN    D  E  C   + +
Sbjct: 3    MGTDLHDTLRSLCFNTDWNYAVFWKLKHRARMVLTWEDGYYDNCEQHDALENKCFRQTQE 62

Query: 3651 NLHDGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHD 3472
            NL  GH   DPLGLAVAKMSYHVYSL           G+H WI ADKH+ +    +E  D
Sbjct: 63   NLRGGHYPRDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWIFADKHVTNSFSSYEFSD 122

Query: 3471 GWKTQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQDPLATSMESS 3292
            GW++QFSAGI+TI          VQLGSL++++ED+ +V HI+ VFF LQD        S
Sbjct: 123  GWQSQFSAGIRTIVVVAVVPYGVVQLGSLNKVSEDVNLVTHIKDVFFALQD--------S 174

Query: 3291 CLSDVTTRTSGSGVSRDCIQNLDRRIDKSGVDFWSSIYSSIEKPDHYPYNFSIPGSYQNQ 3112
             +S VT+ +     +  C++      +K  V                     IP    ++
Sbjct: 175  TVSHVTSPSQHGMKNALCLKTAAELKNKQEV-------------------LEIPTPTNDE 215

Query: 3111 IVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSGGIGNTRCEGQSSGFKSL 2932
             ++++N    A+ L     +G N                     I +    G++S +K L
Sbjct: 216  SIDLLNLKSNASYLDHRSQLGMNI--------------------ISDRMFGGETSVWKDL 255

Query: 2931 VECLENNNVSLSDKVCCKN-SVCDTAHPASNSAMLIAGLPSEVQDSTACNDERDSSC--- 2764
                E+N    S+    +N S+ D   P       +AG P+++ DST C+ ++  S    
Sbjct: 256  GRGSEHNTTMHSNSFMRENVSLSDLVLPNEKLGADLAGFPADLFDSTICDRDKSDSINLR 315

Query: 2763 ------VPELPDMHVCKDTTNVLHMP------------FRFCAGYELYEALGPAFRKQNT 2638
                   PE  D+   +D    L  P            F+F AG EL EALGP+F  +  
Sbjct: 316  PNVVLNAPESSDITFKRDLEKKLDHPAESTHFNSSDTFFKFSAGCELLEALGPSFLNRCM 375

Query: 2637 DCGWGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANI---SSEVDKTDKSFVX 2467
               +    +E     EM E M  SS +T + G+E+LLEAVV N+    S+V         
Sbjct: 376  PFDYQTGKSEAGNIFEMPEGM-SSSQMTFDFGSENLLEAVVGNVCHSGSDVKSEKSGCKS 434

Query: 2466 XXXXXXXXXXXEPCTSDVGSISSAGYSFDRETL------NSLNSSGAYGVRYSKGLSSNS 2305
                       EP       ++SAGYS ++ ++      N  NS+   G   SKG SS  
Sbjct: 435  VQSLVTAEKLPEPSIQTKHIMNSAGYSINQSSVVEEDVHNLSNSTEVCGGMSSKGFSSTC 494

Query: 2304 CSRGSE---KPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSKCSIDSL 2134
             S  SE   K  ES K  KKRA+PGE+CRPRPRDRQLIQDRIKELRELVPNGSKCSIDSL
Sbjct: 495  PSTYSEQLDKRSESAKNSKKRAKPGENCRPRPRDRQLIQDRIKELRELVPNGSKCSIDSL 554

Query: 2133 LERTIKHMIFMQSITKHAEKLNKCTGSKLLEQDPRGRGASVAEQGSSWAVEVGTNSKVCP 1954
            LERTIKHM+F+++ITKHA+KLNKC   K+ +   +G  AS  EQGSSWAVEVG + KV  
Sbjct: 555  LERTIKHMLFLENITKHADKLNKCAEPKMHQ---KGTEASNYEQGSSWAVEVGGHLKVSS 611

Query: 1953 IVVENINMNGQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTESCGEKTWMRFVVEGQN 1774
            I+VEN+N NGQMLVE+LCEECSHFLEIAEAIRS+GLTILKG+TE  GEKTW+ FVVEGQN
Sbjct: 612  IIVENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGITEVQGEKTWICFVVEGQN 671

Query: 1773 NRSLHRMDVLWSLMQLLQPK 1714
            N+ +HRMD+LWSL+Q+LQPK
Sbjct: 672  NKIMHRMDILWSLVQILQPK 691


>ref|XP_006351644.1| PREDICTED: transcription factor bHLH155-like isoform X2 [Solanum
            tuberosum]
          Length = 605

 Score =  516 bits (1328), Expect = e-143
 Identities = 300/615 (48%), Positives = 378/615 (61%), Gaps = 32/615 (5%)
 Frame = -2

Query: 3819 MGSRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDNDN-PEKNCSSSSADNLH 3643
            M S+L +ALRSLC +T WKYAVFWKL HR RMMLTWEDAYYDND  P K    S+A NL+
Sbjct: 1    MASQLQQALRSLCCNTPWKYAVFWKLTHRARMMLTWEDAYYDNDGFPGKKSPGSTAGNLY 60

Query: 3642 DGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHDGWK 3463
            DGH S++ LG+AVAKMSYHVYSL           G+HLW+SADK      L  E  DGW+
Sbjct: 61   DGHYSNNHLGVAVAKMSYHVYSLGEGIVGQVAITGKHLWLSADKVAAITSLAPEHCDGWQ 120

Query: 3462 TQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQDPLAT-------- 3307
             QFSAGIKTI          +QLGSLD I EDL+ + HIR VF ELQ+ +A+        
Sbjct: 121  AQFSAGIKTIVVAAVAPHGVIQLGSLDSIPEDLRAIKHIRDVFSELQELMASCLRSSMQY 180

Query: 3306 SMESSCLSDVTTRTSGSGVSRDCIQNLDRRIDKSGVDFWSSIYSSIEKPDHYPYNFSIPG 3127
            SME+SCLS+++TRTSGS V +DC+ NL R + + G + WS +Y+S+EK   +   FS PG
Sbjct: 181  SMENSCLSEISTRTSGSEVFQDCVNNLGRSVCEDGRNMWSPLYTSVEKSVDHSCIFSQPG 240

Query: 3126 SYQNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSGGIGNTRCEGQSS 2947
             + N+I+E ++        V+G    +N       S   + Q+        + + EGQ+S
Sbjct: 241  GFPNKILEAVHNQGLHRTSVQGSDDSENLLPASCESSIIKHQEEGQMWEETDPKFEGQTS 300

Query: 2946 GFKSL-------VECLENNNVSLSDKVCCKNSVCDTAHPASNSAMLIAGLPSEVQDSTAC 2788
              + L        E    ++ S+         V +   P  N+             S A 
Sbjct: 301  NLRVLGKGSVDKCEPTFRSDASIGSVSYDAGQVTECPQPNRNNLA-----------SEAD 349

Query: 2787 NDERDSSCVPELPDMHV--CKDTT--------NVLHMPFRFCAGYELYEALGPAFRKQNT 2638
            ND      + +LP+ +   C +T         + +H PFRFCAGYELYEALGP F+K N+
Sbjct: 350  NDRNRKLGLSDLPNAYADKCAETNLGFETQCNDTMHTPFRFCAGYELYEALGPVFQKGNS 409

Query: 2637 DCGWGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANISSEVDKTD--KSFVXX 2464
               W A   E +MAV+M E +  SSL+ +N+G EHLLEAV+AN++   +     KSF   
Sbjct: 410  SKDWEAGKRE-EMAVDMLEGIGTSSLVMSNTGNEHLLEAVIANVNRYDNDCSSVKSFCKS 468

Query: 2463 XXXXXXXXXXE-PCTSDVGSISSAGYSFDRETLNSLNSSGAYGVRYSKGLSSNSCSRGS- 2290
                        PC+SD+G+ISS GYSFDRETLNS NSSG   +R S+GLSS SCSRGS 
Sbjct: 469  VDSLLTTEITAEPCSSDIGAISSIGYSFDRETLNSFNSSGTCSIRSSRGLSSTSCSRGSG 528

Query: 2289 --EKPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIK 2116
              E+P E  K+HKKRARPGESCRPRPRDRQLIQDRIKELR+LVPNGSKCSIDSLLERTIK
Sbjct: 529  HVERPLEPVKMHKKRARPGESCRPRPRDRQLIQDRIKELRDLVPNGSKCSIDSLLERTIK 588

Query: 2115 HMIFMQSITKHAEKL 2071
            HM+FMQS+TKHA+KL
Sbjct: 589  HMLFMQSVTKHADKL 603


>ref|XP_006443867.1| hypothetical protein CICLE_v10018993mg [Citrus clementina]
            gi|568851769|ref|XP_006479559.1| PREDICTED: transcription
            factor EMB1444-like [Citrus sinensis]
            gi|557546129|gb|ESR57107.1| hypothetical protein
            CICLE_v10018993mg [Citrus clementina]
          Length = 714

 Score =  512 bits (1318), Expect = e-142
 Identities = 334/737 (45%), Positives = 427/737 (57%), Gaps = 39/737 (5%)
 Frame = -2

Query: 3807 LHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDN----DNPEKNCSSSSADNLHD 3640
            LH  L+SLC +T WKYAVFWKLKHRTRM+LTWED YYDN    D+ E  CSS S +N H 
Sbjct: 10   LHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDNCGQQDSLENKCSSESLENFHG 69

Query: 3639 GHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHDGWKT 3460
            G  SHDPLGLAVAKMSYHVYSL           G+H WI +D+ + +    FE  DGW++
Sbjct: 70   GRYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSSFEFSDGWQS 129

Query: 3459 QFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQDPLATSMESSCLSD 3280
            QFSAGI+TIA         VQLGSLD++ ED+K+V HIR VF  L D     + S+  S 
Sbjct: 130  QFSAGIRTIAVVAVVPHGVVQLGSLDEVTEDMKVVTHIRDVFAALNDISVGHVSSTIQSS 189

Query: 3279 VTTRTSGSGVSRDCI----QNLDRRIDKSGVDFWSSIYSSIEKPDHYPYNFSIPGSYQNQ 3112
            V    S   +    I     NLD  +++ G D    ++  +EK +   Y FS     Q +
Sbjct: 190  VKNTLSLPDLPTKSIPNRWHNLDEVVNRGGPDVQFPMFPYVEKHNDGSYAFS---GMQPK 246

Query: 3111 IVE-MINTYEG-AANLVRGCGIGDNFHLPISRSDAEQQQK---HEPSGGIGNTRCEGQSS 2947
            I + ++N  EG   +   G G     H   +  + + Q +   H  S G+       +SS
Sbjct: 247  IGDGVVNRNEGILLSSAGGVGSAKILHPKSNVINLDYQNQMGIHFISDGMSRV----ESS 302

Query: 2946 GFKSLVECLEN-------NNVSLSDKVC-----CKNSVCDTAHPASNSAMLIAGLPSEVQ 2803
            G+K L    E        N+V  S  +C      +  V D  + ASN    + G   +++
Sbjct: 303  GWKDLGVISEQNGTPFSINSVIDSINLCSVALQAEKFVADRTYLASNPLEAVLGEQVKLE 362

Query: 2802 DSTACNDERDSSCVPELPDMHVCKD------TTNVLH-----MPFRFCAGYELYEALGPA 2656
             + +C  +     +PE+ D+   KD       T + H     M  +F A  EL+EALGPA
Sbjct: 363  CTDSC--QNGMLHIPEISDIKFEKDLEKLQNQTELNHLDPSGMSLKFSAVSELHEALGPA 420

Query: 2655 FRKQNTDCGWGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANISSEVDKTDKS 2476
            F +++       E T     V M  E+  SS L  +SG+E+LL+AVVA++ +        
Sbjct: 421  FLRKDIYNDREPENTVDGETVGM-PELTSSSHLMFDSGSENLLDAVVASVCNS------- 472

Query: 2475 FVXXXXXXXXXXXXEPCTSDVGSISSAGYSFDRETLNSLNSSGAYGVRYSKGLSS---NS 2305
                              SDV S  +   S  +  L +     +     SKG SS   ++
Sbjct: 473  -----------------GSDVKSERTVCRSM-QSLLTTEKKPESSSQMSSKGFSSTCPST 514

Query: 2304 CSRGSEKPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLER 2125
            CS   +   E  K +KKRAR GE+ RPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLER
Sbjct: 515  CSEQLDMSSEPAKNNKKRARTGENGRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLER 574

Query: 2124 TIKHMIFMQSITKHAEKLNKCTGSKLLEQDPRGRGASVAEQGSSWAVEVGTNSKVCPIVV 1945
            TIKHM+F+QSITKHA+KL+KC  SK + Q   G   S  EQGSSWAVE+G++ KVC IVV
Sbjct: 575  TIKHMLFLQSITKHADKLSKCAESK-MHQKGNGIHGSNYEQGSSWAVEMGSHLKVCSIVV 633

Query: 1944 ENINMNGQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTESCGEKTWMRFVVEGQNNRS 1765
            EN+N NGQMLVE+LCEECSHFLEIAEAIRS+GLTILKGVTE+ G+KTW+ FVVEGQ+NR 
Sbjct: 634  ENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGVTEAHGDKTWICFVVEGQDNRI 693

Query: 1764 LHRMDVLWSLMQLLQPK 1714
            +HRMDVLWSL+QLLQ K
Sbjct: 694  MHRMDVLWSLVQLLQSK 710


>ref|XP_004161538.1| PREDICTED: transcription factor EMB1444-like [Cucumis sativus]
          Length = 691

 Score =  501 bits (1291), Expect = e-139
 Identities = 315/725 (43%), Positives = 408/725 (56%), Gaps = 29/725 (4%)
 Frame = -2

Query: 3807 LHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDNDN----PEKNCSSSSADNLHD 3640
            LH+ L+S C ++ WKYAVFWKLKHR RM+LTWED YYDN      PE      + +  +D
Sbjct: 6    LHQILKSFCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHEPPEGKFFRKTLETFYD 65

Query: 3639 GHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHDGWKT 3460
            GH SHD LGLAVAKMSYHVYSL           G+H WI+AD+ + +     E  DGW+T
Sbjct: 66   GHYSHDALGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWITADEQIPNFSSTIEYCDGWQT 125

Query: 3459 QFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQD-------PLATSM 3301
            QFSAGIKTI          +QLGSLD++ ED+ +V  IR VF  LQ+       P+ +  
Sbjct: 126  QFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVTRIRNVFLTLQESSAGEIKPMHSCK 185

Query: 3300 ESSCLSDVTTRTSGSGVSRDCIQNLDRRIDKSGVDFWSSIYSSIEKPDHYPYNFSIPGSY 3121
             S  ++D+ +R+  +        + +  ++ SG + + S+ +   KPD            
Sbjct: 186  SSGYMADIPSRSLATEKGEVASVSKNVGLELSGSEAFESLTT---KPDG----------- 231

Query: 3120 QNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSGGIGNTRCEGQSSGF 2941
                   IN               +NF   +   D ++    EPSG      C+ ++ G 
Sbjct: 232  -------INV--------------ENFKSQVRLLD-DRMCGGEPSG------CKDKAVGL 263

Query: 2940 KSLVECLENNNVSLSDKVCCKNSVCDTAHPASNSAMLIAGLPSEVQDSTACNDE--RDSS 2767
            K  +  +++ N ++     C N +       +++   +   PS   D    N    R + 
Sbjct: 264  KQKIN-VQSQNSTMDMVNICGNLLPAEKIMTNDAYFSMNPHPSSAYDGVNHNGMFIRTNH 322

Query: 2766 CVPELPDMHVCKDTTNVL--HMPFRFCAGYELYEALGPAFRKQNTDCGWGAETTETDMAV 2593
                L +     +T  +   +   +F AGYEL+E LGPAF K      W  E      A 
Sbjct: 323  TEMYLQNDMEASETIEMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVLGGKAF 382

Query: 2592 EMSEEMPDSSLLTANSGTEHLLEAVVANI---SSEVDKTDKSFVXXXXXXXXXXXXEPCT 2422
            E+SE M  S L T++S TE LLEAVVA++    S+V                    EP T
Sbjct: 383  ELSEGMSGSQL-TSDSPTERLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTERIPEPST 441

Query: 2421 SDVGSISSAGYSFDR--------ETLNSLNSSGAYGVRYSKGLSSNSCSRGSE---KPQE 2275
            +   S  S GYS  +        +  NSL+SSG  GV   KG SS     GSE   K  E
Sbjct: 442  NVTTSACSEGYSMGQSQTSFTGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLDKSSE 501

Query: 2274 SNKVHKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIKHMIFMQS 2095
              K  K+RARPGES RPRPRDRQLIQDRIKELRELVPNG+KCSIDSLLERTIKHM+F+Q 
Sbjct: 502  PAKNSKRRARPGESSRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQG 561

Query: 2094 ITKHAEKLNKCTGSKLLEQDPRGRGASVAEQGSSWAVEVGTNSKVCPIVVENINMNGQML 1915
            ITKHA+KL KC   KL ++     G S  +QGSSWAVEVG   KVC I+VEN+N NGQ+L
Sbjct: 562  ITKHADKLTKCANMKLHQKGSGMLGTSDTDQGSSWAVEVGGQLKVCSIIVENLNKNGQIL 621

Query: 1914 VELLCEECSHFLEIAEAIRSMGLTILKGVTESCGEKTWMRFVVEGQNNRSLHRMDVLWSL 1735
            VE+LCEECSHFLEIAEAIRS+GLTILKG+TE+ GEKTW+ FVVEG+NNR++HRMD+LWSL
Sbjct: 622  VEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSL 681

Query: 1734 MQLLQ 1720
            +Q+LQ
Sbjct: 682  VQILQ 686


>gb|ESW16913.1| hypothetical protein PHAVU_007G194600g [Phaseolus vulgaris]
          Length = 741

 Score =  498 bits (1282), Expect = e-137
 Identities = 320/747 (42%), Positives = 413/747 (55%), Gaps = 45/747 (6%)
 Frame = -2

Query: 3819 MGSRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDN----DNPEKNCSSSSAD 3652
            MGS LHR LRS CL T WKYA+FWKLKHR RM+LTWEDAYYDN    D+ E     ++ +
Sbjct: 1    MGSNLHRLLRSFCLGTDWKYAIFWKLKHRARMILTWEDAYYDNPNICDSSENKSCQNTWE 60

Query: 3651 NLHDGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHD 3472
             +     SHDPLGLAVAKMSYHVYSL           G+H WI  D  +      FE  D
Sbjct: 61   RIGSADFSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHRWICVDNQVTSSVPSFEFAD 120

Query: 3471 GWKTQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQDPLATSMESS 3292
            GW++QFSAGI+TI          VQLGSL+++AED+ ++  IR +F   QD       + 
Sbjct: 121  GWQSQFSAGIRTIVVIAVVPLGVVQLGSLNKVAEDMGVITCIRSLFLSNQDYTICHAPNQ 180

Query: 3291 CLSDVTTRTSGSGVSRDCIQNLDRRIDKSGVDFWSSIYSSIEKPDHYPYNFSIPGSYQNQ 3112
              + +  + S S +  +  +++   +  +       +  +I  P   P N   P +   +
Sbjct: 181  LQNSL--KNSSSVMDSETSESVPAYLQTTEKTMKHELLDNI-MPFQCPGNNDSPHAVYEK 237

Query: 3111 IVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSGGIGNTRCEGQSSGFKSL 2932
                +  +EG      G  I       +S     +QQK      +   + EG SSG +  
Sbjct: 238  TTVDVAKHEGPELNSDGSSI---LLQSMSNMMNVEQQKLLGMRPVNERKFEGNSSGREDT 294

Query: 2931 -VECLENNNVSLSDKVCCKNSVCDTAHPASNSAMLIAGLPSEVQDSTACNDER------- 2776
             VE  +  +  L + V   N V D   P+ N+ +      S+  D+  C  E+       
Sbjct: 295  SVESGKKLSSFLHNLVTDNNGVNDLVCPSENAGVNSVSFSSDFLDTVVCESEKFHYVDIN 354

Query: 2775 ----------------DSSCVPELPDMHVCKDTTNVLHMPFRFCAGYELYEALGPAFRKQ 2644
                              + + +       KDTT  L  P    AGYEL+EALGP+F K+
Sbjct: 355  QKGVKNWSRPLDAYSQKDTGMSKFQTEPCSKDTTYTLKFP----AGYELHEALGPSFLKE 410

Query: 2643 NTDCGWGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANISSEVDKTDKSFVXX 2464
            +    W  +  +   A E+S+E+   S LT+    EHLLEA+VANI    +   K  V  
Sbjct: 411  SKYFDWAVKANQDVKATEISDEI-SCSQLTSEPQREHLLEAMVANIGHNNNVNSKLSVSA 469

Query: 2463 XXXXXXXXXXEPCTS--DVGSISSAGYSFDRETLN------SLNSSGAYGVRYSKGLSS- 2311
                       P  S   V +I+S   S D+  L       SL+SSG  G+   KG SS 
Sbjct: 470  TMQAAIASGGNPEGSIHTVHTINSESCSIDQPHLGREEKHYSLSSSGICGIMSPKGFSST 529

Query: 2310 --NSCSRGSEKPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSKCSIDS 2137
              +SCS   E+  E  K  KKRARPGESCRPRPRDRQLIQDRIKELRELVPNG+KCSIDS
Sbjct: 530  CPSSCSEQFERSSEPTKNSKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDS 589

Query: 2136 LLERTIKHMIFMQSITKHAEKLNKC--TGSKLLEQDPRGRGASVAEQGSSWAVEVGTNSK 1963
            LLE TIKHM+F++++TKHA+KLNK   T +KL        G+S  +QGSSWA+EVG + K
Sbjct: 590  LLECTIKHMLFLKNVTKHADKLNKFGDTKTKLHHIQKDINGSSSYQQGSSWAMEVGGHLK 649

Query: 1962 VCPIVVENINMNGQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTESCGEKTWMRFVV- 1786
            VC I+VEN+N NGQMLVE+LCEECSHFLEIAEAIRSMGLTIL G TE+ GEKT + FVV 
Sbjct: 650  VCSILVENLNKNGQMLVEMLCEECSHFLEIAEAIRSMGLTILNGATEAHGEKTCICFVVE 709

Query: 1785 ---EGQNNRSLHRMDVLWSLMQLLQPK 1714
               EGQNNR+LHR+D+LW L+QLLQ K
Sbjct: 710  GRSEGQNNRNLHRLDILWPLVQLLQSK 736


>ref|XP_006588678.1| PREDICTED: transcription factor EMB1444-like [Glycine max]
          Length = 733

 Score =  494 bits (1272), Expect = e-136
 Identities = 319/751 (42%), Positives = 416/751 (55%), Gaps = 49/751 (6%)
 Frame = -2

Query: 3819 MGSRLHRALRSLCLDTGWKYAVFWKLKHRTRMMLTWEDAYYDN----DNPEKNCSSSSAD 3652
            MGS LHR LRS CL T WKYA+FWKLK R RM+LTWEDAYYDN    ++ E     +S +
Sbjct: 1    MGSNLHRLLRSFCLGTDWKYAIFWKLKQRARMILTWEDAYYDNPSICESSENKSCHNSLE 60

Query: 3651 NLHDGHSSHDPLGLAVAKMSYHVYSLXXXXXXXXXXXGRHLWISADKHMNDPGLLFECHD 3472
             +     SHDPLGLAVAKMSYHVYSL           G+H WI  D H+   G  FE  D
Sbjct: 61   QIGSADFSHDPLGLAVAKMSYHVYSLGEGIIGQVAVTGKHRWICVDNHVTSSGPSFEFAD 120

Query: 3471 GWKTQFSAGIKTIAXXXXXXXXXVQLGSLDQIAEDLKMVNHIRGVFFELQDPLATSMESS 3292
            GW++QFSAGI+TI          VQLGSL+++ ED+ +V+ IR +F   QD   + + + 
Sbjct: 121  GWQSQFSAGIRTIVVVAVVALGVVQLGSLNKVTEDMGVVSCIRSLFLSTQDYTISHVHNQ 180

Query: 3291 CLSDVTTRTS----GSGVSRDCIQNLDRRIDKSGVDFWSSIYSSIEKPDHYPY-NFSIPG 3127
              + V   +S     +  S   + + ++ +    +D        I  P   P  N+S   
Sbjct: 181  VQNSVKNSSSVLDTKTSKSMPALHDTEKTMKHEALD--------ILMPFQCPRKNYSPHA 232

Query: 3126 SYQNQIVEMINTYEGAANLVRGCGIGDNFHLPISRSDAEQQQKHEPSGGIGNTRCEGQSS 2947
             +Q  +V++        N  R   +  +    +S     +QQK      +  ++ EG S 
Sbjct: 233  VHQKMVVDVAKHDFPELNSDRSSILLQS----MSNMMNVEQQKLVGMRPVNESKFEGNSG 288

Query: 2946 GFKSLVECLENNNVSLSDKVCCKNSVCDTAHPASNSAMLIAGLPSEVQDSTACN------ 2785
                 +E  +N +  L + V   N V D A P+ N  +      S   D+  C       
Sbjct: 289  CEDKSLESGKNVSSFLHNLVMDNNGVNDLACPSENVGVDPVSFSSGFLDAAVCVSDKFQY 348

Query: 2784 ---DERDSSCVPELPDMHV--------------CKDTTNVLHMPFRFCAGYELYEALGPA 2656
               +E+    VP   D +                KDT+  +  P    AGYEL+EALGP+
Sbjct: 349  VDINEKGVLNVPRPSDANFQIKSEKSKFQTEPCYKDTSYTMKFP----AGYELHEALGPS 404

Query: 2655 FRKQNTDCGWGAETTETDMAVEMSEEMPDSSLLTANSGTEHLLEAVVANISSEVDKTDKS 2476
            F K +    W AE  +     EMS+E+   S LT+    EHLLEA+VANIS   +  +  
Sbjct: 405  FLKGSKCFNWAAEANQDVKNAEMSDEI-SCSQLTSEFRPEHLLEAMVANISHSNNNVNSE 463

Query: 2475 FVXXXXXXXXXXXXEPCTSDVGSISSAGYSFDR------ETLNSLNSSGAYGVRYSKGLS 2314
                                V +I+S G S D+      +   SL+SSG  GV   KG S
Sbjct: 464  LSFSTSMQAAIASGRNPEGSVHTINSEGCSIDQLPFVKEDKHYSLSSSGICGVMSPKGFS 523

Query: 2313 S---NSCSRGSEKPQESNKVHKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGSKCSI 2143
            S   +SCS   E+  E  K  KKRARPGESCRPRPRDRQLIQDRIKELRELVPNG+KCSI
Sbjct: 524  STCPSSCSEQFERSSEPTKNSKKRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSI 583

Query: 2142 DSLLERTIKHMIFMQSITKHAEKLNKCTGSKL----LEQDPRGRGASVAEQGSSWAVEVG 1975
            DSLLE TIKHM+F+Q+ITKHA+KLNK   +K     +E+D  G      +QGSSWA+EVG
Sbjct: 584  DSLLECTIKHMLFLQNITKHADKLNKFADTKTKLHHMEKDIPG------QQGSSWAMEVG 637

Query: 1974 TNSKVCPIVVENINMNGQMLVELLCEECSHFLEIAEAIRSMGLTILKGVTESCGEKTWMR 1795
             + KV  I+VEN+N NGQM VE++CEECSHFLEIA+AIRS+G+TIL G TE+ GEKT++ 
Sbjct: 638  GHLKVSSILVENLNQNGQMFVEMVCEECSHFLEIADAIRSLGMTILNGATEAHGEKTFVC 697

Query: 1794 FVV----EGQNNRSLHRMDVLWSLMQLLQPK 1714
            FVV    EGQNNR+LHR+D+LWSL+QLLQ K
Sbjct: 698  FVVEAGSEGQNNRNLHRLDILWSLVQLLQSK 728


Top