BLASTX nr result

ID: Akebia27_contig00018153 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00018153
         (1978 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN61992.1| hypothetical protein VITISV_030445 [Vitis vinifera]   369   4e-99
ref|XP_002281118.2| PREDICTED: transcription factor bHLH110-like...   363   2e-97
ref|XP_007017613.1| Basic helix-loop-helix DNA-binding superfami...   297   1e-77
ref|XP_006383714.1| hypothetical protein POPTR_0005s25240g [Popu...   289   3e-75
ref|XP_002307676.2| hypothetical protein POPTR_0005s25240g [Popu...   289   4e-75
ref|XP_002510430.1| transcription factor, putative [Ricinus comm...   271   8e-70
ref|XP_006386206.1| hypothetical protein POPTR_0002s03380g [Popu...   269   3e-69
ref|XP_002300753.2| hypothetical protein POPTR_0002s03380g [Popu...   260   1e-66
ref|XP_006473548.1| PREDICTED: transcription factor bHLH110-like...   258   7e-66
ref|XP_007017614.1| Basic helix-loop-helix DNA-binding superfami...   252   4e-64
ref|XP_006435050.1| hypothetical protein CICLE_v10001291mg [Citr...   249   4e-63
ref|XP_006383713.1| hypothetical protein POPTR_0005s25240g [Popu...   244   1e-61
ref|XP_007223238.1| hypothetical protein PRUPE_ppa005486mg [Prun...   227   2e-56
emb|CAN70945.1| hypothetical protein VITISV_002869 [Vitis vinifera]   223   3e-55
ref|XP_002890742.1| hypothetical protein ARALYDRAFT_472970 [Arab...   189   3e-45
ref|XP_004291848.1| PREDICTED: transcription factor bHLH110-like...   188   9e-45
ref|XP_006307469.1| hypothetical protein CARUB_v10009095mg [Caps...   187   2e-44
ref|NP_174087.1| transcription factor bHLH110 [Arabidopsis thali...   186   3e-44
ref|XP_006415743.1| hypothetical protein EUTSA_v10007594mg [Eutr...   185   6e-44
ref|XP_007046833.1| Basic helix-loop-helix DNA-binding superfami...   184   1e-43

>emb|CAN61992.1| hypothetical protein VITISV_030445 [Vitis vinifera]
          Length = 512

 Score =  369 bits (946), Expect = 4e-99
 Identities = 204/384 (53%), Positives = 253/384 (65%), Gaps = 12/384 (3%)
 Frame = +1

Query: 535  LITHLGILKSLNKTIMESANLHQQHQLQEQFDGSS---LATPSLYGVAXXXXXXXXXXXX 705
            L+  LG   +    IMESAN H QHQLQ+Q   SS    A PS Y  A            
Sbjct: 11   LLKALGSKAAFKNIIMESANRHHQHQLQDQLVVSSPLLAANPSCYAPAPSNHGWTPNIIL 70

Query: 706  XAGNFNLNINGVYSNSRDFKQNSDNLAPPLNTSMIQDSGFHWTCNTGSFNNQSAHELHLA 885
              G+FN N NG+  N RD +Q +D++  PLN+S++QD GFHW  N GSF +QSAH+LH  
Sbjct: 71   NTGSFNPNFNGILFNPRDSRQKNDSILHPLNSSVVQDLGFHWASNAGSFTSQSAHDLH-- 128

Query: 886  NKIKEELSDS--------NSSKFSDEEFHLPSTSYAKREQHDLHDLSEKLFLKTFSSGCQ 1041
              IKEELS+S        NSS  + E+ HLP TSY + +  DL+DLSEKL LK+FSSGCQ
Sbjct: 129  PXIKEELSESFPKFTEMINSSSSAVEDLHLPPTSYIRSK--DLNDLSEKLLLKSFSSGCQ 186

Query: 1042 LNGPQVSIGEMYSKPLS-SASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXXGMNF 1218
            +NG Q+S GE  +   S +  FGG    S+G+FSQI P+  I               MN 
Sbjct: 187  INGLQLSAGEFXANAQSCNTGFGGVAIPSRGHFSQIFPTINISNLSQPSSTISSSLDMNL 246

Query: 1219 QALDLLNSTKFGGSFVQPAHNNNLGLFKESLSFGLEHMQESSTWPPSSPNKISSFMNGAT 1398
            QALDLL S +F G+F QP+HNN LGLFK+SLSFGL+H+Q+S+  P +S +KIS F NG  
Sbjct: 247  QALDLLTSARFSGTFSQPSHNN-LGLFKDSLSFGLDHLQZSTNRPSNSSSKISPFTNGVA 305

Query: 1399 ETKRSSSFSEPKASHTATKKARMETRSSLAPFKVRKEKLGDRIAALHQLVSPFGKTDTAS 1578
            E KR SSF EPKA+    KK+R+E+R+S  P KVRKEKLGDRIAAL QLV+PFGKTDTAS
Sbjct: 306  EVKRPSSFLEPKATQATPKKSRLESRASCPPIKVRKEKLGDRIAALQQLVAPFGKTDTAS 365

Query: 1579 VLLEAIGYIRFLQSQIEALSSPYL 1650
            VL+EAIGYI+FLQ+Q+E LS PY+
Sbjct: 366  VLMEAIGYIKFLQNQVETLSVPYM 389


>ref|XP_002281118.2| PREDICTED: transcription factor bHLH110-like [Vitis vinifera]
            gi|302142540|emb|CBI19743.3| unnamed protein product
            [Vitis vinifera]
          Length = 427

 Score =  363 bits (931), Expect = 2e-97
 Identities = 200/369 (54%), Positives = 247/369 (66%), Gaps = 12/369 (3%)
 Frame = +1

Query: 580  MESANLHQQHQLQEQFDGSS---LATPSLYGVAXXXXXXXXXXXXXAGNFNLNINGVYSN 750
            MESAN H QHQLQ+Q   SS    A PS Y  A              G+FN N NG+  N
Sbjct: 1    MESANRHHQHQLQDQLVVSSPLLAANPSCYAPAPSNHGWTPNIILNTGSFNPNFNGILFN 60

Query: 751  SRDFKQNSDNLAPPLNTSMIQDSGFHWTCNTGSFNNQSAHELHLANKIKEELSDS----- 915
             RD +Q +D++  PLN+S++QD GFHW  N GSF +QSAH+LH    IKEELS+S     
Sbjct: 61   PRDSRQKNDSILHPLNSSVVQDLGFHWASNAGSFTSQSAHDLHPT--IKEELSESFPKFT 118

Query: 916  ---NSSKFSDEEFHLPSTSYAKREQHDLHDLSEKLFLKTFSSGCQLNGPQVSIGEMYSKP 1086
               NSS  + E+ HLP TSY + +  DL+DLSEKL LK+FSSGCQ+NG Q+S GE  +  
Sbjct: 119  EMINSSSSAVEDLHLPPTSYIRSK--DLNDLSEKLLLKSFSSGCQINGLQLSAGEFCANA 176

Query: 1087 LS-SASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXXGMNFQALDLLNSTKFGGSF 1263
             S +  FGG    S+G+FSQI P+  I               MN QALDLL S +F G+F
Sbjct: 177  QSCNTGFGGVAIPSRGHFSQIFPTINISNLSQPSSTISSSLDMNLQALDLLTSARFSGTF 236

Query: 1264 VQPAHNNNLGLFKESLSFGLEHMQESSTWPPSSPNKISSFMNGATETKRSSSFSEPKASH 1443
             QP+HNN LGLFK+SLSFGL+H+Q+S+  P +S +KIS F NG  E KR SSF EPKA+ 
Sbjct: 237  SQPSHNN-LGLFKDSLSFGLDHLQQSTNRPSNSSSKISPFTNGVAEVKRPSSFLEPKATQ 295

Query: 1444 TATKKARMETRSSLAPFKVRKEKLGDRIAALHQLVSPFGKTDTASVLLEAIGYIRFLQSQ 1623
               KK+R+E+R+S  P KVRKEKLGDRIAAL QLV+PFGKTDTASVL+EAIGYI+FLQ+Q
Sbjct: 296  ATPKKSRLESRASCPPIKVRKEKLGDRIAALQQLVAPFGKTDTASVLMEAIGYIKFLQNQ 355

Query: 1624 IEALSSPYL 1650
            +E LS PY+
Sbjct: 356  VETLSVPYM 364


>ref|XP_007017613.1| Basic helix-loop-helix DNA-binding superfamily protein, putative
            isoform 1 [Theobroma cacao] gi|508722941|gb|EOY14838.1|
            Basic helix-loop-helix DNA-binding superfamily protein,
            putative isoform 1 [Theobroma cacao]
          Length = 425

 Score =  297 bits (760), Expect = 1e-77
 Identities = 194/397 (48%), Positives = 240/397 (60%), Gaps = 17/397 (4%)
 Frame = +1

Query: 580  MESANLHQQHQLQEQFDGSS-LATPSLYGVAXXXXXXXXXXXXXAGN-FNLNINGVYSNS 753
            MES N+H QHQLQ+Q  GSS L  PS YGVA             + + FN N NG   NS
Sbjct: 1    MESENVHHQHQLQDQLVGSSSLPIPSCYGVASTHSWTPTPSFALSSSEFNPNHNGDILNS 60

Query: 754  RDFKQNSDNLAPPLNTSMIQDSGFHWTCNTGSFN-NQSAHELHLANKIKEELSDS----- 915
            R   Q +D LA P N+SMIQD    WT N GSF  +QS ++LHLA KIKEELS+S     
Sbjct: 61   R---QKNDILASPQNSSMIQD----WTDNGGSFTTSQSCYDLHLA-KIKEELSESLTRFT 112

Query: 916  ----NSSKFSDEEFHLPSTSYAKREQHDLHDLSEKLFLKTFSSGCQLNGPQVSIGEMYSK 1083
                N+S   +     PS +Y K EQ DLHDLSEKL LKT SSG     P  S GE YS 
Sbjct: 113  DMLSNTSSVGESHQLPPSPNYLKNEQKDLHDLSEKLLLKTISSGF----PMFSAGEFYSA 168

Query: 1084 PLSSASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXX-GMNFQALDLLNSTKF--G 1254
              + +  GG    S+  FSQI PS  I                MN +ALDLL+S ++   
Sbjct: 169  TQNCSIPGGTALPSRRNFSQIYPSINISNLNQASSANIPSSFDMNLEALDLLSSARYCRS 228

Query: 1255 GSFVQPAHNNNLGLFKESLSFGLEH-MQESSTWPPSSPNKISSFMNGATETKRSSSFSEP 1431
             S   P+H++NLG++KES  FGL H MQ+S+     SP+K+S F +  +E KR S+  EP
Sbjct: 229  SSLSHPSHDHNLGIYKESPPFGLHHHMQQSNQRAAYSPSKLSPFTSELSEAKRPSTLPEP 288

Query: 1432 KASHTATKKARMETRSSLAPFKVRKEKLGDRIAALHQLVSPFGKTDTASVLLEAIGYIRF 1611
            KA+  ATKK+R+E+R+S  PFKVRKEKLGDRIAAL QLV+PFGKTDTASVL+EAIGYI+F
Sbjct: 289  KATAAATKKSRLESRASCPPFKVRKEKLGDRIAALQQLVAPFGKTDTASVLMEAIGYIKF 348

Query: 1612 LQSQIEALSSPYLGRGSGNL-RQQQSVSNLYNPFPKP 1719
            LQ+Q+E LS PY+     N  R  Q  S + +   +P
Sbjct: 349  LQNQVETLSVPYMKSSRNNASRSNQGGSTMEDGNEEP 385


>ref|XP_006383714.1| hypothetical protein POPTR_0005s25240g [Populus trichocarpa]
            gi|550339707|gb|ERP61511.1| hypothetical protein
            POPTR_0005s25240g [Populus trichocarpa]
          Length = 430

 Score =  289 bits (740), Expect = 3e-75
 Identities = 197/392 (50%), Positives = 247/392 (63%), Gaps = 20/392 (5%)
 Frame = +1

Query: 580  MESANLHQQHQLQEQFDGSS---LATPSLYGVAXXXXXXXXXXXXXAGNFNLNINGVYSN 750
            MESANLH QHQLQ+QF GSS    ATPS Y  A             + N N + NGV  N
Sbjct: 1    MESANLHHQHQLQDQFVGSSSLTTATPSSYAEAGSARAWTQTITLNSDNSNPSYNGVIFN 60

Query: 751  SRDFKQNSDNLAPPLNTSMIQDSGFH-WTCNTGSFNNQSAHELHLANKIKEELSDSNS-S 924
             R   Q +++    LN++M QD GFH W  N G+F++ SA++L L+ KIKE LS S+S  
Sbjct: 61   QR---QKNESPISSLNSTMFQDLGFHYWNNNAGNFSSHSAYDLQLS-KIKEGLSSSDSFP 116

Query: 925  KFSD---------EEFHLPSTSYAKREQHDLHDLSEKLFLKTFSSGCQLNG-PQVSIGEM 1074
            KF++         E+ H+ S+SY K E  DL  LSEKL L+T SSG  +NG  Q S  ++
Sbjct: 117  KFTEMLNSPSSTIEDPHVSSSSYIKDELKDL-SLSEKLLLETISSGFPINGHDQFSPRQI 175

Query: 1075 YSKPLSSASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXX-GMNFQALDLLNSTKF 1251
             S   + +SFG A   S+G FSQI PS  I                MN QALDLL ST+F
Sbjct: 176  SSSHHNCSSFGSA-IPSRGSFSQIYPSINISNLNQPSSPLISGSFDMNLQALDLLTSTRF 234

Query: 1252 GGSFVQPAHNNNLGLFKESLSFGLEHMQESSTWPPSSPNKISSFMNGATETKR-SSSFSE 1428
             GSF QPA  + L +FK+SLSFGL+ +Q+S+  P  SP+KISS  N  TE KR ++S  E
Sbjct: 235  SGSFPQPASLDPLDMFKDSLSFGLDSIQQSNQRPSCSPSKISS-TNEITEAKRPNNSMME 293

Query: 1429 PKASHTAT-KKARMETRSSLAPFKVRKEKLGDRIAALHQLVSPFGKTDTASVLLEAIGYI 1605
            PKA+  A  KK+R+E+RS   PFKVRKEKLGDRIAAL QLV+PFGKTDTASVL+EAIGYI
Sbjct: 294  PKATQAAAPKKSRLESRSPCPPFKVRKEKLGDRIAALQQLVAPFGKTDTASVLMEAIGYI 353

Query: 1606 RFLQSQIEALSSPYLGRGSGN--LRQQQSVSN 1695
            +FLQ+Q+E LS PY+ + S N   R  Q+ SN
Sbjct: 354  KFLQNQVETLSVPYM-KSSRNKTSRSIQAASN 384


>ref|XP_002307676.2| hypothetical protein POPTR_0005s25240g [Populus trichocarpa]
            gi|550339708|gb|EEE94672.2| hypothetical protein
            POPTR_0005s25240g [Populus trichocarpa]
          Length = 419

 Score =  289 bits (739), Expect = 4e-75
 Identities = 191/375 (50%), Positives = 239/375 (63%), Gaps = 18/375 (4%)
 Frame = +1

Query: 580  MESANLHQQHQLQEQFDGSS---LATPSLYGVAXXXXXXXXXXXXXAGNFNLNINGVYSN 750
            MESANLH QHQLQ+QF GSS    ATPS Y  A             + N N + NGV  N
Sbjct: 1    MESANLHHQHQLQDQFVGSSSLTTATPSSYAEAGSARAWTQTITLNSDNSNPSYNGVIFN 60

Query: 751  SRDFKQNSDNLAPPLNTSMIQDSGFH-WTCNTGSFNNQSAHELHLANKIKEELSDSNS-S 924
             R   Q +++    LN++M QD GFH W  N G+F++ SA++L L+ KIKE LS S+S  
Sbjct: 61   QR---QKNESPISSLNSTMFQDLGFHYWNNNAGNFSSHSAYDLQLS-KIKEGLSSSDSFP 116

Query: 925  KFSD---------EEFHLPSTSYAKREQHDLHDLSEKLFLKTFSSGCQLNG-PQVSIGEM 1074
            KF++         E+ H+ S+SY K E  DL  LSEKL L+T SSG  +NG  Q S  ++
Sbjct: 117  KFTEMLNSPSSTIEDPHVSSSSYIKDELKDL-SLSEKLLLETISSGFPINGHDQFSPRQI 175

Query: 1075 YSKPLSSASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXX-GMNFQALDLLNSTKF 1251
             S   + +SFG A   S+G FSQI PS  I                MN QALDLL ST+F
Sbjct: 176  SSSHHNCSSFGSA-IPSRGSFSQIYPSINISNLNQPSSPLISGSFDMNLQALDLLTSTRF 234

Query: 1252 GGSFVQPAHNNNLGLFKESLSFGLEHMQESSTWPPSSPNKISSFMNGATETKR-SSSFSE 1428
             GSF QPA  + L +FK+SLSFGL+ +Q+S+  P  SP+KISS  N  TE KR ++S  E
Sbjct: 235  SGSFPQPASLDPLDMFKDSLSFGLDSIQQSNQRPSCSPSKISS-TNEITEAKRPNNSMME 293

Query: 1429 PKASHTAT-KKARMETRSSLAPFKVRKEKLGDRIAALHQLVSPFGKTDTASVLLEAIGYI 1605
            PKA+  A  KK+R+E+RS   PFKVRKEKLGDRIAAL QLV+PFGKTDTASVL+EAIGYI
Sbjct: 294  PKATQAAAPKKSRLESRSPCPPFKVRKEKLGDRIAALQQLVAPFGKTDTASVLMEAIGYI 353

Query: 1606 RFLQSQIEALSSPYL 1650
            +FLQ+Q+E LS PY+
Sbjct: 354  KFLQNQVETLSVPYM 368


>ref|XP_002510430.1| transcription factor, putative [Ricinus communis]
            gi|223551131|gb|EEF52617.1| transcription factor,
            putative [Ricinus communis]
          Length = 436

 Score =  271 bits (693), Expect = 8e-70
 Identities = 192/399 (48%), Positives = 238/399 (59%), Gaps = 28/399 (7%)
 Frame = +1

Query: 580  MESANLHQ--QHQLQEQF-DGSSLATPSLYGVAXXXXXXXXXXXXXAGNFNLNINGVYSN 750
            MESANLH   QHQLQ Q    SSL+ PS YG                   NLN N V  N
Sbjct: 1    MESANLHHHHQHQLQGQLVRSSSLSAPSNYGAPSPHAWTQNITLSTG---NLNNNEVAIN 57

Query: 751  SRDFKQNSDNLAPPLNTSMIQDSGFHWTCNTGSF------NNQSAHELHLA-NKIKEE-- 903
             R  K  + +++ PLN  MIQD GFHW  N+ +       N+Q++H+  L   KIKEE  
Sbjct: 58   PRQ-KTGTTSISSPLNNPMIQDLGFHWNVNSNNAAAVSLTNHQTSHDHDLQLGKIKEEDE 116

Query: 904  LSDS-----------NSSKFSDEEFHLPSTSYAKREQHDLHDLSEKLFLKTFSSGCQLNG 1050
            LSDS           +++  +D++ HL STSY K EQ  + DLSEKL LKT SSG  +NG
Sbjct: 117  LSDSFTKFTEMINSTSAASNTDQDSHLSSTSYIKDEQKYMTDLSEKLLLKTISSGFPING 176

Query: 1051 -PQVSIGEMYSKPLSSASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXX-GMNFQA 1224
             PQ      +S  L  +SFG +P  S+G FSQI PS  I                MN QA
Sbjct: 177  HPQ------FSPSLICSSFG-SPIPSRGNFSQIYPSINISNLNRSTSPSISGSFDMNLQA 229

Query: 1225 LDLLNSTKFGGSFVQPAHNNNLGLFKESLSFGLEHMQESSTWPPSSPNKISSFMNG-ATE 1401
            LDLL ST+FGGSF QP+H+N LG++K+++S+  + MQ     P  S +KISS      TE
Sbjct: 230  LDLLTSTRFGGSFGQPSHDN-LGIYKDNISYDFDRMQNHM--PSCSHSKISSITTKETTE 286

Query: 1402 TKR-SSSFSEPKAS-HTATKKARMETRSSLAPFKVRKEKLGDRIAALHQLVSPFGKTDTA 1575
             KR  SS  EPKA+   A KK+R+ETR+S  PFKVRKEKLGDRIAAL QLV+PFGKTDTA
Sbjct: 287  AKRPGSSLMEPKATLQAAPKKSRLETRASCPPFKVRKEKLGDRIAALQQLVAPFGKTDTA 346

Query: 1576 SVLLEAIGYIRFLQSQIEALSSPYLGRGSGNLRQQQSVS 1692
            SVL+EAIGYI+FLQ+Q+E LS PY+ + S N   + S S
Sbjct: 347  SVLMEAIGYIKFLQNQVETLSVPYM-KSSRNKSSRNSQS 384


>ref|XP_006386206.1| hypothetical protein POPTR_0002s03380g [Populus trichocarpa]
            gi|550344193|gb|ERP64003.1| hypothetical protein
            POPTR_0002s03380g [Populus trichocarpa]
          Length = 384

 Score =  269 bits (688), Expect = 3e-69
 Identities = 190/382 (49%), Positives = 229/382 (59%), Gaps = 19/382 (4%)
 Frame = +1

Query: 580  MESANLHQQH-QLQEQFDGSS---LATPSLYGVAXXXXXXXXXXXXXAGNFNLNINGVYS 747
            MESANLH QH QLQ+QF GSS     TPS    A             +GN + N NGV  
Sbjct: 1    MESANLHHQHDQLQDQFVGSSSLTTTTPSSDAEAGSTHAWTQTITLNSGNLSPNYNGVIF 60

Query: 748  NSRDFKQNSDNLAPPLNTSMIQDSGF-HWTCNTGSFNNQSA-HELHLANKIKEELSDSNS 921
            N R   Q  ++    +N++MIQD GF HW  N G+FN+ SA HEL L+ KIKEELS  + 
Sbjct: 61   NPR---QKYESPVTSVNSTMIQDLGFQHWNNNAGNFNSLSAYHELQLS-KIKEELSSDSF 116

Query: 922  SKFSD---------EEFHLPSTSYAKREQHDLHDLSEKLFLKTFSSGCQLNG-PQVSIGE 1071
             KF++         E+ H  S+SY K EQ  L  L EKL LKT S G   NG  Q S  E
Sbjct: 117  PKFTEMLYSPSSTIEDPHPSSSSYFKDEQEGL-SLGEKLLLKTISPGFPRNGHDQFSPRE 175

Query: 1072 MYSKPLSSASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXX-GMNFQALDLLNSTK 1248
            + S   + +SFG A  S +  FSQI PS  I                MN Q LDLL ST+
Sbjct: 176  ISSCHHNGSSFGSAIPSRES-FSQIYPSINISNLNQPSSPLISGSFDMNLQGLDLLTSTR 234

Query: 1249 FGGSFVQPAHNNNLGLFKESLSFGLEHMQESSTWPPSSPNKISSFMNGATETKRSS-SFS 1425
            F GSF QP+ +      K+SLSFGL+ MQ++S  P  SPNKISS  N  TE KR + S  
Sbjct: 235  FSGSFAQPSDDPLAMFNKDSLSFGLDRMQQASQRPSCSPNKISS-NNEMTEAKRPNRSLM 293

Query: 1426 EPKASHTAT-KKARMETRSSLAPFKVRKEKLGDRIAALHQLVSPFGKTDTASVLLEAIGY 1602
            EPKA+  A  KK+R+E+R S  P K RKEKLGDRIAAL QLV+PFGKTDTASVL+EAIGY
Sbjct: 294  EPKATQAAAPKKSRLESRVSCPPLKARKEKLGDRIAALQQLVAPFGKTDTASVLMEAIGY 353

Query: 1603 IRFLQSQIEALSSPYLGRGSGN 1668
            I+FLQ+Q+E LS PY+ + SGN
Sbjct: 354  IKFLQNQVETLSIPYM-KSSGN 374


>ref|XP_002300753.2| hypothetical protein POPTR_0002s03380g [Populus trichocarpa]
            gi|550344194|gb|EEE80026.2| hypothetical protein
            POPTR_0002s03380g [Populus trichocarpa]
          Length = 423

 Score =  260 bits (665), Expect = 1e-66
 Identities = 184/373 (49%), Positives = 222/373 (59%), Gaps = 19/373 (5%)
 Frame = +1

Query: 580  MESANLHQQH-QLQEQFDGSS---LATPSLYGVAXXXXXXXXXXXXXAGNFNLNINGVYS 747
            MESANLH QH QLQ+QF GSS     TPS    A             +GN + N NGV  
Sbjct: 1    MESANLHHQHDQLQDQFVGSSSLTTTTPSSDAEAGSTHAWTQTITLNSGNLSPNYNGVIF 60

Query: 748  NSRDFKQNSDNLAPPLNTSMIQDSGF-HWTCNTGSFNNQSA-HELHLANKIKEELSDSNS 921
            N R   Q  ++    +N++MIQD GF HW  N G+FN+ SA HEL L+ KIKEELS  + 
Sbjct: 61   NPR---QKYESPVTSVNSTMIQDLGFQHWNNNAGNFNSLSAYHELQLS-KIKEELSSDSF 116

Query: 922  SKFSD---------EEFHLPSTSYAKREQHDLHDLSEKLFLKTFSSGCQLNG-PQVSIGE 1071
             KF++         E+ H  S+SY K EQ  L  L EKL LKT S G   NG  Q S  E
Sbjct: 117  PKFTEMLYSPSSTIEDPHPSSSSYFKDEQEGL-SLGEKLLLKTISPGFPRNGHDQFSPRE 175

Query: 1072 MYSKPLSSASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXX-GMNFQALDLLNSTK 1248
            + S   + +SFG A  S +  FSQI PS  I                MN Q LDLL ST+
Sbjct: 176  ISSCHHNGSSFGSAIPSRES-FSQIYPSINISNLNQPSSPLISGSFDMNLQGLDLLTSTR 234

Query: 1249 FGGSFVQPAHNNNLGLFKESLSFGLEHMQESSTWPPSSPNKISSFMNGATETKRSS-SFS 1425
            F GSF QP+ +      K+SLSFGL+ MQ++S  P  SPNKISS  N  TE KR + S  
Sbjct: 235  FSGSFAQPSDDPLAMFNKDSLSFGLDRMQQASQRPSCSPNKISS-NNEMTEAKRPNRSLM 293

Query: 1426 EPKASHTAT-KKARMETRSSLAPFKVRKEKLGDRIAALHQLVSPFGKTDTASVLLEAIGY 1602
            EPKA+  A  KK+R+E+R S  P K RKEKLGDRIAAL QLV+PFGKTDTASVL+EAIGY
Sbjct: 294  EPKATQAAAPKKSRLESRVSCPPLKARKEKLGDRIAALQQLVAPFGKTDTASVLMEAIGY 353

Query: 1603 IRFLQSQIEALSS 1641
            I+FLQ+Q+E  S+
Sbjct: 354  IKFLQNQVEVFST 366


>ref|XP_006473548.1| PREDICTED: transcription factor bHLH110-like [Citrus sinensis]
          Length = 431

 Score =  258 bits (659), Expect = 7e-66
 Identities = 186/384 (48%), Positives = 229/384 (59%), Gaps = 27/384 (7%)
 Frame = +1

Query: 580  MESANLHQQHQL-QEQFDGS------SLATPS-LYGVAXXXXXXXXXXXXXA--GNFNLN 729
            MESAN    HQL Q+Q  GS      SL TPS  YGVA                 + N  
Sbjct: 1    MESAN----HQLRQDQLVGSPSSSSSSLPTPSSCYGVASGSTQNAWTPIPNVTLSSGNFI 56

Query: 730  INGVYSNSRDFKQNSDNLAPPLNTSMIQDSG-FHWTCNTGSFNNQSAHELHLANKIKEEL 906
             NGV  NS    +N   L P  N+SMIQ+S   HW       N+QSAHE H A KIK+E 
Sbjct: 57   YNGVILNSTH--KNEILLPPAANSSMIQESAALHW------INSQSAHE-HFA-KIKDEF 106

Query: 907  SDS---------NSSKFSDEEFHLPSTSYAKREQHDLHDLSEKLFLKT-FSSGCQLNGPQ 1056
            SDS         + S   +E+  L + SY K EQ +L+DL +KL LK+  SSG  +NG  
Sbjct: 107  SDSFPKFTEMSSSPSSNINEDSDLSTASYLKNEQKNLNDLGDKLLLKSAISSGFPINGNH 166

Query: 1057 VSIGEMYSKPLSSASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXXGMNFQALDLL 1236
               G++YS   + +S GGA   S+G FSQI PS  I               MN Q LDLL
Sbjct: 167  FPAGDLYSSAHNISSVGGA-MPSRGNFSQIYPSINISNLSQTSSTNSTNFDMNLQFLDLL 225

Query: 1237 NSTKFGGSFVQPAHNNNLGLFKESLSFGLE--HMQESSTWPPSSP-NKISSFMNGA--TE 1401
             S++F G F QP+H+N LGL+KESL FG +  H+Q+SS  P  SP NKI+ F+N +  TE
Sbjct: 226  ASSRFSGDFSQPSHDN-LGLYKESLPFGCDQHHLQQSSRRPSCSPSNKIAHFINNSEITE 284

Query: 1402 -TKRSSSFSEPKASHTATKKARMETRSSLAPFKVRKEKLGDRIAALHQLVSPFGKTDTAS 1578
             TKR     EPKA+  A+KK+R+E+R+S  P KVRKEKLGDRIAAL QLV+PFGKTDTAS
Sbjct: 285  ATKRHGGVMEPKATQFASKKSRLESRASCPPMKVRKEKLGDRIAALQQLVAPFGKTDTAS 344

Query: 1579 VLLEAIGYIRFLQSQIEALSSPYL 1650
            VLLEAIGYI+FLQ+Q+E LS PY+
Sbjct: 345  VLLEAIGYIKFLQNQVETLSVPYM 368


>ref|XP_007017614.1| Basic helix-loop-helix DNA-binding superfamily protein, putative
            isoform 2 [Theobroma cacao] gi|508722942|gb|EOY14839.1|
            Basic helix-loop-helix DNA-binding superfamily protein,
            putative isoform 2 [Theobroma cacao]
          Length = 355

 Score =  252 bits (644), Expect = 4e-64
 Identities = 168/355 (47%), Positives = 209/355 (58%), Gaps = 16/355 (4%)
 Frame = +1

Query: 580  MESANLHQQHQLQEQFDGSS-LATPSLYGVAXXXXXXXXXXXXXAGN-FNLNINGVYSNS 753
            MES N+H QHQLQ+Q  GSS L  PS YGVA             + + FN N NG   NS
Sbjct: 1    MESENVHHQHQLQDQLVGSSSLPIPSCYGVASTHSWTPTPSFALSSSEFNPNHNGDILNS 60

Query: 754  RDFKQNSDNLAPPLNTSMIQDSGFHWTCNTGSFN-NQSAHELHLANKIKEELSDS----- 915
            R   Q +D LA P N+SMIQD    WT N GSF  +QS ++LHLA KIKEELS+S     
Sbjct: 61   R---QKNDILASPQNSSMIQD----WTDNGGSFTTSQSCYDLHLA-KIKEELSESLTRFT 112

Query: 916  ----NSSKFSDEEFHLPSTSYAKREQHDLHDLSEKLFLKTFSSGCQLNGPQVSIGEMYSK 1083
                N+S   +     PS +Y K EQ DLHDLSEKL LKT SSG     P  S GE YS 
Sbjct: 113  DMLSNTSSVGESHQLPPSPNYLKNEQKDLHDLSEKLLLKTISSGF----PMFSAGEFYSA 168

Query: 1084 PLSSASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXX-GMNFQALDLLNSTKF--G 1254
              + +  GG    S+  FSQI PS  I                MN +ALDLL+S ++   
Sbjct: 169  TQNCSIPGGTALPSRRNFSQIYPSINISNLNQASSANIPSSFDMNLEALDLLSSARYCRS 228

Query: 1255 GSFVQPAHNNNLGLFKESLSFGLEH-MQESSTWPPSSPNKISSFMNGATETKRSSSFSEP 1431
             S   P+H++NLG++KES  FGL H MQ+S+     SP+K+S F +  +E KR S+  EP
Sbjct: 229  SSLSHPSHDHNLGIYKESPPFGLHHHMQQSNQRAAYSPSKLSPFTSELSEAKRPSTLPEP 288

Query: 1432 KASHTATKKARMETRSSLAPFKVRKEKLGDRIAALHQLVSPFGKTDTASVLLEAI 1596
            KA+  ATKK+R+E+R+S  PFKVRKEKLGDRIAAL QLV+PFGK  +    L ++
Sbjct: 289  KATAAATKKSRLESRASCPPFKVRKEKLGDRIAALQQLVAPFGKVISGCFFLSSV 343


>ref|XP_006435050.1| hypothetical protein CICLE_v10001291mg [Citrus clementina]
            gi|557537172|gb|ESR48290.1| hypothetical protein
            CICLE_v10001291mg [Citrus clementina]
          Length = 419

 Score =  249 bits (635), Expect = 4e-63
 Identities = 179/380 (47%), Positives = 220/380 (57%), Gaps = 23/380 (6%)
 Frame = +1

Query: 580  MESANLHQQHQL-QEQFDGS------SLATPS-LYGVAXXXXXXXXXXXXXA--GNFNLN 729
            MESAN    HQL Q+Q  GS      SL TPS  YGVA                 + N  
Sbjct: 1    MESAN----HQLRQDQLVGSPSSSSSSLPTPSSCYGVASSSTQNAWTPIPNVTLSSGNFI 56

Query: 730  INGVYSNSRDFKQNSDNLAPPLNTSMIQDS-GFHWTCNTGSFNNQSAHELHLANKIKEEL 906
             NGV  NS    +N   L P  N+SMIQ+S G HW       N+QSAHE H A KIK+E 
Sbjct: 57   YNGVILNSTH--KNEILLPPAANSSMIQESAGLHW------INSQSAHE-HFA-KIKDEF 106

Query: 907  SDS---------NSSKFSDEEFHLPSTSYAKREQHDLHDLSEKLFLK-TFSSGCQLNGPQ 1056
            SDS         + S   +E+  L + SY K EQ +L+DL +KL LK   SSG  +NG  
Sbjct: 107  SDSFPKFTEMSSSPSSNINEDSDLSTASYLKNEQKNLNDLGDKLLLKGAMSSGFPINGNH 166

Query: 1057 VSIGEMYSKPLSSASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXXGMNFQALDLL 1236
               G++YS   + +S GGA   S+G FSQI PS  I               MN Q LDLL
Sbjct: 167  FPAGDLYSSAHNISSVGGA-MPSRGNFSQIYPSINISNLSQISSTNSTNFDMNLQFLDLL 225

Query: 1237 NSTKFGGSFVQPAHNNNLGLFKESLSFGLE--HMQESSTWPPSSPNKISSFMNGATETKR 1410
             S++  G F QP+H+N LGL+KESL FG +  H+Q+SS  P  SP+  +        TKR
Sbjct: 226  ASSRVSGDFSQPSHDN-LGLYKESLPFGCDQHHLQQSSRRPSCSPSNKA--------TKR 276

Query: 1411 SSSFSEPKASHTATKKARMETRSSLAPFKVRKEKLGDRIAALHQLVSPFGKTDTASVLLE 1590
                 EPKA+  A+KK+R+E+R+S  P KVRKEKLGDRIAAL QLV+PFGKTDTASVLLE
Sbjct: 277  HGGVMEPKATQFASKKSRLESRASCPPMKVRKEKLGDRIAALQQLVAPFGKTDTASVLLE 336

Query: 1591 AIGYIRFLQSQIEALSSPYL 1650
            AIGYI+FLQ+Q+E LS PY+
Sbjct: 337  AIGYIKFLQNQVETLSVPYM 356


>ref|XP_006383713.1| hypothetical protein POPTR_0005s25240g [Populus trichocarpa]
            gi|550339706|gb|ERP61510.1| hypothetical protein
            POPTR_0005s25240g [Populus trichocarpa]
          Length = 355

 Score =  244 bits (622), Expect = 1e-61
 Identities = 164/314 (52%), Positives = 206/314 (65%), Gaps = 17/314 (5%)
 Frame = +1

Query: 805  MIQDSGFH-WTCNTGSFNNQSAHELHLANKIKEELSDSNS-SKFSD---------EEFHL 951
            M QD GFH W  N G+F++ SA++L L+ KIKE LS S+S  KF++         E+ H+
Sbjct: 1    MFQDLGFHYWNNNAGNFSSHSAYDLQLS-KIKEGLSSSDSFPKFTEMLNSPSSTIEDPHV 59

Query: 952  PSTSYAKREQHDLHDLSEKLFLKTFSSGCQLNG-PQVSIGEMYSKPLSSASFGGAPTSSK 1128
             S+SY K E  DL  LSEKL L+T SSG  +NG  Q S  ++ S   + +SFG A   S+
Sbjct: 60   SSSSYIKDELKDL-SLSEKLLLETISSGFPINGHDQFSPRQISSSHHNCSSFGSA-IPSR 117

Query: 1129 GYFSQISPSTYIXXXXXXXXXXXXXX-GMNFQALDLLNSTKFGGSFVQPAHNNNLGLFKE 1305
            G FSQI PS  I                MN QALDLL ST+F GSF QPA  + L +FK+
Sbjct: 118  GSFSQIYPSINISNLNQPSSPLISGSFDMNLQALDLLTSTRFSGSFPQPASLDPLDMFKD 177

Query: 1306 SLSFGLEHMQESSTWPPSSPNKISSFMNGATETKR-SSSFSEPKASHTAT-KKARMETRS 1479
            SLSFGL+ +Q+S+  P  SP+KISS  N  TE KR ++S  EPKA+  A  KK+R+E+RS
Sbjct: 178  SLSFGLDSIQQSNQRPSCSPSKISS-TNEITEAKRPNNSMMEPKATQAAAPKKSRLESRS 236

Query: 1480 SLAPFKVRKEKLGDRIAALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSSPYLGRG 1659
               PFKVRKEKLGDRIAAL QLV+PFGKTDTASVL+EAIGYI+FLQ+Q+E LS PY+ + 
Sbjct: 237  PCPPFKVRKEKLGDRIAALQQLVAPFGKTDTASVLMEAIGYIKFLQNQVETLSVPYM-KS 295

Query: 1660 SGN--LRQQQSVSN 1695
            S N   R  Q+ SN
Sbjct: 296  SRNKTSRSIQAASN 309


>ref|XP_007223238.1| hypothetical protein PRUPE_ppa005486mg [Prunus persica]
            gi|462420174|gb|EMJ24437.1| hypothetical protein
            PRUPE_ppa005486mg [Prunus persica]
          Length = 458

 Score =  227 bits (578), Expect = 2e-56
 Identities = 176/415 (42%), Positives = 214/415 (51%), Gaps = 58/415 (13%)
 Frame = +1

Query: 580  MESANLHQQH-QLQEQFDGSS--LATPSLYGVAXXXXXXXXXXXXXAGNFNLNINGVYSN 750
            MESANLH QH QLQE   GSS   ATPS Y V              +GN           
Sbjct: 1    MESANLHHQHHQLQENLVGSSSLAATPSCYAVGTKHAWTPSATLSSSGN----------- 49

Query: 751  SRDFKQNSDNLAPPLNTSMIQDSGFHWTCNTGSFNNQSAHELHLANKIKEELSDSNSSKF 930
                  +S++   PLN+SM+ D GFHW  N  S  +QS H+L    KIKEEL+ S+SS  
Sbjct: 50   ------SSNSGLDPLNSSMVPDLGFHWLTNITS-EHQSPHDLA---KIKEELTSSSSSDH 99

Query: 931  SDEEFH--------LPSTSYAKREQHD---------------LHDLSEKLFLKTFSSGCQ 1041
                 +        L S + +    HD               ++DLSEKL LKT SSGCQ
Sbjct: 100  HHHHHNSFPKLTEMLTSAAASTSIDHDQYYQFMKNEEKNQLIMNDLSEKLLLKTLSSGCQ 159

Query: 1042 LNG------PQVS-IGEMYSKP----------LSSASFGGAPTSSKGYFSQISPSTYIXX 1170
            +N        Q+S  GE YS            L      G P+ S G+FSQI PS  +  
Sbjct: 160  INSIINPHHHQISSAGEFYSNDDHHHLLHNSNLIGGVPPGMPSRSGGHFSQIYPSINVSN 219

Query: 1171 XXXXXXXXXXXXG---MNFQALDLLN-----STKFGGSF-VQPAHNNNLGLFKESL-SFG 1320
                            MN QA+DLL      ST    SF  QP  ++ LGL+KE+  SF 
Sbjct: 220  LNRSLSSSSISNSSLDMNLQAMDLLGASARFSTGTSSSFSTQPNSHDTLGLYKETHDSFA 279

Query: 1321 LEHMQESSTWPP----SSPNKISSFMNGATETKRSSSFSEPKASH-TATKKARMETRSSL 1485
                   ST P      + NKISSF N  TE KR  S  EPK +  TA KK+R+E+R++ 
Sbjct: 280  TLQQMHQSTDPHRLSCGNNNKISSFDNEITEVKRPGSSIEPKVTQATAPKKSRLESRTAC 339

Query: 1486 APFKVRKEKLGDRIAALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSSPYL 1650
             PFKVRKEKLGDRIAAL QLV+PFGKTDTASVL+EAIGYI+FLQ+Q+E LS PY+
Sbjct: 340  PPFKVRKEKLGDRIAALQQLVAPFGKTDTASVLMEAIGYIKFLQNQVETLSVPYM 394


>emb|CAN70945.1| hypothetical protein VITISV_002869 [Vitis vinifera]
          Length = 396

 Score =  223 bits (567), Expect = 3e-55
 Identities = 156/366 (42%), Positives = 198/366 (54%), Gaps = 13/366 (3%)
 Frame = +1

Query: 580  MESANLHQQHQLQEQF---DGSSLATPSLYGVAXXXXXXXXXXXXXAGNFNLNINGVYSN 750
            MES ++H+QHQLQEQF     SSL T ++YGV              + N          N
Sbjct: 1    MESVDVHRQHQLQEQFIINGCSSLDTHAVYGVPTIHGRSPSITMNGS-NHTYGNEIFLPN 59

Query: 751  SRDFKQNSDNLAPPLNTSMIQDSGFHWTCNTGSFNNQSAHELHLANKIKEELSDSNSSKF 930
            SR+ +  +  + PP+  S+IQD GFH   +  SF +QS  E+    KIKEEL +S   KF
Sbjct: 60   SREVRLKNAIMDPPVRASLIQDLGFH---DARSFTHQSPTEVLNFTKIKEELPNS-FPKF 115

Query: 931  SD--------EEFHL-PST-SYAKREQHDLHDLSEKLFLKTFSSGCQLNGPQVSIGEMYS 1080
             +        EE HL PS  SY K  Q    DLSE L   + +S     G Q+  G+ YS
Sbjct: 116  GEMVDNHSNVEELHLVPSIGSYMKHGQQPFRDLSENLCWLSSNSS---EGLQLLAGDSYS 172

Query: 1081 KPLSSASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXXGMNFQALDLLNSTKFGGS 1260
                S  +G A TSS+  FS   PS  +              G+N Q LDLL S  +G  
Sbjct: 173  NARESEGYGSAYTSSRFNFSHGFPSXNLPNLDFSSSLVSNSLGLNLQTLDLLASANYGXG 232

Query: 1261 FVQPAHNNNLGLFKESLSFGLEHMQESSTWPPSSPNKISSFMNGATETKRSSSFSEPKAS 1440
              + +H++ L  FKES+    +HMQES   P +S    S+FMNG + TK + S + PKA 
Sbjct: 233  SSKSSHBD-LDPFKESMPLDHDHMQESXHNPSNSSKMTSAFMNGVSRTKVTRSRTAPKAL 291

Query: 1441 HTATKKARMETRSSLAPFKVRKEKLGDRIAALHQLVSPFGKTDTASVLLEAIGYIRFLQS 1620
            H ATK +    RSS  P KVRKEKLGDRIAAL +LV+PFGKTDTASVL EAIGYI+FL  
Sbjct: 292  HAATKMSGFGPRSSYPPLKVRKEKLGDRIAALQRLVAPFGKTDTASVLTEAIGYIQFLHD 351

Query: 1621 QIEALS 1638
            QI+  S
Sbjct: 352  QIQGSS 357


>ref|XP_002890742.1| hypothetical protein ARALYDRAFT_472970 [Arabidopsis lyrata subsp.
            lyrata] gi|297336584|gb|EFH67001.1| hypothetical protein
            ARALYDRAFT_472970 [Arabidopsis lyrata subsp. lyrata]
          Length = 449

 Score =  189 bits (481), Expect = 3e-45
 Identities = 161/400 (40%), Positives = 207/400 (51%), Gaps = 37/400 (9%)
 Frame = +1

Query: 580  MESANLHQ-QHQLQEQFDGSSLAT------PSLYGVAXXXXXXXXXXXXXAGNFNLNING 738
            M+SANLHQ Q QLQ     SS ++      PS YG +             + + + N N 
Sbjct: 1    MDSANLHQLQDQLQLVGSSSSSSSLDNNSDPSCYGASSAHQWSPGGISLNSVSLSHNYNN 60

Query: 739  VYSNSRDFKQNSD---NLAPPLNTSMIQDSGF--HWTCNTGSFNNQSAHELHLANKIKEE 903
               N+RD   N+    +L+   N S+IQ   F   W  +  S+++   HE  L  KIKEE
Sbjct: 61   EMLNTRDHNNNTSECMSLSTIHNHSLIQQQDFPLQWPHDQSSYHH---HEGLL--KIKEE 115

Query: 904  LSDS-------NSSKFSDEEFHLPSTSYAKREQHDLHDLSEKLFLKTFSSGCQLNGPQVS 1062
            LS S         SKF+D       T+Y K  +H   D +EKL LK+ SSG  ++G   S
Sbjct: 116  LSSSAISDHQEGISKFTDMLNSPVITNYLKINEHK--DYTEKLLLKSMSSGFPISGDYCS 173

Query: 1063 IGEMYSKPLSSASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXXG-------MNFQ 1221
                 S P SS+S   +  S +G FSQI PS  I                      MN Q
Sbjct: 174  -----SLPSSSSSSSPSSQSHRGNFSQIYPSVNISSLSESRKMSMDDMSNIPRPFDMNMQ 228

Query: 1222 ALDLLNSTKFGGSFVQPAHNN----NLGLFKESLS-FGL---EHMQESSTWPPSSP-NKI 1374
              D      F G+ + P  N+    NLG+ + S   FGL    H+Q++   P SSP +++
Sbjct: 229  VFD---GRLFEGNVLVPPLNSQEISNLGMSRGSFPPFGLPFHHHLQQTLPHPSSSPTHQM 285

Query: 1375 SSFMNGA--TETKRSSSFSEPKASHTATKKARMETRSSLAPFKVRKEKLGDRIAALHQLV 1548
              F N +  +E KR +     K    A+KK R+E+RSS  PFKVRKEKLGDRIAAL QLV
Sbjct: 286  EMFSNESQTSEGKRHNFLMATKVGENASKKPRVESRSSCPPFKVRKEKLGDRIAALQQLV 345

Query: 1549 SPFGKTDTASVLLEAIGYIRFLQSQIEALSSPYLGRGSGN 1668
            SPFGKTDTASVL+EAIGYI+FLQSQIE LS PY+ R S N
Sbjct: 346  SPFGKTDTASVLMEAIGYIKFLQSQIETLSVPYM-RASRN 384


>ref|XP_004291848.1| PREDICTED: transcription factor bHLH110-like [Fragaria vesca subsp.
            vesca]
          Length = 468

 Score =  188 bits (477), Expect = 9e-45
 Identities = 162/422 (38%), Positives = 208/422 (49%), Gaps = 55/422 (13%)
 Frame = +1

Query: 580  MESANLHQQH-QLQEQFD-------GSSLAT-PSLYGVAXXXXXXXXXXXXXAGNFNLNI 732
            MESANLH QH QLQE           SSLAT PS YGV              A      I
Sbjct: 1    MESANLHHQHHQLQENLSHLGSSSSSSSLATAPSYYGVGIKH----------AWTQQPTI 50

Query: 733  NGVYSNSRDFKQNSDNLAPPLNTSMIQDSGFH-WTCNTGSFNNQSAHELHLANKIKEELS 909
                +N  +   +S N +  +N  M+ D GFH W  ++ + N   +     ++ IKEELS
Sbjct: 51   TATTTNLSNPSNSSFNSSSSIN--MVPDLGFHCWPPSSNNLNRAGS-----SSSIKEELS 103

Query: 910  DSNSS----KFS----------------DEEFHLP-STSYAKREQHDL--HDLSEKLFLK 1020
             S+S     KF+                D  F  P S    K EQ ++  +DLSEKL LK
Sbjct: 104  SSSSDSTFPKFTQMLTSPSSTSINLDDDDHHFSTPTSLGLIKNEQKEMMMNDLSEKLLLK 163

Query: 1021 TFSSGCQLNGPQVSIGEMYSKPLSSASFGGA-------PTSSKG-YFSQISPSTYIXXXX 1176
            T SS   +N      G+ +     S++           P  S G YFSQI PS  I    
Sbjct: 164  TLSSS-GINHQISLAGDQHHHQFYSSNNNHVQNFTQLMPGRSGGQYFSQIYPSINISNLN 222

Query: 1177 XXXXXXXXXXG-----MNFQALDLLNSTKFGGSFVQPAHNNNLGLFKESL---SFGLEHM 1332
                            MN QA+DLL S++F       +H+  LG++ + +   SFGL+ M
Sbjct: 223  QQSSPSLTISSCSSLNMNLQAMDLLASSRFSTHEPYNSHDT-LGIYNKEIRHNSFGLQQM 281

Query: 1333 QES-----STWPPSSPNKISSFMNGATETKRSSSFSEPKASHTAT-KKARMETRSSLAPF 1494
             +S     S     + +KIS F N  TE KR  S  EPKA+  A  KK+R+E+R+   P 
Sbjct: 282  HQSRANHHSLLSSGANSKISPFENEITEVKRPGSLIEPKATQAAAPKKSRLESRTPCPPL 341

Query: 1495 KVRKEKLGDRIAALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSSPYLGRGSGNLR 1674
            KVRKEKLGDRIA L QLV+PFGKTDTASVL+EAIGYI+FLQ+Q+E LS PY+     N  
Sbjct: 342  KVRKEKLGDRIATLQQLVAPFGKTDTASVLMEAIGYIKFLQNQVETLSVPYMKSSRDNKS 401

Query: 1675 QQ 1680
             Q
Sbjct: 402  SQ 403


>ref|XP_006307469.1| hypothetical protein CARUB_v10009095mg [Capsella rubella]
            gi|482576180|gb|EOA40367.1| hypothetical protein
            CARUB_v10009095mg [Capsella rubella]
          Length = 455

 Score =  187 bits (475), Expect = 2e-44
 Identities = 156/403 (38%), Positives = 198/403 (49%), Gaps = 40/403 (9%)
 Frame = +1

Query: 580  MESANLHQ-QHQLQEQFDGSSLAT------PSLYGVAXXXXXXXXXXXXXAGNFNLNING 738
            M+SANLHQ Q QLQ     SS ++      PS YG +             + + + N N 
Sbjct: 1    MDSANLHQLQDQLQLVGSSSSSSSLDNNSDPSCYGASSAHQWSPGGISLNSVSLSHNYNN 60

Query: 739  VYSNSRDFKQNSD-----NLAPPLNTSMIQDSGFHWTCNTGSFNNQSAHELHLANKIKEE 903
               N+RD   N++     +L+   N S+IQ   F         +  S H      KIKEE
Sbjct: 61   EMLNTRDHSSNNNTSECMSLSTIHNHSLIQQQDFPLQWPPYHHDQSSYHHHEGLLKIKEE 120

Query: 904  LSDSNSS-------KFSDEEFHLPSTSYAKREQHDLHDLSEKLFLKTFSSGCQLNGPQVS 1062
            LS S  S       KF+D       T+Y K  +H   D +EKL LKT S G   NG    
Sbjct: 121  LSSSTISDQQEGISKFTDMLNSPVITNYLKINEHK--DYTEKLLLKTISPGFPTNGD--- 175

Query: 1063 IGEMYSKPLSSASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXXG-------MNFQ 1221
                Y   L S+S   +P+S +G FSQI PS  I                      MN Q
Sbjct: 176  ----YCSSLPSSSSSSSPSSRRGNFSQIYPSVNISSLSESRKMSVDMSNNIPRPFDMNMQ 231

Query: 1222 ALDLLNSTKFGGSFVQPAHNN----NLGLFKESLS-FGL---EHMQESSTWPPSSP---- 1365
              D      F G+ + P  N+    NLG+ + S + FGL    H+Q++   P SS     
Sbjct: 232  VFD---GRLFEGNVLVPPLNSQEISNLGMSRGSFTPFGLPFHHHLQQTLHHPSSSSPSTH 288

Query: 1366 --NKISSFMNGATETKRSSSFSEPKASHTATKKARMETRSSLAPFKVRKEKLGDRIAALH 1539
                +S+     +E KR +     K+   A+KK R+E+RSS  PFKVRKEKLGDRIAAL 
Sbjct: 289  QMEMLSNIEPQTSEGKRHNFLMATKSGENASKKPRVESRSSCPPFKVRKEKLGDRIAALQ 348

Query: 1540 QLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSSPYLGRGSGN 1668
            QLVSPFGKTDTASVL+EAIGYI+FLQSQIE LS PY+ R S N
Sbjct: 349  QLVSPFGKTDTASVLMEAIGYIKFLQSQIETLSVPYM-RASRN 390


>ref|NP_174087.1| transcription factor bHLH110 [Arabidopsis thaliana]
            gi|218563530|sp|Q9SFZ3.2|BH110_ARATH RecName:
            Full=Transcription factor bHLH110; AltName: Full=Basic
            helix-loop-helix protein 110; Short=AtbHLH110; Short=bHLH
            110; AltName: Full=Transcription factor EN 59; AltName:
            Full=bHLH transcription factor bHLH110
            gi|332192739|gb|AEE30860.1| transcription factor bHLH110
            [Arabidopsis thaliana]
          Length = 453

 Score =  186 bits (473), Expect = 3e-44
 Identities = 160/401 (39%), Positives = 207/401 (51%), Gaps = 38/401 (9%)
 Frame = +1

Query: 580  MESANLHQ-QHQLQEQFDGSSLAT------PSLYGVAXXXXXXXXXXXXXAGNFNLNING 738
            M+SANLHQ Q QLQ     SS ++      PS YG +             + + + N N 
Sbjct: 1    MDSANLHQLQDQLQLVGSSSSSSSLDNNSDPSCYGASSAHQWSPGGISLNSVSLSHNYNN 60

Query: 739  VYSNSRDFKQNSDN-------LAPPLNTSMIQDSGF--HWTCNTGSFNNQSAHELHLANK 891
               N+R    N++N       L+   N S+IQ   F   W  +  S+ +   HE  L  K
Sbjct: 61   EMLNTRAHNNNNNNNTSECMSLSSIHNHSLIQQQDFPLQWPHDQSSYQH---HEGLL--K 115

Query: 892  IKEELSDSNSS-------KFSDEEFHLPSTSYAKREQHDLHDLSEKLFLKTFSSGCQLNG 1050
            IKEELS S  S       KF+D       T+Y K  +H   D +EKL LK+ SSG  +NG
Sbjct: 116  IKEELSSSTISDHQEGISKFTDMLNSPVITNYLKINEHK--DYTEKLLLKSMSSGFPING 173

Query: 1051 PQVSIGEMYSKPLSSASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXXGMNFQALD 1230
               S     S P SS+S   +  S +G FSQI PS  I                  +  D
Sbjct: 174  DYGS-----SLPSSSSSSSPSSQSHRGNFSQIYPSVNISSLSESRKMSMDDMSNISRPFD 228

Query: 1231 L----LNSTKFGGSFVQPAHN----NNLGLFKESL-SFGL---EHMQESSTWPPSSP-NK 1371
            +     +   F G+ + P  N    ++LG+ + SL SFGL    H+Q++     SSP ++
Sbjct: 229  INMQVFDGRLFEGNVLVPPFNAQEISSLGMSRGSLPSFGLPFHHHLQQTLPHLSSSPTHQ 288

Query: 1372 ISSFMNG--ATETKRSSSFSEPKASHTATKKARMETRSSLAPFKVRKEKLGDRIAALHQL 1545
            +  F N    +E KR +     KA   A+KK R+E+RSS  PFKVRKEKLGDRIAAL QL
Sbjct: 289  MEMFSNEPQTSEGKRHNFLMATKAGENASKKPRVESRSSCPPFKVRKEKLGDRIAALQQL 348

Query: 1546 VSPFGKTDTASVLLEAIGYIRFLQSQIEALSSPYLGRGSGN 1668
            VSPFGKTDTASVL+EAIGYI+FLQSQIE LS PY+ R S N
Sbjct: 349  VSPFGKTDTASVLMEAIGYIKFLQSQIETLSVPYM-RASRN 388


>ref|XP_006415743.1| hypothetical protein EUTSA_v10007594mg [Eutrema salsugineum]
            gi|557093514|gb|ESQ34096.1| hypothetical protein
            EUTSA_v10007594mg [Eutrema salsugineum]
          Length = 456

 Score =  185 bits (470), Expect = 6e-44
 Identities = 157/403 (38%), Positives = 200/403 (49%), Gaps = 40/403 (9%)
 Frame = +1

Query: 580  MESANLHQQHQLQEQFDGSSLAT--------PSLYGVAXXXXXXXXXXXXXAGNFNLNIN 735
            M+SAN+HQ  Q Q Q  GSS ++        PS Y  +             +   +   N
Sbjct: 1    MDSANMHQLRQDQLQLVGSSSSSSSLDNNSDPSCYVASSAHQWNPGGISLNSERLSQKYN 60

Query: 736  GVYSNSRDFKQNSDN-------LAPPLNTSMIQDSGF--HWTCNTGSFNNQSAHELHLAN 888
                N RD   +++N       L+   N S+IQ   F   W  +  S+++     LH   
Sbjct: 61   IEMLNRRDHNNSNNNNTSECMSLSNIHNHSLIQQQDFPLQWPHDQSSYHHHEG--LH--- 115

Query: 889  KIKEELSDSNSS-------KFSDEEFHLPSTSYAKREQHDLHDLSEKLFLKTFSSGCQLN 1047
            KIKEELS S +S       KF+D       T+Y K  +H   D +EKL L T SSG  +N
Sbjct: 116  KIKEELSSSTTSDHQEGLPKFTDMLNSPVITNYLKINEHK--DYTEKLLLNTISSGFPIN 173

Query: 1048 GPQVSIGEMYSKPLSSASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXXG------ 1209
            G   S   + S   SS+S   A  S +G FSQI PS  I                     
Sbjct: 174  GDYTS--SLPSSSSSSSSSLPASQSHRGSFSQIYPSVNISSLSESRGMSMDMSNIPRPFD 231

Query: 1210 MNFQALDLLNSTKFGGSFVQPAHN---NNLGLFKESLS-FGL---EHMQESSTWPPSSPN 1368
            MN Q LD       G   V P ++   +N G+ + S S FGL    H+Q++   P SSP 
Sbjct: 232  MNMQVLD--GRLLEGNVLVPPLNSQEISNFGMSRGSFSPFGLPFHHHLQQTLHHPSSSPT 289

Query: 1369 KISSFMNG---ATETKRSSSFSEPKASHTATKKARMETRSSLAPFKVRKEKLGDRIAALH 1539
              +   +    A+E KR +     KA   A+KK R+E+RSS  PFKVRKEKLGDRIAAL 
Sbjct: 290  HQTEMFSNEPQASEGKRQNFLMATKAGENASKKPRVESRSSCPPFKVRKEKLGDRIAALQ 349

Query: 1540 QLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSSPYLGRGSGN 1668
            QLVSPFGKTDTASVL+EAIGYI FLQ+QIE LS PY+ R S N
Sbjct: 350  QLVSPFGKTDTASVLMEAIGYINFLQNQIETLSVPYM-RASRN 391


>ref|XP_007046833.1| Basic helix-loop-helix DNA-binding superfamily protein, putative
            [Theobroma cacao] gi|508699094|gb|EOX90990.1| Basic
            helix-loop-helix DNA-binding superfamily protein,
            putative [Theobroma cacao]
          Length = 401

 Score =  184 bits (468), Expect = 1e-43
 Identities = 148/385 (38%), Positives = 193/385 (50%), Gaps = 13/385 (3%)
 Frame = +1

Query: 580  MESANLHQQHQLQEQF-DGSSLATPS-LYGVAXXXXXXXXXXXXXAGNFNLNINGVYSNS 753
            MESANLH   ++QEQ+   SSLAT +  + V+                +N N+      S
Sbjct: 1    MESANLHPHPKVQEQYVKYSSLATQTGHHQVSTSDEWNSNLVPNIGSKYNRNLTETIPKS 60

Query: 754  RDFKQNSDNLAPPL-NTSMIQDSGFHWTCNTGSFNNQSAHELHLANKIKEELSDSNSSKF 930
            RD        APPL  TSM QDS          FN QS  E  L N IK+E+SDS   K 
Sbjct: 61   RDL------WAPPLIRTSMNQDS----------FNQQSTSEFLLTN-IKDEMSDS-FPKL 102

Query: 931  SD--------EEFHLPSTSYAKREQHDLHDLSEKLFLKTFSSGCQLNGPQVSIGEMYSKP 1086
            S+        E+ +LP   +    Q    DL   L+   FS    +   Q+S G+ Y   
Sbjct: 103  SEMMYCHSGAEDSYLPFRKHYIYPQSS--DLGGNLWHSNFSIANHMTELQLSSGDSYRNA 160

Query: 1087 LSSASFGGAPTSSKGYFSQISPSTYIXXXXXXXXXXXXXXGMNFQALDLLNSTKFGGSFV 1266
              S   G A  +S+  F+ I PST I               +N +ALDLL ST  GGS  
Sbjct: 161  HQSPCLGTAAATSRYDFNHIFPSTNISTSDLCSTLFSSSLDLNLKALDLLTSTYDGGSCN 220

Query: 1267 QPAHNNNLGLFKESLSFGLEHMQESSTWPPSSPNKISSFMNGAT-ETKRSSSFSEPKASH 1443
            Q   ++  G    S+  G +H++E S  P +S NKIS+ ++G+T  TKR  SFSE K   
Sbjct: 221  QSLLDSP-GKLSRSVLVGHDHIRERSDSPSTSSNKISTLVSGSTTSTKRPGSFSETKEFQ 279

Query: 1444 TATKKARMETRSSLAP-FKVRKEKLGDRIAALHQLVSPFGKTDTASVLLEAIGYIRFLQS 1620
               KK R  T  S  P  KVRKEKLGDR+AAL +LV+PFGKTDTA+VL EAIGYI+FL  
Sbjct: 280  QDAKKHRSSTSRSPCPTLKVRKEKLGDRVAALQKLVAPFGKTDTATVLTEAIGYIQFLHD 339

Query: 1621 QIEALSSPYLGRGSGNLRQQQSVSN 1695
            Q++ LS P++      L +   V +
Sbjct: 340  QVQTLSVPFMKSSQSRLYRTVQVGS 364


Top