BLASTX nr result

ID: Mentha27_contig00012214 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha27_contig00012214
         (1944 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU46294.1| hypothetical protein MIMGU_mgv1a008008mg [Mimulus...   301   9e-79
ref|XP_007014950.1| DNA binding protein, putative isoform 2 [The...   243   2e-61
gb|EXB48386.1| hypothetical protein L484_007964 [Morus notabilis]     243   3e-61
ref|XP_006493662.1| PREDICTED: transcription factor bHLH122-like...   243   3e-61
ref|XP_006446031.1| hypothetical protein CICLE_v10015437mg [Citr...   243   3e-61
ref|XP_003592465.1| Transcription factor bHLH130 [Medicago trunc...   238   1e-59
ref|XP_007014949.1| DNA binding protein, putative isoform 1 [The...   237   1e-59
ref|XP_007131988.1| hypothetical protein PHAVU_011G057400g [Phas...   237   2e-59
ref|XP_003543549.1| PREDICTED: transcription factor bHLH122 [Gly...   234   1e-58
ref|XP_003540708.1| PREDICTED: transcription factor bHLH122-like...   229   3e-57
ref|XP_004486946.1| PREDICTED: transcription factor bHLH122-like...   228   6e-57
ref|XP_003597460.1| Transcription factor bHLH122 [Medicago trunc...   227   1e-56
ref|XP_007205257.1| hypothetical protein PRUPE_ppa006295mg [Prun...   227   2e-56
ref|XP_003537954.1| PREDICTED: transcription factor bHLH122-like...   224   8e-56
emb|CAN80884.1| hypothetical protein VITISV_018653 [Vitis vinifera]   224   1e-55
ref|XP_002513457.1| DNA binding protein, putative [Ricinus commu...   223   2e-55
emb|CBI16416.3| unnamed protein product [Vitis vinifera]              221   1e-54
ref|XP_007014951.1| DNA binding protein, putative isoform 3, par...   220   2e-54
ref|XP_002304761.1| basic helix-loop-helix family protein [Popul...   219   5e-54
ref|XP_004295210.1| PREDICTED: transcription factor bHLH122-like...   216   4e-53

>gb|EYU46294.1| hypothetical protein MIMGU_mgv1a008008mg [Mimulus guttatus]
          Length = 388

 Score =  301 bits (770), Expect = 9e-79
 Identities = 206/400 (51%), Positives = 239/400 (59%), Gaps = 36/400 (9%)
 Frame = -3

Query: 1414 MNSHHH--QQFEQNTQV-GSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQLFNP- 1247
            MNS+H+  QQ +QN Q  G+GLTRYRSAPSSYFA+LL+TP            FE+LFN  
Sbjct: 1    MNSNHNNQQQQQQNMQAAGAGLTRYRSAPSSYFANLLNTPAGADGGFGGAGDFEELFNEA 60

Query: 1246 RASSPETQVIFSRFMNSS-------AADXXXXXXXXXXXXXXXEY--------HQQPPQR 1112
            RASSPETQ IFSRFM++S       ++                 Y         +Q  QR
Sbjct: 61   RASSPETQRIFSRFMSNSDDSVRESSSPCVMTDLPSPSPLNPQFYPPTKAEPEFEQRLQR 120

Query: 1111 QR-NDYSPQIMYSEAAVSATDTKVFAPFNTNCAAQNKIER----GGAALIRHTSSPAGFF 947
            QR NDYS  +  S  A+     +V   F+++  A  KIER    GG+ LIRH+SSP G F
Sbjct: 121  QRSNDYSEPVHSS--AMDGEYNRVLGSFSSSHVAHMKIERAGAGGGSGLIRHSSSPPGLF 178

Query: 946  ANINIENEFKTMRSEGSYGGGNNATAEATFSSPNTFLTQMDYSSRGIINTISENETVDAQ 767
            ANINI+NEF TMR+ GS G  NN  AEA+ SS + F  QMD         I EN      
Sbjct: 179  ANINIDNEFGTMRTFGS-GNTNNNNAEASISSSSRFKNQMDSIDEIENKGIGEN------ 231

Query: 766  FDEDHHGNDVDFITGLP---WDDSSL--ISDPYLKGLQHN---QKNEAGNQXXXXXXXXX 611
                   N  D+ITG P   WDD  L  ++D   K   +N   Q NE GN          
Sbjct: 232  -------NSSDYITGFPMNAWDDDFLNELADNDNKRFSNNANDQSNEGGNNRPTNRLSHH 284

Query: 610  XXXXXXXA----MERLLEDSVPCKIRAKRGCATHPRSIAERVRRTKISERMRKLQELVPN 443
                   A    ME+LL+DSVPCKIRAKRG ATHPRSIAERVRRTKISERMRKLQELVPN
Sbjct: 285  LSMPTSSAELSAMEKLLQDSVPCKIRAKRGHATHPRSIAERVRRTKISERMRKLQELVPN 344

Query: 442  MEKQTNTSDMLDLAVDYIKDLQTQLKTLSDNRAKCTCSSK 323
            MEKQTNTSDMLDLAVDYIKDLQ Q+KTLSDNRAKC+CS+K
Sbjct: 345  MEKQTNTSDMLDLAVDYIKDLQRQVKTLSDNRAKCSCSAK 384


>ref|XP_007014950.1| DNA binding protein, putative isoform 2 [Theobroma cacao]
            gi|508785313|gb|EOY32569.1| DNA binding protein, putative
            isoform 2 [Theobroma cacao]
          Length = 432

 Score =  243 bits (620), Expect = 2e-61
 Identities = 171/449 (38%), Positives = 229/449 (51%), Gaps = 82/449 (18%)
 Frame = -3

Query: 1420 SSMNSHHH------QQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQ 1259
            S +  HHH      Q      Q+ SGL RY+SAPSSYF+S+L                 Q
Sbjct: 3    SDLQHHHHHLIDYHQPQHHQKQMNSGLMRYQSAPSSYFSSILDRDFC------------Q 50

Query: 1258 LFNPRASSPETQVIFSRFMNSSA--------------------------------ADXXX 1175
             F  R SSPET+ I  RF++SS                                      
Sbjct: 51   EFLNRPSSPETERIIERFLSSSGDGGGGNTVNISDQNLCAITQNSPVRETVIKIEEPTQI 110

Query: 1174 XXXXXXXXXXXXEYHQQPPQRQRNDYSP-----------QIMYSEAAVSATDTKVFAPFN 1028
                        +  QQ  Q Q+ +YS            Q + ++ + S  D ++  P +
Sbjct: 111  MTPMNNQTGVMQQQQQQQQQPQQGNYSSASQNFYQSQPQQHLPNQQSGSTMDYRI--PNS 168

Query: 1027 TNCAAQNKIERGG---AALIRHTSSPAGFFANINIENEFKTMRSEGSYGGGNNATAEATF 857
               A   +++ GG   + L+RH+SSPAG F+N+NI+N +  +R  G YGG NN+  EA+F
Sbjct: 169  MGMARPTQMKMGGGNNSNLVRHSSSPAGLFSNLNIDNSYGVVRGMGDYGGVNNSNREASF 228

Query: 856  SS-----PNTFLTQM-DYSSRGIINTISENETVDAQFDEDHHGNDVDFITGLP---WDDS 704
             S     P+  ++ + +  ++ ++   SEN    A F E+ H N   + +G P   W+DS
Sbjct: 229  PSASRPPPSGLMSPIAEMGNKNVVPNSSEN----AGFGENRHNN---YSSGFPVTSWEDS 281

Query: 703  SLISD--PYLKGLQHN--------------QKNEAGNQXXXXXXXXXXXXXXXXAMERL- 575
             +ISD  P +K L+ +              Q  +AGN+                 M  + 
Sbjct: 282  MMISDNMPGVKRLREDDRSLSGLDLDGAETQNTDAGNRPPPILAHHLSLPKSSAEMSAID 341

Query: 574  ----LEDSVPCKIRAKRGCATHPRSIAERVRRTKISERMRKLQELVPNMEKQTNTSDMLD 407
                 +DSVPCKIRAKRGCATHPRSIAERVRRTKISERMRKLQ+LVPNM+KQTNT+DMLD
Sbjct: 342  KFLQYQDSVPCKIRAKRGCATHPRSIAERVRRTKISERMRKLQDLVPNMDKQTNTADMLD 401

Query: 406  LAVDYIKDLQTQLKTLSDNRAKCTCSSKQ 320
            LAVDYIKDLQ Q+KTLSDNRAKC+CS+KQ
Sbjct: 402  LAVDYIKDLQNQVKTLSDNRAKCSCSNKQ 430


>gb|EXB48386.1| hypothetical protein L484_007964 [Morus notabilis]
          Length = 417

 Score =  243 bits (619), Expect = 3e-61
 Identities = 176/430 (40%), Positives = 226/430 (52%), Gaps = 63/430 (14%)
 Frame = -3

Query: 1420 SSMNSHHHQQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQLFNPRA 1241
            SS   HHH    Q  Q+ SGL RYRSAPSSYF  +L                +Q FN R 
Sbjct: 3    SSDLQHHHHHHHQ--QMNSGLMRYRSAPSSYFTDMLDREFC-----------QQFFN-RP 48

Query: 1240 SSPETQVIFSRFMNSSAA------------DXXXXXXXXXXXXXXXEYHQQPPQRQRNDY 1097
            SSPET+ IF+RFMNS               D                  QQ  Q+Q+   
Sbjct: 49   SSPETERIFARFMNSDGGGSSNNNNTAEVEDLQKVNDNAEAEAAVLRNQQQQQQQQQQQQ 108

Query: 1096 SPQIM----------YSEAAVSATDTKVFAPFNTNCAAQNK-------IERGGAA---LI 977
            S  I+          Y  ++      +  +  NTN  + +        +  GG +   LI
Sbjct: 109  SNNIISGNYSSSSSFYQSSSKPPLPNQGISSGNTNEGSYSMGMNQFPPMRTGGISNSNLI 168

Query: 976  RHTSSPAGFFANINIENE-FKTMRSEGSYGGGNNATAEATFSSPNTF--LTQMDYSSRGI 806
            RH+SSPAG FANINI+   F  MR  G+YG  ++   EA+FS+P+     +    SS G+
Sbjct: 169  RHSSSPAGLFANINIDTSGFGAMRGMGTYGASDSTDEEASFSTPSRLNKFSSGPASSTGL 228

Query: 805  INTISENETVDAQFDEDHHGNDVD-----------FITGLP---WDDSSLISDPY--LKG 674
            ++ I+E +      D+   GN  D           F++  P   WDDS ++S+    LK 
Sbjct: 229  MSPIAEID------DKTMVGNSQDTGAFGDSRSNSFVSSFPMGSWDDSPIMSENITGLKR 282

Query: 673  LQHN----------QKNEAGNQXXXXXXXXXXXXXXXXAMERLLE--DSVPCKIRAKRGC 530
            L+ +          Q  E+G +                A+E+ L+  DSVPCKIRAKRGC
Sbjct: 283  LRDDHDVKQYSSETQNVESGTRPLAHHLSLPKTSSEMAAIEKFLQFQDSVPCKIRAKRGC 342

Query: 529  ATHPRSIAERVRRTKISERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQTQLKTLSDN 350
            ATHPRSIAERVRRT+ISERMRKLQELVPNMEKQTNT+DMLDLAV+YIKDL+ Q++TLSD+
Sbjct: 343  ATHPRSIAERVRRTRISERMRKLQELVPNMEKQTNTADMLDLAVEYIKDLKKQVQTLSDS 402

Query: 349  RAKCTCSSKQ 320
            RAKCTCSSKQ
Sbjct: 403  RAKCTCSSKQ 412


>ref|XP_006493662.1| PREDICTED: transcription factor bHLH122-like isoform X1 [Citrus
            sinensis] gi|568881644|ref|XP_006493663.1| PREDICTED:
            transcription factor bHLH122-like isoform X2 [Citrus
            sinensis]
          Length = 408

 Score =  243 bits (619), Expect = 3e-61
 Identities = 169/415 (40%), Positives = 216/415 (52%), Gaps = 53/415 (12%)
 Frame = -3

Query: 1405 HHHQQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQLFNPRASSPET 1226
            H HQ      Q+ SGLTRY+SAPSSYF+S L                   F  R SSPET
Sbjct: 14   HDHQIQHHQKQLNSGLTRYQSAPSSYFSSFLDKDAFED------------FLMRPSSPET 61

Query: 1225 QVIFSRFMNSSAADXXXXXXXXXXXXXXXEYHQQPPQRQRNDYSPQIMYSEAAVSATDTK 1046
            + IF+RF+++SA +               +  Q   Q+Q+     Q+   +        +
Sbjct: 62   ERIFARFLSNSAGNTDNTENTSNSIIPEQKIIQSDQQQQQQQLM-QLQQQQQQQQQQQQQ 120

Query: 1045 VFAPFNT----------NCAAQN-----------KIERGGAA-------LIRHTSSPAGF 950
              + + T          N  +QN           K+   GA        LIRH+SSPAG 
Sbjct: 121  QQSYYQTQPPQPQLQQQNFQSQNVGMDRFSTQPMKMSGAGAGPGGNNSNLIRHSSSPAGL 180

Query: 949  FANINIENEFKTMRSEGSYGGGNN-----ATAEATFSSPNTFLTQMDYSSRGIINTISEN 785
            F+NI IEN +  M+  G YGGGNN     +T+ A+   P T    M      I+    +N
Sbjct: 181  FSNITIENGYVMMKGMGDYGGGNNTIKRESTSYASSGKPPTGAKPMS----PIVEV--DN 234

Query: 784  ETVDAQFDEDHHGNDVDFITGLP---WDDSSLI-------------SDPYLKGLQ--HNQ 659
            +T +A F E H+GN   + TG P   WDDSS++              D  L GL     Q
Sbjct: 235  KTSNAGFKEGHNGN---YSTGFPMDSWDDSSMMPENIVSEVNRLREDDRTLSGLSATETQ 291

Query: 658  KNEAGNQXXXXXXXXXXXXXXXXAMERLL--EDSVPCKIRAKRGCATHPRSIAERVRRTK 485
              + G +                 +E+LL  +DSVPCKIRAKRGCATHPRSIAERVRRTK
Sbjct: 292  SIDLGTRPPPLLAHHLSLPKDIATIEKLLHYQDSVPCKIRAKRGCATHPRSIAERVRRTK 351

Query: 484  ISERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQTQLKTLSDNRAKCTCSSKQ 320
            ISERMRKLQELVPNM+KQTNT+DMLDLAVDY+K+LQ Q+K LSDNRAKCTC+ ++
Sbjct: 352  ISERMRKLQELVPNMDKQTNTADMLDLAVDYVKELQCQVKALSDNRAKCTCAKQE 406


>ref|XP_006446031.1| hypothetical protein CICLE_v10015437mg [Citrus clementina]
            gi|567907437|ref|XP_006446032.1| hypothetical protein
            CICLE_v10015437mg [Citrus clementina]
            gi|557548642|gb|ESR59271.1| hypothetical protein
            CICLE_v10015437mg [Citrus clementina]
            gi|557548643|gb|ESR59272.1| hypothetical protein
            CICLE_v10015437mg [Citrus clementina]
          Length = 408

 Score =  243 bits (619), Expect = 3e-61
 Identities = 169/415 (40%), Positives = 216/415 (52%), Gaps = 53/415 (12%)
 Frame = -3

Query: 1405 HHHQQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQLFNPRASSPET 1226
            H HQ      Q+ SGLTRY+SAPSSYF+S L                   F  R SSPET
Sbjct: 14   HDHQIQHHQKQLNSGLTRYQSAPSSYFSSFLDKDAFED------------FLMRPSSPET 61

Query: 1225 QVIFSRFMNSSAADXXXXXXXXXXXXXXXEYHQQPPQRQRNDYSPQIMYSEAAVSATDTK 1046
            + IF+RF+++SA +               +  Q   Q+Q+     Q+   +        +
Sbjct: 62   ERIFARFLSNSAGNTDDTENTSNSIIPEQKIIQSDQQQQQQQLM-QLQQQQQQQQQQQQQ 120

Query: 1045 VFAPFNT----------NCAAQN-----------KIERGGAA-------LIRHTSSPAGF 950
              + + T          N  +QN           K+   GA        LIRH+SSPAG 
Sbjct: 121  QQSYYQTQPPQPQLQQQNFQSQNVGMDRFSTQPMKMSGAGAGPGGNNSNLIRHSSSPAGL 180

Query: 949  FANINIENEFKTMRSEGSYGGGNN-----ATAEATFSSPNTFLTQMDYSSRGIINTISEN 785
            F+NI IEN +  M+  G YGGGNN     +T+ A+   P T    M      I+    +N
Sbjct: 181  FSNITIENGYVMMKGMGDYGGGNNTIKRESTSYASSGKPPTGAKPMS----PIVEV--DN 234

Query: 784  ETVDAQFDEDHHGNDVDFITGLP---WDDSSLI-------------SDPYLKGLQ--HNQ 659
            +T +A F E H+GN   + TG P   WDDSS++              D  L GL     Q
Sbjct: 235  KTSNAGFKEGHNGN---YSTGFPMDSWDDSSMMPENIVSEVNRLREDDRTLSGLSATETQ 291

Query: 658  KNEAGNQXXXXXXXXXXXXXXXXAMERLL--EDSVPCKIRAKRGCATHPRSIAERVRRTK 485
              + G +                 +E+LL  +DSVPCKIRAKRGCATHPRSIAERVRRTK
Sbjct: 292  SIDLGTRPPPLLAHHLSLPKDIATIEKLLHYQDSVPCKIRAKRGCATHPRSIAERVRRTK 351

Query: 484  ISERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQTQLKTLSDNRAKCTCSSKQ 320
            ISERMRKLQELVPNM+KQTNT+DMLDLAVDY+K+LQ Q+K LSDNRAKCTC+ ++
Sbjct: 352  ISERMRKLQELVPNMDKQTNTADMLDLAVDYVKELQCQVKALSDNRAKCTCAKQE 406


>ref|XP_003592465.1| Transcription factor bHLH130 [Medicago truncatula]
            gi|355481513|gb|AES62716.1| Transcription factor bHLH130
            [Medicago truncatula]
          Length = 433

 Score =  238 bits (606), Expect = 1e-59
 Identities = 173/435 (39%), Positives = 223/435 (51%), Gaps = 72/435 (16%)
 Frame = -3

Query: 1408 SHHHQQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQLFNPRASSPE 1229
            +HHHQQ     Q+ SGLTR++SAPSSYF++++                E LFN R SSPE
Sbjct: 17   NHHHQQ-----QMNSGLTRFKSAPSSYFSNIIDKEFY-----------EHLFN-RPSSPE 59

Query: 1228 TQVIFSRFM---------------NSSAADXXXXXXXXXXXXXXXEYHQQPPQRQRNDYS 1094
            T+ +F+RFM               ++ +                    Q P  ++  D  
Sbjct: 60   TERVFARFMNSLSGSGGSGGGGGGDAESVSVTASIAPITDSVVDSLTQQLPIVKEEIDQQ 119

Query: 1093 PQIMYS----------EAA----VSATDTKVFAPFNTNCAAQ------------NKIERG 992
             Q M +          EA     +    +   + + ++ A Q            N+++ G
Sbjct: 120  SQTMQTMNNNNNNNNNEAVDLPQLQRQQSNNMSNYGSSAAPQSFYHNSGRPPLPNQMKTG 179

Query: 991  GAALIRHTSSPAGFFANINIENEFKTMRSEGSYGGGNNATAEATFSSPNTFLTQMDYSSR 812
             + LIRH SSPAG F+NINI+  F  MR  G+ G  N+ + EA FSS    L      + 
Sbjct: 180  RSNLIRHGSSPAGLFSNINIDTGFAAMRGIGTMGAANSTSKEANFSSSVVRLKNAPNYAS 239

Query: 811  GI-----INTISENETVDAQFDEDHHGNDVDFITGLP----WDDSSLISD---------- 689
             +      N+I +N      F E   GND  FI G P    W+D+++ISD          
Sbjct: 240  ALGAEIGSNSIPQNNLEPEGFAETR-GND--FIPGFPLGTTWEDTAMISDNITGLKRYRD 296

Query: 688  -----PYLKGLQHNQ-KNEAGNQXXXXXXXXXXXXXXXXA----MERLLE--DSVPCKIR 545
                 P+  GL   + KNE G Q                A    +E+ L+  DSVPCKIR
Sbjct: 297  DDDVKPFPPGLNPAETKNETGGQTTSAPLAHQMSMPNTTAELAAIEKFLQFSDSVPCKIR 356

Query: 544  AKRGCATHPRSIAERVRRTKISERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQTQLK 365
            AKRGCATHPRSIAERVRRTKISERMRKLQ+LVPNM+KQTNTSDMLDLAV+YIKDLQ Q++
Sbjct: 357  AKRGCATHPRSIAERVRRTKISERMRKLQDLVPNMDKQTNTSDMLDLAVEYIKDLQNQVE 416

Query: 364  TLSDNRAKCTCSSKQ 320
            TLSDNRAKCTCS KQ
Sbjct: 417  TLSDNRAKCTCSHKQ 431


>ref|XP_007014949.1| DNA binding protein, putative isoform 1 [Theobroma cacao]
            gi|508785312|gb|EOY32568.1| DNA binding protein, putative
            isoform 1 [Theobroma cacao]
          Length = 434

 Score =  237 bits (605), Expect = 1e-59
 Identities = 171/451 (37%), Positives = 229/451 (50%), Gaps = 84/451 (18%)
 Frame = -3

Query: 1420 SSMNSHHH------QQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQ 1259
            S +  HHH      Q      Q+ SGL RY+SAPSSYF+S+L                 Q
Sbjct: 3    SDLQHHHHHLIDYHQPQHHQKQMNSGLMRYQSAPSSYFSSILDRDFC------------Q 50

Query: 1258 LFNPRASSPETQVIFSRFMNSSA--------------------------------ADXXX 1175
             F  R SSPET+ I  RF++SS                                      
Sbjct: 51   EFLNRPSSPETERIIERFLSSSGDGGGGNTVNISDQNLCAITQNSPVRETVIKIEEPTQI 110

Query: 1174 XXXXXXXXXXXXEYHQQPPQRQRNDYSP-----------QIMYSEAAVSATDTKVFAPFN 1028
                        +  QQ  Q Q+ +YS            Q + ++ + S  D ++  P +
Sbjct: 111  MTPMNNQTGVMQQQQQQQQQPQQGNYSSASQNFYQSQPQQHLPNQQSGSTMDYRI--PNS 168

Query: 1027 TNCAAQNKIERGG---AALIRHTSSPAGFFANINIEN--EFKTMRSEGSYGGGNNATAEA 863
               A   +++ GG   + L+RH+SSPAG F+N+NI+N   +  +R  G YGG NN+  EA
Sbjct: 169  MGMARPTQMKMGGGNNSNLVRHSSSPAGLFSNLNIDNIAGYGVVRGMGDYGGVNNSNREA 228

Query: 862  TFSS-----PNTFLTQM-DYSSRGIINTISENETVDAQFDEDHHGNDVDFITGLP---WD 710
            +F S     P+  ++ + +  ++ ++   SEN    A F E+ H N   + +G P   W+
Sbjct: 229  SFPSASRPPPSGLMSPIAEMGNKNVVPNSSEN----AGFGENRHNN---YSSGFPVTSWE 281

Query: 709  DSSLISD--PYLKGLQHN--------------QKNEAGNQXXXXXXXXXXXXXXXXAMER 578
            DS +ISD  P +K L+ +              Q  +AGN+                 M  
Sbjct: 282  DSMMISDNMPGVKRLREDDRSLSGLDLDGAETQNTDAGNRPPPILAHHLSLPKSSAEMSA 341

Query: 577  L-----LEDSVPCKIRAKRGCATHPRSIAERVRRTKISERMRKLQELVPNMEKQTNTSDM 413
            +      +DSVPCKIRAKRGCATHPRSIAERVRRTKISERMRKLQ+LVPNM+KQTNT+DM
Sbjct: 342  IDKFLQYQDSVPCKIRAKRGCATHPRSIAERVRRTKISERMRKLQDLVPNMDKQTNTADM 401

Query: 412  LDLAVDYIKDLQTQLKTLSDNRAKCTCSSKQ 320
            LDLAVDYIKDLQ Q+KTLSDNRAKC+CS+KQ
Sbjct: 402  LDLAVDYIKDLQNQVKTLSDNRAKCSCSNKQ 432


>ref|XP_007131988.1| hypothetical protein PHAVU_011G057400g [Phaseolus vulgaris]
            gi|561004988|gb|ESW03982.1| hypothetical protein
            PHAVU_011G057400g [Phaseolus vulgaris]
          Length = 416

 Score =  237 bits (604), Expect = 2e-59
 Identities = 171/419 (40%), Positives = 213/419 (50%), Gaps = 58/419 (13%)
 Frame = -3

Query: 1402 HHQQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQLFNPRASSPETQ 1223
            HHQQ     Q  SGLTRYRSAPSSYF+S++                E +FN R SSPET+
Sbjct: 15   HHQQ-----QTNSGLTRYRSAPSSYFSSIIDREFY-----------EHVFN-RPSSPETE 57

Query: 1222 VIFSRFMNS--------------SAADXXXXXXXXXXXXXXXEYHQQPPQ---------- 1115
             + +RF+NS               A +               E +QQP            
Sbjct: 58   RMLTRFVNSLGGGDADADAAAAADAEEALPTQNPSTVVAVKEEVNQQPKDMPLPPINNEP 117

Query: 1114 ---RQRNDYSPQIMYSEAAVSATDTKVFAPFNTNCAAQNKIERG---GAALIRHTSSPAG 953
               +Q+  +  Q   +    SA     F          N+IE G    + LIRH SSPAG
Sbjct: 118  LVLQQQQQHQQQSNINNYGSSAPQN--FYQNTGRPPLPNQIETGRRTASNLIRHGSSPAG 175

Query: 952  FFANINIENEFKTMRSEGSYGGGNNATAEATFSSPNTFLTQMDYSS------RGIINTIS 791
             F+NINIE+ +   R  G+ G  NN+T EA FS         +YSS        I N  S
Sbjct: 176  LFSNINIESGYAAARGMGTMGAVNNSTEEANFSPVTRMKNAPNYSSGLMSSRAEIGNKSS 235

Query: 790  ENETVDAQFDEDHHGNDVDFITGLP---WDDSSLISDPY-------------LKGLQHNQ 659
                 + +   ++ GN+  FI G P   WDDS+++SD                 GL  ++
Sbjct: 236  TQNNAENEGFAENQGNE--FIPGFPVGSWDDSAIMSDNMTGRKRYREEDVKPFSGLNVSE 293

Query: 658  -KNEAGNQXXXXXXXXXXXXXXXXAMERL-----LEDSVPCKIRAKRGCATHPRSIAERV 497
             +NEAG Q                 M  +     L DSVPCKIRAKRGCATHPRSIAERV
Sbjct: 294  SQNEAGGQPSTALAHQLSLPNTSAEMAAIEKFLHLSDSVPCKIRAKRGCATHPRSIAERV 353

Query: 496  RRTKISERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQTQLKTLSDNRAKCTCSSKQ 320
            RRTKISERMRKLQ+LVPNM+KQTNT+DMLDLA++YIKDLQ Q++TLSDNR KCTCS K+
Sbjct: 354  RRTKISERMRKLQDLVPNMDKQTNTADMLDLAIEYIKDLQKQVETLSDNRDKCTCSHKE 412


>ref|XP_003543549.1| PREDICTED: transcription factor bHLH122 [Glycine max]
          Length = 408

 Score =  234 bits (597), Expect = 1e-58
 Identities = 166/414 (40%), Positives = 219/414 (52%), Gaps = 58/414 (14%)
 Frame = -3

Query: 1387 EQNTQVGS-GLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQLFNPRASSPETQVIFS 1211
            EQ  QV S GLTRYRSAPSSYF++++                E +FN R SSPET+ +FS
Sbjct: 6    EQQPQVNSSGLTRYRSAPSSYFSNIIDREFY-----------EHVFN-RPSSPETERVFS 53

Query: 1210 RFMNSSAADXXXXXXXXXXXXXXXEYHQQPPQRQRNDYSPQIMYSEAAVSA--------- 1058
            RFMNS  ++                      +   N ++  +      V+A         
Sbjct: 54   RFMNSLNSEEEDSLHHHKLSTDSSSSSAAVKEEVVNQHNQSVNEEHVVVAALQQSNNNMN 113

Query: 1057 -------------TDTKVFAPFNTNCAAQNKIERGGAA-----------LIRHTSSPAGF 950
                         + +K   P N N    + +E+G  +           LIRH+SSPAG 
Sbjct: 114  SYNNSASRNFYQSSSSKPPLP-NPNPNLSSGMEQGSFSMGLRHSGNNSNLIRHSSSPAGL 172

Query: 949  FANINIENEFKTMRSEGSYGGGNNATAEATFSSPNTFLTQMDYSSRGIINTISE---NET 779
            F+ INIEN +  +R  G+ G  NN+  +A FSS      Q +YSS G +++I+E      
Sbjct: 173  FSQINIENVYAGVRGMGTLGAVNNSIEDAKFSSSRRLKNQPNYSSSGRMSSIAEIGDKGN 232

Query: 778  VDAQFDEDHHGNDVDFITGLP---WDDSSLISDPY--LKGLQHNQ------------KNE 650
             ++  D +   +  DFITG     WDD++++SD    LK  + N             +NE
Sbjct: 233  RESSPDNEAFADGNDFITGFQVGHWDDAAIMSDNVGGLKRFRENDSKPFSGLNAAETQNE 292

Query: 649  AG--NQXXXXXXXXXXXXXXXXAMERLLE--DSVPCKIRAKRGCATHPRSIAERVRRTKI 482
             G  +                 A+E+ L+  DSVPCKIRAKRGCATHPRSIAERVRRTKI
Sbjct: 293  TGQTHAPLAHQLSLPNTSAEIAAIEKFLQFSDSVPCKIRAKRGCATHPRSIAERVRRTKI 352

Query: 481  SERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQTQLKTLSDNRAKCTCSSKQ 320
            SERMRKLQ+LVPNM+KQTNT+DMLDLAVDYIKDLQ Q++TLSD  AKCTCS ++
Sbjct: 353  SERMRKLQDLVPNMDKQTNTADMLDLAVDYIKDLQKQVQTLSDCHAKCTCSHEK 406


>ref|XP_003540708.1| PREDICTED: transcription factor bHLH122-like isoform X1 [Glycine max]
            gi|571492225|ref|XP_006592167.1| PREDICTED: transcription
            factor bHLH122-like isoform X2 [Glycine max]
          Length = 415

 Score =  229 bits (585), Expect = 3e-57
 Identities = 164/420 (39%), Positives = 213/420 (50%), Gaps = 55/420 (13%)
 Frame = -3

Query: 1414 MNSHHHQQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQLFNPRASS 1235
            ++ HHHQQ     Q+ SGLTRYRSAPSSYF+S++                E +FN R SS
Sbjct: 13   LDHHHHQQ-----QMNSGLTRYRSAPSSYFSSIIDREFY-----------EHVFN-RPSS 55

Query: 1234 PETQVIFSRFMNS----SAADXXXXXXXXXXXXXXXEY-------HQQPPQRQRNDYSPQ 1088
            PET+ + +RF++S     AAD                        + QP      +  P 
Sbjct: 56   PETERMLTRFVDSLGGGDAADADAEDSLANTQNPPTTVVAVKEEVNHQPQDVTSMNNEPL 115

Query: 1087 IMYSEAAVSATDTKVFAPFNTNCAAQN----------KIERGGAA-LIRHTSSPAGFFAN 941
            ++  +    + +   +    T    Q+          K  RG ++ LIRH SSPAG F+N
Sbjct: 116  VLQQQQQQQSNNMNNYGSSGTQNFYQSTGRPPLPNQMKTGRGSSSSLIRHGSSPAGLFSN 175

Query: 940  INIENEFKTMRSEGSYGGG--NNATAEATFSSPNTFLTQMDYSSR------GIINTISEN 785
            INI+  +  +R  G+ G    NN T EA FS         ++SS       GI N  +  
Sbjct: 176  INIDTGYAAVRGMGTMGAAAANNTTEEANFSPATRMKNATNFSSGLMSSRPGIGNKSNTQ 235

Query: 784  ETVDAQFDEDHHGND---VDFITGLPWDDSSLISDPYLKGLQH---------------NQ 659
               + +   +  GN+     F  G PWDDS+++SD  + GL+                  
Sbjct: 236  NNAENEGFAESQGNEFIPAGFPVG-PWDDSAIMSDN-MTGLKRFRDEDVKPFSGLNAPES 293

Query: 658  KNEAGNQXXXXXXXXXXXXXXXXAMERL-------LEDSVPCKIRAKRGCATHPRSIAER 500
            +NE G Q                + E         L DSVPCKIRAKRGCATHPRSIAER
Sbjct: 294  QNETGGQQPSSSALAHQLSLPNTSAEMAAIEKFLQLSDSVPCKIRAKRGCATHPRSIAER 353

Query: 499  VRRTKISERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQTQLKTLSDNRAKCTCSSKQ 320
            VRRTKISERMRKLQ+LVPNM+KQTNT+DMLDLAV+YIKDLQ Q++ LSDNRAKCTC  K+
Sbjct: 354  VRRTKISERMRKLQDLVPNMDKQTNTADMLDLAVEYIKDLQNQVEALSDNRAKCTCLHKK 413


>ref|XP_004486946.1| PREDICTED: transcription factor bHLH122-like [Cicer arietinum]
          Length = 416

 Score =  228 bits (582), Expect = 6e-57
 Identities = 173/428 (40%), Positives = 220/428 (51%), Gaps = 63/428 (14%)
 Frame = -3

Query: 1414 MNSHHHQQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQLFNPRASS 1235
            M+S   QQ + N+   SGLTRYRSAPSSYF +++                E +FN R SS
Sbjct: 1    MDSSDLQQQQVNSS--SGLTRYRSAPSSYFNNIIDREFY-----------EHVFN-RPSS 46

Query: 1234 PETQVIFSRFMNSSAADXXXXXXXXXXXXXXXE---YHQQPPQRQRN----------DYS 1094
            PET+ +FSRFMNS  ++               E     QQ  Q+Q N          D +
Sbjct: 47   PETERVFSRFMNSLGSEEDLLAQKISVDSTVKEEEEQQQQVVQQQSNININNININDDDN 106

Query: 1093 PQIMYSEAAVSAT----DTKVFAPF-NTNCAA-------------QNKIERGG--AALIR 974
                Y+ +A + +     + V  P  N N ++             Q     GG  + LIR
Sbjct: 107  NSNNYNNSAATTSHGFYQSSVMPPLPNQNLSSGMEGNYSMGVNRLQQMKSHGGNNSNLIR 166

Query: 973  HTSSPAGFFANINIENEFKTMRSEGSYGGGNNATAEATFSSPNTFLTQMDYSSRGIINTI 794
            H+SSPAG F+ INIEN +  MR  G+ G  NN+  EA FS+  +     +YSSR + +  
Sbjct: 167  HSSSPAGLFSQINIENGYVIMRDMGNLGAVNNSVKEAKFSTTRSLKNSSNYSSRPMSSIA 226

Query: 793  S-------ENETVDAQFDEDHHGNDVDFITGLPWDDSSLISDPY--LKGLQHNQ------ 659
                    EN   +  F E+   + +       WDD+ ++S+    LK  + N       
Sbjct: 227  EIGDKGNRENNQENDVFGENRGNDYISEYQVDTWDDTEMMSENVGGLKRFRDNDSKQQFS 286

Query: 658  --------KNEAG-----NQXXXXXXXXXXXXXXXXAMERLLE--DSVPCKIRAKRGCAT 524
                    +NE G     +                 AME+ L   DSVP KIRAKRGCAT
Sbjct: 287  GLNASSSVQNETGGGHSSSSPLAHQLSMPNTLSEMAAMEKFLHFSDSVPMKIRAKRGCAT 346

Query: 523  HPRSIAERVRRTKISERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQTQLKTLSDNRA 344
            HPRSIAERVRRTKISERMRKLQ+LVPNMEKQTNT+DMLDLAVDYIKDLQ Q++TLSD RA
Sbjct: 347  HPRSIAERVRRTKISERMRKLQDLVPNMEKQTNTADMLDLAVDYIKDLQNQVQTLSDCRA 406

Query: 343  KCTCSSKQ 320
            KCTCS KQ
Sbjct: 407  KCTCSHKQ 414


>ref|XP_003597460.1| Transcription factor bHLH122 [Medicago truncatula]
            gi|355486508|gb|AES67711.1| Transcription factor bHLH122
            [Medicago truncatula]
          Length = 412

 Score =  227 bits (579), Expect = 1e-56
 Identities = 168/424 (39%), Positives = 216/424 (50%), Gaps = 59/424 (13%)
 Frame = -3

Query: 1414 MNSHHHQQFEQNTQVGS-GLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQLFNPRAS 1238
            M S  HQQ  Q  QV S GLTR++SAPSSYF +++                E +FN + S
Sbjct: 1    MESDLHQQ--QQPQVNSSGLTRFKSAPSSYFNNIIDREFY-----------EHVFN-KPS 46

Query: 1237 SPETQVIFSRFMNSSAADXXXXXXXXXXXXXXXE-----YHQQPPQR------------- 1112
            SPET+ +FSRF+NS  +D               E      +QQ  Q+             
Sbjct: 47   SPETERVFSRFINSFGSDDDLLAQKISVDSTVKEEEEVNINQQQQQQDQGLASINNEHVV 106

Query: 1111 -QRNDYSPQIMYSEAAVSATDTKVFAPFNTNCAAQNKIERG-------------GAALIR 974
             Q+++Y+  +  S     ++        N +         G              + LIR
Sbjct: 107  HQQSNYNNSVPSSHGFYQSSMMPPLPNQNVSSGLDGSFSMGVNRLQQVKNHGGNNSNLIR 166

Query: 973  HTSSPAGFFANINIENEFKTMRSEGSYGGGNNATAEATFSSPNTFLTQMDYSSRGIINTI 794
            H+SSPAG F+ INIEN + +MR  G+ G  NN+  EA FS+  +   Q +YSS G+++TI
Sbjct: 167  HSSSPAGLFSQINIENGYVSMRGMGTLGAVNNSMKEAKFSTARSLKNQSNYSS-GLMSTI 225

Query: 793  SE--------NETVDAQFDEDHHGNDVDFITGLPWDDSSLISDPY--LKGLQHNQ----- 659
             E        N   +  F E H    +D+     WDDS ++S+    LK  + N      
Sbjct: 226  DEVGDKDNRENNLENEAFGESHGNEYMDYPVDT-WDDSEMMSENVGGLKRFRDNDSKQQF 284

Query: 658  -----KNEAG----NQXXXXXXXXXXXXXXXXAMERLLE--DSVPCKIRAKRGCATHPRS 512
                 +NE G    N                 AME+ L   DSVP KIRAKRGCATHPRS
Sbjct: 285  SGLNVQNETGGGHSNSPLAHQLSMPNTSSEMAAMEKFLHFSDSVPMKIRAKRGCATHPRS 344

Query: 511  IAERVRRTKISERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQTQLKTLSDNRAKCTC 332
            IAERVRRTKISERMRKLQ+LVPNM+KQTNT+DMLDLAVDYIKDLQ Q + L D +AKCTC
Sbjct: 345  IAERVRRTKISERMRKLQDLVPNMDKQTNTADMLDLAVDYIKDLQKQAQKLQDCQAKCTC 404

Query: 331  SSKQ 320
              KQ
Sbjct: 405  PHKQ 408


>ref|XP_007205257.1| hypothetical protein PRUPE_ppa006295mg [Prunus persica]
            gi|462400899|gb|EMJ06456.1| hypothetical protein
            PRUPE_ppa006295mg [Prunus persica]
          Length = 419

 Score =  227 bits (578), Expect = 2e-56
 Identities = 163/436 (37%), Positives = 222/436 (50%), Gaps = 70/436 (16%)
 Frame = -3

Query: 1420 SSMNSHHHQQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQLFNPRA 1241
            S +  HHH+  +    + S L RYRSAPSSYFA+L S               E LFN R 
Sbjct: 3    SDLQQHHHKPQQH---MNSSLMRYRSAPSSYFANLDSD------------FCEPLFN-RP 46

Query: 1240 SSPETQVIFSRFMN-----------------------SSAADXXXXXXXXXXXXXXXEYH 1130
            SSPET+ IF+RF+                        ++  +                  
Sbjct: 47   SSPETERIFARFLTGEGGGNGDGGGGGTEETASHHKVTTQTNNQQTQFMVPKVDNEAVVI 106

Query: 1129 QQPPQRQRNDYSPQIMYSEAAVSATDTKVFAP-----------FNTNCAAQNKIERGGAA 983
            QQ  Q   N+YS     S+    +  +K   P           ++   +    ++ GG  
Sbjct: 107  QQQQQSHLNNYSS---VSQGFYQSPSSKPPLPNQSLNSANEGAYSMGTSQLPSVKTGGVT 163

Query: 982  ---LIRHTSSPAGFFANINIE-NEFKTMRSEGSYGGGNNATAEATFSSPNTF--LTQMDY 821
               LIRH+SSPAG F+++NI+   +  +R  G+YG  N+   EA+FSS +     +    
Sbjct: 164  NSNLIRHSSSPAGLFSHMNIDVTGYAALRGMGNYGASNSTNEEASFSSTSRLKNFSSGPP 223

Query: 820  SSRGIINTISE-------NETVDAQFDEDHHGNDVDFITGLP---WDDSSLISDPYLKGL 671
            S+ G+++ I+E       ++  D++   D  GN+  ++TG P   WDDS+++S    +  
Sbjct: 224  STSGLMSPIAEIGNKRMRSDNQDSRGFGDGSGNN--YVTGFPIDSWDDSAMMSGDITRST 281

Query: 670  QHNQKN---------------EAGNQXXXXXXXXXXXXXXXXAMERL-----LEDSVPCK 551
               + +               EAGN+                 M  +      +DSVPCK
Sbjct: 282  SFREDDIKAFTGLSPSETQDVEAGNRPPTLLAHHLSLPKTSAEMAAIEKFMQFQDSVPCK 341

Query: 550  IRAKRGCATHPRSIAERVRRTKISERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQTQ 371
            IRAKRGCATHPRSIAERVRRT+ISERMRKLQELVPNM+KQTNT+DMLDLAV+YIKDLQTQ
Sbjct: 342  IRAKRGCATHPRSIAERVRRTRISERMRKLQELVPNMDKQTNTADMLDLAVEYIKDLQTQ 401

Query: 370  LKTLSDNRAKCTCSSK 323
            ++TLSDNRAKCTCSSK
Sbjct: 402  VQTLSDNRAKCTCSSK 417


>ref|XP_003537954.1| PREDICTED: transcription factor bHLH122-like isoform X1 [Glycine max]
            gi|571488501|ref|XP_006590956.1| PREDICTED: transcription
            factor bHLH122-like isoform X2 [Glycine max]
            gi|571488503|ref|XP_006590957.1| PREDICTED: transcription
            factor bHLH122-like isoform X3 [Glycine max]
            gi|571488505|ref|XP_006590958.1| PREDICTED: transcription
            factor bHLH122-like isoform X4 [Glycine max]
            gi|571488508|ref|XP_006590959.1| PREDICTED: transcription
            factor bHLH122-like isoform X5 [Glycine max]
          Length = 418

 Score =  224 bits (572), Expect = 8e-56
 Identities = 165/420 (39%), Positives = 215/420 (51%), Gaps = 59/420 (14%)
 Frame = -3

Query: 1402 HHQQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQLFNPRASSPETQ 1223
            HHQQ     Q+ SGLTRYRSAPSSYF+S++               +E +FN R SSPET+
Sbjct: 15   HHQQ-----QMNSGLTRYRSAPSSYFSSIID-----------HEFYEHVFN-RPSSPETE 57

Query: 1222 VIFSRFMNS-------SAADXXXXXXXXXXXXXXXEYHQQPP-------------QRQRN 1103
             + +RF+NS       +                  E +QQPP             Q+Q+ 
Sbjct: 58   RMLTRFVNSLGGGDADAEDSLATTQNPPTVVAVKEEVNQQPPDVTSMNNEPLVLQQQQQQ 117

Query: 1102 DYSPQIMYSEAAVSATDTKVFAPFNTNCAAQNKIERG---GAALIRHTSSPAGFFANINI 932
                Q   +     ++ T+ F          N+++ G    + LIRH SSPAG F+NINI
Sbjct: 118  QQQQQQSNNMYNYGSSGTQNFYQSTGRPPLPNQMKTGHGSSSNLIRHGSSPAGLFSNINI 177

Query: 931  E--NEFKTMRSEGSYG-GGNNATAEATFSSPNTFLTQMDYSSRGIINTISE--------N 785
            +       +R  G+ G   NN + EA FS            S G++++ +E        N
Sbjct: 178  DITGYAAVVRGMGTMGAAANNTSEEANFSPATRMKNNAPNFSSGLMSSRAEVGNKSNTQN 237

Query: 784  ETVDAQFDEDHHGND---VDFITGLPWDDSSLISD--------------PYLKGLQ-HNQ 659
               + +   +  GN+     F  G PW+DS+++SD              P+  GL     
Sbjct: 238  NNAENEGFAESQGNEFIPAGFPVG-PWNDSAIMSDNVTGLKRFRDEDVKPFSGGLNAPES 296

Query: 658  KNEAGNQXXXXXXXXXXXXXXXXAMERL-------LEDSVPCKIRAKRGCATHPRSIAER 500
            +NE G Q                + E         L DSVPCKIRAKRGCATHPRSIAER
Sbjct: 297  QNETGGQQPSSSALAHQLSLPNTSAEMAAIEKFLQLSDSVPCKIRAKRGCATHPRSIAER 356

Query: 499  VRRTKISERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQTQLKTLSDNRAKCTCSSKQ 320
            VRRTKISERMRKLQ+LVPNM+KQTNT+DMLDLAV+YIKDLQ Q++TLSDNRAKCTCS K+
Sbjct: 357  VRRTKISERMRKLQDLVPNMDKQTNTADMLDLAVEYIKDLQNQVQTLSDNRAKCTCSHKK 416


>emb|CAN80884.1| hypothetical protein VITISV_018653 [Vitis vinifera]
          Length = 446

 Score =  224 bits (571), Expect = 1e-55
 Identities = 162/437 (37%), Positives = 217/437 (49%), Gaps = 74/437 (16%)
 Frame = -3

Query: 1411 NSHHHQQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFE--QLFNPRAS 1238
            N  HHQ  +Q  Q+ S L RYRSAPSSYF++ +                E  ++F+ R S
Sbjct: 4    NLQHHQHQQQ--QMNSSLMRYRSAPSSYFSNFIDGEDCEEFLQHRPSSPETERIFSSRPS 61

Query: 1237 SPETQVIFSRFMNSSAADXXXXXXXXXXXXXXXEYHQQP--------------------- 1121
            SPET+ IFSRFM S   +                   +                      
Sbjct: 62   SPETERIFSRFMASGGTEDSSSHTVMNMRQNSQAMAPESVVVVSQQNQFMASMKHGAEVL 121

Query: 1120 -PQRQRNDY-----------SPQIMYSEAAVSATDTKVFAPFNTNCAAQNKIERGG---A 986
              Q+Q+N Y           SP   ++ AA    +    A  +       + + GG   +
Sbjct: 122  QQQQQQNGYASGSQMMYQTSSPMPHHNSAAPGTVENSYSAVSSMGMDQSQQXKIGGGNNS 181

Query: 985  ALIRHTSSPAGFFANINIENEFKTMRSEGSYGGGNNATAEATFSSPNTFLTQMDYSS--- 815
             LIRH+SSPAG F+++N+EN +  MR  G++G G+    E +FSS +    Q+++SS   
Sbjct: 182  NLIRHSSSPAGLFSHLNVENGYAIMRGMGNFGSGSGTNGEPSFSSASRLKGQINFSSGPP 241

Query: 814  --RGIINTISE--NETV------DAQFDEDHHGNDVDFITGLP---WDDSSLISDPY--L 680
               G++  ISE  N+++      +  F E H  N   FITG P   WDDS+++S+ +  L
Sbjct: 242  SSSGLVTPISEMGNKSMGTGSPDNGSFGEGH-SNSGGFITGFPIGSWDDSAIMSESFSSL 300

Query: 679  KGLQHN-------------QKNEAGNQXXXXXXXXXXXXXXXXAMERL-----LEDSVPC 554
            K ++ +             QK E  N+                 +  +      +DSVPC
Sbjct: 301  KSVRDDEAKTFSGLNASEAQKGEPANRPPVLAHHLSLPTKTSADLTTIEKYLQFQDSVPC 360

Query: 553  KIRAKRGCATHPRSIAERVRRTKISERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQT 374
            KIRAKRGCATHPRSIAERVRRT+ISERMRKLQELVPNM+KQTNTSDMLDLAVDYIKDLQ 
Sbjct: 361  KIRAKRGCATHPRSIAERVRRTRISERMRKLQELVPNMDKQTNTSDMLDLAVDYIKDLQK 420

Query: 373  QLKTLSDNRAKCTCSSK 323
            Q+K    NR K     K
Sbjct: 421  QVKR---NRKKAAIKCK 434


>ref|XP_002513457.1| DNA binding protein, putative [Ricinus communis]
            gi|223547365|gb|EEF48860.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 418

 Score =  223 bits (569), Expect = 2e-55
 Identities = 162/431 (37%), Positives = 214/431 (49%), Gaps = 64/431 (14%)
 Frame = -3

Query: 1420 SSMNSHHH-----QQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQL 1256
            S +  HHH     QQ     Q+ SGL RY+SAPSSYF+S                     
Sbjct: 3    SDLEHHHHFFQGHQQNHHQKQMNSGLQRYQSAPSSYFSSFQDKDFVDD------------ 50

Query: 1255 FNPRASSPETQVIFSRFMNSSAADXXXXXXXXXXXXXXXEYHQQPP-------------- 1118
            F  R +SPET+ IF+RF+ +S                     Q+ P              
Sbjct: 51   FLNRPTSPETERIFARFLANSGGSSTDNISNQNLGAVIK---QESPVKEAVTQVSQQAHI 107

Query: 1117 ----------------QRQRNDYSPQIMYSEA--------AVSATDTKVFAPFNTNCAAQ 1010
                            Q+Q+++YS     S++        + S+ D ++         +Q
Sbjct: 108  MASMNDSDQTRLHRHQQQQQSNYSSGFYQSQSKPPLPDHGSGSSMDYRIMTSMAMERLSQ 167

Query: 1009 NKIERGGAA-LIRHTSSPAGFFANINIE--NEFKTMRSEGSYGGGNNATAEATFSSPNTF 839
             K   G  + L+RH+SSPAG F+NINIE  N +  +R  G +G G+  T+ +T   P   
Sbjct: 168  MKPSAGNNSNLVRHSSSPAGLFSNINIEVENGYAVIRGMGDFGTGSGETSYSTAGRPLPS 227

Query: 838  LTQMDYSSRGIINTISENETVDAQFDEDHHGNDVDFITGLP---WDDSSLIS-------- 692
              +M   +        +N    A F E    N   ++TG P   WDD++++S        
Sbjct: 228  SGRMSPIAEIGNKNRGKNNPDSAGFGETRSNN---YVTGFPIGSWDDTAVMSAGLKRLTD 284

Query: 691  -DPYLKGLQ--HNQKNEAGNQXXXXXXXXXXXXXXXXA--MERLLE--DSVPCKIRAKRG 533
             D  L GL    N+  E GN                    +E+ L+  DSVPCKIRAKRG
Sbjct: 285  DDRTLSGLNASENESGEVGNHPPMLAHHLSLPKTSAELSAIEKYLQLQDSVPCKIRAKRG 344

Query: 532  CATHPRSIAERVRRTKISERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQTQLKTLSD 353
            CATHPRSIAERVRRT+ISERMRKLQ+LVPNM+KQTNTSDMLDLAVDYIKDLQ Q++TLS+
Sbjct: 345  CATHPRSIAERVRRTRISERMRKLQDLVPNMDKQTNTSDMLDLAVDYIKDLQRQVETLSE 404

Query: 352  NRAKCTCSSKQ 320
            NR+KCTC+SKQ
Sbjct: 405  NRSKCTCASKQ 415


>emb|CBI16416.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  221 bits (562), Expect = 1e-54
 Identities = 134/293 (45%), Positives = 177/293 (60%), Gaps = 39/293 (13%)
 Frame = -3

Query: 1081 YSEAAVSATDTKVFAPFNTNCAAQNKIERGG---AALIRHTSSPAGFFANINIENEFKTM 911
            ++ AA    +    A  +       +I+ GG   + LIRH+SSPAG F+++N+EN +  M
Sbjct: 4    HNSAAPGTVENSYSAVSSMGMDQSQQIKIGGGNNSNLIRHSSSPAGLFSHLNVENGYAIM 63

Query: 910  RSEGSYGGGNNATAEATFSSPNTFLTQMDYSS-----RGIINTISE--NETV------DA 770
            R  G++G G+    E +FSS +    Q+++SS      G++  ISE  N+++      + 
Sbjct: 64   RGMGNFGSGSGTNGEPSFSSASRLKGQINFSSGPPSSSGLVTPISEMGNKSMGTGSPDNG 123

Query: 769  QFDEDHHGNDVDFITGLP---WDDSSLISDPY--LKGLQHN-------------QKNEAG 644
             F E H  N   FITG P   WDDS+++S+ +  LK ++ +             QK E  
Sbjct: 124  SFGEGH-SNSGGFITGFPIGSWDDSAIMSESFSSLKSVRDDEAKTFSGLNASEAQKGEPA 182

Query: 643  NQXXXXXXXXXXXXXXXXAMERL-----LEDSVPCKIRAKRGCATHPRSIAERVRRTKIS 479
            N+                 +  +      +DSVPCKIRAKRGCATHPRSIAERVRRT+IS
Sbjct: 183  NRPPVLAHHLSLPTKTSADLTTIEKYLQFQDSVPCKIRAKRGCATHPRSIAERVRRTRIS 242

Query: 478  ERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQTQLKTLSDNRAKCTCSSKQ 320
            ERMRKLQELVPNM+KQTNTSDMLDLAVDYIKDLQ Q+KTLSDNRAKCTCS+KQ
Sbjct: 243  ERMRKLQELVPNMDKQTNTSDMLDLAVDYIKDLQKQVKTLSDNRAKCTCSNKQ 295


>ref|XP_007014951.1| DNA binding protein, putative isoform 3, partial [Theobroma cacao]
            gi|508785314|gb|EOY32570.1| DNA binding protein, putative
            isoform 3, partial [Theobroma cacao]
          Length = 424

 Score =  220 bits (561), Expect = 2e-54
 Identities = 166/445 (37%), Positives = 221/445 (49%), Gaps = 84/445 (18%)
 Frame = -3

Query: 1420 SSMNSHHH------QQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQ 1259
            S +  HHH      Q      Q+ SGL RY+SAPSSYF+S+L                 Q
Sbjct: 3    SDLQHHHHHLIDYHQPQHHQKQMNSGLMRYQSAPSSYFSSILDRDFC------------Q 50

Query: 1258 LFNPRASSPETQVIFSRFMNSSA--------------------------------ADXXX 1175
             F  R SSPET+ I  RF++SS                                      
Sbjct: 51   EFLNRPSSPETERIIERFLSSSGDGGGGNTVNISDQNLCAITQNSPVRETVIKIEEPTQI 110

Query: 1174 XXXXXXXXXXXXEYHQQPPQRQRNDYSP-----------QIMYSEAAVSATDTKVFAPFN 1028
                        +  QQ  Q Q+ +YS            Q + ++ + S  D ++  P +
Sbjct: 111  MTPMNNQTGVMQQQQQQQQQPQQGNYSSASQNFYQSQPQQHLPNQQSGSTMDYRI--PNS 168

Query: 1027 TNCAAQNKIERGG---AALIRHTSSPAGFFANINIEN--EFKTMRSEGSYGGGNNATAEA 863
               A   +++ GG   + L+RH+SSPAG F+N+NI+N   +  +R  G YGG NN+  EA
Sbjct: 169  MGMARPTQMKMGGGNNSNLVRHSSSPAGLFSNLNIDNIAGYGVVRGMGDYGGVNNSNREA 228

Query: 862  TFSS-----PNTFLTQM-DYSSRGIINTISENETVDAQFDEDHHGNDVDFITGLP---WD 710
            +F S     P+  ++ + +  ++ ++   SEN    A F E+ H N   + +G P   W+
Sbjct: 229  SFPSASRPPPSGLMSPIAEMGNKNVVPNSSEN----AGFGENRHNN---YSSGFPVTSWE 281

Query: 709  DSSLISD--PYLKGLQHN--------------QKNEAGNQXXXXXXXXXXXXXXXXAMER 578
            DS +ISD  P +K L+ +              Q  +AGN+                 M  
Sbjct: 282  DSMMISDNMPGVKRLREDDRSLSGLDLDGAETQNTDAGNRPPPILAHHLSLPKSSAEMSA 341

Query: 577  L-----LEDSVPCKIRAKRGCATHPRSIAERVRRTKISERMRKLQELVPNMEKQTNTSDM 413
            +      +DSVPCKIRAKRGCATHPRSIAERVRRTKISERMRKLQ+LVPNM+KQTNT+DM
Sbjct: 342  IDKFLQYQDSVPCKIRAKRGCATHPRSIAERVRRTKISERMRKLQDLVPNMDKQTNTADM 401

Query: 412  LDLAVDYIKDLQTQLKTLSDNRAKC 338
            LDLAVDYIKDLQ Q  TLSDNRAKC
Sbjct: 402  LDLAVDYIKDLQNQ--TLSDNRAKC 424


>ref|XP_002304761.1| basic helix-loop-helix family protein [Populus trichocarpa]
            gi|222842193|gb|EEE79740.1| basic helix-loop-helix family
            protein [Populus trichocarpa]
          Length = 422

 Score =  219 bits (557), Expect = 5e-54
 Identities = 169/428 (39%), Positives = 209/428 (48%), Gaps = 66/428 (15%)
 Frame = -3

Query: 1405 HHHQQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQLFNPRASSPET 1226
            + HQQ  Q  Q+ SGLTRY+SAPSSYF+S L                E+  N R +SPET
Sbjct: 16   NQHQQIHQK-QMNSGLTRYQSAPSSYFSSNLDRDFC-----------EEFLN-RPTSPET 62

Query: 1225 QVIFSRFMNSSA--------ADXXXXXXXXXXXXXXXEYHQQPP---------------- 1118
            + IF+RF+ +S         ++               + +QQP                 
Sbjct: 63   ERIFARFLANSGGNTENIPGSNLCEIKQDSPVKESVSQINQQPQMMASMNNHSSDTRLHQ 122

Query: 1117 ----QRQRNDYSPQ--IMYSEAAVSATDTKVFAPFNTNCAAQNKIER---------GGAA 983
                Q Q  +YS       S +     D    +  N        +ER             
Sbjct: 123  HQHQQHQHGNYSASQGFYQSRSKPPLPDHNPGSGMNHRSTNSTGLERMPSMKPSSGNNPN 182

Query: 982  LIRHTSSPAGFFANINIE--NEFKTMRSEGSYGGGNNATAEATFSSPNTFLTQMDYSSRG 809
            L+RH+SSPAG F+NINIE  N +  +R  G  G GN  T  +    P         SS G
Sbjct: 183  LVRHSSSPAGLFSNINIEFENGYAVLRDVGDLGAGNRDTTYSAAGRPP--------SSSG 234

Query: 808  IINTISE--------NETVDAQFDEDHHGNDVDFITGLPWDDSSLIS----------DPY 683
            I +TI+E        N      F E   GN+ D+  G  WDDS+++S          D  
Sbjct: 235  IRSTIAEMGNKNMGENSPDSGGFGETP-GNNYDYPIG-SWDDSAVMSTGSKRYLTDDDRT 292

Query: 682  LKGLQHN---QKNEAGNQXXXXXXXXXXXXXXXXA--MERLLE--DSVPCKIRAKRGCAT 524
            L GL  +   Q  EAGN+                   +E  L+  DSVPCKIRAKRGCAT
Sbjct: 293  LSGLNSSETQQNEEAGNRPPMLAHHLSLPKTSAEMSTIENFLQFQDSVPCKIRAKRGCAT 352

Query: 523  HPRSIAERVRRTKISERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQTQLKTLSDNRA 344
            HPRSIAERVRRT+ISERMRKLQ+LVPNM+KQTNTSDMLDLAVDYIKDLQ Q K LS+NRA
Sbjct: 353  HPRSIAERVRRTRISERMRKLQDLVPNMDKQTNTSDMLDLAVDYIKDLQRQFKALSENRA 412

Query: 343  KCTCSSKQ 320
            +CTC  KQ
Sbjct: 413  RCTCLKKQ 420


>ref|XP_004295210.1| PREDICTED: transcription factor bHLH122-like [Fragaria vesca subsp.
            vesca]
          Length = 431

 Score =  216 bits (549), Expect = 4e-53
 Identities = 166/438 (37%), Positives = 215/438 (49%), Gaps = 76/438 (17%)
 Frame = -3

Query: 1405 HHHQQFEQNTQVGSGLTRYRSAPSSYFASLLSTPXXXXXXXXXXXXFEQLFNPRASSPET 1226
            HH  Q   N+   S L RYRSAPSSYFA++L T              E  FN RA SPET
Sbjct: 9    HHKPQQHVNS---SSLLRYRSAPSSYFANILDTEFN-----------EPFFN-RAPSPET 53

Query: 1225 QVIFSRFMNSSAADXXXXXXXXXXXXXXXEYH--------------------QQPPQRQR 1106
            + I +RFM+S   D                                      QQ  QRQ+
Sbjct: 54   ERILARFMSSQGGDIGGTEEIVPHQKVETTQQPQFLVPKLDNDAAMIQEQEQQQQQQRQQ 113

Query: 1105 NDYSPQIMY-----SEAAVSATDTKVFAPFNTNCAAQNK---------------IERGGA 986
                 Q  +     S+       TK   P N N A+ N+               ++ GG 
Sbjct: 114  QQQQQQQSHMNNYTSQGFYQIPSTKPPLP-NQNLASSNEGAAAAYPMGSNQFPLMKTGGV 172

Query: 985  A---LIRHTSSPAGFFANINIE-NEFKTMRSEGSYGGGNNATAEATFSSPN---TFLTQM 827
                LIRH+SSPAG FANINI+   +  +R  G+ G  N    +++FS  +    F +  
Sbjct: 173  THSNLIRHSSSPAGLFANINIDVAGYAALRGMGNVGASNGTNEDSSFSPASRLKNFSSGP 232

Query: 826  DYSSRGIINTISE-------NETVDAQFDEDHHGNDVDFITGLP---WDDSSLISDPY-- 683
              S+  +++ ISE       +   D        G   ++++G P   W+DSSL++D    
Sbjct: 233  PSSTSSMMSPISEVGEKRMRSNNQDRTGGGFSEGRGNNYVSGFPMGSWEDSSLLADDIIG 292

Query: 682  -------------LKGL--QHNQKNEAGNQXXXXXXXXXXXXXXXXAMERLL--EDSVPC 554
                         + GL     Q  EA  +                A+E  L  +DSVPC
Sbjct: 293  STNFRDDDDDVKTITGLSASETQNMEARGRPLAHHLSLPKTSAEMAAIENFLQFQDSVPC 352

Query: 553  KIRAKRGCATHPRSIAERVRRTKISERMRKLQELVPNMEKQTNTSDMLDLAVDYIKDLQT 374
            KIRAKRGCATHPRSIAERVRRT+ISERMRKLQELVPNM+KQTNT+DMLDLAVDYIK+LQT
Sbjct: 353  KIRAKRGCATHPRSIAERVRRTRISERMRKLQELVPNMDKQTNTADMLDLAVDYIKNLQT 412

Query: 373  QLKTLSDNRAKCTCSSKQ 320
            +++TLS+ RAKCTCS++Q
Sbjct: 413  EVQTLSEARAKCTCSNQQ 430


Top