BLASTX nr result

ID: Rheum21_contig00005950 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00005950
         (1484 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002511647.1| DNA binding protein, putative [Ricinus commu...   271   5e-70
ref|XP_006386561.1| hypothetical protein POPTR_0002s14430g [Popu...   261   4e-67
gb|ESW12806.1| hypothetical protein PHAVU_008G144300g [Phaseolus...   254   5e-65
ref|XP_002320711.2| hypothetical protein POPTR_0014s06190g [Popu...   254   9e-65
ref|XP_002275629.2| PREDICTED: transcription factor UNE10-like [...   250   1e-63
ref|XP_006591039.1| PREDICTED: transcription factor UNE10-like [...   246   1e-62
gb|EMJ19191.1| hypothetical protein PRUPE_ppa005629mg [Prunus pe...   238   6e-60
gb|EOX96336.1| Basic helix-loop-helix DNA-binding superfamily pr...   234   9e-59
ref|XP_003516808.1| PREDICTED: transcription factor UNE10-like [...   231   5e-58
ref|XP_004514235.1| PREDICTED: transcription factor UNE10-like [...   229   3e-57
ref|XP_004229781.1| PREDICTED: transcription factor UNE10-like [...   225   3e-56
ref|XP_006347913.1| PREDICTED: transcription factor UNE10-like [...   223   2e-55
ref|XP_002301261.2| hypothetical protein POPTR_0002s14430g [Popu...   218   7e-54
ref|NP_191916.3| transcription factor UNE10 [Arabidopsis thalian...   217   1e-53
gb|EOX96337.1| Basic helix-loop-helix DNA-binding superfamily pr...   215   3e-53
gb|AAM10933.1|AF488561_1 putative bHLH transcription factor [Ara...   215   3e-53
gb|EXB37572.1| Transcription factor UNE10 [Morus notabilis]           215   4e-53
ref|XP_002875048.1| hypothetical protein ARALYDRAFT_912247 [Arab...   214   6e-53
ref|XP_004164979.1| PREDICTED: transcription factor UNE10-like [...   213   2e-52
ref|XP_004140610.1| PREDICTED: transcription factor UNE10-like [...   213   2e-52

>ref|XP_002511647.1| DNA binding protein, putative [Ricinus communis]
            gi|223548827|gb|EEF50316.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 465

 Score =  271 bits (693), Expect = 5e-70
 Identities = 186/461 (40%), Positives = 257/461 (55%), Gaps = 44/461 (9%)
 Frame = -2

Query: 1399 MSHSVPNWNLDDASIPASRLTATNSSG-----VPFLGYEMAELTWENGQLSVNGLGPG-- 1241
            M+  VP+W+L+D   PA++ +  ++S      VP L YE+AELTWENGQLS++GLGP   
Sbjct: 1    MTQCVPSWDLEDNPSPAAKHSFRSNSNSSAPDVPMLDYEVAELTWENGQLSMHGLGPPRL 60

Query: 1240 -LTRVSNPTTAKDAWDNPRAAVGTLESIVDLATQKPPPPPMDK--GDYDDVGVPWLDCRN 1070
             +  + + + +K  W+ PRA  GTLESIV+ AT+ P     D   G   +  VPWL   +
Sbjct: 61   PVKTIPSSSPSKYTWEKPRAG-GTLESIVNQATRLPQQRKTDNITGYGSNEVVPWLGHHH 119

Query: 1069 HDQRSSSAAASVT-DALVPCGTPKEEEREG---SRVFRGKGISTCVVDRSARVKN----- 917
            H  R+++++ ++T DALVPC    ++ R       V  G G   CVV  S RV +     
Sbjct: 120  HHHRAATSSPTMTMDALVPCTKQSDDHRSAHVIDSVPAGIG-GNCVVGSSTRVGSCSAPT 178

Query: 916  -------------RMPATRVPVEGXXXXXXXXXXXXXXXXXXRHVLTADTCEKDSGGLGV 776
                         R    RVPV                     H +T DTCE D G +G 
Sbjct: 179  TATQDEEALLAAKRARVARVPVAPEWSSRDQSVSGSATFGRDSHHVTLDTCEMDLG-VGF 237

Query: 775  ATASFCSQGTTLSLSSSPQGDTC---DKEDKKKGKGARSSLLTKKSRTAAIHNQSERKRR 605
             + SF SQ  T + ++  + D+    D +DK+K  G +SS+ TK+SR AAIHNQSERKRR
Sbjct: 238  TSTSFGSQENTKTATAVDENDSVCHSDDDDKQKANG-KSSVSTKRSRAAAIHNQSERKRR 296

Query: 604  DTINQKMKMLQKMVPNSSKTDKASMLDEVIDYLKQLQAQVHMMT-------MSRMSIQSX 446
            D INQ+MK LQK+VPNSSKTDKASMLDEVI+YLKQLQAQV MM+       M  M++Q  
Sbjct: 297  DKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQVQMMSRMNIQPVMLPMTMQQQ 356

Query: 445  XXXXXXXXXXXXXXXXXXXMSIINP-IMARQNLATLSPLLNPSAAFMPMASWNDATTNPS 269
                               M++++   ++R N+A +SP+L+P+A FMPM SW+ ++    
Sbjct: 357  LQMSMLAPMNMGMGLAGIGMNVMDMNTISRPNIAGISPVLHPTA-FMPMTSWDGSSGGDR 415

Query: 268  PSSA-PLGSSDPSTYLARCQSQPMSMEAYTRMVALIQQQQQ 149
              +A P    DP      CQ+QPM+M+AY+RM A+ QQ QQ
Sbjct: 416  LQTASPTVMHDPLAAFLACQTQPMTMDAYSRMAAIYQQLQQ 456


>ref|XP_006386561.1| hypothetical protein POPTR_0002s14430g [Populus trichocarpa]
            gi|550345013|gb|ERP64358.1| hypothetical protein
            POPTR_0002s14430g [Populus trichocarpa]
          Length = 484

 Score =  261 bits (668), Expect = 4e-67
 Identities = 189/486 (38%), Positives = 260/486 (53%), Gaps = 69/486 (14%)
 Frame = -2

Query: 1399 MSHS-VPNWNLDD--ASIPASRLTATNSSGVPF---LGYEMAELTWENGQLSVNGLGPGL 1238
            MSH  VP+W LDD   + P   L + ++SG P+   L YE+AELTWENGQL+++GLG   
Sbjct: 1    MSHQCVPSWELDDNPTTAPKVSLRSHSNSGAPYMPMLNYEVAELTWENGQLAMHGLGQPR 60

Query: 1237 TR---VSNPTTAKDAWDNPRAAVGTLESIVDLATQKPPPPPMDKGDYDDVG-----VPWL 1082
                 +++ + +K  WD PRA+ GTLESIV+LAT  P     +K  +D+ G     VPW 
Sbjct: 61   VPAKPIASTSPSKYTWDKPRAS-GTLESIVNLATCIPQ---CNKQTFDNSGSDHDFVPWF 116

Query: 1081 DCRNHDQRSSSAAASVTDALVPCGTPKEEEREGSRVFRGK--GIST-CVVDRSARV---- 923
               NH  R+S++A    DALVPC    ++ER  +RV      G+ T CVV  S RV    
Sbjct: 117  ---NH-HRASASATMTMDALVPCSKRSDQERT-TRVIDSSPAGLGTDCVVGCSTRVGSCS 171

Query: 922  -------------KNRMPATRVPVEGXXXXXXXXXXXXXXXXXXRHVLTADTCEKDSGGL 782
                         + R    RVPV                       +T D+CE++ G  
Sbjct: 172  APTATQNEVGLLTRKREKVARVPVPAEWSRDQSVNRGATFSKKDSQQVTVDSCERELGVG 231

Query: 781  GVATASFCSQGTTLS-----------------LSSSPQGDTCDKEDKKKGKGARSSLLTK 653
              +T SF SQ  T S                   S PQ +  D++DKKKG G +SS+  +
Sbjct: 232  FTSTTSFGSQENTSSGTKPCTKTNTADENDSVCHSRPQREAGDEDDKKKGNG-KSSVSNR 290

Query: 652  KSRTAAIHNQSERKRRDTINQKMKMLQKMVPNSSKTDKASMLDEVIDYLKQLQAQVHMMT 473
            +SR AA+HNQSERKRRD INQ+MK LQK+VPNSSKTDKASMLDEVI+YLKQLQAQV M++
Sbjct: 291  RSRAAAVHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQVQMVS 350

Query: 472  MS---------------RMSIQSXXXXXXXXXXXXXXXXXXXXMSIINPIMARQNLATLS 338
                             +MS+ +                    +  +N + AR N+  +S
Sbjct: 351  RMNMQPMMLPMALQQQLQMSMMAPISMGMAGMGMGMGMGMGMGVVDMNTLAARSNITGVS 410

Query: 337  PLLNPSAAFMPMASWNDATTNPS-PSSAPLGS--SDPSTYLARCQSQPMSMEAYTRMVAL 167
            P+L+P+A FMPM +W+ + ++   P++AP  +   DP +    CQSQPM+M+AY+RM ++
Sbjct: 411  PVLHPTA-FMPMTTWDGSNSHERLPTAAPSATVMPDPLSAFLACQSQPMTMDAYSRMASM 469

Query: 166  IQQQQQ 149
             QQ  Q
Sbjct: 470  YQQLHQ 475


>gb|ESW12806.1| hypothetical protein PHAVU_008G144300g [Phaseolus vulgaris]
          Length = 478

 Score =  254 bits (650), Expect = 5e-65
 Identities = 193/464 (41%), Positives = 253/464 (54%), Gaps = 46/464 (9%)
 Frame = -2

Query: 1402 KMSHSVPNWNLDDASIPASRLTATNSSG---------VPFLGYEMAELTWENGQLSVNGL 1250
            KMS  VP+W+LDD + P+ RL+  ++S          VP L YE+AELTWENGQLS++GL
Sbjct: 22   KMSQCVPSWDLDD-NPPSPRLSLRSNSNSNSNSTAPDVPMLDYEVAELTWENGQLSMHGL 80

Query: 1249 GPGLTRVSNPTTA--KDAWDNPRAAVGTLESIVDLATQKP----PPPPMDKGDYDDVGVP 1088
            G     V  PT+A  K  W+ PRA+ GTLESIV+ AT  P    P    D G Y +  VP
Sbjct: 81   GLPRVPVKPPTSAANKYTWEKPRAS-GTLESIVNQATSLPHSGKPTLNGDGGVYGNYLVP 139

Query: 1087 WLDCRNHDQRSSSAAASVT-DALVPCGTPKEEEREGSRVFRGKGISTCVVDRSARVKNRM 911
            WLD       S+  A +VT DALVPC + +E+ ++G +       STC+V  S RV +  
Sbjct: 140  WLDPHG----SAGTANTVTMDALVPC-SKREQSKQGMKSVP----STCMVGCSTRVGSCC 190

Query: 910  PATRVPVEGXXXXXXXXXXXXXXXXXXRHVLTADTCEKDSGGL----------GVATASF 761
                   +                   +HV T DTC+++ G              ++A  
Sbjct: 191  GNHGAKGQEMSGRDQSVSGSATFGRDSKHV-TLDTCDREFGVAFTSSSINSLDNTSSAKH 249

Query: 760  CSQGTTL----SLS-SSPQGDTCDKEDKKKGKGARSSLLTKKSRTAAIHNQSERKRRDTI 596
            C+  TT+    S+S S P G+  D+E K++ KG +SS+ TK+SR AAIHNQSERKRRD I
Sbjct: 250  CTNTTTVDDHDSVSHSKPVGENGDEEKKQRAKG-KSSVSTKRSRAAAIHNQSERKRRDKI 308

Query: 595  NQKMKMLQKMVPNSSKTDKASMLDEVIDYLKQLQAQVHMMTMSRMS-----------IQS 449
            NQ+MK LQK+VPNSSKTDKASMLDEVI+YLKQLQAQV MM    MS           +Q 
Sbjct: 309  NQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQVQMMNRINMSSMMLPLTMQQQLQM 368

Query: 448  XXXXXXXXXXXXXXXXXXXXMSIINPIMARQNLATLSPLLNPSAAFMPM-ASWNDATTNP 272
                                   +N  M R N+  + P+L+PS AFMPM ASW+ A    
Sbjct: 369  SMMSPMGMGLGMGMGMGMGMGLDMNS-MNRANIPAIPPVLHPS-AFMPMAASWDAAAAGA 426

Query: 271  S---PSSAPLGSSDPSTYLARCQSQPMSMEAYTRMVALIQQQQQ 149
            +     +      DP + L  CQSQPM+M+AY+R+VA+ QQ  Q
Sbjct: 427  ADRFQGNPATVMPDPLSTLFGCQSQPMTMDAYSRLVAMYQQLHQ 470


>ref|XP_002320711.2| hypothetical protein POPTR_0014s06190g [Populus trichocarpa]
            gi|550323629|gb|EEE99026.2| hypothetical protein
            POPTR_0014s06190g [Populus trichocarpa]
          Length = 471

 Score =  254 bits (648), Expect = 9e-65
 Identities = 183/478 (38%), Positives = 255/478 (53%), Gaps = 61/478 (12%)
 Frame = -2

Query: 1399 MSHS-VPNWNLDDASIPASRLTA---TNSSG--VPFLGYEMAELTWENGQLSVNGLGPGL 1238
            MSH  VP+W +DD    A +L+    +NSS   +P L YE+AELTWENGQ++++GLGP  
Sbjct: 1    MSHQCVPSWEVDDNRTTAPKLSLRFHSNSSAPDMPMLDYEVAELTWENGQIAMHGLGPPR 60

Query: 1237 TR---VSNPTTAKDAWDNPRAAVGTLESIVDLATQKPPPPPMDKGDYDDVG------VPW 1085
                 +++ + +K  WD PRA+ GTLESIV+ AT  P     +K  +D+        +PW
Sbjct: 61   VPAKPIASTSPSKYTWDKPRAS-GTLESIVNQATCVPQ---CNKATFDNSTGSDHDLIPW 116

Query: 1084 LDCRNHDQRSSSAAASVTDALVPCGTPKEEEREGSRVFRGK-GISTCVVDRSARVKN--- 917
                NH + S+SA  ++ DALVPC    ++ R    +  G  G+ TCVV  S RV +   
Sbjct: 117  F---NHHKASASATMTM-DALVPCSNRSDQGRTTHVIDSGPAGLGTCVVGCSTRVGSCSA 172

Query: 916  --------------RMPATRVPVEGXXXXXXXXXXXXXXXXXXRHVLTADTCEKDSGGLG 779
                          R    RVPV                       +T D+CE++ G +G
Sbjct: 173  PAATQDEDGLLTGKRARVARVPVPPEWSRDQSVNHSATFGKKDSQQMTVDSCEREFG-VG 231

Query: 778  VATASFCSQGTTLS-----------------LSSSPQGDTCDKEDKKKGKGARSSLLTKK 650
              + SF SQ  T S                   S PQ +   ++DKKKG G +SS+ TK+
Sbjct: 232  FTSTSFGSQENTSSGTNPCTKTLTADENDSVCHSRPQREAGKEDDKKKGNG-KSSVSTKR 290

Query: 649  SRTAAIHNQSERKRRDTINQKMKMLQKMVPNSSKTDKASMLDEVIDYLKQLQAQVHMMTM 470
            SR AAIHNQSERKRRD INQ+MK LQK+VP+SSKTDKASMLDEVI+YLKQLQAQV MM  
Sbjct: 291  SRAAAIHNQSERKRRDKINQRMKTLQKLVPSSSKTDKASMLDEVIEYLKQLQAQVQMM-- 348

Query: 469  SRMSIQSXXXXXXXXXXXXXXXXXXXXMSI-----------INPIMARQNLATLSPLLNP 323
            SRM++Q                     + +           +N I AR N+  + P L+P
Sbjct: 349  SRMNMQPMMLPLALQQQLQMSMMAPMSIGMAGMGMGMGVMDMNTIAARSNMTGIPPALHP 408

Query: 322  SAAFMPMASWNDATTNPSPSSAPLGSSDPSTYLARCQSQPMSMEAYTRMVALIQQQQQ 149
            +A F+P+ +W+ ++ +    +    ++DP +    CQ+QPM+M+AY+RM A+ QQ  Q
Sbjct: 409  TA-FIPLTTWDGSSGHDRLQTT---AADPMSAFLACQTQPMTMDAYSRMAAMYQQLHQ 462


>ref|XP_002275629.2| PREDICTED: transcription factor UNE10-like [Vitis vinifera]
          Length = 465

 Score =  250 bits (638), Expect = 1e-63
 Identities = 186/469 (39%), Positives = 246/469 (52%), Gaps = 47/469 (10%)
 Frame = -2

Query: 1399 MSHSVPNWNLDDASIPASRLTATNSSG----VPFLGYEMAELTWENGQLSVNGLGPGLTR 1232
            MS  VP+W++DD   P      ++S+     VP L YE+AELTWENGQL+++GLG     
Sbjct: 1    MSQCVPSWDIDDNPTPPRLFLRSHSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 60

Query: 1231 ---VSNPTTAKDAWDNPRAAVGTLESIVDLATQKPPPPPMDKGDYDDVGVPWLDCRNHDQ 1061
               V++   +K  W+ PRA  GTLESIV+ AT+ P   P  +G  DD+ VPWLD   H +
Sbjct: 61   AKPVASAAVSKYPWEKPRAG-GTLESIVNQATRLPHHKPPPEGANDDL-VPWLD---HQR 115

Query: 1060 RSSSAAASVT-----DALVPCGTPKEEEREG--SRVFRG--KGISTCVVDRSARVKN--- 917
              ++AAA+ +     DALVPC            S V      G+  C    S RV +   
Sbjct: 116  AVAAAAAAASVAMTMDALVPCSNNNNTTNNNNPSHVMDSVPAGLGPCGGGSSTRVGSCSG 175

Query: 916  -------------RMPATRVPVEGXXXXXXXXXXXXXXXXXXRHVLTADTCEKDSGGLGV 776
                         R    RVP                        +T DTC+  S     
Sbjct: 176  GATKDDDAILPGKRERVARVPSTHDWSSRDQSVTGSATFDLDSQQVTLDTCDLGSPE-NT 234

Query: 775  ATASFCSQGTTLS-----LSSSPQGDTCDKEDKKKGKGARSSLLTKKSRTAAIHNQSERK 611
            ++   C++  T+        S PQ    D+EDKK+G G +SS+ +K+SR AAIHNQSERK
Sbjct: 235  SSGKPCTKTITVDDHDSVCHSRPQRRAGDEEDKKRGTG-KSSVSSKRSRAAAIHNQSERK 293

Query: 610  RRDTINQKMKMLQKMVPNSSKTDKASMLDEVIDYLKQLQAQVHMMT-------MSRMSIQ 452
            RRD INQ+MK LQK+VPNSSKTDKASMLDEVI+YLKQLQAQV MM        M  M++Q
Sbjct: 294  RRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQVQMMNRMNMSPMMMPMTLQ 353

Query: 451  SXXXXXXXXXXXXXXXXXXXXMSIINP-IMARQNLAT--LSPLLNPSAAFMPMASWNDAT 281
                                 M +++   +AR N+AT  +SPLL+P+  F+P+ SW D +
Sbjct: 354  QQLQMSLMAQMGMGMGMSPMGMGVVDMNTIARPNVATTGISPLLHPTP-FLPLTSW-DVS 411

Query: 280  TNPSPSSAPLGSSDPSTYLARCQSQPMSMEAYTRMVALIQQQQQHLASA 134
             +  P+ AP    DP      CQSQPM+M+AY+RM AL Q   QH AS+
Sbjct: 412  GDRLPA-APTMVPDPLAAFLACQSQPMTMDAYSRMAALYQHLHQHPASS 459


>ref|XP_006591039.1| PREDICTED: transcription factor UNE10-like [Glycine max]
          Length = 465

 Score =  246 bits (629), Expect = 1e-62
 Identities = 184/466 (39%), Positives = 245/466 (52%), Gaps = 49/466 (10%)
 Frame = -2

Query: 1399 MSHSVPNWNLDDASIPASRLTATNSSG----VPFLGYEMAELTWENGQLSVNGLGPGLTR 1232
            MS  VP+W+++D   P+     +NS+     VP L YE+AELTWENGQLS++GLG     
Sbjct: 1    MSQCVPSWDVEDNPPPSRVSLRSNSNSTAPDVPMLDYEVAELTWENGQLSMHGLGLPRVP 60

Query: 1231 VSNPTTA--KDAWDNPRAAVGTLESIVDLAT-----QKPPPPPMDKGD----YDDVGVPW 1085
            V  PT A  K  W+ PR + GTLESIV+ AT     +KP P   D G     Y +  VPW
Sbjct: 61   VKPPTAATNKYTWEKPRGS-GTLESIVNQATSFSHQEKPRPLNGDSGGGGGVYGNFMVPW 119

Query: 1084 LDCRNHDQRSSSAAASVT-DALVPCGTPKEEEREGSRVFRGKGISTCVVDRSARVKNRMP 908
             D       +++   ++T DALVPC   ++ +++G       G  TC+V  S RV +   
Sbjct: 120  FDPHAAATTTTTTTNTMTMDALVPCSNREQGKKKGME----SGPGTCMVGCSTRVGSCCG 175

Query: 907  ATRVPVEGXXXXXXXXXXXXXXXXXXRHVLTADTCEKDSGGLGVAT----------ASFC 758
                                      +HV T DTC+++ G    +T          A  C
Sbjct: 176  GKGAKGHEASGRDQSVSGSATFGRDSKHV-TLDTCDREFGVAFTSTSINSLENTSYAKHC 234

Query: 757  SQGTTL----SLS-SSPQGDTCDKEDKKKGKGARSSLLTKKSRTAAIHNQSERKRRDTIN 593
            ++ TT+    S+S S P G+  D+E KK+  G +SS+ TK+SR AAIHNQSERKRRD IN
Sbjct: 235  TKTTTIEEHDSVSHSKPMGEDGDEEKKKRANG-KSSVSTKRSRAAAIHNQSERKRRDKIN 293

Query: 592  QKMKMLQKMVPNSSKTDKASMLDEVIDYLKQLQAQVHMMTMSRMS-----------IQSX 446
            Q+MK LQK+VPNSSKTDKASMLDEVI+YLKQLQAQV MM    MS           +Q  
Sbjct: 294  QRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQVQMMNRINMSSMMLPLTMQQQLQMS 353

Query: 445  XXXXXXXXXXXXXXXXXXXMSIINPIMARQNLATLSPLLNPSAAFMPM-ASWNDATTNPS 269
                                  +N  M R N+  + P+L+PS AFMPM ASW+ A    +
Sbjct: 354  MMSPMGMGLGMGMGMGMGMGMDMNS-MNRANIPGIPPVLHPS-AFMPMAASWDAAVAAAA 411

Query: 268  PSSAPLGSS------DPSTYLARCQSQPMSMEAYTRMVALIQQQQQ 149
                 L  +      DP + +  CQSQPM+M+AY+R+ A+ QQ  Q
Sbjct: 412  GGGDRLQGTPASVMPDPLSTIFGCQSQPMTMDAYSRLAAMYQQLHQ 457


>gb|EMJ19191.1| hypothetical protein PRUPE_ppa005629mg [Prunus persica]
          Length = 450

 Score =  238 bits (606), Expect = 6e-60
 Identities = 182/459 (39%), Positives = 238/459 (51%), Gaps = 37/459 (8%)
 Frame = -2

Query: 1399 MSHSVPNWNLDDASIP-ASRLT---ATNSSGVPFLGYEMAELTWENGQLSVNGLG-PGLT 1235
            M+  VP+W+LDD S P A RL+   +T S+ VP L YE+AELTWENGQ++++GLG P   
Sbjct: 1    MNQCVPSWDLDDISTPPAPRLSLRSSTPSADVPMLDYEVAELTWENGQVAMHGLGLPRPP 60

Query: 1234 RVSNPTTAK-DAWDNPRAAVGTLESIVDLATQK---PPPPPMDKGDYDDVG--VPWLDC- 1076
               + TTAK   WD PRA+ GTLESIV+ AT     P  PP D       G  V W D  
Sbjct: 61   AKPSLTTAKYTTWDKPRAS-GTLESIVNQATSTLPLPSKPPFDSSGGGSNGELVSWFDHH 119

Query: 1075 RNHDQRSSSAAASVT---DALVPCGTPKEEEREGSRVFRGKGISTC-----VVDRSARVK 920
            R    RS+    S T   DALVPC    + +   S +     +        VV  S  V+
Sbjct: 120  RAAAVRSTEVTPSTTMTMDALVPCRN--QSDNSSSHMMESMSMPVVISGSDVVGCSTGVE 177

Query: 919  NRMPATRVPVEGXXXXXXXXXXXXXXXXXXRHVLTADTCEKDSGGLGVATASFCSQGTTL 740
            +   AT    +                       T +   +     G AT    S   TL
Sbjct: 178  SCSGATGAATQDDDTMLSGKHGSLSRVPE-----TPEWSSRSQSVSGSATFGMDSHPVTL 232

Query: 739  SLS----------SSPQGDTCDKEDKKKGKGARSSLLTKKSRTAAIHNQSERKRRDTINQ 590
              +          S PQ +  D++D+KK    +SS+ TK+SR AAIHNQSERKRRD INQ
Sbjct: 233  DSTKASDHDSVCHSRPQREAGDEDDRKKRSTGKSSVSTKRSRAAAIHNQSERKRRDKINQ 292

Query: 589  KMKMLQKMVPNSSKTDKASMLDEVIDYLKQLQAQVHMMT-------MSRMSIQSXXXXXX 431
            +MK LQK+VPNSSKTDKASMLDEVI+YLK LQAQ+ M++       M  M++Q       
Sbjct: 293  RMKTLQKLVPNSSKTDKASMLDEVIEYLKNLQAQIQMISRMNMPAMMLPMAMQQQLQMSM 352

Query: 430  XXXXXXXXXXXXXXMSIINPIMARQNLATLSPLLNPSAAFMPMASWNDATTNPSPSSAPL 251
                             +N  M R N+  +SP+L+P AAFMPMASW+ +  + S S+  +
Sbjct: 353  MAAAPRNMGMGMGMGMDMN-TMVRPNIPGISPVLHP-AAFMPMASWDGSGGDRSASATVM 410

Query: 250  GSSDPSTYLARCQSQPMSMEAYTRMVALIQQQQQHLASA 134
               DP +    CQSQPM+M+AY+ M A+ QQ  Q  AS+
Sbjct: 411  --PDPLSAFLACQSQPMTMDAYSMMAAMYQQFHQPPASS 447


>gb|EOX96336.1| Basic helix-loop-helix DNA-binding superfamily protein isoform 1
            [Theobroma cacao]
          Length = 470

 Score =  234 bits (596), Expect = 9e-59
 Identities = 187/478 (39%), Positives = 239/478 (50%), Gaps = 56/478 (11%)
 Frame = -2

Query: 1399 MSHSVPNWNLDDASIPASRLTATNSSG----VPFLGYEMAELTWENGQLSVNGLGPGLTR 1232
            MS  VP+W+LDD    A     +NS+     VP L YE+AELTWENGQL+++ LGP    
Sbjct: 1    MSQCVPSWDLDDNPAIARHSLRSNSNSTAPDVPMLDYEVAELTWENGQLAMHSLGPPRVP 60

Query: 1231 ---VSNPTTAKDAWDNPRAAVGTLESIVDLATQKPPPPPMDKGDYDDVGVPWLDCRNHDQ 1061
               +++ + +K  WD PRA  GTLESIV+ AT  P       G  D++ VPW D      
Sbjct: 61   AKPLNSTSPSKYTWDKPRAG-GTLESIVNQATSFPYRNVSLDGGRDEL-VPWFDHHRAAV 118

Query: 1060 RS----SSAAASVTDALVPCGTPKEEEREG-SRVFRGKGISTCVVDRSARVKN------- 917
             +    SS+A    DALVPC    E+         RG G  TCVV  S RV +       
Sbjct: 119  AAAAVASSSATMTMDALVPCSNRSEDRTTHVMESIRGLG-GTCVVGCSTRVGSCSGPTGT 177

Query: 916  ----------RMPATRVPVEGXXXXXXXXXXXXXXXXXXRHVLTADTCEKDSGGLGVATA 767
                      R    RV V                       +T D+ EKD G +G  + 
Sbjct: 178  QDDGVLLTGKRAREARVSVAPEWSSKDQNASASATFGTDSQHVTVDSYEKDFG-VGFTST 236

Query: 766  SFCSQGTTLSLSSSPQGDTCD---------------KEDKKKGKGARSSLLTKKSRTAAI 632
            S  S   T S     +  T D               +EDK+K  G +SS+ TK+SR AAI
Sbjct: 237  SLGSPENTSSPRPCTKATTADDHDSVCHSRPQRKAGEEDKRKETG-KSSVSTKRSRAAAI 295

Query: 631  HNQSERKRRDTINQKMKMLQKMVPNSSKTDKASMLDEVIDYLKQLQAQVHMMTMSRMSI- 455
            HNQSERKRRD INQ+MK LQK+VPNSSKTDKASMLDEVI+YLKQLQAQVHM  MSRM+I 
Sbjct: 296  HNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQVHM--MSRMNIP 353

Query: 454  ---------QSXXXXXXXXXXXXXXXXXXXXMSIIN-PIMARQNLATLSPLL-NPSAAFM 308
                     Q                     M +++   M R N+  +SP+L NP   F+
Sbjct: 354  PMMFPMTMQQQLQMSMMAPMGMGMGMGMGIGMGVMDMSTMGRPNITGISPVLPNP---FV 410

Query: 307  PMASWNDATTNPSPSSAPLGSSDPSTYLARCQSQPMSMEAYTRMVALIQQQQQHLASA 134
             M  W+ +      +SA +     S +LA CQSQP++M+AY+RM A+ QQ Q   AS+
Sbjct: 411  TMTPWDGSGDRLQAASAAVMPDPLSAFLA-CQSQPITMDAYSRMAAMYQQMQHPPASS 467


>ref|XP_003516808.1| PREDICTED: transcription factor UNE10-like [Glycine max]
          Length = 458

 Score =  231 bits (590), Expect = 5e-58
 Identities = 178/462 (38%), Positives = 242/462 (52%), Gaps = 45/462 (9%)
 Frame = -2

Query: 1399 MSHSVPNWNLDDASIPASRLTATNSSG----VPFLGYEMAELTWENGQLSVNGLGPGLTR 1232
            MS  VP+W+++D   P+     +NS+     VP L YE+AELTWENGQLS++GLG     
Sbjct: 1    MSQCVPSWDVEDNPPPSRVSLRSNSNSTAPDVPMLDYEVAELTWENGQLSMHGLGLPRVP 60

Query: 1231 VSNPT--TAKDAWDNPRAAVGTLESIVDLATQKP---PPPPMDKGD----YDDVGVPWLD 1079
            V  PT  T K  W+ PRA+ GTLESIV+  T  P    P P++ G     Y +  VPW D
Sbjct: 61   VKPPTAVTNKYTWEKPRAS-GTLESIVNQVTSFPHRGKPTPLNGGGGGGVYGNFRVPWFD 119

Query: 1078 CRNHDQRSSSAAASVT-DALVPCGTPKEEEREGSRVFRGKGISTCVVDRSARVKNRMPAT 902
                   +++   +VT DALVPC   +E+ ++G     G    TC+V  S RV +     
Sbjct: 120  ----PHATATTTNTVTMDALVPCSN-REQSKQGMESVPG---GTCMVGCSTRVGSCCGGK 171

Query: 901  RVPVEGXXXXXXXXXXXXXXXXXXRHVLTADTCEKDSG-GL---------GVATASFCSQ 752
                                    +HV T DTC+++ G G            ++A  C++
Sbjct: 172  GAKGHEATGRDQSVSGSATFGRDSKHV-TLDTCDREFGVGFTSTSINSLENTSSAKHCTK 230

Query: 751  GTTL----SLS-SSPQGDTCDKEDKKKGKGARSSLLTKKSRTAAIHNQSERKRRDTINQK 587
             TT+    S+S S P G+  D+  KK+  G +SS+ TK+SR AAIHNQSERKRRD INQ+
Sbjct: 231  TTTVDDHDSVSHSKPVGEDQDEGKKKRANG-KSSVSTKRSRAAAIHNQSERKRRDKINQR 289

Query: 586  MKMLQKMVPNSSKTDKASMLDEVIDYLKQLQAQVHMMTMSRMS-----------IQSXXX 440
            MK LQK+VPNSSK+DKASMLDEVI+YLKQLQAQ+ M+    MS           +Q    
Sbjct: 290  MKTLQKLVPNSSKSDKASMLDEVIEYLKQLQAQLQMINRINMSSMMLPLTMQQQLQMSMM 349

Query: 439  XXXXXXXXXXXXXXXXXMSIINPIMARQNLATLSPLLNPSAAFMPMASWNDATTNPSPSS 260
                                +N  M R ++  + P+L+PS AFMPMA+  DA        
Sbjct: 350  SPMGMGLGMGMGMGMGMGMDMNS-MNRAHIPGIPPVLHPS-AFMPMAASWDAAAAAGGGD 407

Query: 259  APLGS-----SDPSTYLARCQSQPMSMEAYTRMVALIQQQQQ 149
               G+      DP +    CQSQPM+++AY+R+ A+ QQ  Q
Sbjct: 408  RLQGTPANVMPDPLSTFFGCQSQPMTIDAYSRLAAMYQQLHQ 449


>ref|XP_004514235.1| PREDICTED: transcription factor UNE10-like [Cicer arietinum]
          Length = 488

 Score =  229 bits (583), Expect = 3e-57
 Identities = 181/486 (37%), Positives = 235/486 (48%), Gaps = 69/486 (14%)
 Frame = -2

Query: 1399 MSHSVPNWNLDDASIPASRLTATNSSG----VPFLGYEMAELTWENGQLSVNGLGPGLTR 1232
            MS  VP+W++++   P      +NS+     VP L Y++AELTWENGQ+S++GLG     
Sbjct: 1    MSQCVPSWDVEENPEPPRVTLRSNSNSTNPDVPMLDYDVAELTWENGQISMHGLGLPRVP 60

Query: 1231 VSNPTTA-----KDAWDNPRAAVGTLESIVDLATQKP----PPPPMDKGDYDDVGVPWLD 1079
            V + T A     KD W+ PRA+ GTLESIV+ AT  P     P     G Y +V VPWLD
Sbjct: 61   VKHSTNAATTPNKDTWEKPRAS-GTLESIVNQATTIPHRGKSPFFAGGGMYGNVLVPWLD 119

Query: 1078 CRNHDQRSSSAAASVTDALVPCGTPKEEER-----EGSRVFRGKGISTCVVDRSARVKNR 914
             +     S++      DALVPC  P +E+R       SRV    GI T +V     V + 
Sbjct: 120  PQRAAAISATTNGMTMDALVPCSNPTKEQRIQTMDSISRV----GIGTYMVGGPTPVGSC 175

Query: 913  MPATRVPVEGXXXXXXXXXXXXXXXXXXR-----------------HVLTADTCEKDSGG 785
              A     E                                       +T DT E++ G 
Sbjct: 176  SAAPAATQEEGALVVAAAVKRGRVAHVVGSGRGQSVSGSGTFGRQSEQVTLDTYEREFGM 235

Query: 784  LGVATASFCSQGTTLSLSSSPQGDTCDKE--------------DKKKGKGARSSLLTKKS 647
             G  + S  S   T S     +    D +              D KK +  +SS+ TK+S
Sbjct: 236  GGFTSTSIASLDNTSSEKQCTKTTVDDHDSVCHSRPTMEDADVDAKKRENRKSSVSTKRS 295

Query: 646  RTAAIHNQSERKRRDTINQKMKMLQKMVPNSSKTDKASMLDEVIDYLKQLQAQVH----- 482
            R AAIHNQSERKRRD INQ+MK LQK+VPNS+KTDKASMLDEVI+YLK LQAQV      
Sbjct: 296  RAAAIHNQSERKRRDKINQRMKTLQKLVPNSNKTDKASMLDEVIEYLKNLQAQVQIVNRF 355

Query: 481  -----MMTMS-----RMSIQSXXXXXXXXXXXXXXXXXXXXMSIINPIMARQNLATLS-- 338
                 MM M+     +MSI +                    M +    M R N+  +   
Sbjct: 356  NMSSMMMPMNMQQQLQMSIMNQMGMGMGMGMTGMPMGIGMGMGMDMNTMNRANIPNIPGM 415

Query: 337  -PLLNPSAAFMPMASWNDATTNPSPSSAP--LGSSDPSTYLARCQSQPMSMEAYTRMVAL 167
             P+L+PSA FMPMASW+   +       P   G +DP + L  CQSQPMSM+AY+R+ A+
Sbjct: 416  PPVLHPSA-FMPMASWDVGGSCDRLQGPPGATGMTDPLSTLLGCQSQPMSMDAYSRIAAM 474

Query: 166  IQQQQQ 149
             QQ QQ
Sbjct: 475  CQQMQQ 480


>ref|XP_004229781.1| PREDICTED: transcription factor UNE10-like [Solanum lycopersicum]
          Length = 464

 Score =  225 bits (574), Expect = 3e-56
 Identities = 174/469 (37%), Positives = 226/469 (48%), Gaps = 52/469 (11%)
 Frame = -2

Query: 1399 MSHSVPNWNLDDASIPASRLTATNSSG----VPFLGYEMAELTWENGQLSVNGLGPGLTR 1232
            M+  VP+W+LDD+++P   L  T S+     VP L YE+AELTWENGQL+++GLGP   R
Sbjct: 1    MNQCVPSWDLDDSTVPRKNLIQTQSNSLAVDVPSLDYEVAELTWENGQLAMHGLGP--PR 58

Query: 1231 VSNPTTAKDAWDNPRAAVGTLESIVDLATQKPPPPPM----------DKGDYDDVGVPWL 1082
             +N   +           GTLESIV+ AT+     P+          +K   D+V VPW 
Sbjct: 59   ANNKPISSYG--------GTLESIVNQATRCNDDVPLHLHGKSTVDRNKQSGDEV-VPWF 109

Query: 1081 DCRN---HDQRSSSAAASVTDALVPCGTPKEEEREGSRVFRGKGI---------STCVVD 938
            +  N   +   ++   A   DALVPC        +  R     GI         S     
Sbjct: 110  NNHNAVAYAPPATGLVAMTKDALVPCSR-NTSNSDNQRSVHVPGIDGSTHVGSCSGATNS 168

Query: 937  RSARVKNRMPATRVPVEGXXXXXXXXXXXXXXXXXXRHVLTADTCEKDSGGLGVATASFC 758
            R   V  RM       E                      LT DT +++ G     + S  
Sbjct: 169  RDWTVAPRMRVRPTRREWSSRADMISVSGSETCGGDSRQLTVDTFDREFGTTMYTSTSMG 228

Query: 757  SQGTTLS------LSSSPQGDTCDKEDKKKG----------KGAR-SSLLTKKSRTAAIH 629
            S   T S       +       C   D+K+G          KG++ SS  TK+ R AAIH
Sbjct: 229  SPENTSSDKQCTNRTGDDHDSVCHSRDQKEGGDDEDDNDNKKGSKNSSSSTKRKRAAAIH 288

Query: 628  NQSERKRRDTINQKMKMLQKMVPNSSKTDKASMLDEVIDYLKQLQAQVHMMTMSRMS--- 458
            NQSERKRRD INQ+MK LQK+VPNSSKTDKASMLDEVI+YLKQLQAQVHMM+   MS   
Sbjct: 289  NQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQVHMMSRMNMSPAM 348

Query: 457  -----IQSXXXXXXXXXXXXXXXXXXXXMSIINPIMARQNLATLSPLLNPSAAFM-PMAS 296
                 +Q                            ++R N+  L   L+PSAAFM P+ S
Sbjct: 349  MLPLAMQQQLQMSMMGMGMGMGMGMGVAGVFDINNLSRPNIPGLPSFLHPSAAFMQPITS 408

Query: 295  WNDATTNPSPSSAPLGSSDPSTYLARCQSQPMSMEAYTRMVALIQQQQQ 149
            W+++ + PSP SA +   DP   L  CQSQP++M+AY+RM AL  Q QQ
Sbjct: 409  WDNSNSAPSPPSAAM--PDPLAALLACQSQPINMDAYSRMAALYLQFQQ 455


>ref|XP_006347913.1| PREDICTED: transcription factor UNE10-like [Solanum tuberosum]
          Length = 464

 Score =  223 bits (567), Expect = 2e-55
 Identities = 178/470 (37%), Positives = 226/470 (48%), Gaps = 53/470 (11%)
 Frame = -2

Query: 1399 MSHSVPNWNLDDASIPASRLTATNSSG----VPFLGYEMAELTWENGQLSVNGLGPGLTR 1232
            M+  VP+W+LDD+++P      T S+     VP L YE+AELTWENGQL+++GLGP   R
Sbjct: 1    MNQCVPSWDLDDSTVPRKNPIQTQSNSLAADVPSLNYEVAELTWENGQLAMHGLGP--PR 58

Query: 1231 VSNPTTAKDAWDNPRAAVGTLESIVDLATQKPPPPP---------MDKGDYDDVGVPWLD 1079
             +N   +           GTLESIV+ AT+    PP          +K   D+V VPW +
Sbjct: 59   ANNKPISSYG--------GTLESIVNQATRCNDVPPHLHGKSTVDRNKNGGDEV-VPWFN 109

Query: 1078 CRNHDQRSSSAAASVT---DALVPCGTPKEEEREGSRVFRGKGI--STCVVDRSARVKNR 914
              N    +  A   VT   DALVPC +      +  R     GI  ST V   S    +R
Sbjct: 110  NHNAVAYAPPATGLVTMTKDALVPC-SRNTSNSDNHRSVHVPGIDGSTHVGSCSGATNSR 168

Query: 913  --MPATRVPV-----EGXXXXXXXXXXXXXXXXXXRHVLTADTCEKDSGGLGVATASFCS 755
              M A R+ V     E                      LT DT +++ G     + S  S
Sbjct: 169  DWMVAPRMRVRPTKREWNSRTDMISVSGSETCGGDSRQLTVDTFDREFGTTMYTSTSMGS 228

Query: 754  QGTTLS---------------LSSSPQGDTCDKEDKKKGKGAR-SSLLTKKSRTAAIHNQ 623
               T S                 S  + +  D ED    KG++ SS  TK+ R AAIHNQ
Sbjct: 229  PENTSSDKQCTNRTGDDHDSVCHSRDEREAGDDEDDNNKKGSKNSSCFTKRKRAAAIHNQ 288

Query: 622  SERKRRDTINQKMKMLQKMVPNSSKTDKASMLDEVIDYLKQLQAQVHMMTMSRMS----- 458
            SERKRRD INQ+MK LQK+VPNSSKTDKASMLDEVI+YLKQLQAQVHMM+   MS     
Sbjct: 289  SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQVHMMSRMNMSPAMML 348

Query: 457  ---IQSXXXXXXXXXXXXXXXXXXXXMSIINPIMARQNLATLSPLLNPSAA----FMPMA 299
               +Q                            ++R N+  L   L+PSAA      PM 
Sbjct: 349  PLAMQQQLQMSMMGMGMGMGMGMGVAGVFDINNLSRPNIPGLPSFLHPSAATAAFMQPMT 408

Query: 298  SWNDATTNPSPSSAPLGSSDPSTYLARCQSQPMSMEAYTRMVALIQQQQQ 149
            SW++++  P P   P    DP   L  CQSQP++M+AY RM AL QQ QQ
Sbjct: 409  SWDNSSAAPPP---PPAMPDPLAALLACQSQPINMDAYRRMAALYQQFQQ 455


>ref|XP_002301261.2| hypothetical protein POPTR_0002s14430g [Populus trichocarpa]
            gi|550345012|gb|EEE80534.2| hypothetical protein
            POPTR_0002s14430g [Populus trichocarpa]
          Length = 432

 Score =  218 bits (554), Expect = 7e-54
 Identities = 159/421 (37%), Positives = 220/421 (52%), Gaps = 60/421 (14%)
 Frame = -2

Query: 1231 VSNPTTAKDAWDNPRAAVGTLESIVDLATQKPPPPPMDKGDYDDVG-----VPWLDCRNH 1067
            +++ + +K  WD PRA+ GTLESIV+LAT  P     +K  +D+ G     VPW    NH
Sbjct: 14   IASTSPSKYTWDKPRAS-GTLESIVNLATCIPQ---CNKQTFDNSGSDHDFVPWF---NH 66

Query: 1066 DQRSSSAAASVTDALVPCGTPKEEEREGSRVFRGK--GIST-CVVDRSARV--------- 923
              R+S++A    DALVPC    ++ER  +RV      G+ T CVV  S RV         
Sbjct: 67   -HRASASATMTMDALVPCSKRSDQERT-TRVIDSSPAGLGTDCVVGCSTRVGSCSAPTAT 124

Query: 922  --------KNRMPATRVPVEGXXXXXXXXXXXXXXXXXXRHVLTADTCEKDSGGLGVATA 767
                    + R    RVPV                       +T D+CE++ G    +T 
Sbjct: 125  QNEVGLLTRKREKVARVPVPAEWSRDQSVNRGATFSKKDSQQVTVDSCERELGVGFTSTT 184

Query: 766  SFCSQGTTLS-----------------LSSSPQGDTCDKEDKKKGKGARSSLLTKKSRTA 638
            SF SQ  T S                   S PQ +  D++DKKKG G +SS+  ++SR A
Sbjct: 185  SFGSQENTSSGTKPCTKTNTADENDSVCHSRPQREAGDEDDKKKGNG-KSSVSNRRSRAA 243

Query: 637  AIHNQSERKRRDTINQKMKMLQKMVPNSSKTDKASMLDEVIDYLKQLQAQVHMMTMS--- 467
            A+HNQSERKRRD INQ+MK LQK+VPNSSKTDKASMLDEVI+YLKQLQAQV M++     
Sbjct: 244  AVHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQVQMVSRMNMQ 303

Query: 466  ------------RMSIQSXXXXXXXXXXXXXXXXXXXXMSIINPIMARQNLATLSPLLNP 323
                        +MS+ +                    +  +N + AR N+  +SP+L+P
Sbjct: 304  PMMLPMALQQQLQMSMMAPISMGMAGMGMGMGMGMGMGVVDMNTLAARSNITGVSPVLHP 363

Query: 322  SAAFMPMASWNDATTNPS-PSSAPLGS--SDPSTYLARCQSQPMSMEAYTRMVALIQQQQ 152
            +A FMPM +W+ + ++   P++AP  +   DP +    CQSQPM+M+AY+RM ++ QQ  
Sbjct: 364  TA-FMPMTTWDGSNSHERLPTAAPSATVMPDPLSAFLACQSQPMTMDAYSRMASMYQQLH 422

Query: 151  Q 149
            Q
Sbjct: 423  Q 423


>ref|NP_191916.3| transcription factor UNE10 [Arabidopsis thaliana]
            gi|75299638|sp|Q8GZ38.1|UNE10_ARATH RecName:
            Full=Transcription factor UNE10; AltName: Full=Basic
            helix-loop-helix protein 16; Short=AtbHLH16; Short=bHLH
            16; AltName: Full=Protein UNFERTILIZED EMBRYO SAC 10;
            AltName: Full=Transcription factor EN 108; AltName:
            Full=bHLH transcription factor bHLH016
            gi|26449558|dbj|BAC41905.1| putative bHLH transcription
            factor bHLH016 [Arabidopsis thaliana]
            gi|109134123|gb|ABG25060.1| At4g00050 [Arabidopsis
            thaliana] gi|332656418|gb|AEE81818.1| transcription
            factor UNE10 [Arabidopsis thaliana]
          Length = 399

 Score =  217 bits (552), Expect = 1e-53
 Identities = 174/441 (39%), Positives = 219/441 (49%), Gaps = 22/441 (4%)
 Frame = -2

Query: 1399 MSHSVPNWNLDDASIPASR-LTATNSSGVPFLGYEMAELTWENGQLSVNGLGPGLTRVSN 1223
            MS  VPN ++DD    A+  + +T ++ +P L YE+AELTWENGQL ++GLGP       
Sbjct: 1    MSQCVPNCHIDDTPAAATTTVRSTTAADIPILDYEVAELTWENGQLGLHGLGP------- 53

Query: 1222 PTTAKDAWDNPRAAVGTLESIVDLATQKPPPPPMDKGDYDDVGVPWLDCRNHDQRSSSAA 1043
            P     +      A GTLESIVD AT+ P P P D+       VPW   R      SS A
Sbjct: 54   PRVTASSTKYSTGAGGTLESIVDQATRLPNPKPTDEL------VPWFHHR------SSRA 101

Query: 1042 ASVTDALVPCGTPKEEEREGSRVFRGKGISTCVVDRSARVKNRMPATRVPVEGXXXXXXX 863
            A   DALVPC     E++          + +C   R+     R    RV  E        
Sbjct: 102  AMAMDALVPCSNLVHEQQSKPGGVGSTRVGSCSDGRTMGGGKR---ARVAPEWSGGGSQR 158

Query: 862  XXXXXXXXXXXRHVLTADTCEKDSGGLGVATASFCSQGTTLS-----LSSSPQGDTCDKE 698
                          LT DT +     +G  + S  S   T+        S PQ +  D+E
Sbjct: 159  --------------LTMDTYD-----VGFTSTSMGSHDNTIDDHDSVCHSRPQME--DEE 197

Query: 697  DKKKGKGARSSLLTKKSRTAAIHNQSERKRRDTINQKMKMLQKMVPNSSKTDKASMLDEV 518
            +KK G   +SS+ TK+SR AAIHNQSERKRRD INQ+MK LQK+VPNSSKTDKASMLDEV
Sbjct: 198  EKKAG--GKSSVSTKRSRAAAIHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEV 255

Query: 517  IDYLKQLQAQVHMMTMSRMSIQSXXXXXXXXXXXXXXXXXXXXMSIINPIMARQNLATLS 338
            I+YLKQLQAQV M  MSRM++ S                       +   M    L  L 
Sbjct: 256  IEYLKQLQAQVSM--MSRMNMPSMMLPMAMQQQQQLQMSLMSNPMGLGMGMGMPGLGLLD 313

Query: 337  -PLLNPSAA-------------FMPM--ASWNDATTNPSPSSAPLGSSDPSTYLARCQSQ 206
               +N +AA             F+PM   SW DA++N S   +PL     S +LA C +Q
Sbjct: 314  LNSMNRAAASAPNIHANMMPNPFLPMNCPSW-DASSNDSRFQSPLIPDPMSAFLA-CSTQ 371

Query: 205  PMSMEAYTRMVALIQQQQQHL 143
            P +MEAY+RM  L QQ QQ L
Sbjct: 372  PTTMEAYSRMATLYQQMQQQL 392


>gb|EOX96337.1| Basic helix-loop-helix DNA-binding superfamily protein isoform 2
            [Theobroma cacao]
          Length = 478

 Score =  215 bits (548), Expect = 3e-53
 Identities = 174/444 (39%), Positives = 222/444 (50%), Gaps = 52/444 (11%)
 Frame = -2

Query: 1309 LGYEMAELTWENGQLSVNGLGPGLTR---VSNPTTAKDAWDNPRAAVGTLESIVDLATQK 1139
            L YE+AELTWENGQL+++ LGP       +++ + +K  WD PRA  GTLESIV+ AT  
Sbjct: 43   LDYEVAELTWENGQLAMHSLGPPRVPAKPLNSTSPSKYTWDKPRAG-GTLESIVNQATSF 101

Query: 1138 PPPPPMDKGDYDDVGVPWLDCRNHDQRS----SSAAASVTDALVPCGTPKEEEREG-SRV 974
            P       G  D++ VPW D       +    SS+A    DALVPC    E+        
Sbjct: 102  PYRNVSLDGGRDEL-VPWFDHHRAAVAAAAVASSSATMTMDALVPCSNRSEDRTTHVMES 160

Query: 973  FRGKGISTCVVDRSARVKN-----------------RMPATRVPVEGXXXXXXXXXXXXX 845
             RG G  TCVV  S RV +                 R    RV V               
Sbjct: 161  IRGLG-GTCVVGCSTRVGSCSGPTGTQDDGVLLTGKRAREARVSVAPEWSSKDQNASASA 219

Query: 844  XXXXXRHVLTADTCEKDSGGLGVATASFCSQGTTLSLSSSPQGDTCD------------- 704
                    +T D+ EKD G +G  + S  S   T S     +  T D             
Sbjct: 220  TFGTDSQHVTVDSYEKDFG-VGFTSTSLGSPENTSSPRPCTKATTADDHDSVCHSRPQRK 278

Query: 703  --KEDKKKGKGARSSLLTKKSRTAAIHNQSERKRRDTINQKMKMLQKMVPNSSKTDKASM 530
              +EDK+K  G +SS+ TK+SR AAIHNQSERKRRD INQ+MK LQK+VPNSSKTDKASM
Sbjct: 279  AGEEDKRKETG-KSSVSTKRSRAAAIHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASM 337

Query: 529  LDEVIDYLKQLQAQVHMMTMSRMSI----------QSXXXXXXXXXXXXXXXXXXXXMSI 380
            LDEVI+YLKQLQAQVHM  MSRM+I          Q                     M +
Sbjct: 338  LDEVIEYLKQLQAQVHM--MSRMNIPPMMFPMTMQQQLQMSMMAPMGMGMGMGMGIGMGV 395

Query: 379  IN-PIMARQNLATLSPLL-NPSAAFMPMASWNDATTNPSPSSAPLGSSDPSTYLARCQSQ 206
            ++   M R N+  +SP+L NP   F+ M  W+ +      +SA +     S +LA CQSQ
Sbjct: 396  MDMSTMGRPNITGISPVLPNP---FVTMTPWDGSGDRLQAASAAVMPDPLSAFLA-CQSQ 451

Query: 205  PMSMEAYTRMVALIQQQQQHLASA 134
            P++M+AY+RM A+ QQ Q   AS+
Sbjct: 452  PITMDAYSRMAAMYQQMQHPPASS 475


>gb|AAM10933.1|AF488561_1 putative bHLH transcription factor [Arabidopsis thaliana]
          Length = 399

 Score =  215 bits (548), Expect = 3e-53
 Identities = 173/441 (39%), Positives = 219/441 (49%), Gaps = 22/441 (4%)
 Frame = -2

Query: 1399 MSHSVPNWNLDDASIPASR-LTATNSSGVPFLGYEMAELTWENGQLSVNGLGPGLTRVSN 1223
            MS  VPN ++DD    A+  + +T ++ +P L YE+AELTWENGQL ++GLGP       
Sbjct: 1    MSQCVPNCHIDDTPAAATTTVRSTTAADIPILDYEVAELTWENGQLGLHGLGP------- 53

Query: 1222 PTTAKDAWDNPRAAVGTLESIVDLATQKPPPPPMDKGDYDDVGVPWLDCRNHDQRSSSAA 1043
            P     +      A GTLESIVD AT+ P P P D+       VPW   R      SS A
Sbjct: 54   PRVTASSTKYSTGAGGTLESIVDQATRLPNPKPTDEL------VPWFHHR------SSRA 101

Query: 1042 ASVTDALVPCGTPKEEEREGSRVFRGKGISTCVVDRSARVKNRMPATRVPVEGXXXXXXX 863
            A   DALVPC     E++          + +C   R+     R    RV  E        
Sbjct: 102  AMAMDALVPCSNLVHEQQSKPGGVGSTRVGSCSDGRTMGGGKR---ARVAPEWSGGGSQR 158

Query: 862  XXXXXXXXXXXRHVLTADTCEKDSGGLGVATASFCSQGTTLS-----LSSSPQGDTCDKE 698
                          LT DT +     +G  + S  S   T+        S PQ +  D+E
Sbjct: 159  --------------LTMDTYD-----VGFTSTSMGSHDNTIDDHDSVCHSRPQME--DEE 197

Query: 697  DKKKGKGARSSLLTKKSRTAAIHNQSERKRRDTINQKMKMLQKMVPNSSKTDKASMLDEV 518
            +KK G   +SS+ TK+SR AAIHNQSERKRRD INQ+MK LQK+VPNSSKTDKASMLDEV
Sbjct: 198  EKKAG--GKSSVSTKRSRAAAIHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEV 255

Query: 517  IDYLKQLQAQVHMMTMSRMSIQSXXXXXXXXXXXXXXXXXXXXMSIINPIMARQNLATLS 338
            I+YLKQLQAQV M  MSRM++ S                       +   M    L  L 
Sbjct: 256  IEYLKQLQAQVSM--MSRMNMPSMMLPMAMQQQQQLQMSLMSNPMGLGMGMGMPGLGLLD 313

Query: 337  -PLLNPSAA-------------FMPM--ASWNDATTNPSPSSAPLGSSDPSTYLARCQSQ 206
               +N +AA             F+PM   SW DA++N S   +PL     S +LA C +Q
Sbjct: 314  LNSMNRAAASAPNIHANMMPNPFLPMNCPSW-DASSNDSRFQSPLIPDPMSAFLA-CSTQ 371

Query: 205  PMSMEAYTRMVALIQQQQQHL 143
            P +MEAY+RM  L QQ Q+ L
Sbjct: 372  PTTMEAYSRMATLYQQMQRQL 392


>gb|EXB37572.1| Transcription factor UNE10 [Morus notabilis]
          Length = 493

 Score =  215 bits (547), Expect = 4e-53
 Identities = 183/501 (36%), Positives = 238/501 (47%), Gaps = 79/501 (15%)
 Frame = -2

Query: 1399 MSHSVPNWNLDDASIPASRLTATNSS------GVP-FLGYEMAELTWENGQLSVNGLGPG 1241
            MS  VP+W+LDD+  P +RL   + S       VP  L YE+AELTWENGQ++++GLGP 
Sbjct: 1    MSQCVPSWDLDDS--PPARLPLRSRSHSIAPHDVPALLDYEVAELTWENGQIAMHGLGPR 58

Query: 1240 L-----------TRVSNPTT-----------AKDAWDNPRAAVGTLESIVDLATQK--PP 1133
                        T  + PTT           A  AW+ P A  GTLESIV+ AT+   P 
Sbjct: 59   RVPNKLLTNTTSTTTATPTTNSHACKYTTATATTAWEKPSAG-GTLESIVNQATRSSFPH 117

Query: 1132 PPPMDKGDYDDVGVPWLDCRNHDQRSSSAAASVTDALVPCGTPKEEE-----REGSRVFR 968
             PP      +   VPW D  +H   +++A  +  DA VPC            R  S    
Sbjct: 118  KPPSSA---NAELVPWFD--HHHNAAAAAMTTNMDAQVPCSNRHHHHNDVVSRRPSARAG 172

Query: 967  GKGI---------STCVVDRSARVKNRMPATRVPVEGXXXXXXXXXXXXXXXXXXRHVLT 815
            G G+         S+C    + R +N   A    ++                    H +T
Sbjct: 173  GGGVNIGSLYTQVSSCS-GAATRDENNNAAAGQQMKRARVAARVPPEWSVSGTSGSHQVT 231

Query: 814  AD--TCEKDSG-GLGVAT-------ASFCSQGTTLSLSSSPQGDTCD------------- 704
             D  +CE+D G G   +T       AS     T  + +++   D  D             
Sbjct: 232  MDQYSCERDFGVGFMTSTSLGSPENASSGKPSTKAATATTTAADDHDSVCHSRLQASDEE 291

Query: 703  --KEDKKKGKGARSSLLTKKSRTAAIHNQSERKRRDTINQKMKMLQKMVPNSSKTDKASM 530
              +EDKKKG G +SS+ TK+SR AAIHNQSERKRRD INQ+MK LQK+VPNSSKTDKASM
Sbjct: 292  EEEEDKKKGSG-KSSVSTKRSRAAAIHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASM 350

Query: 529  LDEVIDYLKQLQAQVHMMTMSRM-------SIQSXXXXXXXXXXXXXXXXXXXXMSIINP 371
            LDEVIDYLKQLQAQV MM+   M       ++Q                        +N 
Sbjct: 351  LDEVIDYLKQLQAQVQMMSRMNMPAMMLPVAMQQQLQLSMMSHMGMGMGMGMGMGMDMNT 410

Query: 370  IMARQNLATLSPLLNPSAAFMPMASWNDATTNPSPSSAPLGS--SDPSTYLARCQSQPMS 197
             M R NL  +SP +     FM MA W+ + T        + +   DP +    CQSQPM+
Sbjct: 411  -MGRPNLHGISPAVLHPNPFMTMAQWDGSGTVDGRLQPQMAAVIPDPLSQFFACQSQPMT 469

Query: 196  MEAYTRMVALIQQQQQHLASA 134
             EAY+RMVA+ QQ  Q  AS+
Sbjct: 470  TEAYSRMVAMYQQLHQPPASS 490


>ref|XP_002875048.1| hypothetical protein ARALYDRAFT_912247 [Arabidopsis lyrata subsp.
            lyrata] gi|297320885|gb|EFH51307.1| hypothetical protein
            ARALYDRAFT_912247 [Arabidopsis lyrata subsp. lyrata]
          Length = 403

 Score =  214 bits (546), Expect = 6e-53
 Identities = 174/446 (39%), Positives = 219/446 (49%), Gaps = 27/446 (6%)
 Frame = -2

Query: 1399 MSHSVPNWNLDDASIPASRLTATNS---SGVPFLGYEMAELTWENGQLSVNGLGPGLTRV 1229
            MS  VPN ++DD +  A+  T   S   + +P L YE+AELTWENGQL ++GLGP     
Sbjct: 1    MSQCVPNCHIDDTTAAAAATTTVRSITAADIPILDYEVAELTWENGQLGLHGLGP----- 55

Query: 1228 SNPTTAKDAWDNPRAAVGTLESIVDLATQKPPPPPMDKGDYDDVGVPWLDCRNHDQRSSS 1049
              P     +      A GTLESIVD AT+ P   P D+       VPW   R      SS
Sbjct: 56   --PRVTASSTKYSTGAGGTLESIVDQATRLPNHKPTDEL------VPWFHHR------SS 101

Query: 1048 AAASVTDALVPCGTPKEEEREGSRVFRGKGISTCVVDRSARVKNRMPATRVPVEGXXXXX 869
             AA   DALVPC    +E++          + +C   R+     R    RV  E      
Sbjct: 102  RAAMAMDALVPCSKLVQEQQSKPGGVGSTRVGSCSDGRTMAGGKR---ARVAPEWSGGGS 158

Query: 868  XXXXXXXXXXXXXRHVLTADTCEKDSGGLGVATASFCSQGTTLS-----LSSSPQGDTCD 704
                            LT DT +     +G  + S  SQ  T+        S PQ +  D
Sbjct: 159  QR--------------LTMDTYD-----VGFTSTSMGSQDNTIDDHDSVCHSRPQME--D 197

Query: 703  KEDKKKGKGARSSLLTKKSRTAAIHNQSERKRRDTINQKMKMLQKMVPNSSKTDKASMLD 524
            +E+KK G   +SS+ TK+SR AAIHNQSERKRRD INQ+MK+LQK+VPNSSKTDKASMLD
Sbjct: 198  EEEKKAG--GKSSVSTKRSRAAAIHNQSERKRRDKINQRMKILQKLVPNSSKTDKASMLD 255

Query: 523  EVIDYLKQLQAQVHMMTMSRMSIQSXXXXXXXXXXXXXXXXXXXXMSI---INPIMARQN 353
            EVI+YLKQLQAQV M  MSRM++ S                      +   I   M    
Sbjct: 256  EVIEYLKQLQAQVSM--MSRMNMPSMMLPMAMQQQQQQLQMSLMSNPMGLGIGMGMPGLG 313

Query: 352  LATLSPLLNPSAA--------------FMPMA--SWNDATTNPSPSSAPLGSSDPSTYLA 221
            L  L+ +   +AA              F PM   SW DA++N +   +PL   DP     
Sbjct: 314  LLDLNSMNRAAAAATAPNIHANMMPNPFAPMTCPSW-DASSNDARFQSPL-IPDPMAAFL 371

Query: 220  RCQSQPMSMEAYTRMVALIQQQQQHL 143
             C +QP +MEAY+RM AL QQ QQ L
Sbjct: 372  ACSTQPTTMEAYSRMAALYQQMQQQL 397


>ref|XP_004164979.1| PREDICTED: transcription factor UNE10-like [Cucumis sativus]
          Length = 478

 Score =  213 bits (542), Expect = 2e-52
 Identities = 172/477 (36%), Positives = 225/477 (47%), Gaps = 59/477 (12%)
 Frame = -2

Query: 1399 MSHSVPNWNLDD---ASIPASRLTATNSSG----VPFLGYEMAELTWENGQLSVNGLG-P 1244
            MS  VPNW+L +   +S  A R    +SS     VP   YE+AELTWENGQLS++GLG P
Sbjct: 1    MSQCVPNWDLSEPPPSSAAAGRPPFQSSSSADDVVPLFEYEVAELTWENGQLSMHGLGLP 60

Query: 1243 GLT-RVSNP-----TTAKDAWDN-PRAAVGTLESIVDLATQKPPPPPMDKGDYDDVG--- 1094
             +T ++ N        +K  WDN P  A GTLES+V+  T+          + DD     
Sbjct: 61   RVTGKIQNSGGGGGVGSKYTWDNKPARASGTLESLVNQGTRHGKNNISFDINTDDTSHGG 120

Query: 1093 ----VPWLDCRNHDQRSSSAAASVTDALVPCGTPKE------------------EEREGS 980
                VPW      D    +  AS  DA+VPC   K                   +E E  
Sbjct: 121  ANDLVPWFS----DHHRQTPTASTADAMVPCDGEKSATVGGGGDKSSDIPVAARKEDEDC 176

Query: 979  RVFRGKGISTCVVDRSARVKNRMPATR--VPVEGXXXXXXXXXXXXXXXXXXRHV----- 821
            RV  GK     VV R    +    + R  + V G                    V     
Sbjct: 177  RVIHGKRGK--VVARVVHAEGEWSSCRNQISVSGNRESGQKVTLNSSRDRNFVAVDVSVG 234

Query: 820  LTADTCEKDSGGLGVATASFCSQGTTLSLSSSPQGDTCDKEDKKKGKGARSSLLTKKSRT 641
             TA T          ++   C + TT++ +          E  +K + A+SS+ TK+SR 
Sbjct: 235  FTATTATSQGSLDNTSSDKPCVKNTTVTTTDDHDSVCHKDEGDRKKENAKSSVSTKRSRA 294

Query: 640  AAIHNQSERKRRDTINQKMKMLQKMVPNSSKTDKASMLDEVIDYLKQLQAQVHMMTMSRM 461
            AAIHNQSERKRRD INQ+MK LQK+VPNS+KTDKASMLDEVI+YLKQLQAQV MM+   M
Sbjct: 295  AAIHNQSERKRRDKINQRMKTLQKLVPNSNKTDKASMLDEVIEYLKQLQAQVQMMSRMNM 354

Query: 460  SIQSXXXXXXXXXXXXXXXXXXXXMSI-----------INPIMARQNL-ATLSPLLNPSA 317
             +                      M +           +N +  R  L A +SPLL+P+A
Sbjct: 355  PMMLPIAMQQQLSMAPLMAPMGLGMGMGGMGMPLGMDHLNMMAGRSGLTAGMSPLLHPTA 414

Query: 316  AFMPMASWNDATTNPSPSSAPLGSSDPSTYLARCQSQPMSMEAYTRMVALIQQQQQH 146
             FMP+ +W+  T     S   + +   ST+LA CQ QPM+MEAY R+  + QQ  QH
Sbjct: 415  -FMPIPTWDGGTDQLQHSPTTMVADPFSTFLA-CQQQPMTMEAYNRIATMFQQLHQH 469


>ref|XP_004140610.1| PREDICTED: transcription factor UNE10-like [Cucumis sativus]
          Length = 478

 Score =  213 bits (542), Expect = 2e-52
 Identities = 172/477 (36%), Positives = 225/477 (47%), Gaps = 59/477 (12%)
 Frame = -2

Query: 1399 MSHSVPNWNLDD---ASIPASRLTATNSSG----VPFLGYEMAELTWENGQLSVNGLG-P 1244
            MS  VPNW+L +   +S  A R    +SS     VP   YE+AELTWENGQLS++GLG P
Sbjct: 1    MSQCVPNWDLSEPPPSSAAAGRPPFQSSSSADDVVPLFEYEVAELTWENGQLSMHGLGLP 60

Query: 1243 GLT-RVSNP-----TTAKDAWDN-PRAAVGTLESIVDLATQKPPPPPMDKGDYDDVG--- 1094
             +T ++ N        +K  WDN P  A GTLES+V+  T+          + DD     
Sbjct: 61   RVTGKIQNSGGGGGVGSKYTWDNKPARASGTLESLVNQGTRHGKNNISFDINTDDTSHGG 120

Query: 1093 ----VPWLDCRNHDQRSSSAAASVTDALVPCGTPKE------------------EEREGS 980
                VPW      D    +  AS  DA+VPC   K                   +E E  
Sbjct: 121  ANDLVPWFS----DHHRQTPTASTADAMVPCDGEKSATVGGGGDKSSDIPVAARKEDEDC 176

Query: 979  RVFRGKGISTCVVDRSARVKNRMPATR--VPVEGXXXXXXXXXXXXXXXXXXRHV----- 821
            RV  GK     VV R    +    + R  + V G                    V     
Sbjct: 177  RVIHGKRGK--VVARVVHAEGEWSSCRNQISVSGNRESGQKVTLNSSRDRNFVAVDVSVG 234

Query: 820  LTADTCEKDSGGLGVATASFCSQGTTLSLSSSPQGDTCDKEDKKKGKGARSSLLTKKSRT 641
             TA T          ++   C + TT++ +          E  +K + A+SS+ TK+SR 
Sbjct: 235  FTATTATSQGSLDNTSSDKPCVKNTTVTTTDDHDSVCHKDEGDRKKENAKSSVSTKRSRA 294

Query: 640  AAIHNQSERKRRDTINQKMKMLQKMVPNSSKTDKASMLDEVIDYLKQLQAQVHMMTMSRM 461
            AAIHNQSERKRRD INQ+MK LQK+VPNS+KTDKASMLDEVI+YLKQLQAQV MM+   M
Sbjct: 295  AAIHNQSERKRRDKINQRMKTLQKLVPNSNKTDKASMLDEVIEYLKQLQAQVQMMSRMNM 354

Query: 460  SIQSXXXXXXXXXXXXXXXXXXXXMSI-----------INPIMARQNL-ATLSPLLNPSA 317
             +                      M +           +N +  R  L A +SPLL+P+A
Sbjct: 355  PMMLPMAMQQQLSMAPLMAPMGLGMGMGGMGMPLGMDHLNMMAGRSGLTAGMSPLLHPTA 414

Query: 316  AFMPMASWNDATTNPSPSSAPLGSSDPSTYLARCQSQPMSMEAYTRMVALIQQQQQH 146
             FMP+ +W+  T     S   + +   ST+LA CQ QPM+MEAY R+  + QQ  QH
Sbjct: 415  -FMPIPTWDGGTDQLQHSPTTMVADPFSTFLA-CQQQPMTMEAYNRIATMFQQLHQH 469


Top