BLASTX nr result

ID: Akebia24_contig00004268 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00004268
         (1445 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002277884.2| PREDICTED: uncharacterized protein LOC100248...   233   1e-58
gb|EXC30127.1| TGACG-sequence-specific DNA-binding protein TGA-1...   179   3e-42
emb|CBI32817.3| unnamed protein product [Vitis vinifera]              178   5e-42
ref|XP_007162048.1| hypothetical protein PHAVU_001G1193000g, par...   174   9e-41
ref|XP_004493333.1| PREDICTED: uncharacterized protein LOC101504...   171   1e-39
ref|XP_006411360.1| hypothetical protein EUTSA_v10016317mg [Eutr...   167   8e-39
ref|XP_007028261.1| Transcription factor hy5, putative [Theobrom...   167   1e-38
ref|XP_003554104.2| PREDICTED: uncharacterized protein LOC100127...   166   2e-38
ref|XP_007220226.1| hypothetical protein PRUPE_ppa002181mg [Prun...   166   2e-38
gb|AGO05993.1| bZIP transcription factor family protein 9 [Camel...   164   7e-38
ref|XP_004136623.1| PREDICTED: uncharacterized protein LOC101215...   163   2e-37
ref|XP_002526200.1| transcription factor hy5, putative [Ricinus ...   160   1e-36
ref|XP_006836241.1| hypothetical protein AMTR_s00101p00121930 [A...   158   5e-36
ref|XP_004299018.1| PREDICTED: uncharacterized protein LOC101299...   157   9e-36
gb|AGO05994.1| bZIP transcription factor family protein 10 [Came...   157   1e-35
ref|XP_006482041.1| PREDICTED: uncharacterized protein LOC102629...   154   7e-35
ref|XP_002308867.2| hypothetical protein POPTR_0006s03300g [Popu...   154   1e-34
ref|XP_006293762.1| hypothetical protein CARUB_v10022722mg [Caps...   153   2e-34
ref|NP_565946.1| transcription factor BZIP17 [Arabidopsis thalia...   153   2e-34
gb|AAM96961.1| putative TGACG-sequence-specific bZIP DNA-binding...   153   2e-34

>ref|XP_002277884.2| PREDICTED: uncharacterized protein LOC100248184 [Vitis vinifera]
          Length = 768

 Score =  233 bits (595), Expect = 1e-58
 Identities = 175/463 (37%), Positives = 218/463 (47%), Gaps = 33/463 (7%)
 Frame = +3

Query: 99   NPNLSIEFDSLQCPSLDMDFLS---NDIFLPEDLMEELGF-GNEXXXXXXXXXXPPINEG 266
            NPN S + + L  P LD DF S   ND  L E  M +LG  G +          P  +E 
Sbjct: 12   NPNPSADSEPLAVPPLDPDFFSDNSNDAALHETFMSDLGLDGVDFDFTFDDLYFPSESED 71

Query: 267  FLAEGSDVLLSSNPNLCEDFSDRPSGEVSRVLNSSFPDYGNCGLDSSGTFSVQDSGGCNS 446
            FLA   D  L    +   D +DR S +VS+VLNS  P+ GNCG++SS     Q SG  NS
Sbjct: 72   FLA---DFPLPEEGSGGHDSADR-SFDVSKVLNSPSPESGNCGVESS--LPCQVSGDRNS 125

Query: 447  VVAERIPNVCQNSGDPPSG---------DISRVFNSSSPDSGNCVRDSSGPVSDQDSGGC 599
             V+      C     PP           D +RV N  SP+SG+C R  SGP S Q SG  
Sbjct: 126  DVSSIELGCCDQKLSPPVASQSSSDQNLDGARVLNVPSPESGSCDRGFSGPESSQGSGNG 185

Query: 600  RSAIAGFLNSASPNSVIVSELSGSPNSCNVSPDAMAMDERKECLLKRKNQD-----ENXX 764
             S + G +N                  C V       D  K  + KRK +      E+  
Sbjct: 186  GSGVPGAVN------------------CVVDQKVKLEDSGKNSVPKRKKEQDDSTTESRS 227

Query: 765  XXXXXXXXXXXXXXXXXQEEDKRKARLIRNRESAQLSRERKKNYVQELEHKVRSMHSTIA 944
                              EE+K+KARL+RNRESAQLSR+RKK+YV+ELE K+RSMHSTI 
Sbjct: 228  SKFRRSSICSETANASNDEEEKKKARLMRNRESAQLSRQRKKHYVEELEEKIRSMHSTIQ 287

Query: 945  SLNSKISFIMAENASLHHQLGQLAV-----GDVFPPSMAAPMHYPWIPCSSYTMRPQ-SQ 1106
             L  KIS IMAENA+L  Q G   +       ++P    APM YPW+PC+ Y ++PQ SQ
Sbjct: 288  DLTGKISIIMAENANLRQQFGGGGMCPPPHAGMYPHPSMAPMAYPWVPCAPYVVKPQGSQ 347

Query: 1107 VPLVPIPRLKPQQP-----IXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVPFVNVR 1271
            VPLVPIPRLKPQ P     +                                 VPFVN++
Sbjct: 348  VPLVPIPRLKPQAPVSAPKVKKTENKKNETKSKKVVSVSLLGMLSFMFLMGCLVPFVNIK 407

Query: 1272 YKEKKEMVPNRLGLITNSFDGQPRGSVLTV----NSSDQSVGV 1388
            Y   KE VP R   I+N F    R  +LTV    N S+  +GV
Sbjct: 408  YGGIKETVPGRSDYISNRFSDMHRRRILTVKDDLNGSNYGMGV 450


>gb|EXC30127.1| TGACG-sequence-specific DNA-binding protein TGA-1B [Morus notabilis]
          Length = 797

 Score =  179 bits (454), Expect = 3e-42
 Identities = 160/467 (34%), Positives = 208/467 (44%), Gaps = 47/467 (10%)
 Frame = +3

Query: 105  NLSIEFDSLQCPSLDMDFLSND-IFLPEDLMEELGFGNEXXXXXXXXXX--------PPI 257
            + S EF+ L  P LD  F S+D   L ED   +LG G E                  P  
Sbjct: 21   DFSAEFEPLSIPPLDHQFFSSDDAALREDFFSDLGLGLEENCDYDFTFDDIGDDLYLPSE 80

Query: 258  NEGFLA-EGSDVLLSS-NPNLCEDFSD-RPSGEVSRVLNSSFPDY------GNCGLDSSG 410
             E FL  +G D+  +S +PN      D  P  E      S+ P+       G    D +G
Sbjct: 81   TEEFLIPDGLDIGPNSLSPNGTNSDRDVNPISEADVAAKSASPESESSTVSGVRDYDVAG 140

Query: 411  TFSVQ--DSGGCNSVVAERIPNVCQNSGDPPSGDISRVFNSSSPDSGNCVRDSSG-PVSD 581
              + Q  +SGGCNS  +       +N  D  S  I  V +S SPD GNC ++ SG  VS 
Sbjct: 141  FLNCQSSESGGCNSEYS-------RNLADRKS-KIDGVMDSPSPDCGNCDQECSGEAVSS 192

Query: 582  QDSGGCRSAIAGFLNSASPNSVIVSELSGSPNSCNVSPDAMAMDE-RKECLLKRKNQDE- 755
            Q SG C S ++   NS + +     ++S    SC      + ++E  K  + KRK + E 
Sbjct: 193  QGSGNCGSGVSEGANSPAHSGNSDKDVS----SCVFVDQKVKVEEVGKNYMSKRKKEPEE 248

Query: 756  ----------NXXXXXXXXXXXXXXXXXXXQEEDKRKARLIRNRESAQLSRERKKNYVQE 905
                                           EE+KRKARL+RNRESAQLSR+RKK+YV+E
Sbjct: 249  GNAESRTPKYRRSSAPAENTHSQSTLNPLSDEEEKRKARLMRNRESAQLSRQRKKHYVEE 308

Query: 906  LEHKVRSMHSTIASLNSKISFIMAENASLHHQLGQLAVGDVFPPSMA-------APMHYP 1064
            LE K+RSM+STI  LNS+IS+IM ENASL  QL    +    PP+          PM YP
Sbjct: 309  LEDKLRSMNSTITDLNSRISYIMVENASLRQQLSGGGICPPPPPTPGMYPHPPMGPMPYP 368

Query: 1065 WIPCSSYTMRPQ-SQVPLVPIPRLKPQQPI------XXXXXXXXXXXXXXXXXXXXXXXX 1223
            W+P + Y ++PQ SQVPLVPIPRLKPQQ +                              
Sbjct: 369  WVPYAPYVVKPQGSQVPLVPIPRLKPQQTVSASKAKKSEGKKSEGGKTKKVASISFLGLL 428

Query: 1224 XXXXXXXXXVPFVNVRYKEKKEMVPNRLGLITNSFDGQPRGSVLTVN 1364
                     VP VNV +       P  L   +     Q RGSVLT +
Sbjct: 429  FFVFLFGGLVPMVNVNFGGLTNNAPGGLVYTSGRLYDQHRGSVLTAD 475


>emb|CBI32817.3| unnamed protein product [Vitis vinifera]
          Length = 680

 Score =  178 bits (452), Expect = 5e-42
 Identities = 157/460 (34%), Positives = 196/460 (42%), Gaps = 30/460 (6%)
 Frame = +3

Query: 99   NPNLSIEFDSLQCPSLDMDFLS---NDIFLPEDLMEELGF-GNEXXXXXXXXXXPPINEG 266
            NPN S + + L  P LD DF S   ND  L E  M +LG  G +          P  +E 
Sbjct: 12   NPNPSADSEPLAVPPLDPDFFSDNSNDAALHETFMSDLGLDGVDFDFTFDDLYFPSESED 71

Query: 267  FLAEGSDVLLSSNPNLCEDFSDRPSGEVSRVLNSSFPDYGNCGLDSSGTFSVQDSGGCNS 446
            FLA               DF                P+ G+ G DS+   S   SG  NS
Sbjct: 72   FLA---------------DFP--------------LPEEGSGGHDSADR-SFDVSGDRNS 101

Query: 447  VVAERIPNVCQNSGDPP-----SGDISRVFNSSSPDSGNCVRDSSGPVSDQDSGGCRSAI 611
             V+      C     PP     S D +   NS   DSGN    S  P             
Sbjct: 102  DVSSIELGCCDQKLSPPVASQSSSDQNLDVNSPLLDSGNSDHSSWVP------------- 148

Query: 612  AGFLNSASPNSVIVSELSGSPNSCNVSPDAMAM-DERKECLLKRKNQD-----ENXXXXX 773
                  +SPN         + NS  V    + + D  K  + KRK +      E+     
Sbjct: 149  ------SSPNL--------ADNSWGVVDQKVKLEDSGKNSVPKRKKEQDDSTTESRSSKF 194

Query: 774  XXXXXXXXXXXXXXQEEDKRKARLIRNRESAQLSRERKKNYVQELEHKVRSMHSTIASLN 953
                           EE+K+KARL+RNRESAQLSR+RKK+YV+ELE K+RSMHSTI  L 
Sbjct: 195  RRSSICSETANASNDEEEKKKARLMRNRESAQLSRQRKKHYVEELEEKIRSMHSTIQDLT 254

Query: 954  SKISFIMAENASLHHQLGQLAV-----GDVFPPSMAAPMHYPWIPCSSYTMRPQ-SQVPL 1115
             KIS IMAENA+L  Q G   +       ++P    APM YPW+PC+ Y ++PQ SQVPL
Sbjct: 255  GKISIIMAENANLRQQFGGGGMCPPPHAGMYPHPSMAPMAYPWVPCAPYVVKPQGSQVPL 314

Query: 1116 VPIPRLKPQQP-----IXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVPFVNVRYKE 1280
            VPIPRLKPQ P     +                                 VPFVN++Y  
Sbjct: 315  VPIPRLKPQAPVSAPKVKKTENKKNETKSKKVVSVSLLGMLSFMFLMGCLVPFVNIKYGG 374

Query: 1281 KKEMVPNRLGLITNSFDGQPRGSVLTV----NSSDQSVGV 1388
             KE VP R   I+N F    R  +LTV    N S+  +GV
Sbjct: 375  IKETVPGRSDYISNRFSDMHRRRILTVKDDLNGSNYGMGV 414


>ref|XP_007162048.1| hypothetical protein PHAVU_001G1193000g, partial [Phaseolus vulgaris]
            gi|561035512|gb|ESW34042.1| hypothetical protein
            PHAVU_001G1193000g, partial [Phaseolus vulgaris]
          Length = 779

 Score =  174 bits (441), Expect = 9e-41
 Identities = 143/390 (36%), Positives = 188/390 (48%), Gaps = 28/390 (7%)
 Frame = +3

Query: 60   TVPY----DFIGEIDQSNPNLSIEFDSLQ---CPSLDMDFLSNDIFLPEDLMEELGFGNE 218
            T+P+    +F  + D +N    I FD L     PS   DFL  D   P++    LG    
Sbjct: 47   TLPFASDLEFGMDFDDNNGEFEITFDDLDDICIPSDAEDFLLTDACNPDNT-SVLG---- 101

Query: 219  XXXXXXXXXXPPINEGFLAEGSDVLLSSNPNLCEDFSDRPSGEVSRVLNSSFPDYGNCGL 398
                       PI E   A+ SD   S   +      DR SG VSR  NS   D      
Sbjct: 102  -----------PIEESS-AKNSD---SPRSDASVVSGDRSSG-VSRFFNSQASD------ 139

Query: 399  DSSGTFSVQDSGGCNSVVAERIPNVCQNSGDPPSGDISRVFNSSSPDSGNCVRD--SSGP 572
                  SV +   C               G   + D+ RV N  SP+S  C R+  SSGP
Sbjct: 140  ------SVSEGNSCKE-------------GSLDAVDV-RVSNIPSPESEFCDREESSSGP 179

Query: 573  VSDQDSGGCRSAIAGFLNSASPNSVIVSELSGSPNSCNVSPDAMAMDERKECLLKRKNQD 752
            VS Q SG   S +   +NS SP+SV       S ++  V    + ++E   C LKRK + 
Sbjct: 180  VSSQGSGNAGSGVYEAINSPSPDSVSFERDITSSHAHEVMDKGVKLEEISGCDLKRKKES 239

Query: 753  -----------ENXXXXXXXXXXXXXXXXXXXQEEDKRKARLIRNRESAQLSRERKKNYV 899
                        +                    +++KRKARL+RNRESAQLSR+RKK+YV
Sbjct: 240  CEGSATKHRRFSSSSVDTKTEKQTPSDVNAIDDDDEKRKARLMRNRESAQLSRQRKKHYV 299

Query: 900  QELEHKVRSMHSTIASLNSKISFIMAENASLHHQLGQLAV-------GDVFPPSMAAPMH 1058
            +ELE KVRSM+S IA L+SKIS+++AENA+L  Q+G   +         ++P    APM 
Sbjct: 300  EELEEKVRSMNSIIADLSSKISYMVAENATLRQQVGAGVMCAPPPPAPGIYPHPPMAPMP 359

Query: 1059 YPWIPCSSYTMRPQ-SQVPLVPIPRLKPQQ 1145
            YPW+PC+ Y ++PQ SQVPLVPIPRLKPQQ
Sbjct: 360  YPWMPCAPYVVKPQGSQVPLVPIPRLKPQQ 389


>ref|XP_004493333.1| PREDICTED: uncharacterized protein LOC101504999 [Cicer arietinum]
          Length = 786

 Score =  171 bits (432), Expect = 1e-39
 Identities = 140/391 (35%), Positives = 188/391 (48%), Gaps = 43/391 (10%)
 Frame = +3

Query: 105  NLSIEFDSLQCPSLDMDFLSNDIFLPEDLMEELGFGNEXXXXXXXXXXPPINEGFLAEGS 284
            + S +F++L  PS+D  F   D F P DL   LG              P   + FL   +
Sbjct: 22   DFSGQFNNLPIPSIDAFFNDVDTF-PSDLDLPLGDFEITFDDLDTLCIPSDTDDFLLPDA 80

Query: 285  DVLLSSNPN------LCEDFSDRPSGEVSRVLNSSFPDYGNCGLDSSGTFSVQDSGGCNS 446
                  NPN      L ++  D          NS   DYG    DS          G + 
Sbjct: 81   -----WNPNGLPISPLTDNHGDYNGDGDCSAKNS---DYGVANFDSP-------ESGASV 125

Query: 447  VVAERIPNVCQN------SGDPPSGDISRVFNSSSPDSGNCVRD--SSGPVSDQDSGGCR 602
            V +++ P+V +       S D  S D+ ++ +  SP++ +  R+  S+GP+S Q SG   
Sbjct: 126  VSSDQSPDVSRFFNSESVSADDNSVDV-KISSMPSPETESSDREESSNGPISSQGSGNGG 184

Query: 603  SAIAGFLNSASPNSVIVS-ELSGSPNSCNVSPDAMAMDERKECLLKRKNQD--------- 752
            S +   +NS SP+S     ++S S     V          K C LKRK ++         
Sbjct: 185  SGVYEAMNSPSPDSGRYERDISSSHKHAIVEEGVKLEGIVKGCDLKRKKENCIESAENRT 244

Query: 753  ----------ENXXXXXXXXXXXXXXXXXXXQEEDKRKARLIRNRESAQLSRERKKNYVQ 902
                      EN                    E++KRKARL+RNRESAQLSR+RKK+YV+
Sbjct: 245  PKCSRRSSSMENKTQQQLQQQQAQSGFDGIEDEDEKRKARLMRNRESAQLSRQRKKHYVE 304

Query: 903  ELEHKVRSMHSTIASLNSKISFIMAENASLHHQLG--------QLAVGDVFPPSMAAPMH 1058
            ELE KVRSMHSTIA L+SKI+F+MAENA+L  QLG          A   ++P     PM 
Sbjct: 305  ELEEKVRSMHSTIADLSSKITFVMAENATLRQQLGGGMMCPPPPPAGSGMYPHPPMPPMP 364

Query: 1059 YPWIPCSSYTMRPQ-SQVPLVPIPRLKPQQP 1148
            YPW+P + Y ++PQ SQVPLVPIPRLKPQQP
Sbjct: 365  YPWMPYAPYVVKPQGSQVPLVPIPRLKPQQP 395


>ref|XP_006411360.1| hypothetical protein EUTSA_v10016317mg [Eutrema salsugineum]
            gi|557112529|gb|ESQ52813.1| hypothetical protein
            EUTSA_v10016317mg [Eutrema salsugineum]
          Length = 722

 Score =  167 bits (424), Expect = 8e-39
 Identities = 128/364 (35%), Positives = 174/364 (47%), Gaps = 13/364 (3%)
 Frame = +3

Query: 99   NPNLSI---EFDSLQCPSLDMDFLSNDIFLP-EDLMEELGFGNEXXXXXXXXXXPPINEG 266
            +PN ++   +FDS+  P  D  + S    +P  +LM +LGF  +             +  
Sbjct: 14   DPNSTLAPPDFDSIPIPPFDQFYHSGSDQVPIGELMSDLGFPVDADGEFELTFDGMDDLY 73

Query: 267  FLAEGSDVLLSSNPNLCEDFSD-RPSGEVSRVLNSSFPDYGNCGLDSSGTFSVQDSGGCN 443
            F AE    L+  N +  E F D  P  E S +   S P          G      SG CN
Sbjct: 74   FPAENETFLIPVNASNQEQFGDFTPESEGSGISGDSLP---------KGDADKSTSGCCN 124

Query: 444  SVVAERIPNVCQNSGDPPSGDISRVFNSSSPDSGNCVRDSSGPVSDQDSGGCRSAIAGFL 623
                    +  ++SGD  SG              +   D   P+S Q SG C S ++   
Sbjct: 125  R-------DSPRDSGDRCSG-------------ADRTLDLPTPLSSQGSGNCGSDVSEAT 164

Query: 624  NSASPNSVIVSELSGSPNSCNVSPDAMAMDERKECLLKRKNQDENXXXXXXXXXXXXXXX 803
            N +SP SV V           V   A A   +++  ++    DE+               
Sbjct: 165  NESSPKSVNVVV----DQKVKVEEAATASITKRKKEIEEDMSDESRSSKYRRSGEDADAS 220

Query: 804  XXXXQEEDKRKARLIRNRESAQLSRERKKNYVQELEHKVRSMHSTIASLNSKISFIMAEN 983
                +E++K++ARL+RNRESAQLSR+RKK+YV+ELE KVR+MHSTI  LN KIS+ MAEN
Sbjct: 221  AVTGEEDEKKRARLMRNRESAQLSRQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAEN 280

Query: 984  ASLHHQLGQLAVGDVF--PPSMA-----APMHYPWIPCSSYTMRPQ-SQVPLVPIPRLKP 1139
            A+L  QLG   +      PP M      APM YPW+PC  Y ++ Q SQVPL+PIPRLKP
Sbjct: 281  ATLRQQLGGNGMCPPHHPPPPMGMYPPMAPMPYPWMPCPPYMVKQQGSQVPLIPIPRLKP 340

Query: 1140 QQPI 1151
            Q P+
Sbjct: 341  QNPL 344


>ref|XP_007028261.1| Transcription factor hy5, putative [Theobroma cacao]
            gi|508716866|gb|EOY08763.1| Transcription factor hy5,
            putative [Theobroma cacao]
          Length = 687

 Score =  167 bits (423), Expect = 1e-38
 Identities = 106/245 (43%), Positives = 137/245 (55%), Gaps = 22/245 (8%)
 Frame = +3

Query: 480  NSGDPPSGDISRVFNSSSPDSGNCVR-DSSG----PVSDQDSGGCRSAIAGFLNSASPNS 644
            +S   P  D+ R  NSSSP+ G+C   DSSG    P+S   SG C SA++  +N+ SP+S
Sbjct: 64   DSSTTPDSDVERYLNSSSPELGSCNGPDSSGNSHSPLSSSGSGNCASAVSEAMNATSPDS 123

Query: 645  VIVSELSGSPNSC---NVSPDAMAMDERKECLLKRKNQDEN-XXXXXXXXXXXXXXXXXX 812
              + +   S        VS      +E      +R +   +                   
Sbjct: 124  ENIVDQKISVEEIGKRRVSKRKKDREETDSSKCRRSSLTPSVNNSNSNSDNNNNNNSNAP 183

Query: 813  XQEEDKRKARLIRNRESAQLSRERKKNYVQELEHKVRSMHSTIASLNSKISFIMAENASL 992
             +EE+KR+ARL+RNRESAQLSR+RKK+YV+ELE KVR+MHSTIA LN+KI++ MAENA+L
Sbjct: 184  SEEEEKRRARLMRNRESAQLSRQRKKHYVEELEDKVRTMHSTIADLNNKIAYFMAENATL 243

Query: 993  HHQLGQLAVG-----------DVFPPSMAAPMHYPWIPCS-SYTMRPQ-SQVPLVPIPRL 1133
              QL     G              P  M  PM YPW+PC+  Y M+P  SQVPLVPIPRL
Sbjct: 244  RQQLSTAGGGGGGGGAVMCPPQPLPMPMYPPMAYPWVPCAPPYVMKPPGSQVPLVPIPRL 303

Query: 1134 KPQQP 1148
            KPQQP
Sbjct: 304  KPQQP 308


>ref|XP_003554104.2| PREDICTED: uncharacterized protein LOC100127362 [Glycine max]
          Length = 784

 Score =  166 bits (420), Expect = 2e-38
 Identities = 130/378 (34%), Positives = 184/378 (48%), Gaps = 30/378 (7%)
 Frame = +3

Query: 105  NLSIEFDSLQCPSLDMDFLSNDIF-LPEDLMEELGFGNEXXXXXXXXXXPPINEGFL-AE 278
            + S  F++   PS+D  F + D      DL   + F N             +++ F+ ++
Sbjct: 44   DFSSNFNAFLIPSMDSLFNTTDALPFASDLEFGMDFDNNGEFEITFDDLDELDDIFIPSD 103

Query: 279  GSDVLLSSNPNLCEDFSDRPSGEVSRVLNSSFPDYGNCGLDSSGTFSVQDSGGCNSVVAE 458
              D LL   P++C    D  S  +    NS  PD                          
Sbjct: 104  AEDFLL---PDVCNSNYDSASPPID-AKNSDSPD-------------------------- 133

Query: 459  RIPNVCQNSGDPPSGDISRVFNSSSPDSGNCVRD--SSGPVSDQDSGGCRSAIAGFLNSA 632
               +V   SG+  S D  RV +  SP++  C R+  S+GPVS Q SG   S +   ++S 
Sbjct: 134  --SDVSAVSGEGDSADNVRVSSVPSPEAEFCDREESSNGPVSSQGSGNGGSGVYEAMHSP 191

Query: 633  SPNSVIVSELSGSPNSCNVSPDAMAMDERKECLLKRKNQD---------------ENXXX 767
            SP+S        S ++  V+ + + M+E     LKRK +                EN   
Sbjct: 192  SPDSGPYERDITSSHAHAVTNNGVKMEETPAFDLKRKKESCDGSATKHRRFSSSVENNNN 251

Query: 768  XXXXXXXXXXXXXXXXQEEDKRKARLIRNRESAQLSRERKKNYVQELEHKVRSMHSTIAS 947
                             E++KRKARL+RNRESAQLSR+RKK+YV+ELE KVRS++S IA 
Sbjct: 252  NTEKQSQSGLNGID--DEDEKRKARLMRNRESAQLSRQRKKHYVEELEEKVRSLNSIIAD 309

Query: 948  LNSKISFIMAENASLHHQLGQLAVGDVFPPSMA----------APMHYPWIPCSSYTMRP 1097
            ++SK+S+++AENA+L  Q+G   V    PP+ A          APM YPW+PC+ Y ++P
Sbjct: 310  MSSKMSYVVAENATLRQQVGAAGVMCPPPPAPAPGMYPHHPPMAPMPYPWMPCAPYVVKP 369

Query: 1098 Q-SQVPLVPIPRLKPQQP 1148
            Q SQVPLVPIPRLKPQQP
Sbjct: 370  QGSQVPLVPIPRLKPQQP 387


>ref|XP_007220226.1| hypothetical protein PRUPE_ppa002181mg [Prunus persica]
            gi|462416688|gb|EMJ21425.1| hypothetical protein
            PRUPE_ppa002181mg [Prunus persica]
          Length = 704

 Score =  166 bits (420), Expect = 2e-38
 Identities = 149/468 (31%), Positives = 208/468 (44%), Gaps = 33/468 (7%)
 Frame = +3

Query: 87   IDQSNPNLSIEFDSLQCPSLDMDFLSND---IFLPED-LMEELGFG--NEXXXXXXXXXX 248
            +D  +   + E D L  P LD  F S+D     +P D  M +LGFG  ++          
Sbjct: 16   LDHGDFKFNAELDGLAIPPLDPQFFSSDDGMATVPSDTFMSDLGFGFGSDDNCDFELTFD 75

Query: 249  PPINEGFLAEGSDVLLSSNPNLCEDFSDRPSGEVSRVLNSSFPDYGNCGLDSSGTFSVQD 428
               N    +E  D L+    +        P G     LNS  P+ G+  +  SG     D
Sbjct: 76   DLDNLYLPSEADDFLIPDGLD--------PGGTA---LNSGSPESGSSAISISG----DD 120

Query: 429  SGGCNSVVAERIPNVCQNSGDPPSGDISRVFNSSSPDSGNCVRDSSGPVSDQDSGGCRSA 608
             GG +         V +    P S + S   NS+ P++     +S G VS Q SG   + 
Sbjct: 121  KGGSD---------VSRFLNCPSSNESSE--NSNGPENSGGPENSGGAVSSQGSGISEAV 169

Query: 609  IAGFLNSASPNSVIVSELSGSPNSCNVSPDAMAMDERKECLLKRKNQDENXXXXXXXXXX 788
             + + +  S NSV  + +S + +      D +     K CL+KRK   +           
Sbjct: 170  NSTWHSGNSGNSVSSNAISDADDEKVKMEDEIT----KNCLVKRKKVSDEGNVESRSAKY 225

Query: 789  XXXXXXXXX--------QEEDKRKARLIRNRESAQLSRERKKNYVQELEHKVRSMHSTIA 944
                              EE+KRKARL+RNRESAQLSR+RKK+YV+ELE KVR+MHSTIA
Sbjct: 226  RRSDNNNASVDANANGNDEEEKRKARLMRNRESAQLSRQRKKHYVEELEDKVRAMHSTIA 285

Query: 945  SLNSKISFIMAENASLHHQLGQLAVGDVFPPSMAAPMH---------YPWIPCSSYTMRP 1097
             LN++IS++MAENA+L  QL     G + PP   A MH         YPW+P S Y ++P
Sbjct: 286  DLNTRISYVMAENATLKQQLCS-GSGAMCPPPPHAGMHPHPPMPPMAYPWMPYSPYVVKP 344

Query: 1098 Q-SQVPLVPIPRLKPQQPI-----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVPF 1259
            Q SQ  LVPIPRLK QQP+                                      VP 
Sbjct: 345  QGSQGLLVPIPRLKSQQPVAAPKSKKSETKKTEGKTKKVASISFLGLLFFILLFGGLVPM 404

Query: 1260 VNVRYKEKKEMVPNRLGLITNSFDGQPRGSVLTV----NSSDQSVGVV 1391
            VNV +    +  P     +++ F  + R  VLTV    N S++++GV+
Sbjct: 405  VNVYFGGVTDRGPGGSAYVSDRFYDKSRVRVLTVHSNLNGSEENIGVI 452


>gb|AGO05993.1| bZIP transcription factor family protein 9 [Camellia sinensis]
          Length = 708

 Score =  164 bits (416), Expect = 7e-38
 Identities = 151/480 (31%), Positives = 213/480 (44%), Gaps = 24/480 (5%)
 Frame = +3

Query: 48   MANPTVPYDFIGEIDQSNPNLSIEFDSLQCPSLDMDFLSNDIF----LPEDL-MEELGFG 212
            MA+ +   D I      NPN + +FD+L  P LD  FLS+  F    LP D   ++L F 
Sbjct: 1    MADQSAAVDLI----PPNPNPT-DFDALAIPPLDSAFLSDSFFSDLALPFDADFDDLDF- 54

Query: 213  NEXXXXXXXXXXPPINEGFLAEGSDVLLSSNPNLCEDFSDRPSGEVSRVLNSSFPDYGNC 392
                           ++ +L   S+  L+S P+    FS  PS + S +LNS+       
Sbjct: 55   -------------TFDDLYLPSDSEDFLNSFPS---QFSSDPSPDASTILNSA------- 91

Query: 393  GLDSSGTFSVQDSGGCNSVVAERIPNVCQNSGDPPSGDISRVFNSSSPDSGNCVRDSSGP 572
                    S Q SG          P + + SG   S   SRV N SSP+S          
Sbjct: 92   -----DQTSSQVSGD---------PEISEESGIKGSDVGSRVLNYSSPES---------- 127

Query: 573  VSDQDSGGCRSAIAGFLNSA------SPNSVIVSELSGSPNSCNVSPDAMAMDERKECLL 734
               ++SG   S     ++          N + +    GS +        +  + R+    
Sbjct: 128  -ETRNSGSAESGNFAIVDQKIEFEGEGKNFLSLKRKKGSED--------VNFESRRMGKY 178

Query: 735  KRKNQDENXXXXXXXXXXXXXXXXXXXQEEDKRKARLIRNRESAQLSRERKKNYVQELEH 914
            +R + + N                   +E++K+KARLIRNRESAQLSR+R+K+YV ELE 
Sbjct: 179  RRSSSEGNANSPCGLNGNN--------EEDEKKKARLIRNRESAQLSRQRRKHYVGELED 230

Query: 915  KVRSMHSTIASLNSKISFIMAENASLHHQLGQLAV---GDVFPPSMAAPMHYPWIPCSSY 1085
            KVR MHSTI  LN++IS+++AENASL  QLG         ++P    AP+ YPW+PC  Y
Sbjct: 231  KVRLMHSTIQDLNTRISYVIAENASLRQQLGGAMCPPPPGMYPHPPLAPLGYPWMPCPPY 290

Query: 1086 TMRPQ-SQVPLVPIPRLKPQQ-----PIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1247
             ++PQ SQ PLVPIP+LKPQQ                                       
Sbjct: 291  FVKPQGSQAPLVPIPKLKPQQSAPAPKAKKVESKKSESKTKKVASVSFLGLLLFILLFGG 350

Query: 1248 XVPFVNVRYKEKKEMVPNRLGLITNSFDGQPRGSVLTV----NSSDQSVGVVGLCSGKPG 1415
             VP +NV++   ++ VP     + N F     G VL V    N+SD ++G  GLCSG+ G
Sbjct: 351  LVPMINVKFGGMRDRVPGGSDYLGNRFYDHHGGRVLPVDGNLNNSDPTIG-TGLCSGRLG 409


>ref|XP_004136623.1| PREDICTED: uncharacterized protein LOC101215342 [Cucumis sativus]
            gi|449521537|ref|XP_004167786.1| PREDICTED:
            uncharacterized protein LOC101224129 [Cucumis sativus]
          Length = 768

 Score =  163 bits (412), Expect = 2e-37
 Identities = 144/401 (35%), Positives = 190/401 (47%), Gaps = 39/401 (9%)
 Frame = +3

Query: 66   PYDFIGEIDQSNPN---LSIEFDSLQCPSLDMDFLSN-------DIFLPEDLMEELGFGN 215
            P+  +   DQ NPN    + EFDSL  P LD  F S+       D FL    ++ LGF +
Sbjct: 4    PFHPVSPSDQ-NPNSTSYASEFDSLPIPPLDSLFFSDPNHDGPGDPFLYSTALD-LGFDD 61

Query: 216  EXXXXXXXXXXPPINEGFLAEGSDVLLSSNPNLCEDFSDRPSGEVSRVLNSSFPDYGNCG 395
                         +     +E  D L+S N +   +    P  +V    +SS P     G
Sbjct: 62   NDDFELTFDDLDDLC--LPSEADDFLISDNLDHPTNSPHLPP-DVPLEDDSSVPVCSPAG 118

Query: 396  LDSSGTFSVQ---DSGGCNSVVAER-----IPNVCQNSGDPP-SGDISRVFNSSSPDSGN 548
               SG+ +V        C  +  E        + C ++G        SR+ NS SP+ G+
Sbjct: 119  SPGSGSSAVSCHPSPHDCKFLNYESSKLGTADSECFSTGSGGWDSKGSRMVNSHSPELGD 178

Query: 549  CVRDSSGPVSDQDSGGCRSAIAGFLNSASPNSVIVSELSGSPNSCNVSPDAMAMDERKEC 728
                S GP S Q SG   S ++  +N  S N+     +        V     + +  K C
Sbjct: 179  H-EFSGGPASSQGSG---SGVSEGMNCPSSNAECYDVI--------VDQKVKSEEMGKNC 226

Query: 729  LLKRKN-QDENXXXXXXXXXXXXXXXXXXX----------QEEDKRKARLIRNRESAQLS 875
            + KRK  QDE                              ++++KRKARL+RNRESAQLS
Sbjct: 227  MTKRKKEQDEGNADFRSAKYQRSSVSTEATNPQLDPCSINEDDEKRKARLMRNRESAQLS 286

Query: 876  RERKKNYVQELEHKVRSMHSTIASLNSKISFIMAENASLHHQLGQLAVGDVFPPSM---- 1043
            R+RKK+YV+ELE KVR+MHSTIA LNSKIS+IMAENA L  QL    +    PP M    
Sbjct: 287  RQRKKHYVEELEDKVRNMHSTIAELNSKISYIMAENAGLRQQLSGSGMCQPPPPGMFPHP 346

Query: 1044 ----AAPMHYPWIPCSSYTMRPQ-SQVPLVPIPRLKPQQPI 1151
                  PM Y W+PC+ Y ++PQ SQVPLVPIPRLKPQQPI
Sbjct: 347  SMPPMPPMPYSWMPCAPYVVKPQGSQVPLVPIPRLKPQQPI 387


>ref|XP_002526200.1| transcription factor hy5, putative [Ricinus communis]
            gi|223534478|gb|EEF36179.1| transcription factor hy5,
            putative [Ricinus communis]
          Length = 702

 Score =  160 bits (405), Expect = 1e-36
 Identities = 119/316 (37%), Positives = 155/316 (49%), Gaps = 19/316 (6%)
 Frame = +3

Query: 498  SGD--ISRVFNSSSPDSGNCVRDSSG-------PVSDQDSGGCRSAIAGFLNSASPNSVI 650
            SGD  ++   NSS   S +    SSG       PVS Q SG   S ++  +N      V 
Sbjct: 99   SGDHHVATYLNSSPSASNSTTTCSSGDQLNVSSPVSSQGSGNGGSGVSDSVNFVVDQKVK 158

Query: 651  VSELSGSPNSCNVSPDAMAMDERKECLLKRKNQDENXXXXXXXXXXXXXXXXXXXQEEDK 830
            + E   +  + N S     + +RK    K    ++                     E++K
Sbjct: 159  LEEEGSNSKNKNGS-----LSKRK----KENGSEDTRNQKYRRSENSNANTQCVSDEDEK 209

Query: 831  RKARLIRNRESAQLSRERKKNYVQELEHKVRSMHSTIASLNSKISFIMAENASLHHQLGQ 1010
            RKARL+RNRESAQLSR+RKK+YV+ELE KV++MHSTIA LNSKISF MAENA+L  QL  
Sbjct: 210  RKARLMRNRESAQLSRQRKKHYVEELEDKVKTMHSTIADLNSKISFFMAENATLRQQLS- 268

Query: 1011 LAVGDVFPPSMAAPMHYPWIPCSSYTMRPQ-SQVPLVPIPRLKPQQPI-----XXXXXXX 1172
                 + PP M APM YPW+PC+ Y ++ Q SQVPLVPIPRLK QQP+            
Sbjct: 269  -GGNGMCPPPMYAPMPYPWVPCAPYVVKAQGSQVPLVPIPRLKSQQPVSAAKSKKSDPKK 327

Query: 1173 XXXXXXXXXXXXXXXXXXXXXXXXXXVPFVNVRYKEKKEMVPNRLGLITNSFDGQPRGSV 1352
                                      VP VNV++    E   N  G +++ F  + RG V
Sbjct: 328  AEGKTKKVASVSFLGLLFFVLLFGGLVPIVNVKFGGVGENGAN--GFVSDKFYNRHRGRV 385

Query: 1353 LTV----NSSDQSVGV 1388
            L V    N S ++V V
Sbjct: 386  LRVDGHSNGSHENVDV 401


>ref|XP_006836241.1| hypothetical protein AMTR_s00101p00121930 [Amborella trichopoda]
            gi|548838741|gb|ERM99094.1| hypothetical protein
            AMTR_s00101p00121930 [Amborella trichopoda]
          Length = 772

 Score =  158 bits (400), Expect = 5e-36
 Identities = 132/384 (34%), Positives = 182/384 (47%), Gaps = 36/384 (9%)
 Frame = +3

Query: 108  LSIEFDSLQCPSLD---MDFLSNDIFLPEDLMEELGFGNEXXXXXXXXXXPPINEGFLAE 278
            L  +F++LQ P LD    D L ND+  P+ LM+ + F ++           P + G    
Sbjct: 18   LPSDFEALQIPPLDPAYTDGLFNDMDFPQTLMDGIEFSDDDLNFDLDDILLPSSPGS-GN 76

Query: 279  GSDVLLSSNPNLCEDFSDRPSGEVSR--------VLNSSFPDYGNCGLDSSGTFSVQDSG 434
             S VL  S  N     +   SG +S         V  S  P      L S  T    +  
Sbjct: 77   NSGVLPDSGQNCQSLMNSGDSGYLSNGDSSINIEVQGSQIP----ASLSSDHTNGDPEVD 132

Query: 435  GCNSVVAERIPNVCQNSGDPPSGDISRVFNSSSPDSGNCVRDSSGPVSDQDSGGCRSAIA 614
               SV+A   P++   +    +     +      D+ N +   SG  S   +G C +   
Sbjct: 133  SSGSVLASCDPSIQSVTNSETATFFPEI--RVCTDNKNQLLIDSGNTSSM-TGFCGTRDP 189

Query: 615  GFLNSASPNSVIVSELSG------SPNSCNVSPDAMAMD------ERKECLL----KRKN 746
            G L S SP S+  S++S       SP+S + S      D       + E L+    KRK 
Sbjct: 190  GVLGSESPQSLQCSQVSADYKDRPSPDSGHGSSSMNFGDTFTSGLSKSEPLVSFQGKRKK 249

Query: 747  QDENXXXXXXXXXXXXXXXXXXXQEEDKRKARLIRNRESAQLSRERKKNYVQELEHKVRS 926
              E+                   +EE+K+KARL+RNRESAQLSR+RKK+YV ELE KVR+
Sbjct: 250  AREDGSPSSNN------------EEEEKKKARLMRNRESAQLSRQRKKHYVDELEDKVRA 297

Query: 927  MHSTIASLNSKISFIMAENASLHHQLGQLA-VGDVFPPSMAAPMHYPWIPCSSYTMRPQ- 1100
            MHSTIA LNS++SF  AEN +L HQL  L+   + +    + P H+PW+PCSSY M PQ 
Sbjct: 298  MHSTIAELNSRLSFATAENMNLRHQLIALSPTSNAYAQPSSLPSHFPWVPCSSYAMNPQN 357

Query: 1101 -------SQVPLVPIPRLKPQQPI 1151
                   SQVPL+PIPRL+ QQ +
Sbjct: 358  GLGFPPGSQVPLLPIPRLRTQQTV 381


>ref|XP_004299018.1| PREDICTED: uncharacterized protein LOC101299380 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 711

 Score =  157 bits (398), Expect = 9e-36
 Identities = 128/365 (35%), Positives = 172/365 (47%), Gaps = 21/365 (5%)
 Frame = +3

Query: 117  EFDSLQCPSLDMDFLSNDIFLP----EDLMEELGFG--NEXXXXXXXXXXPPINEGFLAE 278
            +F+SL  P LD  F S+D  +     +  M +LGFG  ++             N    +E
Sbjct: 27   DFESLPIPPLDPQFFSSDAGMATMAADSFMSDLGFGFGSDDNCDYELTFDDLDNLYIPSE 86

Query: 279  GSDVLLSSNPNLCEDFSDRPSGEVSRVLNSSFPDYGNCGLDSSG--------TFSVQDSG 434
              D LL        D + +PS + S +L S  P+ G+ G+             +   +SG
Sbjct: 87   ADDFLLPEG----FDPAAQPSSDSSVILKSESPESGSSGVSKGSDGVVSGFLNYPSSESG 142

Query: 435  GCNSVVAERIPNVCQNSGDPPSGDISRVFNS--SSPDSGNCVRDSSGPVSDQDSGGCRSA 608
            G +   +E       NSG P S   S +  +  S   SGN  RD S  V+  D       
Sbjct: 143  GHDQEFSE-------NSGGPLSSQGSGIPEAANSPTHSGNSDRDVSSNVTTADE------ 189

Query: 609  IAGFLNSASPNSVIVSELSGSPNSCNVSPDAMAMDERKECLLKRKNQDENXXXXXXXXXX 788
                        V + E         V+        +KE     +   E+          
Sbjct: 190  -----------KVKIEE--------EVTRSGFVAKRKKESGGGEEGNMESRSSKFRRSES 230

Query: 789  XXXXXXXXXQEEDKRKARLIRNRESAQLSRERKKNYVQELEHKVRSMHSTIASLNSKISF 968
                      E+++RKARL+RNRESAQLSR+RKK+YV+ELE KVR+MH+TIA LN+K+S+
Sbjct: 231  SGGSGGCLDDEDERRKARLMRNRESAQLSRQRKKHYVEELEDKVRAMHTTIADLNNKMSY 290

Query: 969  IMAENASLHHQL--GQLAVGDVFPPSM--AAPMHYPWIPCSSYTMRPQ-SQVPLVPIPRL 1133
            IMAENA+L  QL  G        PP M    PM YPW+P S Y ++PQ SQVPLVPIPRL
Sbjct: 291  IMAENATLKQQLSSGSGICPPPPPPGMYPMPPMGYPWMPYSPYVVKPQGSQVPLVPIPRL 350

Query: 1134 KPQQP 1148
            KPQQP
Sbjct: 351  KPQQP 355


>gb|AGO05994.1| bZIP transcription factor family protein 10 [Camellia sinensis]
          Length = 718

 Score =  157 bits (397), Expect = 1e-35
 Identities = 127/344 (36%), Positives = 164/344 (47%), Gaps = 45/344 (13%)
 Frame = +3

Query: 510  SRVFNSSSPDSGNCVRDSSGPVSDQDSGGCRSAIAGFLN--------SASPNSVIVSELS 665
            SRVFNS   D  +   + S P S  +S    S +A  L+        S +  SV+   L+
Sbjct: 84   SRVFNSD--DLISDFLNVSSPESSHESANKASIVARVLDPEVSSSQGSGNSGSVVSEPLN 141

Query: 666  -GSPNSCN------VSPDAMAMDERKECLLKRKNQDE--------------NXXXXXXXX 782
              SP+S N      V       +E   CLLKRK + E              +        
Sbjct: 142  YTSPDSANNSIHDFVDQKIELKEEGTNCLLKRKKESEEDVNSEFRTSKYQRSNSGENPNQ 201

Query: 783  XXXXXXXXXXXQEEDKRKARLIRNRESAQLSRERKKNYVQELEHKVRSMHSTIASLNSKI 962
                       ++++K+KARL+RNRESAQLSR+RKK+YV+ELE K+R+MHST+  LNSKI
Sbjct: 202  SYGYTSNTGISEDDEKKKARLMRNRESAQLSRQRKKHYVEELEDKLRTMHSTVQDLNSKI 261

Query: 963  SFIMAENASLHHQL--GQLAVGDVFPPSM-----AAPMHYPWIPCSSYTMRPQ-SQVPLV 1118
            S+IMAENASL  QL  G +    V PP M      APM YPW+PC  Y ++PQ SQVPLV
Sbjct: 262  SYIMAENASLRQQLSGGAMCPPPVPPPGMYPHPPMAPMGYPWMPCPPYVVKPQGSQVPLV 321

Query: 1119 PIPRLKPQQPI---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVPFVNVRYKE-KK 1286
            PIPRLK Q P                                     VP VNV +   ++
Sbjct: 322  PIPRLKSQNPSPAPKAKKVESKKTKTKKVASVSFLGLLFFILFFGGLVPMVNVNFGGIRR 381

Query: 1287 EMVPNRLGLITNSFDGQPRGSVLTV----NSSDQSVGVVGLCSG 1406
            + V        N F  Q  G V+TV    N SDQ +G +GL +G
Sbjct: 382  DTVLGGSNYFGNGFYDQHHGRVVTVNGHLNGSDQKIG-MGLSNG 424


>ref|XP_006482041.1| PREDICTED: uncharacterized protein LOC102629395 [Citrus sinensis]
          Length = 719

 Score =  154 bits (390), Expect = 7e-35
 Identities = 159/472 (33%), Positives = 215/472 (45%), Gaps = 33/472 (6%)
 Frame = +3

Query: 102  PNLSIEFDSLQCPSLDMDFLSNDIFLPEDLMEELGFGNEXXXXXXXXXXPPINEGFLAEG 281
            P  S +FD+L  P LD  +L++ I  P    ++L F  +            I++ + A  
Sbjct: 10   PPPSNDFDALSIPPLDPPYLNSQIPHPCASSDDLDFVLDDNCDFDFT----IDDLYFASE 65

Query: 282  SDVLLSSNPNLCEDFSDRPSGEVSRVLNSSFPDYGNCGLDSSGTFSVQDSGGCNSVVAER 461
             D     +    ED  D   G+ S       PD         G  +V    G + ++   
Sbjct: 66   DDTFFLPS----EDPHDGQFGDFS-------PDV------DGGAAAVSPGSGSSGIL--- 105

Query: 462  IPNVCQNSGDPPSGDISRVFN-SSSP-DSGNCVRDS-----SGPVSDQDSGGCRSAIAGF 620
                    G+P S D+    N SSSP +SGN +        SG  S+    G  S     
Sbjct: 106  --------GNPASLDVESYLNYSSSPQNSGNRISHLNYIGVSGGRSENSGSGVSSD---- 153

Query: 621  LNSASPNSVIVSELSGSPNSCNVSPDA-MAMDE-RKECLLKRKNQDENXXXXXXXXXXXX 794
             N+  P          SP+S N+  D  + M+E  K+ + KRK   E             
Sbjct: 154  -NTDDP----------SPDSGNLVVDQKIKMEEVSKKGIFKRKKDIEETNNESRSNKYRK 202

Query: 795  XXXXXXXQ---------EEDKRKARLIRNRESAQLSRERKKNYVQELEHKVRSMHSTIAS 947
                   +         EE KRKARL+RNRESAQLSR+RKK+YV+ELE KVR+MHSTIA 
Sbjct: 203  SSSLSVNEADNDHNLGEEEMKRKARLMRNRESAQLSRQRKKHYVEELEDKVRNMHSTIAD 262

Query: 948  LNSKISFIMAENASLHHQL-GQLAVG---DVFPP---SMAAPMHYPWIPCSS-YTMRPQ- 1100
            LNSKISF MAENASL  QL G  A+     ++PP     AAPM Y W+PC++ Y ++PQ 
Sbjct: 263  LNSKISFFMAENASLKQQLSGSNAMPPPLGMYPPPPHMAAAPMPYGWMPCAAPYMVKPQG 322

Query: 1101 SQVPLVPIPRLKPQ--QPIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVPFVNVRY 1274
            SQVPLVPIPRLKPQ    +                                 VP V+V+Y
Sbjct: 323  SQVPLVPIPRLKPQAAAAVPPRTKKSDGSKTKKVASVSFLGLLFFILLFGGLVPLVDVKY 382

Query: 1275 KEKKEMVPNRLGLITNSFDGQPRGSVLTV----NSSDQSVGVVGLCSGKPGF 1418
               ++ V    G  ++ F  Q RG VLT+    N S +S+G +G  +G+ GF
Sbjct: 383  GGIRDGVSG--GYFSSGFYNQHRGRVLTINGYSNGSGESMG-IGFPNGRVGF 431


>ref|XP_002308867.2| hypothetical protein POPTR_0006s03300g [Populus trichocarpa]
            gi|550335363|gb|EEE92390.2| hypothetical protein
            POPTR_0006s03300g [Populus trichocarpa]
          Length = 729

 Score =  154 bits (388), Expect = 1e-34
 Identities = 117/299 (39%), Positives = 148/299 (49%), Gaps = 36/299 (12%)
 Frame = +3

Query: 363  NSSFPDYGNCGLDSSGTFSVQ--DSGGCNSVVAERIPNVCQNSGDPPSGD-ISRVFNSSS 533
            N+  PD G  G  +S T +++  DSGG         P  C + G       + +  N S 
Sbjct: 96   NTVNPDPGCFGDFASNTVNLESTDSGG---------PGTCGDHGGLEVDKYVDKYLNPSP 146

Query: 534  PDSGNCVRDSSG---------PVSDQDSGGCRSAIAGFLNSASPNSVIVSELSGSPNSCN 686
             ++ +C  DS G         PVS   SG   S   G L++ SP S        + N CN
Sbjct: 147  SEAESC--DSGGSDYRSSVLSPVSSHGSGNSGS---GVLSAGSPES------GTNVNPCN 195

Query: 687  VSPDAMAMDERKECLLKRKN-------------QDENXXXXXXXXXXXXXXXXXXXQ--- 818
               D   +    E   KRK+              +EN                       
Sbjct: 196  FVVDKKFVKTETESAKKRKSAKIAVAKRKKEMGDEENGEIMRNLKSRKAESENVSVNVSG 255

Query: 819  ------EEDKRKARLIRNRESAQLSRERKKNYVQELEHKVRSMHSTIASLNSKISFIMAE 980
                  EED+RKARL+RNRESAQLSR+RKK+YV+ELE KVR MHSTIA LN K+S+ MAE
Sbjct: 256  SASLSGEEDRRKARLMRNRESAQLSRQRKKHYVEELEDKVRMMHSTIAQLNGKVSYFMAE 315

Query: 981  NASLHHQLGQLAVGDVFPPSMAAPM-HYPWIPCSSYTMRPQ-SQVPLVPIPRLKPQQPI 1151
            NA+L     QL+     PP M APM  YPW+PC+ Y ++PQ SQVPLVPIPRLKPQQ +
Sbjct: 316  NATLRR---QLSGNGACPPPMYAPMAPYPWVPCAPYVVKPQGSQVPLVPIPRLKPQQTV 371


>ref|XP_006293762.1| hypothetical protein CARUB_v10022722mg [Capsella rubella]
            gi|482562470|gb|EOA26660.1| hypothetical protein
            CARUB_v10022722mg [Capsella rubella]
          Length = 725

 Score =  153 bits (387), Expect = 2e-34
 Identities = 121/358 (33%), Positives = 170/358 (47%), Gaps = 13/358 (3%)
 Frame = +3

Query: 117  EFDSLQCPSLDMDFL--SNDIFLPEDLMEELGFGNEXXXXXXXXXXPPINEGFLAEGSDV 290
            +FDS+  P  D  F    +D     +LM +LGF +                 F AE    
Sbjct: 27   DFDSISIPPFDDQFYHPGSDQTPIGELMSDLGFPDGEFELTFDGMDDLY---FPAENESF 83

Query: 291  LLSSNPNLCEDFSD-RPSGEVSRVLNSSFPDYGNCGLDSSGTFSVQDSGGCNSVVAERIP 467
            L+  N +  E F D  P  E S +             D    F    + GC++  + R  
Sbjct: 84   LIPVNTSSQEQFGDFTPDSEGSGISG-----------DPKDVFKNITTSGCSNRESPRDS 132

Query: 468  NVCQNSGDPPSGDISRVFNSSSPDSGNCVRDSSGPVSDQDSGGCRSAIAGFLNSASPNS- 644
            +  + SG  PS D+                    P+S Q SG C S ++   N +SP S 
Sbjct: 133  DD-RCSGADPSLDLPT------------------PLSSQGSGNCASDVSEATNESSPKSR 173

Query: 645  -VIVSELSGSPNSCNVSPDAMAMDERKECLLKRKNQDENXXXXXXXXXXXXXXXXXXXQE 821
             V+V +      +   +    ++ +RK+ + +  + +                     +E
Sbjct: 174  NVVVDQKVKVEEAATTT----SITKRKKEIEEDLSGESRSSKYRRSGEEDIDASAVTGEE 229

Query: 822  EDKRKARLIRNRESAQLSRERKKNYVQELEHKVRSMHSTIASLNSKISFIMAENASLHHQ 1001
            ++K+KARL+RNRESAQLSR+RKK+YV+ELE KVR+MHSTI  LN KIS+ MAENA+L  Q
Sbjct: 230  DEKKKARLMRNRESAQLSRQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAENATLRQQ 289

Query: 1002 LGQLAVGDVF--PPSMA-----APMHYPWIPCSSYTMRPQ-SQVPLVPIPRLKPQQPI 1151
            LG   +      PP M      APM YPW+PC  Y ++ Q SQVPL+PIPRLKPQ P+
Sbjct: 290  LGGNGMCPPHHPPPPMGMYPPMAPMPYPWMPCPPYMVKQQGSQVPLIPIPRLKPQNPL 347


>ref|NP_565946.1| transcription factor BZIP17 [Arabidopsis thaliana]
            gi|20196934|gb|AAB86455.2| bZIP family transcription
            factor [Arabidopsis thaliana] gi|330254811|gb|AEC09905.1|
            Basic-leucine zipper (bZIP) transcription factor family
            protein [Arabidopsis thaliana]
          Length = 721

 Score =  153 bits (387), Expect = 2e-34
 Identities = 127/365 (34%), Positives = 169/365 (46%), Gaps = 20/365 (5%)
 Frame = +3

Query: 117  EFDSLQCPSLDMDFLSNDIFLPEDLMEELGFGNEXXXXXXXXXXPPINEGFLAEGSDVLL 296
            +FDS+  P LD D  S+   + E LM +LGF +                 F AE    L+
Sbjct: 26   DFDSISIPPLD-DHFSDQTPIGE-LMSDLGFPDGEFELTFDGMDDLY---FPAENESFLI 80

Query: 297  SSNPNLCEDFSD-RPSGEVSRVLNSSFPDYGNCGLDSSGTFSVQDSGGCNSVVAERIPNV 473
              N +  E F D  P  E S +        G+C +      ++  SG             
Sbjct: 81   PINTSNQEQFGDFTPESESSGIS-------GDCIVPKDADKTITTSG------------- 120

Query: 474  CQNSGDPPSGDISRVFNSSSPDSGNCVRDSSGPVSDQDSGGCRSAIAGFLNSASPNS--- 644
            C N   P   D       S  D      D   P+S Q SG C S ++   N +SP S   
Sbjct: 121  CINRESPRDSDD----RCSGADHN---LDLPTPLSSQGSGNCGSDVSEATNESSPKSRNV 173

Query: 645  -----VIVSELSGSPNSCNVSP---DAMAMDERKECLLKRKNQDENXXXXXXXXXXXXXX 800
                 V V E + +  S        D    DE +    +R  +D +              
Sbjct: 174  AVDQKVKVEEAATTTTSITKRKKEIDEDLTDESRNSKYRRSGEDADASAVTG-------- 225

Query: 801  XXXXXQEEDKRKARLIRNRESAQLSRERKKNYVQELEHKVRSMHSTIASLNSKISFIMAE 980
                 +E++K++ARL+RNRESAQLSR+RKK+YV+ELE KVR+MHSTI  LN KIS+ MAE
Sbjct: 226  -----EEDEKKRARLMRNRESAQLSRQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAE 280

Query: 981  NASLHHQLG--QLAVGDVFPPSMA-----APMHYPWIPCSSYTMRPQ-SQVPLVPIPRLK 1136
            NA+L  QLG   +    + PP M      APM YPW+PC  Y ++ Q SQVPL+PIPRLK
Sbjct: 281  NATLRQQLGGNGMCPPHLPPPPMGMYPPMAPMPYPWMPCPPYMVKQQGSQVPLIPIPRLK 340

Query: 1137 PQQPI 1151
            PQ  +
Sbjct: 341  PQNTL 345


>gb|AAM96961.1| putative TGACG-sequence-specific bZIP DNA-binding protein
            [Arabidopsis thaliana] gi|23198400|gb|AAN15727.1|
            putative TGACG-sequence-specific bZIP DNA-binding protein
            [Arabidopsis thaliana]
          Length = 721

 Score =  153 bits (387), Expect = 2e-34
 Identities = 127/365 (34%), Positives = 169/365 (46%), Gaps = 20/365 (5%)
 Frame = +3

Query: 117  EFDSLQCPSLDMDFLSNDIFLPEDLMEELGFGNEXXXXXXXXXXPPINEGFLAEGSDVLL 296
            +FDS+  P LD D  S+   + E LM +LGF +                 F AE    L+
Sbjct: 26   DFDSISIPPLD-DHFSDQTPIGE-LMSDLGFPDGEFELTFDGMDDLY---FPAENESFLI 80

Query: 297  SSNPNLCEDFSD-RPSGEVSRVLNSSFPDYGNCGLDSSGTFSVQDSGGCNSVVAERIPNV 473
              N +  E F D  P  E S +        G+C +      ++  SG             
Sbjct: 81   PINTSNQEQFGDFTPESESSGIS-------GDCIVPKDADKTITTSG------------- 120

Query: 474  CQNSGDPPSGDISRVFNSSSPDSGNCVRDSSGPVSDQDSGGCRSAIAGFLNSASPNS--- 644
            C N   P   D       S  D      D   P+S Q SG C S ++   N +SP S   
Sbjct: 121  CINRESPRDSDD----RCSGADHN---LDLPTPLSSQGSGNCGSDVSEATNESSPKSRNV 173

Query: 645  -----VIVSELSGSPNSCNVSP---DAMAMDERKECLLKRKNQDENXXXXXXXXXXXXXX 800
                 V V E + +  S        D    DE +    +R  +D +              
Sbjct: 174  AVDQKVKVEEAATTTTSITKRKKEIDEDLTDESRNSKYRRSGEDADASAVTG-------- 225

Query: 801  XXXXXQEEDKRKARLIRNRESAQLSRERKKNYVQELEHKVRSMHSTIASLNSKISFIMAE 980
                 +E++K++ARL+RNRESAQLSR+RKK+YV+ELE KVR+MHSTI  LN KIS+ MAE
Sbjct: 226  -----EEDEKKRARLMRNRESAQLSRQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAE 280

Query: 981  NASLHHQLG--QLAVGDVFPPSMA-----APMHYPWIPCSSYTMRPQ-SQVPLVPIPRLK 1136
            NA+L  QLG   +    + PP M      APM YPW+PC  Y ++ Q SQVPL+PIPRLK
Sbjct: 281  NATLRQQLGGNGMCPPHLPPPPMGMYPPMAPMPYPWMPCPPYMVKQQGSQVPLIPIPRLK 340

Query: 1137 PQQPI 1151
            PQ  +
Sbjct: 341  PQNTL 345


Top