BLASTX nr result

ID: Sinomenium22_contig00019165 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00019165
         (1276 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002266577.2| PREDICTED: ZF-HD homeobox protein At4g24660-...   211   4e-52
emb|CBI17508.3| unnamed protein product [Vitis vinifera]              187   1e-44
ref|XP_006425664.1| hypothetical protein CICLE_v10025926mg [Citr...   182   4e-43
emb|CAN72985.1| hypothetical protein VITISV_009036 [Vitis vinifera]   180   1e-42
ref|XP_007204025.1| hypothetical protein PRUPE_ppa020272mg [Prun...   178   4e-42
ref|XP_007046919.1| Homeobox protein 24, putative [Theobroma cac...   173   2e-40
ref|XP_007224511.1| hypothetical protein PRUPE_ppa023369mg [Prun...   164   8e-38
ref|XP_003520309.1| PREDICTED: ZF-HD homeobox protein At4g24660-...   163   2e-37
ref|XP_002521573.1| transcription factor, putative [Ricinus comm...   162   4e-37
ref|XP_007156052.1| hypothetical protein PHAVU_003G254200g [Phas...   161   5e-37
ref|XP_007017558.1| Homeobox protein 33 isoform 1 [Theobroma cac...   160   1e-36
ref|XP_004165400.1| PREDICTED: ZF-HD homeobox protein At4g24660-...   160   1e-36
ref|XP_004152776.1| PREDICTED: ZF-HD homeobox protein At4g24660-...   160   1e-36
ref|XP_004232414.1| PREDICTED: ZF-HD homeobox protein At4g24660-...   159   2e-36
ref|XP_007156949.1| hypothetical protein PHAVU_002G031000g [Phas...   159   3e-36
ref|XP_006383190.1| hypothetical protein POPTR_0005s12420g [Popu...   158   5e-36
ref|XP_004293203.1| PREDICTED: ZF-HD homeobox protein At4g24660-...   156   2e-35
ref|XP_002281371.1| PREDICTED: ZF-HD homeobox protein At4g24660-...   156   2e-35
ref|XP_006380765.1| hypothetical protein POPTR_0007s12970g [Popu...   155   4e-35
ref|XP_004287891.1| PREDICTED: uncharacterized protein LOC101298...   155   5e-35

>ref|XP_002266577.2| PREDICTED: ZF-HD homeobox protein At4g24660-like [Vitis vinifera]
          Length = 345

 Score =  211 bits (538), Expect = 4e-52
 Identities = 136/320 (42%), Positives = 166/320 (51%), Gaps = 22/320 (6%)
 Frame = +3

Query: 381  MDLRGQEKILGMPNSLGISYNNNPSIGEQTSKISLSS------------TGNGISVISPP 524
            M+LRGQ+K +GMP+SLG S  N     E  SK+S +S              +G +V+SP 
Sbjct: 1    MELRGQDKEIGMPSSLGYSPPNR----ESPSKVSPASIVLPVGDRRRDGAASGTTVLSP- 55

Query: 525  HSTTVDXXXXXXXRPKNRXXXXXXXXXXXXXERDPDPVSIAV-VTASIAPIITVGSNPRT 701
             S T+D       +                 + DPDPVS  + V+ + A  IT GSNP+ 
Sbjct: 56   -SQTLDHRHLHHHQ--FNLQQQTQHGEVGDPDPDPDPVSATIAVSGATATPITGGSNPKV 112

Query: 702  ----PKTKITQASSVRYRECLKNHAASLGGHALDGCGEFMPSGEDGTAEALKCAACDCHR 869
                P      A+S+RYRECLKNHAAS+GGH  DGCGEFMPSGE+GT EALKCAACDCHR
Sbjct: 113  AAAPPHPPPQSAASIRYRECLKNHAASMGGHVFDGCGEFMPSGEEGTLEALKCAACDCHR 172

Query: 870  NFHRKEAEGESQSHHQHGGIGNCFYCYNPNSKNXXXXXXXXXXXXXXXXXXXXIQQHQKF 1049
            NFHRKE +GESQ         NC+Y  NPN+ N                    + QH K+
Sbjct: 173  NFHRKEIDGESQP------TANCYYTCNPNT-NSSRRNTIAPQLPPSHAPLPHLHQHHKY 225

Query: 1050 ---XXXXXXXXXXXXXXXAFGGSGATXXXXXXP--TVYHSNAGAAAVGLSQFAISKKRFR 1214
                              AFGG G            ++ SN G        FA+SKKRFR
Sbjct: 226  SHGLSGSPLMSPIPPMMMAFGGGGGAPAESSSEDLNMFQSNVGMHLQPQPAFALSKKRFR 285

Query: 1215 TKFSQEQKDRMLEFAEKVGW 1274
            TKFSQEQKD+M EFAEK+GW
Sbjct: 286  TKFSQEQKDKMQEFAEKLGW 305


>emb|CBI17508.3| unnamed protein product [Vitis vinifera]
          Length = 410

 Score =  187 bits (474), Expect = 1e-44
 Identities = 105/222 (47%), Positives = 123/222 (55%), Gaps = 5/222 (2%)
 Frame = +3

Query: 624  DPDPVSIAV-VTASIAPIITVGSNPRT----PKTKITQASSVRYRECLKNHAASLGGHAL 788
            DPDPVS  + V+ + A  IT GSNP+     P      A+S+RYRECLKNHAAS+GGH  
Sbjct: 48   DPDPVSATIAVSGATATPITGGSNPKVAAAPPHPPPQSAASIRYRECLKNHAASMGGHVF 107

Query: 789  DGCGEFMPSGEDGTAEALKCAACDCHRNFHRKEAEGESQSHHQHGGIGNCFYCYNPNSKN 968
            DGCGEFMPSGE+GT EALKCAACDCHRNFHRKE +GESQ         NC+Y  NPN+ N
Sbjct: 108  DGCGEFMPSGEEGTLEALKCAACDCHRNFHRKEIDGESQP------TANCYYTCNPNT-N 160

Query: 969  XXXXXXXXXXXXXXXXXXXXIQQHQKFXXXXXXXXXXXXXXXAFGGSGATXXXXXXPTVY 1148
                                + QH K+                               ++
Sbjct: 161  SSRRNTIAPQLPPSHAPLPHLHQHHKY------------------SHAPAESSSEDLNMF 202

Query: 1149 HSNAGAAAVGLSQFAISKKRFRTKFSQEQKDRMLEFAEKVGW 1274
             SN G        FA+SKKRFRTKFSQEQKD+M EFAEK+GW
Sbjct: 203  QSNVGMHLQPQPAFALSKKRFRTKFSQEQKDKMQEFAEKLGW 244


>ref|XP_006425664.1| hypothetical protein CICLE_v10025926mg [Citrus clementina]
            gi|568824849|ref|XP_006466804.1| PREDICTED:
            ras-interacting protein RIP3-like [Citrus sinensis]
            gi|557527654|gb|ESR38904.1| hypothetical protein
            CICLE_v10025926mg [Citrus clementina]
          Length = 363

 Score =  182 bits (461), Expect = 4e-43
 Identities = 113/329 (34%), Positives = 159/329 (48%), Gaps = 31/329 (9%)
 Frame = +3

Query: 381  MDLRGQEKILGMPNSLGISYNNNPSIGEQTSKISLS-----STGNGISVISPPHSTT--- 536
            M+L+G+EK +GM +S+  + +++ ++    + I+        T +G ++ + P +     
Sbjct: 1    MELQGKEKEIGMTSSMRYNRDSSSTVSTPINSIAGEMIRDQGTVHGEAIFNLPQTLDQHQ 60

Query: 537  --------VDXXXXXXXRPKNRXXXXXXXXXXXXXERDPDPVSIAVVTASIAPIITVGSN 692
                    ++       + +                + PDPV ++V   +        SN
Sbjct: 61   HPPYRHHQLNSQQQQQPQTQQNLQNKPSAGSSNPEAQHPDPVPVSVANTTTNTKEANRSN 120

Query: 693  PRTP------KTKITQASSVRYRECLKNHAASLGGHALDGCGEFMPSGEDGTAEALKCAA 854
             R+P       T I  ASS+RYRECLKNHAA++G H +DGCGEFMPSGEDGT E LKCAA
Sbjct: 121  QRSPAQAPTTSTAIITASSIRYRECLKNHAANMGNHVIDGCGEFMPSGEDGTPEGLKCAA 180

Query: 855  CDCHRNFHRKEAEGESQSHHQHGGIGNCFYCYN-PNSKNXXXXXXXXXXXXXXXXXXXXI 1031
            CDCHRNFHRKE +G+SQS  Q+    +  Y YN P+  N                    +
Sbjct: 181  CDCHRNFHRKEIDGDSQSQSQYA--AHSLYPYNYPSRNNSTQRNHHHQQQQQPPPPFHHL 238

Query: 1032 QQHQKFXXXXXXXXXXXXXXXAFGGSGAT--------XXXXXXPTVYHSNAGAAAVGLSQ 1187
            QQH +                 FGG G                  ++HS+AG    G + 
Sbjct: 239  QQHHRISYTSPQTASIAPMMMTFGGGGGAGGSSGGLDESSSEDLNMFHSSAG----GQTS 294

Query: 1188 FAISKKRFRTKFSQEQKDRMLEFAEKVGW 1274
                KKRFRTKFSQEQKD+M+EFAE +GW
Sbjct: 295  MQAKKKRFRTKFSQEQKDKMMEFAETLGW 323


>emb|CAN72985.1| hypothetical protein VITISV_009036 [Vitis vinifera]
          Length = 250

 Score =  180 bits (456), Expect = 1e-42
 Identities = 101/215 (46%), Positives = 118/215 (54%), Gaps = 8/215 (3%)
 Frame = +3

Query: 654  TASIAPIITVGSNPRT----PKTKITQASSVRYRECLKNHAASLGGHALDGCGEFMPSGE 821
            + + A  IT GSNP+     P      A+S+RYRECLKNHAAS+GGH  DGCGEFMPSGE
Sbjct: 2    SGATATPITGGSNPKVAAAPPHPPPQSAASIRYRECLKNHAASMGGHVFDGCGEFMPSGE 61

Query: 822  DGTAEALKCAACDCHRNFHRKEAEGESQSHHQHGGIGNCFYCYNPNSKNXXXXXXXXXXX 1001
            +GT EALKCAACDCHRNFHRKE +GESQ         NC+Y  NPN+ +           
Sbjct: 62   EGTLEALKCAACDCHRNFHRKEIDGESQP------TANCYYTCNPNTNSSRRNTIAPQLP 115

Query: 1002 XXXXXXXXXIQQHQKFXXXXXXXXXXXXXXX--AFGGSGATXXXXXXP--TVYHSNAGAA 1169
                      Q H+                   AFGG G            ++ SN G  
Sbjct: 116  PSHAPLPHLHQXHKYSHGLSGSPLMSPIPPMMMAFGGGGGAPAESSSEDLNMFQSNVGMH 175

Query: 1170 AVGLSQFAISKKRFRTKFSQEQKDRMLEFAEKVGW 1274
                  FA+SKKRFRTKFSQEQKD+M EFAEK+GW
Sbjct: 176  LQPQPAFALSKKRFRTKFSQEQKDKMQEFAEKLGW 210


>ref|XP_007204025.1| hypothetical protein PRUPE_ppa020272mg [Prunus persica]
            gi|462399556|gb|EMJ05224.1| hypothetical protein
            PRUPE_ppa020272mg [Prunus persica]
          Length = 331

 Score =  178 bits (452), Expect = 4e-42
 Identities = 114/304 (37%), Positives = 149/304 (49%), Gaps = 6/304 (1%)
 Frame = +3

Query: 381  MDLRGQEKILGMPNSLGISY-NNNPSIGEQTSKISLSSTGNGISVISPPHSTTVDXXXXX 557
            M++RGQ+K++GMP +LG +  N + S  + +S  +L ST N I  I+P    T+D     
Sbjct: 1    MEVRGQDKVIGMPTTLGYNPPNRDSSSSKLSSSPALPSTANNIIFINPLQ--TLDPHPSP 58

Query: 558  XXRPKNRXXXXXXXXXXXXXERDPDPVSIAVV-----TASIAPIITVGSNPRTPKTKITQ 722
                 ++             E++PDP+S  +V     T + A  I  GSN + P  +   
Sbjct: 59   HRHQPHQLNLSPHKSSRRDSEQNPDPISSPIVVTPSATTTTATSIPGGSNFKAPPAQPPP 118

Query: 723  ASSVRYRECLKNHAASLGGHALDGCGEFMPSGEDGTAEALKCAACDCHRNFHRKEAEGES 902
               VRYRECL+NHAAS GGH LDGCGEFMPSGE+   EALKCAAC+CHRNFHRKE EG+ 
Sbjct: 119  PQKVRYRECLRNHAASSGGHVLDGCGEFMPSGEEDIPEALKCAACECHRNFHRKEIEGDH 178

Query: 903  QSHHQHGGIGNCFYCYNPNSKNXXXXXXXXXXXXXXXXXXXXIQQHQKFXXXXXXXXXXX 1082
                    + N +Y  N                         +  H              
Sbjct: 179  --------LPNNYYVVNHQKHTISRRDSETRVFQLPPPPLPPV--HHSAAGGPVPQTMMA 228

Query: 1083 XXXXAFGGSGATXXXXXXPTVYHSNAGAAAVGLSQFAISKKRFRTKFSQEQKDRMLEFAE 1262
                  GG GA         + +      A G  Q A SKKRFRTKFSQEQK++M+E AE
Sbjct: 229  FGGRGGGGGGADESSSEDLNMNNLFRATYAAG-QQAAGSKKRFRTKFSQEQKEKMMEVAE 287

Query: 1263 KVGW 1274
            K+GW
Sbjct: 288  KLGW 291


>ref|XP_007046919.1| Homeobox protein 24, putative [Theobroma cacao]
            gi|508699180|gb|EOX91076.1| Homeobox protein 24, putative
            [Theobroma cacao]
          Length = 385

 Score =  173 bits (438), Expect = 2e-40
 Identities = 98/237 (41%), Positives = 120/237 (50%), Gaps = 20/237 (8%)
 Frame = +3

Query: 624  DPDPVSIAVVTASIAPIITVGSNPRTPK-------------------TKITQASSVRYRE 746
            DPDP  ++  TA+ +  +T  +N  + K                   T I+    +RYRE
Sbjct: 103  DPDPDPVSAPTATTSATVTASANRSSLKSPQQQPPTSQPPPVAAASPTTISSTPLIRYRE 162

Query: 747  CLKNHAASLGGHALDGCGEFMPSGEDGTAEALKCAACDCHRNFHRKEAEGESQSHHQHGG 926
            C+KNHAAS+G H +DGCGEFMPSGE+GT EALKCAAC+CHRNFHRKE  GE+Q       
Sbjct: 163  CMKNHAASMGSHVMDGCGEFMPSGEEGTPEALKCAACECHRNFHRKEINGETQY------ 216

Query: 927  IGNCFYCYNPNSKNXXXXXXXXXXXXXXXXXXXXIQQHQKFXXXXXXXXXXXXXXXA-FG 1103
              +C+Y YNPN  N                     QQ                     F 
Sbjct: 217  APSCYYSYNPNKNNNRRDTTHPPSQLHPQQPIPLHQQRFSLGLSTSPTAMPIAPVMMNFR 276

Query: 1104 GSGATXXXXXXPTVYHSNAGAAAVGLSQFAISKKRFRTKFSQEQKDRMLEFAEKVGW 1274
            G G          ++HSNAG       Q   SKKRFRTKFSQEQKD+M+EFAEK+GW
Sbjct: 277  GGGPAESSSEDLNMFHSNAGGQISAQPQ--SSKKRFRTKFSQEQKDKMMEFAEKLGW 331


>ref|XP_007224511.1| hypothetical protein PRUPE_ppa023369mg [Prunus persica]
            gi|462421447|gb|EMJ25710.1| hypothetical protein
            PRUPE_ppa023369mg [Prunus persica]
          Length = 310

 Score =  164 bits (415), Expect = 8e-38
 Identities = 91/219 (41%), Positives = 113/219 (51%), Gaps = 1/219 (0%)
 Frame = +3

Query: 621  RDPDPVSIAVVTASIAPIITVGSNPRTPKTKITQASSVRYRECLKNHAASLGGHALDGCG 800
            RDPDP      T   + ++  G    T K        +RYRECLKNHAA++GG+  DGCG
Sbjct: 70   RDPDPDRALAGTPVPSTVLASGGPKSTSKI-------IRYRECLKNHAANIGGNVFDGCG 122

Query: 801  EFMPSGEDGTAEALKCAACDCHRNFHRKEAEGESQSHHQHGGIGNCFYCYNPNSKNXXXX 980
            EFMPSGE+GT EALKCAACDCHRNFHRKE +GE+ +             ++  S+     
Sbjct: 123  EFMPSGEEGTLEALKCAACDCHRNFHRKEVDGETTA-------------FSHGSRRSSIM 169

Query: 981  XXXXXXXXXXXXXXXXIQQHQKFXXXXXXXXXXXXXXXAFG-GSGATXXXXXXPTVYHSN 1157
                            +  H                  AFG G G T        V+ SN
Sbjct: 170  LSPLQLPPPLPSPSSALHHHHHHHQKFSMAPIIQPMNVAFGSGGGGTESSSEDLNVFQSN 229

Query: 1158 AGAAAVGLSQFAISKKRFRTKFSQEQKDRMLEFAEKVGW 1274
                 + +  FA+SKKRFRTKF+QEQK+RM+EFAEKVGW
Sbjct: 230  NAEGGLPMPPFAMSKKRFRTKFTQEQKERMMEFAEKVGW 268


>ref|XP_003520309.1| PREDICTED: ZF-HD homeobox protein At4g24660-like [Glycine max]
          Length = 334

 Score =  163 bits (412), Expect = 2e-37
 Identities = 115/321 (35%), Positives = 148/321 (46%), Gaps = 23/321 (7%)
 Frame = +3

Query: 381  MDLRGQEKILGMPNSLGISYNNNPS-------IGEQTSKISLSSTGNGISVISPPHSTTV 539
            MD+R Q+K++ MP++LG  YNN+ S       IGE++S        + +    PP +++ 
Sbjct: 1    MDMREQDKVIEMPSTLG--YNNSSSGSKLSSPIGERSSDQLPPHQSHTLVFTDPPQTSSH 58

Query: 540  DXXXXXXXRPKNRXXXXXXXXXXXXXERDPDPVSIAVVTASIAPIITVGSNPRTPKTKIT 719
                     P N               RDPDP SI      I+P I   +    P    T
Sbjct: 59   HHNLYPPSLPPN---PLQLPQPHHRPRRDPDPSSI------ISPPIISTTPTTAPPQPHT 109

Query: 720  QASSVRYRECLKNHAASLGGHALDGCGEFMPSGEDGTAEALKCAACDCHRNFHRKEAEG- 896
              +  RYRECLKNHAAS+GGH  DGCGEFMP+GE+GT E+LKCAAC+CHRNFHRKE    
Sbjct: 110  TTTLFRYRECLKNHAASMGGHVTDGCGEFMPNGEEGTPESLKCAACECHRNFHRKEPHQG 169

Query: 897  ---ESQSHHQHGGIGNCFYCYNPNSKNXXXXXXXXXXXXXXXXXXXXIQQHQKFXXXXXX 1067
               ESQ  H            N N++N                       H         
Sbjct: 170  VLVESQLQH---------VLLNKNNRNINTIIHSPDSHHHLQFPTPHSHLH-------GG 213

Query: 1068 XXXXXXXXXAFGGSGATXXXXXXPTVYHSN---AGAAAVGLSQF---------AISKKRF 1211
                      FGGSG          ++ +N    G   + LS           + SKKRF
Sbjct: 214  PPVVQPVMLGFGGSGPAESSSEDLNMFQTNDHGGGGNNLLLSSVQQQPPLLSSSSSKKRF 273

Query: 1212 RTKFSQEQKDRMLEFAEKVGW 1274
            RTKF+Q+QKDRM+EFAEK+GW
Sbjct: 274  RTKFTQQQKDRMMEFAEKLGW 294


>ref|XP_002521573.1| transcription factor, putative [Ricinus communis]
            gi|223539251|gb|EEF40844.1| transcription factor,
            putative [Ricinus communis]
          Length = 333

 Score =  162 bits (409), Expect = 4e-37
 Identities = 111/308 (36%), Positives = 143/308 (46%), Gaps = 10/308 (3%)
 Frame = +3

Query: 381  MDLRGQEKILGMPNSLGISYNNNPSIGEQTSKIS--LSSTGNGISVISPPHSTTVDXXXX 554
            M++R Q+K +GMP+SL      NP   + +SK    +S+ G   +   P  S T +    
Sbjct: 1    MEVRSQDKEIGMPSSLDC----NPPKRDSSSKFPPMISALGERTTDHQPAISQTHEQHHP 56

Query: 555  XXXRPK-NRXXXXXXXXXXXXXERDPDPVSIAVVTASIAPIITVGSN------PRTPKTK 713
               +PK N                DP P       A+  P +    +      P +  T 
Sbjct: 57   LYDQPKMNLHQQSLKPIRDLDLIPDPAPAPAPATGATNRPPVPSSRSMSRSPPPASAITT 116

Query: 714  ITQASSVRYRECLKNHAASLGGHALDGCGEFMPSGEDGTAEALKCAACDCHRNFHRKEAE 893
               A SVRYRECLKNHAAS GG  +DGCGEFMPSG++GT EA+KCAAC+CHRNFHRKE  
Sbjct: 117  TASAPSVRYRECLKNHAASTGGLIVDGCGEFMPSGQEGTLEAMKCAACECHRNFHRKEIH 176

Query: 894  GESQSHHQHGGIGNCFYCYNPNSKNXXXXXXXXXXXXXXXXXXXXIQQHQKFXXXXXXXX 1073
            GESQ         NC YC N + +N                    I Q + F        
Sbjct: 177  GESQC------AANC-YCKNNSQRNNTVPPPYHHLSHSLASAQPPIHQRRTFPHGFSSAV 229

Query: 1074 XXXXXXXAFGGSGATXXXXXXP-TVYHSNAGAAAVGLSQFAISKKRFRTKFSQEQKDRML 1250
                    FG  GA          ++  N    + G       KKR+RTKFSQEQKD+M+
Sbjct: 230  LTAPVLMTFGSGGAAAESSSEDLDMFQPN----SQGHGCMQQLKKRYRTKFSQEQKDKMM 285

Query: 1251 EFAEKVGW 1274
            EFAE++ W
Sbjct: 286  EFAERLEW 293


>ref|XP_007156052.1| hypothetical protein PHAVU_003G254200g [Phaseolus vulgaris]
            gi|561029406|gb|ESW28046.1| hypothetical protein
            PHAVU_003G254200g [Phaseolus vulgaris]
          Length = 321

 Score =  161 bits (408), Expect = 5e-37
 Identities = 109/306 (35%), Positives = 139/306 (45%), Gaps = 8/306 (2%)
 Frame = +3

Query: 381  MDLRGQEKILGMPNSLGISYNNNPS--------IGEQTSKISLSSTGNGISVISPPHSTT 536
            MD+R Q+K++ MP +LG +  N  S        IGE++ +   S T     V S P  T 
Sbjct: 1    MDMREQDKVIEMPGTLGYNLPNTNSSSSKLSSLIGERSDQPPQSHT----LVFSDPPQTN 56

Query: 537  VDXXXXXXXRPKNRXXXXXXXXXXXXXERDPDPVSIAVVTASIAPIITVGSNPRTPKTKI 716
                      P +               RD DP +I+       PI+T       P +  
Sbjct: 57   -SHHHRRLNPPNSLPPNPLQLPHPHRPRRDLDPTAIS------PPIVTTSRTQ--PHSTG 107

Query: 717  TQASSVRYRECLKNHAASLGGHALDGCGEFMPSGEDGTAEALKCAACDCHRNFHRKEAEG 896
            T  ++VRYRECLKNHAA +GGH  DGCGEFMPSGE+GT E+ KCAAC+CHRNFHRKE EG
Sbjct: 108  TFTATVRYRECLKNHAAIMGGHVTDGCGEFMPSGEEGTPESFKCAACECHRNFHRKEPEG 167

Query: 897  ESQSHHQHGGIGNCFYCYNPNSKNXXXXXXXXXXXXXXXXXXXXIQQHQKFXXXXXXXXX 1076
            ES  H  +      ++   PN  N                       H            
Sbjct: 168  ESSQHVLN------YHLTYPNKTNRNIVIHSPQSHLQLP------THHLHGVVATPSGGS 215

Query: 1077 XXXXXXAFGGSGATXXXXXXPTVYHSNAGAAAVGLSQFAISKKRFRTKFSQEQKDRMLEF 1256
                   FGG+                AG         + SKKRFRTKFSQ+QKD+M+EF
Sbjct: 216  VQPAVLGFGGTPTESSSEDLNMFQTDEAGQLLSVQPPLSSSKKRFRTKFSQQQKDQMMEF 275

Query: 1257 AEKVGW 1274
            A+K+GW
Sbjct: 276  ADKLGW 281


>ref|XP_007017558.1| Homeobox protein 33 isoform 1 [Theobroma cacao]
            gi|590593411|ref|XP_007017559.1| Homeobox protein 33
            isoform 1 [Theobroma cacao] gi|508722886|gb|EOY14783.1|
            Homeobox protein 33 isoform 1 [Theobroma cacao]
            gi|508722887|gb|EOY14784.1| Homeobox protein 33 isoform 1
            [Theobroma cacao]
          Length = 296

 Score =  160 bits (405), Expect = 1e-36
 Identities = 111/300 (37%), Positives = 149/300 (49%), Gaps = 2/300 (0%)
 Frame = +3

Query: 381  MDLRGQEKILGMPNSLGISYNNNPSIGEQTSKISLSSTGNGISVISPPHSTTVDXXXXXX 560
            M++RGQE  +  P S G  +++ P    +          NG +V++   + T+D      
Sbjct: 1    MEVRGQEHDIKAPGSSGFGHHS-PGADRRRD-----GNHNGTAVLTC--TETLDHVH--- 49

Query: 561  XRPKNRXXXXXXXXXXXXXERDPDPVSIAVVTASIAPIITVGSNPRTPKTKITQASSVRY 740
             RP+ +              R P P  +    A++API +V SN +        +S +RY
Sbjct: 50   -RPQRQQSLGQG--------RSPHPDRVTASGAAVAPI-SVSSNTKP-------SSVIRY 92

Query: 741  RECLKNHAASLGGHALDGCGEFMPSGEDGTAEALKCAACDCHRNFHRKEAEGESQSHHQH 920
            RECLKNHAAS+GG+  DGCGEFMPSGE+GT EALKCAACDCHRNFHRKE +GE+Q     
Sbjct: 93   RECLKNHAASIGGNVYDGCGEFMPSGEEGTLEALKCAACDCHRNFHRKEVDGETQ----- 147

Query: 921  GGIGNCFYCYNPNSKNXXXXXXXXXXXXXXXXXXXXIQQHQKF-XXXXXXXXXXXXXXXA 1097
                     + PNS +                    +  HQ++                A
Sbjct: 148  ---------FGPNS-SRRSLMLNPLQLPPPLPSPTMLHHHQRYSVHTSPSSAMVAPMNVA 197

Query: 1098 FG-GSGATXXXXXXPTVYHSNAGAAAVGLSQFAISKKRFRTKFSQEQKDRMLEFAEKVGW 1274
            FG G G          ++ SNA         + +SKKRFRTKF+QEQKD+MLEFAEK+GW
Sbjct: 198  FGSGGGCGTESSSEDLMFQSNAEGMPPP-PPYVLSKKRFRTKFTQEQKDKMLEFAEKLGW 256


>ref|XP_004165400.1| PREDICTED: ZF-HD homeobox protein At4g24660-like [Cucumis sativus]
          Length = 320

 Score =  160 bits (404), Expect = 1e-36
 Identities = 91/188 (48%), Positives = 105/188 (55%), Gaps = 5/188 (2%)
 Frame = +3

Query: 726  SSVRYRECLKNHAASLGGHALDGCGEFMPSGEDGTAEALKCAACDCHRNFHRKEAEGESQ 905
            S VRYRECLKNHAAS+GG+  DGCGEFMPSGEDGT EALKCAAC+CHRNFHRKE +GE+Q
Sbjct: 90   SGVRYRECLKNHAASVGGNIYDGCGEFMPSGEDGTLEALKCAACECHRNFHRKEIDGETQ 149

Query: 906  SHHQHGGIGNCFYCYNPNSKNXXXXXXXXXXXXXXXXXXXXIQQHQKF-----XXXXXXX 1070
             +             +PN +                     +  H KF            
Sbjct: 150  LN------------ISPNYRR--GLMLNHLQLPPPLPSPSALHGHHKFSMALNLHSSPTA 195

Query: 1071 XXXXXXXXAFGGSGATXXXXXXPTVYHSNAGAAAVGLSQFAISKKRFRTKFSQEQKDRML 1250
                    AF G G          V+HSN  A  +  S F++SKKRFRTKF+QEQKDRML
Sbjct: 196  PIIAPMNVAFAGGGGNESSSEDLNVFHSN--AEVMPPSSFSLSKKRFRTKFTQEQKDRML 253

Query: 1251 EFAEKVGW 1274
            EFAEKVGW
Sbjct: 254  EFAEKVGW 261


>ref|XP_004152776.1| PREDICTED: ZF-HD homeobox protein At4g24660-like [Cucumis sativus]
          Length = 276

 Score =  160 bits (404), Expect = 1e-36
 Identities = 91/188 (48%), Positives = 105/188 (55%), Gaps = 5/188 (2%)
 Frame = +3

Query: 726  SSVRYRECLKNHAASLGGHALDGCGEFMPSGEDGTAEALKCAACDCHRNFHRKEAEGESQ 905
            S VRYRECLKNHAAS+GG+  DGCGEFMPSGEDGT EALKCAAC+CHRNFHRKE +GE+Q
Sbjct: 46   SGVRYRECLKNHAASVGGNIYDGCGEFMPSGEDGTLEALKCAACECHRNFHRKEIDGETQ 105

Query: 906  SHHQHGGIGNCFYCYNPNSKNXXXXXXXXXXXXXXXXXXXXIQQHQKF-----XXXXXXX 1070
             +             +PN +                     +  H KF            
Sbjct: 106  LN------------ISPNYRR--GLMLNHLQLPPPLPSPSALHGHHKFSMALNLHSSPTA 151

Query: 1071 XXXXXXXXAFGGSGATXXXXXXPTVYHSNAGAAAVGLSQFAISKKRFRTKFSQEQKDRML 1250
                    AF G G          V+HSN  A  +  S F++SKKRFRTKF+QEQKDRML
Sbjct: 152  PIIAPMNVAFAGGGGNESSSEDLNVFHSN--AEVMPPSSFSLSKKRFRTKFTQEQKDRML 209

Query: 1251 EFAEKVGW 1274
            EFAEKVGW
Sbjct: 210  EFAEKVGW 217


>ref|XP_004232414.1| PREDICTED: ZF-HD homeobox protein At4g24660-like [Solanum
            lycopersicum]
          Length = 293

 Score =  159 bits (402), Expect = 2e-36
 Identities = 110/301 (36%), Positives = 141/301 (46%), Gaps = 3/301 (0%)
 Frame = +3

Query: 381  MDLRGQEKILGMPNSLGISYNNNPSIGEQTSKISLSSTGNGISVISPPHSTTVDXXXXXX 560
            M+ RGQEK +G+PN   +SYN +  + +Q S  S ++       ++ P+ TT +      
Sbjct: 1    MEHRGQEKDMGLPNPNPMSYNPS-QLNQQESSSSAAN-----KFLTAPNRTTNEHENTIF 54

Query: 561  XRPKNRXXXXXXXXXXXXXERDPDPVSIAVVTASIAPIITVGSNPRTPKTKITQASSVRY 740
               +                 DPDPV     +++    IT                 VRY
Sbjct: 55   SPNQT------LDQHNITQNSDPDPVRQLSTSSASERNIT----------------PVRY 92

Query: 741  RECLKNHAASLGGHALDGCGEFMPSGEDGTAEALKCAACDCHRNFHRKEAEGESQS---H 911
            +ECLKNHAA+LGG+ LDGCGEFMPSGE+ T E LKCAACDCHRNFHRKE E ESQ+   H
Sbjct: 93   KECLKNHAANLGGYVLDGCGEFMPSGEEETLEYLKCAACDCHRNFHRKETEDESQTPGVH 152

Query: 912  HQHGGIGNCFYCYNPNSKNXXXXXXXXXXXXXXXXXXXXIQQHQKFXXXXXXXXXXXXXX 1091
              +  I N      P S                       QQH                 
Sbjct: 153  RNNHRIPN----QTPPS----------------LPAVPTQQQHHHKYPHSYPRGHMAPVM 192

Query: 1092 XAFGGSGATXXXXXXPTVYHSNAGAAAVGLSQFAISKKRFRTKFSQEQKDRMLEFAEKVG 1271
             +FGG+           +   + G   +    F+ SKKRFRTKFSQ+QKDRMLEFAEK+G
Sbjct: 193  MSFGGNTGVAAESSSEDLNMFHGGQGVIQPCNFSASKKRFRTKFSQQQKDRMLEFAEKLG 252

Query: 1272 W 1274
            W
Sbjct: 253  W 253


>ref|XP_007156949.1| hypothetical protein PHAVU_002G031000g [Phaseolus vulgaris]
            gi|561030364|gb|ESW28943.1| hypothetical protein
            PHAVU_002G031000g [Phaseolus vulgaris]
          Length = 353

 Score =  159 bits (401), Expect = 3e-36
 Identities = 113/336 (33%), Positives = 147/336 (43%), Gaps = 38/336 (11%)
 Frame = +3

Query: 381  MDLRGQEKILGMPNSLGISYNNNPSIGEQTSKISLSSTGNGIS----------------- 509
            M++ GQ+K + +P SLG +  N  S    +SK+S  + G   +                 
Sbjct: 1    MEMEGQDKEIEIPTSLGYNLPNRDS-SSSSSKLSSPTVGERSTTHHDHGHDHGDHDHGHD 59

Query: 510  -VISPPHSTTVDXXXXXXXRPKNRXXXXXXXXXXXXXER-DPDPVSIAVVTASIAPIITV 683
             +  PPH T           P +               R  PDP     +  +  P+ T 
Sbjct: 60   QLHQPPHQTHT---LIFNEPPHHNLYQPPPPLAPRQPHRLTPDPDLSTPIAPTSNPLRTA 116

Query: 684  GSNPRT----PKTKITQASSVRYRECLKNHAASLGGHALDGCGEFMPSGEDGTAEALKCA 851
                 T      T  T   S+RYRECL+NHAAS+G H +DGCGEFM SGE+GT E+L+CA
Sbjct: 117  HPQTTTIAAAAATTTTSTPSIRYRECLRNHAASMGSHVVDGCGEFMASGEEGTPESLRCA 176

Query: 852  ACDCHRNFHRKEAEGE-------------SQSHHQHGGIGNCFYCYNPNSKNXXXXXXXX 992
            AC+CHRNFHRKE EGE              Q   QH      ++ Y PN+ N        
Sbjct: 177  ACECHRNFHRKEVEGELQPQQPPPLSLLPQQQQQQH---APNYHSYYPNNHNGHLHYPTP 233

Query: 993  XXXXXXXXXXXXIQQHQKFXXXXXXXXXXXXXXXAFGGSGATXXXXXXPTVYHSNAGAAA 1172
                           H +                AFGG   +        ++ SN G A 
Sbjct: 234  SPS----------SLHHRLVGSSGTPSLVPPVMMAFGGPAESSSEDL--NMFQSNTGGAH 281

Query: 1173 VGLSQFA--ISKKRFRTKFSQEQKDRMLEFAEKVGW 1274
              LS  A   SKKRFRTKFS++QKDRM+EFAEK+GW
Sbjct: 282  AQLSVQAPVSSKKRFRTKFSKQQKDRMMEFAEKIGW 317


>ref|XP_006383190.1| hypothetical protein POPTR_0005s12420g [Populus trichocarpa]
            gi|550338772|gb|ERP60987.1| hypothetical protein
            POPTR_0005s12420g [Populus trichocarpa]
          Length = 339

 Score =  158 bits (399), Expect = 5e-36
 Identities = 108/313 (34%), Positives = 145/313 (46%), Gaps = 15/313 (4%)
 Frame = +3

Query: 381  MDLRGQEKILGMPNSLGISYNNNPSIGEQTSKISLSSTGNGISVISP--PHSTTVDXXXX 554
            M+LRGQEK   MP S      N P+  + +S+I  + T           PH+  ++    
Sbjct: 1    MELRGQEKETVMPRSF-----NPPNNRDSSSRIPSAPTRRDHRHTDTVLPHTLDLEHQSL 55

Query: 555  XXXRPK-----NRXXXXXXXXXXXXXERDPDPVSIAVVTASIA-----PIITVGSNPRTP 704
               + +     N                DP   +  V T S       P  ++  +P  P
Sbjct: 56   YQQQQQQQKQLNPQHQACKPTRDLDLTPDPTQATTPVATTSATNTAPTPSRSISRSPPPP 115

Query: 705  KTKITQASSVRYRECLKNHAASLGGHALDGCGEFMPSGEDGTAEALKCAACDCHRNFHRK 884
             T  + AS +RYRECLKNHAAS+GGH LDGCGEFMP GE+GT E  KCAAC+CHR+FHR+
Sbjct: 116  PTSASSAS-IRYRECLKNHAASMGGHVLDGCGEFMPGGEEGTPETFKCAACECHRSFHRR 174

Query: 885  EAEGESQSHHQHGGIGNCFYCYNPNSKNXXXXXXXXXXXXXXXXXXXXIQQHQKF--XXX 1058
            E +G  Q       + N     N N K                     +  HQ++     
Sbjct: 175  EIDGAPQC------VANSTCYKNSNGKRNILPFPQQLVTSHAPPQSASLHPHQRYHHGTL 228

Query: 1059 XXXXXXXXXXXXAFGGSGATXXXXXXP-TVYHSNAGAAAVGLSQFAISKKRFRTKFSQEQ 1235
                        +FGG GA          +Y S+    +   +Q  ISKKRFRT+FS+EQ
Sbjct: 229  STYTTPIAPMMMSFGGGGAAAESSSEDLNMYQSDLQGQS--SAQPLISKKRFRTRFSEEQ 286

Query: 1236 KDRMLEFAEKVGW 1274
            KD+M+EFAEK+GW
Sbjct: 287  KDKMMEFAEKLGW 299


>ref|XP_004293203.1| PREDICTED: ZF-HD homeobox protein At4g24660-like [Fragaria vesca
            subsp. vesca]
          Length = 342

 Score =  156 bits (395), Expect = 2e-35
 Identities = 94/221 (42%), Positives = 120/221 (54%), Gaps = 3/221 (1%)
 Frame = +3

Query: 621  RDPDPVSIAVVTASIAPIITVGSNPRTPKTKITQASSVRYRECLKNHAASLGGHALDGCG 800
            RDPDP  + + + ++ P   V  + +TP   +  AS+VRYRECLKNHAA++GG+  DGCG
Sbjct: 73   RDPDPDRV-IASNALVPSSAVARS-KTPT--LATASNVRYRECLKNHAANIGGNVFDGCG 128

Query: 801  EFMPSGEDGTAEALKCAACDCHRNFHRKEAEGESQSHHQHGGIGNCFYCYNPNSKNXXXX 980
            EFMP GE+GT EALKCAACDCHRNFHRKE +GE+ +   HG            S++    
Sbjct: 129  EFMPCGEEGTLEALKCAACDCHRNFHRKEVDGETMTPFGHGS----------RSRSIMLS 178

Query: 981  XXXXXXXXXXXXXXXXIQQHQKFXXXXXXXXXXXXXXXAFGGSGATXXXXXXP---TVYH 1151
                              +HQKF               AFGGSG             V+ 
Sbjct: 179  PIQLPPPLPSPH-----HRHQKF-------SIVQPMSVAFGGSGGGGGGESSSEDLNVFD 226

Query: 1152 SNAGAAAVGLSQFAISKKRFRTKFSQEQKDRMLEFAEKVGW 1274
            +  G        +++SKKRFRTKF+ EQK RM+EFAEKVGW
Sbjct: 227  NADGIGGGVAPPYSLSKKRFRTKFTAEQKVRMVEFAEKVGW 267


>ref|XP_002281371.1| PREDICTED: ZF-HD homeobox protein At4g24660-like [Vitis vinifera]
          Length = 316

 Score =  156 bits (394), Expect = 2e-35
 Identities = 110/295 (37%), Positives = 137/295 (46%), Gaps = 4/295 (1%)
 Frame = +3

Query: 402  KILGMPNSLGISYNNNPSIGEQTSKISLSSTGNGISVISPPHSTTVDXXXXXXXRPKNRX 581
            K +G P S G  YN   S G         +  NG +V  PP             +  N  
Sbjct: 5    KEMGFPPSSG--YNPLASAGSGGGGDDHHNIDNGTTVFKPPQIPH-HHPLLQQQQELNPQ 61

Query: 582  XXXXXXXXXXXXERDPDPVSIAVVTASIAPIITVGSNPRTPKTKITQASSVRYRECLKNH 761
                        + DPDPV +A V A      T+G +           +SVRYRECLKNH
Sbjct: 62   QQSLGQGCDPDPDPDPDPVHVAGVLAGATIASTIGGS--------NSKASVRYRECLKNH 113

Query: 762  AASLGGHALDGCGEFMPSGEDGTAEALKCAACDCHRNFHRKEAEGESQSHHQHGGIGNCF 941
            AA++GG+ +DGCGEFMP GE+GT EAL CAAC+CHRNFHRKE +GE+        IG   
Sbjct: 114  AANIGGNVVDGCGEFMPDGEEGTLEALMCAACNCHRNFHRKEVDGET--------IGRSA 165

Query: 942  YCYNPNSKNXXXXXXXXXXXXXXXXXXXXIQQHQKFXXXXXXXXXXXXXXXAFGGS-GAT 1118
              ++P                         Q+  K                AFG S GAT
Sbjct: 166  PHFHPLPPTLASPPYLHR------------QKFPKAFHAPPSTIIIPPMSMAFGTSIGAT 213

Query: 1119 XXXXXXPTVYHSNAGAA---AVGLSQFAISKKRFRTKFSQEQKDRMLEFAEKVGW 1274
                     + SNAGAA          ++SKKRFRTKF+QEQK++MLE+AEKVGW
Sbjct: 214  ESSSEDLRAFDSNAGAAPPPPPPPPPSSLSKKRFRTKFTQEQKEKMLEYAEKVGW 268


>ref|XP_006380765.1| hypothetical protein POPTR_0007s12970g [Populus trichocarpa]
            gi|550334765|gb|ERP58562.1| hypothetical protein
            POPTR_0007s12970g [Populus trichocarpa]
          Length = 331

 Score =  155 bits (392), Expect = 4e-35
 Identities = 108/309 (34%), Positives = 141/309 (45%), Gaps = 11/309 (3%)
 Frame = +3

Query: 381  MDLRGQEKILGMPNSLGISYNNNPSIGEQTSKISLSSTGNGISVISPPHSTTVDXXXXXX 560
            M+LRGQ+K + MP SL     N P   + +SK+  S+           H+  V       
Sbjct: 1    MELRGQDKGIVMPKSLNY---NPPDNRDSSSKVPNSAPARR----DHHHAAAVLPHALGH 53

Query: 561  XRPKNRXXXXXXXXXXXXXERDPDPVSIAVVTASIAPIITVG----SNPRTPKTKITQAS 728
                 +             +  PDPV      A+   I T      S  R+P      ++
Sbjct: 54   QSLYQQQQQQQAQKPTTDLDLTPDPVQATTPIATTGAINTAQTPSRSLSRSPPPTPASSA 113

Query: 729  SVRYRECLKNHAASLGGHALDGCGEFMPSGEDGTAEALKCAACDCHRNFHRKEAEGESQS 908
            S RYRECLKNHAAS+GGH LDGCGEFMP GE+GT E+ KCAAC+CHRNFHR+E +GE Q 
Sbjct: 114  STRYRECLKNHAASMGGHVLDGCGEFMPGGEEGTLESFKCAACECHRNFHRREIDGEPQC 173

Query: 909  HHQHGGIGNCFYCYNPNSKNXXXXXXXXXXXXXXXXXXXXIQQHQKFXXXXXXXXXXXXX 1088
                    +  Y  +   +N                      QH ++             
Sbjct: 174  -----VANSTLYKISNGQRNILPPQHLVTSCAPRQPFP---HQHHRYHQGTLSAYTTPIA 225

Query: 1089 XXAF------GGSGATXXXXXXPTVYHSN-AGAAAVGLSQFAISKKRFRTKFSQEQKDRM 1247
                      GG  A         +Y SN  G A+V   Q ++S+KRFRTKFSQ+QKD+M
Sbjct: 226  PMIMSFGRGDGGGAAAESSSEDLNMYQSNLQGQASV---QPSMSRKRFRTKFSQDQKDKM 282

Query: 1248 LEFAEKVGW 1274
             EFAEK+GW
Sbjct: 283  TEFAEKLGW 291


>ref|XP_004287891.1| PREDICTED: uncharacterized protein LOC101298828 [Fragaria vesca
            subsp. vesca]
          Length = 358

 Score =  155 bits (391), Expect = 5e-35
 Identities = 115/328 (35%), Positives = 147/328 (44%), Gaps = 30/328 (9%)
 Frame = +3

Query: 381  MDLR-GQEKILGMPNSLGIS-YNNNPSIGEQTSKISLSSTGNGISVISPPHSTTVDXXXX 554
            M++R GQ+KILGMP +LG +  N   S   + S  SL    +  + +   H  T+D    
Sbjct: 1    MEIRAGQDKILGMPTTLGFNPQNRESSSSSRLSSPSLHHHHHVNNTLIFNHPQTLDPFYQ 60

Query: 555  XXXRPKNRXXXXXXXXXXXXXERDPDPVSIAVVTASIAPIITV----GSNPRTPKTKITQ 722
               + ++               RDPDPV       S A + T      S PR        
Sbjct: 61   P--QTQHHHHHHQPQQSNPYKPRDPDPVPNPDPNLSPAAVCTTPRATSSTPRGANHSFKA 118

Query: 723  ASS-----------------VRYRECLKNHAASLGGHALDGCGEFMPSGEDGTAEALKCA 851
            A S                 VRYRECLKNHAA+ GGH LDGCGEFMPSGE+ +   LKCA
Sbjct: 119  APSAAQAQVPVPEPVPASKAVRYRECLKNHAATTGGHVLDGCGEFMPSGEEDSPGGLKCA 178

Query: 852  ACDCHRNFHRKEAEGESQSHHQHGGIGNCFYCYNPNSKN--XXXXXXXXXXXXXXXXXXX 1025
            ACDCHRNFHRKE EGE+Q  H    + N ++  N  +KN                     
Sbjct: 179  ACDCHRNFHRKEIEGETQLVH----VPNNYHVLNHPNKNSHSSRRNASSAPVVPSLPAPP 234

Query: 1026 XIQQHQKF-----XXXXXXXXXXXXXXXAFGGSGATXXXXXXPTVYHSNAGAAAVGLSQF 1190
             +  H ++                     FGG G          + H N    +   +Q 
Sbjct: 235  PVHHHHQYHHFPATSPNVAGSFPPGSMMTFGGGGGA-AESSSEDLNHMNMYDQS---NQA 290

Query: 1191 AISKKRFRTKFSQEQKDRMLEFAEKVGW 1274
              S+KRFRTKFSQEQKD+M+E AEK+GW
Sbjct: 291  GSSRKRFRTKFSQEQKDKMMEVAEKLGW 318


Top