BLASTX nr result

ID: Papaver25_contig00037101 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver25_contig00037101
         (1431 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EXC01084.1| hypothetical protein L484_025452 [Morus notabilis]     178   5e-42
ref|XP_002534484.1| transcription factor, putative [Ricinus comm...   178   6e-42
ref|XP_006291497.1| hypothetical protein CARUB_v10017644mg [Caps...   173   2e-40
ref|XP_004165400.1| PREDICTED: ZF-HD homeobox protein At4g24660-...   169   3e-39
ref|XP_004152776.1| PREDICTED: ZF-HD homeobox protein At4g24660-...   169   3e-39
ref|XP_007156052.1| hypothetical protein PHAVU_003G254200g [Phas...   167   1e-38
ref|XP_007046919.1| Homeobox protein 24, putative [Theobroma cac...   165   5e-38
ref|XP_003520078.1| PREDICTED: transcription factor HB29-like [G...   164   7e-38
ref|XP_006349273.1| PREDICTED: ZF-HD homeobox protein At4g24660-...   163   2e-37
ref|XP_002324056.2| hypothetical protein POPTR_0017s11920g [Popu...   163   2e-37
ref|XP_006373320.1| hypothetical protein POPTR_0017s11920g [Popu...   161   6e-37
ref|XP_007017558.1| Homeobox protein 33 isoform 1 [Theobroma cac...   161   8e-37
ref|XP_007224511.1| hypothetical protein PRUPE_ppa023369mg [Prun...   155   3e-35
ref|XP_006376247.1| hypothetical protein POPTR_0013s11330g [Popu...   154   7e-35
ref|NP_568570.1| homeobox protein 23 [Arabidopsis thaliana] gi|7...   113   2e-22
gb|AAM64462.1| unknown [Arabidopsis thaliana]                         112   4e-22
dbj|BAE99230.1| hypothetical protein [Arabidopsis thaliana]           111   7e-22
ref|XP_002870762.1| hypothetical protein ARALYDRAFT_494014 [Arab...   110   2e-21
ref|XP_006395326.1| hypothetical protein EUTSA_v10004599mg [Eutr...   108   4e-21
gb|ABK22835.1| unknown [Picea sitchensis]                             108   4e-21

>gb|EXC01084.1| hypothetical protein L484_025452 [Morus notabilis]
          Length = 302

 Score =  178 bits (452), Expect = 5e-42
 Identities = 117/288 (40%), Positives = 143/288 (49%), Gaps = 23/288 (7%)
 Frame = +2

Query: 380  VVKP----VTSNNSNMKQHHHHSSKFQ-VKYKECMKNHAASIGGHANDGCGEFMQ---EA 535
            VV+P      +NN+N+   +  +S    VKYKECM+NHAASIGGHANDGCGEFM    EA
Sbjct: 28   VVEPKNNAANNNNNNLNNKNILNSNCNDVKYKECMRNHAASIGGHANDGCGEFMPSSAEA 87

Query: 536  DQDNNNARSLRCAACGCHRNFHRKEIHGSGF-----VDHLQQQVIVNYNGTNPRYEK--- 691
            ++ + +  SL CAACGCHRNFHR+EI G        V     Q ++ YN   P   +   
Sbjct: 88   EKSDMSPSSLTCAACGCHRNFHRREIIGCEQNLLYPVHSPPPQPVLLYNAVGPATPRWDP 147

Query: 692  -------TKKHHHHHQSRGGFNSLPSLSLPQVHNNNQQFLVQHRXXXXXXXXXXXXXRDA 850
                    ++H  H   R G                 + +V+ R              D 
Sbjct: 148  KMSVVGPPRRHQIHSLHRNG------------SGGGGRGIVEAREGRAECGAEVREYYDR 195

Query: 851  LFGXXXXXXXXXXXXXXXRRSETPEERVGMMRXXXXXXXXXXXXXXXKRFRTKFSQEQKD 1030
                              R SETPE                      KRFRTKF+ EQKD
Sbjct: 196  ------------------RSSETPERE-----DVSAPSAMAGSGGGGKRFRTKFTLEQKD 232

Query: 1031 KMLDFADKIGWRIQKQDDNALNQFCLEIGVKRQVLKVWMHNNKNTLRK 1174
            +ML FAD+IGWRIQ+QDD A+NQFC EIGVKR VLKVWMHNNKN  R+
Sbjct: 233  RMLAFADRIGWRIQRQDDVAINQFCSEIGVKRNVLKVWMHNNKNAHRR 280


>ref|XP_002534484.1| transcription factor, putative [Ricinus communis]
            gi|223525213|gb|EEF27897.1| transcription factor,
            putative [Ricinus communis]
          Length = 245

 Score =  178 bits (451), Expect = 6e-42
 Identities = 107/242 (44%), Positives = 123/242 (50%)
 Frame = +2

Query: 449  VKYKECMKNHAASIGGHANDGCGEFMQEADQDNNNARSLRCAACGCHRNFHRKEIHGSGF 628
            VKYKECMKNHAASIGGHANDGCGEFM  AD +N     L CAACGCHRNFHR+E      
Sbjct: 35   VKYKECMKNHAASIGGHANDGCGEFMPCADDNN-----LTCAACGCHRNFHRRE------ 83

Query: 629  VDHLQQQVIVNYNGTNPRYEKTKKHHHHHQSRGGFNSLPSLSLPQVHNNNQQFLVQHRXX 808
                         GT+      ++HH  H      +  P  +   V  + +  +  H   
Sbjct: 84   -------------GTSAA-SSARQHHTLHFEHLLLSPPPLAAAKSVTVSKKHLITSH--- 126

Query: 809  XXXXXXXXXXXRDALFGXXXXXXXXXXXXXXXRRSETPEERVGMMRXXXXXXXXXXXXXX 988
                                            RRSETPE      R              
Sbjct: 127  --------------------DHSDDPEDDDHDRRSETPE------RGEVNHVGGLGSRAK 160

Query: 989  XKRFRTKFSQEQKDKMLDFADKIGWRIQKQDDNALNQFCLEIGVKRQVLKVWMHNNKNTL 1168
             KRFRTKF+QEQKD+ML+FA+KIGWRI K DD ALNQFC E+GVKR VLKVWMHNNKN  
Sbjct: 161  NKRFRTKFTQEQKDRMLEFAEKIGWRINKNDDMALNQFCDEVGVKRNVLKVWMHNNKNAH 220

Query: 1169 RK 1174
            R+
Sbjct: 221  RR 222


>ref|XP_006291497.1| hypothetical protein CARUB_v10017644mg [Capsella rubella]
            gi|482560204|gb|EOA24395.1| hypothetical protein
            CARUB_v10017644mg [Capsella rubella]
          Length = 323

 Score =  173 bits (438), Expect = 2e-40
 Identities = 101/264 (38%), Positives = 132/264 (50%)
 Frame = +2

Query: 386  KPVTSNNSNMKQHHHHSSKFQVKYKECMKNHAASIGGHANDGCGEFMQEADQDNNNARSL 565
            KP++ +N  +K+HHHH+    V YKEC+KNHAA+IGGHA DGCGEFM       ++  SL
Sbjct: 29   KPISFSNGFIKRHHHHN--VTVTYKECLKNHAAAIGGHALDGCGEFMPSPSSTPSDPTSL 86

Query: 566  RCAACGCHRNFHRKEIHGSGFVDHLQQQVIVNYNGTNPRYEKTKKHHHHHQSRGGFNSLP 745
            +CAACGCHRNFHR++   S           +  + T     + + HH HH         P
Sbjct: 87   KCAACGCHRNFHRRDPDDSSAA---LPPSSLPLSSTTTAAIEYQPHHRHHPLPPPLPRSP 143

Query: 746  SLSLPQVHNNNQQFLVQHRXXXXXXXXXXXXXRDALFGXXXXXXXXXXXXXXXRRSETPE 925
            + S P   +++   L                   A                    S TP 
Sbjct: 144  NSSSPPPISSSYMLLALSGNNNNKTVPFSDLNFSAAAAAAGRHNNNI--------STTPG 195

Query: 926  ERVGMMRXXXXXXXXXXXXXXXKRFRTKFSQEQKDKMLDFADKIGWRIQKQDDNALNQFC 1105
             R                    KRFRTKFS  QK+KM +FADKIGW+IQK+D++ + +FC
Sbjct: 196  SR--------------------KRFRTKFSSSQKEKMHEFADKIGWKIQKRDEDNVREFC 235

Query: 1106 LEIGVKRQVLKVWMHNNKNTLRKP 1177
             EIGV + VLKVWMHNNKNT + P
Sbjct: 236  REIGVDKGVLKVWMHNNKNTFKFP 259


>ref|XP_004165400.1| PREDICTED: ZF-HD homeobox protein At4g24660-like [Cucumis sativus]
          Length = 320

 Score =  169 bits (428), Expect = 3e-39
 Identities = 93/245 (37%), Positives = 129/245 (52%), Gaps = 3/245 (1%)
 Frame = +2

Query: 449  VKYKECMKNHAASIGGHANDGCGEFMQEADQDNNNARSLRCAACGCHRNFHRKEIHGSGF 628
            V+Y+EC+KNHAAS+GG+  DGCGEFM   +  +    +L+CAAC CHRNFHRKEI G   
Sbjct: 92   VRYRECLKNHAASVGGNIYDGCGEFMPSGE--DGTLEALKCAACECHRNFHRKEIDGETQ 149

Query: 629  VD---HLQQQVIVNYNGTNPRYEKTKKHHHHHQSRGGFNSLPSLSLPQVHNNNQQFLVQH 799
            ++   + ++ +++N+    P        H HH+     N   S + P +   N  F    
Sbjct: 150  LNISPNYRRGLMLNHLQLPPPLPSPSALHGHHKFSMALNLHSSPTAPIIAPMNVAFA--- 206

Query: 800  RXXXXXXXXXXXXXRDALFGXXXXXXXXXXXXXXXRRSETPEERVGMMRXXXXXXXXXXX 979
                                                 +E+  E + +             
Sbjct: 207  ---------------------------------GGGGNESSSEDLNVFHSNAEVMPPSSF 233

Query: 980  XXXXKRFRTKFSQEQKDKMLDFADKIGWRIQKQDDNALNQFCLEIGVKRQVLKVWMHNNK 1159
                KRFRTKF+QEQKD+ML+FA+K+GWRIQKQD+  + +FC E+GVKRQVLKVWMHNNK
Sbjct: 234  SLSKKRFRTKFTQEQKDRMLEFAEKVGWRIQKQDEEEVERFCTEVGVKRQVLKVWMHNNK 293

Query: 1160 NTLRK 1174
            NT++K
Sbjct: 294  NTVKK 298


>ref|XP_004152776.1| PREDICTED: ZF-HD homeobox protein At4g24660-like [Cucumis sativus]
          Length = 276

 Score =  169 bits (428), Expect = 3e-39
 Identities = 93/245 (37%), Positives = 129/245 (52%), Gaps = 3/245 (1%)
 Frame = +2

Query: 449  VKYKECMKNHAASIGGHANDGCGEFMQEADQDNNNARSLRCAACGCHRNFHRKEIHGSGF 628
            V+Y+EC+KNHAAS+GG+  DGCGEFM   +  +    +L+CAAC CHRNFHRKEI G   
Sbjct: 48   VRYRECLKNHAASVGGNIYDGCGEFMPSGE--DGTLEALKCAACECHRNFHRKEIDGETQ 105

Query: 629  VD---HLQQQVIVNYNGTNPRYEKTKKHHHHHQSRGGFNSLPSLSLPQVHNNNQQFLVQH 799
            ++   + ++ +++N+    P        H HH+     N   S + P +   N  F    
Sbjct: 106  LNISPNYRRGLMLNHLQLPPPLPSPSALHGHHKFSMALNLHSSPTAPIIAPMNVAFA--- 162

Query: 800  RXXXXXXXXXXXXXRDALFGXXXXXXXXXXXXXXXRRSETPEERVGMMRXXXXXXXXXXX 979
                                                 +E+  E + +             
Sbjct: 163  ---------------------------------GGGGNESSSEDLNVFHSNAEVMPPSSF 189

Query: 980  XXXXKRFRTKFSQEQKDKMLDFADKIGWRIQKQDDNALNQFCLEIGVKRQVLKVWMHNNK 1159
                KRFRTKF+QEQKD+ML+FA+K+GWRIQKQD+  + +FC E+GVKRQVLKVWMHNNK
Sbjct: 190  SLSKKRFRTKFTQEQKDRMLEFAEKVGWRIQKQDEEEVERFCTEVGVKRQVLKVWMHNNK 249

Query: 1160 NTLRK 1174
            NT++K
Sbjct: 250  NTVKK 254


>ref|XP_007156052.1| hypothetical protein PHAVU_003G254200g [Phaseolus vulgaris]
            gi|561029406|gb|ESW28046.1| hypothetical protein
            PHAVU_003G254200g [Phaseolus vulgaris]
          Length = 321

 Score =  167 bits (423), Expect = 1e-38
 Identities = 99/274 (36%), Positives = 135/274 (49%), Gaps = 3/274 (1%)
 Frame = +2

Query: 368  LEPIVVKPVTSNNSNMKQHHHHSSKFQVKYKECMKNHAASIGGHANDGCGEFMQEADQDN 547
            L+P  + P     S  + H   +    V+Y+EC+KNHAA +GGH  DGCGEFM   ++  
Sbjct: 86   LDPTAISPPIVTTSRTQPHSTGTFTATVRYRECLKNHAAIMGGHVTDGCGEFMPSGEE-- 143

Query: 548  NNARSLRCAACGCHRNFHRKEIHGSGFVDHLQQQVIVNYNGTNPRYEKTKKHHHHHQSRG 727
                S +CAAC CHRNFHRKE  G         Q ++NY+ T P   KT ++   H    
Sbjct: 144  GTPESFKCAACECHRNFHRKEPEGES------SQHVLNYHLTYPN--KTNRNIVIHS--- 192

Query: 728  GFNSLPSLSLPQVHNNNQQFLVQHRXXXXXXXXXXXXXRDALFGXXXXXXXXXXXXXXXR 907
                      PQ H      L  H              + A+ G                
Sbjct: 193  ----------PQSHLQ----LPTHHLHGVVATPSGGSVQPAVLGFGGTP----------- 227

Query: 908  RSETPEERVGMMRXXXXXXXXXXXXXXX---KRFRTKFSQEQKDKMLDFADKIGWRIQKQ 1078
             +E+  E + M +                  KRFRTKFSQ+QKD+M++FADK+GW+IQKQ
Sbjct: 228  -TESSSEDLNMFQTDEAGQLLSVQPPLSSSKKRFRTKFSQQQKDQMMEFADKLGWKIQKQ 286

Query: 1079 DDNALNQFCLEIGVKRQVLKVWMHNNKNTLRKPQ 1180
            D+  L+QFC ++GVKRQ+ KVWMHN+K  ++K Q
Sbjct: 287  DEQELHQFCSQVGVKRQIFKVWMHNSKQAMKKKQ 320


>ref|XP_007046919.1| Homeobox protein 24, putative [Theobroma cacao]
            gi|508699180|gb|EOX91076.1| Homeobox protein 24, putative
            [Theobroma cacao]
          Length = 385

 Score =  165 bits (417), Expect = 5e-38
 Identities = 97/249 (38%), Positives = 122/249 (48%)
 Frame = +2

Query: 434  SSKFQVKYKECMKNHAASIGGHANDGCGEFMQEADQDNNNARSLRCAACGCHRNFHRKEI 613
            SS   ++Y+ECMKNHAAS+G H  DGCGEFM   ++    A  L+CAAC CHRNFHRKEI
Sbjct: 153  SSTPLIRYRECMKNHAASMGSHVMDGCGEFMPSGEEGTPEA--LKCAACECHRNFHRKEI 210

Query: 614  HGSGFVDHLQQQVIVNYNGTNPRYEKTKKHHHHHQSRGGFNSLPSLSLPQVHNNNQQFLV 793
            +G        Q     Y   NP     ++   H  S             Q+H      L 
Sbjct: 211  NGE------TQYAPSCYYSYNPNKNNNRRDTTHPPS-------------QLHPQQPIPLH 251

Query: 794  QHRXXXXXXXXXXXXXRDALFGXXXXXXXXXXXXXXXRRSETPEERVGMMRXXXXXXXXX 973
            Q R                +                   +E+  E + M           
Sbjct: 252  QQRFSLGLSTSPTAMPIAPVMMNFRGGGP----------AESSSEDLNMFHSNAGGQISA 301

Query: 974  XXXXXXKRFRTKFSQEQKDKMLDFADKIGWRIQKQDDNALNQFCLEIGVKRQVLKVWMHN 1153
                  KRFRTKFSQEQKDKM++FA+K+GWRIQKQD+  + QFC ++GVKRQV KVWMHN
Sbjct: 302  QPQSSKKRFRTKFSQEQKDKMMEFAEKLGWRIQKQDEQEVQQFCAQVGVKRQVFKVWMHN 361

Query: 1154 NKNTLRKPQ 1180
            NK  ++K Q
Sbjct: 362  NKQAMKKKQ 370


>ref|XP_003520078.1| PREDICTED: transcription factor HB29-like [Glycine max]
          Length = 291

 Score =  164 bits (416), Expect = 7e-38
 Identities = 99/269 (36%), Positives = 125/269 (46%), Gaps = 9/269 (3%)
 Frame = +2

Query: 386  KPVTSNNSNMKQHHHHSSKFQ---------VKYKECMKNHAASIGGHANDGCGEFMQEAD 538
            +P T+ N ++K HHHH +            V YKEC+KNHAASIGGHA DGCGEFM  + 
Sbjct: 14   QPNTTTNGSLKHHHHHPTTVSPPQQPPSTTVFYKECLKNHAASIGGHALDGCGEFMPSSS 73

Query: 539  QDNNNARSLRCAACGCHRNFHRKEIHGSGFVDHLQQQVIVNYNGTNPRYEKTKKHHHHHQ 718
             + N  RSL CAACGCHRNFHR+            +    N++ +N R      +H    
Sbjct: 74   SNPNEPRSLTCAACGCHRNFHRR------------RDTQENHHRSNSRPNFISFYHSPPL 121

Query: 719  SRGGFNSLPSLSLPQVHNNNQQFLVQHRXXXXXXXXXXXXXRDALFGXXXXXXXXXXXXX 898
            SR G    P+ S P    +     + H                 L G             
Sbjct: 122  SRHGPGLSPTPS-PMSSPSPSPPPISHHFPPSSHHFQGPIPAHGLLGLGNENHHHHMSFN 180

Query: 899  XXRRSETPEERVGMMRXXXXXXXXXXXXXXXKRFRTKFSQEQKDKMLDFADKIGWRIQKQ 1078
                S + +                      KR RTKFS EQK KM +FA+K+GWR+QK 
Sbjct: 181  FNSSSHSTQGNTS----------------GKKRHRTKFSHEQKQKMYNFAEKLGWRMQKA 224

Query: 1079 DDNALNQFCLEIGVKRQVLKVWMHNNKNT 1165
            ++  +  FC EIGV R V KVWMHNNKNT
Sbjct: 225  EEGLVQDFCNEIGVSRGVFKVWMHNNKNT 253


>ref|XP_006349273.1| PREDICTED: ZF-HD homeobox protein At4g24660-like [Solanum tuberosum]
          Length = 287

 Score =  163 bits (412), Expect = 2e-37
 Identities = 97/250 (38%), Positives = 126/250 (50%)
 Frame = +2

Query: 425  HHHSSKFQVKYKECMKNHAASIGGHANDGCGEFMQEADQDNNNARSLRCAACGCHRNFHR 604
            HH   K  ++YKEC+KNHAAS+GG+A DGCGEFM   ++    A  L C+AC CHRNFHR
Sbjct: 57   HHVPFKKMIRYKECLKNHAASMGGNATDGCGEFMPSGEEGTIEA--LICSACNCHRNFHR 114

Query: 605  KEIHGSGFVDHLQQQVIVNYNGTNPRYEKTKKHHHHHQSRGGFNSLPSLSLPQVHNNNQQ 784
            KEI G        QQ+         +   +    HH  +RGG      L     HN+++ 
Sbjct: 115  KEIEGD-------QQL---------QLPSSCDCFHHLNNRGGSTKKVYLG----HNHHKT 154

Query: 785  FLVQHRXXXXXXXXXXXXXRDALFGXXXXXXXXXXXXXXXRRSETPEERVGMMRXXXXXX 964
             L                    +                   SE+ E             
Sbjct: 155  SLGPEPFGTIIPTRPIIPPHHQMI--------MSYNMGSLPNSESEEHDQDHHHIGGIMA 206

Query: 965  XXXXXXXXXKRFRTKFSQEQKDKMLDFADKIGWRIQKQDDNALNQFCLEIGVKRQVLKVW 1144
                     KRFRTKF+QEQKDKML+FA+K+GW+IQKQ++  + QFC E+GVKR+VLKVW
Sbjct: 207  MARPLHHVKKRFRTKFTQEQKDKMLNFAEKVGWKIQKQEEGVVQQFCQEVGVKRRVLKVW 266

Query: 1145 MHNNKNTLRK 1174
            MHNNK++L K
Sbjct: 267  MHNNKHSLAK 276


>ref|XP_002324056.2| hypothetical protein POPTR_0017s11920g [Populus trichocarpa]
            gi|566212828|ref|XP_006373321.1| hypothetical protein
            POPTR_0017s11920g [Populus trichocarpa]
            gi|550320081|gb|EEF04189.2| hypothetical protein
            POPTR_0017s11920g [Populus trichocarpa]
            gi|550320082|gb|ERP51118.1| hypothetical protein
            POPTR_0017s11920g [Populus trichocarpa]
          Length = 340

 Score =  163 bits (412), Expect = 2e-37
 Identities = 99/280 (35%), Positives = 135/280 (48%), Gaps = 15/280 (5%)
 Frame = +2

Query: 386  KPVTSNNSNMKQH-----HHHSSKFQ-----VKYKECMKNHAASIGGHANDGCGEFMQEA 535
            KP++  N  +K+H     HHH   F      + YKEC+KNHAA+IGGHA DGCGEFM   
Sbjct: 40   KPLSFTNGVLKRHPPHHHHHHHHHFSPSPIVIAYKECLKNHAATIGGHALDGCGEFMPSP 99

Query: 536  DQDNNNARSLRCAACGCHRNFHRKEIHGSGFVDHLQQQVIVNYNGTNPRYEKTKKHHHHH 715
               + N  SL+CAACGCHRNFHR+E   S    H      + Y          + HH HH
Sbjct: 100  IATHTNPTSLKCAACGCHRNFHRREPEDS--PPHTATTTTIQY----------QSHHRHH 147

Query: 716  -----QSRGGFNSLPSLSLPQVHNNNQQFLVQHRXXXXXXXXXXXXXRDALFGXXXXXXX 880
                 Q++   N  P+ + P   +++      H                AL G       
Sbjct: 148  PLPPPQAQPLHNGSPNSASPPPISSSYYPSGPHMLL-------------ALSGGVSGLNE 194

Query: 881  XXXXXXXXRRSETPEERVGMMRXXXXXXXXXXXXXXXKRFRTKFSQEQKDKMLDFADKIG 1060
                        +P +R                      FRTKFSQ QK++M  FA+++G
Sbjct: 195  NANINVPPPVGSSPRKR----------------------FRTKFSQSQKERMYQFAERVG 232

Query: 1061 WRIQKQDDNALNQFCLEIGVKRQVLKVWMHNNKNTLRKPQ 1180
            W++QK+D++ + +FC E+GV R VLKVWMHNNKN+L K +
Sbjct: 233  WKMQKRDEDLVQEFCNEVGVDRGVLKVWMHNNKNSLGKKE 272


>ref|XP_006373320.1| hypothetical protein POPTR_0017s11920g [Populus trichocarpa]
            gi|550320080|gb|ERP51117.1| hypothetical protein
            POPTR_0017s11920g [Populus trichocarpa]
          Length = 286

 Score =  161 bits (408), Expect = 6e-37
 Identities = 98/276 (35%), Positives = 133/276 (48%), Gaps = 15/276 (5%)
 Frame = +2

Query: 386  KPVTSNNSNMKQH-----HHHSSKFQ-----VKYKECMKNHAASIGGHANDGCGEFMQEA 535
            KP++  N  +K+H     HHH   F      + YKEC+KNHAA+IGGHA DGCGEFM   
Sbjct: 40   KPLSFTNGVLKRHPPHHHHHHHHHFSPSPIVIAYKECLKNHAATIGGHALDGCGEFMPSP 99

Query: 536  DQDNNNARSLRCAACGCHRNFHRKEIHGSGFVDHLQQQVIVNYNGTNPRYEKTKKHHHHH 715
               + N  SL+CAACGCHRNFHR+E   S    H      + Y          + HH HH
Sbjct: 100  IATHTNPTSLKCAACGCHRNFHRREPEDS--PPHTATTTTIQY----------QSHHRHH 147

Query: 716  -----QSRGGFNSLPSLSLPQVHNNNQQFLVQHRXXXXXXXXXXXXXRDALFGXXXXXXX 880
                 Q++   N  P+ + P   +++      H                AL G       
Sbjct: 148  PLPPPQAQPLHNGSPNSASPPPISSSYYPSGPHMLL-------------ALSGGVSGLNE 194

Query: 881  XXXXXXXXRRSETPEERVGMMRXXXXXXXXXXXXXXXKRFRTKFSQEQKDKMLDFADKIG 1060
                        +P +R                      FRTKFSQ QK++M  FA+++G
Sbjct: 195  NANINVPPPVGSSPRKR----------------------FRTKFSQSQKERMYQFAERVG 232

Query: 1061 WRIQKQDDNALNQFCLEIGVKRQVLKVWMHNNKNTL 1168
            W++QK+D++ + +FC E+GV R VLKVWMHNNKN+L
Sbjct: 233  WKMQKRDEDLVQEFCNEVGVDRGVLKVWMHNNKNSL 268


>ref|XP_007017558.1| Homeobox protein 33 isoform 1 [Theobroma cacao]
            gi|590593411|ref|XP_007017559.1| Homeobox protein 33
            isoform 1 [Theobroma cacao] gi|508722886|gb|EOY14783.1|
            Homeobox protein 33 isoform 1 [Theobroma cacao]
            gi|508722887|gb|EOY14784.1| Homeobox protein 33 isoform 1
            [Theobroma cacao]
          Length = 296

 Score =  161 bits (407), Expect = 8e-37
 Identities = 94/247 (38%), Positives = 121/247 (48%), Gaps = 3/247 (1%)
 Frame = +2

Query: 449  VKYKECMKNHAASIGGHANDGCGEFMQEADQDNNNARSLRCAACGCHRNFHRKEIHGSGF 628
            ++Y+EC+KNHAASIGG+  DGCGEFM   ++    A  L+CAAC CHRNFHRKE+ G   
Sbjct: 90   IRYRECLKNHAASIGGNVYDGCGEFMPSGEEGTLEA--LKCAACDCHRNFHRKEVDGETQ 147

Query: 629  VDHLQQQVIVNYNGTN---PRYEKTKKHHHHHQSRGGFNSLPSLSLPQVHNNNQQFLVQH 799
                  +  +  N      P    T  HHH   S              VH +    +V  
Sbjct: 148  FGPNSSRRSLMLNPLQLPPPLPSPTMLHHHQRYS--------------VHTSPSSAMVA- 192

Query: 800  RXXXXXXXXXXXXXRDALFGXXXXXXXXXXXXXXXRRSETPEERVGMMRXXXXXXXXXXX 979
                           +  FG                 +E+  E +               
Sbjct: 193  -------------PMNVAFGSGGGCG-----------TESSSEDLMFQSNAEGMPPPPPY 228

Query: 980  XXXXKRFRTKFSQEQKDKMLDFADKIGWRIQKQDDNALNQFCLEIGVKRQVLKVWMHNNK 1159
                KRFRTKF+QEQKDKML+FA+K+GWRI KQD+  + +FC E+GVKRQV KVWMHNNK
Sbjct: 229  VLSKKRFRTKFTQEQKDKMLEFAEKLGWRINKQDEEEVEKFCAEVGVKRQVFKVWMHNNK 288

Query: 1160 NTLRKPQ 1180
            N  ++PQ
Sbjct: 289  NVKKQPQ 295


>ref|XP_007224511.1| hypothetical protein PRUPE_ppa023369mg [Prunus persica]
            gi|462421447|gb|EMJ25710.1| hypothetical protein
            PRUPE_ppa023369mg [Prunus persica]
          Length = 310

 Score =  155 bits (393), Expect = 3e-35
 Identities = 88/245 (35%), Positives = 125/245 (51%), Gaps = 3/245 (1%)
 Frame = +2

Query: 449  VKYKECMKNHAASIGGHANDGCGEFMQEADQDNNNARSLRCAACGCHRNFHRKEIHGSGF 628
            ++Y+EC+KNHAA+IGG+  DGCGEFM   ++    A  L+CAAC CHRNFHRKE+ G   
Sbjct: 100  IRYRECLKNHAANIGGNVFDGCGEFMPSGEEGTLEA--LKCAACDCHRNFHRKEVDGETT 157

Query: 629  V---DHLQQQVIVNYNGTNPRYEKTKKHHHHHQSRGGFNSLPSLSLPQVHNNNQQFLVQH 799
                   +  ++++     P         HHH                 H+++Q+F +  
Sbjct: 158  AFSHGSRRSSIMLSPLQLPPPLPSPSSALHHH-----------------HHHHQKFSMA- 199

Query: 800  RXXXXXXXXXXXXXRDALFGXXXXXXXXXXXXXXXRRSETPEERVGMMRXXXXXXXXXXX 979
                           +  FG                +S   E  + M             
Sbjct: 200  ---------PIIQPMNVAFGSGGGGTESSSEDLNVFQSNNAEGGLPM----------PPF 240

Query: 980  XXXXKRFRTKFSQEQKDKMLDFADKIGWRIQKQDDNALNQFCLEIGVKRQVLKVWMHNNK 1159
                KRFRTKF+QEQK++M++FA+K+GWRIQKQD+  + +FC E+GVKRQVL+VWMHNNK
Sbjct: 241  AMSKKRFRTKFTQEQKERMMEFAEKVGWRIQKQDEEEVERFCAEVGVKRQVLRVWMHNNK 300

Query: 1160 NTLRK 1174
            N+ +K
Sbjct: 301  NSAKK 305


>ref|XP_006376247.1| hypothetical protein POPTR_0013s11330g [Populus trichocarpa]
            gi|550325522|gb|ERP54044.1| hypothetical protein
            POPTR_0013s11330g [Populus trichocarpa]
          Length = 254

 Score =  154 bits (390), Expect = 7e-35
 Identities = 107/263 (40%), Positives = 127/263 (48%)
 Frame = +2

Query: 386  KPVTSNNSNMKQHHHHSSKFQVKYKECMKNHAASIGGHANDGCGEFMQEADQDNNNARSL 565
            KP+T+  SN K       K  VKYKECM+NHAASIGGHANDGCGEFM   D    +   L
Sbjct: 31   KPITTMISNPKATKD-PCKNVVKYKECMRNHAASIGGHANDGCGEFMPRGDDGTRDW--L 87

Query: 566  RCAACGCHRNFHRKEIHGSGFVDHLQQQVIVNYNGTNPRYEKTKKHHHHHQSRGGFNSLP 745
             CAACGC    HR                  N++    R E + K  H  Q         
Sbjct: 88   TCAACGC----HR------------------NFH----RRESSTKRQHQQQLL------- 114

Query: 746  SLSLPQVHNNNQQFLVQHRXXXXXXXXXXXXXRDALFGXXXXXXXXXXXXXXXRRSETPE 925
             LS P +    QQFL                    L+G               R  +  +
Sbjct: 115  -LSPPPLQP--QQFL--------------------LYGAPTTKNMNPVHDFMSRPHDEDD 151

Query: 926  ERVGMMRXXXXXXXXXXXXXXXKRFRTKFSQEQKDKMLDFADKIGWRIQKQDDNALNQFC 1105
            +  G M                KRFRTKF+QEQK++ML+FA+KIGWRIQK DD ALNQFC
Sbjct: 152  DDDGFM-------VKSTSGSSNKRFRTKFTQEQKERMLEFAEKIGWRIQKHDDMALNQFC 204

Query: 1106 LEIGVKRQVLKVWMHNNKNTLRK 1174
             E+G+KR VLKVWMHNNKN  R+
Sbjct: 205  NEVGIKRNVLKVWMHNNKNAHRR 227


>ref|NP_568570.1| homeobox protein 23 [Arabidopsis thaliana]
            gi|75333929|sp|Q9FIW9.1|ZHD10_ARATH RecName:
            Full=Zinc-finger homeodomain protein 10; Short=AtZHD10;
            AltName: Full=Homeobox protein 23; Short=AtHB-23
            gi|10177976|dbj|BAB11382.1| unnamed protein product
            [Arabidopsis thaliana] gi|20259470|gb|AAM13855.1| unknown
            protein [Arabidopsis thaliana] gi|21436443|gb|AAM51422.1|
            unknown protein [Arabidopsis thaliana]
            gi|332007089|gb|AED94472.1| homeobox protein 23
            [Arabidopsis thaliana]
          Length = 334

 Score =  113 bits (282), Expect = 2e-22
 Identities = 83/279 (29%), Positives = 110/279 (39%), Gaps = 16/279 (5%)
 Frame = +2

Query: 386  KPVTSNNSNMKQHHHHSSKFQVKYKECMKNHAASIGGHANDGCGEFMQEADQDNNNARSL 565
            KP++ +N  +K+HHHH       YKEC+KNHAA++GGHA DGCGEFM      +++  SL
Sbjct: 33   KPISFSNGIIKRHHHHHHPLLFTYKECLKNHAAALGGHALDGCGEFMPSPSSISSDPTSL 92

Query: 566  RCAACGCHRNFHRKEIHGSGFVDHLQQQVIVNYNGTNPRYEKTKKHHHHH--------QS 721
            +CAACGCHRNFHR++   +          I     T   Y+    HH HH          
Sbjct: 93   KCAACGCHRNFHRRDPDNNN-----DSSQIPPPPSTAVEYQ---PHHRHHPPPPPPPPPP 144

Query: 722  RGGFNSLPS--------LSLPQVHNNNQQFLVQHRXXXXXXXXXXXXXRDALFGXXXXXX 877
            R   ++ P         LSL   +NNN                     +  L G      
Sbjct: 145  RSPNSASPPPISSSYMLLSLSGTNNNNNNLASFSDLNFSAGNNHHHHHQHTLHGSRKRFR 204

Query: 878  XXXXXXXXXRRSETPEERVGMMRXXXXXXXXXXXXXXXKRFRTKFSQEQKDKMLDFADKI 1057
                     +  E   ERVG                       K  +  +D + DF  +I
Sbjct: 205  TKFSQFQKEKMHEF-AERVGW----------------------KMQKRDEDDVRDFCRQI 241

Query: 1058 GWRIQKQDDNALNQFCLEIGVKRQVLKVWMHNNKNTLRK 1174
                               GV + VLKVWMHNNKNT  +
Sbjct: 242  -------------------GVDKSVLKVWMHNNKNTFNR 261


>gb|AAM64462.1| unknown [Arabidopsis thaliana]
          Length = 333

 Score =  112 bits (280), Expect = 4e-22
 Identities = 80/279 (28%), Positives = 109/279 (39%), Gaps = 16/279 (5%)
 Frame = +2

Query: 386  KPVTSNNSNMKQHHHHSSKFQVKYKECMKNHAASIGGHANDGCGEFMQEADQDNNNARSL 565
            KP++ +N  +K+HHHH       YKEC+KNHAA++GGHA DGCGEFM      +++  SL
Sbjct: 32   KPISFSNGIIKRHHHHHHPLLFTYKECLKNHAAALGGHALDGCGEFMPSPSSISSDPTSL 91

Query: 566  RCAACGCHRNFHRKEIHGSGFVDHLQQQVIVNYNGTNPRYEKTKKHHHHH--------QS 721
            +CAACGCHRNFHR++   +     +                + + HH HH          
Sbjct: 92   KCAACGCHRNFHRRDPDNNNDSSQIPPPPSTXV--------ENQPHHRHHPPPPPPPPPP 143

Query: 722  RGGFNSLPS--------LSLPQVHNNNQQFLVQHRXXXXXXXXXXXXXRDALFGXXXXXX 877
            R   ++ P         LSL   +NNN                     +  L G      
Sbjct: 144  RSPNSASPPPISSSYMLLSLSGTNNNNNNLASFSDLNFSAGNNHHHHHQHTLHGSRKRFR 203

Query: 878  XXXXXXXXXRRSETPEERVGMMRXXXXXXXXXXXXXXXKRFRTKFSQEQKDKMLDFADKI 1057
                     +  E   ERVG                       K  +  +D + DF  +I
Sbjct: 204  TKFSQFQKEKMHEF-AERVGW----------------------KMQKRDZDDVRDFCRQI 240

Query: 1058 GWRIQKQDDNALNQFCLEIGVKRQVLKVWMHNNKNTLRK 1174
                               GV + VLKVWMHNNKNT  +
Sbjct: 241  -------------------GVDKSVLKVWMHNNKNTFNR 260


>dbj|BAE99230.1| hypothetical protein [Arabidopsis thaliana]
          Length = 334

 Score =  111 bits (278), Expect = 7e-22
 Identities = 83/279 (29%), Positives = 109/279 (39%), Gaps = 16/279 (5%)
 Frame = +2

Query: 386  KPVTSNNSNMKQHHHHSSKFQVKYKECMKNHAASIGGHANDGCGEFMQEADQDNNNARSL 565
            KP++ +N  +K+HHHH       YKEC+KNHAA++GGHA DGCGEFM      +++  SL
Sbjct: 33   KPISFSNGIIKRHHHHHHPLLFTYKECLKNHAAALGGHALDGCGEFMPSPSSISSDPTSL 92

Query: 566  RCAACGCHRNFHRKEIHGSGFVDHLQQQVIVNYNGTNPRYEKTKKHHHHH--------QS 721
            +CAACGCHRNFHR +   +          I     T   Y+    HH HH          
Sbjct: 93   KCAACGCHRNFHRLDPDNNN-----DSSQIPPPPSTAVEYQ---PHHRHHPPPPPPPPPP 144

Query: 722  RGGFNSLPS--------LSLPQVHNNNQQFLVQHRXXXXXXXXXXXXXRDALFGXXXXXX 877
            R   ++ P         LSL   +NNN                     +  L G      
Sbjct: 145  RSPNSASPPPISSSYMLLSLSGTNNNNNNLASFSDLNFSAGNNHHHHHQHTLHGSRKRFR 204

Query: 878  XXXXXXXXXRRSETPEERVGMMRXXXXXXXXXXXXXXXKRFRTKFSQEQKDKMLDFADKI 1057
                     +  E   ERVG                       K  +  +D + DF  +I
Sbjct: 205  TKFSQFQKEKMHEF-AERVGW----------------------KMQKRDEDDVRDFCRQI 241

Query: 1058 GWRIQKQDDNALNQFCLEIGVKRQVLKVWMHNNKNTLRK 1174
                               GV + VLKVWMHNNKNT  +
Sbjct: 242  -------------------GVDKSVLKVWMHNNKNTFNR 261


>ref|XP_002870762.1| hypothetical protein ARALYDRAFT_494014 [Arabidopsis lyrata subsp.
            lyrata] gi|297316598|gb|EFH47021.1| hypothetical protein
            ARALYDRAFT_494014 [Arabidopsis lyrata subsp. lyrata]
          Length = 327

 Score =  110 bits (275), Expect = 2e-21
 Identities = 83/281 (29%), Positives = 110/281 (39%), Gaps = 18/281 (6%)
 Frame = +2

Query: 386  KPVTSNNSNMKQHHHHSSKFQVKYKECMKNHAASIGGHANDGCGEFMQEADQDNNNARSL 565
            KP++ +N  +K+HHHH   F   YKEC+KNHAA++GGHA DGCGEFM      +++  SL
Sbjct: 31   KPISFSNGIIKRHHHHPLLFT--YKECLKNHAAALGGHALDGCGEFMPSPSSISSDPTSL 88

Query: 566  RCAACGCHRNFHRKEIHGSGFVD--HLQQQVIVNYNGTNPRYEKTKKHHHHH-------- 715
            +CAACGCHRNFHR++   +      H      V Y          + HH HH        
Sbjct: 89   KCAACGCHRNFHRRDPDNNNDSSPIHPPPSTAVEY----------QPHHRHHPPPPLPPP 138

Query: 716  QSRGGFNSLPS--------LSLPQVHNNNQQFLVQHRXXXXXXXXXXXXXRDALFGXXXX 871
              R   ++ P         LSL   +NNN   L                 +  L G    
Sbjct: 139  PPRSPNSASPPPISSSYMLLSLSGTNNNNNN-LASFSDLNFPGGNNHHHHQHTLHGSRKR 197

Query: 872  XXXXXXXXXXXRRSETPEERVGMMRXXXXXXXXXXXXXXXKRFRTKFSQEQKDKMLDFAD 1051
                       +  E  E                       R   K  +  +D + DF  
Sbjct: 198  FRTKFSQFQKEKMHEFAE-----------------------RLGWKMQKRDEDDVRDFCR 234

Query: 1052 KIGWRIQKQDDNALNQFCLEIGVKRQVLKVWMHNNKNTLRK 1174
            +IG                   V + VLKVWMHNNKNT  +
Sbjct: 235  QIG-------------------VDKSVLKVWMHNNKNTFNR 256


>ref|XP_006395326.1| hypothetical protein EUTSA_v10004599mg [Eutrema salsugineum]
            gi|557091965|gb|ESQ32612.1| hypothetical protein
            EUTSA_v10004599mg [Eutrema salsugineum]
          Length = 323

 Score =  108 bits (271), Expect = 4e-21
 Identities = 78/261 (29%), Positives = 106/261 (40%), Gaps = 1/261 (0%)
 Frame = +2

Query: 386  KPVTSNNSNMKQHHHHSSKF-QVKYKECMKNHAASIGGHANDGCGEFMQEADQDNNNARS 562
            KP++ +N  +K+HHHH      V YKEC+KNHAA+IGGHA DGCGEFM       ++  S
Sbjct: 22   KPISFSNGIIKRHHHHHHPIIAVTYKECLKNHAAAIGGHALDGCGEFMPSPSSTPSDPTS 81

Query: 563  LRCAACGCHRNFHRKEIHGSGFVDHLQQQVIVNYNGTNPRYEKTKKHHHHHQSRGGFNSL 742
            L+CAACGCHRNFHR++   S     +    +   + T    E    H HH          
Sbjct: 82   LKCAACGCHRNFHRRDPDDSSLSSAVPPPSLPP-SSTTAAIEYQPHHRHHPPP------- 133

Query: 743  PSLSLPQVHNNNQQFLVQHRXXXXXXXXXXXXXRDALFGXXXXXXXXXXXXXXXRRSETP 922
            P+  LP+  N++    +                 + L                     TP
Sbjct: 134  PAPPLPRSPNSSSPPPISSSYMLLALSGTNKSAGNNL----PFSDLNFAANNLSTHHHTP 189

Query: 923  EERVGMMRXXXXXXXXXXXXXXXKRFRTKFSQEQKDKMLDFADKIGWRIQKQDDNALNQF 1102
              R    R                R   K  +  ++++ DF  +IG              
Sbjct: 190  GSR-KRFRTKFSPIQKEKMHEFADRIGWKIQKRDEEEVRDFCREIG-------------- 234

Query: 1103 CLEIGVKRQVLKVWMHNNKNT 1165
                 V + VLKVWMHNNKNT
Sbjct: 235  -----VDKGVLKVWMHNNKNT 250


>gb|ABK22835.1| unknown [Picea sitchensis]
          Length = 249

 Score =  108 bits (271), Expect = 4e-21
 Identities = 48/61 (78%), Positives = 56/61 (91%)
 Frame = +2

Query: 992  KRFRTKFSQEQKDKMLDFADKIGWRIQKQDDNALNQFCLEIGVKRQVLKVWMHNNKNTLR 1171
            KRFRTKF+QEQKD+MLDFA+K+GWRIQK D+ A+ QFC +IGVKR+VLKVWMHNNKNTL 
Sbjct: 186  KRFRTKFTQEQKDRMLDFAEKVGWRIQKHDEQAVQQFCQDIGVKRRVLKVWMHNNKNTLG 245

Query: 1172 K 1174
            K
Sbjct: 246  K 246



 Score = 84.3 bits (207), Expect = 1e-13
 Identities = 36/66 (54%), Positives = 47/66 (71%)
 Frame = +2

Query: 437 SKFQVKYKECMKNHAASIGGHANDGCGEFMQEADQDNNNARSLRCAACGCHRNFHRKEIH 616
           SK  V+Y+ECMKNHAA++GG A DGCGEFM   ++      +L+C+AC CHRNFHR+E+ 
Sbjct: 49  SKKTVRYRECMKNHAAAMGGSATDGCGEFMPSGEE--GTLEALKCSACECHRNFHRREVE 106

Query: 617 GSGFVD 634
           G    D
Sbjct: 107 GEPSCD 112


Top