BLASTX nr result

ID: Mentha24_contig00016429 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00016429
         (805 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU19874.1| hypothetical protein MIMGU_mgv1a000669mg [Mimulus...   167   3e-39
gb|ADY38784.1| sequence-specific DNA-binding transcription facto...   159   1e-36
gb|EPS74161.1| hypothetical protein M569_00592, partial [Genlise...   149   2e-33
ref|XP_006351031.1| PREDICTED: uncharacterized protein LOC102601...   147   6e-33
gb|ADZ55295.1| sequence-specific DNA binding protein [Coffea ara...   145   2e-32
gb|ABZ89177.1| putative protein [Coffea canephora]                    145   2e-32
ref|XP_004250459.1| PREDICTED: uncharacterized protein LOC101266...   142   2e-31
emb|CBI24184.3| unnamed protein product [Vitis vinifera]               84   6e-14
ref|XP_002263797.2| PREDICTED: uncharacterized protein LOC100241...    84   8e-14
emb|CAN63605.1| hypothetical protein VITISV_019128 [Vitis vinifera]    84   8e-14
ref|XP_007049489.1| Homeodomain-like transcriptional regulator i...    80   1e-12
ref|XP_007049488.1| Homeodomain-like transcriptional regulator i...    80   1e-12
ref|XP_007049487.1| Homeodomain-like transcriptional regulator i...    80   1e-12
ref|XP_002524572.1| hypothetical protein RCOM_1211540 [Ricinus c...    80   1e-12
ref|XP_006469383.1| PREDICTED: uncharacterized protein LOC102620...    74   8e-11
ref|XP_006447893.1| hypothetical protein CICLE_v10014094mg [Citr...    74   8e-11
ref|XP_003579233.1| PREDICTED: uncharacterized protein LOC100825...    74   8e-11
ref|XP_007214909.1| hypothetical protein PRUPE_ppa000565mg [Prun...    72   3e-10
gb|EEE67601.1| hypothetical protein OsJ_25152 [Oryza sativa Japo...    71   4e-10
gb|EEC82457.1| hypothetical protein OsI_26894 [Oryza sativa Indi...    69   2e-09

>gb|EYU19874.1| hypothetical protein MIMGU_mgv1a000669mg [Mimulus guttatus]
          Length = 1024

 Score =  167 bits (424), Expect = 3e-39
 Identities = 112/268 (41%), Positives = 129/268 (48%), Gaps = 1/268 (0%)
 Frame = +3

Query: 3   NGNGVQNSRGRAVNQKTKSKRNQQIYMNENDYRHRLQEVLYTPEDIFARIFRKDGPALGD 182
           N  G+ NSR      K KSK  Q+++MN++DYR RLQEVLYTPE I  +IFRK+GP LGD
Sbjct: 22  NSGGIHNSRSSGAELKGKSK--QRLFMNDSDYRLRLQEVLYTPEHIVTKIFRKEGPQLGD 79

Query: 183 QFDPLPSNAFPGGPRRSRLEDNRHRKRRKVSMHALSDYETCCEGSLPARGYGMGKGPMTA 362
           QFD LPSNAF                                             G MT 
Sbjct: 80  QFDSLPSNAFSA-------------------------------------------GLMTQ 96

Query: 363 HGAPAKKYGMGKGLMIQSSVPXXXXXXXXXXXXXXANTHHARGFKYDACSSGRVIQKKKK 542
            G   K +G+GKGLM  +                      A  F Y A       QKKK 
Sbjct: 97  KGVNGKTHGIGKGLMTAAR----------------GTNPDASDFPYVAYCRQSATQKKK- 139

Query: 543 RVQPRESILKKLADRERAKKKNSLRCRKVEPQXXXXXXXXXXXXCDLALEDVKCTENT-G 719
           RVQPRESI++KLA +E+AK+K  LR RKVE Q            C+LALEDVKC ENT  
Sbjct: 140 RVQPRESIMRKLASKEKAKRKAPLRSRKVECQKVQKRKKPRNENCELALEDVKCLENTEQ 199

Query: 720 FAXXXXXXXXXXXXXQAGPNPLSCSAHF 803
           FA             QAGPNPLSCSAHF
Sbjct: 200 FAMLQEDEELELRELQAGPNPLSCSAHF 227


>gb|ADY38784.1| sequence-specific DNA-binding transcription factor [Coffea arabica]
          Length = 1116

 Score =  159 bits (401), Expect = 1e-36
 Identities = 110/281 (39%), Positives = 140/281 (49%), Gaps = 16/281 (5%)
 Frame = +3

Query: 6   GNGVQNSRGRAVNQKTKSKRNQQIYMNENDYRHRLQEVLYTPEDIFARIFRKDGPALGDQ 185
           GNG  NS  R   +  K ++ QQ +MNENDYR RLQEVL+  + I  +IFRKDGPALG +
Sbjct: 24  GNG--NSNHRNCKKAGKRQQQQQKFMNENDYRLRLQEVLFNSDYILQKIFRKDGPALGVE 81

Query: 186 FDPLPSNAF----PGGPRRSRL--EDNRHRKRRKVSMHALSDYETCCEGSLPARGYGMGK 347
           FD LP NAF    PG  +  R   E+ R  KR+KVS     DY+ C E       +G+GK
Sbjct: 82  FDSLPENAFRYCRPGSRKSHRTCQENQRTFKRQKVSTPL--DYQACPEPRSTTIKHGIGK 139

Query: 348 GPMTAHGAPAKKYGMGKGLMIQSSVPXXXXXXXXXXXXXXANTHHARGFKYDACSSGRV- 524
           G M  +G P K++G+GKGLM + S P                T+   G       S    
Sbjct: 140 GLMAKNGTPVKRHGIGKGLMTKKSAPMKKHGIGKGLMTVWRVTNPDGGDFPTGIGSSTFS 199

Query: 525 ---IQKKKKRVQPRESILKKLADRERAKKKNSLRCRKV-----EPQXXXXXXXXXXXXCD 680
              +  KKK +Q R+S+++KL  R + KKK S+RCRK                     C+
Sbjct: 200 NFSLLAKKKSLQRRQSLMRKLGKRLQEKKKASVRCRKEIHGMGASGRFEQRKQARKEKCE 259

Query: 681 LALEDVKCTENTG-FAXXXXXXXXXXXXXQAGPNPLSCSAH 800
           LALE + C EN                  QAGPNPLSCSAH
Sbjct: 260 LALEGLTCEENLDQLVNLVDDEELELKELQAGPNPLSCSAH 300


>gb|EPS74161.1| hypothetical protein M569_00592, partial [Genlisea aurea]
          Length = 1036

 Score =  149 bits (375), Expect = 2e-33
 Identities = 105/271 (38%), Positives = 137/271 (50%), Gaps = 18/271 (6%)
 Frame = +3

Query: 45  QKTKSKRNQ-QIYMNENDYRHRLQEVLYTPEDIFARIFRKDGPALGDQFDPLPSNA---- 209
           +K KS++NQ Q++ N+ DYR RLQE +Y  E I A++FRKDGP LGDQFD LPSNA    
Sbjct: 1   RKRKSEQNQRQVFTNDKDYRLRLQEYMYDSEYILAKVFRKDGPPLGDQFDALPSNAAVVN 60

Query: 210 -----FPGGPRRSRLEDNR--HRKRRKVSMHALSDYETCCEGSLPARGYGMGKGPMTAHG 368
                F      S+L+      ++ + VSMHA+ DYE C   S   R YG GKGP+TA+G
Sbjct: 61  LLICSFLLDCSTSQLKKKPVCVKRSKVVSMHAVVDYEACITSSSSMR-YGPGKGPITANG 119

Query: 369 APAKKYGMGKGLMIQSSV--PXXXXXXXXXXXXXXANTHHARG---FKYDACSSGRVIQK 533
           +  KK+GMGKGL++Q                        H  G       A      I+K
Sbjct: 120 STLKKHGMGKGLILQRDTLWKNHGVGKGPMTLKGDRGVRHRIGKGLMTLKAMRDNSTIRK 179

Query: 534 KKKRVQPRESILKKLADRERAKKKNSLRCRKVEPQXXXXXXXXXXXXCDLALEDVKCTEN 713
           KKK    RES++KKLA +E AK+  SLR +K++ +            C L ++DVK  EN
Sbjct: 180 KKKLT--RESVVKKLAKKELAKRNVSLRNKKMKGRHVEKQNLLRKDKCKLGIDDVKRIEN 237

Query: 714 T-GFAXXXXXXXXXXXXXQAGPNPLSCSAHF 803
              FA             Q G   LSC  HF
Sbjct: 238 NEQFAKLLDDEELELRESQLGARILSCCPHF 268


>ref|XP_006351031.1| PREDICTED: uncharacterized protein LOC102601165 [Solanum tuberosum]
          Length = 1079

 Score =  147 bits (370), Expect = 6e-33
 Identities = 93/268 (34%), Positives = 131/268 (48%), Gaps = 18/268 (6%)
 Frame = +3

Query: 54  KSKRNQQIYMNENDYRHRLQEVLYTPEDIFARIFRKDGPALGDQFDPLPSNAFPGGPRRS 233
           K K+ QQ +++E+DYR RLQE LY+P+ I A+IFRKDGP LGD+FD LPSNAF    + S
Sbjct: 15  KKKQQQQQFLSEDDYRLRLQEGLYSPDYILAKIFRKDGPTLGDEFDLLPSNAFSSHKKGS 74

Query: 234 RL------EDNRHRKRRKVSMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKYGMG 395
           R+      E+    KRRKVS+ A    +  CE + P + +G GKG +T      KK+  G
Sbjct: 75  RISGQARQENQGATKRRKVSVPATMHLQALCESNPPVKKHGTGKGLIT-KDVSVKKHSAG 133

Query: 396 KGLMIQSSVPXXXXXXXXXXXXXXANTHHARGFKYDACSSGRVIQKKKKRVQPRESILKK 575
           K LM + S                  T+   G        G   +++KK++  R+SIL+K
Sbjct: 134 KRLMTEKSATLRNHGMGKGLMTVWRATNPHAGDIPSGVGFGESAEERKKKLLQRQSILRK 193

Query: 576 LADRERAKKKNSLRCRKVEPQXXXXXXXXXXXXCDLALEDVKCTEN------------TG 719
           +  + + KK+  ++CRK E +            C+LALE  KC E             T 
Sbjct: 194 IEKKLQDKKRIGVKCRKAENKRIEKQKMPRKEKCELALEWSKCQEGLPIKKRKCQHEFTQ 253

Query: 720 FAXXXXXXXXXXXXXQAGPNPLSCSAHF 803
                          +AGPN L+C  HF
Sbjct: 254 LGSLVDDEELELMEMEAGPNSLTCCTHF 281


>gb|ADZ55295.1| sequence-specific DNA binding protein [Coffea arabica]
          Length = 1156

 Score =  145 bits (366), Expect = 2e-32
 Identities = 107/311 (34%), Positives = 138/311 (44%), Gaps = 46/311 (14%)
 Frame = +3

Query: 6   GNGVQNSRGRAVNQKTKSKRNQQIYMNENDYRHRLQEVLYTPEDIFARIFRKDGPALGDQ 185
           GNG  NS  R   +  K ++ QQ +MNENDYR RLQEVL+  + I  +IFRKDGPALG +
Sbjct: 24  GNG--NSNHRNCKKAGKRQQQQQKFMNENDYRLRLQEVLFNSDYILQKIFRKDGPALGFE 81

Query: 186 FDPLPSNAF------------------------------------PGGPRRSRLEDNRHR 257
           FD LP NAF                                    P    +  L+  R+R
Sbjct: 82  FDSLPENAFRYCRPVYVNVDIYRCAYLTRVIDLLMCDQAPESLTAPAKRTKEHLKGKRYR 141

Query: 258 KRRKVSMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKYGMGKGLMIQSSVPXXXX 437
           K+  VS     DY+ C E       +G+GKG M  +G P K++G+GKGLM + S P    
Sbjct: 142 KKFWVSTPL--DYQACPEPRSTTIKHGIGKGLMAKNGTPVKRHGIGKGLMTKKSAPMKKH 199

Query: 438 XXXXXXXXXXANTHHARGFKYDACSSGRV----IQKKKKRVQPRESILKKLADRERAKKK 605
                       T+   G       S       +  KKK +Q R+S+++KL  R + KKK
Sbjct: 200 GIGKGLMTVWRVTNPDGGDFPTGIGSSTFSNFSLLAKKKSLQRRQSLMRKLGKRLQEKKK 259

Query: 606 NSLRCRKV-----EPQXXXXXXXXXXXXCDLALEDVKCTENTG-FAXXXXXXXXXXXXXQ 767
            S+RCRK                     C+LALE + C EN                  Q
Sbjct: 260 ASVRCRKEIHGMGASGRFEQRKQARKEKCELALEGLTCEENLDQLVNLEDDEELELKELQ 319

Query: 768 AGPNPLSCSAH 800
           AGPNPLSCSAH
Sbjct: 320 AGPNPLSCSAH 330


>gb|ABZ89177.1| putative protein [Coffea canephora]
          Length = 1156

 Score =  145 bits (366), Expect = 2e-32
 Identities = 107/311 (34%), Positives = 138/311 (44%), Gaps = 46/311 (14%)
 Frame = +3

Query: 6   GNGVQNSRGRAVNQKTKSKRNQQIYMNENDYRHRLQEVLYTPEDIFARIFRKDGPALGDQ 185
           GNG  NS  R   +  K ++ QQ +MNENDYR RLQEVL+  + I  +IFRKDGPALG +
Sbjct: 24  GNG--NSNHRNCKKAGKRQQQQQKFMNENDYRLRLQEVLFNSDYILQKIFRKDGPALGVE 81

Query: 186 FDPLPSNAF------------------------------------PGGPRRSRLEDNRHR 257
           FD LP NAF                                    P    +  L+  R+R
Sbjct: 82  FDSLPENAFRYCRPVYVNVDIYRCAYLTRVIDLLMCDQAPESLTAPAKRTKEHLKGKRYR 141

Query: 258 KRRKVSMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKYGMGKGLMIQSSVPXXXX 437
           K+  VS     DY+ C E       +G+GKG M  +G P K++G+GKGLM + S P    
Sbjct: 142 KKFWVSTPL--DYQACPEPRSTTIKHGIGKGLMAKNGTPVKRHGIGKGLMTKKSAPMKKH 199

Query: 438 XXXXXXXXXXANTHHARGFKYDACSSGRV----IQKKKKRVQPRESILKKLADRERAKKK 605
                       T+   G       S       +  KKK +Q R+S+++KL  R + KKK
Sbjct: 200 GIGKGLMTVWRVTNPDGGDFPTGIGSSTFSNFSLLAKKKSLQRRQSLMRKLGKRLQEKKK 259

Query: 606 NSLRCRKV-----EPQXXXXXXXXXXXXCDLALEDVKCTENTG-FAXXXXXXXXXXXXXQ 767
            S+RCRK                     C+LALE + C EN                  Q
Sbjct: 260 ASVRCRKEIHGMGASGRFEQRKQARKEKCELALEGLTCEENLDQLVNLEDDEELELKELQ 319

Query: 768 AGPNPLSCSAH 800
           AGPNPLSCSAH
Sbjct: 320 AGPNPLSCSAH 330


>ref|XP_004250459.1| PREDICTED: uncharacterized protein LOC101266687 [Solanum
           lycopersicum]
          Length = 1080

 Score =  142 bits (357), Expect = 2e-31
 Identities = 92/268 (34%), Positives = 129/268 (48%), Gaps = 18/268 (6%)
 Frame = +3

Query: 54  KSKRNQQIYMNENDYRHRLQEVLYTPEDIFARIFRKDGPALGDQFDPLPSNAFPGGPRRS 233
           K K+ QQ++++E+DYR RLQE LY+P+ I A+IFRKDGP LGD+FD LPSNAF    + S
Sbjct: 17  KKKQQQQLFLSEDDYRLRLQEGLYSPDYILAKIFRKDGPTLGDEFDILPSNAFSHLKKGS 76

Query: 234 RL------EDNRHRKRRKVSMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKYGMG 395
           R+      E+    KRRKVS+ A       CE + P + +G GKG +T      KK+  G
Sbjct: 77  RISGQARQENQGATKRRKVSVPATMHCRALCESNPPVKKHGTGKGLIT-KDVSVKKHSAG 135

Query: 396 KGLMIQSSVPXXXXXXXXXXXXXXANTHHARGFKYDACSSGRVIQKKKKRVQPRESILKK 575
           K LM +                    T+   G        G   +++KK++  R+SIL+K
Sbjct: 136 KRLMTEKRATLRNHGMGKGLMTVWRATNPHSGDIPVGVDFGESAEERKKKLLQRQSILRK 195

Query: 576 LADRERAKKKNSLRCRKVEPQXXXXXXXXXXXXCDLALEDVKCTEN------------TG 719
           +  + + KKK  ++CRK E +            C+LALE  KC E             T 
Sbjct: 196 IEKKLQDKKKVGVKCRKAENKRIEKQKMPRKEKCELALEWRKCQEGLPIKKRNYQQEFTQ 255

Query: 720 FAXXXXXXXXXXXXXQAGPNPLSCSAHF 803
                          + GPN L+C  HF
Sbjct: 256 LGSLVDDEELELMELEEGPNSLTCCTHF 283


>emb|CBI24184.3| unnamed protein product [Vitis vinifera]
          Length = 1188

 Score = 84.0 bits (206), Expect = 6e-14
 Identities = 75/240 (31%), Positives = 101/240 (42%), Gaps = 8/240 (3%)
 Frame = +3

Query: 108 LQEVLYTPEDIFARIFRKDGPALGDQFDPLPSNAF-----PGGPRRSRLEDNRHRKRRK- 269
           L E L T + I  ++FRKDGP LG +FD LPS++F          R+  E+    KRRK 
Sbjct: 119 LNEDLSTTDYILKKVFRKDGPPLGVEFDSLPSSSFCHCTDSRNSHRTCQENQTSSKRRKV 178

Query: 270 -VSMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKYGMGKGLMIQSSVPXXXXXXX 446
            VS  A+   + C   S PA+ +G+GKG MT   A     G           P       
Sbjct: 179 VVSKPAVLHQQFCNNKSAPAKIHGIGKGLMTVWRATNPGAG---------DFPTGIDFAD 229

Query: 447 XXXXXXXANTHHARGFKYDACSSGRVIQKKKKRVQPRESILKKLADRERAKKKNSLRCRK 626
                    +         +     +I+KKK R Q   +  K +  +   KKK S +  K
Sbjct: 230 GQVAAVSPTS--------TSILRKSLIKKKKPRKQSSVTKWKSVGGKLNDKKKPSRKRGK 281

Query: 627 VEPQXXXXXXXXXXXXCDLALEDVKCTENTG-FAXXXXXXXXXXXXXQAGPNPLSCSAHF 803
           VE              C+LALE+ K  E+   FA             QAGPNP++CSAHF
Sbjct: 282 VECNKDVNQKKPNKEKCELALEEGKSQEHLDQFAMLMDDEELELQESQAGPNPVTCSAHF 341


>ref|XP_002263797.2| PREDICTED: uncharacterized protein LOC100241125 [Vitis vinifera]
          Length = 1154

 Score = 83.6 bits (205), Expect = 8e-14
 Identities = 75/241 (31%), Positives = 101/241 (41%), Gaps = 9/241 (3%)
 Frame = +3

Query: 108 LQEVLYTPEDIFARIFRKDGPALGDQFDPLPSNAF-----PGGPRRSRLEDNRHRKRRK- 269
           L E L T + I  ++FRKDGP LG +FD LPS++F          R+  E+    KRRK 
Sbjct: 117 LNEDLSTTDYILKKVFRKDGPPLGVEFDSLPSSSFCHCTDSRNSHRTCQENQTSSKRRKV 176

Query: 270 --VSMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKYGMGKGLMIQSSVPXXXXXX 443
             VS  A+   + C   S PA+ +G+GKG MT   A     G           P      
Sbjct: 177 VVVSKPAVLHQQFCNNKSAPAKIHGIGKGLMTVWRATNPGAG---------DFPTGIDFA 227

Query: 444 XXXXXXXXANTHHARGFKYDACSSGRVIQKKKKRVQPRESILKKLADRERAKKKNSLRCR 623
                     +         +     +I+KKK R Q   +  K +  +   KKK S +  
Sbjct: 228 DGQVAAVSPTS--------TSILRKSLIKKKKPRKQSSVTKWKSVGGKLNDKKKPSRKRG 279

Query: 624 KVEPQXXXXXXXXXXXXCDLALEDVKCTENTG-FAXXXXXXXXXXXXXQAGPNPLSCSAH 800
           KVE              C+LALE+ K  E+   FA             QAGPNP++CSAH
Sbjct: 280 KVECNKDVNQKKPNKEKCELALEEGKSQEHLDQFAMLMDDEELELQESQAGPNPVTCSAH 339

Query: 801 F 803
           F
Sbjct: 340 F 340


>emb|CAN63605.1| hypothetical protein VITISV_019128 [Vitis vinifera]
          Length = 494

 Score = 83.6 bits (205), Expect = 8e-14
 Identities = 75/241 (31%), Positives = 101/241 (41%), Gaps = 9/241 (3%)
 Frame = +3

Query: 108 LQEVLYTPEDIFARIFRKDGPALGDQFDPLPSNAF-----PGGPRRSRLEDNRHRKRRK- 269
           L E L T + I  ++FRKDGP LG +FD LPS++F          R+  E+    KRRK 
Sbjct: 117 LNEDLSTTDYILKKVFRKDGPPLGVEFDSLPSSSFCHCTDSRNSHRTCQENQTSSKRRKV 176

Query: 270 --VSMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKYGMGKGLMIQSSVPXXXXXX 443
             VS  A+   + C   S PA+ +G+GKG MT   A     G           P      
Sbjct: 177 VVVSKPAVLHQQFCNNKSAPAKIHGIGKGLMTVWRATNPGAG---------DFPTGIDFA 227

Query: 444 XXXXXXXXANTHHARGFKYDACSSGRVIQKKKKRVQPRESILKKLADRERAKKKNSLRCR 623
                     +         +     +I+KKK R Q   +  K +  +   KKK S +  
Sbjct: 228 DGQVAAVSPTS--------TSILRKSLIKKKKPRKQSSVTKWKSVGGKLNDKKKPSRKRG 279

Query: 624 KVEPQXXXXXXXXXXXXCDLALEDVKCTENTG-FAXXXXXXXXXXXXXQAGPNPLSCSAH 800
           KVE              C+LALE+ K  E+   FA             QAGPNP++CSAH
Sbjct: 280 KVECNKDVNQKKPNKEKCELALEEGKSQEHLDQFAMLMDDEELELQESQAGPNPVTCSAH 339

Query: 801 F 803
           F
Sbjct: 340 F 340


>ref|XP_007049489.1| Homeodomain-like transcriptional regulator isoform 3 [Theobroma
           cacao] gi|508701750|gb|EOX93646.1| Homeodomain-like
           transcriptional regulator isoform 3 [Theobroma cacao]
          Length = 1085

 Score = 79.7 bits (195), Expect = 1e-12
 Identities = 71/250 (28%), Positives = 106/250 (42%), Gaps = 11/250 (4%)
 Frame = +3

Query: 84  NENDYRHRLQEVLYTPEDIFARIFRKDGPALGDQFDPLPSNAF--PGGPRRSR---LEDN 248
           N+   +  L + L +P+ I  ++FRKDGP LG +FD LPS AF    G + S     ED 
Sbjct: 115 NKRKKKMLLLQDLSSPQYILKKVFRKDGPPLGVEFDSLPSQAFCHCKGSKNSHPADQEDQ 174

Query: 249 RHRKRRKVSMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKYGMGKGLMIQSSVPX 428
           R  +RR VS     DY+  C  S P + +G+GKG MT       + G          +P 
Sbjct: 175 RATRRRTVSELTTIDYQNNCNESAPVKKHGIGKGLMTVWRVVNPEGG---------DIPT 225

Query: 429 XXXXXXXXXXXXXANTHHARGFKYDACSSGRVIQK---KKKRVQPRESILKK--LADRER 593
                                      +S  V++K   + KR QP  S++K+  L  + +
Sbjct: 226 GVDFSNKQIIAPPQ-------------TSSPVVRKPPARNKRRQPLVSLMKQRSLEKKLQ 272

Query: 594 AKKKNSLRCRKVEPQXXXXXXXXXXXXCDLALEDVKCTENTG-FAXXXXXXXXXXXXXQA 770
            KK+ S++ R+++              C+LALE     ++                  QA
Sbjct: 273 EKKRPSIKRREMKSNKDDSNRQLHKEKCELALEGSTSNKSLDQLLMLVDDEELELRELQA 332

Query: 771 GPNPLSCSAH 800
           GPNPL+CS H
Sbjct: 333 GPNPLTCSDH 342


>ref|XP_007049488.1| Homeodomain-like transcriptional regulator isoform 2 [Theobroma
           cacao] gi|508701749|gb|EOX93645.1| Homeodomain-like
           transcriptional regulator isoform 2 [Theobroma cacao]
          Length = 1158

 Score = 79.7 bits (195), Expect = 1e-12
 Identities = 71/250 (28%), Positives = 106/250 (42%), Gaps = 11/250 (4%)
 Frame = +3

Query: 84  NENDYRHRLQEVLYTPEDIFARIFRKDGPALGDQFDPLPSNAF--PGGPRRSR---LEDN 248
           N+   +  L + L +P+ I  ++FRKDGP LG +FD LPS AF    G + S     ED 
Sbjct: 115 NKRKKKMLLLQDLSSPQYILKKVFRKDGPPLGVEFDSLPSQAFCHCKGSKNSHPADQEDQ 174

Query: 249 RHRKRRKVSMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKYGMGKGLMIQSSVPX 428
           R  +RR VS     DY+  C  S P + +G+GKG MT       + G          +P 
Sbjct: 175 RATRRRTVSELTTIDYQNNCNESAPVKKHGIGKGLMTVWRVVNPEGG---------DIPT 225

Query: 429 XXXXXXXXXXXXXANTHHARGFKYDACSSGRVIQK---KKKRVQPRESILKK--LADRER 593
                                      +S  V++K   + KR QP  S++K+  L  + +
Sbjct: 226 GVDFSNKQIIAPPQ-------------TSSPVVRKPPARNKRRQPLVSLMKQRSLEKKLQ 272

Query: 594 AKKKNSLRCRKVEPQXXXXXXXXXXXXCDLALEDVKCTENTG-FAXXXXXXXXXXXXXQA 770
            KK+ S++ R+++              C+LALE     ++                  QA
Sbjct: 273 EKKRPSIKRREMKSNKDDSNRQLHKEKCELALEGSTSNKSLDQLLMLVDDEELELRELQA 332

Query: 771 GPNPLSCSAH 800
           GPNPL+CS H
Sbjct: 333 GPNPLTCSDH 342


>ref|XP_007049487.1| Homeodomain-like transcriptional regulator isoform 1 [Theobroma
           cacao] gi|508701748|gb|EOX93644.1| Homeodomain-like
           transcriptional regulator isoform 1 [Theobroma cacao]
          Length = 1164

 Score = 79.7 bits (195), Expect = 1e-12
 Identities = 71/250 (28%), Positives = 106/250 (42%), Gaps = 11/250 (4%)
 Frame = +3

Query: 84  NENDYRHRLQEVLYTPEDIFARIFRKDGPALGDQFDPLPSNAF--PGGPRRSR---LEDN 248
           N+   +  L + L +P+ I  ++FRKDGP LG +FD LPS AF    G + S     ED 
Sbjct: 115 NKRKKKMLLLQDLSSPQYILKKVFRKDGPPLGVEFDSLPSQAFCHCKGSKNSHPADQEDQ 174

Query: 249 RHRKRRKVSMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKYGMGKGLMIQSSVPX 428
           R  +RR VS     DY+  C  S P + +G+GKG MT       + G          +P 
Sbjct: 175 RATRRRTVSELTTIDYQNNCNESAPVKKHGIGKGLMTVWRVVNPEGG---------DIPT 225

Query: 429 XXXXXXXXXXXXXANTHHARGFKYDACSSGRVIQK---KKKRVQPRESILKK--LADRER 593
                                      +S  V++K   + KR QP  S++K+  L  + +
Sbjct: 226 GVDFSNKQIIAPPQ-------------TSSPVVRKPPARNKRRQPLVSLMKQRSLEKKLQ 272

Query: 594 AKKKNSLRCRKVEPQXXXXXXXXXXXXCDLALEDVKCTENTG-FAXXXXXXXXXXXXXQA 770
            KK+ S++ R+++              C+LALE     ++                  QA
Sbjct: 273 EKKRPSIKRREMKSNKDDSNRQLHKEKCELALEGSTSNKSLDQLLMLVDDEELELRELQA 332

Query: 771 GPNPLSCSAH 800
           GPNPL+CS H
Sbjct: 333 GPNPLTCSDH 342


>ref|XP_002524572.1| hypothetical protein RCOM_1211540 [Ricinus communis]
           gi|223536125|gb|EEF37780.1| hypothetical protein
           RCOM_1211540 [Ricinus communis]
          Length = 1120

 Score = 79.7 bits (195), Expect = 1e-12
 Identities = 79/261 (30%), Positives = 108/261 (41%), Gaps = 8/261 (3%)
 Frame = +3

Query: 36  AVNQKTKSKRNQQIYMNENDYRHRLQEVLYTPEDIFARIFRKDGPALGDQFDPLPSNAFP 215
           A    +K+KR ++  +   D        L TP+ +  +IFRKDGP LG +FD LPS AF 
Sbjct: 81  ATRMISKTKRKKKKLIPSQD--------LLTPDYVLCKIFRKDGPPLGVEFDSLPSKAFL 132

Query: 216 GG--PRRSRL---EDNRHRKRRKVSMHALSDYETCCE--GSLPARGYGMGKGPMTAHGAP 374
                R S L   E+ R  ++RKVS     D  TC +   S PA  +G+GKG MT   A 
Sbjct: 133 NSIDSRNSNLASQENQRANRKRKVSK---QDTSTCQDYNNSDPAMKHGIGKGLMTVWRAT 189

Query: 375 AKKYGMGKGLMIQSSVPXXXXXXXXXXXXXXANTHHARGFKYDACSSGRVIQKKKKRVQP 554
               G          +P                       +    +  + + +KKK+   
Sbjct: 190 NPTAG-----HFPPRIPFSQKEIVP---------------QVPTPTPRKSLCRKKKQQLV 229

Query: 555 RESILKKLADRERAKKKNSLRCRKVEPQXXXXXXXXXXXXCDLALEDVKCTENTG-FAXX 731
                K+L ++   K+K S++ R VE Q            C+LALE V   E    FA  
Sbjct: 230 SIMKQKRLENKTHHKRKPSVKQRVVESQRDEFQKLPLKERCELALEGVISQERINQFAML 289

Query: 732 XXXXXXXXXXXQAGPNPLSCS 794
                      QAGPNPLSCS
Sbjct: 290 ADDEELELRELQAGPNPLSCS 310


>ref|XP_006469383.1| PREDICTED: uncharacterized protein LOC102620965 isoform X1 [Citrus
           sinensis] gi|568830180|ref|XP_006469384.1| PREDICTED:
           uncharacterized protein LOC102620965 isoform X2 [Citrus
           sinensis]
          Length = 1155

 Score = 73.6 bits (179), Expect = 8e-11
 Identities = 71/239 (29%), Positives = 101/239 (42%), Gaps = 8/239 (3%)
 Frame = +3

Query: 108 LQEVLYTPEDIFARIFRKDGPALGDQFDPLPSNAFPGGPRRSR----LEDNRHRKR-RKV 272
           LQ++L TP+ I  ++FRKDGP+LG +FD LPS AF            L++N+  KR RKV
Sbjct: 117 LQDLL-TPDYILKKVFRKDGPSLGVEFDSLPSKAFFHSKDSINSCPPLQENQTAKRKRKV 175

Query: 273 SMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKYGMGKGLMIQSSVPXXXXXXXXX 452
           S+H   D++ CC  +   R +GMGKG MTA        G         +VP         
Sbjct: 176 SIHDELDHQECCTNTDHVRKHGMGKGLMTAWRVMNPNGG---------TVP--------- 217

Query: 453 XXXXXANTHHARGFKYDACSSGRVIQKKKKRVQPRESILK--KLADRERAKKKNSLRCRK 626
                            A    +    +KKR Q   S+LK  +LA+  + K+K   + R+
Sbjct: 218 -TGIDVADRQVTVVPQMATPLSQKPPLRKKRAQQIVSLLKQRRLANNLQNKRKPVAKGRQ 276

Query: 627 VEPQXXXXXXXXXXXXCDLALEDVKCTENTG-FAXXXXXXXXXXXXXQAGPNPLSCSAH 800
           V+              C+LA + V   E     A             + GPNP +C  H
Sbjct: 277 VKLDKGERLRQPNKEKCELAPDSVISQERLDQIAMLVDDEELELRELEVGPNPPTCCDH 335


>ref|XP_006447893.1| hypothetical protein CICLE_v10014094mg [Citrus clementina]
           gi|557550504|gb|ESR61133.1| hypothetical protein
           CICLE_v10014094mg [Citrus clementina]
          Length = 1127

 Score = 73.6 bits (179), Expect = 8e-11
 Identities = 71/239 (29%), Positives = 101/239 (42%), Gaps = 8/239 (3%)
 Frame = +3

Query: 108 LQEVLYTPEDIFARIFRKDGPALGDQFDPLPSNAFPGGPRRSR----LEDNRHRKR-RKV 272
           LQ++L TP+ I  ++FRKDGP+LG +FD LPS AF            L++N+  KR RKV
Sbjct: 89  LQDLL-TPDYILKKVFRKDGPSLGVEFDSLPSKAFFHSKDSINSCPPLQENQTAKRKRKV 147

Query: 273 SMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKYGMGKGLMIQSSVPXXXXXXXXX 452
           S+H   D++ CC  +   R +GMGKG MTA        G         +VP         
Sbjct: 148 SIHDELDHQECCTNTDHVRKHGMGKGLMTAWRVMNPNGG---------TVP--------- 189

Query: 453 XXXXXANTHHARGFKYDACSSGRVIQKKKKRVQPRESILK--KLADRERAKKKNSLRCRK 626
                            A    +    +KKR Q   S+LK  +LA+  + K+K   + R+
Sbjct: 190 -TGIDVADRQVTVVPQMATPLSQKPPLRKKRAQQIVSLLKQRRLANNLQNKRKPVAKGRQ 248

Query: 627 VEPQXXXXXXXXXXXXCDLALEDVKCTENTG-FAXXXXXXXXXXXXXQAGPNPLSCSAH 800
           V+              C+LA + V   E     A             + GPNP +C  H
Sbjct: 249 VKLDKGERLRQPNKEKCELAPDSVISQERLDQIAMLVDDEELELRELEVGPNPPTCCDH 307


>ref|XP_003579233.1| PREDICTED: uncharacterized protein LOC100825161 [Brachypodium
           distachyon]
          Length = 1111

 Score = 73.6 bits (179), Expect = 8e-11
 Identities = 74/259 (28%), Positives = 112/259 (43%), Gaps = 2/259 (0%)
 Frame = +3

Query: 30  GRAVNQKTKSKRNQQIYMNENDYRHRLQ-EVLYTPEDIFARIFRKDGPALGDQFDPLPSN 206
           G  +  +  ++ N    M+    +H L+ +VLY  + I A++FRKDGP+LG +FDPLP +
Sbjct: 64  GALMETQVSARSNGPRSMSLVGEKHALRPQVLYPKDYILAKVFRKDGPSLGSEFDPLPKS 123

Query: 207 AFPGGPRRSRLEDNRHRKRRKVSMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKY 386
           A   G  R   E +  + +R V    + +   C +     +G+ +   P  ++G P +K+
Sbjct: 124 AH--GHVRDTTEYHSDQDQRVVKKRKIVE---CTD-----QGFTL---PCQSNG-PVRKH 169

Query: 387 GMGKGLMIQSSVPXXXXXXXXXXXXXXANTHHARGFKYDACSSGRVIQKKKKRVQPRESI 566
           GMGKGLM                      T   R  +    S G++IQK      PR+ +
Sbjct: 170 GMGKGLMTVWHAMYSKNAEVQDVSNFIDETGCLRSLRPFDDSDGKLIQKF---FLPRKKV 226

Query: 567 LKKLADRERAKKKNSLRCRKVEPQXXXXXXXXXXXXCDLALEDVKCTE-NTGFAXXXXXX 743
            KK   R    K      RKV               C L++++ + +E  T  A      
Sbjct: 227 DKK--SRPPPSK------RKVPRGRVTVLKEHPAMECHLSVDESESSELQTEQATLVDDE 278

Query: 744 XXXXXXXQAGPNPLSCSAH 800
                  QAGPNPL CSAH
Sbjct: 279 ELELSELQAGPNPLRCSAH 297


>ref|XP_007214909.1| hypothetical protein PRUPE_ppa000565mg [Prunus persica]
           gi|462411059|gb|EMJ16108.1| hypothetical protein
           PRUPE_ppa000565mg [Prunus persica]
          Length = 1095

 Score = 71.6 bits (174), Expect = 3e-10
 Identities = 75/258 (29%), Positives = 99/258 (38%), Gaps = 11/258 (4%)
 Frame = +3

Query: 63  RNQQIYMNENDYRHRLQEVLYTPEDIFARIFRKDGPALGDQFDPLPSNAF-----PGGPR 227
           R +Q  MN N     +QE+L TP+ I  ++FRKDGP LG +FD LPS A      P    
Sbjct: 62  RYKQTKMNGN----HIQELL-TPDYILKKVFRKDGPPLGVEFDSLPSRALFHSTDPEDLH 116

Query: 228 RSRLEDNRHRKRRKVSMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKYGMGKGLM 407
               E+ R  KRRKV+ HA+  ++ C E                   AP KK+G+GKGLM
Sbjct: 117 PPCKENQRETKRRKVTEHAVIGHQNCDE------------------SAPVKKHGVGKGLM 158

Query: 408 IQSSVPXXXXXXXXXXXXXXANTHHARGFKYD-ACSSGRVIQKKKKRVQPRESILKKLAD 584
                               A    AR F  D   ++G V       + P     K +  
Sbjct: 159 ----------------TVWRATNPDARDFPVDMGFANGGVTSVS---LIPTPVSRKPVTQ 199

Query: 585 RERAKKKNSL----RCRKVEPQXXXXXXXXXXXXCDLALEDVKCTENTG-FAXXXXXXXX 749
             R ++K  +    R R                 C+LALE     E++   A        
Sbjct: 200 NRRLQQKKCVPKQGRVRNKVESNNENQTLPSKEKCELALEGAGSQEHSDKIAMLVDDEEL 259

Query: 750 XXXXXQAGPNPLSCSAHF 803
                Q  PN L CS HF
Sbjct: 260 ELRELQGRPNALGCSDHF 277


>gb|EEE67601.1| hypothetical protein OsJ_25152 [Oryza sativa Japonica Group]
          Length = 1173

 Score = 71.2 bits (173), Expect = 4e-10
 Identities = 69/265 (26%), Positives = 107/265 (40%), Gaps = 9/265 (3%)
 Frame = +3

Query: 33  RAVNQKTKSKRNQQIYMNENDYRHRLQEVLYTPED-IFARIFRKDGPALGDQFDPLPSNA 209
           R VN + +  R     M+    +H L+  +  P+D I  ++FRKDGP LG +FDPLP +A
Sbjct: 110 RTVNLQPEDDRYVDKGMSFTGEKHTLRSQVLFPKDYILRKVFRKDGPPLGSEFDPLPHSA 169

Query: 210 FPGGPRRSRLEDNRHRKRRKVSMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKYG 389
              G  R   +D+ ++ +R +    + +  T    SLP    G           P +K+G
Sbjct: 170 --PGHLRDTTDDHFYQNQRVIKKRKIVE-PTTQRSSLPCGDNG-----------PVRKHG 215

Query: 390 MGKGLMIQSSVPXXXXXXXXXXXXXXANTHHARGF-------KYDACSSGRVIQKKKKRV 548
            GKGLM                      T   R         + + C  G++IQKK   V
Sbjct: 216 AGKGLMTVWHAMYSHSSKIQDGSNFIDETGCLRSLRPLDDCGRIEDCDDGKLIQKK---V 272

Query: 549 QPRESILKKLADRERAKKKNSLRCRKVEPQXXXXXXXXXXXXCDLALEDVKC-TENTGFA 725
             R+ ++K+   R  + K+     R  +P+            C L++++ +         
Sbjct: 273 LARKKVVKR--TRPPSNKRKVPSSRVTDPK------KHPPMECHLSVDESQSPVLQANQV 324

Query: 726 XXXXXXXXXXXXXQAGPNPLSCSAH 800
                        QAGPNPL CSAH
Sbjct: 325 TLVDDEELELRELQAGPNPLRCSAH 349


>gb|EEC82457.1| hypothetical protein OsI_26894 [Oryza sativa Indica Group]
          Length = 1173

 Score = 69.3 bits (168), Expect = 2e-09
 Identities = 68/265 (25%), Positives = 107/265 (40%), Gaps = 9/265 (3%)
 Frame = +3

Query: 33  RAVNQKTKSKRNQQIYMNENDYRHRLQEVLYTPED-IFARIFRKDGPALGDQFDPLPSNA 209
           R VN + +  R     M+    +H L+  +  P+D I  ++FRKDGP LG +FDPLP +A
Sbjct: 110 RTVNLQPEDDRYVDKGMSFTGEKHTLRSQVLFPKDYILRKVFRKDGPPLGSEFDPLPHSA 169

Query: 210 FPGGPRRSRLEDNRHRKRRKVSMHALSDYETCCEGSLPARGYGMGKGPMTAHGAPAKKYG 389
              G  R   +++ ++ +R +    + +  T    SLP    G           P +K+G
Sbjct: 170 --PGHLRDTTDNHFYQNQRVIKKRKIVE-PTTQRSSLPCGDNG-----------PVRKHG 215

Query: 390 MGKGLMIQSSVPXXXXXXXXXXXXXXANTHHARGF-------KYDACSSGRVIQKKKKRV 548
            GKGLM                      T   R         + + C  G++IQKK   V
Sbjct: 216 AGKGLMTVWHAMYSHSSKIQDGSNFIDETGCLRSLRPLDDCGRIEDCDDGKLIQKK---V 272

Query: 549 QPRESILKKLADRERAKKKNSLRCRKVEPQXXXXXXXXXXXXCDLALEDVKC-TENTGFA 725
             R+ ++K+   R  + K+     R  +P+            C L++++ +         
Sbjct: 273 LARKKVVKR--TRPPSNKRKVPSSRVTDPK------KHPPMECHLSVDESQSPVLQANQV 324

Query: 726 XXXXXXXXXXXXXQAGPNPLSCSAH 800
                        QAGPNPL CSAH
Sbjct: 325 TLVDDEELELRELQAGPNPLRCSAH 349


Top