BLASTX nr result

ID: Mentha25_contig00044326 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00044326
         (829 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006385642.1| DNA-binding family protein [Populus trichoca...   267   4e-69
gb|EPS58236.1| hypothetical protein M569_16579, partial [Genlise...   264   3e-68
ref|XP_002281340.1| PREDICTED: uncharacterized protein LOC100245...   264   3e-68
ref|XP_002519830.1| DNA binding protein, putative [Ricinus commu...   263   6e-68
gb|EYU23823.1| hypothetical protein MIMGU_mgv1a0132321mg, partia...   249   7e-64
gb|EXB99734.1| Putative DNA-binding protein ESCAROLA [Morus nota...   248   2e-63
ref|XP_006368415.1| hypothetical protein POPTR_0001s02600g [Popu...   248   2e-63
ref|XP_007039521.1| AT hook motif DNA-binding family protein iso...   242   1e-61
ref|XP_007209253.1| hypothetical protein PRUPE_ppa007231mg [Prun...   242   1e-61
ref|XP_004301686.1| PREDICTED: uncharacterized protein LOC101304...   242   1e-61
ref|XP_007039522.1| AT hook motif DNA-binding family protein iso...   236   8e-60
gb|EYU34143.1| hypothetical protein MIMGU_mgv1a023359mg [Mimulus...   235   2e-59
ref|XP_006436724.1| hypothetical protein CICLE_v10031852mg [Citr...   234   2e-59
ref|XP_004148734.1| PREDICTED: uncharacterized protein LOC101204...   233   5e-59
gb|AGE46020.1| putative AT-hook DNA-binding protein [Elaeis guin...   227   4e-57
gb|EXB56269.1| Putative DNA-binding protein ESCAROLA [Morus nota...   224   2e-56
ref|XP_006847725.1| hypothetical protein AMTR_s00149p00085280 [A...   222   1e-55
ref|XP_002275328.1| PREDICTED: uncharacterized protein LOC100263...   221   2e-55
emb|CAN64876.1| hypothetical protein VITISV_030792 [Vitis vinifera]   221   2e-55
ref|XP_007155774.1| hypothetical protein PHAVU_003G230500g [Phas...   221   3e-55

>ref|XP_006385642.1| DNA-binding family protein [Populus trichocarpa]
           gi|550342773|gb|ERP63439.1| DNA-binding family protein
           [Populus trichocarpa]
          Length = 375

 Score =  267 bits (682), Expect = 4e-69
 Identities = 158/273 (57%), Positives = 181/273 (66%), Gaps = 4/273 (1%)
 Frame = -1

Query: 808 LPNNSYPLANSNNAVNHPTTTSATIMHQQNPGFPFNSMTGG---GGPNPADHLQPDGSPS 638
           +P +SYP + +++ +N+P  +          GFPFN+M+G      P  A       S S
Sbjct: 35  VPTSSYP-STTSHLINNPNISPQNAA--LGGGFPFNTMSGNRLQSKPEGAFDGSSPTSSS 91

Query: 637 GGGFSIVPARKKRGRPRKYSPDNSIGLGLSPAPVSRIPSLMAQAHNDXXXXXXXXXXXXX 458
           G  FSI PA+KKRGRPRKY+PD +I LGLSP PV   PS ++  H D             
Sbjct: 92  GMRFSIEPAKKKRGRPRKYTPDGNIALGLSPTPV---PSGISAGHADSGGGGVTHDAASE 148

Query: 457 XXXXXXSKRNRGRPPGSVKRQLDALG-VPGVGFTPHVITVNAGEDIASKIMAFSQQGPRT 281
                  K+NRGRPPGS K+QLDALG V GVGFTPHVITV AGEDIASKIMAFSQQGPRT
Sbjct: 149 HPS----KKNRGRPPGSGKKQLDALGGVGGVGFTPHVITVKAGEDIASKIMAFSQQGPRT 204

Query: 280 VCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIISLSGSFXXXXXXXXXXXXXXXXXXXX 101
           VC+LSANGAI NVTLRQ AMSGG+VTYEGRFEIISLSGSF                    
Sbjct: 205 VCILSANGAICNVTLRQPAMSGGSVTYEGRFEIISLSGSFLLSESNGSRSRSGGLSVSLA 264

Query: 100 GPDGKVLGGGVAGMLTAASPVQVIIGSFIAEGK 2
           G DG+VLGGGVAGMLTAASPVQVI+GSFIA+GK
Sbjct: 265 GSDGRVLGGGVAGMLTAASPVQVIVGSFIADGK 297


>gb|EPS58236.1| hypothetical protein M569_16579, partial [Genlisea aurea]
          Length = 344

 Score =  264 bits (675), Expect = 3e-68
 Identities = 153/281 (54%), Positives = 178/281 (63%), Gaps = 7/281 (2%)
 Frame = -1

Query: 829 HSQSQHHLPNNS-YPLANSNNAVNHPTTTSATIMHQQNPGFPFNSMTGG--GGPNPADHL 659
           HSQ+QH+  N+S Y L   +N+V    T++A +MHQQNP FPFNSM      GP P ++ 
Sbjct: 32  HSQTQHYSSNSSGYGLPGGSNSVAS-ATSNAGVMHQQNPRFPFNSMPAAVAPGPKPVENQ 90

Query: 658 QPDGSPS---GGGFSIVPARKKRGRPRKYSPDNSIGLGLSPAPVSRIPSLMAQAHNDXXX 488
             DGSPS   G    I PA+KKRGRPRKYSPDNSIGLGLSPA   +I S +    +    
Sbjct: 91  YSDGSPSASPGAWLGIEPAKKKRGRPRKYSPDNSIGLGLSPAAGGQISSAVGHVDSSGGT 150

Query: 487 XXXXXXXXXXXXXXXXSKRNRGRPPGSVKRQLDAL-GVPGVGFTPHVITVNAGEDIASKI 311
                            KRNRGRPPGS KRQL+AL G+PGVGFTPHVI VN+GEDI SKI
Sbjct: 151 PSSETPL----------KRNRGRPPGSGKRQLNALAGLPGVGFTPHVIMVNSGEDIISKI 200

Query: 310 MAFSQQGPRTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIISLSGSFXXXXXXXXXX 131
           MAFS+QGPRTVC+LSA GA+ NV L Q AM    VTYEGRFEIISLSGS           
Sbjct: 201 MAFSRQGPRTVCILSATGAVCNVALHQTAMPTSVVTYEGRFEIISLSGSVASSGSSGGQG 260

Query: 130 XXXXXXXXXXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAE 8
                       DG+VLGGGV  +L AAS VQ+I+GSF+ E
Sbjct: 261 QTGGLTVSLASSDGRVLGGGVGEILKAASSVQIIVGSFMTE 301


>ref|XP_002281340.1| PREDICTED: uncharacterized protein LOC100245362 [Vitis vinifera]
           gi|297742130|emb|CBI33917.3| unnamed protein product
           [Vitis vinifera]
          Length = 353

 Score =  264 bits (675), Expect = 3e-68
 Identities = 162/285 (56%), Positives = 183/285 (64%), Gaps = 11/285 (3%)
 Frame = -1

Query: 823 QSQHHLPN------NSYP--LANSNNAVNHPTTTSATIMHQQNPGFPFNSMTGGGGPNPA 668
           Q Q H P+      NSY   +AN++  +N     SA IM  QN  F F SM       P 
Sbjct: 10  QQQQHPPHGMMMGPNSYHTNMANTSPMMN---PNSAAIM--QNNRFSFTSMVAS---KPV 61

Query: 667 DHLQPDGSPSG---GGFSIVPARKKRGRPRKYSPDNSIGLGLSPAPVSRIPSLMAQAHND 497
           D    DGS +G    GF+I PA+KKRGRPRKY+PD +I LGL+P P   IPS    AH D
Sbjct: 62  DSPYGDGSSTGLRPCGFNIEPAKKKRGRPRKYAPDGNIALGLAPTP---IPS--TAAHGD 116

Query: 496 XXXXXXXXXXXXXXXXXXXSKRNRGRPPGSVKRQLDALGVPGVGFTPHVITVNAGEDIAS 317
                               KRNRGRPPGS K+QLDALG  GVGFTPHVITVN GEDIAS
Sbjct: 117 ATGTPSSEPPA---------KRNRGRPPGSGKKQLDALGAAGVGFTPHVITVNVGEDIAS 167

Query: 316 KIMAFSQQGPRTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIISLSGSFXXXXXXXX 137
           KIMAFSQQGPRTVC+LSANGAI NVTLRQ AMSGGT++YEGRF+IISLSGSF        
Sbjct: 168 KIMAFSQQGPRTVCILSANGAICNVTLRQPAMSGGTISYEGRFDIISLSGSFLLSEDNGS 227

Query: 136 XXXXXXXXXXXXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAEGK 2
                       G DG+VLGGGVAGMLTAA+PVQV++GSFIA+GK
Sbjct: 228 RHRTGGLSVSLAGSDGRVLGGGVAGMLTAATPVQVVVGSFIADGK 272


>ref|XP_002519830.1| DNA binding protein, putative [Ricinus communis]
           gi|223540876|gb|EEF42434.1| DNA binding protein,
           putative [Ricinus communis]
          Length = 376

 Score =  263 bits (672), Expect = 6e-68
 Identities = 162/291 (55%), Positives = 180/291 (61%), Gaps = 17/291 (5%)
 Frame = -1

Query: 823 QSQHHLPNNSYPLANSNNAVNHPTTTSATIMHQQNP-----GFPFNSMTGGGGPNPADHL 659
           Q QH  P +      SN  +   +  +   M   NP     GFPFNS+   G P      
Sbjct: 10  QHQHQQPPHPQQQQQSNMMLGGYSNNAHPAMTMINPNIPPSGFPFNSV---GPPRTQPSK 66

Query: 658 QP-------DGS--PSGGG--FSIVPARKKRGRPRKYSPDNSIGLGLSPAPVSRIPSLMA 512
           QP       DGS  PS  G  FS+ PA+KKRGRPRKY+PD +I LGLSP P+S   + + 
Sbjct: 67  QPSSDGGLFDGSSPPSSSGMRFSMDPAKKKRGRPRKYTPDGNIALGLSPTPISSSATSLP 126

Query: 511 QAHNDXXXXXXXXXXXXXXXXXXXSKRNRGRPPGSVKRQLDALG-VPGVGFTPHVITVNA 335
               D                   SKRNRGRPPGS K+QLDALG V GVGFTPHVITV A
Sbjct: 127 PHVADSGSGVGVGIGTPAIASDPPSKRNRGRPPGSGKKQLDALGGVGGVGFTPHVITVKA 186

Query: 334 GEDIASKIMAFSQQGPRTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIISLSGSFXX 155
           GEDIASKIMAFSQQGPRTVC+LSANGAI NVTLRQ AMSGGTVTYEGR+EIISLSGSF  
Sbjct: 187 GEDIASKIMAFSQQGPRTVCILSANGAICNVTLRQPAMSGGTVTYEGRYEIISLSGSFLL 246

Query: 154 XXXXXXXXXXXXXXXXXXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAEGK 2
                             G DG+VLGGGVAGML AASPVQVI+GSFIA+GK
Sbjct: 247 SENNGNRSRSGGLSVSLAGSDGRVLGGGVAGMLMAASPVQVIVGSFIADGK 297


>gb|EYU23823.1| hypothetical protein MIMGU_mgv1a0132321mg, partial [Mimulus
           guttatus]
          Length = 210

 Score =  249 bits (637), Expect = 7e-64
 Identities = 139/205 (67%), Positives = 146/205 (71%), Gaps = 13/205 (6%)
 Frame = -1

Query: 736 IMHQQ--NPGFPFNSMTGGGGP--------NPADHLQPDGSPSGGG---FSIVPARKKRG 596
           +MHQQ  N  FPFNSM               P DH   DGSPSG G   F+I PARKKRG
Sbjct: 1   MMHQQQQNARFPFNSMAAAAAAAAAAAASQKPLDHQYSDGSPSGSGGGWFNIEPARKKRG 60

Query: 595 RPRKYSPDNSIGLGLSPAPVSRIPSLMAQAHNDXXXXXXXXXXXXXXXXXXXSKRNRGRP 416
           RPRKYSPDNSIGLGLSPAPV++I S     H D                    KRNRGRP
Sbjct: 61  RPRKYSPDNSIGLGLSPAPVNQITSAGGGGHADSGGGGGGGGGGTPSSETSA-KRNRGRP 119

Query: 415 PGSVKRQLDALGVPGVGFTPHVITVNAGEDIASKIMAFSQQGPRTVCVLSANGAISNVTL 236
           PGSVK+QLDALGVPGVGFTPHVITV +GEDIASKIMAFSQQGPRTVC+LSA GAI NVTL
Sbjct: 120 PGSVKKQLDALGVPGVGFTPHVITVESGEDIASKIMAFSQQGPRTVCILSAYGAICNVTL 179

Query: 235 RQVAMSGGTVTYEGRFEIISLSGSF 161
           RQ AMSGGTVTYEGRFEIISLSGSF
Sbjct: 180 RQPAMSGGTVTYEGRFEIISLSGSF 204


>gb|EXB99734.1| Putative DNA-binding protein ESCAROLA [Morus notabilis]
          Length = 391

 Score =  248 bits (633), Expect = 2e-63
 Identities = 152/289 (52%), Positives = 178/289 (61%), Gaps = 14/289 (4%)
 Frame = -1

Query: 829 HSQSQHHLPNNSYPLANSNNAVNHPTTTSATIMHQQNPGFPFNSMTGG--GGPNPADHLQ 656
           HS + +H  NNS   A S+   ++   ++  +       FPFNS+T        P D L 
Sbjct: 36  HSHNHNHTNNNS---AASSMMGSNSIGSAQMLGGGGGARFPFNSVTPPPPSASKPLDSLS 92

Query: 655 P---DGSPS--------GGGFSIVP-ARKKRGRPRKYSPDNSIGLGLSPAPVSRIPSLMA 512
               DGS S        GGGFSI   ++KKRGRPRKYSPD +I LGLSP P+    + + 
Sbjct: 93  ANPYDGSSSPGLRPCVGGGGFSIDSGSKKKRGRPRKYSPDGNIALGLSPTPIPS-STAVG 151

Query: 511 QAHNDXXXXXXXXXXXXXXXXXXXSKRNRGRPPGSVKRQLDALGVPGVGFTPHVITVNAG 332
             H D                    K++RGRPPGS KRQLDALG  GVGFTPHVI V AG
Sbjct: 152 GGHGDSSGTTPSSEASG--------KKHRGRPPGSSKRQLDALGAGGVGFTPHVIMVKAG 203

Query: 331 EDIASKIMAFSQQGPRTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIISLSGSFXXX 152
           EDIASK+MAFSQQGPRTVC+LSANGAI NV+LRQ A+SGGTVTYEGR+EIISLSGSF   
Sbjct: 204 EDIASKVMAFSQQGPRTVCILSANGAICNVSLRQPALSGGTVTYEGRYEIISLSGSFFIS 263

Query: 151 XXXXXXXXXXXXXXXXXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAEG 5
                            GPDG+VLGGGVAG+L AASPVQVI+GSFI +G
Sbjct: 264 DNSGSRSRIGGLSVSLAGPDGRVLGGGVAGILMAASPVQVIVGSFIVDG 312


>ref|XP_006368415.1| hypothetical protein POPTR_0001s02600g [Populus trichocarpa]
           gi|550346328|gb|ERP64984.1| hypothetical protein
           POPTR_0001s02600g [Populus trichocarpa]
          Length = 377

 Score =  248 bits (633), Expect = 2e-63
 Identities = 155/280 (55%), Positives = 179/280 (63%), Gaps = 17/280 (6%)
 Frame = -1

Query: 790 PLANSN---NAVNHPTTTSATIMHQQNPGFPFNSMTGGGGP---NPADHLQ--PDG---- 647
           P + SN     +++P T S  +++ ++   P N+  GGG P     A  LQ  P+G    
Sbjct: 25  PQSQSNMIPGPISYPATASPHLINNRSIS-PQNAAIGGGFPFNQMSAQRLQSKPEGAFDG 83

Query: 646 ----SPSGGGFSIVPARKKRGRPRKYSPDNSIGLGLSPAPVSRIPSLMAQAHNDXXXXXX 479
               S SG  FSI PA+KKRGRPRKY+PD +I LGLSP P+    S M+    D      
Sbjct: 84  SSPTSSSGMRFSIEPAKKKRGRPRKYTPDGNIALGLSPTPIH---SGMSAGQADSSGGAG 140

Query: 478 XXXXXXXXXXXXXSKRNRGRPPGSVKRQLDALG-VPGVGFTPHVITVNAGEDIASKIMAF 302
                         K++RGRPPGS K+QLDALG   GVGFTPHVITV AGEDIASKIMAF
Sbjct: 141 SGVMPDVASEHPS-KKHRGRPPGSGKKQLDALGGTGGVGFTPHVITVKAGEDIASKIMAF 199

Query: 301 SQQGPRTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIISLSGSFXXXXXXXXXXXXX 122
           SQQGPRTVC+LSANGAI NVTLRQ AMSGG+VTYEGRFEIISLSGSF             
Sbjct: 200 SQQGPRTVCILSANGAICNVTLRQPAMSGGSVTYEGRFEIISLSGSFLLSESNGSRSRTG 259

Query: 121 XXXXXXXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAEGK 2
                  G DG+VLGGGVAGMLTAAS VQVI+GSFIA+GK
Sbjct: 260 GLSVSLAGSDGRVLGGGVAGMLTAASAVQVILGSFIADGK 299


>ref|XP_007039521.1| AT hook motif DNA-binding family protein isoform 1 [Theobroma
           cacao] gi|508776766|gb|EOY24022.1| AT hook motif
           DNA-binding family protein isoform 1 [Theobroma cacao]
          Length = 386

 Score =  242 bits (618), Expect = 1e-61
 Identities = 155/286 (54%), Positives = 173/286 (60%), Gaps = 19/286 (6%)
 Frame = -1

Query: 802 NNSYPLANSNNAVNHPTTTSATIMHQQNPGFPFNSMTGGGGP---------------NPA 668
           ++SYP   SN+ +  P  T A I     P FPFNS++    P                P 
Sbjct: 28  SSSYP---SNSGMISPNPTPA-IPPSSTPRFPFNSLSSPPPPPHHQHHQHHQHQQQPKPL 83

Query: 667 DHLQP---DGSPSGGGFSIVPARKKRGRPRKYSPDNSIGLGLSPAPVSRIPSLMAQAHND 497
           D L     DGSP     +    +KKRGRPRKY+PD +I L L  AP + I S  A  H  
Sbjct: 84  DSLNSVGFDGSPQLRYNTEPAMKKKRGRPRKYAPDGNIAL-LQLAPTTPIASNSAN-HGG 141

Query: 496 XXXXXXXXXXXXXXXXXXXSKRNRGRPPGSVKRQLDALG-VPGVGFTPHVITVNAGEDIA 320
                              +KRNRGRPPGS KRQ+DALG V GVGFTPHVITV AGEDIA
Sbjct: 142 GDSVGLGSSSGGGAASEPPAKRNRGRPPGSGKRQMDALGGVGGVGFTPHVITVKAGEDIA 201

Query: 319 SKIMAFSQQGPRTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIISLSGSFXXXXXXX 140
           +KIMAFSQQGPRTVC+LSANGAI NVTLRQ AMSGGTVTYEGRFEIISLSGSF       
Sbjct: 202 AKIMAFSQQGPRTVCILSANGAICNVTLRQPAMSGGTVTYEGRFEIISLSGSFLLSENNG 261

Query: 139 XXXXXXXXXXXXXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAEGK 2
                        G DG+VLGGGVAGML AASPVQVI+GSFIA+GK
Sbjct: 262 SRSRSGGLSVSLAGSDGRVLGGGVAGMLQAASPVQVIVGSFIADGK 307


>ref|XP_007209253.1| hypothetical protein PRUPE_ppa007231mg [Prunus persica]
           gi|462404988|gb|EMJ10452.1| hypothetical protein
           PRUPE_ppa007231mg [Prunus persica]
          Length = 377

 Score =  242 bits (618), Expect = 1e-61
 Identities = 149/281 (53%), Positives = 168/281 (59%), Gaps = 6/281 (2%)
 Frame = -1

Query: 826 SQSQHHLPNNSYPLANSNNAVNHPTTTSATIMHQQNPGFPFNSMTGGGGPNPAD-HLQPD 650
           S    +L  NS P+    N    P         QQ        M     P+P D  L+P 
Sbjct: 31  SMPNSNLNPNSGPMMGGPNPARFPFNAVPQPQQQQQQPTSKPQMDSLS-PSPYDGSLRPC 89

Query: 649 GSPSGGGFSI-----VPARKKRGRPRKYSPDNSIGLGLSPAPVSRIPSLMAQAHNDXXXX 485
           GS  GGGFSI       A+KKRGRPRKYSPD +I LGL+P  +    S  A   +     
Sbjct: 90  GS--GGGFSIDSSSASAAKKKRGRPRKYSPDGNIALGLAPTQMPSTASTAAAGPHGESSG 147

Query: 484 XXXXXXXXXXXXXXXSKRNRGRPPGSVKRQLDALGVPGVGFTPHVITVNAGEDIASKIMA 305
                           K+NRGRPPGS K+QLDALG  GVGFTPHVI V AGEDIA+K+M+
Sbjct: 148 TMSSDPPA--------KKNRGRPPGSGKKQLDALGAGGVGFTPHVIMVQAGEDIAAKVMS 199

Query: 304 FSQQGPRTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIISLSGSFXXXXXXXXXXXX 125
           FSQQGPRTVC+LSANGAI NVTLRQ AMSGGTVTYEGRFEIISLSGS+            
Sbjct: 200 FSQQGPRTVCILSANGAICNVTLRQPAMSGGTVTYEGRFEIISLSGSYLFSENNGNRSRS 259

Query: 124 XXXXXXXXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAEGK 2
                   G DG+VLGGGVAGML AASPVQVI+GSFIA+GK
Sbjct: 260 GGLSVSLAGSDGQVLGGGVAGMLVAASPVQVIVGSFIADGK 300


>ref|XP_004301686.1| PREDICTED: uncharacterized protein LOC101304880 [Fragaria vesca
           subsp. vesca]
          Length = 383

 Score =  242 bits (617), Expect = 1e-61
 Identities = 149/286 (52%), Positives = 172/286 (60%), Gaps = 20/286 (6%)
 Frame = -1

Query: 799 NSYPLANSNN---AVNHPTTTSATIMHQQNPG-FPFNSMTGGG-GPNPADHLQPDGSP-- 641
           NSY     NN   A N+ + +SA ++   N G F +N +        P D + P  SP  
Sbjct: 27  NSYTSPIPNNTATATNNNSNSSAAMIGGPNSGRFQYNPVAQQPPASKPLDAMSPSPSPFD 86

Query: 640 ------SGGGFSIVPA-----RKKRGRPRKYSPDNSIGLGLSPAPV--SRIPSLMAQAHN 500
                   GGFSI  +     +KKRGRPRKYSPD +I LGL+P  V  S  P   A  H 
Sbjct: 87  GSLRPCGSGGFSIDSSTASAGKKKRGRPRKYSPDGNIALGLAPTQVAASAAPVAAAGPHG 146

Query: 499 DXXXXXXXXXXXXXXXXXXXSKRNRGRPPGSVKRQLDALGVPGVGFTPHVITVNAGEDIA 320
           +                    K+NRGRPPGS K+QLDALG  GVGFTPHVI+V AGEDIA
Sbjct: 147 ESSVTMSSDPPA---------KKNRGRPPGSGKKQLDALGAGGVGFTPHVISVQAGEDIA 197

Query: 319 SKIMAFSQQGPRTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIISLSGSFXXXXXXX 140
           +K+M FSQQGPRT+C+LSANG ISNVTLRQ +MSGGTVTYEGRFEIISLSGS+       
Sbjct: 198 TKVMNFSQQGPRTICILSANGPISNVTLRQPSMSGGTVTYEGRFEIISLSGSYMFSENNG 257

Query: 139 XXXXXXXXXXXXXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAEGK 2
                        G DG VLGGGVAGML AA PVQVI+GSFIAEGK
Sbjct: 258 NRSRSGGLSVSLAGSDGSVLGGGVAGMLVAAGPVQVIVGSFIAEGK 303


>ref|XP_007039522.1| AT hook motif DNA-binding family protein isoform 2 [Theobroma
           cacao] gi|508776767|gb|EOY24023.1| AT hook motif
           DNA-binding family protein isoform 2 [Theobroma cacao]
          Length = 391

 Score =  236 bits (602), Expect = 8e-60
 Identities = 155/291 (53%), Positives = 173/291 (59%), Gaps = 24/291 (8%)
 Frame = -1

Query: 802 NNSYPLANSNNAVNHPTTTSATIMHQQNPGFPFNSMTGGGGP---------------NPA 668
           ++SYP   SN+ +  P  T A I     P FPFNS++    P                P 
Sbjct: 28  SSSYP---SNSGMISPNPTPA-IPPSSTPRFPFNSLSSPPPPPHHQHHQHHQHQQQPKPL 83

Query: 667 DHLQP---DGSPSGGGFSIVPARKKRGRPRKYSPDNSIGLGLSPAPVSRIPSLMAQAHND 497
           D L     DGSP     +    +KKRGRPRKY+PD +I L L  AP + I S  A  H  
Sbjct: 84  DSLNSVGFDGSPQLRYNTEPAMKKKRGRPRKYAPDGNIAL-LQLAPTTPIASNSAN-HGG 141

Query: 496 XXXXXXXXXXXXXXXXXXXSKRNRGRPPGSVKRQLDALG-VPGVGFTPHVITVNAGE--- 329
                              +KRNRGRPPGS KRQ+DALG V GVGFTPHVITV AGE   
Sbjct: 142 GDSVGLGSSSGGGAASEPPAKRNRGRPPGSGKRQMDALGGVGGVGFTPHVITVKAGESFG 201

Query: 328 --DIASKIMAFSQQGPRTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIISLSGSFXX 155
             DIA+KIMAFSQQGPRTVC+LSANGAI NVTLRQ AMSGGTVTYEGRFEIISLSGSF  
Sbjct: 202 LQDIAAKIMAFSQQGPRTVCILSANGAICNVTLRQPAMSGGTVTYEGRFEIISLSGSFLL 261

Query: 154 XXXXXXXXXXXXXXXXXXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAEGK 2
                             G DG+VLGGGVAGML AASPVQVI+GSFIA+GK
Sbjct: 262 SENNGSRSRSGGLSVSLAGSDGRVLGGGVAGMLQAASPVQVIVGSFIADGK 312


>gb|EYU34143.1| hypothetical protein MIMGU_mgv1a023359mg [Mimulus guttatus]
          Length = 288

 Score =  235 bits (599), Expect = 2e-59
 Identities = 144/276 (52%), Positives = 164/276 (59%)
 Frame = -1

Query: 829 HSQSQHHLPNNSYPLANSNNAVNHPTTTSATIMHQQNPGFPFNSMTGGGGPNPADHLQPD 650
           HSQSQ +L  N             P T +  +M QQN GFPFN+   G      DHLQ D
Sbjct: 4   HSQSQQNLSIN-------------PNTMNMNMMQQQNHGFPFNNSMSG--QKTVDHLQSD 48

Query: 649 GSPSGGGFSIVPARKKRGRPRKYSPDNSIGLGLSPAPVSRIPSLMAQAHNDXXXXXXXXX 470
           G   GGG    P+RKKRGRPRK      IG+  +PA   R+                   
Sbjct: 49  GGGGGGGGG-EPSRKKRGRPRK-----CIGVSETPAAAKRL------------------- 83

Query: 469 XXXXXXXXXXSKRNRGRPPGSVKRQLDALGVPGVGFTPHVITVNAGEDIASKIMAFSQQG 290
                         RGRPPGSVK+QL++LGVPGVGFTPHVITVNAGED+ASKIMAFS+QG
Sbjct: 84  --------------RGRPPGSVKKQLNSLGVPGVGFTPHVITVNAGEDVASKIMAFSKQG 129

Query: 289 PRTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIISLSGSFXXXXXXXXXXXXXXXXX 110
            RTVC+LSANG ISNVTLRQ +MSGGTVTYEG+FEII LSGS                  
Sbjct: 130 CRTVCILSANGTISNVTLRQASMSGGTVTYEGQFEIICLSGS---------TSGGGGLSV 180

Query: 109 XXXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAEGK 2
              G DG VLGGGVAG+L AAS VQV++GSFIA+GK
Sbjct: 181 SLAGSDGMVLGGGVAGLLKAASQVQVVVGSFIADGK 216


>ref|XP_006436724.1| hypothetical protein CICLE_v10031852mg [Citrus clementina]
           gi|568864368|ref|XP_006485573.1| PREDICTED:
           uncharacterized protein LOC102612198 [Citrus sinensis]
           gi|557538920|gb|ESR49964.1| hypothetical protein
           CICLE_v10031852mg [Citrus clementina]
          Length = 376

 Score =  234 bits (598), Expect = 2e-59
 Identities = 151/298 (50%), Positives = 171/298 (57%), Gaps = 24/298 (8%)
 Frame = -1

Query: 823 QSQHHLPNNSY-PLANSNNAVNHPTTTSATIMHQQNPGFPFNSMTGGGGPNPAD-----H 662
           Q QH  PN    P +   NA+  P   +          F FN ++     + +       
Sbjct: 14  QHQHQQPNIMMGPTSYHTNAMMPPNAAAGAAAR-----FSFNPLSSSQSQSQSQSESQSQ 68

Query: 661 LQP-------------DGSPS----GGGFSIVPARKKRGRPRKYSPDNSIGLGLSPAPVS 533
           LQP             DGSPS    GG FSI PA+KKRGRPRKY+PD +I L L+     
Sbjct: 69  LQPKQPLDSLPHGGVFDGSPSLRTGGGSFSIDPAKKKRGRPRKYTPDGNIALRLATT--- 125

Query: 532 RIPSLMAQAHNDXXXXXXXXXXXXXXXXXXXSKRNRGRPPGSVKRQLDALG-VPGVGFTP 356
                 AQ+                      +KR+RGRPPGS K+QLDALG V GVGFTP
Sbjct: 126 ------AQSPGSLADSGGGGGGAAGSASEPSAKRHRGRPPGSGKKQLDALGGVGGVGFTP 179

Query: 355 HVITVNAGEDIASKIMAFSQQGPRTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIIS 176
           HVITV AGEDI+SKI AFSQQGPRTVC+LSA+GAI NVTLRQ  MSGGTVTYEGRFEIIS
Sbjct: 180 HVITVKAGEDISSKIFAFSQQGPRTVCILSASGAICNVTLRQPTMSGGTVTYEGRFEIIS 239

Query: 175 LSGSFXXXXXXXXXXXXXXXXXXXXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAEGK 2
           LSGSF                    G DG+VLGG VAGML AASPVQVI+GSFIAEGK
Sbjct: 240 LSGSFLLSDNNGNRSRSGGLSVSLAGSDGRVLGGLVAGMLMAASPVQVIVGSFIAEGK 297


>ref|XP_004148734.1| PREDICTED: uncharacterized protein LOC101204243 [Cucumis sativus]
           gi|449511145|ref|XP_004163876.1| PREDICTED:
           uncharacterized LOC101204243 [Cucumis sativus]
          Length = 362

 Score =  233 bits (595), Expect = 5e-59
 Identities = 147/299 (49%), Positives = 179/299 (59%), Gaps = 24/299 (8%)
 Frame = -1

Query: 826 SQSQHH---------LPNNSYPLANSNNAVN-----HPTTTSATIMHQQNPGFPFNSMTG 689
           S  QHH         +PNN+   AN  N+ N     +P + +A +M   +  FPFNSM G
Sbjct: 10  SVHQHHQQSTPPNRMIPNNASYSANMPNSNNTSPLINPNSAAAQMMSSASR-FPFNSMMG 68

Query: 688 GGG-----PNPADHLQPDGSPSG---GGFSIVPARKKRGRPRKYSPDNSIGLGLSPAPV- 536
                   PN A +   DGS S    GGF+I   +KKRGRPRKYSPD +I LGLSP P+ 
Sbjct: 69  SSSKPSESPNAASY---DGSQSELRTGGFNIDSGKKKRGRPRKYSPDGNIALGLSPTPIT 125

Query: 535 -SRIPSLMAQAHNDXXXXXXXXXXXXXXXXXXXSKRNRGRPPGSVKRQLDALGVPGVGFT 359
            S +P+  A  H+                     K+NRGRPPG+ KRQ+DALG  GVGFT
Sbjct: 126 SSAVPADSAGMHSPDPRP----------------KKNRGRPPGTGKRQMDALGTGGVGFT 169

Query: 358 PHVITVNAGEDIASKIMAFSQQGPRTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEII 179
           PHVI V  GEDIASK+MAFSQQGPRTVC+LSA+GA+ NVTL Q A+S G+V+YEGR+EII
Sbjct: 170 PHVILVKPGEDIASKVMAFSQQGPRTVCILSAHGAVCNVTL-QPALSSGSVSYEGRYEII 228

Query: 178 SLSGSFXXXXXXXXXXXXXXXXXXXXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAEGK 2
           SLSGSF                      DG+VL GG+  MLTAAS VQVI+GSF+ +GK
Sbjct: 229 SLSGSFLISENNGNRSRSGGLSVSLASADGQVL-GGITNMLTAASTVQVIVGSFLVDGK 286


>gb|AGE46020.1| putative AT-hook DNA-binding protein [Elaeis guineensis]
          Length = 362

 Score =  227 bits (579), Expect = 4e-57
 Identities = 135/275 (49%), Positives = 163/275 (59%), Gaps = 1/275 (0%)
 Frame = -1

Query: 823 QSQHHLPNNSYPLANSNNAVNHPTTTSATIMHQQNPGFPFNSMTGGGGPNPADHLQPDGS 644
           QSQ  + +     A    A+  P TTS+        G    S  GG GP+PA  + P G 
Sbjct: 26  QSQPSMQSMRLAFAPDGTAIYKPITTSSPPPPPYQGGGGAGSTGGGDGPSPAA-ITPHGL 84

Query: 643 PSGGGFSIVPARKKRGRPRKYSPDNSIGLGLSPAPVSRIPSLMAQAHNDXXXXXXXXXXX 464
               G    P ++KRGRPRKY PD ++ L L+    +   + ++                
Sbjct: 85  NINVG---EPVKRKRGRPRKYGPDGTMSLALTTVSPT---AAVSPGSGGFSPSSAGAGNP 138

Query: 463 XXXXXXXXSKRNRGRPPGSVKRQ-LDALGVPGVGFTPHVITVNAGEDIASKIMAFSQQGP 287
                    K+ RGRPPGS K+Q L ALG  G+GFTPHVITV AGED++SKIM+FSQ GP
Sbjct: 139 ASSASAEAMKKARGRPPGSGKKQQLAALGSAGIGFTPHVITVKAGEDVSSKIMSFSQHGP 198

Query: 286 RTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIISLSGSFXXXXXXXXXXXXXXXXXX 107
           R VC+LSANGAISNVTLRQ A SGGTVTYEGRFEI+SLSGSF                  
Sbjct: 199 RAVCILSANGAISNVTLRQAATSGGTVTYEGRFEILSLSGSFLLSESGGQRSRTGGLSVS 258

Query: 106 XXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAEGK 2
             GPDG+VLGGGVAG+LTAASPVQV++GSFIA+GK
Sbjct: 259 LAGPDGRVLGGGVAGLLTAASPVQVVVGSFIADGK 293


>gb|EXB56269.1| Putative DNA-binding protein ESCAROLA [Morus notabilis]
          Length = 351

 Score =  224 bits (572), Expect = 2e-56
 Identities = 134/279 (48%), Positives = 162/279 (58%), Gaps = 3/279 (1%)
 Frame = -1

Query: 829 HSQSQHHLPNNSYPLANSNNAVNHPTTTSATIMHQQNPGFPFNSMTGGGGPNPADHLQPD 650
           H Q+   +P      A +  AV  P  T+AT     +P +  +     GG      + P 
Sbjct: 14  HQQNNIRIPFTPPDSAAAAAAVYKPNITTAT-----SPSYQPSGDASSGGV-----MVPM 63

Query: 649 GSPSGGGFSIVPARKKRGRPRKYSPDNSIGLGLSPAPVS-RIPSLMAQAHNDXXXXXXXX 473
            + SGGG      ++KRGRPRKY PD ++ LGLSP P S  +        +         
Sbjct: 64  AAASGGGGGEPMVKRKRGRPRKYGPDGTMALGLSPNPPSVGVTQSSGGGFSSPPPTAAIS 123

Query: 472 XXXXXXXXXXXSKRNRGRPPGSV--KRQLDALGVPGVGFTPHVITVNAGEDIASKIMAFS 299
                       K+ RGRPPGS   K+Q DA G  G GFTPHVITV AGED++SKIM+FS
Sbjct: 124 GGGGGGPTSASLKKARGRPPGSTGKKQQFDAFGSAGFGFTPHVITVKAGEDVSSKIMSFS 183

Query: 298 QQGPRTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIISLSGSFXXXXXXXXXXXXXX 119
           Q GPR VCVLSANGAISNVTLRQ A SGGTVTYEGR+EI+SLSGSF              
Sbjct: 184 QHGPRAVCVLSANGAISNVTLRQPATSGGTVTYEGRYEILSLSGSFLLSENGGQRSRTGG 243

Query: 118 XXXXXXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAEGK 2
                 G DG+VLGGGVAG+LTAASPVQV++GSFIA+G+
Sbjct: 244 LSVSLSGTDGRVLGGGVAGLLTAASPVQVVVGSFIADGR 282


>ref|XP_006847725.1| hypothetical protein AMTR_s00149p00085280 [Amborella trichopoda]
           gi|548850994|gb|ERN09306.1| hypothetical protein
           AMTR_s00149p00085280 [Amborella trichopoda]
          Length = 346

 Score =  222 bits (566), Expect = 1e-55
 Identities = 128/247 (51%), Positives = 154/247 (62%), Gaps = 6/247 (2%)
 Frame = -1

Query: 724 QNPGFPFNSMTGGG--GPNPADHLQPDGS--PSGGGFSIVPARKKRGRPRKYSPDNSIGL 557
           QN   PFN++         P ++  P G+  P G   S  P +KKRGRPRKY PD S+ L
Sbjct: 35  QNMRLPFNTVVSKQTEANAPLNYPNPSGAIVPHGASMS-EPIKKKRGRPRKYGPDGSVSL 93

Query: 556 GLSPAPVSRIPSLMAQAHNDXXXXXXXXXXXXXXXXXXXSKRNRGRPPGSV--KRQLDAL 383
            L+ +P+S +P                             KRNRGRP G+   K+Q+ AL
Sbjct: 94  ALA-SPISSVPGYSTTPS---------------------YKRNRGRPAGAGGRKQQMAAL 131

Query: 382 GVPGVGFTPHVITVNAGEDIASKIMAFSQQGPRTVCVLSANGAISNVTLRQVAMSGGTVT 203
           G  GVGFTPH+I + AGED+ASKIM+FSQQGPR +C+LSANGAISNVTLRQ A SGGTVT
Sbjct: 132 GTAGVGFTPHIIAIMAGEDVASKIMSFSQQGPRAICILSANGAISNVTLRQAATSGGTVT 191

Query: 202 YEGRFEIISLSGSFXXXXXXXXXXXXXXXXXXXXGPDGKVLGGGVAGMLTAASPVQVIIG 23
           YEGRFEIISLSGS+                    GPDG+VLGGGVAG+L AA+PVQV++G
Sbjct: 192 YEGRFEIISLSGSYLLTERDGILSRTGGLSVSLAGPDGRVLGGGVAGLLVAATPVQVVVG 251

Query: 22  SFIAEGK 2
           SFIAEGK
Sbjct: 252 SFIAEGK 258


>ref|XP_002275328.1| PREDICTED: uncharacterized protein LOC100263332 [Vitis vinifera]
           gi|297745600|emb|CBI40765.3| unnamed protein product
           [Vitis vinifera]
          Length = 353

 Score =  221 bits (564), Expect = 2e-55
 Identities = 128/238 (53%), Positives = 151/238 (63%), Gaps = 2/238 (0%)
 Frame = -1

Query: 709 PFNSMTGGGGP-NPADHLQPDGSPSGGGFSIVPARKKRGRPRKYSPDNSIGLGLSPAPVS 533
           P+ S  G GG  +    + P G     G    P ++KRGRPRKY PD ++ L LSPAP  
Sbjct: 54  PYQSSGGTGGDGSTGGAIIPHGLNMNMGSE--PLKRKRGRPRKYGPDGTMALALSPAPSG 111

Query: 532 RIPSLMAQAHNDXXXXXXXXXXXXXXXXXXXSKRNRGRPPGSVKRQ-LDALGVPGVGFTP 356
              S    A +                     K+ RGRPPGS K+Q ++ALG  GVGFTP
Sbjct: 112 VNVSQSGGAFSSPPASAGSASPSSL-------KKARGRPPGSSKKQQMEALGSAGVGFTP 164

Query: 355 HVITVNAGEDIASKIMAFSQQGPRTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIIS 176
           HVITV AGED++SKIM+FSQ GPR VC+LSANGAISNVTLRQ A SGGTVTYEGRFEI+S
Sbjct: 165 HVITVKAGEDVSSKIMSFSQHGPRAVCILSANGAISNVTLRQPATSGGTVTYEGRFEILS 224

Query: 175 LSGSFXXXXXXXXXXXXXXXXXXXXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAEGK 2
           LSGSF                    GPDG+VLGGGVAG+LTAASPVQV++GSFIA+G+
Sbjct: 225 LSGSFLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFIADGR 282


>emb|CAN64876.1| hypothetical protein VITISV_030792 [Vitis vinifera]
          Length = 390

 Score =  221 bits (564), Expect = 2e-55
 Identities = 128/238 (53%), Positives = 151/238 (63%), Gaps = 2/238 (0%)
 Frame = -1

Query: 709 PFNSMTGGGGP-NPADHLQPDGSPSGGGFSIVPARKKRGRPRKYSPDNSIGLGLSPAPVS 533
           P+ S  G GG  +    + P G     G    P ++KRGRPRKY PD ++ L LSPAP  
Sbjct: 54  PYQSSGGTGGDGSTGGAIIPHGLNMNMGSE--PLKRKRGRPRKYGPDGTMALALSPAPSG 111

Query: 532 RIPSLMAQAHNDXXXXXXXXXXXXXXXXXXXSKRNRGRPPGSVKRQ-LDALGVPGVGFTP 356
              S    A +                     K+ RGRPPGS K+Q ++ALG  GVGFTP
Sbjct: 112 VNVSQSGGAFSSPPASAGSASPSSL-------KKARGRPPGSSKKQQMEALGSAGVGFTP 164

Query: 355 HVITVNAGEDIASKIMAFSQQGPRTVCVLSANGAISNVTLRQVAMSGGTVTYEGRFEIIS 176
           HVITV AGED++SKIM+FSQ GPR VC+LSANGAISNVTLRQ A SGGTVTYEGRFEI+S
Sbjct: 165 HVITVKAGEDVSSKIMSFSQHGPRAVCILSANGAISNVTLRQPATSGGTVTYEGRFEILS 224

Query: 175 LSGSFXXXXXXXXXXXXXXXXXXXXGPDGKVLGGGVAGMLTAASPVQVIIGSFIAEGK 2
           LSGSF                    GPDG+VLGGGVAG+LTAASPVQV++GSFIA+G+
Sbjct: 225 LSGSFLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFIADGR 282


>ref|XP_007155774.1| hypothetical protein PHAVU_003G230500g [Phaseolus vulgaris]
           gi|561029128|gb|ESW27768.1| hypothetical protein
           PHAVU_003G230500g [Phaseolus vulgaris]
          Length = 368

 Score =  221 bits (563), Expect = 3e-55
 Identities = 133/259 (51%), Positives = 155/259 (59%), Gaps = 7/259 (2%)
 Frame = -1

Query: 757 PTTTSATIMHQQNPGFPFNSMTGGGG-PNPADH---LQPDGSPSGGGFSIVP---ARKKR 599
           P ++ A +M      FPF  +      P PA     + P  +  G    + P   A+KKR
Sbjct: 28  PNSSGAVMMAPATARFPFGVVPQQQQQPPPASEPFPVSPAAAYDGSSSPMKPCSLAKKKR 87

Query: 598 GRPRKYSPDNSIGLGLSPAPVSRIPSLMAQAHNDXXXXXXXXXXXXXXXXXXXSKRNRGR 419
           GRPRKYSPD +I LGL+P   S  P     A N                    +K++RGR
Sbjct: 88  GRPRKYSPDGNIALGLAPTHASPPPP----ASNAASGGGIGGDSAGTASADAPAKKHRGR 143

Query: 418 PPGSVKRQLDALGVPGVGFTPHVITVNAGEDIASKIMAFSQQGPRTVCVLSANGAISNVT 239
           PPGS K+QLDALG  GVGFTPHVI V +GEDI +KIMAFSQQGPRTVC+LSA GAI NVT
Sbjct: 144 PPGSGKKQLDALGAGGVGFTPHVILVESGEDITAKIMAFSQQGPRTVCILSAIGAICNVT 203

Query: 238 LRQVAMSGGTVTYEGRFEIISLSGSFXXXXXXXXXXXXXXXXXXXXGPDGKVLGGGVAGM 59
           LRQ A+SGGT TYEGRFEIISLSG+                     G DG+VLGGGVAG 
Sbjct: 204 LRQPALSGGTATYEGRFEIISLSGAMQQSESNGERSRTCTLNVTLAGSDGRVLGGGVAGT 263

Query: 58  LTAASPVQVIIGSFIAEGK 2
           LTAAS VQVI+GSFI +GK
Sbjct: 264 LTAASTVQVIVGSFIVDGK 282


Top