BLASTX nr result

ID: Forsythia22_contig00004706 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00004706
         (1036 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABL63120.1| AT-hook DNA-binding protein [Catharanthus roseus]      259   2e-66
ref|XP_009757660.1| PREDICTED: putative DNA-binding protein ESCA...   248   7e-63
ref|XP_006338072.1| PREDICTED: putative DNA-binding protein ESCA...   244   1e-61
ref|XP_009591426.1| PREDICTED: putative DNA-binding protein ESCA...   242   4e-61
ref|XP_004237987.1| PREDICTED: putative DNA-binding protein ESCA...   237   9e-60
ref|XP_002285689.1| PREDICTED: putative DNA-binding protein ESCA...   237   1e-59
emb|CAN74013.1| hypothetical protein VITISV_003550 [Vitis vinifera]   237   1e-59
emb|CDP11496.1| unnamed protein product [Coffea canephora]            229   3e-57
ref|XP_010107687.1| hypothetical protein L484_007706 [Morus nota...   226   2e-56
ref|XP_007226323.1| hypothetical protein PRUPE_ppa022234mg [Prun...   221   5e-55
ref|XP_007018209.1| AT-hook DNA-binding family protein [Theobrom...   221   9e-55
ref|XP_004148420.1| PREDICTED: AT-hook motif nuclear-localized p...   221   9e-55
ref|XP_010277472.1| PREDICTED: putative DNA-binding protein ESCA...   220   1e-54
ref|XP_004299724.1| PREDICTED: putative DNA-binding protein ESCA...   220   1e-54
ref|XP_008219285.1| PREDICTED: putative DNA-binding protein ESCA...   219   2e-54
ref|XP_003522748.1| PREDICTED: putative DNA-binding protein ESCA...   219   3e-54
ref|XP_010266082.1| PREDICTED: putative DNA-binding protein ESCA...   218   6e-54
ref|XP_012838878.1| PREDICTED: AT-hook motif nuclear-localized p...   218   6e-54
ref|XP_008444938.1| PREDICTED: putative DNA-binding protein ESCA...   218   7e-54
ref|XP_003526530.1| PREDICTED: putative DNA-binding protein ESCA...   217   1e-53

>gb|ABL63120.1| AT-hook DNA-binding protein [Catharanthus roseus]
          Length = 335

 Score =  259 bits (662), Expect = 2e-66
 Identities = 151/328 (46%), Positives = 191/328 (58%), Gaps = 50/328 (15%)
 Frame = +1

Query: 49  MKREYAEENQDRSNIKMISKFHQTKKFHQNPQPPFSTPVEENRXXXXXXXXXXXXXX--- 219
           MK EY ++ +D  +  M +K HQT+KFH +P PP  + +  +                  
Sbjct: 1   MKGEYVKDEKDNHS-SMFAKLHQTQKFHHHPSPPHPSALHHHNNHHHNSFQVPRECQNSE 59

Query: 220 --------------------------------DGDTIEVSXXXXXXXXXSKNKLKPPVII 303
                                           DG +IEV          SKNK KPPVII
Sbjct: 60  EVDSHGHHSPTTKRDLSIQPVILSAPLSSGGNDGASIEVVRRPRGRPPGSKNKPKPPVII 119

Query: 304 TQNAEPSMAPYVLELPPGIDVIEAITRFCRKRKMGLCVLNGNGAVSNVTLKQPSTT-GAT 480
           T++AEPSM+PYVLELP GID++E+IT FCRKR MGLC+LNG+G V+NVTL+QPSTT GA+
Sbjct: 120 TRDAEPSMSPYVLELPGGIDIVESITSFCRKRNMGLCILNGSGTVTNVTLRQPSTTPGAS 179

Query: 481 VTFHGRFNILSLSATILPVNV--PTSLTTALANGFAISLAXXXXXXXXXXXXXXXXSAGT 654
           VTFHGRF+ILSLSAT++P N     +L+  +ANGF ISLA                SAGT
Sbjct: 180 VTFHGRFDILSLSATVIPSNTLSAIALSNGIANGFTISLAGPQGQVVGGAVVGSLFSAGT 239

Query: 655 IYLISATFNSPSYHRLPMEDDATDSAS-GGEANDGRHQSPPPAVSCADSGHPPA------ 813
           +YLI+A+FN+P YHRLP+EDD  +S S GG   +G HQS P A S  D G  PA      
Sbjct: 240 VYLIAASFNNPQYHRLPLEDDQRNSGSAGGTGQEGHHQS-PSATSGGDDGRSPAAPVGGS 298

Query: 814 ----ESCGISIYS-YQPSDVIWAPTARQ 882
               +SCG+S++S + PSDVIWAPTARQ
Sbjct: 299 SAGMDSCGVSLFSCHLPSDVIWAPTARQ 326


>ref|XP_009757660.1| PREDICTED: putative DNA-binding protein ESCAROLA [Nicotiana
           sylvestris]
          Length = 330

 Score =  248 bits (632), Expect = 7e-63
 Identities = 152/322 (47%), Positives = 192/322 (59%), Gaps = 44/322 (13%)
 Frame = +1

Query: 49  MKREYA--EENQDRSNIK----MISKFHQTKKFHQNPQ------PPF------------- 153
           MK EY   EE +D S+      M  K H  + F QNP       P F             
Sbjct: 1   MKGEYVQLEEKKDHSSNSSRNSMFGKLHHPQNFQQNPSHHHFHHPSFQISRECQNSEEAD 60

Query: 154 STPVEEN--------RXXXXXXXXXXXXXXDGDTIEVSXXXXXXXXXSKNKLKPPVIITQ 309
           ST  ++N                       DG TIEV          SKN+ KPPVIIT+
Sbjct: 61  STTADKNDALTPQPVSAVAPPPPPPPPPSSDGATIEVVRRPRGRPPGSKNRPKPPVIITR 120

Query: 310 NAEPSMAPYVLELPPGIDVIEAITRFCRKRKMGLCVLNGNGAVSNVTLKQPSTTG-ATVT 486
           +AEPSM+PY+LE+P G+D+I +IT+FCRKR MGLCVLNG+G ++NVTL+QPSTT  +TVT
Sbjct: 121 DAEPSMSPYILEIPIGVDIINSITKFCRKRNMGLCVLNGSGTITNVTLRQPSTTPVSTVT 180

Query: 487 FHGRFNILSLSATILPVNVPT-SLTTALANGFAISLAXXXXXXXXXXXXXXXXSAGTIYL 663
           FHGRF+ILS+SATI+  N    S    +ANGF ISLA                +AGT+YL
Sbjct: 181 FHGRFDILSISATIVQPNASVPSNNNGIANGFTISLAGPQGQVVGGGVVGPLLTAGTVYL 240

Query: 664 ISATFNSPSYHRLPMEDDATDSASGGEANDGR-HQSPP-PAVSC--ADSGHPP----AES 819
           ++ATFNSP++H+LP+E++   ++ GG  N+G  HQSPP PAVS    DSGHPP     ES
Sbjct: 241 VAATFNSPTFHKLPVEEELARNSGGGGGNEGSGHQSPPQPAVSVGGGDSGHPPQTTAPES 300

Query: 820 CGISIYS-YQPSDVIWAPTARQ 882
           CG+S+YS + PSDVIWAPTARQ
Sbjct: 301 CGMSMYSCHLPSDVIWAPTARQ 322


>ref|XP_006338072.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Solanum
           tuberosum]
          Length = 354

 Score =  244 bits (622), Expect = 1e-61
 Identities = 130/230 (56%), Positives = 160/230 (69%), Gaps = 9/230 (3%)
 Frame = +1

Query: 220 DGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAEPSMAPYVLELPPGIDVIEAITRFCRKR 399
           DG TIEV          SKNK KPPVIIT++AEPSM+PY+LE+P G+D+I +IT+FCRKR
Sbjct: 119 DGATIEVVRRPRGRPPGSKNKPKPPVIITRDAEPSMSPYILEIPTGVDIINSITKFCRKR 178

Query: 400 KMGLCVLNGNGAVSNVTLKQPSTTG-ATVTFHGRFNILSLSATILPVNVPTSLTTALANG 576
            MGLCVLNG+G V+NVTL+QPSTT  +TVTFHGRF+ILS+SAT++  N        +ANG
Sbjct: 179 NMGLCVLNGSGTVTNVTLRQPSTTPVSTVTFHGRFDILSISATVVQPNASVPSNNGIANG 238

Query: 577 FAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSPSYHRLPMEDDATDSASGGEANDG 756
           F ISLA                +AGT+YLI+ATFN PSYH+LP E++   + SGG   DG
Sbjct: 239 FTISLAGPQGQVVGGGVIGPLVTAGTVYLIAATFNGPSYHQLPAEEELARNNSGGGNEDG 298

Query: 757 RHQSPPPAVSCA---DSGHPPA----ESCGISIYS-YQPSDVIWAPTARQ 882
              SPPP    +   DSGHPP+    E+CG+SIYS + PSDVIWAPTARQ
Sbjct: 299 ---SPPPHAEVSGGGDSGHPPSTTAPETCGMSIYSCHLPSDVIWAPTARQ 345


>ref|XP_009591426.1| PREDICTED: putative DNA-binding protein ESCAROLA [Nicotiana
           tomentosiformis]
          Length = 328

 Score =  242 bits (617), Expect = 4e-61
 Identities = 149/320 (46%), Positives = 188/320 (58%), Gaps = 42/320 (13%)
 Frame = +1

Query: 49  MKREYA--EENQDRSNIK----MISKFHQTKKFHQNPQ------PPF------------- 153
           MK EY   EE +D S+      M  K H  + F QNP       P F             
Sbjct: 1   MKGEYVQLEEKKDHSSNSSRNSMFGKLHPPQNFQQNPSHHHFHHPSFQISRECQNSEEAD 60

Query: 154 STPVEENRXXXXXXXXXXXXXX------DGDTIEVSXXXXXXXXXSKNKLKPPVIITQNA 315
           ST  ++N                     DG TIEV          SKN+ KPPVIIT++A
Sbjct: 61  STTADKNDALTPQPVSAIAPPPPPPPSSDGATIEVVRRPRGRPPGSKNRPKPPVIITRDA 120

Query: 316 EPSMAPYVLELPPGIDVIEAITRFCRKRKMGLCVLNGNGAVSNVTLKQPSTTG-ATVTFH 492
           EPSM+PY+LE+P G+D+I +IT+FCR R MGLCVLNG+G V+NVTL+QPSTT  +TVTFH
Sbjct: 121 EPSMSPYILEIPIGVDIINSITKFCRTRNMGLCVLNGSGTVTNVTLRQPSTTPVSTVTFH 180

Query: 493 GRFNILSLSATILPVNVPT-SLTTALANGFAISLAXXXXXXXXXXXXXXXXSAGTIYLIS 669
           GRF+ILS+SATI+  N    S    +ANGF ISLA                +AGT+YL++
Sbjct: 181 GRFDILSISATIVQPNASVPSNNNGIANGFTISLAGPQGQVVGGGVVGPLVTAGTVYLVA 240

Query: 670 ATFNSPSYHRLPMEDDATDSASGGEANDGR-HQSPPPAVSCA---DSGHPPA----ESCG 825
           ATFNSPS+H+LP+E++   ++ GG  N+G   QSPPP    +   DSGHPP     ESCG
Sbjct: 241 ATFNSPSFHKLPVEEELGRNSGGGGGNEGSGRQSPPPHTVVSGGEDSGHPPTTTAPESCG 300

Query: 826 ISIYS-YQPSDVIWAPTARQ 882
           +S+YS + PSDVIWAPTARQ
Sbjct: 301 MSMYSCHLPSDVIWAPTARQ 320


>ref|XP_004237987.1| PREDICTED: putative DNA-binding protein ESCAROLA [Solanum
           lycopersicum]
          Length = 336

 Score =  237 bits (605), Expect = 9e-60
 Identities = 126/228 (55%), Positives = 156/228 (68%), Gaps = 7/228 (3%)
 Frame = +1

Query: 220 DGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAEPSMAPYVLELPPGIDVIEAITRFCRKR 399
           DG TIEV          SKNK KPPVIIT++AEPSM+PY+LE+P G+D+I ++T+FCRKR
Sbjct: 103 DGATIEVVRRPRGRPPGSKNKPKPPVIITRDAEPSMSPYILEIPTGVDIINSVTKFCRKR 162

Query: 400 KMGLCVLNGNGAVSNVTLKQPSTTG-ATVTFHGRFNILSLSATILPVNVPTSLTTALANG 576
            MGLCVLNG+G V+NVTL+QPSTT  +TVTFHGRF+ILS+SAT++  N        +ANG
Sbjct: 163 NMGLCVLNGSGTVTNVTLRQPSTTPVSTVTFHGRFDILSISATVVQPNANIPSNNGIANG 222

Query: 577 FAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSPSYHRLPMEDDATDSASGGEANDG 756
           F ISLA                +AGT+YLI+ATFN PS+HRLP E++   + SGG   DG
Sbjct: 223 FTISLAGPQGQVVGGGVVGPLVTAGTVYLIAATFNGPSFHRLPAEEELARNNSGGGNEDG 282

Query: 757 RHQSPPPAVS-CADSGHPPA----ESCGISIYS-YQPSDVIWAPTARQ 882
                   VS   D GHPP+    ESCG+S+YS + PSDVIWAPTARQ
Sbjct: 283 SSPQQHAEVSGGGDGGHPPSTTAPESCGMSMYSCHLPSDVIWAPTARQ 330


>ref|XP_002285689.1| PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera]
          Length = 309

 Score =  237 bits (604), Expect = 1e-59
 Identities = 131/223 (58%), Positives = 162/223 (72%), Gaps = 2/223 (0%)
 Frame = +1

Query: 220 DGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAEPSMAPYVLELPPGIDVIEAITRFCRKR 399
           DG TIEV          SKNK KPPVIIT++ EP+M+PYVLE+P G+D++EAI RF R+R
Sbjct: 91  DGATIEVVRRPRGRPPGSKNKPKPPVIITRDTEPAMSPYVLEVPGGVDIVEAIARFSRRR 150

Query: 400 KMGLCVLNGNGAVSNVTLKQPSTT-GATVTFHGRFNILSLSATILPVNVPTSLTTALANG 576
            +GLCVLNG+G V+NVTL+QPSTT GATVTFHGRF+ILS+SATI+P +  + + ++ ANG
Sbjct: 151 NIGLCVLNGSGTVANVTLRQPSTTPGATVTFHGRFDILSISATIIPQSASSPIPSS-ANG 209

Query: 577 FAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSPSYHRLPMEDDATDSASGGEANDG 756
           F ISLA                +AGT+Y+I+A+FN+PSYHRLP ED+  +S SGG  NDG
Sbjct: 210 FTISLAGPQGQIVGGSVAGTLLAAGTVYVIAASFNNPSYHRLPGEDEVPNSGSGG--NDG 267

Query: 757 RHQSPPPAVSCADSGHPPAESCGISIYS-YQPSDVIWAPTARQ 882
             QSPP      DSGHPPAE   +SIYS + PSDVIWAPTARQ
Sbjct: 268 --QSPP--TGSGDSGHPPAE---MSIYSCHLPSDVIWAPTARQ 303


>emb|CAN74013.1| hypothetical protein VITISV_003550 [Vitis vinifera]
          Length = 417

 Score =  237 bits (604), Expect = 1e-59
 Identities = 131/223 (58%), Positives = 162/223 (72%), Gaps = 2/223 (0%)
 Frame = +1

Query: 220 DGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAEPSMAPYVLELPPGIDVIEAITRFCRKR 399
           DG TIEV          SKNK KPPVIIT++ EP+M+PYVLE+P G+D++EAI RF R+R
Sbjct: 199 DGATIEVVRRPRGRPPGSKNKPKPPVIITRDTEPAMSPYVLEVPGGVDIVEAIARFSRRR 258

Query: 400 KMGLCVLNGNGAVSNVTLKQPSTT-GATVTFHGRFNILSLSATILPVNVPTSLTTALANG 576
            +GLCVLNG+G V+NVTL+QPSTT GATVTFHGRF+ILS+SATI+P +  + + ++ ANG
Sbjct: 259 NIGLCVLNGSGTVANVTLRQPSTTPGATVTFHGRFDILSISATIIPQSASSPIPSS-ANG 317

Query: 577 FAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSPSYHRLPMEDDATDSASGGEANDG 756
           F ISLA                +AGT+Y+I+A+FN+PSYHRLP ED+  +S SGG  NDG
Sbjct: 318 FTISLAGPQGQIVGGSVAGTLLAAGTVYVIAASFNNPSYHRLPGEDEVPNSGSGG--NDG 375

Query: 757 RHQSPPPAVSCADSGHPPAESCGISIYS-YQPSDVIWAPTARQ 882
             QSPP      DSGHPPAE   +SIYS + PSDVIWAPTARQ
Sbjct: 376 --QSPP--TGSGDSGHPPAE---MSIYSCHLPSDVIWAPTARQ 411


>emb|CDP11496.1| unnamed protein product [Coffea canephora]
          Length = 329

 Score =  229 bits (583), Expect = 3e-57
 Identities = 142/326 (43%), Positives = 177/326 (54%), Gaps = 48/326 (14%)
 Frame = +1

Query: 49  MKREYAEENQDRSNIKMISKFHQTKKFHQNPQPPFSTPV--------------------- 165
           MK +Y +E +D  +  M +K HQT+KF  +P PP   P                      
Sbjct: 1   MKGDYVKEEKDGHSNPMFAKLHQTQKFLHHPPPPQPQPTTLLHHSFNPRECQASEEVDSH 60

Query: 166 ---------EENRXXXXXXXXXXXXXXDGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAE 318
                                      DG TIEV          SKNK KPPVIIT+ AE
Sbjct: 61  HRSPTTPSASAATTSKTQTLVTPSSGNDGATIEVVRRPRGRPPGSKNKPKPPVIITREAE 120

Query: 319 PSMAPYVLELPPGIDVIEAITRFCRKRKMGLCVLNGNGAVSNVTLKQPSTT-GATVTFHG 495
           PSM+PYVLE+  G+D++E +T FCRKR  GLC+LNG+G VSNVTL+QPSTT GA+VTFHG
Sbjct: 121 PSMSPYVLEISGGVDILETVTTFCRKRSTGLCILNGSGTVSNVTLRQPSTTPGASVTFHG 180

Query: 496 RFNILSLSATILPVN-----VPTSLTTALANGFAISLAXXXXXXXXXXXXXXXXSAGTIY 660
           RF+ILSLSATILP N        +L+  + NGF ISLA                +AGT+Y
Sbjct: 181 RFDILSLSATILPQNSHSFSTSAALSNGIGNGFTISLAGPQGQVVGGTVVGSLFTAGTVY 240

Query: 661 LISATFNSPSYHRLPMEDDATDSASGGEANDGRHQSPPPAVS-----CADSGHPPA---- 813
           LI+A+FNSPS+HRLP+ED+    ++   A      S  P VS         G  PA    
Sbjct: 241 LIAASFNSPSFHRLPLEDERNSGSAAAAAG-----SEGPTVSGGGGGDGGGGRSPAQGGG 295

Query: 814 -ESC-GISIYS-YQPSDVIWAPTARQ 882
            +SC G+S+YS + PSDVIWAPTARQ
Sbjct: 296 VDSCSGVSLYSCHLPSDVIWAPTARQ 321


>ref|XP_010107687.1| hypothetical protein L484_007706 [Morus notabilis]
           gi|587929504|gb|EXC16660.1| hypothetical protein
           L484_007706 [Morus notabilis]
          Length = 301

 Score =  226 bits (577), Expect = 2e-56
 Identities = 125/226 (55%), Positives = 161/226 (71%), Gaps = 5/226 (2%)
 Frame = +1

Query: 220 DGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAEPSMAPYVLELPPGIDVIEAITRFCRKR 399
           DG TIEV          SKN+ KPPVIIT++ EP+M+PY+LE+P G DV++AI  FCR++
Sbjct: 72  DGATIEVVRRPRGRPPGSKNRPKPPVIITRDTEPAMSPYILEVPGGNDVVDAIATFCRRK 131

Query: 400 KMGLCVLNGNGAVSNVTLKQPSTT-GATVTFHGRFNILSLSATILPVNVPTSLTTALANG 576
            MGLCVL G+G V+NVTL+QPSTT GATVTFHGRF+ILS++AT LP + P   ++AL NG
Sbjct: 132 NMGLCVLTGSGTVANVTLRQPSTTPGATVTFHGRFDILSVTATFLPQSAPHG-SSALPNG 190

Query: 577 -FAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSPSYHRLP-MEDDATDSASGGEAN 750
            F ISLA                +AGT+Y+++A+FN+PSYHRLP  ED+  +SA+   + 
Sbjct: 191 AFTISLAGPQGQIVGGLVAGALLAAGTVYVVAASFNNPSYHRLPAAEDEGRNSAATAASG 250

Query: 751 DGRHQSPPPAVSCADS-GHPPAESCGISIYSYQ-PSDVIWAPTARQ 882
           +G  QSPP +    DS GH PA+SCG+S+YS Q PSDVIWAPTARQ
Sbjct: 251 EG--QSPPGSGGGGDSGGHAPADSCGMSMYSCQLPSDVIWAPTARQ 294


>ref|XP_007226323.1| hypothetical protein PRUPE_ppa022234mg [Prunus persica]
           gi|462423259|gb|EMJ27522.1| hypothetical protein
           PRUPE_ppa022234mg [Prunus persica]
          Length = 296

 Score =  221 bits (564), Expect = 5e-55
 Identities = 116/225 (51%), Positives = 157/225 (69%), Gaps = 4/225 (1%)
 Frame = +1

Query: 220 DGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAEPSMAPYVLELPPGIDVIEAITRFCRKR 399
           DG TIEV          SKNK KPPVIIT+++EP M+PY+LE+P G D++EA++RFC ++
Sbjct: 71  DGATIEVIRRPRGRPPGSKNKPKPPVIITRDSEPPMSPYILEVPGGSDIVEAVSRFCCRK 130

Query: 400 KMGLCVLNGNGAVSNVTLKQPSTT-GATVTFHGRFNILSLSATILPVNVPTSLTTALANG 576
            +GLC+L G+G V+NVTL+QPSTT GATVTFHGRF+ILS+SAT LP   P S   ++ +G
Sbjct: 131 NIGLCILTGSGTVANVTLRQPSTTPGATVTFHGRFDILSISATFLPQTTP-SCPVSVPSG 189

Query: 577 FAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSPSYHRLPMEDDATDSASGGEANDG 756
           F ISLA                +AGT+Y+I+A+FN+PSYHRLP ED+A  ++  G+A+  
Sbjct: 190 FTISLAGPQGQIVGGLVAGALVAAGTVYVIAASFNNPSYHRLPGEDEAVRNSGSGDAH-- 247

Query: 757 RHQSPPPAVSCADSGH--PPAESCGISIYS-YQPSDVIWAPTARQ 882
              SPP +      GH  P ++SCG+S+YS + P+DV+WAPTARQ
Sbjct: 248 ---SPPLSGGVESGGHAPPSSQSCGMSMYSCHLPTDVLWAPTARQ 289


>ref|XP_007018209.1| AT-hook DNA-binding family protein [Theobroma cacao]
           gi|508723537|gb|EOY15434.1| AT-hook DNA-binding family
           protein [Theobroma cacao]
          Length = 308

 Score =  221 bits (562), Expect = 9e-55
 Identities = 133/307 (43%), Positives = 173/307 (56%), Gaps = 30/307 (9%)
 Frame = +1

Query: 49  MKREYAEENQDRSNIKMISKFHQTKKFHQNPQPPFS------------------------ 156
           MK EY E   +  N  M SK H + + HQ+   PFS                        
Sbjct: 1   MKGEYVETKNENPN-NMFSKLHHSHQQHQHQNHPFSHHFQLSRDSQTPDSEDTSRTTTPT 59

Query: 157 --TPVEENRXXXXXXXXXXXXXXDGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAEPSMA 330
              P   +               DG TIEV          SKNK KPPVIIT+  EP+M+
Sbjct: 60  TKDPTTNHNSTLPSGGGGGTSGGDGATIEVIRRPRGRPPGSKNKPKPPVIITREPEPAMS 119

Query: 331 PYVLELPPGIDVIEAITRFCRKRKMGLCVLNGNGAVSNVTLKQPSTT-GATVTFHGRFNI 507
           PY+LE+P G D++EAI+RF R++ +G+CVL G+G VSNVTL+Q STT GAT+TFHGRF+I
Sbjct: 120 PYILEIPGGNDIVEAISRFSRRKNIGICVLTGSGTVSNVTLRQLSTTPGATITFHGRFDI 179

Query: 508 LSLSATILPVNVPTSLTTALANGFAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSP 687
           LSLSAT L    P S +  + N F+ISLA                +AGT+++++ATFN+P
Sbjct: 180 LSLSATFL----PQSTSCHMPNTFSISLAGPQGQIVGGFVAGSLVAAGTVFIVAATFNNP 235

Query: 688 SYHRLPMEDDATDSASGGEANDGRHQSPPPAVSCADSGH-PPAESCGISIYSYQ--PSDV 858
           SYHRLP E++A ++ S G   +G  QSPP +    DSGH    +SCG+S+YS     SDV
Sbjct: 236 SYHRLPGEEEARNTVSSGGGGEG--QSPPLSGGGGDSGHGGGVDSCGVSMYSCHLGGSDV 293

Query: 859 IWAPTAR 879
           IWAPTAR
Sbjct: 294 IWAPTAR 300


>ref|XP_004148420.1| PREDICTED: AT-hook motif nuclear-localized protein 17 [Cucumis
           sativus] gi|700207639|gb|KGN62758.1| hypothetical
           protein Csa_2G370580 [Cucumis sativus]
          Length = 286

 Score =  221 bits (562), Expect = 9e-55
 Identities = 128/288 (44%), Positives = 171/288 (59%), Gaps = 10/288 (3%)
 Frame = +1

Query: 49  MKREYAEENQDRSNIKMISKFHQTKK---FHQNPQPPFSTPVEENRXXXXXXXXXXXXXX 219
           MK ++A      SN  M+SKFH +      H  PQPP   P+                  
Sbjct: 1   MKGDFAHPKSKTSN--MLSKFHLSPHPFTHHPPPQPPVDEPIAA-LPSPFKHHTDLTSTA 57

Query: 220 DGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAEPSMAPYVLELPPGIDVIEAITRFCRKR 399
           DG TIEV          SKNK KPP+++T+  EP+M PYVLE+P G DV+EAI+RF R++
Sbjct: 58  DGSTIEVVRRPRGRPPGSKNKPKPPLVVTREPEPAMRPYVLEVPGGNDVVEAISRFSRRK 117

Query: 400 KMGLCVLNGNGAVSNVTLKQPSTT-GATVTFHGRFNILSLSATILPVNVPTSLTTALANG 576
            +GLCVLNG+G V+NV+L+QPS T GATVTFHGRF ILS+SAT+ P + P      L NG
Sbjct: 118 NLGLCVLNGSGTVANVSLRQPSATPGATVTFHGRFEILSISATVFPQSTP----LPLPNG 173

Query: 577 FAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSPSYHRLPMEDDATDSASGGEANDG 756
           F+ISLA                +AGT+++++++FN+P YHRLP E++  +  SGG +  G
Sbjct: 174 FSISLAGPQGQIVGGLVAGALIAAGTVFVVASSFNNPFYHRLPDEEEIKNLGSGGGSGGG 233

Query: 757 RHQSPPPAVSCADSGH-----PPAESCGISIYS-YQPSDVIWAPTARQ 882
              SP  +     SG        AE+CG+++YS + PSDVIWAPTARQ
Sbjct: 234 EVHSPHVSGGGDSSGQGHGHGQIAETCGMAMYSCHAPSDVIWAPTARQ 281


>ref|XP_010277472.1| PREDICTED: putative DNA-binding protein ESCAROLA [Nelumbo nucifera]
          Length = 313

 Score =  220 bits (561), Expect = 1e-54
 Identities = 122/223 (54%), Positives = 156/223 (69%), Gaps = 2/223 (0%)
 Frame = +1

Query: 220 DGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAEPSMAPYVLELPPGIDVIEAITRFCRKR 399
           DG TIEV          SKNK KPPVIIT++ E +M P VLE+P G+DV++AI+RF R+R
Sbjct: 91  DGATIEVVRRPRGRPPGSKNKPKPPVIITRDTECAMRPQVLEVPGGLDVVDAISRFSRRR 150

Query: 400 KMGLCVLNGNGAVSNVTLKQPSTT-GATVTFHGRFNILSLSATILPVNVPTSLTTALANG 576
            +G+CVL+G+G V+NVTL+QPST  GATVTFHGRF+ILS+SAT LP +  TSL +++ NG
Sbjct: 151 NLGVCVLSGSGTVANVTLRQPSTNPGATVTFHGRFDILSISATFLPPS-STSLPSSV-NG 208

Query: 577 FAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSPSYHRLPMEDDATDSASGGEANDG 756
           F ISLA                +AGT+++++ATF++PSYHRLP+ED+  +S SG      
Sbjct: 209 FTISLAGPQGQILGGSVVGSLLAAGTVFIVAATFSNPSYHRLPLEDEVPNSLSGNAG--- 265

Query: 757 RHQSPPPAVSCADSGHPPAESCGISIYS-YQPSDVIWAPTARQ 882
             QSP P      S  PPAESCG+SIYS + PSDVIWAPTARQ
Sbjct: 266 --QSPSPPGGGEGSHPPPAESCGMSIYSCHLPSDVIWAPTARQ 306


>ref|XP_004299724.1| PREDICTED: putative DNA-binding protein ESCAROLA [Fragaria vesca
           subsp. vesca]
          Length = 314

 Score =  220 bits (561), Expect = 1e-54
 Identities = 118/225 (52%), Positives = 157/225 (69%), Gaps = 4/225 (1%)
 Frame = +1

Query: 220 DGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAEPSMAPYVLELPPGIDVIEAITRFCRKR 399
           DG TIEV          SKNK KPPVIIT+++EP+M+PY+LE+P G D+++A++RF  ++
Sbjct: 91  DGATIEVVRRPRGRPPGSKNKPKPPVIITRDSEPAMSPYILEVPGGSDIVDAVSRFSCRK 150

Query: 400 KMGLCVLNGNGAVSNVTLKQPSTT-GATVTFHGRFNILSLSATILPVNVPTSLTTALANG 576
            +GL +L G+G V+NVTL+QPS+T GATVTFHGRF+ILS+SAT LP    T+ T  + NG
Sbjct: 151 NIGLVILTGSGTVANVTLRQPSSTPGATVTFHGRFDILSISATFLP---QTTATGPIPNG 207

Query: 577 FAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSPSYHRLPMED--DATDSASGGEAN 750
           F ISLA                +AGT+Y+I+A+FN+PSYHRLP+ED     +S SGGEA 
Sbjct: 208 FTISLAGPQGQIVGGLVAGALIAAGTVYIIAASFNNPSYHRLPVEDVEAPRNSVSGGEA- 266

Query: 751 DGRHQSPPPAVSCADSGHPPAESCGISIYS-YQPSDVIWAPTARQ 882
               QSPP +      GH PA+SCG+++YS + P+DVIWAPTARQ
Sbjct: 267 ----QSPPLSTGGESGGHAPAQSCGMAMYSCHLPTDVIWAPTARQ 307


>ref|XP_008219285.1| PREDICTED: putative DNA-binding protein ESCAROLA [Prunus mume]
          Length = 323

 Score =  219 bits (559), Expect = 2e-54
 Identities = 114/225 (50%), Positives = 156/225 (69%), Gaps = 4/225 (1%)
 Frame = +1

Query: 220 DGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAEPSMAPYVLELPPGIDVIEAITRFCRKR 399
           DG TIEV          SKNK KPPVIIT+++EP M+PY+LE+P G D++EA++RFC ++
Sbjct: 98  DGATIEVIRRPRGRPPGSKNKPKPPVIITRDSEPPMSPYILEVPGGSDIVEAVSRFCCRK 157

Query: 400 KMGLCVLNGNGAVSNVTLKQPSTT-GATVTFHGRFNILSLSATILPVNVPTSLTTALANG 576
            +GLC+L G+G V+NVTL+QPSTT GATVTFHGRF+ILS+SAT LP   P S   ++ +G
Sbjct: 158 NIGLCILTGSGTVANVTLRQPSTTPGATVTFHGRFDILSISATFLPQTTP-SCPVSVPSG 216

Query: 577 FAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSPSYHRLPMEDDATDSASGGEANDG 756
           F ISLA                +AGT+Y+I+A+FN+PSYHRLP ED+   ++  G+A+  
Sbjct: 217 FTISLAGPQGQIVGGLVAGALVAAGTVYVIAASFNNPSYHRLPGEDEGVRNSGSGDAH-- 274

Query: 757 RHQSPPPAVSCADSGH--PPAESCGISIYS-YQPSDVIWAPTARQ 882
              SPP +      GH  P ++SCG+S+YS + P+D++WAPTARQ
Sbjct: 275 ---SPPLSGGVESGGHAPPSSQSCGMSMYSCHLPTDILWAPTARQ 316


>ref|XP_003522748.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
          Length = 280

 Score =  219 bits (557), Expect = 3e-54
 Identities = 131/284 (46%), Positives = 167/284 (58%), Gaps = 7/284 (2%)
 Frame = +1

Query: 49  MKREYAEENQDRSNIK----MISKFHQTKKFHQNPQPPFSTPVEE-NRXXXXXXXXXXXX 213
           MK EY E+ Q     +    M SK     + H  P  PF    E+               
Sbjct: 1   MKGEYVEQQQQHPKSETPPSMFSKLQP--QHHPFPHHPFQLSAEDATTITPSTAQKANSS 58

Query: 214 XXDGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAEPSMAPYVLELPPGIDVIEAITRFCR 393
             DG TIEV          SKNK KPPVIIT++ EP+M+PY+LE+  G DV+EAI +F  
Sbjct: 59  GGDGATIEVVRRPRGRPPGSKNKPKPPVIITRDPEPAMSPYILEVSGGNDVVEAIAQFSH 118

Query: 394 KRKMGLCVLNGNGAVSNVTLKQPSTT-GATVTFHGRFNILSLSATILPVNVPTSLTTALA 570
           ++ MG+CVL G+G V+NVTL+QPSTT G TVTFHGRF+ILS+SAT LP    +  + A+ 
Sbjct: 119 RKNMGICVLTGSGTVANVTLRQPSTTPGTTVTFHGRFDILSVSATFLPQQ--SGASPAVP 176

Query: 571 NGFAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSPSYHRLPMEDDATDSASGGEAN 750
           NGFAISLA                +AGT+++I+A+FN+P+YHRLP E++      G  A 
Sbjct: 177 NGFAISLAGPQGQIVGGLVAGGLMAAGTVFVIAASFNNPAYHRLPPEEE------GASAG 230

Query: 751 DGRHQSPPPAVSCADSGHPPAESCGISIYS-YQPSDVIWAPTAR 879
           DG     PP     DSGH  AESCG+S+YS + PSDVIWAPTAR
Sbjct: 231 DGH---SPPVSGGGDSGHGQAESCGMSMYSCHLPSDVIWAPTAR 271


>ref|XP_010266082.1| PREDICTED: putative DNA-binding protein ESCAROLA [Nelumbo nucifera]
          Length = 304

 Score =  218 bits (555), Expect = 6e-54
 Identities = 121/224 (54%), Positives = 157/224 (70%), Gaps = 3/224 (1%)
 Frame = +1

Query: 220 DGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAEPSMAPYVLELPPGIDVIEAITRFCRKR 399
           DG TIEV          SKNK KPPVIIT++AE +M P VLE+P G+DV++AI+RF R+R
Sbjct: 83  DGATIEVVRRPRGRPPGSKNKPKPPVIITRDAECAMRPQVLEVPGGLDVVDAISRFVRRR 142

Query: 400 KMGLCVLNGNGAVSNVTLKQPSTT-GATVTFHGRFNILSLSATILPVNVPTSLTTALANG 576
            +G+CVL G+G V+NV L+QPST  GATVTFHGRF+ILS+SAT LP +  TSL +++ NG
Sbjct: 143 NIGVCVLTGSGTVANVILRQPSTNPGATVTFHGRFDILSISATFLPPSA-TSLPSSV-NG 200

Query: 577 FAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSPSYHRLPMEDDATDSASGGEANDG 756
           FAISLA                +AGT+++++ATF++PSYHRLP+ED+  +S +G   +  
Sbjct: 201 FAISLAGPQGQILGGSVVGPLLAAGTVFVVAATFSNPSYHRLPIEDEIPNSVTGNAGHS- 259

Query: 757 RHQSPPPAVSCADSGHPP-AESCGISIYS-YQPSDVIWAPTARQ 882
                PPA    +  HPP AESCG+SIYS + PSDVIWAPTARQ
Sbjct: 260 -----PPAPGGGEGSHPPSAESCGMSIYSCHLPSDVIWAPTARQ 298


>ref|XP_012838878.1| PREDICTED: AT-hook motif nuclear-localized protein 17-like
           [Erythranthe guttatus] gi|604331612|gb|EYU36470.1|
           hypothetical protein MIMGU_mgv1a010537mg [Erythranthe
           guttata]
          Length = 309

 Score =  218 bits (555), Expect = 6e-54
 Identities = 129/223 (57%), Positives = 146/223 (65%), Gaps = 3/223 (1%)
 Frame = +1

Query: 223 GDTIEVSXXXXXXXXXSKNKLKPPVIITQNA-EPSMAPYVLELPPGIDVIEAITRFCRKR 399
           G TIEV          SKNK KPPVIIT+++ EPSM+PYVLELPPG+DVIE+   FCRKR
Sbjct: 92  GATIEVVRRPRGRPPGSKNKAKPPVIITRDSSEPSMSPYVLELPPGVDVIESTASFCRKR 151

Query: 400 KMGLCVLNGNGAVSNVTLKQPSTT-GATVTFHGRFNILSLSATILPVNVPTSLTTALANG 576
            MGL VLNGNG V+NVT+KQPSTT GATVTFHGRF+ILS+SATILP          + NG
Sbjct: 152 NMGLSVLNGNGVVANVTIKQPSTTPGATVTFHGRFDILSISATILPAG-----ALPVGNG 206

Query: 577 -FAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSPSYHRLPMEDDATDSASGGEAND 753
            F ISLA                SAGTIYLI+A+FN PS+ RL +  D  DSA+  E   
Sbjct: 207 SFTISLAGPQGQVVGGHVVGPLISAGTIYLIAASFNRPSFDRLQLAVDHADSAA-IEGGQ 265

Query: 754 GRHQSPPPAVSCADSGHPPAESCGISIYSYQPSDVIWAPTARQ 882
             H+  P AVS  DSG       G  IYSYQPSDVIWAPTARQ
Sbjct: 266 NHHRDSPKAVSGGDSG-------GTPIYSYQPSDVIWAPTARQ 301


>ref|XP_008444938.1| PREDICTED: putative DNA-binding protein ESCAROLA [Cucumis melo]
          Length = 288

 Score =  218 bits (554), Expect = 7e-54
 Identities = 127/290 (43%), Positives = 169/290 (58%), Gaps = 12/290 (4%)
 Frame = +1

Query: 49  MKREYAEENQDRSNIKMISKFHQTKK---FHQNPQPPFSTPVEENRXXXXXXXXXXXXXX 219
           MK ++A      SN  M+SKFH +      H  PQPP   P                   
Sbjct: 1   MKGDFAHPKSKTSN--MLSKFHLSPHPFTHHPPPQPPADEPAAA-MPSPFKHHPDLTSTA 57

Query: 220 DGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAEPSMAPYVLELPPGIDVIEAITRFCRKR 399
           DG TIEV          SKNK KPP+++T+  EP+M PYVLE+P G DV+EAI+RF R++
Sbjct: 58  DGSTIEVVRRPRGRPPGSKNKPKPPLVVTREPEPAMRPYVLEVPGGNDVVEAISRFSRRK 117

Query: 400 KMGLCVLNGNGAVSNVTLKQPSTT-GATVTFHGRFNILSLSATILPVNVPTSLTTALANG 576
            +GLCVLNG+G V+NV+L+QPS T GATVTFHGRF ILS+SAT+ P + P      + NG
Sbjct: 118 NLGLCVLNGSGTVANVSLRQPSATPGATVTFHGRFEILSISATVFPQSTP----LPIPNG 173

Query: 577 FAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSPSYHRLPMEDDATDSASGGEANDG 756
           F ISLA                +AGT+++++++FN+P YHRLP E++  +  SGG +  G
Sbjct: 174 FTISLAGPQGQIVGGLVAGALIAAGTVFVVASSFNNPLYHRLPDEEEIKNLGSGGGSGGG 233

Query: 757 RHQSPPPAVSCADSGH-------PPAESCGISIYS-YQPSDVIWAPTARQ 882
              SP  +     SG          AE+CG+++YS + PSDVIWAPTARQ
Sbjct: 234 EVHSPHVSGGGDSSGQGHGHGHGQIAETCGMAMYSCHAPSDVIWAPTARQ 283


>ref|XP_003526530.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max]
          Length = 284

 Score =  217 bits (553), Expect = 1e-53
 Identities = 132/287 (45%), Positives = 169/287 (58%), Gaps = 10/287 (3%)
 Frame = +1

Query: 49  MKREYAEENQD--RSNIKMISKFHQTK-KFHQNPQPPFSTPVEENRXXXXXXXXXXXXXX 219
           MK EY E+ Q   +S     S F + + + H  P  PF    EE                
Sbjct: 1   MKGEYLEQQQQHPKSETTPPSMFSKLQPQHHPFPHHPFQLSAEEENRVGAATTPSTVQKA 60

Query: 220 -----DGDTIEVSXXXXXXXXXSKNKLKPPVIITQNAEPSMAPYVLELPPGIDVIEAITR 384
                DG TIEV          SKNK KPPVIIT++ EP+M+PY+LE+  G DV+EAI +
Sbjct: 61  NSSGGDGATIEVVRRPRGRPPGSKNKPKPPVIITRDPEPAMSPYILEVSGGNDVVEAIAQ 120

Query: 385 FCRKRKMGLCVLNGNGAVSNVTLKQPSTT-GATVTFHGRFNILSLSATILPVNVPTSLTT 561
           F R++ MG+CVL G+G V+NVTL+QPSTT G TVTFHGRF+ILS+SAT LP    +  + 
Sbjct: 121 FSRRKNMGICVLTGSGTVANVTLRQPSTTPGTTVTFHGRFDILSVSATFLPQQ--SGASP 178

Query: 562 ALANGFAISLAXXXXXXXXXXXXXXXXSAGTIYLISATFNSPSYHRLPMEDDATDSASGG 741
           A+ NGFAISLA                +AGT+++I+A+FN+P+YHRLP E++      G 
Sbjct: 179 AVPNGFAISLAGPQGQIVGGLVAGGLMAAGTVFVIAASFNNPAYHRLPPEEE------GA 232

Query: 742 EANDGRHQSPPPAVSCADSGHPPAESCGISIYS-YQPSDVIWAPTAR 879
            A DG     P      DSGH  AESCG+S+YS + PSDVIWAPTAR
Sbjct: 233 SAGDGH---SPQVSGGGDSGHGQAESCGMSMYSCHLPSDVIWAPTAR 276


Top