BLASTX nr result
ID: Zanthoxylum22_contig00042046
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Zanthoxylum22_contig00042046 (1011 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_012083556.1| PREDICTED: AT-hook motif nuclear-localized p... 247 1e-62 ref|XP_011034536.1| PREDICTED: putative DNA-binding protein ESCA... 245 5e-62 ref|XP_002511734.1| ESC, putative [Ricinus communis] gi|22354891... 245 5e-62 ref|XP_006445233.1| hypothetical protein CICLE_v10024070mg [Citr... 242 3e-61 gb|KDO85858.1| hypothetical protein CISIN_1g047830mg [Citrus sin... 242 4e-61 ref|XP_002320066.1| DNA-binding family protein [Populus trichoca... 242 4e-61 ref|XP_002279636.1| PREDICTED: putative DNA-binding protein ESCA... 241 9e-61 ref|XP_011017502.1| PREDICTED: putative DNA-binding protein ESCA... 237 1e-59 ref|XP_002301300.1| DNA-binding family protein [Populus trichoca... 237 1e-59 ref|XP_002281296.1| PREDICTED: putative DNA-binding protein ESCA... 235 5e-59 ref|XP_013445011.1| AT hook motif DNA-binding family protein [Me... 234 6e-59 ref|XP_007052032.1| AT-hook motif nuclear-localized protein 22 [... 234 8e-59 ref|XP_012475244.1| PREDICTED: AT-hook motif nuclear-localized p... 232 4e-58 ref|XP_007220373.1| hypothetical protein PRUPE_ppa018314mg [Prun... 231 9e-58 ref|XP_004510939.1| PREDICTED: AT-hook motif nuclear-localized p... 229 3e-57 ref|XP_004306909.1| PREDICTED: putative DNA-binding protein ESCA... 227 1e-56 ref|XP_012087074.1| PREDICTED: AT-hook motif nuclear-localized p... 226 2e-56 ref|XP_007134900.1| hypothetical protein PHAVU_010G085300g [Phas... 226 2e-56 ref|XP_003521618.1| PREDICTED: putative DNA-binding protein ESCA... 226 2e-56 ref|XP_010092850.1| hypothetical protein L484_022445 [Morus nota... 224 9e-56 >ref|XP_012083556.1| PREDICTED: AT-hook motif nuclear-localized protein 22 [Jatropha curcas] gi|643717114|gb|KDP28740.1| hypothetical protein JCGZ_14511 [Jatropha curcas] Length = 308 Score = 247 bits (630), Expect = 1e-62 Identities = 137/234 (58%), Positives = 151/234 (64%), Gaps = 11/234 (4%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQNNSEEEQSGNGSLSRGQKRDRDD---- 599 M+PV AHG PL PPF TRD+ Q NSE+EQSGNGS +RGQKR+ D+ Sbjct: 1 MDPVAAHGRPLPPPFHTRDLHLHTHHQFQHHQQQNSEDEQSGNGSFNRGQKREHDEITTG 60 Query: 598 -------EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRSHVIEIS 440 EGKEL + GG E+ RRPRGRP GSKNK KPPIIITRDSANALRSHV+EI+ Sbjct: 61 TANNTSVEGKELVPANSGGEGEMGRRPRGRPAGSKNKAKPPIIITRDSANALRSHVMEIA 120 Query: 439 NGCDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXXXXXXXX 260 NGCDIM+SVS FARRRQRG+CILSGTGTV+NVTLRQPA+PGAVV L GRFEI Sbjct: 121 NGCDIMESVSTFARRRQRGVCILSGTGTVTNVTLRQPASPGAVVTLHGRFEILSLSGSFL 180 Query: 259 XXXXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPL 98 LTIY VIMAASFGNAAYERLPL Sbjct: 181 PPPAPPAASGLTIYLAGGQGQVVGGSVVGPLLASGPVVIMAASFGNAAYERLPL 234 >ref|XP_011034536.1| PREDICTED: putative DNA-binding protein ESCAROLA [Populus euphratica] Length = 300 Score = 245 bits (625), Expect = 5e-62 Identities = 137/240 (57%), Positives = 155/240 (64%), Gaps = 8/240 (3%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQNNSEEEQSGNGSLSRGQKRDRDD---- 599 M+PV+AHG PL PPF TRD NSE+EQSGNG L+RGQKR+ D+ Sbjct: 1 MDPVSAHGRPLPPPFHTRDFHLHQFQHHQQ---QNSEDEQSGNGDLNRGQKREHDEITNN 57 Query: 598 ----EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRSHVIEISNGC 431 EG EL + GG EI+RRPRGRP GSKNKPKPPIIITRDSANALRSHV+EI++GC Sbjct: 58 NNTVEGLELVPSNSGGEGEISRRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEIASGC 117 Query: 430 DIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXXXXXXXXXXX 251 DIM+SVS FARRRQRG+CILSGTGTV+NVTL+QPA+PGAVV L GRFEI Sbjct: 118 DIMESVSTFARRRQRGVCILSGTGTVTNVTLKQPASPGAVVTLHGRFEILSLSGSFLPPP 177 Query: 250 XXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPLXXXXESQVP 71 LT+Y V+MAASFGNAAYERLPL ESQ P Sbjct: 178 APPAASGLTVYLAGGQGQVIGGSVAGPLLASGPVVVMAASFGNAAYERLPLEEDIESQTP 237 >ref|XP_002511734.1| ESC, putative [Ricinus communis] gi|223548914|gb|EEF50403.1| ESC, putative [Ricinus communis] Length = 299 Score = 245 bits (625), Expect = 5e-62 Identities = 137/232 (59%), Positives = 153/232 (65%), Gaps = 9/232 (3%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQN-----NSEEEQSGNGSLSRGQKRDRD 602 M+PV AHG PL PPF TRD+ Q NSE+EQ+GNGS++RGQKR+ D Sbjct: 1 MDPVAAHGRPLPPPFHTRDLHLHPHHQFQHHHQQQQQQQNSEDEQTGNGSINRGQKREHD 60 Query: 601 D----EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRSHVIEISNG 434 + EGKEL +GGG E+ RRPRGRP GSKNKPKPPIIITRDSANALRSHV+EI+NG Sbjct: 61 EITTPEGKELVPTTGGGDGEMTRRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEIANG 120 Query: 433 CDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXXXXXXXXXX 254 DIM+SVS FARRRQRG+CILSGTGTV+NVTLRQPA+PGAVV L GRFEI Sbjct: 121 SDIMESVSTFARRRQRGVCILSGTGTVTNVTLRQPASPGAVVTLHGRFEILSLSGSFLPP 180 Query: 253 XXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPL 98 LTIY VIMAASFGNAAYERLPL Sbjct: 181 PAPPAASGLTIYLAGGQGQVVGGSVVGPLLASGPVVIMAASFGNAAYERLPL 232 >ref|XP_006445233.1| hypothetical protein CICLE_v10024070mg [Citrus clementina] gi|568875722|ref|XP_006490941.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Citrus sinensis] gi|557547495|gb|ESR58473.1| hypothetical protein CICLE_v10024070mg [Citrus clementina] Length = 313 Score = 242 bits (618), Expect = 3e-61 Identities = 152/266 (57%), Positives = 166/266 (62%), Gaps = 22/266 (8%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQN----------NSEEEQSGNGSLSRGQ 617 MNP+TAHG PL PPF TRDI Q NSEE+ N SL+RGQ Sbjct: 1 MNPLTAHGRPLPPPFHTRDIHLQPHHHQFQQQQQHHHHHQQQQQNSEED---NSSLNRGQ 57 Query: 616 KRDRDD--------EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALR 461 KRDR++ EG+ELAV +G GG EI RRPRGRP GSKNKPKPPIIITRDSANALR Sbjct: 58 KRDRNENNIANAMEEGEELAVATGEGG-EITRRPRGRPAGSKNKPKPPIIITRDSANALR 116 Query: 460 SHVIEISNGCDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIX 281 SHV+EI+NGCDIMDSVS FARRRQRGICILSGTGTV+NVTLRQPA+PGAVV L GRFEI Sbjct: 117 SHVMEIANGCDIMDSVSNFARRRQRGICILSGTGTVTNVTLRQPASPGAVVTLHGRFEIL 176 Query: 280 XXXXXXXXXXXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLP 101 LTIY VIMAASFGNAAYERLP Sbjct: 177 SLAGSFLPPPAPPAASGLTIYLAGGQGQVVGGSVVGPLVASGPVVIMAASFGNAAYERLP 236 Query: 100 LXXXXESQVPSVLDQ----SPEINVG 35 L E+ V S+ SPE+NVG Sbjct: 237 L-EEEETPVASIPGSRPLGSPEVNVG 261 >gb|KDO85858.1| hypothetical protein CISIN_1g047830mg [Citrus sinensis] Length = 313 Score = 242 bits (617), Expect = 4e-61 Identities = 151/265 (56%), Positives = 165/265 (62%), Gaps = 21/265 (7%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQN----------NSEEEQSGNGSLSRGQ 617 MNP+TAHG PL PPF TRDI Q NSEE+ N SL+RGQ Sbjct: 1 MNPLTAHGRPLPPPFHTRDIHLQPHHHQFQQQQQHHHHHQQQQQNSEED---NSSLNRGQ 57 Query: 616 KRDRDD--------EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALR 461 KRDR++ EG+ELAV +G GG EI RRPRGRP GSKNKPKPPIIITRDSANALR Sbjct: 58 KRDRNENNIANAMEEGEELAVATGEGG-EITRRPRGRPAGSKNKPKPPIIITRDSANALR 116 Query: 460 SHVIEISNGCDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIX 281 SHV+EI+NGCDIMDSVS FARRRQRGICILSGTGTV+NVTLRQPA+PGAVV L GRFEI Sbjct: 117 SHVMEIANGCDIMDSVSNFARRRQRGICILSGTGTVTNVTLRQPASPGAVVTLHGRFEIL 176 Query: 280 XXXXXXXXXXXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLP 101 LTIY VIMAASFGNAAYERLP Sbjct: 177 SLAGSFLPPPAPPAASGLTIYLAGGQGQVVGGSVVGPLVASGPVVIMAASFGNAAYERLP 236 Query: 100 L--XXXXESQVPSVLD-QSPEINVG 35 L E+ +P SPE+NVG Sbjct: 237 LEEEETPEASIPGSRPLGSPEVNVG 261 >ref|XP_002320066.1| DNA-binding family protein [Populus trichocarpa] gi|222860839|gb|EEE98381.1| DNA-binding family protein [Populus trichocarpa] Length = 300 Score = 242 bits (617), Expect = 4e-61 Identities = 137/240 (57%), Positives = 153/240 (63%), Gaps = 8/240 (3%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQNNSEEEQSGNGSLSRGQKRDRDD---- 599 M+PV+AHG PL PPF TRD NSE+EQSGNG L+RGQKR+ D+ Sbjct: 1 MDPVSAHGRPLPPPFHTRDFHLHQFQHHQQ---QNSEDEQSGNGDLNRGQKREHDEINNN 57 Query: 598 ----EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRSHVIEISNGC 431 EG EL S GG EI+RRPRGRP GSKNKPKPPIIITRDSANALRSHV+EI+ G Sbjct: 58 NNTVEGLELVPSSSGGEGEISRRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEIATGS 117 Query: 430 DIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXXXXXXXXXXX 251 DIM+SVS FARRRQRG+CILSGTGTV+NVTL+QPA+PGAVV L GRFEI Sbjct: 118 DIMESVSTFARRRQRGVCILSGTGTVTNVTLKQPASPGAVVTLHGRFEILSLSGSFLPPP 177 Query: 250 XXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPLXXXXESQVP 71 LT+Y V+MAASFGNAAYERLPL ESQ P Sbjct: 178 APPAASGLTVYLAGGQGQVIGGSVAGPLLASGPVVVMAASFGNAAYERLPLEEDIESQTP 237 >ref|XP_002279636.1| PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera] gi|147867329|emb|CAN81187.1| hypothetical protein VITISV_029906 [Vitis vinifera] gi|296089162|emb|CBI38865.3| unnamed protein product [Vitis vinifera] Length = 300 Score = 241 bits (614), Expect = 9e-61 Identities = 137/233 (58%), Positives = 151/233 (64%), Gaps = 10/233 (4%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQNNSEEEQSGNGSLSRGQKRDRDD---- 599 M+PVTAHG PL PPF TRD+ Q NSE+EQSG+ SL+R QKRDRD+ Sbjct: 1 MDPVTAHGRPLPPPFHTRDLQLHHHHQYQHHPQANSEDEQSGSSSLNRAQKRDRDESNAT 60 Query: 598 ------EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRSHVIEISN 437 +GKE SG G EI RRPRGRP GSKNKPKPPIIITRDSANALRSHV+EI+ Sbjct: 61 NNTSPIDGKEFGTSSGDG--EITRRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEIAT 118 Query: 436 GCDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXXXXXXXXX 257 GCDIMDS++ FARRRQRGICILSG+GTV+NVTLRQPA+PGAVV L GRFEI Sbjct: 119 GCDIMDSLNTFARRRQRGICILSGSGTVTNVTLRQPASPGAVVTLHGRFEILSLSGSFLP 178 Query: 256 XXXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPL 98 LTIY VIMAASFGNAAYERLPL Sbjct: 179 PPAPPAASGLTIYLAGGQGQVVGGSVVGPLLASGPVVIMAASFGNAAYERLPL 231 >ref|XP_011017502.1| PREDICTED: putative DNA-binding protein ESCAROLA [Populus euphratica] Length = 298 Score = 237 bits (604), Expect = 1e-59 Identities = 134/232 (57%), Positives = 149/232 (64%), Gaps = 9/232 (3%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQNNSEEEQSGNGSLSRGQKRDRDD---- 599 M+PV AHG PL PPF TRD NSE+EQSG+G+L+RGQKR+ + Sbjct: 1 MDPVAAHGRPLPPPFHTRDFHLHQFQHQQQ---QNSEDEQSGSGNLNRGQKREHAEIATN 57 Query: 598 -----EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRSHVIEISNG 434 EGKEL S GG EI RRPRGRP GSKNKPKPPIIITRDSANALRSHV+EI++G Sbjct: 58 NNNTAEGKELVPSSAGGEGEITRRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEIASG 117 Query: 433 CDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXXXXXXXXXX 254 CDIM+SVS FA RRQRG+CILSGTGTV+NVTL+QPA+PGAVV L GRFEI Sbjct: 118 CDIMESVSTFAGRRQRGVCILSGTGTVTNVTLKQPASPGAVVTLHGRFEILSLSGSFLPP 177 Query: 253 XXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPL 98 LTIY VIMAASFGNAAYERLPL Sbjct: 178 PAPPAASGLTIYLAGGQGQVVGGSVVGPLLASGPVVIMAASFGNAAYERLPL 229 >ref|XP_002301300.1| DNA-binding family protein [Populus trichocarpa] gi|222843026|gb|EEE80573.1| DNA-binding family protein [Populus trichocarpa] Length = 298 Score = 237 bits (604), Expect = 1e-59 Identities = 134/232 (57%), Positives = 147/232 (63%), Gaps = 9/232 (3%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQNNSEEEQSGNGSLSRGQKRDRDD---- 599 M+PV AHG PL PPF TRD NSE+EQSGNG+L+RGQKR+ + Sbjct: 1 MDPVAAHGRPLPPPFHTRDFHLHQFQHQQQ---QNSEDEQSGNGNLNRGQKREHAEIATN 57 Query: 598 -----EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRSHVIEISNG 434 EGKEL S GG EI RRPRGRP GSKNKPKPPIIITRDS NALRSHV+EI+ G Sbjct: 58 NNNTAEGKELVPSSAGGEGEITRRPRGRPAGSKNKPKPPIIITRDSPNALRSHVMEIATG 117 Query: 433 CDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXXXXXXXXXX 254 CDIM+SVS FARRRQRG+CILS TGTV+NVTL+QPA+PGAVV L GRFEI Sbjct: 118 CDIMESVSTFARRRQRGVCILSATGTVTNVTLKQPASPGAVVTLHGRFEILSLSGSFLPP 177 Query: 253 XXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPL 98 LTIY VIMAASFGNAAYERLPL Sbjct: 178 PAPPAASGLTIYLAGGQGQVVGGSVVGPLLASGPVVIMAASFGNAAYERLPL 229 >ref|XP_002281296.1| PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera] Length = 302 Score = 235 bits (599), Expect = 5e-59 Identities = 133/233 (57%), Positives = 153/233 (65%), Gaps = 10/233 (4%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQN-NSEEEQSGNGSLSRGQKRDRDD--- 599 M+PVTAHGH L PPF TRD+ Q NSE+EQSG+ L+RGQKRDRDD Sbjct: 1 MDPVTAHGHSLPPPFHTRDLHLHHQQQHQFHPQQQNSEDEQSGSSGLNRGQKRDRDDNNE 60 Query: 598 ------EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRSHVIEISN 437 EG E+ +SG G EI+RRPRGRP GSKNKPKPPIIITRDSANALR+HV+EI++ Sbjct: 61 NTNGGSEGNEMVGLSGDG--EISRRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIAD 118 Query: 436 GCDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXXXXXXXXX 257 GCDI++SV+ FARRRQRG+CI+SGTGTV+NVTLRQPA+PGA+V L GRFEI Sbjct: 119 GCDIVESVATFARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVTLHGRFEILSLSGSFLP 178 Query: 256 XXXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPL 98 LTIY VIMAASF NAAYERLPL Sbjct: 179 PPAPPAATGLTIYLAGGQGQVVGGSVVGQLLASGPVVIMAASFSNAAYERLPL 231 >ref|XP_013445011.1| AT hook motif DNA-binding family protein [Medicago truncatula] gi|657373298|gb|KEH19036.1| AT hook motif DNA-binding family protein [Medicago truncatula] Length = 308 Score = 234 bits (598), Expect = 6e-59 Identities = 131/231 (56%), Positives = 148/231 (64%), Gaps = 13/231 (5%) Frame = -3 Query: 751 AHGHPLSPPFLTRDIXXXXXXXXXXXXQNNSEEEQSGNGSLSRGQKRDRDDE-------- 596 A G PL PPFLTRD+ Q N EE+QSGNGSLSRGQKR+R++E Sbjct: 5 AQGRPLPPPFLTRDLHLHPHHQFHTNHQTNEEEQQSGNGSLSRGQKRERNNEDGNNTPTG 64 Query: 595 --GKE---LAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRSHVIEISNGC 431 GK+ GG G E+ RRPRGRP GSKNKPKPPIIITRDSANALRSHV+E++NGC Sbjct: 65 GEGKDDGGSGSAGGGSGGEMGRRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEVANGC 124 Query: 430 DIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXXXXXXXXXXX 251 DIM+SV++FARRRQRG+CILSG+GTV+NVTLRQPA+PGAVV L GRFEI Sbjct: 125 DIMESVTVFARRRQRGVCILSGSGTVTNVTLRQPASPGAVVTLHGRFEILSLSGSFLPPP 184 Query: 250 XXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPL 98 L IY VIMAASFGNAAYERLPL Sbjct: 185 APPAASGLAIYLAGGQGQVVGGSVVGPLLASGPVVIMAASFGNAAYERLPL 235 >ref|XP_007052032.1| AT-hook motif nuclear-localized protein 22 [Theobroma cacao] gi|508704293|gb|EOX96189.1| AT-hook motif nuclear-localized protein 22 [Theobroma cacao] Length = 326 Score = 234 bits (597), Expect = 8e-59 Identities = 137/240 (57%), Positives = 153/240 (63%), Gaps = 15/240 (6%) Frame = -3 Query: 772 KDMNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQN-NSEEEQSGNGSLSRGQKRDRDD- 599 K+M+PVTAHG PL PPFLTRD+ Q NSEEEQ+ RGQKRDR++ Sbjct: 21 KEMDPVTAHGRPLPPPFLTRDLHLNPHHQFQHHHQQENSEEEQN------RGQKRDREET 74 Query: 598 -------------EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRS 458 EGKELA++ G G EI RRPRGRP+GSKNKPKPPIIITRDSANALRS Sbjct: 75 ATTTTATATTDTSEGKELAIIPGTEG-EITRRPRGRPSGSKNKPKPPIIITRDSANALRS 133 Query: 457 HVIEISNGCDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXX 278 HV+EI+NGCDIM+S+S FARRRQRG+CILSG+GTV+NVTLRQP PGAVV L GRFEI Sbjct: 134 HVMEIANGCDIMESISTFARRRQRGVCILSGSGTVTNVTLRQPGAPGAVVTLHGRFEILS 193 Query: 277 XXXXXXXXXXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPL 98 LTIY VIMAASFGNAAYERLPL Sbjct: 194 LSGSFLPPPAPPAASGLTIYLAGGQGQVVGGSVVGPLVASGPVVIMAASFGNAAYERLPL 253 >ref|XP_012475244.1| PREDICTED: AT-hook motif nuclear-localized protein 22-like [Gossypium raimondii] gi|763757461|gb|KJB24792.1| hypothetical protein B456_004G160700 [Gossypium raimondii] Length = 308 Score = 232 bits (591), Expect = 4e-58 Identities = 132/236 (55%), Positives = 147/236 (62%), Gaps = 13/236 (5%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQNNSEEEQSGNGSLSRGQKRDRDD---- 599 M+PVTAHG PL PPFLTRD+ +++QS SRGQKRDR++ Sbjct: 1 MDPVTAHGRPLPPPFLTRDLHLNPQHQFQHHHNQQQQQQQSSEDEQSRGQKRDREETATT 60 Query: 598 ---------EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRSHVIE 446 EGKELAVV G G EI RRPRGRP GSKNKPKPPIIITRDSANALRSHV+E Sbjct: 61 MGGGATDTSEGKELAVVPGAEG-EITRRPRGRPAGSKNKPKPPIIITRDSANALRSHVME 119 Query: 445 ISNGCDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXXXXXX 266 I++GCDIM+SVS FARRRQRG+ ILSG+GTV+NVTLRQP PGAVV L GRFEI Sbjct: 120 IADGCDIMESVSTFARRRQRGVSILSGSGTVTNVTLRQPGAPGAVVTLHGRFEILSLSGS 179 Query: 265 XXXXXXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPL 98 LTIY V+MAASFGNAAYERLPL Sbjct: 180 FLPPPAPPAASGLTIYLAGGQGQVVGGAVVGPLVASGPVVMMAASFGNAAYERLPL 235 >ref|XP_007220373.1| hypothetical protein PRUPE_ppa018314mg [Prunus persica] gi|462416835|gb|EMJ21572.1| hypothetical protein PRUPE_ppa018314mg [Prunus persica] Length = 312 Score = 231 bits (588), Expect = 9e-58 Identities = 136/254 (53%), Positives = 154/254 (60%), Gaps = 15/254 (5%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQNN---SEEEQS---GNGSLSRGQKRDR 605 M+PV AHG PL PPFL+RD+ +N SE+EQ+ G G +SRG KRDR Sbjct: 1 MDPVAAHGRPLPPPFLSRDLHLHPHHQFQHHLHHNNHNSEDEQNSSGGGGLISRGIKRDR 60 Query: 604 DD---------EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRSHV 452 D+ EGKEL S G G EI RRPRGRP GSKNK KPPIIITRDSANALRSHV Sbjct: 61 DENTSAATTSLEGKELGSTSAGEG-EITRRPRGRPAGSKNKAKPPIIITRDSANALRSHV 119 Query: 451 IEISNGCDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXXXX 272 +E++NGCDIMDSVS FARRRQRG+CILSG+GTV+NVT+RQPA+PG+VV L GRFEI Sbjct: 120 MEVANGCDIMDSVSTFARRRQRGVCILSGSGTVTNVTIRQPASPGSVVTLHGRFEILSLS 179 Query: 271 XXXXXXXXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPLXX 92 LTIY VIMAASFGNAAYERLPL Sbjct: 180 GSFLPPPAPPAASGLTIYLAGVQGQVVGGGVVGPLLASGPVVIMAASFGNAAYERLPLEE 239 Query: 91 XXESQVPSVLDQSP 50 + V P Sbjct: 240 EEPAAAGQVQGSGP 253 >ref|XP_004510939.1| PREDICTED: AT-hook motif nuclear-localized protein 22-like [Cicer arietinum] Length = 305 Score = 229 bits (583), Expect = 3e-57 Identities = 129/232 (55%), Positives = 147/232 (63%), Gaps = 14/232 (6%) Frame = -3 Query: 751 AHGHPLSPPFLTRDIXXXXXXXXXXXXQNNSEEEQSGNGSLSRGQKRDRDD--------- 599 A G PL PPFLTRD+ N +E+QSGNG+LSRGQKRDR++ Sbjct: 5 AQGRPLPPPFLTRDLHLHHQFHTNHQA--NEDEQQSGNGNLSRGQKRDRNNNNDDNNTPT 62 Query: 598 --EGKE---LAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRSHVIEISNG 434 EGK+ GG G E+ RRPRGRP GSKNKPKPPIIITRDSANALRSHV+E++NG Sbjct: 63 GGEGKDEGGSGSAGGGSGGEMGRRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEVANG 122 Query: 433 CDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXXXXXXXXXX 254 CDIM+SV++FARRRQRG+CILSG+GTV+NVTLRQPA+PGAVV L GRFEI Sbjct: 123 CDIMESVTVFARRRQRGVCILSGSGTVTNVTLRQPASPGAVVTLHGRFEILSLSGSFLPP 182 Query: 253 XXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPL 98 L IY VIMAASFGNAAYERLPL Sbjct: 183 PAPPAASGLAIYLAGGQGQVVGGSVVGPLLASGPVVIMAASFGNAAYERLPL 234 >ref|XP_004306909.1| PREDICTED: putative DNA-binding protein ESCAROLA [Fragaria vesca subsp. vesca] Length = 325 Score = 227 bits (579), Expect = 1e-56 Identities = 136/259 (52%), Positives = 153/259 (59%), Gaps = 20/259 (7%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQN-----NSEEEQSGNGSLSR--GQKRD 608 M+PV AHG PL PPFLTRD+ + NSE+EQ+ +G L+R G KRD Sbjct: 1 MDPVAAHGRPLPPPFLTRDLHLNHPHHQFQHHPHLLHHQNSEDEQNSSGGLNRHRGMKRD 60 Query: 607 RDDE------------GKELAVVSGGG-GSEINRRPRGRPTGSKNKPKPPIIITRDSANA 467 RDD K+L S G G EI RRPRGRP GSKNK KPPIIITRDSANA Sbjct: 61 RDDNTSGGNTPNSMDGNKDLLGGSNSGEGGEITRRPRGRPAGSKNKAKPPIIITRDSANA 120 Query: 466 LRSHVIEISNGCDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFE 287 LRSHV+E++NGCDIMDSVS FARRRQRG+CILSG+GTV+NVTLRQPA+PGAVV L GRFE Sbjct: 121 LRSHVMEVANGCDIMDSVSTFARRRQRGVCILSGSGTVTNVTLRQPASPGAVVTLHGRFE 180 Query: 286 IXXXXXXXXXXXXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYER 107 I LTIY VIMAASFGNAAYER Sbjct: 181 ILSLSGSFLPPPAPPAASGLTIYLAGGQGQVVGGSVVGPLLASGPVVIMAASFGNAAYER 240 Query: 106 LPLXXXXESQVPSVLDQSP 50 LPL + V + P Sbjct: 241 LPLEEEDATAVTPMPGSGP 259 >ref|XP_012087074.1| PREDICTED: AT-hook motif nuclear-localized protein 24 [Jatropha curcas] gi|643712149|gb|KDP25577.1| hypothetical protein JCGZ_20733 [Jatropha curcas] Length = 295 Score = 226 bits (576), Expect = 2e-56 Identities = 128/231 (55%), Positives = 149/231 (64%), Gaps = 8/231 (3%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQNNSEEEQSGNGS----LSRGQKRDRDD 599 M+PVTAHGH L PPF TRD NNSE+EQSG+ S L++ QKR+RD+ Sbjct: 1 MDPVTAHGHSLPPPFHTRDFQLHHQFPHHQQHNNNSEDEQSGSSSGAAGLNKSQKRERDE 60 Query: 598 ----EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRSHVIEISNGC 431 EGKEL + G EI RRPRGRP GSKNKPKPPIIITRDSANALR+H++E+++GC Sbjct: 61 GNNSEGKEL--IPTGSQGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHLMEVADGC 118 Query: 430 DIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXXXXXXXXXXX 251 DI++SV+ FARRRQRG+ I+SGTGTV+NVTLRQPA+PGAVV L GRFEI Sbjct: 119 DIVESVATFARRRQRGVSIMSGTGTVTNVTLRQPASPGAVVTLHGRFEILSLAGSFLPPP 178 Query: 250 XXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPL 98 LTIY VIMAASF NAAYERLPL Sbjct: 179 APPAATGLTIYLAGGQGQVVGGSVVGTLTASGPVVIMAASFSNAAYERLPL 229 >ref|XP_007134900.1| hypothetical protein PHAVU_010G085300g [Phaseolus vulgaris] gi|561007945|gb|ESW06894.1| hypothetical protein PHAVU_010G085300g [Phaseolus vulgaris] Length = 310 Score = 226 bits (576), Expect = 2e-56 Identities = 127/239 (53%), Positives = 147/239 (61%), Gaps = 16/239 (6%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQNNSEEEQSGNGSLSRGQKRDRDD---- 599 M+PV A G PL PPFLTRD+ + + E+++ NG RGQKR RD+ Sbjct: 1 MDPVAAQGRPLPPPFLTRDLHLHPHHQFQPHHNHQNTEDEAANG---RGQKRGRDENTGS 57 Query: 598 ------------EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRSH 455 +G+E GGGGSE+ RRPRGRP GSKNKPKPPIIITRDSANALRSH Sbjct: 58 GGGATTPPHAGGDGQEPGSGDGGGGSEMGRRPRGRPAGSKNKPKPPIIITRDSANALRSH 117 Query: 454 VIEISNGCDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXXX 275 V+EI+NGCDIM+SV+ FARRRQRG+C+LSG+GTV+NVTLRQPA+PGAVV L GRFEI Sbjct: 118 VMEIANGCDIMESVTAFARRRQRGVCVLSGSGTVTNVTLRQPASPGAVVTLHGRFEILSL 177 Query: 274 XXXXXXXXXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPL 98 L IY VIMAASFGNAAYERLPL Sbjct: 178 SGSFLPPPAPPAASGLAIYLAGGQGQVVGGSVVGPLLASGPVVIMAASFGNAAYERLPL 236 >ref|XP_003521618.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max] gi|947117002|gb|KRH65251.1| hypothetical protein GLYMA_03G022700 [Glycine max] Length = 310 Score = 226 bits (576), Expect = 2e-56 Identities = 130/239 (54%), Positives = 147/239 (61%), Gaps = 16/239 (6%) Frame = -3 Query: 766 MNPVTAHGHPLSPPFLTRDIXXXXXXXXXXXXQNNSEEEQSGNGSLSRGQKRDRDD---- 599 M+PV A G PL PPFLTRD+ N + E+++GNG RGQKRDRD+ Sbjct: 1 MDPVAAQGRPLPPPFLTRDLHLHPHHQFQPHHNNQNTEDEAGNG---RGQKRDRDENVGG 57 Query: 598 ------------EGKELAVVSGGGGSEINRRPRGRPTGSKNKPKPPIIITRDSANALRSH 455 EGKE GGGS++ RRPRGRP GSKNKPKPPIIITRDSANALRSH Sbjct: 58 GGGATTPPHGGGEGKEPGS-EDGGGSDMGRRPRGRPAGSKNKPKPPIIITRDSANALRSH 116 Query: 454 VIEISNGCDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQGRFEIXXX 275 V+EI+NGCDIM+SV+ FARRRQRGIC+LSG+GTV+NVTLRQPA+P AVV L GRFEI Sbjct: 117 VMEITNGCDIMESVTAFARRRQRGICLLSGSGTVTNVTLRQPASPSAVVTLHGRFEILSL 176 Query: 274 XXXXXXXXXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAAYERLPL 98 L IY VIMAASFGNAAYERLPL Sbjct: 177 SGSFLPPPAPPAASGLAIYLAGGQGQVVGGSVVGPLVASGPVVIMAASFGNAAYERLPL 235 >ref|XP_010092850.1| hypothetical protein L484_022445 [Morus notabilis] gi|587862883|gb|EXB52668.1| hypothetical protein L484_022445 [Morus notabilis] Length = 326 Score = 224 bits (571), Expect = 9e-56 Identities = 141/273 (51%), Positives = 160/273 (58%), Gaps = 29/273 (10%) Frame = -3 Query: 766 MNPVTA-HGHPLSPPFLTRDIXXXXXXXXXXXXQN---------NSEEEQSGNGSLSRGQ 617 M+P+ A HG PL PPFL+RD+ Q+ NSE+EQ+ N +LSRGQ Sbjct: 1 MDPIAAAHGRPLPPPFLSRDLHLHQFQQLQHHHQHQQQQQQQQQNSEDEQNSN-ALSRGQ 59 Query: 616 KRDRDDEG------KELAVVSGGGGSE-------INRRPRGRPTGSKNKPKPPIIITRDS 476 KRDRDD EL GGGG I RRPRGRP GSKNKPKPPIIITRDS Sbjct: 60 KRDRDDTATTTPTTSELGGGGGGGGGGAGDSMDIITRRPRGRPAGSKNKPKPPIIITRDS 119 Query: 475 ANALRSHVIEISNGCDIMDSVSIFARRRQRGICILSGTGTVSNVTLRQPATPGAVVNLQG 296 ANALRSHV+EI+N CD+MDS+S FARRRQRGIC+LSGTGTV+NVTLRQPA+PGAVV+L G Sbjct: 120 ANALRSHVMEIANACDVMDSMSAFARRRQRGICVLSGTGTVTNVTLRQPASPGAVVSLHG 179 Query: 295 RFEIXXXXXXXXXXXXXXXXXXLTIYXXXXXXXXXXXXXXXXXXXXXXXVIMAASFGNAA 116 RFEI LTIY VIMAASFGNAA Sbjct: 180 RFEILSLSGSFLPPPAPPAASGLTIYLAGGQGQVVGGSVVGPLMASGPVVIMAASFGNAA 239 Query: 115 YERLPLXXXXES------QVPSVLDQSPEINVG 35 YERLPL ++ QV L +NVG Sbjct: 240 YERLPLEEEDQANNQVSMQVSGGLGSPGSLNVG 272