BLASTX nr result
ID: Glycyrrhiza34_contig00008200
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza34_contig00008200 (1745 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_014618251.1 PREDICTED: AT-hook motif nuclear-localized protei... 285 6e-89 KHN30735.1 Putative DNA-binding protein ESCAROLA [Glycine soja] 284 2e-88 XP_014512428.1 PREDICTED: AT-hook motif nuclear-localized protei... 277 8e-86 KHN01050.1 Putative DNA-binding protein ESCAROLA [Glycine soja] 276 2e-85 KRG92293.1 hypothetical protein GLYMA_20G202300 [Glycine max] 276 3e-85 XP_007144108.1 hypothetical protein PHAVU_007G129400g, partial [... 276 5e-85 XP_014628043.1 PREDICTED: AT-hook motif nuclear-localized protei... 276 5e-85 XP_007145832.1 hypothetical protein PHAVU_007G271800g [Phaseolus... 271 1e-83 GAU47901.1 hypothetical protein TSUD_404610 [Trifolium subterran... 271 2e-83 XP_016173389.1 PREDICTED: AT-hook motif nuclear-localized protei... 271 3e-83 XP_013468659.1 AT hook motif DNA-binding family protein [Medicag... 271 8e-83 EEF33084.1 DNA binding protein, putative [Ricinus communis] 270 8e-83 OMO54799.1 hypothetical protein CCACVL1_27564 [Corchorus capsula... 269 9e-83 XP_015580943.1 PREDICTED: AT-hook motif nuclear-localized protei... 270 9e-83 OIW07651.1 hypothetical protein TanjilG_07693 [Lupinus angustifo... 267 3e-82 XP_017975810.1 PREDICTED: AT-hook motif nuclear-localized protei... 267 4e-82 EOY05966.1 AT-hook motif nuclear localized protein 20 [Theobroma... 267 4e-82 KOM33959.1 hypothetical protein LR48_Vigan02g010900 [Vigna angul... 267 5e-82 XP_014618360.1 PREDICTED: AT-hook motif nuclear-localized protei... 266 7e-82 XP_010262258.1 PREDICTED: AT-hook motif nuclear-localized protei... 266 8e-82 >XP_014618251.1 PREDICTED: AT-hook motif nuclear-localized protein 20-like [Glycine max] KRH34516.1 hypothetical protein GLYMA_10G188400 [Glycine max] Length = 290 Score = 285 bits (730), Expect = 6e-89 Identities = 171/301 (56%), Positives = 178/301 (59%) Frame = +2 Query: 611 IAGLANPWWTVQXXXXXXXXXXXXXXXXXXXXXXXKRHVXXXXXXXXXXXXXXXXYNRDN 790 +A LANPWWT Q KRH ++ DN Sbjct: 1 MATLANPWWTGQGGLSGVDHPGTHSPGLS------KRHSDLGINENSDSHNNREEFDEDN 54 Query: 791 GSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADVAESV 970 DEPKEGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPNAL+SHVME+ GGADVAESV Sbjct: 55 -RDEPKEGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEITGGADVAESV 112 Query: 971 AHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPPGST 1150 A FARRRQRGVCVLSGSGSVANVTLRQP+APGAVVALHGRFEILSLTGTFLPGPAPPGST Sbjct: 113 AQFARRRQRGVCVLSGSGSVANVTLRQPSAPGAVVALHGRFEILSLTGTFLPGPAPPGST 172 Query: 1151 GLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXXXXXX 1330 GLTVY PVM+IAATFANATYERLPL Sbjct: 173 GLTVYLAGGQGQVVGGSVVGSLVAAGPVMVIAATFANATYERLPLDEDDEGPSSMVGAQG 232 Query: 1331 XXXXXXXXXXXXXXXXNQLQAGGINPADPSISXXXXXXXXXXXXXGSGQVGHEALAWPHG 1510 QLQ GGI DPS G GQVGHEALAW HG Sbjct: 233 GGGSPPLPLGIGSSGGGQLQ-GGI--PDPS---SLPLYNLPPNGGGGGQVGHEALAWAHG 286 Query: 1511 R 1513 R Sbjct: 287 R 287 >KHN30735.1 Putative DNA-binding protein ESCAROLA [Glycine soja] Length = 290 Score = 284 bits (726), Expect = 2e-88 Identities = 170/301 (56%), Positives = 177/301 (58%) Frame = +2 Query: 611 IAGLANPWWTVQXXXXXXXXXXXXXXXXXXXXXXXKRHVXXXXXXXXXXXXXXXXYNRDN 790 +A LANPWWT Q KRH ++ DN Sbjct: 1 MATLANPWWTGQGGLSGVDHPGTHSPGLS------KRHSDLGINENSDSHNNREEFDEDN 54 Query: 791 GSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADVAESV 970 DEPKEGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPN L+SHVME+ GGADVAESV Sbjct: 55 -RDEPKEGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNTLRSHVMEITGGADVAESV 112 Query: 971 AHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPPGST 1150 A FARRRQRGVCVLSGSGSVANVTLRQP+APGAVVALHGRFEILSLTGTFLPGPAPPGST Sbjct: 113 AQFARRRQRGVCVLSGSGSVANVTLRQPSAPGAVVALHGRFEILSLTGTFLPGPAPPGST 172 Query: 1151 GLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXXXXXX 1330 GLTVY PVM+IAATFANATYERLPL Sbjct: 173 GLTVYLAGGQGQVVGGSVVGSLVAAGPVMVIAATFANATYERLPLDEDDEGPSSMVGAQG 232 Query: 1331 XXXXXXXXXXXXXXXXNQLQAGGINPADPSISXXXXXXXXXXXXXGSGQVGHEALAWPHG 1510 QLQ GGI DPS G GQVGHEALAW HG Sbjct: 233 GGGSPPLPLGIGSSGGGQLQ-GGI--PDPS---SLPLYNLPPNGGGGGQVGHEALAWAHG 286 Query: 1511 R 1513 R Sbjct: 287 R 287 >XP_014512428.1 PREDICTED: AT-hook motif nuclear-localized protein 20-like, partial [Vigna radiata var. radiata] Length = 287 Score = 277 bits (709), Expect = 8e-86 Identities = 163/298 (54%), Positives = 172/298 (57%) Frame = +2 Query: 620 LANPWWTVQXXXXXXXXXXXXXXXXXXXXXXXKRHVXXXXXXXXXXXXXXXXYNRDNGSD 799 LANPWWT Q KRH ++ D Sbjct: 5 LANPWWTGQGGLSGVDPGTHSPGLS-------KRHTDLAINETSGGHNIEED---EDNRD 54 Query: 800 EPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADVAESVAHF 979 EP+EGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPNAL+SHVME+ GGADVAESVA F Sbjct: 55 EPREGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEITGGADVAESVAQF 113 Query: 980 ARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPPGSTGLT 1159 ARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPPGSTGLT Sbjct: 114 ARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPPGSTGLT 173 Query: 1160 VYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXXXXXXXXX 1339 VY PVM+IAATF+NATYERLPL Sbjct: 174 VYLAGGQGQVLGGSVVGPLVATGPVMVIAATFSNATYERLPLDEDDEGPSSAAAVQGGGS 233 Query: 1340 XXXXXXXXXXXXXNQLQAGGINPADPSISXXXXXXXXXXXXXGSGQVGHEALAWPHGR 1513 QLQ G +P + G GQVGHEALAW HGR Sbjct: 234 PPPLGIGSGGGGGGQLQGGIPDPTSLPL-------YNLPPNGGGGQVGHEALAWGHGR 284 >KHN01050.1 Putative DNA-binding protein ESCAROLA [Glycine soja] Length = 295 Score = 276 bits (707), Expect = 2e-85 Identities = 153/243 (62%), Positives = 161/243 (66%) Frame = +2 Query: 785 DNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADVAE 964 ++ DEPKEGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPN L+SHVMEV GGADVAE Sbjct: 58 EDNRDEPKEGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNTLRSHVMEVTGGADVAE 116 Query: 965 SVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPPG 1144 SVA FARRRQRGVCVLSGSGSVANVTLRQP+APGAVVALHGRFEILSLTGTFLPGPAPPG Sbjct: 117 SVAQFARRRQRGVCVLSGSGSVANVTLRQPSAPGAVVALHGRFEILSLTGTFLPGPAPPG 176 Query: 1145 STGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXXXX 1324 STGLTVY PVM+IAATFANATYERLPL Sbjct: 177 STGLTVYLTGGQGQIVGGSVVGSLVAAGPVMVIAATFANATYERLPLDEDDEGPSSAAGA 236 Query: 1325 XXXXXXXXXXXXXXXXXXNQLQAGGINPADPSISXXXXXXXXXXXXXGSGQVGHEALAWP 1504 QLQ G +P+ + G GQVGHEALAW Sbjct: 237 QGGGSSPPPPLGIGSSGGGQLQGGMPDPSSMPL-------YNLPPNGGVGQVGHEALAWA 289 Query: 1505 HGR 1513 HGR Sbjct: 290 HGR 292 >KRG92293.1 hypothetical protein GLYMA_20G202300 [Glycine max] Length = 303 Score = 276 bits (707), Expect = 3e-85 Identities = 153/243 (62%), Positives = 161/243 (66%) Frame = +2 Query: 785 DNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADVAE 964 ++ DEPKEGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPN L+SHVMEV GGADVAE Sbjct: 66 EDNRDEPKEGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNTLRSHVMEVTGGADVAE 124 Query: 965 SVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPPG 1144 SVA FARRRQRGVCVLSGSGSVANVTLRQP+APGAVVALHGRFEILSLTGTFLPGPAPPG Sbjct: 125 SVAQFARRRQRGVCVLSGSGSVANVTLRQPSAPGAVVALHGRFEILSLTGTFLPGPAPPG 184 Query: 1145 STGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXXXX 1324 STGLTVY PVM+IAATFANATYERLPL Sbjct: 185 STGLTVYLTGGQGQIVGGSVVGSLVAAGPVMVIAATFANATYERLPLDEDDEGPSSAAGA 244 Query: 1325 XXXXXXXXXXXXXXXXXXNQLQAGGINPADPSISXXXXXXXXXXXXXGSGQVGHEALAWP 1504 QLQ G +P+ + G GQVGHEALAW Sbjct: 245 QGGGSSPPPPLGIGSSGGGQLQGGMPDPSSMPL-------YNLPPNGGVGQVGHEALAWA 297 Query: 1505 HGR 1513 HGR Sbjct: 298 HGR 300 >XP_007144108.1 hypothetical protein PHAVU_007G129400g, partial [Phaseolus vulgaris] ESW16102.1 hypothetical protein PHAVU_007G129400g, partial [Phaseolus vulgaris] Length = 310 Score = 276 bits (706), Expect = 5e-85 Identities = 165/298 (55%), Positives = 173/298 (58%) Frame = +2 Query: 620 LANPWWTVQXXXXXXXXXXXXXXXXXXXXXXXKRHVXXXXXXXXXXXXXXXXYNRDNGSD 799 LANPWWT Q KRH ++ D Sbjct: 30 LANPWWTGQGGLSGIDPGTHSPGLS-------KRHTDLVINESSGGHDIEED---EDNRD 79 Query: 800 EPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADVAESVAHF 979 EPKEGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPNAL+SHVME+ GGADVAESVA F Sbjct: 80 EPKEGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEITGGADVAESVAQF 138 Query: 980 ARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPPGSTGLT 1159 ARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPPGSTGLT Sbjct: 139 ARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPPGSTGLT 198 Query: 1160 VYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXXXXXXXXX 1339 VY PVM+IAATFANATYERLPL Sbjct: 199 VYLAGGQGQVLGGSVVGPLVATGPVMVIAATFANATYERLPLDEDDEGPSSAAAVQGGGS 258 Query: 1340 XXXXXXXXXXXXXNQLQAGGINPADPSISXXXXXXXXXXXXXGSGQVGHEALAWPHGR 1513 QLQ G +P+ + G GQVGHEALAW HGR Sbjct: 259 PPPLGIGSGGGV--QLQGGMPDPSSLPL-------YNLPPNVGGGQVGHEALAWAHGR 307 >XP_014628043.1 PREDICTED: AT-hook motif nuclear-localized protein 20-like [Glycine max] Length = 324 Score = 276 bits (707), Expect = 5e-85 Identities = 153/243 (62%), Positives = 161/243 (66%) Frame = +2 Query: 785 DNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADVAE 964 ++ DEPKEGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPN L+SHVMEV GGADVAE Sbjct: 87 EDNRDEPKEGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNTLRSHVMEVTGGADVAE 145 Query: 965 SVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPPG 1144 SVA FARRRQRGVCVLSGSGSVANVTLRQP+APGAVVALHGRFEILSLTGTFLPGPAPPG Sbjct: 146 SVAQFARRRQRGVCVLSGSGSVANVTLRQPSAPGAVVALHGRFEILSLTGTFLPGPAPPG 205 Query: 1145 STGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXXXX 1324 STGLTVY PVM+IAATFANATYERLPL Sbjct: 206 STGLTVYLTGGQGQIVGGSVVGSLVAAGPVMVIAATFANATYERLPLDEDDEGPSSAAGA 265 Query: 1325 XXXXXXXXXXXXXXXXXXNQLQAGGINPADPSISXXXXXXXXXXXXXGSGQVGHEALAWP 1504 QLQ G +P+ + G GQVGHEALAW Sbjct: 266 QGGGSSPPPPLGIGSSGGGQLQGGMPDPSSMPL-------YNLPPNGGVGQVGHEALAWA 318 Query: 1505 HGR 1513 HGR Sbjct: 319 HGR 321 >XP_007145832.1 hypothetical protein PHAVU_007G271800g [Phaseolus vulgaris] ESW17826.1 hypothetical protein PHAVU_007G271800g [Phaseolus vulgaris] Length = 268 Score = 271 bits (693), Expect = 1e-83 Identities = 157/247 (63%), Positives = 163/247 (65%), Gaps = 3/247 (1%) Frame = +2 Query: 779 NRDNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADV 958 +RDNG DEPKEGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPN+L+SHVMEVAGGADV Sbjct: 21 DRDNG-DEPKEGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNSLRSHVMEVAGGADV 78 Query: 959 AESVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAP 1138 AESVA FARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTG FLPGPAP Sbjct: 79 AESVAQFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGAFLPGPAP 138 Query: 1139 PGSTGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXX 1318 PG+TGLTVY PVM+IAATFANATYERLPL Sbjct: 139 PGATGLTVYLAGGQGQVVGGSVVGPLVAAGPVMIIAATFANATYERLPLEEEDEDGGGGS 198 Query: 1319 XXXXXXXXXXXXXXXXXXXXN---QLQAGGINPADPSISXXXXXXXXXXXXXGSGQVGHE 1489 + GGI DPS S G GQVGHE Sbjct: 199 VQGGSTLGGSPPGIGGGGGGDGGGSHLPGGI--PDPS-SLPLYNLPPNLLSNGGGQVGHE 255 Query: 1490 ALAWPHG 1510 A AW HG Sbjct: 256 AFAWAHG 262 >GAU47901.1 hypothetical protein TSUD_404610 [Trifolium subterraneum] Length = 291 Score = 271 bits (693), Expect = 2e-83 Identities = 154/252 (61%), Positives = 163/252 (64%), Gaps = 8/252 (3%) Frame = +2 Query: 785 DNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADVAE 964 D+ DEP+EGAV EVG RRPRGRP GSKN+PKPPIFVTRDSPNALKSHVMEVAGGAD+AE Sbjct: 41 DDNRDEPREGAV-EVGNRRPRGRPPGSKNRPKPPIFVTRDSPNALKSHVMEVAGGADIAE 99 Query: 965 SVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPPG 1144 SVA FARRRQRGVCVLSGSG+VANVTLRQP+APGAVVALHGRFEILSLTGTFLPGPAPPG Sbjct: 100 SVAQFARRRQRGVCVLSGSGTVANVTLRQPSAPGAVVALHGRFEILSLTGTFLPGPAPPG 159 Query: 1145 STGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPL---XXXXXXXXXX 1315 STGLTVY PVMLI+ATF NATYERLPL Sbjct: 160 STGLTVYLAGGQGQVVGGCVVGTLVAVGPVMLISATFTNATYERLPLDDDDNDNNNEGPS 219 Query: 1316 XXXXXXXXXXXXXXXXXXXXXNQLQAGGINPADPSISXXXXXXXXXXXXXGSGQVGHEAL 1495 +QLQ GGI DPS GQ+GHEAL Sbjct: 220 SAAGVQGGGTSGGSPPPGIGHHQLQQGGI--PDPSSMHPLYNLPPNLLPNNGGQMGHEAL 277 Query: 1496 AWP-----HGRP 1516 AW HGRP Sbjct: 278 AWAAAAAGHGRP 289 >XP_016173389.1 PREDICTED: AT-hook motif nuclear-localized protein 20-like [Arachis ipaensis] Length = 315 Score = 271 bits (694), Expect = 3e-83 Identities = 155/257 (60%), Positives = 161/257 (62%), Gaps = 11/257 (4%) Frame = +2 Query: 779 NRDNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADV 958 NRDNGSDEP+EGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPNAL+SHVME+AGGADV Sbjct: 61 NRDNGSDEPREGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEIAGGADV 119 Query: 959 AESVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAP 1138 AESVA FARRRQRGVCVLSGSGSVANVT+RQP APGAVV LHGRFEILSLTG FLPGPAP Sbjct: 120 AESVAQFARRRQRGVCVLSGSGSVANVTIRQPTAPGAVV-LHGRFEILSLTGAFLPGPAP 178 Query: 1139 PGSTGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXX 1318 PGSTGLTVY PVM+IAATFANATYERLPL Sbjct: 179 PGSTGLTVYLAGGQGQVIGGSVVGSLVAVGPVMVIAATFANATYERLPLDEEDEGPSGGG 238 Query: 1319 XXXXXXXXXXXXXXXXXXXXNQLQA-----------GGINPADPSISXXXXXXXXXXXXX 1465 L GG DP + Sbjct: 239 GGGSSGGTGGGAGGGGSPPPQPLPGIGGSGGHHSLHGGGGLPDP--TSLPLYNMPPNLMP 296 Query: 1466 GSGQVGHEALAWPHGRP 1516 GQVGHEA WPHGRP Sbjct: 297 NGGQVGHEAYPWPHGRP 313 >XP_013468659.1 AT hook motif DNA-binding family protein [Medicago truncatula] KEH42696.1 AT hook motif DNA-binding family protein [Medicago truncatula] Length = 322 Score = 271 bits (692), Expect = 8e-83 Identities = 166/308 (53%), Positives = 175/308 (56%), Gaps = 4/308 (1%) Frame = +2 Query: 605 TEIAGLANPWWTVQXXXXXXXXXXXXXXXXXXXXXXXKRHVXXXXXXXXXXXXXXXXYNR 784 T A LANPWWT+Q K H NR Sbjct: 28 TTTAVLANPWWTIQGGLSGVDPGTHSPSDLN------KHHNTNLTINENHDDEEEDDDNR 81 Query: 785 DNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADVAE 964 D EP+EGAV EVG RRPRGRP GSKN+PKPPIFVTRDSPNALKSHVMEVAGGAD+AE Sbjct: 82 D----EPREGAV-EVGNRRPRGRPPGSKNRPKPPIFVTRDSPNALKSHVMEVAGGADIAE 136 Query: 965 SVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPPG 1144 SVA FARRRQRGVCVLSGSGSVANVTLRQP+APGAVVALHGRFEILSLTGTFLPGPAPPG Sbjct: 137 SVAQFARRRQRGVCVLSGSGSVANVTLRQPSAPGAVVALHGRFEILSLTGTFLPGPAPPG 196 Query: 1145 STGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPL--XXXXXXXXXXX 1318 STGLTVY PVMLIAATF NATYERLPL Sbjct: 197 STGLTVYLAGGQGQVVGGCVVGTLVAVGPVMLIAATFTNATYERLPLDDDDNDNNEGPNS 256 Query: 1319 XXXXXXXXXXXXXXXXXXXXNQLQAGGINPADPSISXXXXXXXXXXXXXGSGQVGHEALA 1498 +QLQ G +P+ S GQ+GHEALA Sbjct: 257 AGGVQGGGASTGGSPPPGIGHQLQGGLPDPS----SMPLYNLPPNLLPNNGGQMGHEALA 312 Query: 1499 W--PHGRP 1516 W HGRP Sbjct: 313 WAAAHGRP 320 >EEF33084.1 DNA binding protein, putative [Ricinus communis] Length = 301 Score = 270 bits (690), Expect = 8e-83 Identities = 152/246 (61%), Positives = 159/246 (64%) Frame = +2 Query: 779 NRDNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADV 958 +RD G DEPKEGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPNAL+SHVMEV GGADV Sbjct: 64 DRDTG-DEPKEGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVVGGADV 121 Query: 959 AESVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAP 1138 AE VA FARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTG FLPGPAP Sbjct: 122 AECVAQFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGAFLPGPAP 181 Query: 1139 PGSTGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXX 1318 PGSTGLTVY PVM+IAATFANATYERLPL Sbjct: 182 PGSTGLTVYLAGGQGQVVGGSVVGSLIAAGPVMVIAATFANATYERLPLEDDEEAASAGQ 241 Query: 1319 XXXXXXXXXXXXXXXXXXXXNQLQAGGINPADPSISXXXXXXXXXXXXXGSGQVGHEALA 1498 + + G P P S GQ+GH+A A Sbjct: 242 GHIQGGSNNSPP---------PIGSTGQQPGLPDPSALPVYNLPPNLIPNGGQLGHDAYA 292 Query: 1499 WPHGRP 1516 W HGRP Sbjct: 293 WAHGRP 298 >OMO54799.1 hypothetical protein CCACVL1_27564 [Corchorus capsularis] Length = 282 Score = 269 bits (688), Expect = 9e-83 Identities = 153/260 (58%), Positives = 162/260 (62%), Gaps = 14/260 (5%) Frame = +2 Query: 779 NRDNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADV 958 +RD G DEPKEGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPNAL+SHVMEVA G DV Sbjct: 37 DRDTG-DEPKEGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVASGTDV 94 Query: 959 AESVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAP 1138 AES+A FARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTG FLPGPAP Sbjct: 95 AESIAQFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGAFLPGPAP 154 Query: 1139 PGSTGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXX 1318 PGSTGLTVY PVM+IAATF+NATYERLP+ Sbjct: 155 PGSTGLTVYLAGGQGQVVGGSVVGSLIAAGPVMVIAATFSNATYERLPI----------- 203 Query: 1319 XXXXXXXXXXXXXXXXXXXXNQLQAGGINPA--------------DPSISXXXXXXXXXX 1456 Q+Q GG + DPS Sbjct: 204 ---EEEEEGGSAGHGGGGGGGQIQGGGAGSSPPAIGSSGPQTGLPDPSSLPIYNLPPNLL 260 Query: 1457 XXXGSGQVGHEALAWPHGRP 1516 G GQ+GHEA W HGRP Sbjct: 261 SNGGGGQLGHEAYGWAHGRP 280 >XP_015580943.1 PREDICTED: AT-hook motif nuclear-localized protein 20, partial [Ricinus communis] Length = 305 Score = 270 bits (690), Expect = 9e-83 Identities = 152/246 (61%), Positives = 159/246 (64%) Frame = +2 Query: 779 NRDNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADV 958 +RD G DEPKEGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPNAL+SHVMEV GGADV Sbjct: 68 DRDTG-DEPKEGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVVGGADV 125 Query: 959 AESVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAP 1138 AE VA FARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTG FLPGPAP Sbjct: 126 AECVAQFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGAFLPGPAP 185 Query: 1139 PGSTGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXX 1318 PGSTGLTVY PVM+IAATFANATYERLPL Sbjct: 186 PGSTGLTVYLAGGQGQVVGGSVVGSLIAAGPVMVIAATFANATYERLPLEDDEEAASAGQ 245 Query: 1319 XXXXXXXXXXXXXXXXXXXXNQLQAGGINPADPSISXXXXXXXXXXXXXGSGQVGHEALA 1498 + + G P P S GQ+GH+A A Sbjct: 246 GHIQGGSNNSPP---------PIGSTGQQPGLPDPSALPVYNLPPNLIPNGGQLGHDAYA 296 Query: 1499 WPHGRP 1516 W HGRP Sbjct: 297 WAHGRP 302 >OIW07651.1 hypothetical protein TanjilG_07693 [Lupinus angustifolius] Length = 270 Score = 267 bits (683), Expect = 3e-82 Identities = 154/252 (61%), Positives = 162/252 (64%), Gaps = 7/252 (2%) Frame = +2 Query: 782 RDNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADVA 961 +DNG DEPKEGAV E+G RRPRGRP GSKNKPKPPIFVTRDSPN L+SHV+EVAGGADVA Sbjct: 24 KDNG-DEPKEGAV-EIGNRRPRGRPPGSKNKPKPPIFVTRDSPNTLRSHVLEVAGGADVA 81 Query: 962 ESVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPP 1141 ESVA FARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRF+ILSLTG FLPGPAPP Sbjct: 82 ESVAQFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFDILSLTGAFLPGPAPP 141 Query: 1142 GSTGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPL-------XXXXX 1300 G+TGLTVY PVMLIAATFANATYE+LPL Sbjct: 142 GATGLTVYLAGGQGQVVGGSVVGSLVAAGPVMLIAATFANATYEKLPLEDDDEGGGGGGG 201 Query: 1301 XXXXXXXXXXXXXXXXXXXXXXXXXXNQLQAGGINPADPSISXXXXXXXXXXXXXGSGQV 1480 +QLQ GGI P S GQV Sbjct: 202 GNSGVQGGGRGVGGSPPPGIGNSGGAHQLQ-GGI----PDPSSLPLYNLTSNLIPNGGQV 256 Query: 1481 GHEALAWPHGRP 1516 GHEA AW HGRP Sbjct: 257 GHEAFAWAHGRP 268 >XP_017975810.1 PREDICTED: AT-hook motif nuclear-localized protein 20 [Theobroma cacao] Length = 273 Score = 267 bits (683), Expect = 4e-82 Identities = 150/246 (60%), Positives = 158/246 (64%) Frame = +2 Query: 779 NRDNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADV 958 +RD G DEPKEGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPNAL+SHVMEVA G DV Sbjct: 36 DRDTG-DEPKEGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVASGTDV 93 Query: 959 AESVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAP 1138 AES+A FARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTG FLPGPAP Sbjct: 94 AESIAQFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGAFLPGPAP 153 Query: 1139 PGSTGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXX 1318 PGSTGLTVY PVM+IAATFANATYERLP+ Sbjct: 154 PGSTGLTVYLAGGQGQVVGGSVVGSLIAAGPVMVIAATFANATYERLPIEDDEEAGSGGH 213 Query: 1319 XXXXXXXXXXXXXXXXXXXXNQLQAGGINPADPSISXXXXXXXXXXXXXGSGQVGHEALA 1498 + + G P S GQ+GHEA A Sbjct: 214 GGQIQGGAGNSPP--------AIGSSGPQTGLPDPSSLPIYNLPPNLLANGGQLGHEAYA 265 Query: 1499 WPHGRP 1516 W HGRP Sbjct: 266 WAHGRP 271 >EOY05966.1 AT-hook motif nuclear localized protein 20 [Theobroma cacao] Length = 273 Score = 267 bits (683), Expect = 4e-82 Identities = 150/246 (60%), Positives = 158/246 (64%) Frame = +2 Query: 779 NRDNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADV 958 +RD G DEPKEGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPNAL+SHVMEVA G DV Sbjct: 36 DRDTG-DEPKEGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVASGTDV 93 Query: 959 AESVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAP 1138 AES+A FARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTG FLPGPAP Sbjct: 94 AESIAQFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGAFLPGPAP 153 Query: 1139 PGSTGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXX 1318 PGSTGLTVY PVM+IAATFANATYERLP+ Sbjct: 154 PGSTGLTVYLAGGQGQVVGGSVVGSLIAAGPVMVIAATFANATYERLPIEDDEEAGSGGH 213 Query: 1319 XXXXXXXXXXXXXXXXXXXXNQLQAGGINPADPSISXXXXXXXXXXXXXGSGQVGHEALA 1498 + + G P S GQ+GHEA A Sbjct: 214 GGQIQGGAGNSPP--------AIGSSGPQTGLPDPSSLPIYNLPPNLLANGGQLGHEAYA 265 Query: 1499 WPHGRP 1516 W HGRP Sbjct: 266 WAHGRP 271 >KOM33959.1 hypothetical protein LR48_Vigan02g010900 [Vigna angularis] BAT96606.1 hypothetical protein VIGAN_08357300 [Vigna angularis var. angularis] Length = 273 Score = 267 bits (682), Expect = 5e-82 Identities = 155/250 (62%), Positives = 162/250 (64%), Gaps = 6/250 (2%) Frame = +2 Query: 779 NRDNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADV 958 +RDNG DEPKEGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPN+L+SHVMEVAGGADV Sbjct: 21 DRDNG-DEPKEGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNSLRSHVMEVAGGADV 78 Query: 959 AESVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAP 1138 AESVA FARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTG FLPGPAP Sbjct: 79 AESVAQFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGAFLPGPAP 138 Query: 1139 PGSTGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPL---XXXXXXXX 1309 PG+TGLTVY PVM+IAATFANATYERLPL Sbjct: 139 PGATGLTVYLAGGQGQVVGGSVVGPLVAAGPVMIIAATFANATYERLPLEEEDEDGGGGG 198 Query: 1310 XXXXXXXXXXXXXXXXXXXXXXXNQLQAGGINPA---DPSISXXXXXXXXXXXXXGSGQV 1480 + + G P DPS S G G V Sbjct: 199 SVQGGSTLGGSPPGIGSSGGGGGDGVSGGSHLPGGIPDPS-SLPLYNLPPNLLSNGGGPV 257 Query: 1481 GHEALAWPHG 1510 GHEA AW HG Sbjct: 258 GHEAYAWAHG 267 >XP_014618360.1 PREDICTED: AT-hook motif nuclear-localized protein 20-like isoform X1 [Glycine max] KRH31728.1 hypothetical protein GLYMA_10G008400 [Glycine max] Length = 270 Score = 266 bits (681), Expect = 7e-82 Identities = 152/244 (62%), Positives = 157/244 (64%), Gaps = 1/244 (0%) Frame = +2 Query: 782 RDNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADVA 961 RDNG DEPKEGAV E GTRRPRGRP GSKNKPKPPIFVTRDSPN+L+SHVMEVAGGADVA Sbjct: 25 RDNG-DEPKEGAV-EAGTRRPRGRPPGSKNKPKPPIFVTRDSPNSLRSHVMEVAGGADVA 82 Query: 962 ESVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPP 1141 ESVA FARRRQRGVCVLSGSGSVANVTLRQP+APGAVVALHGRFEILSLTG FLPGPAPP Sbjct: 83 ESVAQFARRRQRGVCVLSGSGSVANVTLRQPSAPGAVVALHGRFEILSLTGAFLPGPAPP 142 Query: 1142 GSTGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXXX 1321 G+TGLTVY PVM+IAATFANATYERLPL Sbjct: 143 GATGLTVYLAGGQGQVVGGSVVGSLVAAGPVMVIAATFANATYERLPLEEEEDDGGGSVQ 202 Query: 1322 XXXXXXXXXXXXXXXXXXXNQLQAGGINPAD-PSISXXXXXXXXXXXXXGSGQVGHEALA 1498 GG P P S GQVGHEA A Sbjct: 203 GGSTLGGSPHGIGSSGGGGG--SGGGHLPGGIPGPSSLPLYNLPPNLLPNGGQVGHEAFA 260 Query: 1499 WPHG 1510 W HG Sbjct: 261 WAHG 264 >XP_010262258.1 PREDICTED: AT-hook motif nuclear-localized protein 20 [Nelumbo nucifera] XP_019053885.1 PREDICTED: AT-hook motif nuclear-localized protein 20 [Nelumbo nucifera] Length = 255 Score = 266 bits (679), Expect = 8e-82 Identities = 152/245 (62%), Positives = 161/245 (65%) Frame = +2 Query: 782 RDNGSDEPKEGAVHEVGTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEVAGGADVA 961 R+NG DEPKEGAV EVGTRRPRGRP GSKNKPKPPIFVTRDSPNAL+SHVMEVAGGADVA Sbjct: 18 RENG-DEPKEGAV-EVGTRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVAGGADVA 75 Query: 962 ESVAHFARRRQRGVCVLSGSGSVANVTLRQPAAPGAVVALHGRFEILSLTGTFLPGPAPP 1141 ESVA FARRRQRGVCVLSGSG+VANVTLRQPAAPGAVVALHGRFEILSLTG FLPGPAPP Sbjct: 76 ESVAQFARRRQRGVCVLSGSGAVANVTLRQPAAPGAVVALHGRFEILSLTGAFLPGPAPP 135 Query: 1142 GSTGLTVYXXXXXXXXXXXXXXXXXXXXXPVMLIAATFANATYERLPLXXXXXXXXXXXX 1321 GSTGLTVY PVM+IAATFANATYERLPL Sbjct: 136 GSTGLTVYLAGGQGQVVGGSVVGTLVAAGPVMVIAATFANATYERLPLEEEEDEAGSGGQ 195 Query: 1322 XXXXXXXXXXXXXXXXXXXNQLQAGGINPADPSISXXXXXXXXXXXXXGSGQVGHEALAW 1501 Q Q+G +P+ I GQ+ H+A AW Sbjct: 196 GQLAGGAGSSPPAIGSSA--QQQSGLPDPSSLPI-----YNLPPNLIPNGGQLNHDAFAW 248 Query: 1502 PHGRP 1516 H RP Sbjct: 249 AHARP 253