BLASTX nr result
ID: Rehmannia25_contig00017275
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia25_contig00017275 (1341 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EPS73173.1| hypothetical protein M569_01583, partial [Genlise... 281 3e-73 ref|XP_006362530.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 251 4e-64 ref|XP_004238967.1| PREDICTED: GC-rich sequence DNA-binding fact... 243 1e-61 ref|XP_003528569.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 234 8e-59 emb|CBI27069.3| unnamed protein product [Vitis vinifera] 234 8e-59 ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding fact... 227 8e-57 ref|XP_006605552.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 225 3e-56 gb|EMJ26532.1| hypothetical protein PRUPE_ppa001044mg [Prunus pe... 225 3e-56 ref|XP_004514246.1| PREDICTED: GC-rich sequence DNA-binding fact... 222 2e-55 gb|ESW32937.1| hypothetical protein PHAVU_001G030200g [Phaseolus... 219 2e-54 ref|XP_006583671.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 214 7e-53 ref|XP_003530304.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 214 7e-53 ref|XP_003610832.1| GC-rich sequence DNA-binding factor-like pro... 211 7e-52 ref|XP_006379383.1| hypothetical protein POPTR_0008s00320g [Popu... 207 6e-51 ref|XP_006379382.1| hypothetical protein POPTR_0008s00320g [Popu... 207 6e-51 gb|EXB53993.1| GC-rich sequence DNA-binding factor 1 [Morus nota... 198 5e-48 ref|XP_004298307.1| PREDICTED: GC-rich sequence DNA-binding fact... 196 2e-47 gb|EOY19310.1| GC-rich sequence DNA-binding factor-like protein,... 191 6e-46 ref|XP_002311888.1| predicted protein [Populus trichocarpa] 190 1e-45 ref|XP_006468681.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 181 6e-43 >gb|EPS73173.1| hypothetical protein M569_01583, partial [Genlisea aurea] Length = 765 Score = 281 bits (720), Expect = 3e-73 Identities = 189/427 (44%), Positives = 223/427 (52%), Gaps = 16/427 (3%) Frame = +3 Query: 42 KSRNFRRRAXXXXXXXX---NKSAAPST-----------TNKPSAXXXXXXXXXXXXXXX 179 KSRNFRRR+ N SA PST +K SA Sbjct: 1 KSRNFRRRSGVEEVDEEDGDNPSAVPSTPAKIKGTIPSSASKSSAVNKPQKSASQSGRKS 60 Query: 180 LLSFADDDDES--PFXXXXXXXXXXXXXXXXXXXXXXXXXXXXDRIGPHHPSSSLPSNVQ 353 LLSFA D +ES P DR PH SSS+PSNVQ Sbjct: 61 LLSFAGDVEESFSPAPTKSSHSSSSSSSLRSSKGSAHQLTSAKDRNAPHPSSSSIPSNVQ 120 Query: 354 PQAGVYTKEALLELQKNTKTLAAPARNXXXXXXXXXXXXLKGLIKPVISNDLDIGTTGRS 533 PQAG YTKE LLELQ+NT+TLAAPAR+ LKGLIKPV+S+DL G Sbjct: 121 PQAGTYTKETLLELQRNTRTLAAPARHKPKAEQETVVV-LKGLIKPVVSSDLG----GSG 175 Query: 534 QNLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERL 713 + D FD DL D ++L L GS DK+ +PD+A IEAI+AKRERL Sbjct: 176 HDSAAHDADFDGN-IDLGAENDATLTKLSGLGFEGGSEGDKDVIPDRATIEAIRAKRERL 234 Query: 714 RQAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAM 893 RQAKAAAPDY+ALDGGSNHG AEGLSDEEPEFRGRIGFF +K G DK+GVF+D E RAM Sbjct: 235 RQAKAAAPDYVALDGGSNHGAAEGLSDEEPEFRGRIGFFADKAGVHDKRGVFEDLEQRAM 294 Query: 894 PKERGIEVVSXXXXXXXKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXXFGYLGT 1073 P++R +E S KMWE EQVRKGLGKRL + G Sbjct: 295 PRDRFVESGSDAEDEEDKMWEEEQVRKGLGKRLGNGVGGKGVTVNIAGSGLTTVHHLGGP 354 Query: 1074 GTSGVHPPVQNVDVXXXXXXXXXXXXXLFGIDVMSIPQQAELAKKALNENLRRVQESHGR 1253 + H + + + +G+D MSI QQA+LAKK L NL R++ESH + Sbjct: 355 QPTSGHSIIASSN--GDRVSDAASVVGSWGLDSMSISQQADLAKKTLTTNLARLKESHRQ 412 Query: 1254 TMMSLAK 1274 T L K Sbjct: 413 TKALLDK 419 >ref|XP_006362530.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Solanum tuberosum] Length = 939 Score = 251 bits (642), Expect = 4e-64 Identities = 181/430 (42%), Positives = 224/430 (52%), Gaps = 16/430 (3%) Frame = +3 Query: 36 SVKSRNFRRRAXXXXXXXXNKSAAPSTTNKPSAXXXXXXXXXXXXXXXLLSFADDDD--E 209 S KSRNFRRR + + TTN +A LLSFADD+D + Sbjct: 2 SGKSRNFRRRGGDDGD---DDETSAKTTNGTAAKPTTTASATKPKKKSLLSFADDEDSDD 58 Query: 210 SPFXXXXXXXXXXXXXXXXXXXXXXXXXXXX--DRIGPHHPSSSLPSNVQPQAGVYTKEA 383 +PF DRI P PS + SNVQPQAG YTKEA Sbjct: 59 TPFVRPSSKPSSASSRITKPSSSSSAHKLTSGKDRITPKPPSFT--SNVQPQAGTYTKEA 116 Query: 384 LLELQKNTKTL----AAPARNXXXXXXXXXXXXLKGLIKPVISNDLDIGTTGRSQNLGDD 551 LLELQKNT+TL +A + LKGL+KP S + T Q DD Sbjct: 117 LLELQKNTRTLVGSRSAQPKPEPRPGPVEPVIVLKGLVKPPFS--VTAQTQQNGQESEDD 174 Query: 552 DMSFDQKGKDLRVVRDDASSRLKDLELGPGSRE-DKEG--MPDQAMIEAIKAKRERLRQA 722 +M DQ G + +RL + L SR+ D G +PD+ I+AI+AKRERLRQA Sbjct: 175 EMDVDQFGGTV--------NRLGSMALEKDSRKKDDVGSVIPDKMTIDAIRAKRERLRQA 226 Query: 723 KAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMPKE 902 + AA D+IALD G NHGEAEGLSDEEPEF+ RIGF+GEKIG ++GVF+DFED+AM K+ Sbjct: 227 RPAAQDFIALDEGGNHGEAEGLSDEEPEFQQRIGFYGEKIGS-GRRGVFEDFEDKAMQKD 285 Query: 903 RGIEVVSXXXXXXXKMWEAEQVRKGLGKRLDD-----XXXXXXXXXXXXXXXXXXXFGYL 1067 G KMWE EQVRKGLGKRLDD FG Sbjct: 286 GGFRSDDDEEDEEEKMWEEEQVRKGLGKRLDDGSNRGVMSSVVSSAAAVQNVQKANFGSS 345 Query: 1068 GTGTSGVHPPVQNVDVXXXXXXXXXXXXXLFGIDVMSIPQQAELAKKALNENLRRVQESH 1247 G S V+ VQ++DV L +D +SI ++AE+AKKAL E++ R++ESH Sbjct: 346 AVGAS-VYSSVQSIDVSDGPTIGGGVVGGLPSLDALSISKKAEVAKKALYESMGRLKESH 404 Query: 1248 GRTMMSLAKT 1277 GRT+ SL KT Sbjct: 405 GRTVTSLHKT 414 >ref|XP_004238967.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Solanum lycopersicum] Length = 941 Score = 243 bits (620), Expect = 1e-61 Identities = 179/437 (40%), Positives = 222/437 (50%), Gaps = 23/437 (5%) Frame = +3 Query: 36 SVKSRNFRRRAXXXXXXXXNKS-------AAPSTTNKPSAXXXXXXXXXXXXXXXLLSFA 194 S KSRNFRRR + A P+TT SA LLSFA Sbjct: 2 SGKSRNFRRRGGDDGDDDETATKSTNGTAAKPTTTASASAAKPKKKS--------LLSFA 53 Query: 195 DDD--DESPFXXXXXXXXXXXXXXXXXXXXXXXXXXXX--DRIGPHHPSSSLPSNVQPQA 362 DD+ D++PF DRI P +S SNVQPQA Sbjct: 54 DDEESDDTPFVRPSSKPSSASSRITKPSSSSSAHKLTSGKDRITPK--PTSFTSNVQPQA 111 Query: 363 GVYTKEALLELQKNTKTL----AAPARNXXXXXXXXXXXXLKGLIKPVISNDLDIGTTGR 530 G YTKEALLELQKNT+TL ++ + LKGL+KP S G+ Sbjct: 112 GTYTKEALLELQKNTRTLVGSRSSQPKPEPRPGPVEPVIVLKGLVKPPFSVSAQTQQNGK 171 Query: 531 SQNLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSRE-DKEG--MPDQAMIEAIKAK 701 DD+M DQ G + +RL + L SR+ D G +PD+ I+AI+AK Sbjct: 172 ESE--DDEMDVDQFGGTV--------NRLGSMALEKDSRKKDDVGSVIPDKMTIDAIRAK 221 Query: 702 RERLRQAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFE 881 RERLRQA+ AA D+IALD G NHGEAEGLSDEEPEF+ RIGF+GEKIG +KGVF+DF+ Sbjct: 222 RERLRQARPAAQDFIALDEGGNHGEAEGLSDEEPEFQQRIGFYGEKIGS-GRKGVFEDFD 280 Query: 882 DRAMPKERGIEVVSXXXXXXXKMWEAEQVRKGLGKRLDD-----XXXXXXXXXXXXXXXX 1046 D+A+ K+ G KMWE EQVRKGLGKRLDD Sbjct: 281 DKALQKDGGFRSDDDEEDEEDKMWEEEQVRKGLGKRLDDGSNRGVMSSVVSSAAAVQNAQ 340 Query: 1047 XXXFGYLGTGTSGVHPPVQNVDVXXXXXXXXXXXXXLFGIDVMSIPQQAELAKKALNENL 1226 FG G S V+ VQ++DV L +D +SI +AE+AKKAL E++ Sbjct: 341 KANFGSSAVGAS-VYSSVQSIDVSDGPTIGGGVVGGLPSLDALSISMKAEVAKKALYESM 399 Query: 1227 RRVQESHGRTMMSLAKT 1277 R++ESHGRT+ SL KT Sbjct: 400 GRLKESHGRTVTSLHKT 416 >ref|XP_003528569.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Glycine max] Length = 913 Score = 234 bits (596), Expect = 8e-59 Identities = 174/433 (40%), Positives = 211/433 (48%), Gaps = 17/433 (3%) Frame = +3 Query: 30 MSSVKSRNFRRRAXXXXXXXXNKSAAPSTTNKPSAXXXXXXXXXXXXXXXLLSFADDD-- 203 MS+ KSRNFRRR N +TT PS LLSFAD+D Sbjct: 1 MSTAKSRNFRRRGGDTESNDGNDGGT-TTTTFPSKPTSSAKPKKKPQAPKLLSFADEDEQ 59 Query: 204 -DESPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXDRIGPHHPSSSLPSNVQPQAGVYTKE 380 DE+P DRI H S S+PSNVQPQAG YTKE Sbjct: 60 TDENP--RPRASKPYRSAATAKKPSSSHKITTLKDRIA-HSSSPSVPSNVQPQAGTYTKE 116 Query: 381 ALLELQKNTKTLAAPARNXXXXXXXXXXXX-LKGLIKPVISNDLDIGTTGRSQNLGDDDM 557 AL ELQKNT+TL + + LKGL+KP+ S+ G D Sbjct: 117 ALRELQKNTRTLVTSSSSRSDPKPSSEPVIVLKGLVKPL-----------GSEPQGRDSY 165 Query: 558 SFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGM--PDQAMIEAIKAKRERLRQAKAA 731 S + R + +L ++KEG PD I AI+AKRERLRQA+ A Sbjct: 166 S-------------EGEHREVEAKLATVGIQNKEGSFYPDDETIRAIRAKRERLRQARPA 212 Query: 732 APDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMP---KE 902 APDYI+LDGGSNHG AEGLSDEEPEFRGRI FGEK+ G KKGVF++ E+R M K Sbjct: 213 APDYISLDGGSNHGAAEGLSDEEPEFRGRIAMFGEKVDG-GKKGVFEEVEERIMDVRFKG 271 Query: 903 RGIEVVSXXXXXXXKMWEAEQVRKGLGKRLDD--------XXXXXXXXXXXXXXXXXXXF 1058 EVV KMWE EQ RKGLGKR+D+ + Sbjct: 272 GEDEVVDDDDDDEEKMWEEEQFRKGLGKRMDEGSARVDVSVMQGSQSPHNFVVPSAAKVY 331 Query: 1059 GYLGTGTSGVHPPVQNVDVXXXXXXXXXXXXXLFGIDVMSIPQQAELAKKALNENLRRVQ 1238 G + + + V P + V L +DV+ I QQAE A+KAL EN+RR++ Sbjct: 332 GAVPSAAASVSPSIGGV------------IESLPALDVVPISQQAEAARKALLENVRRLK 379 Query: 1239 ESHGRTMMSLAKT 1277 ESHGRTM SL+KT Sbjct: 380 ESHGRTMSSLSKT 392 >emb|CBI27069.3| unnamed protein product [Vitis vinifera] Length = 425 Score = 234 bits (596), Expect = 8e-59 Identities = 175/432 (40%), Positives = 214/432 (49%), Gaps = 18/432 (4%) Frame = +3 Query: 36 SVKSRNFRRRAXXXXXXXXNKSAAP--STTNKPSAXXXXXXXXXXXXXXX-LLSFADDDD 206 S + RNFRRRA N P T+KPS LLSFADD++ Sbjct: 2 SSRPRNFRRRADDDDNDDTNGDGPPLIKPTSKPSTTTATTAAAAKPKKPPKLLSFADDEE 61 Query: 207 -ESPFXXXXXXXXXXXXXXXXXXXXXXXXXXXX-------DRIGPHHPSSSLPSNVQPQA 362 ESP DR+ P S+SLPSNVQPQA Sbjct: 62 NESPSRSSSRSTQPPSRPSKTSSRFTKLSSSSSHKITTTKDRLTPS--SASLPSNVQPQA 119 Query: 363 GVYTKEALLELQKNTKTLAA--PARNXXXXXXXXXXXXLKGLIKPVISNDLDIGTTGRSQ 536 G YTKEAL ELQKNT+TLA+ PA + LKGL+KP+ + + Sbjct: 120 GTYTKEALRELQKNTRTLASSRPASSEPKPSLEPVIV-LKGLVKPISAAE---------- 168 Query: 537 NLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERLR 716 D D++ +D +RL + +G G ++ +PDQA I AI+AKRERLR Sbjct: 169 -----DAVIDEEN-------EDTETRLASMGIGKG----RDSIPDQATINAIRAKRERLR 212 Query: 717 QAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMP 896 Q++AAAPDYI+LDGGSNHG AEGLSDEEPEF+GRI FGEK KKGVF+D ++R M Sbjct: 213 QSRAAAPDYISLDGGSNHGAAEGLSDEEPEFQGRIAMFGEK-PESGKKGVFEDVDERGME 271 Query: 897 KERGIEVVSXXXXXXXKMWEAEQVRKGLGKRLDD-XXXXXXXXXXXXXXXXXXXFGYLG- 1070 + K+WE EQ RKGLGKR+DD F Y Sbjct: 272 GGFKKDAHDSDDEEEEKIWEEEQFRKGLGKRMDDGSSRVVSSSVPVVQKVQQQKFMYSSV 331 Query: 1071 ---TGTSGVHPPVQNVDVXXXXXXXXXXXXXLFGIDVMSIPQQAELAKKALNENLRRVQE 1241 T GV P+ L G D MS+ QQAELAKKAL+ENLRR++E Sbjct: 332 TAYTSVPGVSAPLN----------IGGAVGPLPGFDAMSLSQQAELAKKALHENLRRLKE 381 Query: 1242 SHGRTMMSLAKT 1277 SHGRTM SL +T Sbjct: 382 SHGRTMSSLTRT 393 >ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Vitis vinifera] Length = 913 Score = 227 bits (579), Expect = 8e-57 Identities = 173/432 (40%), Positives = 210/432 (48%), Gaps = 18/432 (4%) Frame = +3 Query: 36 SVKSRNFRRRAXXXXXXXXNKSAAP--STTNKPSAXXXXXXXXXXXXXXX-LLSFADDDD 206 S + RNFRRRA N P T+KPS LLSFADD++ Sbjct: 2 SSRPRNFRRRADDDDNDDTNGDGPPLIKPTSKPSTTTATTAAAAKPKKPPKLLSFADDEE 61 Query: 207 -ESPFXXXXXXXXXXXXXXXXXXXXXXXXXXXX-------DRIGPHHPSSSLPSNVQPQA 362 ESP DR+ P S+SLPSNVQPQA Sbjct: 62 NESPSRSSSRSTQPPSRPSKTSSRFTKLSSSSSHKITTTKDRLTPS--SASLPSNVQPQA 119 Query: 363 GVYTKEALLELQKNTKTLAA--PARNXXXXXXXXXXXXLKGLIKPVISNDLDIGTTGRSQ 536 G YTKEAL ELQKNT+TLA+ PA + LKGL+KP+ + + + Sbjct: 120 GTYTKEALRELQKNTRTLASSRPA-SSEPKPSLEPVIVLKGLVKPISAAE---DAVIDEE 175 Query: 537 NLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERLR 716 N+ ++ S D+ G+D +PDQA I AI+AKRERLR Sbjct: 176 NVEEEPESKDKGGRD--------------------------SIPDQATINAIRAKRERLR 209 Query: 717 QAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMP 896 Q++AAAPDYI+LDGGSNHG AEGLSDEEPEF+GRI FGEK KKGVF+D ++R M Sbjct: 210 QSRAAAPDYISLDGGSNHGAAEGLSDEEPEFQGRIAMFGEK-PESGKKGVFEDVDERGME 268 Query: 897 KERGIEVVSXXXXXXXKMWEAEQVRKGLGKRLDD-XXXXXXXXXXXXXXXXXXXFGYLG- 1070 + K+WE EQ RKGLGKR+DD F Y Sbjct: 269 GGFKKDAHDSDDEEEEKIWEEEQFRKGLGKRMDDGSSRVVSSSVPVVQKVQQQKFMYSSV 328 Query: 1071 ---TGTSGVHPPVQNVDVXXXXXXXXXXXXXLFGIDVMSIPQQAELAKKALNENLRRVQE 1241 T GV P+ L G D MS+ QQAELAKKAL+ENLRR++E Sbjct: 329 TAYTSVPGVSAPLN----------IGGAVGPLPGFDAMSLSQQAELAKKALHENLRRLKE 378 Query: 1242 SHGRTMMSLAKT 1277 SHGRTM SL +T Sbjct: 379 SHGRTMSSLTRT 390 >ref|XP_006605552.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Glycine max] Length = 916 Score = 225 bits (574), Expect = 3e-56 Identities = 168/431 (38%), Positives = 210/431 (48%), Gaps = 15/431 (3%) Frame = +3 Query: 30 MSSVKSRNFRRRAXXXXXXXXNKSAAPSTTNKPSAXXXXXXXXXXXXXXXLLSFADDDDE 209 MS+ KSRNFRRR + ++T PS LLSFADD+DE Sbjct: 1 MSTAKSRNFRRRGGDDTESNDDNDGDTTSTTLPSKPPSSAKPKKKPQAPKLLSFADDEDE 60 Query: 210 SPFXXXXXXXXXXXXXXXXXXXXXXXXXXXX-DRIGPHHPSSSLPSNVQPQAGVYTKEAL 386 + DRI H S S+P+NVQPQAG YTKEAL Sbjct: 61 TDENPRPRASKPHRTAATAKKPSSSHKITTLKDRIA-HTSSPSVPTNVQPQAGTYTKEAL 119 Query: 387 LELQKNTKTLAAPARNXXXXXXXXXXXX-LKGLIKPVISNDLDIGTTGRSQNLGDDDMSF 563 ELQKNT+TL + + + LKG +KP L T GR D Sbjct: 120 RELQKNTRTLVSSSSSRSDPKPSSEPVIVLKGHVKP-----LGPETQGR-------DSDS 167 Query: 564 DQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERLRQAKAAAPDY 743 D +G+ V K +G ++ED PD+ I AI+AKRERLR A+ AAPDY Sbjct: 168 DSEGEHREV-------EAKLATVGIQNKEDSF-YPDEETIRAIRAKRERLRLARPAAPDY 219 Query: 744 IALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMP---KERGIE 914 I+LDGGSNHG AEGLSDEEPEFRGRI FGEK+ G KKGVF++ E+R + K E Sbjct: 220 ISLDGGSNHGAAEGLSDEEPEFRGRIAMFGEKVDG-GKKGVFEEVEERRVDLRFKGGEEE 278 Query: 915 VVSXXXXXXXKMWEAEQVRKGLGKRLDD----------XXXXXXXXXXXXXXXXXXXFGY 1064 V+ KMWE EQ RKGLGKR+D+ +G Sbjct: 279 VLDDDDDEEEKMWEEEQFRKGLGKRMDEGSARVDVAAAAVQGAQLQHNFVVPSAAKVYGA 338 Query: 1065 LGTGTSGVHPPVQNVDVXXXXXXXXXXXXXLFGIDVMSIPQQAELAKKALNENLRRVQES 1244 + + + V P + L +DV+ I QQAE A+KAL EN+RR++ES Sbjct: 339 VPSAAASVSPSIGGA------------IESLPVLDVVPISQQAEAARKALLENVRRLKES 386 Query: 1245 HGRTMMSLAKT 1277 HGRTM SL+KT Sbjct: 387 HGRTMSSLSKT 397 >gb|EMJ26532.1| hypothetical protein PRUPE_ppa001044mg [Prunus persica] Length = 925 Score = 225 bits (574), Expect = 3e-56 Identities = 170/432 (39%), Positives = 224/432 (51%), Gaps = 18/432 (4%) Frame = +3 Query: 36 SVKSRNFRRRAXXXXXXXX--NKSAAPST------TNKPSAXXXXXXXXXXXXXXXLLSF 191 S ++RNFRRRA N + P+T ++KPS+ LLSF Sbjct: 2 SSRARNFRRRADDDDDKNDDPNDTGTPATIPTVKSSSKPSSSSSSKPKKPHNQAPKLLSF 61 Query: 192 ADDDDESPFXXXXXXXXXXXXXXXXXXXXXXXXXXXX-DRIGPHHPSS---SLPSNVQPQ 359 DD++ + DR+ H SS SLPSNVQPQ Sbjct: 62 VDDEESAAAPSRSSSSKPDKPSSRLGKPSSAHKMTALKDRLA--HTSSVSTSLPSNVQPQ 119 Query: 360 AGVYTKEALLELQKNTKTLAAPARNXXXXXXXXXXXXLKGLIKPV--ISNDLDIGTTGRS 533 AG YTKEAL ELQKNT+TLA+ + LKGL+KP IS+ L R Sbjct: 120 AGTYTKEALRELQKNTRTLASSRPSSEPTIV------LKGLVKPTGTISDTL---REARE 170 Query: 534 QNLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGM-PDQAMIEAIKAKRER 710 + +D+ ++ R +DDA +RL + G + G+ PDQA I AI+AKRER Sbjct: 171 LDSDNDEEQEKERASLFRRDKDDAEARLASM--GIDKAKGSSGLFPDQATINAIRAKRER 228 Query: 711 LRQAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDR- 887 LR+++AAAPD+I+LD GSNHG AEGLSDEEPEFRGRI FG+ + G KKGVF+D +DR Sbjct: 229 LRKSRAAAPDFISLDSGSNHGAAEGLSDEEPEFRGRIAIFGDNMEG-SKKGVFEDVDDRA 287 Query: 888 --AMPKERGIEVVSXXXXXXXKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXXFG 1061 A+ +++ I+ K+WE EQ RKGLGKR+DD Sbjct: 288 ADAVLRQKSID-RDEDEDEEEKIWEEEQFRKGLGKRMDDGSSIGVVSTSAPVVQSVPQPK 346 Query: 1062 YLGTGTSGVHPPVQNVDVXXXXXXXXXXXXXLFGIDVMSIPQQAELAKKALNENLRRVQE 1241 + +G + VQ+V V G +VMSI QAE+AKKAL EN+ +++E Sbjct: 347 ATYSAMAG-YSSVQSVPVGPSIGGAIGASQ---GSNVMSIKAQAEIAKKALEENVMKLKE 402 Query: 1242 SHGRTMMSLAKT 1277 SHGRTM+SL KT Sbjct: 403 SHGRTMLSLTKT 414 >ref|XP_004514246.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cicer arietinum] Length = 916 Score = 222 bits (566), Expect = 2e-55 Identities = 167/425 (39%), Positives = 209/425 (49%), Gaps = 9/425 (2%) Frame = +3 Query: 30 MSSVKSRNFRRRAXXXXXXXXNKSAAPSTTNKPSAXXXXXXXXXXXXXXXLLSFADDD-D 206 MS+ KSRNFRRR + S+ PS +KPS+ LLSFADD+ D Sbjct: 1 MSTAKSRNFRRRNDTNEDDHADTSSTPSLPSKPSSSAPKPKKPQAPK---LLSFADDEND 57 Query: 207 ESPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXDRIGPHHPSSSLPSNVQPQAGVYTKEAL 386 DRI H PS S SNVQPQAG YTKEAL Sbjct: 58 NENENPRPRSSKPHRSGVSKSSSSSHKITTHKDRIS-HSPSPSFLSNVQPQAGTYTKEAL 116 Query: 387 LELQKNTKTLAAPARNXXXXXXXXXXXX----LKGLIKPVISNDLDIGTTGRSQNLGDDD 554 ELQKNT+TL + + LKGL+KP S GR + D+ Sbjct: 117 RELQKNTRTLVTGSTSRPSSTSXXPSSEPVIVLKGLLKPASSEP-----QGRESDSEDEH 171 Query: 555 MSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERLRQAKAAA 734 + K + + + S +PD+ I+AI+A+RERLRQA+ AA Sbjct: 172 KEVEAKFASVGIQNGNDSL-----------------IPDEETIKAIRARRERLRQARPAA 214 Query: 735 PDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMPK--ERG 908 DYI+LDGGSNHG AEGLSDEEPEFRGRI FGEK G KKGVF+D ++R + G Sbjct: 215 QDYISLDGGSNHGAAEGLSDEEPEFRGRIALFGEK-GEGGKKGVFEDVDERGVDGRFNGG 273 Query: 909 IEVVSXXXXXXXKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXXFGYLGTGTSGV 1088 +VV KMWE EQ RKGLGKR+D+ ++ + V Sbjct: 274 GDVVVEEEDEEEKMWEEEQFRKGLGKRMDEGPGRVSGGDVSVVQVAQQP-KFVVPSAATV 332 Query: 1089 HPPVQNV--DVXXXXXXXXXXXXXLFGIDVMSIPQQAELAKKALNENLRRVQESHGRTMM 1262 + V NV +DV+SI QQAE+A+KAL +N+RR++ESHGRTM Sbjct: 333 YGAVPNVVAAAASVSTSIGGAIPATPALDVISISQQAEIARKALLDNVRRLKESHGRTMS 392 Query: 1263 SLAKT 1277 SL KT Sbjct: 393 SLNKT 397 >gb|ESW32937.1| hypothetical protein PHAVU_001G030200g [Phaseolus vulgaris] Length = 882 Score = 219 bits (559), Expect = 2e-54 Identities = 167/424 (39%), Positives = 204/424 (48%), Gaps = 8/424 (1%) Frame = +3 Query: 30 MSSVKSRNFRRRAXXXXXXXXNKSAAPSTTNKPSAXXXXXXXXXXXXXXXLLSFADDDD- 206 MS+ KSRNFRRR + ++KP + LLSFADD++ Sbjct: 1 MSTAKSRNFRRRGGGDTEGNDEDGDTSTLSSKPPSSAKPKKPQAPK----LLSFADDEEN 56 Query: 207 ESPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXDRIGPHHPSSSLPSNVQPQAGVYTKEAL 386 E+P DRI PS +PSNVQPQAG YTKE L Sbjct: 57 ENP------RPRSAKPQRSSKPSSAHKITTLKDRIASSSPS--VPSNVQPQAGTYTKETL 108 Query: 387 LELQKNTKTLAAPARNXXXXXXXXXXXXLKGLIKPVISNDLDIGTTGRSQNLGDDDMSFD 566 ELQKNT+TL + LKGL+KPV S GR + D D Sbjct: 109 RELQKNTRTLVTSSSRSEPKPPGEPVIVLKGLVKPVASEP-----QGR-----ESDSEGD 158 Query: 567 QKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERLRQAKAAAPDYI 746 K + +L L L G PD+ I+AI+AKRERLRQA+ AA DYI Sbjct: 159 HK---------EVEGKLGGLGLHNGK---DSFFPDEETIKAIRAKRERLRQARPAAQDYI 206 Query: 747 ALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMPKERGIEVVSX 926 +LDGGSNHG AEGLSDEEPEFRGRI FGEK+ G KKGVF++ E+R + + + Sbjct: 207 SLDGGSNHGAAEGLSDEEPEFRGRIAMFGEKVEG-GKKGVFEEVEERRV--DVRFKEEEE 263 Query: 927 XXXXXXKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXXFGYLGTGTSGVHPPVQN 1106 KMWE EQ RKGLGKR+D+ G+ V P VQ Sbjct: 264 DDDEEEKMWEEEQFRKGLGKRMDE-----------------------GSARVDV-PVVQG 299 Query: 1107 VDVXXXXXXXXXXXXXLFG-------IDVMSIPQQAELAKKALNENLRRVQESHGRTMMS 1265 FG +DV+S+ QQAE AKKAL EN+RR++ESHGRTM S Sbjct: 300 AQQHKYVVPSAAVPNAGFGTIESMPALDVLSLSQQAESAKKALVENVRRLKESHGRTMSS 359 Query: 1266 LAKT 1277 L+KT Sbjct: 360 LSKT 363 >ref|XP_006583671.1| PREDICTED: PAX3- and PAX7-binding protein 1-like isoform X2 [Glycine max] Length = 838 Score = 214 bits (545), Expect = 7e-53 Identities = 160/417 (38%), Positives = 201/417 (48%), Gaps = 1/417 (0%) Frame = +3 Query: 30 MSSVKSRNFRRRAXXXXXXXXNKSAAPSTTNKPSAXXXXXXXXXXXXXXXLLSFADDDDE 209 MS+ KSRNFRRR + + + +KP + LLSFADD++ Sbjct: 1 MSAAKSRNFRRRGGDTEANEDDGDTSTTFRSKPPSSAKPKKPQAPK----LLSFADDEEI 56 Query: 210 SPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXDRIGPHHPSSSLPSNVQPQAGVYTKEALL 389 S DRI SSS+ SNVQPQAG YTKEAL Sbjct: 57 S----NPRPRSSAKPQRPSKPSSSHKITTLKDRIAH---SSSVSSNVQPQAGTYTKEALR 109 Query: 390 ELQKNTKTLAAPARNXXXXXXXXXXXX-LKGLIKPVISNDLDIGTTGRSQNLGDDDMSFD 566 ELQKNT+TL + + LKGL+KPV+S GR D Sbjct: 110 ELQKNTRTLVSSSTTTTTSSSRSEPVIVLKGLVKPVVSEP-----QGRHS---------D 155 Query: 567 QKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERLRQAKAAAPDYI 746 +G+ V +L L + G PD+ I+AI+AKRERLR+A+ AAPDYI Sbjct: 156 SEGEHKEV-----EGKLSSLGIQNGK---DSFFPDEETIKAIRAKRERLRKARPAAPDYI 207 Query: 747 ALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMPKERGIEVVSX 926 +LDGGSNHG AEGLSDEEPEFRGRI F EK G KKGVF++ E+R +E + Sbjct: 208 SLDGGSNHGAAEGLSDEEPEFRGRIAMFEEKGEGGGKKGVFEEVEERLRDEEE-----ND 262 Query: 927 XXXXXXKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXXFGYLGTGTSGVHPPVQN 1106 KMWE EQ RKGLGKR+D+ GV P + Sbjct: 263 DDYEEEKMWEEEQFRKGLGKRMDEGAARVDVPVVQGAQQNKFVVSSAAAVYGGV--PSAD 320 Query: 1107 VDVXXXXXXXXXXXXXLFGIDVMSIPQQAELAKKALNENLRRVQESHGRTMMSLAKT 1277 V + +DV+ + QQAE A+KAL EN+RR++ESH RTM SL+KT Sbjct: 321 ARVPSVSPSIGGATESMPALDVVPMSQQAERARKALVENVRRLKESHERTMSSLSKT 377 >ref|XP_003530304.1| PREDICTED: PAX3- and PAX7-binding protein 1-like isoform X1 [Glycine max] Length = 896 Score = 214 bits (545), Expect = 7e-53 Identities = 160/417 (38%), Positives = 201/417 (48%), Gaps = 1/417 (0%) Frame = +3 Query: 30 MSSVKSRNFRRRAXXXXXXXXNKSAAPSTTNKPSAXXXXXXXXXXXXXXXLLSFADDDDE 209 MS+ KSRNFRRR + + + +KP + LLSFADD++ Sbjct: 1 MSAAKSRNFRRRGGDTEANEDDGDTSTTFRSKPPSSAKPKKPQAPK----LLSFADDEEI 56 Query: 210 SPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXDRIGPHHPSSSLPSNVQPQAGVYTKEALL 389 S DRI SSS+ SNVQPQAG YTKEAL Sbjct: 57 S----NPRPRSSAKPQRPSKPSSSHKITTLKDRIAH---SSSVSSNVQPQAGTYTKEALR 109 Query: 390 ELQKNTKTLAAPARNXXXXXXXXXXXX-LKGLIKPVISNDLDIGTTGRSQNLGDDDMSFD 566 ELQKNT+TL + + LKGL+KPV+S GR D Sbjct: 110 ELQKNTRTLVSSSTTTTTSSSRSEPVIVLKGLVKPVVSEP-----QGRHS---------D 155 Query: 567 QKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERLRQAKAAAPDYI 746 +G+ V +L L + G PD+ I+AI+AKRERLR+A+ AAPDYI Sbjct: 156 SEGEHKEV-----EGKLSSLGIQNGK---DSFFPDEETIKAIRAKRERLRKARPAAPDYI 207 Query: 747 ALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMPKERGIEVVSX 926 +LDGGSNHG AEGLSDEEPEFRGRI F EK G KKGVF++ E+R +E + Sbjct: 208 SLDGGSNHGAAEGLSDEEPEFRGRIAMFEEKGEGGGKKGVFEEVEERLRDEEE-----ND 262 Query: 927 XXXXXXKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXXFGYLGTGTSGVHPPVQN 1106 KMWE EQ RKGLGKR+D+ GV P + Sbjct: 263 DDYEEEKMWEEEQFRKGLGKRMDEGAARVDVPVVQGAQQNKFVVSSAAAVYGGV--PSAD 320 Query: 1107 VDVXXXXXXXXXXXXXLFGIDVMSIPQQAELAKKALNENLRRVQESHGRTMMSLAKT 1277 V + +DV+ + QQAE A+KAL EN+RR++ESH RTM SL+KT Sbjct: 321 ARVPSVSPSIGGATESMPALDVVPMSQQAERARKALVENVRRLKESHERTMSSLSKT 377 >ref|XP_003610832.1| GC-rich sequence DNA-binding factor-like protein [Medicago truncatula] gi|355512167|gb|AES93790.1| GC-rich sequence DNA-binding factor-like protein [Medicago truncatula] Length = 892 Score = 211 bits (536), Expect = 7e-52 Identities = 163/432 (37%), Positives = 207/432 (47%), Gaps = 16/432 (3%) Frame = +3 Query: 30 MSSVKSRNFRRRAXXXXXXXXNKSAAPSTT-NKPSAXXXXXXXXXXXXXXXLLSFADD-- 200 MSS KSRNFRRR + P+T +KPSA LLSFADD Sbjct: 1 MSSAKSRNFRRRTDTN-----SDDDTPTTVPSKPSAPKPKKPPK-------LLSFADDEI 48 Query: 201 --DDESPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXDRIGPHHPSSSLPSNVQPQAGVYT 374 D+E+P +RI H PS S PSNVQPQAG YT Sbjct: 49 DADNETP---RPRSSKPHHHRPKPSSSSSHKITTHKNRITSHSPSPS-PSNVQPQAGTYT 104 Query: 375 KEALLELQKNTKTLAAPAR-----NXXXXXXXXXXXXLKGLIKPVISNDLDIGTTGRSQN 539 EAL ELQKNT+TL P + LKGL+KPV T ++ Sbjct: 105 LEALRELQKNTRTLVTPTTASRPISSEPKPSSEPVIVLKGLLKPV---------TSEPES 155 Query: 540 LGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGM-PDQAMIEAIKAKRERLR 716 +++ F+ K + G + K+ P + I+A KAKRER+R Sbjct: 156 DSEENGEFEAKFASV------------------GIKNGKDSFFPGEEDIKAAKAKRERMR 197 Query: 717 QAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVF----DDFED 884 +A AAAPDYI+LDGGSNHG AEGLSDEEPE+RGRI FG K G +KKGVF + F+D Sbjct: 198 KAGAAAPDYISLDGGSNHGAAEGLSDEEPEYRGRIAMFGGKKGDGEKKGVFEVADERFDD 257 Query: 885 RAMPKERGIEVVSXXXXXXXKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXXFGY 1064 + +E G +WE EQ +KGLGKR D+ + Sbjct: 258 VVVDEEDG-------------LWEEEQFKKGLGKRRDEGSARVGGGGEVPVVQAAQQPNF 304 Query: 1065 LGTGTSGVHPPVQNVDVXXXXXXXXXXXXXLFGI-DVMSIPQQAELAKKALNENLRRVQE 1241 +G + V+ V NV + DV+SI QQAE+AKKA+ +N+RR++E Sbjct: 305 VGPSVANVYGAVPNVVAAASANTSIGGAIPATPVLDVISISQQAEIAKKAMLDNIRRLKE 364 Query: 1242 SHGRTMMSLAKT 1277 SHGRTM SL KT Sbjct: 365 SHGRTMSSLNKT 376 >ref|XP_006379383.1| hypothetical protein POPTR_0008s00320g [Populus trichocarpa] gi|550332058|gb|ERP57180.1| hypothetical protein POPTR_0008s00320g [Populus trichocarpa] Length = 972 Score = 207 bits (528), Expect = 6e-51 Identities = 161/457 (35%), Positives = 205/457 (44%), Gaps = 42/457 (9%) Frame = +3 Query: 33 SSVKSRNFRRRAXXXXXXXX--------NKSAAPSTTNKPSAXXXXXXXXXXXXXXXLLS 188 SS KSRNFRRR N A PSTT KP LLS Sbjct: 3 SSSKSRNFRRRGDVDDEKTDANTNNTDTNAKATPSTTRKPPPPQSTKPKPKK-----LLS 57 Query: 189 FADDD-DESPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXDRIGPHHPSSSLPSNVQPQAG 365 FA+D+ DE DR+ P + SNVQPQAG Sbjct: 58 FAEDEEDEQAVTRIPSSKSKPKPKPKPTSSSSHKLTVSQDRLPPTTSYLTTASNVQPQAG 117 Query: 366 VYTKEALLELQKNTKTLAAPARNXXXXXXXXXXXXLKGLIKPVISN----DLDIGTTGRS 533 YTKEALLELQ+NT+TLA + LKGL+KP S + + + + Sbjct: 118 TYTKEALLELQRNTRTLAKSTKTTTPASASEPKIILKGLLKPSFSPSPNPNPNYSSNHQQ 177 Query: 534 QNLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERL 713 Q+ DD + + KD DDA +RL + LG + +D PD+ I+ I+AKRERL Sbjct: 178 QDDADDQSEDENEDKDNGA--DDAQNRLASMGLGKSTSDDYSCFPDEDTIKKIRAKRERL 235 Query: 714 RQAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKI-GGPDKKGVF------- 869 RQ++AAAPDYI+LD GSNH G SDEEPEFR RI G GVF Sbjct: 236 RQSRAAAPDYISLDSGSNH--QGGFSDEEPEFRTRIAMIGTMTKDTATHGGVFDAAADDD 293 Query: 870 -DDFEDRAMPKE--------------------RGIEVVSXXXXXXXKMWEAEQVRKGLGK 986 DD +DR++ + VV ++WE EQ RKGLGK Sbjct: 294 EDDDDDRSIKAKALAMMGTHHHHAVVDDGNVAAAASVVHDEEDEEDRIWEEEQFRKGLGK 353 Query: 987 RLDDXXXXXXXXXXXXXXXXXXXFGYLGTGTSGVHPPVQNVDVXXXXXXXXXXXXXLFGI 1166 R+DD G + T + P + G+ Sbjct: 354 RMDDASAPIANRALASTA------GAAASSTIPMQPQQRPTPGYGSIPSIGGAFGSSQGL 407 Query: 1167 DVMSIPQQAELAKKALNENLRRVQESHGRTMMSLAKT 1277 DV+SIPQQA++AKKAL +NLRR++ESHGRT+ L+KT Sbjct: 408 DVLSIPQQADIAKKALQDNLRRLKESHGRTISLLSKT 444 >ref|XP_006379382.1| hypothetical protein POPTR_0008s00320g [Populus trichocarpa] gi|550332057|gb|ERP57179.1| hypothetical protein POPTR_0008s00320g [Populus trichocarpa] Length = 834 Score = 207 bits (528), Expect = 6e-51 Identities = 161/457 (35%), Positives = 205/457 (44%), Gaps = 42/457 (9%) Frame = +3 Query: 33 SSVKSRNFRRRAXXXXXXXX--------NKSAAPSTTNKPSAXXXXXXXXXXXXXXXLLS 188 SS KSRNFRRR N A PSTT KP LLS Sbjct: 3 SSSKSRNFRRRGDVDDEKTDANTNNTDTNAKATPSTTRKPPPPQSTKPKPKK-----LLS 57 Query: 189 FADDD-DESPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXDRIGPHHPSSSLPSNVQPQAG 365 FA+D+ DE DR+ P + SNVQPQAG Sbjct: 58 FAEDEEDEQAVTRIPSSKSKPKPKPKPTSSSSHKLTVSQDRLPPTTSYLTTASNVQPQAG 117 Query: 366 VYTKEALLELQKNTKTLAAPARNXXXXXXXXXXXXLKGLIKPVISN----DLDIGTTGRS 533 YTKEALLELQ+NT+TLA + LKGL+KP S + + + + Sbjct: 118 TYTKEALLELQRNTRTLAKSTKTTTPASASEPKIILKGLLKPSFSPSPNPNPNYSSNHQQ 177 Query: 534 QNLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERL 713 Q+ DD + + KD DDA +RL + LG + +D PD+ I+ I+AKRERL Sbjct: 178 QDDADDQSEDENEDKDNGA--DDAQNRLASMGLGKSTSDDYSCFPDEDTIKKIRAKRERL 235 Query: 714 RQAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKI-GGPDKKGVF------- 869 RQ++AAAPDYI+LD GSNH G SDEEPEFR RI G GVF Sbjct: 236 RQSRAAAPDYISLDSGSNH--QGGFSDEEPEFRTRIAMIGTMTKDTATHGGVFDAAADDD 293 Query: 870 -DDFEDRAMPKE--------------------RGIEVVSXXXXXXXKMWEAEQVRKGLGK 986 DD +DR++ + VV ++WE EQ RKGLGK Sbjct: 294 EDDDDDRSIKAKALAMMGTHHHHAVVDDGNVAAAASVVHDEEDEEDRIWEEEQFRKGLGK 353 Query: 987 RLDDXXXXXXXXXXXXXXXXXXXFGYLGTGTSGVHPPVQNVDVXXXXXXXXXXXXXLFGI 1166 R+DD G + T + P + G+ Sbjct: 354 RMDDASAPIANRALASTA------GAAASSTIPMQPQQRPTPGYGSIPSIGGAFGSSQGL 407 Query: 1167 DVMSIPQQAELAKKALNENLRRVQESHGRTMMSLAKT 1277 DV+SIPQQA++AKKAL +NLRR++ESHGRT+ L+KT Sbjct: 408 DVLSIPQQADIAKKALQDNLRRLKESHGRTISLLSKT 444 >gb|EXB53993.1| GC-rich sequence DNA-binding factor 1 [Morus notabilis] Length = 952 Score = 198 bits (503), Expect = 5e-48 Identities = 162/454 (35%), Positives = 210/454 (46%), Gaps = 41/454 (9%) Frame = +3 Query: 36 SVKSRNFRRRAXXXXXXXXNKSA-------APSTTN----------KPSAXXXXXXXXXX 164 S ++RNFRRR N + PSTT KPS+ Sbjct: 2 SNRARNFRRRTGGDDDDDDNYNIKDSNAKNGPSTTTATTTTTKSLLKPSSTSASKPKRPP 61 Query: 165 XXXXXLLSFADDDDE---SPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXDRIGPHHPSSS 335 LLSFADD+D S DR+ PH SSS Sbjct: 62 NQSTKLLSFADDEDNETPSRSKPSSSSKLSSSSSRLSKPTSSHKMTALKDRL-PHSSSSS 120 Query: 336 -------LPSNVQPQAGVYTKEALLELQKNTKTLAAPARNXXXXXXXXXXXXLKGLIKPV 494 LPSNVQPQAG YTKEAL ELQKNT+TLA+ + LKGL+KP Sbjct: 121 PSSSSLSLPSNVQPQAGTYTKEALRELQKNTRTLASSKPSSEPVIV------LKGLLKP- 173 Query: 495 ISNDLDIGTTGRSQNLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEG---- 662 L D D + +D + L +E+G R+ Sbjct: 174 -------------SELAKSDWKLDSEEEDEPDELKERRGELASMEIGAKGRDRDNSSPEP 220 Query: 663 -MPDQAMIEAIKAKRERLRQAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEK 839 +PDQA I AI+AKRERLRQ++AAAPD+IALD GSNHGEAEGLSDEEPE + RI FGEK Sbjct: 221 LIPDQATINAIRAKRERLRQSRAAAPDFIALDAGSNHGEAEGLSDEEPENQTRIAMFGEK 280 Query: 840 IGGPDKKGVF-DDFEDRA-----MPKERGI--EVVSXXXXXXXKMWEAEQVRKGLGK-RL 992 GP KKGVF DD +DR + +++G+ E K+WE EQ RKGLGK R+ Sbjct: 281 AEGP-KKGVFEDDIDDRGIELGLLRRKQGVLEENHEDDEDEEDKIWEEEQFRKGLGKTRI 339 Query: 993 DDXXXXXXXXXXXXXXXXXXXFGYLGTGTSGVHPPVQNVDVXXXXXXXXXXXXXLFGIDV 1172 DD ++ + S PP + + G+ + Sbjct: 340 DDGGKNSVVPVVKRETQQK----FVSSVGSQTLPP--SASIGGTFGGSSGGSSTGLGLGM 393 Query: 1173 MSIPQQAELAKKALNENLRRVQESHGRTMMSLAK 1274 M QQAE+A A+++N+RR++E+H + ++SL K Sbjct: 394 MPFSQQAEIALNAIDDNVRRLKETHDQDLVSLNK 427 >ref|XP_004298307.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Fragaria vesca subsp. vesca] Length = 914 Score = 196 bits (498), Expect = 2e-47 Identities = 158/441 (35%), Positives = 202/441 (45%), Gaps = 26/441 (5%) Frame = +3 Query: 30 MSSVKSRNFRRRAXXXXXXXXNKSAAPSTT---NKPSAXXXXXXXXXXXXXXXLLSFADD 200 MSS + +NFRRR + + ST +KPS+ LLSF DD Sbjct: 1 MSSARPKNFRRRIDDDDDDDADTPSTTSTLKSLSKPSSSAAKPKKPQSQAPK-LLSFVDD 59 Query: 201 DDE---SPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXDRI---GPHHPSSSLPSNVQPQA 362 ++ S DR+ S+SLPSNVQPQA Sbjct: 60 EENATPSRSSSSSSKRDKSSSSRLAKPSSAHKLTAAKDRLVNSTSSTASASLPSNVQPQA 119 Query: 363 GVYTKEALLELQKNTKTLAAPARNXXXXXXXXXXXXLKGLIKPVISNDLDIGTTGRSQNL 542 G YTKEAL ELQKNT+TLA+ +R L+G IKP ++ D R + Sbjct: 120 GTYTKEALRELQKNTRTLAS-SRTSSAAAAAEPTIVLRGSIKPADASIADAVNGARELDS 178 Query: 543 GDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERLRQA 722 D++ Q+G K+ PDQA IEAI+ KRERLR++ Sbjct: 179 DDEE----QQGS-------------------------KDRYPDQATIEAIRKKRERLRKS 209 Query: 723 KAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMP-- 896 K AAPD+IALD GSNHG AEGLSDEEPEFR RI FGEK+ +KKGVF+D +D + Sbjct: 210 KPAAPDFIALDSGSNHGAAEGLSDEEPEFRNRIAMFGEKM--ENKKGVFEDVDDTGVDGG 267 Query: 897 KERGIEVVSXXXXXXXKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXXFGYLGTG 1076 R VV K+WE EQ RKGLGKR+D+ LG Sbjct: 268 LRRESVVVEDDEDEEEKIWEEEQFRKGLGKRVDNDG------------------ASLGVS 309 Query: 1077 TS--GVHPPVQNVDVXXXXXXXXXXXXXLFGI-------------DVMSIPQQAELAKKA 1211 S VH L G+ + +SI +Q+E+A+KA Sbjct: 310 ASVPRVHSAAPQPKASYNSIAGYSLAQSLAGVASIGGATGASQGSNALSINEQSEIAQKA 369 Query: 1212 LNENLRRVQESHGRTMMSLAK 1274 L EN+R+++ESHGRT MSL K Sbjct: 370 LLENVRKLKESHGRTKMSLTK 390 >gb|EOY19310.1| GC-rich sequence DNA-binding factor-like protein, putative isoform 1 [Theobroma cacao] gi|508727414|gb|EOY19311.1| GC-rich sequence DNA-binding factor-like protein, putative isoform 1 [Theobroma cacao] Length = 934 Score = 191 bits (485), Expect = 6e-46 Identities = 158/439 (35%), Positives = 202/439 (46%), Gaps = 25/439 (5%) Frame = +3 Query: 33 SSVKSRNFRRRAXXXXXXXXNKSAAPS-------TTNKPSAXXXXXXXXXXXXXXXLLSF 191 S++++RNFRRR + + P+ T KPS+ LLSF Sbjct: 3 SAIRARNFRRRGDDIDDDGNDDNNTPNIASATVTATKKPSSSKPTAKKPPK-----LLSF 57 Query: 192 ADDDDESPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXDRIGPHH--PSSSLPSNVQPQAG 365 ADD++E S+LPSNVQPQAG Sbjct: 58 ADDENEEETTKPSSNRNRDKEREKPFSSRVSKPLSAHKITSTKDCKTPSTLPSNVQPQAG 117 Query: 366 VYTKEALLELQKNTKTLAAPARNXXXXXXXXXXXXLKGLIKPVISNDLDIGTTGRSQNLG 545 YTKEALLELQKN +TLAAP+ + LKGL+KP +SQNL Sbjct: 118 TYTKEALLELQKNMRTLAAPS-SRASSVSSEPKIVLKGLLKP------------QSQNLN 164 Query: 546 DDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERLRQAK 725 + ++ +DD SRL + G G D PDQA I+AIKAK++R+R++ Sbjct: 165 SE----RDNDPPEKLQKDDTESRLATMAAGKGVDLDFSAFPDQATIDAIKAKKDRVRKSF 220 Query: 726 A-AAPDYIALDGGSNHG---EAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAM 893 A APDYI+LD GSN G E E DEEPEF GR+ FGE KKGVF+ E+RA+ Sbjct: 221 ARPAPDYISLDRGSNLGGAMEEELSDDEEPEFPGRL--FGES----GKKGVFEVIEERAV 274 Query: 894 P---KERGIEVVSXXXXXXXKMWEAEQVRKGLGKRLDD---------XXXXXXXXXXXXX 1037 ++ GI KMWE EQ RKGLGKR+DD Sbjct: 275 GVGLRKDGIHDEDDDDNEEEKMWEEEQFRKGLGKRMDDSSNRVVSSSNNSGGVGMVHNMQ 334 Query: 1038 XXXXXXFGYLGTGTSGVHPPVQNVDVXXXXXXXXXXXXXLFGIDVMSIPQQAELAKKALN 1217 +GY G+ G P + G+DV SI QQAE+ KKAL Sbjct: 335 QQHQQRYGYSTMGSYGSMMPSVS---PAPPSSIVGAAGASQGLDVTSISQQAEITKKALQ 391 Query: 1218 ENLRRVQESHGRTMMSLAK 1274 EN+RR++ESH RT+ SL K Sbjct: 392 ENVRRLKESHDRTISSLTK 410 >ref|XP_002311888.1| predicted protein [Populus trichocarpa] Length = 476 Score = 190 bits (482), Expect = 1e-45 Identities = 152/444 (34%), Positives = 194/444 (43%), Gaps = 42/444 (9%) Frame = +3 Query: 33 SSVKSRNFRRRAXXXXXXXX--------NKSAAPSTTNKPSAXXXXXXXXXXXXXXXLLS 188 SS KSRNFRRR N A PSTT KP LLS Sbjct: 3 SSSKSRNFRRRGDVDDEKTDANTNNTDTNAKATPSTTRKPPPPQSTKPKPKK-----LLS 57 Query: 189 FADDD-DESPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXDRIGPHHPSSSLPSNVQPQAG 365 FA+D+ DE DR+ P + SNVQPQAG Sbjct: 58 FAEDEEDEQAVTRIPSSKSKPKPKPKPTSSSSHKLTVSQDRLPPTTSYLTTASNVQPQAG 117 Query: 366 VYTKEALLELQKNTKTLAAPARNXXXXXXXXXXXXLKGLIKPVISN----DLDIGTTGRS 533 YTKEALLELQ+NT+TLA + LKGL+KP S + + + + Sbjct: 118 TYTKEALLELQRNTRTLAKSTKTTTPASASEPKIILKGLLKPSFSPSPNPNPNYSSNHQQ 177 Query: 534 QNLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERL 713 Q+ DD + + KD DDA +RL + LG + +D PD+ I+ I+AKRERL Sbjct: 178 QDDADDQSEDENEDKDNGA--DDAQNRLASMGLGKSTSDDYSCFPDEDTIKKIRAKRERL 235 Query: 714 RQAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKI-GGPDKKGVF------- 869 RQ++AAAPDYI+LD GSNH G SDEEPEFR RI G GVF Sbjct: 236 RQSRAAAPDYISLDSGSNH--QGGFSDEEPEFRTRIAMIGTMTKDTATHGGVFDAAADDD 293 Query: 870 -DDFEDRAMPKE--------------------RGIEVVSXXXXXXXKMWEAEQVRKGLGK 986 DD +DR++ + VV ++WE EQ RKGLGK Sbjct: 294 EDDDDDRSIKAKALAMMGTHHHHAVVDDGNVAAAASVVHDEEDEEDRIWEEEQFRKGLGK 353 Query: 987 RLDDXXXXXXXXXXXXXXXXXXXFGYLGTGTSGVHPPVQNVDVXXXXXXXXXXXXXLFGI 1166 R+DD G + T + P + G+ Sbjct: 354 RMDDASAPIANRALASTA------GAAASSTIPMQPQQRPTPGYGSIPSIGGAFGSSQGL 407 Query: 1167 DVMSIPQQAELAKKALNENLRRVQ 1238 DV+SIPQQA++AKKAL +NLRR++ Sbjct: 408 DVLSIPQQADIAKKALQDNLRRLK 431 >ref|XP_006468681.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Citrus sinensis] Length = 913 Score = 181 bits (459), Expect = 6e-43 Identities = 157/429 (36%), Positives = 197/429 (45%), Gaps = 13/429 (3%) Frame = +3 Query: 30 MSSVKSRNFRRRAXXXXXXXXNKSAAPST---TNKPSAXXXXXXXXXXXXXXXLLSFADD 200 MSS ++RNFRRRA + + + +T T KP + LLSFADD Sbjct: 1 MSSSRARNFRRRADDDEDNNDDNTPSAATTTATKKPPS---------SSKPKKLLSFADD 51 Query: 201 DDESPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXDRIGPHHPSS--SLPSNVQPQAGVYT 374 ++E +R SS SL SNVQ QAG YT Sbjct: 52 EEEKSEIPTSNRDRTRPSSRLSKPSSSHKITASKERQSSSATSSSTSLLSNVQAQAGTYT 111 Query: 375 KEALLELQKNTKTLAAPARNXXXXXXXXXXXXLKGLIKPVISNDLDIGTTGRSQNLGDDD 554 +E LLEL+KNTKTL AP+ L+G IKP SN T Q D Sbjct: 112 EEYLLELRKNTKTLKAPSSK----PPAEPVVVLRGSIKPEDSN-----LTRVQQKPSRDS 162 Query: 555 MSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEG-MPDQAMIEAIKAKRERLRQAKAA 731 D K A + + LG G + G + D+A I+AI+AK++RLRQ+ A Sbjct: 163 SDSDSDHK--------AETEKRFASLGVGKIAVQSGVIYDEAEIKAIRAKKDRLRQSGAK 214 Query: 732 APDYIALDGGSN--HGEAEGLSDEEPEFRGRIGFFGEK-IGGPDKKGVF--DDFEDRAMP 896 APDYI LDGGS+ G+AEG SDEEPEF R+ FGE+ G KKGVF DD ++ P Sbjct: 215 APDYIPLDGGSSSLRGDAEGSSDEEPEFPRRVAMFGERTASGKKKKGVFEDDDVDEDERP 274 Query: 897 KERGIEVVSXXXXXXXKMWEAEQVRKGLGKRLDD--XXXXXXXXXXXXXXXXXXXFGYLG 1070 +E MWE EQVRKGLGKR+DD F Y Sbjct: 275 VVARVE-NDYEYVDEDVMWEEEQVRKGLGKRIDDGSVRVGANTSSSVAMPQQQQQFSYST 333 Query: 1071 TGTSGVHPPVQNVDVXXXXXXXXXXXXXLFGIDVMSIPQQAELAKKALNENLRRVQESHG 1250 T T P+ ++ G+D MSI Q+AE A KAL N+ R++ESH Sbjct: 334 TVT-----PIPSIGGAIGASQ---------GLDTMSIAQKAESAMKALQTNVNRLKESHA 379 Query: 1251 RTMMSLAKT 1277 RTM SL KT Sbjct: 380 RTMSSLKKT 388