BLASTX nr result
ID: Atropa21_contig00029105
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00029105 (1188 letters) Database: nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006362530.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 474 e-131 ref|XP_004238967.1| PREDICTED: GC-rich sequence DNA-binding fact... 469 e-130 ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding fact... 294 4e-77 ref|XP_004159322.1| PREDICTED: GC-rich sequence DNA-binding fact... 267 7e-69 ref|XP_004135116.1| PREDICTED: GC-rich sequence DNA-binding fact... 266 2e-68 gb|EOY19310.1| GC-rich sequence DNA-binding factor-like protein,... 261 3e-67 ref|XP_003528569.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 254 5e-65 gb|EMJ26532.1| hypothetical protein PRUPE_ppa001044mg [Prunus pe... 252 2e-64 ref|XP_004514246.1| PREDICTED: GC-rich sequence DNA-binding fact... 251 4e-64 ref|XP_006468681.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 248 3e-63 ref|XP_006605552.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 247 8e-63 ref|XP_006448500.1| hypothetical protein CICLE_v10014191mg [Citr... 246 1e-62 gb|EXB53993.1| GC-rich sequence DNA-binding factor 1 [Morus nota... 243 1e-61 ref|XP_002513154.1| gc-rich sequence DNA-binding factor, putativ... 243 1e-61 gb|ESW32937.1| hypothetical protein PHAVU_001G030200g [Phaseolus... 237 6e-60 gb|EPS73173.1| hypothetical protein M569_01583, partial [Genlise... 234 5e-59 ref|XP_004298307.1| PREDICTED: GC-rich sequence DNA-binding fact... 234 5e-59 ref|XP_006838726.1| hypothetical protein AMTR_s00002p00252610 [A... 234 7e-59 ref|XP_003610832.1| GC-rich sequence DNA-binding factor-like pro... 229 2e-57 ref|XP_006379383.1| hypothetical protein POPTR_0008s00320g [Popu... 229 2e-57 >ref|XP_006362530.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Solanum tuberosum] Length = 939 Score = 474 bits (1220), Expect = e-131 Identities = 281/419 (67%), Positives = 289/419 (68%), Gaps = 23/419 (5%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSGRRGVFEDFVEEKAMLKKDGRFGGFXXXXXXXXXXXXXXX 180 SDEEPEFQQRIGFYGEKI SGRRGVFEDF E+KAM +KDG F Sbjct: 249 SDEEPEFQQRIGFYGEKIGSGRRGVFEDF-EDKAM-QKDGGFRS-----DDDEEDEEEKM 301 Query: 181 XXXXQVRKGLGKRLED-KXXXXXXXXXXXXXXXXXXQKANFGNSG-GATVYSSVQSIDVS 354 QVRKGLGKRL+D QKANFG+S GA+VYSSVQSIDVS Sbjct: 302 WEEEQVRKGLGKRLDDGSNRGVMSSVVSSAAAVQNVQKANFGSSAVGASVYSSVQSIDVS 361 Query: 355 DGHTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKTEENLSA 534 DG TIGGGV VG LPSLDALSISKKAEVAKKALYESMGRLKESH RTV SL+KTEENLSA Sbjct: 362 DGPTIGGGV-VGGLPSLDALSISKKAEVAKKALYESMGRLKESHGRTVTSLHKTEENLSA 420 Query: 535 SLSKVTMLENSLSAAG---------------------DKGPYIEELEDQMQKLHXXXXXX 651 SLSKVT LENSLSAAG DKGPYIEELEDQMQKLH Sbjct: 421 SLSKVTTLENSLSAAGEKYMFMQKLRDFVSVICALLQDKGPYIEELEDQMQKLHEERAAA 480 Query: 652 XXXXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRKGGDLSVE 831 DNDDEMKELEAAV+AARQVLSRGGSN MRKGGDL +E Sbjct: 481 ILERRAADNDDEMKELEAAVSAARQVLSRGGSNAATIEAATAAAQTSTAAMRKGGDLPIE 540 Query: 832 LDEFGRDKNLQKRMDTTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXXXXXXXXX 1011 LDEFGRDKNLQKRMDTT NDVKRMSAIKCDSSYQKIEG Sbjct: 541 LDEFGRDKNLQKRMDTTRRAEARKRRRVKNDVKRMSAIKCDSSYQKIEGESSTDESDSES 600 Query: 1012 XAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIF 1188 AYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIF Sbjct: 601 TAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIF 659 >ref|XP_004238967.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Solanum lycopersicum] Length = 941 Score = 469 bits (1208), Expect = e-130 Identities = 279/419 (66%), Positives = 288/419 (68%), Gaps = 23/419 (5%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSGRRGVFEDFVEEKAMLKKDGRFGGFXXXXXXXXXXXXXXX 180 SDEEPEFQQRIGFYGEKI SGR+GVFEDF ++KA L+KDG F Sbjct: 251 SDEEPEFQQRIGFYGEKIGSGRKGVFEDF-DDKA-LQKDGGFRS-----DDDEEDEEDKM 303 Query: 181 XXXXQVRKGLGKRLED-KXXXXXXXXXXXXXXXXXXQKANFGNSG-GATVYSSVQSIDVS 354 QVRKGLGKRL+D QKANFG+S GA+VYSSVQSIDVS Sbjct: 304 WEEEQVRKGLGKRLDDGSNRGVMSSVVSSAAAVQNAQKANFGSSAVGASVYSSVQSIDVS 363 Query: 355 DGHTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKTEENLSA 534 DG TIGGGV VG LPSLDALSIS KAEVAKKALYESMGRLKESH RTV SL+KTEENLSA Sbjct: 364 DGPTIGGGV-VGGLPSLDALSISMKAEVAKKALYESMGRLKESHGRTVTSLHKTEENLSA 422 Query: 535 SLSKVTMLENSLSAAG---------------------DKGPYIEELEDQMQKLHXXXXXX 651 SLSKVT LENSLSAAG DKGPYIEELEDQMQKLH Sbjct: 423 SLSKVTTLENSLSAAGEKYMFMQKLRDFVSVICALLQDKGPYIEELEDQMQKLHEERAAA 482 Query: 652 XXXXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRKGGDLSVE 831 DNDDEMKELEAAV+AARQVLSRGGSN MRKGGDL VE Sbjct: 483 ILERRAADNDDEMKELEAAVSAARQVLSRGGSNAATIEAATAAAQTSTAAMRKGGDLPVE 542 Query: 832 LDEFGRDKNLQKRMDTTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXXXXXXXXX 1011 LDEFGRDKNLQKRMDTT NDVKRMSAIKCDSSYQKIEG Sbjct: 543 LDEFGRDKNLQKRMDTTRRAEARKRRRMKNDVKRMSAIKCDSSYQKIEGESSTDESDSES 602 Query: 1012 XAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIF 1188 AYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIF Sbjct: 603 TAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIF 661 >ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Vitis vinifera] Length = 913 Score = 294 bits (753), Expect = 4e-77 Identities = 188/417 (45%), Positives = 236/417 (56%), Gaps = 21/417 (5%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSGRRGVFEDFVEEKAMLKKDGRFGGFXXXXXXXXXXXXXXX 180 SDEEPEFQ RI +GEK +SG++GVFED V+E+ M GGF Sbjct: 234 SDEEPEFQGRIAMFGEKPESGKKGVFED-VDERGME------GGFKKDAHDSDDEEEEKI 286 Query: 181 XXXXQVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQKANFGNSGGATVYSSVQSIDVSDG 360 Q RKGLGKR++D Q+ F S T Y+SV VS Sbjct: 287 WEEEQFRKGLGKRMDD---GSSRVVSSSVPVVQKVQQQKFMYSS-VTAYTSVPG--VSAP 340 Query: 361 HTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKTEENLSASL 540 IGG V G LP DA+S+S++AE+AKKAL+E++ RLKESH RT++SL +T+ENLS+SL Sbjct: 341 LNIGGAV--GPLPGFDAMSLSQQAELAKKALHENLRRLKESHGRTMSSLTRTDENLSSSL 398 Query: 541 SKVTMLENSLSAAGDK---------------------GPYIEELEDQMQKLHXXXXXXXX 657 S +T LE SL+AAG+K P+IEELE+QMQKLH Sbjct: 399 SNITTLEKSLTAAGEKFIFMQXLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERASAIL 458 Query: 658 XXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRKGGDLSVELD 837 DND EM E++A+V+AA V ++ GSN MR+ +L V+LD Sbjct: 459 ERRAADND-EMMEIQASVDAAMSVFTKSGSNEAMVAAARTAAQAASAAMREQTNLPVKLD 517 Query: 838 EFGRDKNLQKRMDTTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXXXXXXXXXXA 1017 E+GRD NLQK MD D KRM+ ++ +SS+QKIEG A Sbjct: 518 EYGRDINLQKCMDKNRRSEARQRKRDRWDAKRMTFLENESSHQKIEGESSTDESDSETTA 577 Query: 1018 YQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIF 1188 YQSNRD LLQ +EQIFGDA EEYSQLS V E+ +RWKK Y+SSYRDAYMSLS+P IF Sbjct: 578 YQSNRDLLLQTAEQIFGDAAEEYSQLSAVKERIERWKKQYSSSYRDAYMSLSVPAIF 634 >ref|XP_004159322.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cucumis sativus] Length = 889 Score = 267 bits (682), Expect = 7e-69 Identities = 176/417 (42%), Positives = 224/417 (53%), Gaps = 21/417 (5%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSGRRGVFEDFVEEKAMLKKDGRFGGFXXXXXXXXXXXXXXX 180 SDEE EF RI G K++S ++GVFE+ E+ G G Sbjct: 210 SDEEAEFPGRIAMIGGKLESSKKGVFEEVDEQ-------GIDGARTNIIEHSDEDEEEKI 262 Query: 181 XXXXQVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQKANFGNSGGATVYSSVQSIDVSDG 360 Q RKGLGKR++D Q + + G YSSV S VS Sbjct: 263 WEEEQFRKGLGKRMDD-GSTRVESTSVPVVPSVQPQNLIYPTTIG---YSSVPS--VSTA 316 Query: 361 HTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKTEENLSASL 540 +IGG V + + LD LSIS++AE+AK A+ ESMGRLKES+ RT S+ KT+ENLSASL Sbjct: 317 TSIGGSVSISQ--GLDGLSISQQAEIAKTAMQESMGRLKESYRRTAMSVLKTDENLSASL 374 Query: 541 SKVTMLENSLSAAGDK---------------------GPYIEELEDQMQKLHXXXXXXXX 657 K+T LE +LSAAGDK P+IEELE+QMQKLH Sbjct: 375 LKITDLEKALSAAGDKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERASTVV 434 Query: 658 XXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRKGGDLSVELD 837 DNDDEM E+E AV AA +L++ GS+ R+ +L +LD Sbjct: 435 ERRVADNDDEMVEIETAVKAAISILNKKGSSNEMITAATSAAQAAIALSREQANLPTKLD 494 Query: 838 EFGRDKNLQKRMDTTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXXXXXXXXXXA 1017 EFGRD NLQKRMD D KR+++++ D +QK+EG A Sbjct: 495 EFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRLASMEVDG-HQKVEGESSTDESDSDSAA 553 Query: 1018 YQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIF 1188 YQSNRD LLQ +EQIF DA EE+SQLSVV ++F+ WK+DY+++YRDAYMSLSIP IF Sbjct: 554 YQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRFEAWKRDYSATYRDAYMSLSIPAIF 610 >ref|XP_004135116.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cucumis sativus] Length = 920 Score = 266 bits (679), Expect = 2e-68 Identities = 175/417 (41%), Positives = 224/417 (53%), Gaps = 21/417 (5%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSGRRGVFEDFVEEKAMLKKDGRFGGFXXXXXXXXXXXXXXX 180 SDEE EF RI G K++S ++GVFE+ E+ G G Sbjct: 241 SDEEAEFPGRIAMIGGKLESSKKGVFEEVDEQ-------GIDGARTNIIEHSDEDEEEKI 293 Query: 181 XXXXQVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQKANFGNSGGATVYSSVQSIDVSDG 360 Q RKGLGKR++D Q + + G YSSV S+ S Sbjct: 294 WEEEQFRKGLGKRMDD-GSTRVESTSVPVVPSVQPQNLIYPTTIG---YSSVPSM--STA 347 Query: 361 HTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKTEENLSASL 540 +IGG V + + LD LSIS++AE+AK A+ ESMGRLKES+ RT S+ KT+ENLSASL Sbjct: 348 TSIGGSVSISQ--GLDGLSISQQAEIAKTAMQESMGRLKESYRRTAMSVLKTDENLSASL 405 Query: 541 SKVTMLENSLSAAGDK---------------------GPYIEELEDQMQKLHXXXXXXXX 657 K+T LE +LSAAGDK P+IEELE+QMQKLH Sbjct: 406 LKITDLEKALSAAGDKFMFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERASTVV 465 Query: 658 XXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRKGGDLSVELD 837 DNDDEM E+E AV AA +L++ GS+ R+ +L +LD Sbjct: 466 ERRVADNDDEMVEIETAVKAAISILNKKGSSNEMVTAATSAAQAAIALSREQANLPTKLD 525 Query: 838 EFGRDKNLQKRMDTTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXXXXXXXXXXA 1017 EFGRD NLQKRMD D KR+++++ D +QK+EG A Sbjct: 526 EFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRLASMEVDG-HQKVEGESSTDESDSDSAA 584 Query: 1018 YQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIF 1188 YQSNRD LLQ +EQIF DA EE+SQLSVV ++F+ WK+DY+++YRDAYMSLSIP IF Sbjct: 585 YQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRFEAWKRDYSATYRDAYMSLSIPAIF 641 >gb|EOY19310.1| GC-rich sequence DNA-binding factor-like protein, putative isoform 1 [Theobroma cacao] gi|508727414|gb|EOY19311.1| GC-rich sequence DNA-binding factor-like protein, putative isoform 1 [Theobroma cacao] Length = 934 Score = 261 bits (668), Expect = 3e-67 Identities = 176/424 (41%), Positives = 225/424 (53%), Gaps = 29/424 (6%) Frame = +1 Query: 4 DEEPEFQQRIGFYGEKIDSGRRGVFEDFVEEKAM---LKKDGRFGGFXXXXXXXXXXXXX 174 DEEPEF R+ +GE SG++GVFE +EE+A+ L+KDG Sbjct: 247 DEEPEFPGRL--FGE---SGKKGVFE-VIEERAVGVGLRKDG------IHDEDDDDNEEE 294 Query: 175 XXXXXXQVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQ-----KANFGNSGGATVYSSVQ 339 Q RKGLGKR++D + +G S + S + Sbjct: 295 KMWEEEQFRKGLGKRMDDSSNRVVSSSNNSGGVGMVHNMQQQHQQRYGYSTMGSYGSMMP 354 Query: 340 SIDVSDGHTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKTE 519 S+ + +I G G LD SIS++AE+ KKAL E++ RLKESHDRT++SL K + Sbjct: 355 SVSPAPPSSIVGAA--GASQGLDVTSISQQAEITKKALQENVRRLKESHDRTISSLTKAD 412 Query: 520 ENLSASLSKVTMLENSLSAAGDK---------------------GPYIEELEDQMQKLHX 636 ENLSASL +T LE SLSAAG+K P IEELE+ MQKL+ Sbjct: 413 ENLSASLFNITALEKSLSAAGEKFIFMQKLRDFVSVICEFLQHKAPLIEELEEHMQKLNE 472 Query: 637 XXXXXXXXXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRKGG 816 +NDDEM E+EAAV AA V S G++ +R Sbjct: 473 ERALSVLERRSANNDDEMVEVEAAVTAAMLVFSECGNSAAMIEVAANAAQAAAAAIRGQV 532 Query: 817 DLSVELDEFGRDKNLQKRMDTTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXXXX 996 +L V+LDEFGRD N QK +D D KR+S+++ DSSYQKIEG Sbjct: 533 NLPVKLDEFGRDVNRQKHLDMERRAEARQRRKARFDSKRLSSMEIDSSYQKIEGESSTDE 592 Query: 997 XXXXXXAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSI 1176 AY+SNRD LLQ +++IFGDA EEYSQLS+V E+F+RWKKDY+SSYRDAYMSLSI Sbjct: 593 SDSESTAYRSNRDMLLQTADEIFGDASEEYSQLSLVKERFERWKKDYSSSYRDAYMSLSI 652 Query: 1177 PVIF 1188 P IF Sbjct: 653 PAIF 656 >ref|XP_003528569.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Glycine max] Length = 913 Score = 254 bits (649), Expect = 5e-65 Identities = 169/420 (40%), Positives = 225/420 (53%), Gaps = 24/420 (5%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSGRRGVFEDFVEEKAMLKKDGRFGGFXXXXXXXXXXXXXXX 180 SDEEPEF+ RI +GEK+D G++GVFE+ VEE+ M D RF G Sbjct: 232 SDEEPEFRGRIAMFGEKVDGGKKGVFEE-VEERIM---DVRFKGGEDEVVDDDDDDEEKM 287 Query: 181 XXXXQVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQKANFGNSGGATVYSSVQSIDVSDG 360 Q RKGLGKR+++ NF A VY +V S S Sbjct: 288 WEEEQFRKGLGKRMDE-----GSARVDVSVMQGSQSPHNFVVPSAAKVYGAVPSAAASVS 342 Query: 361 HTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKTEENLSASL 540 +IGG + LP+LD + IS++AE A+KAL E++ RLKESH RT++SL+KT+ENLSASL Sbjct: 343 PSIGG--VIESLPALDVVPISQQAEAARKALLENVRRLKESHGRTMSSLSKTDENLSASL 400 Query: 541 SKVTMLENSLSAAGD---------------------KGPYIEELEDQMQKLHXXXXXXXX 657 +T LENSL A + K YIEELE+QM+KLH Sbjct: 401 LNITALENSLVVADEKYRFMQKLRNYVTNICDFLQHKAFYIEELEEQMKKLHEDRALAIS 460 Query: 658 XXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRKGGDLSVELD 837 +NDDEM E+E AV AA VLS+ G+N +RK DL V+LD Sbjct: 461 ERRATNNDDEMIEVEEAVKAAMSVLSKKGNN---MEAAKIAAQEAFSAVRKQRDLPVKLD 517 Query: 838 EFGRDKNLQKRMD---TTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXXXXXXXX 1008 EFGRD NL+KRM+ T D ++++++ D KIEG Sbjct: 518 EFGRDLNLEKRMNMKAKTRSEACQRKRSQAFDSNKVTSMELDD--HKIEGESSTDESDSE 575 Query: 1009 XXAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIF 1188 AYQS D +LQ +++IF DA EEY QLS+V + + WK++++SSY+DAYMSLS+P+IF Sbjct: 576 SQAYQSQSDLVLQAADEIFSDASEEYGQLSLVKSRMEEWKREHSSSYKDAYMSLSLPLIF 635 >gb|EMJ26532.1| hypothetical protein PRUPE_ppa001044mg [Prunus persica] Length = 925 Score = 252 bits (643), Expect = 2e-64 Identities = 162/408 (39%), Positives = 218/408 (53%), Gaps = 12/408 (2%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSGRRGVFEDFVEEKA--MLKKDGRFGGFXXXXXXXXXXXXX 174 SDEEPEF+ RI +G+ ++ ++GVFED + A +L++ Sbjct: 255 SDEEPEFRGRIAIFGDNMEGSKKGVFEDVDDRAADAVLRQKS-------IDRDEDEDEEE 307 Query: 175 XXXXXXQVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQKANFGNSGGATVYSSVQSIDVS 354 Q RKGLGKR++D KA + G YSSVQS+ V Sbjct: 308 KIWEEEQFRKGLGKRMDDGSSIGVVSTSAPVVQSVPQPKATYSAMAG---YSSVQSVPV- 363 Query: 355 DGHTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKTEENLSA 534 G +IGG + G + +SI +AE+AKKAL E++ +LKESH RT+ SL KT+ENLS+ Sbjct: 364 -GPSIGGAI--GASQGSNVMSIKAQAEIAKKALEENVMKLKESHGRTMLSLTKTDENLSS 420 Query: 535 SLSKVTMLENSLSAAGDK----------GPYIEELEDQMQKLHXXXXXXXXXXXXXDNDD 684 SL +T LE SLSAA +K P IEELE++MQK+H D DD Sbjct: 421 SLLNITALEKSLSAADEKYKGMEIGSVKAPLIEELEEEMQKIHEQRASATLERRSAD-DD 479 Query: 685 EMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRKGGDLSVELDEFGRDKNLQ 864 EM E+EAAV AA + S+ GS+ R+ +L V+LDEFGRD NLQ Sbjct: 480 EMMEVEAAVKAAMSIFSKEGSSAEIIAAAKSAAQAATTAEREQTNLPVKLDEFGRDMNLQ 539 Query: 865 KRMDTTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXXXXXXXXXXAYQSNRDQLL 1044 KR D + KR+S+++ DS+++ IEG AY +R +L Sbjct: 540 KRRDMKGRSEAHQHRKRRYESKRLSSMEVDSTHRTIEGESSTDESDSESNAYHKHRQLVL 599 Query: 1045 QVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIF 1188 + + Q+F DA EEYS+LS+V E+F+ WK DYASSYRDAYMSLS P IF Sbjct: 600 ETAAQVFSDAAEEYSKLSLVKERFEEWKTDYASSYRDAYMSLSAPAIF 647 >ref|XP_004514246.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cicer arietinum] Length = 916 Score = 251 bits (641), Expect = 4e-64 Identities = 170/421 (40%), Positives = 225/421 (53%), Gaps = 25/421 (5%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSGRRGVFEDFVEEKAMLKKDGRFGGFXXXXXXXXXXXXXXX 180 SDEEPEF+ RI +GEK + G++GVFED V+E+ + DGRF G Sbjct: 233 SDEEPEFRGRIALFGEKGEGGKKGVFED-VDERGV---DGRFNG-GGDVVVEEEDEEEKM 287 Query: 181 XXXXQVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQKANFGNSGGATVYSSVQSI---DV 351 Q RKGLGKR+++ Q+ F ATVY +V ++ Sbjct: 288 WEEEQFRKGLGKRMDE---GPGRVSGGDVSVVQVAQQPKFVVPSAATVYGAVPNVVAAAA 344 Query: 352 SDGHTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKTEENLS 531 S +IGG + P+LD +SIS++AE+A+KAL +++ RLKESH RT++SLNKT+ENLS Sbjct: 345 SVSTSIGGAI--PATPALDVISISQQAEIARKALLDNVRRLKESHGRTMSSLNKTDENLS 402 Query: 532 ASLSKVTMLENSLSAAGD---------------------KGPYIEELEDQMQKLHXXXXX 648 ASL +T LENSL A + K YIEELEDQM+KLH Sbjct: 403 ASLLNITDLENSLVVADEKYRFMQKLRNYVTNICDFLQHKAFYIEELEDQMKKLHEDRAS 462 Query: 649 XXXXXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRKGGDLSV 828 + DDEM E+EAAV AA VLSR G N +RK D V Sbjct: 463 AIFEKRATNIDDEMVEVEAAVKAAMSVLSRKGDN---LEAARSAAQDAFSAVRKQRDFPV 519 Query: 829 ELDEFGRDKNLQKRMD-TTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXXXXXXX 1005 +LDEFGRD NL+KRM D ++++++ D K+EG Sbjct: 520 QLDEFGRDLNLEKRMKMKVMAEARQRRKSKAFDSNKLASMEVDD--HKVEGESSTDESDS 577 Query: 1006 XXXAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVI 1185 AYQS RD +LQ +++IF DA EEYSQLS+V K + WK++Y SSY DAY+SLS+P+I Sbjct: 578 ESQAYQSQRDLVLQAADEIFSDASEEYSQLSLVKNKMEEWKREYFSSYNDAYISLSLPLI 637 Query: 1186 F 1188 F Sbjct: 638 F 638 >ref|XP_006468681.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Citrus sinensis] Length = 913 Score = 248 bits (634), Expect = 3e-63 Identities = 169/425 (39%), Positives = 224/425 (52%), Gaps = 30/425 (7%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSGRR--GVFEDF-VEEK-----AMLKKDGRFGGFXXXXXXX 156 SDEEPEF +R+ +GE+ SG++ GVFED V+E A ++ D + Sbjct: 236 SDEEPEFPRRVAMFGERTASGKKKKGVFEDDDVDEDERPVVARVENDYEY---------- 285 Query: 157 XXXXXXXXXXXXQVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQKANFGNSGGATVYSSV 336 QVRKGLGKR++D Q+ ++ Sbjct: 286 --VDEDVMWEEEQVRKGLGKRIDDGSVRVGANTSSSVAMPQQQQQFSY------------ 331 Query: 337 QSIDVSDGHTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKT 516 S V+ +IGG + G LD +SI++KAE A KAL ++ RLKESH RT++SL KT Sbjct: 332 -STTVTPIPSIGGAI--GASQGLDTMSIAQKAESAMKALQTNVNRLKESHARTMSSLKKT 388 Query: 517 EENLSASLSKVTMLENSLSAAG---------------------DKGPYIEELEDQMQKLH 633 +E+LS+SL K+T LE+SLSAAG DK PYIE LE +MQKL+ Sbjct: 389 DEDLSSSLLKITDLESSLSAAGEKFIFMQKLRDYVSVICDFLQDKAPYIETLEAEMQKLN 448 Query: 634 XXXXXXXXXXXXXDNDDEMKELEAAVNAARQVLS-RGGSNXXXXXXXXXXXXXXXXXMRK 810 DNDDEM E+EAA+ AA V+ RG S +++ Sbjct: 449 KERASAILERRAADNDDEMTEVEAAIKAATLVIGDRGNSASKLIAASSAAQAAAAAAVKE 508 Query: 811 GGDLSVELDEFGRDKNLQKRMDTTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXX 990 +L V+LDEFGRD NLQKR D D+K++S++ D S QK+EG Sbjct: 509 QTNLPVKLDEFGRDMNLQKRRDMERRAESRQHRRTRFDLKQLSSMDADISSQKLEGESTT 568 Query: 991 XXXXXXXXAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSL 1170 AYQSNR++LL+ +E IF DA EEYSQLSVV E+F++WK+DY+SSYRDAYMSL Sbjct: 569 DESDSETEAYQSNREELLKTAEHIFSDAAEEYSQLSVVKERFEKWKRDYSSSYRDAYMSL 628 Query: 1171 SIPVI 1185 S P I Sbjct: 629 STPAI 633 >ref|XP_006605552.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Glycine max] Length = 916 Score = 247 bits (630), Expect = 8e-63 Identities = 165/418 (39%), Positives = 222/418 (53%), Gaps = 22/418 (5%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSGRRGVFEDFVEEKAMLKKDGRFGGFXXXXXXXXXXXXXXX 180 SDEEPEF+ RI +GEK+D G++GVFE+ VEE+ + D RF G Sbjct: 235 SDEEPEFRGRIAMFGEKVDGGKKGVFEE-VEERRV---DLRFKGGEEEVLDDDDDEEEKM 290 Query: 181 XXXXQVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQKANFGNSGGATVYSSVQSIDVSDG 360 Q RKGLGKR+++ + NF A VY +V S S Sbjct: 291 WEEEQFRKGLGKRMDEGSARVDVAAAAVQGAQL---QHNFVVPSAAKVYGAVPSAAASVS 347 Query: 361 HTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKTEENLSASL 540 +IGG + LP LD + IS++AE A+KAL E++ RLKESH RT++SL+KT+ENLSASL Sbjct: 348 PSIGGAI--ESLPVLDVVPISQQAEAARKALLENVRRLKESHGRTMSSLSKTDENLSASL 405 Query: 541 SKVTMLENSLSAAGD---------------------KGPYIEELEDQMQKLHXXXXXXXX 657 +T LENSL A + K YIEELE+QM+KLH Sbjct: 406 LNITALENSLVVADEKYRFMQKLRNYVTNICDFLQHKACYIEELEEQMKKLHQDRASAIF 465 Query: 658 XXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRKGGDLSVELD 837 +NDDEM E+E AV AA VL + G+N +RK DL V+LD Sbjct: 466 ERRATNNDDEMVEVEEAVKAAMSVLIKKGNN---MEAAKIAAQEAFAAVRKQRDLPVKLD 522 Query: 838 EFGRDKNLQKRMD-TTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXXXXXXXXXX 1014 EFGRD NL+KRM+ ++++++ D KIEG Sbjct: 523 EFGRDLNLEKRMNMKVRAEACQRKRSLAFGYNKVTSMEWDD--HKIEGESSTDESDSESQ 580 Query: 1015 AYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIF 1188 AYQS D +LQ +++IF DA EEY QLS+V + + WK++Y+S+Y+DAYMSLS+P+IF Sbjct: 581 AYQSQSDLVLQAADEIFSDASEEYGQLSLVKSRMEEWKREYSSTYKDAYMSLSLPLIF 638 >ref|XP_006448500.1| hypothetical protein CICLE_v10014191mg [Citrus clementina] gi|557551111|gb|ESR61740.1| hypothetical protein CICLE_v10014191mg [Citrus clementina] Length = 913 Score = 246 bits (628), Expect = 1e-62 Identities = 167/425 (39%), Positives = 223/425 (52%), Gaps = 30/425 (7%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSGRR--GVFEDF-VEEK-----AMLKKDGRFGGFXXXXXXX 156 SDEEPEF +R+ +GE+ SG++ GVFED V+E A ++ D + Sbjct: 236 SDEEPEFPRRVAMFGERTASGKKKKGVFEDDDVDEDERPVVARVENDYEY---------- 285 Query: 157 XXXXXXXXXXXXQVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQKANFGNSGGATVYSSV 336 QVRKGLGKR++D Q+ ++ + Sbjct: 286 --VDEDVMWEEEQVRKGLGKRIDDSSVRVGANTSSSVAMPQQQQQFSYPTT--------- 334 Query: 337 QSIDVSDGHTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKT 516 V+ +IGG + G LD +SI++KAE A KAL ++ RLKESH RT++SL KT Sbjct: 335 ----VTPIPSIGGAI--GASQGLDTMSIAQKAESAMKALQTNVNRLKESHARTMSSLKKT 388 Query: 517 EENLSASLSKVTMLENSLSAAG---------------------DKGPYIEELEDQMQKLH 633 +E+LS+SL K+T LE+SLSAAG DK PYIE LE +MQKL+ Sbjct: 389 DEDLSSSLLKITDLESSLSAAGERFIFMQKLRDYVSVICDFLQDKAPYIETLEAEMQKLN 448 Query: 634 XXXXXXXXXXXXXDNDDEMKELEAAVNAARQVLS-RGGSNXXXXXXXXXXXXXXXXXMRK 810 DNDDEM E+EAA+ AA + RG S +++ Sbjct: 449 KERASAILERRAADNDDEMTEVEAAIKAATLFIGDRGNSASKLTAASSAAQAAAAAAIKE 508 Query: 811 GGDLSVELDEFGRDKNLQKRMDTTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXX 990 +L V+LDEFGRD NLQKR D D+K++S++ D S QK+EG Sbjct: 509 QTNLPVKLDEFGRDMNLQKRRDMERRAESRQHRRTRFDLKQLSSMDADISSQKLEGESTT 568 Query: 991 XXXXXXXXAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSL 1170 AYQSNR++LL+ +E IF DA EEYSQLSVV E+F++WK+DY+SSYRDAYMSL Sbjct: 569 DESDSETEAYQSNREELLKTAEHIFSDAAEEYSQLSVVKERFEKWKRDYSSSYRDAYMSL 628 Query: 1171 SIPVI 1185 S P I Sbjct: 629 STPAI 633 >gb|EXB53993.1| GC-rich sequence DNA-binding factor 1 [Morus notabilis] Length = 952 Score = 243 bits (620), Expect = 1e-61 Identities = 160/422 (37%), Positives = 219/422 (51%), Gaps = 26/422 (6%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSGRRGVFEDFVEEKAM-LKKDGRFGGFXXXXXXXXXXXXXX 177 SDEEPE Q RI +GEK + ++GVFED ++++ + L R G Sbjct: 264 SDEEPENQTRIAMFGEKAEGPKKGVFEDDIDDRGIELGLLRRKQGVLEENHEDDEDEEDK 323 Query: 178 XXXXXQVRKGLGK-RLEDKXXXXXXXXXXXXXXXXXXQKANFGNSGGATVYSSVQSIDVS 354 Q RKGLGK R++D + F +S G+ SI + Sbjct: 324 IWEEEQFRKGLGKTRIDD------GGKNSVVPVVKRETQQKFVSSVGSQTLPPSASIGGT 377 Query: 355 DGHTIGG---GVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKTEEN 525 G + GG G+ +G +P S++AE+A A+ +++ RLKE+HD+ + SLNK ++N Sbjct: 378 FGGSSGGSSTGLGLGMMP------FSQQAEIALNAIDDNVRRLKETHDQDLVSLNKADKN 431 Query: 526 LSASLSKVTMLENSLSAAGDK---------------------GPYIEELEDQMQKLHXXX 642 LS SL +T LE SLSAA +K P+IEELEDQMQKLH Sbjct: 432 LSDSLLNITALEKSLSAADEKYKFTQKLRDFISIICDFLQHKAPFIEELEDQMQKLHEKH 491 Query: 643 XXXXXXXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRKGGDL 822 +NDDEM E+EA VNAA + S+ GSN +R+ G+L Sbjct: 492 ASAIVERRTANNDDEMMEVEAEVNAAMSIFSKKGSNVDVVAAAKSAAQAASAALREQGNL 551 Query: 823 SVELDEFGRDKNLQKRMDTTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXXXXXX 1002 V+LDEFGRD NLQKRM+ D KR+S++ D YQ++EG Sbjct: 552 PVKLDEFGRDMNLQKRMEMKGRAEARQCRKARFDSKRLSSMDVDGPYQRMEGESSTDESD 611 Query: 1003 XXXXAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPV 1182 A++S+R+ LLQ + IF DA EEYSQLSVV E+F+ WK++Y+S+Y DAYMSLS P Sbjct: 612 SESTAFESHRELLLQTAAHIFSDASEEYSQLSVVKERFEEWKREYSSTYSDAYMSLSAPS 671 Query: 1183 IF 1188 IF Sbjct: 672 IF 673 >ref|XP_002513154.1| gc-rich sequence DNA-binding factor, putative [Ricinus communis] gi|223548165|gb|EEF49657.1| gc-rich sequence DNA-binding factor, putative [Ricinus communis] Length = 885 Score = 243 bits (620), Expect = 1e-61 Identities = 163/428 (38%), Positives = 216/428 (50%), Gaps = 32/428 (7%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSG--RRGVFEDF---------VEEKAMLKKDGRFGGFXXXX 147 SDEEPEF+ RI G+K ++ VF+DF EE + +D Sbjct: 215 SDEEPEFRNRIAMIGKKDNTTPTTHAVFQDFDNGNDSHVIAEETVVNDEDEE-------- 266 Query: 148 XXXXXXXXXXXXXXXQVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQKANFGNSGGATVY 327 Q RK LGKR++D F +T+ Sbjct: 267 --------DKIWEEEQFRKALGKRMDDPSSSTPSL---------------FPTPSTSTIT 303 Query: 328 SSVQSIDVSDGHTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASL 507 ++ TIGG G P LDALS+ +++ +A+KAL +++ RLKESH+RTV+SL Sbjct: 304 TTNNHRHSHIVPTIGGAF--GPTPGLDALSVPQQSHIARKALLDNLTRLKESHNRTVSSL 361 Query: 508 NKTEENLSASLSKVTMLENSLSAAGDK---------------------GPYIEELEDQMQ 624 K +ENLSASL +T LE SLSAAG+K PYIEELE+QMQ Sbjct: 362 TKADENLSASLMNITALEKSLSAAGEKFIFMQKLRDFVSVICEFLQHKAPYIEELEEQMQ 421 Query: 625 KLHXXXXXXXXXXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXM 804 LH DNDDEM E++ A+ AA++V S GSN M Sbjct: 422 TLHEQRASAILERRTADNDDEMMEVKTALEAAKKVFSARGSNEAAITAAMNAAQDASASM 481 Query: 805 RKGGDLSVELDEFGRDKNLQKRMDTTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXX 984 ++ +L V+LDEFGRD N QKR+D K++S+++ D S QK+EG Sbjct: 482 KEQINLPVKLDEFGRDINQQKRLDMKRRAEARQRRKAQ---KKLSSVEVDGSNQKVEGES 538 Query: 985 XXXXXXXXXXAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYM 1164 AYQSNRD LLQ ++QIFGDA EEY QLSVV ++F+ WKK+Y++SYRDAYM Sbjct: 539 STDESDSESAAYQSNRDLLLQTADQIFGDASEEYCQLSVVKQRFENWKKEYSTSYRDAYM 598 Query: 1165 SLSIPVIF 1188 S+S P IF Sbjct: 599 SISAPAIF 606 >gb|ESW32937.1| hypothetical protein PHAVU_001G030200g [Phaseolus vulgaris] Length = 882 Score = 237 bits (605), Expect = 6e-60 Identities = 161/419 (38%), Positives = 219/419 (52%), Gaps = 23/419 (5%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSGRRGVFEDFVEEKAMLKKDGRFGGFXXXXXXXXXXXXXXX 180 SDEEPEF+ RI +GEK++ G++GVFE+ VEE+ + D RF Sbjct: 221 SDEEPEFRGRIAMFGEKVEGGKKGVFEE-VEERRV---DVRF-----KEEEEDDDEEEKM 271 Query: 181 XXXXQVRKGLGKRLED-KXXXXXXXXXXXXXXXXXXQKANFGNSGGATVYSSVQSIDVSD 357 Q RKGLGKR+++ A N+G T+ S Sbjct: 272 WEEEQFRKGLGKRMDEGSARVDVPVVQGAQQHKYVVPSAAVPNAGFGTIES--------- 322 Query: 358 GHTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKTEENLSAS 537 +P+LD LS+S++AE AKKAL E++ RLKESH RT++SL+KT+ENLSAS Sbjct: 323 ------------MPALDVLSLSQQAESAKKALVENVRRLKESHGRTMSSLSKTDENLSAS 370 Query: 538 LSKVTMLENSLSAAGDK---------------------GPYIEELEDQMQKLHXXXXXXX 654 L +T LENSL A DK YIEELE+Q++KLH Sbjct: 371 LLNITALENSLVVADDKYRFMQKLRNYVTNICDFLQHKAFYIEELEEQIKKLHGDRATAI 430 Query: 655 XXXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRKGGDLSVEL 834 +NDDE+ E+EAAV AA VL++ G+N +RK DL V+L Sbjct: 431 FEKRTTNNDDEIVEVEAAVKAAMSVLNKKGNNMEAAKSAAQEAYTA---VRKQKDLPVKL 487 Query: 835 DEFGRDKNLQKRMDTTXXXXXXXXXXXXN-DVKRMSAIKCDSSYQKIEGXXXXXXXXXXX 1011 DEFGRD NL+KRM D ++++++ D KIEG Sbjct: 488 DEFGRDLNLEKRMQMKMRAVARQRKRSQLFDSNKLTSMELDD--HKIEGESSTDESDSES 545 Query: 1012 XAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIF 1188 AY+S RD +LQ +++IFGDA EEY QLS+V + + WK+DY+SSY+DAYMSLS+P++F Sbjct: 546 QAYESQRDLVLQAADEIFGDASEEYGQLSLVKRRMEEWKRDYSSSYKDAYMSLSLPLVF 604 >gb|EPS73173.1| hypothetical protein M569_01583, partial [Genlisea aurea] Length = 765 Score = 234 bits (597), Expect = 5e-59 Identities = 167/426 (39%), Positives = 214/426 (50%), Gaps = 30/426 (7%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKID-SGRRGVFEDFVEEKAMLKKDGRFGGFXXXXXXXXXXXXXX 177 SDEEPEF+ RIGF+ +K +RGVFED +E++AM + RF Sbjct: 260 SDEEPEFRGRIGFFADKAGVHDKRGVFED-LEQRAMPRD--RF----VESGSDAEDEEDK 312 Query: 178 XXXXXQVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQKANFGNSGGATVY--------SS 333 QVRKGLGKRL + N SG TV+ S Sbjct: 313 MWEEEQVRKGLGKRLGNGVGGKGVT-------------VNIAGSGLTTVHHLGGPQPTSG 359 Query: 334 VQSIDVSDGHTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNK 513 I S+G + V LD++SIS++A++AKK L ++ RLKESH +T A L+K Sbjct: 360 HSIIASSNGDRVSDAASVVGSWGLDSMSISQQADLAKKTLTTNLARLKESHRQTKALLDK 419 Query: 514 TEENLSASLSKVTMLENSLSAAGDK---------------------GPYIEELEDQMQKL 630 +ENLS+SL +VT LENSLSA+ +K PYIEELE+QMQKL Sbjct: 420 NDENLSSSLQRVTTLENSLSASEEKFLFMQKLREFVSVICEFLQHKAPYIEELEEQMQKL 479 Query: 631 HXXXXXXXXXXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRK 810 H DNDDEM E++ A ++L GGSN Sbjct: 480 HEEQARAIEERRQADNDDEMSEIQMA---RARLLKGGGSNAATAAAGHD----------- 525 Query: 811 GGDLSVELDEFGRDKNLQKRMDTTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXX 990 D +ELDEFGRD NLQK+MD D KR A+ S Q++EG Sbjct: 526 --DAPMELDEFGRDMNLQKKMDVARRSKSRQRRRARADAKRKLALDRSGSPQEMEGELST 583 Query: 991 XXXXXXXXAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSL 1170 A+QS+R +LL+V+++IF DA +EYSQ +VVEKF+RWK YASSYRDAYMSL Sbjct: 584 DESETESRAHQSSRSELLRVADKIFSDAADEYSQFQIVVEKFERWKSRYASSYRDAYMSL 643 Query: 1171 SIPVIF 1188 S P IF Sbjct: 644 SAPAIF 649 >ref|XP_004298307.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Fragaria vesca subsp. vesca] Length = 914 Score = 234 bits (597), Expect = 5e-59 Identities = 158/420 (37%), Positives = 219/420 (52%), Gaps = 24/420 (5%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSGRRGVFEDFVEEKAMLKKDGRFGGFXXXXXXXXXXXXXXX 180 SDEEPEF+ RI +GEK+++ ++GVFED + G GG Sbjct: 232 SDEEPEFRNRIAMFGEKMEN-KKGVFED-------VDDTGVDGGLRRESVVVEDDEDEEE 283 Query: 181 XXXX--QVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQ-KANFGNSGGATVYSSVQSIDV 351 Q RKGLGKR+++ Q KA++ + G YS QS+ Sbjct: 284 KIWEEEQFRKGLGKRVDNDGASLGVSASVPRVHSAAPQPKASYNSIAG---YSLAQSL-- 338 Query: 352 SDGHTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKTEENLS 531 + +IGG G +ALSI++++E+A+KAL E++ +LKESH RT SL K E+LS Sbjct: 339 AGVASIGGAT--GASQGSNALSINEQSEIAQKALLENVRKLKESHGRTKMSLTKANESLS 396 Query: 532 ASLSKVTMLENSLSAAG---------------------DKGPYIEELEDQMQKLHXXXXX 648 ASL +T LE SLSAA DK P IEELE++MQK Sbjct: 397 ASLLNITDLEKSLSAADEKYKFMQELRDFVSTICDFLQDKAPLIEELEEEMQKQRDERAS 456 Query: 649 XXXXXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRKGGDLSV 828 DNDDEM E+EAAVNAA + S+ G++ +R+ +L V Sbjct: 457 AIFERRIADNDDEMMEVEAAVNAAMSIFSKEGTSAGVIAVAKSAAQAASAAVREQKNLPV 516 Query: 829 ELDEFGRDKNLQKRMDTTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXXXXXXXX 1008 +LDEFGRD NL+KR+D + KR S++ DS + +EG Sbjct: 517 KLDEFGRDMNLKKRLDMKGRAEARQRRRKRYEAKRESSMDVDSPDRTVEGESSTDESDGE 576 Query: 1009 XXAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIF 1188 Y+S+R +L ++Q+F DA EEYSQLS+V E+F++WK++Y SSYRDAYMSLS+P+IF Sbjct: 577 SKEYESHRQLVLGTADQVFSDAAEEYSQLSLVKERFEKWKREYRSSYRDAYMSLSVPIIF 636 >ref|XP_006838726.1| hypothetical protein AMTR_s00002p00252610 [Amborella trichopoda] gi|548841232|gb|ERN01295.1| hypothetical protein AMTR_s00002p00252610 [Amborella trichopoda] Length = 946 Score = 234 bits (596), Expect = 7e-59 Identities = 159/417 (38%), Positives = 216/417 (51%), Gaps = 21/417 (5%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGEKIDSGRRGVFEDFVEEKAMLKKDGRFGGFXXXXXXXXXXXXXXX 180 SD+E EFQ RI GE +S R+GVFE+ E+ LK++ R Sbjct: 272 SDDESEFQGRIALLGEGNNSSRKGVFENADEKVFELKREER------ETEVDDDDEEDKK 325 Query: 181 XXXXQVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQKANFGNSGGATVYSSVQSIDVSDG 360 Q RK LGKR++D Q + + SGG+ Y S VS+ Sbjct: 326 WEEEQFRKALGKRMDDNSNRGSVQSVASAGSVKAVQSSVY--SGGS--YHGASSGLVSN- 380 Query: 361 HTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKTEENLSASL 540 + VG S++ ++ S++AEVA +AL +SM RLKESHDRT++S+ +T+ NLSASL Sbjct: 381 ------LGVGVTRSVEFMTTSQQAEVATQALRDSMARLKESHDRTISSIVRTDNNLSASL 434 Query: 541 SKVTMLENSLSAAG---------------------DKGPYIEELEDQMQKLHXXXXXXXX 657 S + LE SLSAAG DK P+IEELE+QMQ+LH Sbjct: 435 SNIIDLEKSLSAAGEKYLFMQKLRDFVSVICDFLQDKAPFIEELEEQMQRLHEERASAIV 494 Query: 658 XXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRKGGDLSVELD 837 D+ DEM E+EAAVNAA V ++GGS ++ +L VELD Sbjct: 495 QRRADDDADEMAEIEAAVNAAISVFNKGGS----VSSAASAAQAASLAAKEQSNLPVELD 550 Query: 838 EFGRDKNLQKRMDTTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXXXXXXXXXXA 1017 EFGRD NLQKRMD+ ++ KR+ + SSYQ+IEG A Sbjct: 551 EFGRDVNLQKRMDSKRRAEARKRRKAWSESKRIRTVGDGSSYQRIEGESSTDESDSDSTA 610 Query: 1018 YQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIF 1188 Y+S+ D+LLQ + +IF DA +E+S LSVV +F+ WK+ Y +YRDAYMS++ IF Sbjct: 611 YRSSCDELLQTASEIFSDAADEFSNLSVVKVRFEGWKRQYLPTYRDAYMSMNASAIF 667 >ref|XP_003610832.1| GC-rich sequence DNA-binding factor-like protein [Medicago truncatula] gi|355512167|gb|AES93790.1| GC-rich sequence DNA-binding factor-like protein [Medicago truncatula] Length = 892 Score = 229 bits (584), Expect = 2e-57 Identities = 157/422 (37%), Positives = 219/422 (51%), Gaps = 26/422 (6%) Frame = +1 Query: 1 SDEEPEFQQRIGFYG-EKIDSGRRGVFE---DFVEEKAMLKKDGRFGGFXXXXXXXXXXX 168 SDEEPE++ RI +G +K D ++GVFE + ++ + ++DG + Sbjct: 222 SDEEPEYRGRIAMFGGKKGDGEKKGVFEVADERFDDVVVDEEDGLW-------------- 267 Query: 169 XXXXXXXXQVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQKANFGNSGGATVYSSVQSID 348 Q +KGLGKR ++ Q+ NF A VY +V ++ Sbjct: 268 -----EEEQFKKGLGKRRDE--GSARVGGGGEVPVVQAAQQPNFVGPSVANVYGAVPNVV 320 Query: 349 VSDGHTIGGGVFVGELPSLDALSISKKAEVAKKALYESMGRLKESHDRTVASLNKTEENL 528 + G + P LD +SIS++AE+AKKA+ +++ RLKESH RT++SLNKT+ENL Sbjct: 321 AAASANTSIGGAIPATPVLDVISISQQAEIAKKAMLDNIRRLKESHGRTMSSLNKTDENL 380 Query: 529 SASLSKVTMLENSLSAAGD---------------------KGPYIEELEDQMQKLHXXXX 645 SASL K+T LE+SL A + K YIEELEDQM+KLH Sbjct: 381 SASLLKITDLESSLVVADEKYRFMQKLRNYISNICDFLQHKAYYIEELEDQMKKLHEDRA 440 Query: 646 XXXXXXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXXXXXXXXXXXXXXXMRKGGDLS 825 +NDDEM E+EAAV AA VLSR G N +RK D Sbjct: 441 SAIFEKRATNNDDEMVEVEAAVKAAMLVLSRKGDN---VEAARSAAQDAFAAVRKQRDFP 497 Query: 826 VELDEFGRDKNLQKRMD-TTXXXXXXXXXXXXNDVKRMSAIKCDSSYQKIEGXXXXXXXX 1002 V+LDEFGRD NL+KR D K+ ++++ D K+EG Sbjct: 498 VQLDEFGRDLNLEKRKQMKVMAEARQRRRSKAFDSKKSASMEIDD--HKVEGESSTDESD 555 Query: 1003 XXXXAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPV 1182 AYQS RD +LQ +++IF DA EEYSQLS+V + + WK++Y+SSY +AY+SLS+P+ Sbjct: 556 SESQAYQSQRDLVLQAADEIFSDASEEYSQLSLVKTRMEEWKREYSSSYNEAYISLSLPL 615 Query: 1183 IF 1188 IF Sbjct: 616 IF 617 >ref|XP_006379383.1| hypothetical protein POPTR_0008s00320g [Populus trichocarpa] gi|550332058|gb|ERP57180.1| hypothetical protein POPTR_0008s00320g [Populus trichocarpa] Length = 972 Score = 229 bits (583), Expect = 2e-57 Identities = 170/445 (38%), Positives = 213/445 (47%), Gaps = 49/445 (11%) Frame = +1 Query: 1 SDEEPEFQQRIGFYGE--KIDSGRRGVF-------EDFVEEKAMLKK------------- 114 SDEEPEF+ RI G K + GVF ED +++++ K Sbjct: 259 SDEEPEFRTRIAMIGTMTKDTATHGGVFDAAADDDEDDDDDRSIKAKALAMMGTHHHHAV 318 Query: 115 --DGRFGGFXXXXXXXXXXXXXXXXXXXQVRKGLGKRLEDKXXXXXXXXXXXXXXXXXXQ 288 DG Q RKGLGKR++D Sbjct: 319 VDDGNVAA-AASVVHDEEDEEDRIWEEEQFRKGLGKRMDDASAPIANRALASTAGAAASS 377 Query: 289 KANFGNSGGATV-YSSVQSIDVSDGHTIGGGVFVGELPSLDALSISKKAEVAKKALYESM 465 T Y S+ SI GG F G LD LSI ++A++AKKAL +++ Sbjct: 378 TIPMQPQQRPTPGYGSIPSI---------GGAF-GSSQGLDVLSIPQQADIAKKALQDNL 427 Query: 466 GRLKESHDRTVASLNKTEENLSASLSKVTMLENSLSAAGDK------------------- 588 RLKESH RT++ L+KT+ENLSASL VT LE S+SAAG+K Sbjct: 428 RRLKESHGRTISLLSKTDENLSASLMNVTALEKSISAAGEKFIFMQKLRDFVSVICEFLQ 487 Query: 589 --GPYIEELEDQMQKLHXXXXXXXXXXXXXDNDDEMKELEAAVNAARQVLSRGGSNXXXX 762 IEELE++MQKLH DN+DEM E+EAAV AA V S G++ Sbjct: 488 HKATLIEELEERMQKLHEEQASLILERRTADNEDEMMEVEAAVKAAMSVFSARGNSAATI 547 Query: 763 XXXXXXXXXXXXXMRKGGDLSVELDEFGRDKNLQKRMDTTXXXXXXXXXXXXNDVKRMSA 942 ++ +L V+LDEFGRD NLQKRMD D KR+S Sbjct: 548 DAAKSAAAAALVALKDQANLPVKLDEFGRDINLQKRMDMEKRAKARQRRKARFDSKRLSY 607 Query: 943 IKCDSSYQKIEGXXXXXXXXXXXX---AYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEK 1113 ++ DSS QKIEG AYQS RD LL+ +E+IF DA EEYSQLSVV E+ Sbjct: 608 MEVDSSDQKIEGELSTDESDSDSEKNAAYQSTRDLLLRTAEEIFSDASEEYSQLSVVKER 667 Query: 1114 FDRWKKDYASSYRDAYMSLSIPVIF 1188 F+ WKK+Y +SYRDAYMSLS P IF Sbjct: 668 FETWKKEYFASYRDAYMSLSAPAIF 692