BLASTX nr result
ID: Rehmannia23_contig00000215
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia23_contig00000215 (3044 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006362530.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 803 0.0 ref|XP_004238967.1| PREDICTED: GC-rich sequence DNA-binding fact... 799 0.0 ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding fact... 788 0.0 gb|EOY19310.1| GC-rich sequence DNA-binding factor-like protein,... 755 0.0 ref|XP_006379383.1| hypothetical protein POPTR_0008s00320g [Popu... 746 0.0 ref|XP_004159322.1| PREDICTED: GC-rich sequence DNA-binding fact... 739 0.0 ref|XP_004135116.1| PREDICTED: GC-rich sequence DNA-binding fact... 739 0.0 gb|EXB53993.1| GC-rich sequence DNA-binding factor 1 [Morus nota... 734 0.0 ref|XP_004298307.1| PREDICTED: GC-rich sequence DNA-binding fact... 725 0.0 ref|XP_006468681.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 715 0.0 ref|XP_006448500.1| hypothetical protein CICLE_v10014191mg [Citr... 714 0.0 ref|XP_003528569.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 703 0.0 gb|EMJ26532.1| hypothetical protein PRUPE_ppa001044mg [Prunus pe... 700 0.0 ref|XP_006605552.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 686 0.0 ref|XP_004514246.1| PREDICTED: GC-rich sequence DNA-binding fact... 686 0.0 ref|XP_002513154.1| gc-rich sequence DNA-binding factor, putativ... 686 0.0 gb|ESW32937.1| hypothetical protein PHAVU_001G030200g [Phaseolus... 685 0.0 ref|XP_003530304.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 674 0.0 ref|XP_003610832.1| GC-rich sequence DNA-binding factor-like pro... 673 0.0 ref|NP_196472.1| GC-rich sequence DNA-binding factor-like protei... 657 0.0 >ref|XP_006362530.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Solanum tuberosum] Length = 939 Score = 803 bits (2075), Expect = 0.0 Identities = 478/959 (49%), Positives = 579/959 (60%), Gaps = 17/959 (1%) Frame = -1 Query: 3026 SAKSRNFRRRAGXXXXXXDNXXXXXXXXXXXXXXXXXXXXXXXXXXXTLLSFADDDDEXX 2847 S KSRNFRRR G + +LLSFADD+D Sbjct: 2 SGKSRNFRRRGGDDGD---DDETSAKTTNGTAAKPTTTASATKPKKKSLLSFADDEDSDD 58 Query: 2846 XXXXXXXXXXXXXXXXXXXXXXXXXXXXK----DRIGPHHPSSSLPSNVQPQAGVYTKEA 2679 DRI P PS + SNVQPQAG YTKEA Sbjct: 59 TPFVRPSSKPSSASSRITKPSSSSSAHKLTSGKDRITPKPPSFT--SNVQPQAGTYTKEA 116 Query: 2678 LLELQKNTKTL----AAPARNXXXXXXXXXXXVLKGLIKPVISNDLDIGTTGRSQNLGDD 2511 LLELQKNT+TL +A + VLKGL+KP S + T Q DD Sbjct: 117 LLELQKNTRTLVGSRSAQPKPEPRPGPVEPVIVLKGLVKPPFS--VTAQTQQNGQESEDD 174 Query: 2510 DMSFDQKGKDLRVVRDDASSRLKDLELGPGSRE-DKEG--MPDQAMIEAIKAKRERLRQA 2340 +M DQ G + +RL + L SR+ D G +PD+ I+AI+AKRERLRQA Sbjct: 175 EMDVDQFGGTV--------NRLGSMALEKDSRKKDDVGSVIPDKMTIDAIRAKRERLRQA 226 Query: 2339 KAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMPKE 2160 + AA D+IALD G NHGEAEGLSDEEPEF+ RIGF+GEKIG ++GVF+DFED+AM K+ Sbjct: 227 RPAAQDFIALDEGGNHGEAEGLSDEEPEFQQRIGFYGEKIGS-GRRGVFEDFEDKAMQKD 285 Query: 2159 RGIEVVSXXXXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXS-----FGYL 1995 G +KMWE EQVRKGLGKRLDD + FG Sbjct: 286 GGFRSDDDEEDEEEKMWEEEQVRKGLGKRLDDGSNRGVMSSVVSSAAAVQNVQKANFGSS 345 Query: 1994 GTGTSGVHPPVQNVDVXXXXXXXXXXXXSLFGIDVMSIPQQAELAKKALNENLRRVQESH 1815 G S V+ VQ++DV L +D +SI ++AE+AKKAL E++ R++ESH Sbjct: 346 AVGAS-VYSSVQSIDVSDGPTIGGGVVGGLPSLDALSISKKAEVAKKALYESMGRLKESH 404 Query: 1814 GRTMMSLAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFMQKLREFVSVICEFLQHKAPFIE 1635 GRT+ SL KT AGEK++FMQKLR+FVSVIC LQ K P+IE Sbjct: 405 GRTVTSLHKTEENLSASLSKVTTLENSLSAAGEKYMFMQKLRDFVSVICALLQDKGPYIE 464 Query: 1634 ELEEQMQKLHXXXXXXXXXXXASDNDDELFEIEQAIIAARTELRKGGGNXXXXXXXXXXX 1455 ELE+QMQKLH A+DNDDE+ E+E A+ AAR L +GG N Sbjct: 465 ELEDQMQKLHEERAAAILERRAADNDDEMKELEAAVSAARQVLSRGGSNAATIEAATAAA 524 Query: 1454 XXXXXXXXXXXXAPVELDEFGRDVNLQKRMDITXXXXXXXXXXXXADSKRKLTMENDNSI 1275 P+ELDEFGRD NLQKRMD T D KR ++ D+S Sbjct: 525 QTSTAAMRKGGDLPIELDEFGRDKNLQKRMDTTRRAEARKRRRVKNDVKRMSAIKCDSSY 584 Query: 1274 QQMXXXXXXXXXXXXXXXXXXXXSQLLLVADEVFSDAAEEYSQFSIVVERFERWKKDYAS 1095 Q++ QLL V++++F DA EEYSQ S+VVE+F+RWKKDYAS Sbjct: 585 QKIEGESSTDESDSESTAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYAS 644 Query: 1094 SYRDAYMSLSIPAIFSPYVRLELLKWDPLHEDADFIDMKWHSLLFNYGLP-EDENEISEX 918 SYRDAYMSLSIP IFSPYVRLELLKWDPLHE+ DF+DM WH+ LF+YG+P E E EIS Sbjct: 645 SYRDAYMSLSIPVIFSPYVRLELLKWDPLHENTDFMDMNWHNSLFSYGIPPEGEAEIS-- 702 Query: 917 XXXXXXXANIIPELVEKLAIPILHHQLAYCWDILSTRETTYAVSAMNLVIRYVDLSSSAL 738 N+IP+LVEKLAIPILH+QLA CWD+LST ET AVSAM LV+RY S SAL Sbjct: 703 --VDDTDVNLIPQLVEKLAIPILHNQLANCWDMLSTSETVCAVSAMRLVLRYGPFSGSAL 760 Query: 737 GDLITVLRDRLTKAVTDLMVPTWSPLEMKAVPNAARVAAYRFGTSVRLMRNICLWNKILA 558 +LI VLRDRL AV +L VPTW L M+AVP+AARVAAYRFG S+RL+RNICL+++I A Sbjct: 761 SNLIAVLRDRLADAVANLKVPTWDTLVMRAVPDAARVAAYRFGMSIRLIRNICLFHEIFA 820 Query: 557 LPVLEKIALDELLSGKVLPHLHSIHSNVHDAIVRTERVVASLHGVWTGPNVTGDQSRKLQ 378 +PVLE++ LD+LLSGK+LPHL SI SN+HDA+ RTERVV SLHGVW GP TGD S KL+ Sbjct: 821 MPVLEELVLDQLLSGKILPHLRSIQSNIHDAVTRTERVVTSLHGVWAGPKATGDFSPKLR 880 Query: 377 PLVDYLLLIGKTLEKKHVSSAMETETGRLVRRLKKMLVELNEYDHARALSRTFNLKEAL 201 PLVDYLL + + LEKKH SS+ E +T + RRLKKMLVELN+YD+AR +SRTFN+KEAL Sbjct: 881 PLVDYLLSLARVLEKKHSSSSGEIDTSKFARRLKKMLVELNQYDYARDISRTFNIKEAL 939 >ref|XP_004238967.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Solanum lycopersicum] Length = 941 Score = 799 bits (2064), Expect = 0.0 Identities = 458/866 (52%), Positives = 555/866 (64%), Gaps = 13/866 (1%) Frame = -1 Query: 2759 DRIGPHHPSSSLPSNVQPQAGVYTKEALLELQKNTKTL----AAPARNXXXXXXXXXXXV 2592 DRI P +S SNVQPQAG YTKEALLELQKNT+TL ++ + V Sbjct: 94 DRITPK--PTSFTSNVQPQAGTYTKEALLELQKNTRTLVGSRSSQPKPEPRPGPVEPVIV 151 Query: 2591 LKGLIKPVISNDLDIGTTGRSQNLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSRE 2412 LKGL+KP S G+ DD+M DQ G + +RL + L SR+ Sbjct: 152 LKGLVKPPFSVSAQTQQNGKESE--DDEMDVDQFGGTV--------NRLGSMALEKDSRK 201 Query: 2411 -DKEG--MPDQAMIEAIKAKRERLRQAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRI 2241 D G +PD+ I+AI+AKRERLRQA+ AA D+IALD G NHGEAEGLSDEEPEF+ RI Sbjct: 202 KDDVGSVIPDKMTIDAIRAKRERLRQARPAAQDFIALDEGGNHGEAEGLSDEEPEFQQRI 261 Query: 2240 GFFGEKIGGPDKKGVFDDFEDRAMPKERGIEVVSXXXXXXDKMWEAEQVRKGLGKRLDDX 2061 GF+GEKIG +KGVF+DF+D+A+ K+ G DKMWE EQVRKGLGKRLDD Sbjct: 262 GFYGEKIGS-GRKGVFEDFDDKALQKDGGFRSDDDEEDEEDKMWEEEQVRKGLGKRLDDG 320 Query: 2060 XXXXXXXXXXXXXXXXXS-----FGYLGTGTSGVHPPVQNVDVXXXXXXXXXXXXSLFGI 1896 + FG G S V+ VQ++DV L + Sbjct: 321 SNRGVMSSVVSSAAAVQNAQKANFGSSAVGAS-VYSSVQSIDVSDGPTIGGGVVGGLPSL 379 Query: 1895 DVMSIPQQAELAKKALNENLRRVQESHGRTMMSLAKTXXXXXXXXXXXXXXXXXXXXAGE 1716 D +SI +AE+AKKAL E++ R++ESHGRT+ SL KT AGE Sbjct: 380 DALSISMKAEVAKKALYESMGRLKESHGRTVTSLHKTEENLSASLSKVTTLENSLSAAGE 439 Query: 1715 KFLFMQKLREFVSVICEFLQHKAPFIEELEEQMQKLHXXXXXXXXXXXASDNDDELFEIE 1536 K++FMQKLR+FVSVIC LQ K P+IEELE+QMQKLH A+DNDDE+ E+E Sbjct: 440 KYMFMQKLRDFVSVICALLQDKGPYIEELEDQMQKLHEERAAAILERRAADNDDEMKELE 499 Query: 1535 QAIIAARTELRKGGGNXXXXXXXXXXXXXXXXXXXXXXXAPVELDEFGRDVNLQKRMDIT 1356 A+ AAR L +GG N PVELDEFGRD NLQKRMD T Sbjct: 500 AAVSAARQVLSRGGSNAATIEAATAAAQTSTAAMRKGGDLPVELDEFGRDKNLQKRMDTT 559 Query: 1355 XXXXXXXXXXXXADSKRKLTMENDNSIQQMXXXXXXXXXXXXXXXXXXXXSQLLLVADEV 1176 D KR ++ D+S Q++ QLL V++++ Sbjct: 560 RRAEARKRRRMKNDVKRMSAIKCDSSYQKIEGESSTDESDSESTAYQSNRDQLLQVSEQI 619 Query: 1175 FSDAAEEYSQFSIVVERFERWKKDYASSYRDAYMSLSIPAIFSPYVRLELLKWDPLHEDA 996 F DA EEYSQ S+VVE+F+RWKKDYASSYRDAYMSLSIP IFSPYVRLELLKWDPLHE+ Sbjct: 620 FGDAHEEYSQLSVVVEKFDRWKKDYASSYRDAYMSLSIPVIFSPYVRLELLKWDPLHENT 679 Query: 995 DFIDMKWHSLLFNYGL-PEDENEISEXXXXXXXXANIIPELVEKLAIPILHHQLAYCWDI 819 DF+DM WH+ LF+YG+ PE E EIS N+IP+LVEKLAIPILH+QLA CWD+ Sbjct: 680 DFMDMNWHNSLFSYGISPEGETEISADDTDV----NLIPQLVEKLAIPILHNQLANCWDM 735 Query: 818 LSTRETTYAVSAMNLVIRYVDLSSSALGDLITVLRDRLTKAVTDLMVPTWSPLEMKAVPN 639 LST ET AVSAM LV+RY S SAL +LI VLRDRL AV +L VPTW L M+AVP+ Sbjct: 736 LSTSETVCAVSAMRLVLRYGPFSGSALSNLIAVLRDRLADAVANLKVPTWDTLVMRAVPD 795 Query: 638 AARVAAYRFGTSVRLMRNICLWNKILALPVLEKIALDELLSGKVLPHLHSIHSNVHDAIV 459 AARVAAYRFG S+RL+RNICL+++I A+PVLE++ LD+LLSGK++PHL SI SN+HDA+ Sbjct: 796 AARVAAYRFGMSIRLIRNICLFHEIFAMPVLEELVLDQLLSGKIVPHLRSIQSNIHDAVT 855 Query: 458 RTERVVASLHGVWTGPNVTGDQSRKLQPLVDYLLLIGKTLEKKHVSSAMETETGRLVRRL 279 RTERVV SLHGVW GP TGD S KL+PLVDYLL + + LEKKH SS+ E ET + RRL Sbjct: 856 RTERVVTSLHGVWAGPKATGDCSPKLRPLVDYLLSLARVLEKKHSSSSGEIETSKFARRL 915 Query: 278 KKMLVELNEYDHARALSRTFNLKEAL 201 KKMLVELN+YD+AR +SRTFN+KEAL Sbjct: 916 KKMLVELNQYDYARDISRTFNIKEAL 941 >ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Vitis vinifera] Length = 913 Score = 788 bits (2036), Expect = 0.0 Identities = 450/860 (52%), Positives = 546/860 (63%), Gaps = 7/860 (0%) Frame = -1 Query: 2759 DRIGPHHPSSSLPSNVQPQAGVYTKEALLELQKNTKTLAA--PARNXXXXXXXXXXXVLK 2586 DR+ P S+SLPSNVQPQAG YTKEAL ELQKNT+TLA+ PA + LK Sbjct: 102 DRLTPS--SASLPSNVQPQAGTYTKEALRELQKNTRTLASSRPASSEPKPSLEPVIV-LK 158 Query: 2585 GLIKPVISNDLDIGTTGRSQNLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDK 2406 GL+KP+ + + +N+ ++ S D+ G+D Sbjct: 159 GLVKPISAAE---DAVIDEENVEEEPESKDKGGRD------------------------- 190 Query: 2405 EGMPDQAMIEAIKAKRERLRQAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGE 2226 +PDQA I AI+AKRERLRQ++AAAPDYI+LDGGSNHG AEGLSDEEPEF+GRI FGE Sbjct: 191 -SIPDQATINAIRAKRERLRQSRAAAPDYISLDGGSNHGAAEGLSDEEPEFQGRIAMFGE 249 Query: 2225 KIGGPDKKGVFDDFEDRAMPKERGIEVVSXXXXXXDKMWEAEQVRKGLGKRLDDXXXXXX 2046 K KKGVF+D ++R M + +K+WE EQ RKGLGKR+DD Sbjct: 250 KPES-GKKGVFEDVDERGMEGGFKKDAHDSDDEEEEKIWEEEQFRKGLGKRMDDGSSRVV 308 Query: 2045 XXXXXXXXXXXXS-FGYLG----TGTSGVHPPVQNVDVXXXXXXXXXXXXSLFGIDVMSI 1881 F Y T GV P+ L G D MS+ Sbjct: 309 SSSVPVVQKVQQQKFMYSSVTAYTSVPGVSAPLN----------IGGAVGPLPGFDAMSL 358 Query: 1880 PQQAELAKKALNENLRRVQESHGRTMMSLAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFM 1701 QQAELAKKAL+ENLRR++ESHGRTM SL +T AGEKF+FM Sbjct: 359 SQQAELAKKALHENLRRLKESHGRTMSSLTRTDENLSSSLSNITTLEKSLTAAGEKFIFM 418 Query: 1700 QKLREFVSVICEFLQHKAPFIEELEEQMQKLHXXXXXXXXXXXASDNDDELFEIEQAIIA 1521 Q LR+FVSVIC+FLQHKAPFIEELEEQMQKLH A+DND E+ EI+ ++ A Sbjct: 419 QXLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERASAILERRAADND-EMMEIQASVDA 477 Query: 1520 ARTELRKGGGNXXXXXXXXXXXXXXXXXXXXXXXAPVELDEFGRDVNLQKRMDITXXXXX 1341 A + K G N PV+LDE+GRD+NLQK MD Sbjct: 478 AMSVFTKSGSNEAMVAAARTAAQAASAAMREQTNLPVKLDEYGRDINLQKCMDKNRRSEA 537 Query: 1340 XXXXXXXADSKRKLTMENDNSIQQMXXXXXXXXXXXXXXXXXXXXSQLLLVADEVFSDAA 1161 D+KR +EN++S Q++ LL A+++F DAA Sbjct: 538 RQRKRDRWDAKRMTFLENESSHQKIEGESSTDESDSETTAYQSNRDLLLQTAEQIFGDAA 597 Query: 1160 EEYSQFSIVVERFERWKKDYASSYRDAYMSLSIPAIFSPYVRLELLKWDPLHEDADFIDM 981 EEYSQ S V ER ERWKK Y+SSYRDAYMSLS+PAIFSPYVRLELLKWDPL+E+ADF DM Sbjct: 598 EEYSQLSAVKERIERWKKQYSSSYRDAYMSLSVPAIFSPYVRLELLKWDPLYEEADFDDM 657 Query: 980 KWHSLLFNYGLPEDENEISEXXXXXXXXANIIPELVEKLAIPILHHQLAYCWDILSTRET 801 KWHSLLFNYGL ED N+ S N++PELVE++A+PILHH+LA+CWDI STRET Sbjct: 658 KWHSLLFNYGLSEDGNDFSPDDADA----NLVPELVERVALPILHHELAHCWDIFSTRET 713 Query: 800 TYAVSAMNLVIRYVDLSSSALGDLITVLRDRLTKAVTDLMVPTWSPLEMKAVPNAARVAA 621 AVSA NLVIRY+ SS ALG+L+ V+ RL KA+T+ MVP W+ L MKAVPNAARVAA Sbjct: 714 KNAVSATNLVIRYIPASSEALGELLAVVHKRLYKALTNFMVPPWNILVMKAVPNAARVAA 773 Query: 620 YRFGTSVRLMRNICLWNKILALPVLEKIALDELLSGKVLPHLHSIHSNVHDAIVRTERVV 441 YRFG S+RLMRNICLW ILALPVLEK+ LD+LLSG+VLPH+ +I S+VHDAI RTER++ Sbjct: 774 YRFGMSIRLMRNICLWKDILALPVLEKLVLDQLLSGQVLPHIENIASDVHDAITRTERII 833 Query: 440 ASLHGVWTGPNVTGDQSRKLQPLVDYLLLIGKTLEKKHVSSAMETETGRLVRRLKKMLVE 261 +SL GVW GP+VTG++S KLQPLVDY+L +GK LEK+H+ E++T RL RRLK+MLVE Sbjct: 834 SSLSGVWAGPSVTGERSNKLQPLVDYVLRLGKRLEKRHLPGVTESDTSRLARRLKRMLVE 893 Query: 260 LNEYDHARALSRTFNLKEAL 201 LNEYD AR +SRTF+LKEAL Sbjct: 894 LNEYDKARDISRTFHLKEAL 913 >gb|EOY19310.1| GC-rich sequence DNA-binding factor-like protein, putative isoform 1 [Theobroma cacao] gi|508727414|gb|EOY19311.1| GC-rich sequence DNA-binding factor-like protein, putative isoform 1 [Theobroma cacao] Length = 934 Score = 755 bits (1950), Expect = 0.0 Identities = 438/860 (50%), Positives = 536/860 (62%), Gaps = 16/860 (1%) Frame = -1 Query: 2732 SSLPSNVQPQAGVYTKEALLELQKNTKTLAAPARNXXXXXXXXXXXVLKGLIKPVISNDL 2553 S+LPSNVQPQAG YTKEALLELQKN +TLAAP+ + VLKGL+KP Sbjct: 106 STLPSNVQPQAGTYTKEALLELQKNMRTLAAPS-SRASSVSSEPKIVLKGLLKP------ 158 Query: 2552 DIGTTGRSQNLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEA 2373 +SQNL + + + ++ +DD SRL + G G D PDQA I+A Sbjct: 159 ------QSQNLNSERDNDPPE----KLQKDDTESRLATMAAGKGVDLDFSAFPDQATIDA 208 Query: 2372 IKAKRERLRQAKAA-APDYIALDGGSNHG---EAEGLSDEEPEFRGRIGFFGEKIGGPDK 2205 IKAK++R+R++ A APDYI+LD GSN G E E DEEPEF GR+ FGE K Sbjct: 209 IKAKKDRVRKSFARPAPDYISLDRGSNLGGAMEEELSDDEEPEFPGRL--FGES----GK 262 Query: 2204 KGVFDDFEDRAMP---KERGIEVVSXXXXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXX 2034 KGVF+ E+RA+ ++ GI +KMWE EQ RKGLGKR+DD Sbjct: 263 KGVFEVIEERAVGVGLRKDGIHDEDDDDNEEEKMWEEEQFRKGLGKRMDDSSNRVVSSSN 322 Query: 2033 XXXXXXXXS---------FGYLGTGTSGVHPPVQNVDVXXXXXXXXXXXXSLFGIDVMSI 1881 +GY G+ G P + G+DV SI Sbjct: 323 NSGGVGMVHNMQQQHQQRYGYSTMGSYGSMMPSVSPAPPSSIVGAAGASQ---GLDVTSI 379 Query: 1880 PQQAELAKKALNENLRRVQESHGRTMMSLAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFM 1701 QQAE+ KKAL EN+RR++ESH RT+ SL K AGEKF+FM Sbjct: 380 SQQAEITKKALQENVRRLKESHDRTISSLTKADENLSASLFNITALEKSLSAAGEKFIFM 439 Query: 1700 QKLREFVSVICEFLQHKAPFIEELEEQMQKLHXXXXXXXXXXXASDNDDELFEIEQAIIA 1521 QKLR+FVSVICEFLQHKAP IEELEE MQKL+ +++NDDE+ E+E A+ A Sbjct: 440 QKLRDFVSVICEFLQHKAPLIEELEEHMQKLNEERALSVLERRSANNDDEMVEVEAAVTA 499 Query: 1520 ARTELRKGGGNXXXXXXXXXXXXXXXXXXXXXXXAPVELDEFGRDVNLQKRMDITXXXXX 1341 A + G + PV+LDEFGRDVN QK +D+ Sbjct: 500 AMLVFSECGNSAAMIEVAANAAQAAAAAIRGQVNLPVKLDEFGRDVNRQKHLDMERRAEA 559 Query: 1340 XXXXXXXADSKRKLTMENDNSIQQMXXXXXXXXXXXXXXXXXXXXSQLLLVADEVFSDAA 1161 DSKR +ME D+S Q++ LL ADE+F DA+ Sbjct: 560 RQRRKARFDSKRLSSMEIDSSYQKIEGESSTDESDSESTAYRSNRDMLLQTADEIFGDAS 619 Query: 1160 EEYSQFSIVVERFERWKKDYASSYRDAYMSLSIPAIFSPYVRLELLKWDPLHEDADFIDM 981 EEYSQ S+V ERFERWKKDY+SSYRDAYMSLSIPAIFSPYVRLELLKWDPLH D DF DM Sbjct: 620 EEYSQLSLVKERFERWKKDYSSSYRDAYMSLSIPAIFSPYVRLELLKWDPLHVDEDFSDM 679 Query: 980 KWHSLLFNYGLPEDENEISEXXXXXXXXANIIPELVEKLAIPILHHQLAYCWDILSTRET 801 KWH+LLFNYG PED + + N++P LVEK+A+P+LHH++++CWD+LS +ET Sbjct: 680 KWHNLLFNYGFPEDGSFAPDDADA-----NLVPALVEKVALPVLHHEISHCWDMLSMQET 734 Query: 800 TYAVSAMNLVIRYVDLSSSALGDLITVLRDRLTKAVTDLMVPTWSPLEMKAVPNAARVAA 621 AVSA +L+I YV SS AL +L+ +R RL++AV D+MVPTWSPL MKAVPNAARVAA Sbjct: 735 KNAVSATSLIIDYVPASSEALAELLVTIRTRLSEAVADIMVPTWSPLVMKAVPNAARVAA 794 Query: 620 YRFGTSVRLMRNICLWNKILALPVLEKIALDELLSGKVLPHLHSIHSNVHDAIVRTERVV 441 YRFG SVRLMRNICLW +ILALP+LEK+ALDELL GK+LPH+ +I S+VHDA+ RTER+V Sbjct: 795 YRFGMSVRLMRNICLWKEILALPILEKLALDELLYGKILPHVRNITSDVHDAVTRTERIV 854 Query: 440 ASLHGVWTGPNVTGDQSRKLQPLVDYLLLIGKTLEKKHVSSAMETETGRLVRRLKKMLVE 261 ASL GVW G NV D SRKLQPLVDY+LL+GKTLE++H S E+ TG L RRLKKMLVE Sbjct: 855 ASLSGVWAGTNVIQDSSRKLQPLVDYVLLLGKTLERRHASGVTESGTGGLARRLKKMLVE 914 Query: 260 LNEYDHARALSRTFNLKEAL 201 LNEYD AR ++R F+LKEAL Sbjct: 915 LNEYDSARDIARRFHLKEAL 934 >ref|XP_006379383.1| hypothetical protein POPTR_0008s00320g [Populus trichocarpa] gi|550332058|gb|ERP57180.1| hypothetical protein POPTR_0008s00320g [Populus trichocarpa] Length = 972 Score = 746 bits (1925), Expect = 0.0 Identities = 455/984 (46%), Positives = 561/984 (57%), Gaps = 41/984 (4%) Frame = -1 Query: 3029 SSAKSRNFRRRAGXXXXXXD---NXXXXXXXXXXXXXXXXXXXXXXXXXXXTLLSFADDD 2859 SS+KSRNFRRR D N LLSFA+D+ Sbjct: 3 SSSKSRNFRRRGDVDDEKTDANTNNTDTNAKATPSTTRKPPPPQSTKPKPKKLLSFAEDE 62 Query: 2858 -DEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDRIGPHHPSSSLPSNVQPQAGVYTKE 2682 DE +DR+ P + SNVQPQAG YTKE Sbjct: 63 EDEQAVTRIPSSKSKPKPKPKPTSSSSHKLTVSQDRLPPTTSYLTTASNVQPQAGTYTKE 122 Query: 2681 ALLELQKNTKTLAAPARNXXXXXXXXXXXVLKGLIKPVISN----DLDIGTTGRSQNLGD 2514 ALLELQ+NT+TLA + +LKGL+KP S + + + + Q+ D Sbjct: 123 ALLELQRNTRTLAKSTKTTTPASASEPKIILKGLLKPSFSPSPNPNPNYSSNHQQQDDAD 182 Query: 2513 DDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERLRQAKA 2334 D + + KD DDA +RL + LG + +D PD+ I+ I+AKRERLRQ++A Sbjct: 183 DQSEDENEDKDNGA--DDAQNRLASMGLGKSTSDDYSCFPDEDTIKKIRAKRERLRQSRA 240 Query: 2333 AAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKG-VFD--------DFE 2181 AAPDYI+LD GSNH G SDEEPEFR RI G G VFD D + Sbjct: 241 AAPDYISLDSGSNH--QGGFSDEEPEFRTRIAMIGTMTKDTATHGGVFDAAADDDEDDDD 298 Query: 2180 DRAMPKER--------------------GIEVVSXXXXXXDKMWEAEQVRKGLGKRLDDX 2061 DR++ + VV D++WE EQ RKGLGKR+DD Sbjct: 299 DRSIKAKALAMMGTHHHHAVVDDGNVAAAASVVHDEEDEEDRIWEEEQFRKGLGKRMDDA 358 Query: 2060 XXXXXXXXXXXXXXXXXSFGYLGTGTSGVHPPVQNVDVXXXXXXXXXXXXSLFGIDVMSI 1881 G + T + P + S G+DV+SI Sbjct: 359 SAPIANRALASTA------GAAASSTIPMQPQQRPTPGYGSIPSIGGAFGSSQGLDVLSI 412 Query: 1880 PQQAELAKKALNENLRRVQESHGRTMMSLAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFM 1701 PQQA++AKKAL +NLRR++ESHGRT+ L+KT AGEKF+FM Sbjct: 413 PQQADIAKKALQDNLRRLKESHGRTISLLSKTDENLSASLMNVTALEKSISAAGEKFIFM 472 Query: 1700 QKLREFVSVICEFLQHKAPFIEELEEQMQKLHXXXXXXXXXXXASDNDDELFEIEQAIIA 1521 QKLR+FVSVICEFLQHKA IEELEE+MQKLH +DN+DE+ E+E A+ A Sbjct: 473 QKLRDFVSVICEFLQHKATLIEELEERMQKLHEEQASLILERRTADNEDEMMEVEAAVKA 532 Query: 1520 ARTELRKGGGNXXXXXXXXXXXXXXXXXXXXXXXAPVELDEFGRDVNLQKRMDITXXXXX 1341 A + G + PV+LDEFGRD+NLQKRMD+ Sbjct: 533 AMSVFSARGNSAATIDAAKSAAAAALVALKDQANLPVKLDEFGRDINLQKRMDMEKRAKA 592 Query: 1340 XXXXXXXADSKRKLTMENDNSIQQMXXXXXXXXXXXXXXXXXXXXSQ---LLLVADEVFS 1170 DSKR ME D+S Q++ S LL A+E+FS Sbjct: 593 RQRRKARFDSKRLSYMEVDSSDQKIEGELSTDESDSDSEKNAAYQSTRDLLLRTAEEIFS 652 Query: 1169 DAAEEYSQFSIVVERFERWKKDYASSYRDAYMSLSIPAIFSPYVRLELLKWDPLHEDADF 990 DA+EEYSQ S+V ERFE WKK+Y +SYRDAYMSLS PAIFSPYVRLELLKWDPLHED+DF Sbjct: 653 DASEEYSQLSVVKERFETWKKEYFASYRDAYMSLSAPAIFSPYVRLELLKWDPLHEDSDF 712 Query: 989 IDMKWHSLLFNYGLPEDENEISEXXXXXXXXANIIPELVEKLAIPILHHQLAYCWDILST 810 DMKWHSLLFNYGLPED ++++ N++P LVEK+AIPIL+H++A+CWD+LST Sbjct: 713 FDMKWHSLLFNYGLPEDGSDLNPDDVDA----NLVPGLVEKIAIPILYHEIAHCWDMLST 768 Query: 809 RETTYAVSAMNLVIRYVDLSSSALGDLITVLRDRLTKAVTDLMVPTWSPLEMKAVPNAAR 630 +ET A+SA +LVI YV +S AL +L+ +R RL AV +VPTWS L +KAVP+AA+ Sbjct: 769 QETKNAISATSLVINYVPATSEALSELLAAIRTRLADAVASTVVPTWSLLVLKAVPSAAQ 828 Query: 629 VAAYRFGTSVRLMRNICLWNKILALPVLEKIALDELLSGKVLPHLHSIHSNVHDAIVRTE 450 VAAYRFG SVRLMRNICLW ILALPVLEK+ LDELL GKVLPH+ SI SNVHDA+ RTE Sbjct: 829 VAAYRFGMSVRLMRNICLWKDILALPVLEKLVLDELLCGKVLPHVRSIASNVHDAVTRTE 888 Query: 449 RVVASLHGVWTGPNVTGD-QSRKLQPLVDYLLLIGKTLEKKHVSSAMETETGRLVRRLKK 273 R+VASL W GP+ T D S KLQPLVD++L IG TLEK+HVS ETET L RRLKK Sbjct: 889 RIVASLSRAWAGPSATSDHSSHKLQPLVDFILSIGMTLEKRHVSGVTETETSGLARRLKK 948 Query: 272 MLVELNEYDHARALSRTFNLKEAL 201 MLVELN+YD+AR ++RTF+LKEAL Sbjct: 949 MLVELNDYDNARDMARTFHLKEAL 972 >ref|XP_004159322.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cucumis sativus] Length = 889 Score = 739 bits (1909), Expect = 0.0 Identities = 419/847 (49%), Positives = 522/847 (61%), Gaps = 2/847 (0%) Frame = -1 Query: 2735 SSSLPSNVQPQAGVYTKEALLELQKNTKTLAAPARNXXXXXXXXXXXVLKGLIKPVISND 2556 S+S+PSNVQPQAGVYTKEAL ELQKNT+TLA+ + VLKGL+KP Sbjct: 84 SASVPSNVQPQAGVYTKEALRELQKNTRTLASSRPSSESKPSAEPVIVLKGLLKPA-EQV 142 Query: 2555 LDIGTTGRSQNLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIE 2376 D + + DD+ D G + PDQA I Sbjct: 143 PDSAREAKESSSEDDEAGKDSSGSSI---------------------------PDQATIN 175 Query: 2375 AIKAKRERLRQAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGV 2196 AI+AKRER+RQA AAPDYI+LD GSN LSDEE EF GRI G K+ KKGV Sbjct: 176 AIRAKRERMRQAGVAAPDYISLDAGSNRTAPGELSDEEAEFPGRIAMIGGKLES-SKKGV 234 Query: 2195 FDDFEDRAMPKERGIEVVSXXXXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXX 2016 F++ +++ + R + +K+WE EQ RKGLGKR+DD Sbjct: 235 FEEVDEQGIDGARTNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGSTRVESTSVPVVPSV 294 Query: 2015 XXSFGYLGT--GTSGVHPPVQNVDVXXXXXXXXXXXXSLFGIDVMSIPQQAELAKKALNE 1842 T G S V P V G+D +SI QQAE+AK A+ E Sbjct: 295 QPQNLIYPTTIGYSSV-PSVSTATSIGGSVSISQ------GLDGLSISQQAEIAKTAMQE 347 Query: 1841 NLRRVQESHGRTMMSLAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFMQKLREFVSVICEF 1662 ++ R++ES+ RT MS+ KT AG+KF+FMQKLR+FVSVIC+F Sbjct: 348 SMGRLKESYRRTAMSVLKTDENLSASLLKITDLEKALSAAGDKFIFMQKLRDFVSVICDF 407 Query: 1661 LQHKAPFIEELEEQMQKLHXXXXXXXXXXXASDNDDELFEIEQAIIAARTELRKGGGNXX 1482 LQHKAPFIEELEEQMQKLH +DNDDE+ EIE A+ AA + L K G + Sbjct: 408 LQHKAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIETAVKAAISILNKKGSSNE 467 Query: 1481 XXXXXXXXXXXXXXXXXXXXXAPVELDEFGRDVNLQKRMDITXXXXXXXXXXXXADSKRK 1302 P +LDEFGRD+NLQKRMD+ DSKR Sbjct: 468 MITAATSAAQAAIALSREQANLPTKLDEFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRL 527 Query: 1301 LTMENDNSIQQMXXXXXXXXXXXXXXXXXXXXSQLLLVADEVFSDAAEEYSQFSIVVERF 1122 +ME D Q++ LL A+++FSDAAEE+SQ S+V +RF Sbjct: 528 ASMEVDGH-QKVEGESSTDESDSDSAAYQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRF 586 Query: 1121 ERWKKDYASSYRDAYMSLSIPAIFSPYVRLELLKWDPLHEDADFIDMKWHSLLFNYGLPE 942 E WK+DY+++YRDAYMSLSIPAIFSPYVRLELLKWDPLHE ADF DM WHSLLFNYG+PE Sbjct: 587 EAWKRDYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHESADFFDMNWHSLLFNYGMPE 646 Query: 941 DENEISEXXXXXXXXANIIPELVEKLAIPILHHQLAYCWDILSTRETTYAVSAMNLVIRY 762 D ++ + N++PELVEK+A+PILHH++A+CWD+LSTRET A A +L+ Y Sbjct: 647 DGSDFAPNDADA----NLVPELVEKVALPILHHEIAHCWDMLSTRETRNAAFATSLITNY 702 Query: 761 VDLSSSALGDLITVLRDRLTKAVTDLMVPTWSPLEMKAVPNAARVAAYRFGTSVRLMRNI 582 V SS AL +L+ V+R RL+ A+ DL VPTW+ L KAVPNAAR+AAYRFG SVRLMRNI Sbjct: 703 VPPSSEALTELLVVIRTRLSGAIEDLTVPTWNSLVTKAVPNAARIAAYRFGMSVRLMRNI 762 Query: 581 CLWNKILALPVLEKIALDELLSGKVLPHLHSIHSNVHDAIVRTERVVASLHGVWTGPNVT 402 CLW +I+ALP+LEK+AL+ELL GKVLPH+ SI +N+HDA+ RTER++ASL GVWTG + Sbjct: 763 CLWKEIIALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASLAGVWTGSGII 822 Query: 401 GDQSRKLQPLVDYLLLIGKTLEKKHVSSAMETETGRLVRRLKKMLVELNEYDHARALSRT 222 GD+S KLQPLVDY+LL+G+TLEKKH+S E+ET L RRLKKMLVELNEYD+AR +++T Sbjct: 823 GDRSHKLQPLVDYVLLLGRTLEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKT 882 Query: 221 FNLKEAL 201 F+LKEAL Sbjct: 883 FHLKEAL 889 >ref|XP_004135116.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cucumis sativus] Length = 920 Score = 739 bits (1909), Expect = 0.0 Identities = 419/847 (49%), Positives = 526/847 (62%), Gaps = 2/847 (0%) Frame = -1 Query: 2735 SSSLPSNVQPQAGVYTKEALLELQKNTKTLAAPARNXXXXXXXXXXXVLKGLIKPVISND 2556 S+S+PSNVQPQAGVYTKEAL ELQKNT+TLA+ + VLKGL+KP Sbjct: 114 SASVPSNVQPQAGVYTKEALRELQKNTRTLASSRPSSESKPSAEPVIVLKGLLKPA---- 169 Query: 2555 LDIGTTGRSQNLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEG--MPDQAM 2382 +Q R ++ +S +D E G R+D G +PDQA Sbjct: 170 -------------------EQVPDSAREAKESSS---EDDEAG---RKDSSGSSIPDQAT 204 Query: 2381 IEAIKAKRERLRQAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKK 2202 I AI+AKRER+RQA AAPDYI+LD GSN LSDEE EF GRI G K+ KK Sbjct: 205 INAIRAKRERMRQAGVAAPDYISLDAGSNRTAPGELSDEEAEFPGRIAMIGGKLES-SKK 263 Query: 2201 GVFDDFEDRAMPKERGIEVVSXXXXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXX 2022 GVF++ +++ + R + +K+WE EQ RKGLGKR+DD Sbjct: 264 GVFEEVDEQGIDGARTNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGSTRVESTSVPVVP 323 Query: 2021 XXXXSFGYLGTGTSGVHPPVQNVDVXXXXXXXXXXXXSLFGIDVMSIPQQAELAKKALNE 1842 T P + G+D +SI QQAE+AK A+ E Sbjct: 324 SVQPQNLIYPTTIGYSSVPSMSTATSIGGSVSISQ-----GLDGLSISQQAEIAKTAMQE 378 Query: 1841 NLRRVQESHGRTMMSLAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFMQKLREFVSVICEF 1662 ++ R++ES+ RT MS+ KT AG+KF+FMQKLR+FVSVIC+F Sbjct: 379 SMGRLKESYRRTAMSVLKTDENLSASLLKITDLEKALSAAGDKFMFMQKLRDFVSVICDF 438 Query: 1661 LQHKAPFIEELEEQMQKLHXXXXXXXXXXXASDNDDELFEIEQAIIAARTELRKGGGNXX 1482 LQHKAPFIEELEEQMQKLH +DNDDE+ EIE A+ AA + L K G + Sbjct: 439 LQHKAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIETAVKAAISILNKKGSSNE 498 Query: 1481 XXXXXXXXXXXXXXXXXXXXXAPVELDEFGRDVNLQKRMDITXXXXXXXXXXXXADSKRK 1302 P +LDEFGRD+NLQKRMD+ DSKR Sbjct: 499 MVTAATSAAQAAIALSREQANLPTKLDEFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRL 558 Query: 1301 LTMENDNSIQQMXXXXXXXXXXXXXXXXXXXXSQLLLVADEVFSDAAEEYSQFSIVVERF 1122 +ME D Q++ LL A+++FSDAAEE+SQ S+V +RF Sbjct: 559 ASMEVDGH-QKVEGESSTDESDSDSAAYQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRF 617 Query: 1121 ERWKKDYASSYRDAYMSLSIPAIFSPYVRLELLKWDPLHEDADFIDMKWHSLLFNYGLPE 942 E WK+DY+++YRDAYMSLSIPAIFSPYVRLELLKWDPLHE ADF DM WHSLLFNYG+PE Sbjct: 618 EAWKRDYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHESADFFDMNWHSLLFNYGMPE 677 Query: 941 DENEISEXXXXXXXXANIIPELVEKLAIPILHHQLAYCWDILSTRETTYAVSAMNLVIRY 762 D ++ + N++PELVEK+A+PILHH++A+CWD+LSTRET A A +L+ Y Sbjct: 678 DGSDFAPNDADA----NLVPELVEKVALPILHHEIAHCWDMLSTRETRNAAFATSLITNY 733 Query: 761 VDLSSSALGDLITVLRDRLTKAVTDLMVPTWSPLEMKAVPNAARVAAYRFGTSVRLMRNI 582 V SS AL +L+ V+R RL+ A+ DL VPTW+ L KAVPNAAR+AAYRFG SVRLMRNI Sbjct: 734 VPPSSEALTELLVVIRTRLSGAIEDLTVPTWNSLVTKAVPNAARIAAYRFGMSVRLMRNI 793 Query: 581 CLWNKILALPVLEKIALDELLSGKVLPHLHSIHSNVHDAIVRTERVVASLHGVWTGPNVT 402 CLW +I+ALP+LEK+AL+ELL GKVLPH+ SI +N+HDA+ RTER++ASL GVWTG + Sbjct: 794 CLWKEIIALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASLAGVWTGSGII 853 Query: 401 GDQSRKLQPLVDYLLLIGKTLEKKHVSSAMETETGRLVRRLKKMLVELNEYDHARALSRT 222 GD+S KLQPLVDY+LL+G+TLEKKH+S E+ET L RRLKKMLVELNEYD+AR +++T Sbjct: 854 GDRSHKLQPLVDYVLLLGRTLEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKT 913 Query: 221 FNLKEAL 201 F+LKEAL Sbjct: 914 FHLKEAL 920 >gb|EXB53993.1| GC-rich sequence DNA-binding factor 1 [Morus notabilis] Length = 952 Score = 734 bits (1894), Expect = 0.0 Identities = 422/863 (48%), Positives = 528/863 (61%), Gaps = 14/863 (1%) Frame = -1 Query: 2747 PHHPSSSLPSNVQPQAGVYTKEALLELQKNTKTLAAPARNXXXXXXXXXXXVLKGLIKPV 2568 P S SLPSNVQPQAG YTKEAL ELQKNT+TLA+ + LKGL+KP Sbjct: 121 PSSSSLSLPSNVQPQAGTYTKEALRELQKNTRTLASSKPSSEPVIV------LKGLLKP- 173 Query: 2567 ISNDLDIGTTGRSQNLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEG---- 2400 L D D + +D + L +E+G R+ Sbjct: 174 -------------SELAKSDWKLDSEEEDEPDELKERRGELASMEIGAKGRDRDNSSPEP 220 Query: 2399 -MPDQAMIEAIKAKRERLRQAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEK 2223 +PDQA I AI+AKRERLRQ++AAAPD+IALD GSNHGEAEGLSDEEPE + RI FGEK Sbjct: 221 LIPDQATINAIRAKRERLRQSRAAAPDFIALDAGSNHGEAEGLSDEEPENQTRIAMFGEK 280 Query: 2222 IGGPDKKGVF-DDFEDRAMP-----KERGI--EVVSXXXXXXDKMWEAEQVRKGLGK-RL 2070 GP KKGVF DD +DR + +++G+ E DK+WE EQ RKGLGK R+ Sbjct: 281 AEGP-KKGVFEDDIDDRGIELGLLRRKQGVLEENHEDDEDEEDKIWEEEQFRKGLGKTRI 339 Query: 2069 DDXXXXXXXXXXXXXXXXXXSFGYLGTGTSGVHPPVQNVDVXXXXXXXXXXXXSLFGIDV 1890 DD ++ + S PP + + + G+ + Sbjct: 340 DDGGKNSVVPVVKRETQQK----FVSSVGSQTLPP--SASIGGTFGGSSGGSSTGLGLGM 393 Query: 1889 MSIPQQAELAKKALNENLRRVQESHGRTMMSLAKTXXXXXXXXXXXXXXXXXXXXAGEKF 1710 M QQAE+A A+++N+RR++E+H + ++SL K A EK+ Sbjct: 394 MPFSQQAEIALNAIDDNVRRLKETHDQDLVSLNKADKNLSDSLLNITALEKSLSAADEKY 453 Query: 1709 LFMQKLREFVSVICEFLQHKAPFIEELEEQMQKLHXXXXXXXXXXXASDNDDELFEIEQA 1530 F QKLR+F+S+IC+FLQHKAPFIEELE+QMQKLH ++NDDE+ E+E Sbjct: 454 KFTQKLRDFISIICDFLQHKAPFIEELEDQMQKLHEKHASAIVERRTANNDDEMMEVEAE 513 Query: 1529 IIAARTELRKGGGNXXXXXXXXXXXXXXXXXXXXXXXAPVELDEFGRDVNLQKRMDITXX 1350 + AA + K G N PV+LDEFGRD+NLQKRM++ Sbjct: 514 VNAAMSIFSKKGSNVDVVAAAKSAAQAASAALREQGNLPVKLDEFGRDMNLQKRMEMKGR 573 Query: 1349 XXXXXXXXXXADSKRKLTMENDNSIQQMXXXXXXXXXXXXXXXXXXXXSQLLLVADEVFS 1170 DSKR +M+ D Q+M LL A +FS Sbjct: 574 AEARQCRKARFDSKRLSSMDVDGPYQRMEGESSTDESDSESTAFESHRELLLQTAAHIFS 633 Query: 1169 DAAEEYSQFSIVVERFERWKKDYASSYRDAYMSLSIPAIFSPYVRLELLKWDPLHEDADF 990 DA+EEYSQ S+V ERFE WK++Y+S+Y DAYMSLS P+IFSPYVRLELLKWDPLHE DF Sbjct: 634 DASEEYSQLSVVKERFEEWKREYSSTYSDAYMSLSAPSIFSPYVRLELLKWDPLHEKTDF 693 Query: 989 IDMKWHSLLFNYGLPEDENEISEXXXXXXXXANIIPELVEKLAIPILHHQLAYCWDILST 810 ++M WHSLL +YG+PED + N++PELVEK+A+ ILHH++ +CWD+LST Sbjct: 694 LNMNWHSLLMDYGVPEDGGGFAPDDADA----NLVPELVEKVALRILHHEIVHCWDMLST 749 Query: 809 RETTYAVSAMNLVIRYVDLSSSALGDLITVLRDRLTKAVTDLMVPTWSPLEMKAVPNAAR 630 ET AV+A +LV YV SS AL DL+ +R RL AV +L VPTWSP ++AVPNAAR Sbjct: 750 LETRNAVAATSLVTDYVPASSEALADLLVAIRTRLADAVANLTVPTWSPPVLQAVPNAAR 809 Query: 629 VAAYRFGTSVRLMRNICLWNKILALPVLEKIALDELLSGKVLPHLHSIHSNVHDAIVRTE 450 +AAYRFG SVRLM+NICLW +ILALPVLEK+ALDELL GKVLPH+ SI +NVHDAI RTE Sbjct: 810 LAAYRFGVSVRLMKNICLWKEILALPVLEKLALDELLCGKVLPHVRSIAANVHDAIPRTE 869 Query: 449 RVVASLHGVWTGPNVTGDQSRKLQPLVDYLLLIGKTLEKKHVSSAMETETGRLVRRLKKM 270 ++VASL GVW GP+VTGD+SRKLQPLVDYL+L+ K LEKKH S E+ET L RRLKKM Sbjct: 870 KIVASLSGVWAGPSVTGDRSRKLQPLVDYLMLLRKILEKKHESGVTESETSGLARRLKKM 929 Query: 269 LVELNEYDHARALSRTFNLKEAL 201 LVELNEYD AR ++RTF+LKEAL Sbjct: 930 LVELNEYDKARDIARTFHLKEAL 952 >ref|XP_004298307.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Fragaria vesca subsp. vesca] Length = 914 Score = 725 bits (1872), Expect = 0.0 Identities = 421/862 (48%), Positives = 528/862 (61%), Gaps = 17/862 (1%) Frame = -1 Query: 2735 SSSLPSNVQPQAGVYTKEALLELQKNTKTLAAPARNXXXXXXXXXXXVLKGLIKPVISND 2556 S+SLPSNVQPQAG YTKEAL ELQKNT+TLA+ +R VL+G IKP ++ Sbjct: 108 SASLPSNVQPQAGTYTKEALRELQKNTRTLAS-SRTSSAAAAAEPTIVLRGSIKPADASI 166 Query: 2555 LDIGTTGRSQNLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIE 2376 D R + D++ Q+G K+ PDQA IE Sbjct: 167 ADAVNGARELDSDDEE----QQGS-------------------------KDRYPDQATIE 197 Query: 2375 AIKAKRERLRQAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGV 2196 AI+ KRERLR++K AAPD+IALD GSNHG AEGLSDEEPEFR RI FGEK+ +KKGV Sbjct: 198 AIRKKRERLRKSKPAAPDFIALDSGSNHGAAEGLSDEEPEFRNRIAMFGEKM--ENKKGV 255 Query: 2195 FDDFEDRAMPK--ERGIEVVSXXXXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXX 2022 F+D +D + R VV +K+WE EQ RKGLGKR+D+ Sbjct: 256 FEDVDDTGVDGGLRRESVVVEDDEDEEEKIWEEEQFRKGLGKRVDNDGAS---------- 305 Query: 2021 XXXXSFGYLGTGTS--GVHPPVQNVDVXXXXXXXXXXXXSLFGI-------------DVM 1887 LG S VH SL G+ + + Sbjct: 306 --------LGVSASVPRVHSAAPQPKASYNSIAGYSLAQSLAGVASIGGATGASQGSNAL 357 Query: 1886 SIPQQAELAKKALNENLRRVQESHGRTMMSLAKTXXXXXXXXXXXXXXXXXXXXAGEKFL 1707 SI +Q+E+A+KAL EN+R+++ESHGRT MSL K A EK+ Sbjct: 358 SINEQSEIAQKALLENVRKLKESHGRTKMSLTKANESLSASLLNITDLEKSLSAADEKYK 417 Query: 1706 FMQKLREFVSVICEFLQHKAPFIEELEEQMQKLHXXXXXXXXXXXASDNDDELFEIEQAI 1527 FMQ+LR+FVS IC+FLQ KAP IEELEE+MQK +DNDDE+ E+E A+ Sbjct: 418 FMQELRDFVSTICDFLQDKAPLIEELEEEMQKQRDERASAIFERRIADNDDEMMEVEAAV 477 Query: 1526 IAARTELRKGGGNXXXXXXXXXXXXXXXXXXXXXXXAPVELDEFGRDVNLQKRMDITXXX 1347 AA + K G + PV+LDEFGRD+NL+KR+D+ Sbjct: 478 NAAMSIFSKEGTSAGVIAVAKSAAQAASAAVREQKNLPVKLDEFGRDMNLKKRLDMKGRA 537 Query: 1346 XXXXXXXXXADSKRKLTMENDNSIQQMXXXXXXXXXXXXXXXXXXXXSQLLLVADEVFSD 1167 ++KR+ +M+ D+ + + +L AD+VFSD Sbjct: 538 EARQRRRKRYEAKRESSMDVDSPDRTVEGESSTDESDGESKEYESHRQLVLGTADQVFSD 597 Query: 1166 AAEEYSQFSIVVERFERWKKDYASSYRDAYMSLSIPAIFSPYVRLELLKWDPLHEDADFI 987 AAEEYSQ S+V ERFE+WK++Y SSYRDAYMSLS+P IFSPYVRLELLKWDPL E+ DF+ Sbjct: 598 AAEEYSQLSLVKERFEKWKREYRSSYRDAYMSLSVPIIFSPYVRLELLKWDPLRENTDFV 657 Query: 986 DMKWHSLLFNYGLPEDENEISEXXXXXXXXANIIPELVEKLAIPILHHQLAYCWDILSTR 807 M WH LL NYG+PED ++ + N+IP LVEK+A+PILHHQ+ +CWDILSTR Sbjct: 658 KMSWHELLENYGVPEDGSDFASDDADA----NLIPALVEKVALPILHHQIVHCWDILSTR 713 Query: 806 ETTYAVSAMNLVIRYVDLSSSALGDLITVLRDRLTKAVTDLMVPTWSPLEMKAVPNAARV 627 ET AV+A +LV YV SS AL DL+ +R RL AV+ LMVPTWSPL +KAVPNAAR+ Sbjct: 714 ETKNAVAATSLVTDYVS-SSEALEDLLVAIRTRLADAVSKLMVPTWSPLVLKAVPNAARI 772 Query: 626 AAYRFGTSVRLMRNICLWNKILALPVLEKIALDELLSGKVLPHLHSIHSNVHDAIVRTER 447 AAYRFG SVRLM+NICLW +ILALPVLEK+A++ELL GKV+PH+ SI ++VHDA+ RTER Sbjct: 773 AAYRFGMSVRLMKNICLWKEILALPVLEKLAINELLCGKVIPHIRSIAADVHDAVTRTER 832 Query: 446 VVASLHGVWTGPNVTGDQSRKLQPLVDYLLLIGKTLEKKHVSSAMETETGRLVRRLKKML 267 V+ASL GVW+G +VTGD+SRKLQ LVDY+L +GKT+EKKH ++ETG L RRLKKML Sbjct: 833 VIASLSGVWSGSDVTGDRSRKLQSLVDYVLTLGKTIEKKHSLGVTQSETGGLARRLKKML 892 Query: 266 VELNEYDHARALSRTFNLKEAL 201 VELNEYD AR ++RTF+LKEAL Sbjct: 893 VELNEYDKARDVARTFHLKEAL 914 >ref|XP_006468681.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Citrus sinensis] Length = 913 Score = 715 bits (1846), Expect = 0.0 Identities = 447/955 (46%), Positives = 545/955 (57%), Gaps = 11/955 (1%) Frame = -1 Query: 3032 MSSAKSRNFRRRAGXXXXXXDNXXXXXXXXXXXXXXXXXXXXXXXXXXXTLLSFADDDDE 2853 MSS+++RNFRRRA D+ LLSFADD++E Sbjct: 1 MSSSRARNFRRRADDDEDNNDDNTPSAATTTATKKPPSSSKPKK------LLSFADDEEE 54 Query: 2852 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDRIGPHHPSSS--LPSNVQPQAGVYTKEA 2679 K+R SSS L SNVQ QAG YT+E Sbjct: 55 KSEIPTSNRDRTRPSSRLSKPSSSHKITASKERQSSSATSSSTSLLSNVQAQAGTYTEEY 114 Query: 2678 LLELQKNTKTLAAPARNXXXXXXXXXXXVLKGLIKPVISNDLDIGTTGRSQNLGDDDMSF 2499 LLEL+KNTKTL AP+ L+G IKP SN T Q D Sbjct: 115 LLELRKNTKTLKAPSSKPPAEPVVV----LRGSIKPEDSN-----LTRVQQKPSRDSSDS 165 Query: 2498 DQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMP-DQAMIEAIKAKRERLRQAKAAAPD 2322 D K A + + LG G + G+ D+A I+AI+AK++RLRQ+ A APD Sbjct: 166 DSDHK--------AETEKRFASLGVGKIAVQSGVIYDEAEIKAIRAKKDRLRQSGAKAPD 217 Query: 2321 YIALDGGSN--HGEAEGLSDEEPEFRGRIGFFGEKIG-GPDKKGVF--DDFEDRAMPKER 2157 YI LDGGS+ G+AEG SDEEPEF R+ FGE+ G KKGVF DD ++ P Sbjct: 218 YIPLDGGSSSLRGDAEGSSDEEPEFPRRVAMFGERTASGKKKKGVFEDDDVDEDERPVVA 277 Query: 2156 GIEVVSXXXXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXS--FGYLGTGT 1983 +E D MWE EQVRKGLGKR+DD F Y T T Sbjct: 278 RVEN-DYEYVDEDVMWEEEQVRKGLGKRIDDGSVRVGANTSSSVAMPQQQQQFSYSTTVT 336 Query: 1982 SGVHPPVQNVDVXXXXXXXXXXXXSLFGIDVMSIPQQAELAKKALNENLRRVQESHGRTM 1803 P+ ++ G+D MSI Q+AE A KAL N+ R++ESH RTM Sbjct: 337 -----PIPSIGGAIGASQ---------GLDTMSIAQKAESAMKALQTNVNRLKESHARTM 382 Query: 1802 MSLAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFMQKLREFVSVICEFLQHKAPFIEELEE 1623 SL KT AGEKF+FMQKLR++VSVIC+FLQ KAP+IE LE Sbjct: 383 SSLKKTDEDLSSSLLKITDLESSLSAAGEKFIFMQKLRDYVSVICDFLQDKAPYIETLEA 442 Query: 1622 QMQKLHXXXXXXXXXXXASDNDDELFEIEQAIIAARTELR-KGGGNXXXXXXXXXXXXXX 1446 +MQKL+ A+DNDDE+ E+E AI AA + +G Sbjct: 443 EMQKLNKERASAILERRAADNDDEMTEVEAAIKAATLVIGDRGNSASKLIAASSAAQAAA 502 Query: 1445 XXXXXXXXXAPVELDEFGRDVNLQKRMDITXXXXXXXXXXXXADSKRKLTMENDNSIQQM 1266 PV+LDEFGRD+NLQKR D+ D K+ +M+ D S Q++ Sbjct: 503 AAAVKEQTNLPVKLDEFGRDMNLQKRRDMERRAESRQHRRTRFDLKQLSSMDADISSQKL 562 Query: 1265 XXXXXXXXXXXXXXXXXXXXSQLLLVADEVFSDAAEEYSQFSIVVERFERWKKDYASSYR 1086 +LL A+ +FSDAAEEYSQ S+V ERFE+WK+DY+SSYR Sbjct: 563 EGESTTDESDSETEAYQSNREELLKTAEHIFSDAAEEYSQLSVVKERFEKWKRDYSSSYR 622 Query: 1085 DAYMSLSIPAIFSPYVRLELLKWDPLHEDADFIDMKWHSLLFNYGLPEDENEISEXXXXX 906 DAYMSLS PAI SPYVRLELLKWDPLHEDADF +MKWH+LLFNYGLP+D + + Sbjct: 623 DAYMSLSTPAIMSPYVRLELLKWDPLHEDADFSEMKWHNLLFNYGLPKDGEDFAHDDADA 682 Query: 905 XXXANIIPELVEKLAIPILHHQLAYCWDILSTRETTYAVSAMNLVIRYVDLSSSALGDLI 726 N++P LVEK+A+PILHH +AYCWD+LSTRET AVSA LV+ YV SS AL DL+ Sbjct: 683 ----NLVPTLVEKVALPILHHDIAYCWDMLSTRETKNAVSATILVMAYVPTSSEALKDLL 738 Query: 725 TVLRDRLTKAVTDLMVPTWSPLEMKAVPNAARVAAYRFGTSVRLMRNICLWNKILALPVL 546 + RL +AV ++ VPTWS L M AVPNAAR+AAYRFG SVRLMRNICLW ++ ALP+L Sbjct: 739 VAIHTRLAEAVANIAVPTWSSLAMSAVPNAARIAAYRFGVSVRLMRNICLWKEVFALPIL 798 Query: 545 EKIALDELLSGKVLPHLHSIHSNVHDAIVRTERVVASLHGVWTGPNVTGDQSRKLQPLVD 366 EK+ALDELL KVLPH+ SI SNVHDAI RTER+VASL GVW GP+VTG KLQPLVD Sbjct: 799 EKLALDELLCRKVLPHVRSIASNVHDAISRTERIVASLSGVWAGPSVTGSCCHKLQPLVD 858 Query: 365 YLLLIGKTLEKKHVSSAMETETGRLVRRLKKMLVELNEYDHARALSRTFNLKEAL 201 ++L + KTLEKKH+ E+ET L RRLKKMLVELNEYD+AR ++RTF+LKEAL Sbjct: 859 FMLSLAKTLEKKHLPGVTESETAGLARRLKKMLVELNEYDNARDIARTFHLKEAL 913 >ref|XP_006448500.1| hypothetical protein CICLE_v10014191mg [Citrus clementina] gi|557551111|gb|ESR61740.1| hypothetical protein CICLE_v10014191mg [Citrus clementina] Length = 913 Score = 714 bits (1842), Expect = 0.0 Identities = 445/955 (46%), Positives = 545/955 (57%), Gaps = 11/955 (1%) Frame = -1 Query: 3032 MSSAKSRNFRRRAGXXXXXXDNXXXXXXXXXXXXXXXXXXXXXXXXXXXTLLSFADDDDE 2853 MSS+++RNFRRRA D+ LLSFADD++E Sbjct: 1 MSSSRARNFRRRADDDEDNNDDNTPSVATTTATKKPPSSSKPKK------LLSFADDEEE 54 Query: 2852 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDRIGPHHPSSS--LPSNVQPQAGVYTKEA 2679 K+R SSS L SNVQ QAG YT+E Sbjct: 55 KSEIPTSNRDRTRPSSRLSKPSSSHKITASKERQSSSATSSSTSLLSNVQAQAGTYTEEY 114 Query: 2678 LLELQKNTKTLAAPARNXXXXXXXXXXXVLKGLIKPVISNDLDIGTTGRSQNLGDDDMSF 2499 LLEL+KNTKTL AP+ L+G IKP SN T Q D Sbjct: 115 LLELRKNTKTLKAPSSKPPAEPVVV----LRGSIKPEDSN-----LTRVQQKPSRDSSDS 165 Query: 2498 DQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMP-DQAMIEAIKAKRERLRQAKAAAPD 2322 D K A + + LG G + G+ D+A I+AI+AK++RLRQ+ A APD Sbjct: 166 DSDHK--------AETEKRFASLGVGKIAVQSGVIYDEAEIKAIRAKKDRLRQSGAKAPD 217 Query: 2321 YIALDGGSN--HGEAEGLSDEEPEFRGRIGFFGEKIG-GPDKKGVF--DDFEDRAMPKER 2157 YI LDGGS+ G+AEG SDEEPEF R+ FGE+ G KKGVF DD ++ P Sbjct: 218 YIPLDGGSSSLRGDAEGSSDEEPEFPRRVAMFGERTASGKKKKGVFEDDDVDEDERPVVA 277 Query: 2156 GIEVVSXXXXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXS--FGYLGTGT 1983 +E D MWE EQVRKGLGKR+DD F Y T T Sbjct: 278 RVEN-DYEYVDEDVMWEEEQVRKGLGKRIDDSSVRVGANTSSSVAMPQQQQQFSYPTTVT 336 Query: 1982 SGVHPPVQNVDVXXXXXXXXXXXXSLFGIDVMSIPQQAELAKKALNENLRRVQESHGRTM 1803 P+ ++ G+D MSI Q+AE A KAL N+ R++ESH RTM Sbjct: 337 -----PIPSIGGAIGASQ---------GLDTMSIAQKAESAMKALQTNVNRLKESHARTM 382 Query: 1802 MSLAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFMQKLREFVSVICEFLQHKAPFIEELEE 1623 SL KT AGE+F+FMQKLR++VSVIC+FLQ KAP+IE LE Sbjct: 383 SSLKKTDEDLSSSLLKITDLESSLSAAGERFIFMQKLRDYVSVICDFLQDKAPYIETLEA 442 Query: 1622 QMQKLHXXXXXXXXXXXASDNDDELFEIEQAIIAARTELR-KGGGNXXXXXXXXXXXXXX 1446 +MQKL+ A+DNDDE+ E+E AI AA + +G Sbjct: 443 EMQKLNKERASAILERRAADNDDEMTEVEAAIKAATLFIGDRGNSASKLTAASSAAQAAA 502 Query: 1445 XXXXXXXXXAPVELDEFGRDVNLQKRMDITXXXXXXXXXXXXADSKRKLTMENDNSIQQM 1266 PV+LDEFGRD+NLQKR D+ D K+ +M+ D S Q++ Sbjct: 503 AAAIKEQTNLPVKLDEFGRDMNLQKRRDMERRAESRQHRRTRFDLKQLSSMDADISSQKL 562 Query: 1265 XXXXXXXXXXXXXXXXXXXXSQLLLVADEVFSDAAEEYSQFSIVVERFERWKKDYASSYR 1086 +LL A+ +FSDAAEEYSQ S+V ERFE+WK+DY+SSYR Sbjct: 563 EGESTTDESDSETEAYQSNREELLKTAEHIFSDAAEEYSQLSVVKERFEKWKRDYSSSYR 622 Query: 1085 DAYMSLSIPAIFSPYVRLELLKWDPLHEDADFIDMKWHSLLFNYGLPEDENEISEXXXXX 906 DAYMSLS PAI SPYVRLELLKWDPLHEDADF +MKWH+LLFNYGLP+D + + Sbjct: 623 DAYMSLSTPAIMSPYVRLELLKWDPLHEDADFSEMKWHNLLFNYGLPKDGEDFAHDDADA 682 Query: 905 XXXANIIPELVEKLAIPILHHQLAYCWDILSTRETTYAVSAMNLVIRYVDLSSSALGDLI 726 N++P LVEK+A+PILHH +AYCWD+LSTRET VSA LV+ YV SS AL DL+ Sbjct: 683 ----NLVPTLVEKVALPILHHDIAYCWDMLSTRETKNVVSATILVMAYVPTSSEALKDLL 738 Query: 725 TVLRDRLTKAVTDLMVPTWSPLEMKAVPNAARVAAYRFGTSVRLMRNICLWNKILALPVL 546 + RL +AV ++ VPTWSPL M AVPN+AR+AAYRFG SVRLMRNICLW ++ ALP+L Sbjct: 739 VAIHTRLAEAVANIAVPTWSPLAMSAVPNSARIAAYRFGVSVRLMRNICLWKEVFALPIL 798 Query: 545 EKIALDELLSGKVLPHLHSIHSNVHDAIVRTERVVASLHGVWTGPNVTGDQSRKLQPLVD 366 EK+ALDELL KVLPH+ SI SNVHDAI RTER+VASL GVW GP+VTG KLQPLVD Sbjct: 799 EKLALDELLCRKVLPHVRSIASNVHDAISRTERIVASLSGVWAGPSVTGSCCHKLQPLVD 858 Query: 365 YLLLIGKTLEKKHVSSAMETETGRLVRRLKKMLVELNEYDHARALSRTFNLKEAL 201 ++L + KTLEKKH+ E+ET L RRLKKMLVELNEYD+AR ++RTF+LKEAL Sbjct: 859 FMLSLAKTLEKKHLPGVTESETAGLARRLKKMLVELNEYDNARDIARTFHLKEAL 913 >ref|XP_003528569.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Glycine max] Length = 913 Score = 703 bits (1815), Expect = 0.0 Identities = 427/962 (44%), Positives = 549/962 (57%), Gaps = 18/962 (1%) Frame = -1 Query: 3032 MSSAKSRNFRRRAGXXXXXXDNXXXXXXXXXXXXXXXXXXXXXXXXXXXTLLSFADDDDE 2853 MS+AKSRNFRRR G N LLSFAD+D++ Sbjct: 1 MSTAKSRNFRRRGGDTESNDGNDGGTTTTTFPSKPTSSAKPKKKPQAPK-LLSFADEDEQ 59 Query: 2852 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXK-DRIGPHHPSSSLPSNVQPQAGVYTKEAL 2676 DRI H S S+PSNVQPQAG YTKEAL Sbjct: 60 TDENPRPRASKPYRSAATAKKPSSSHKITTLKDRIA-HSSSPSVPSNVQPQAGTYTKEAL 118 Query: 2675 LELQKNTKTLAAPARNXXXXXXXXXXXV-LKGLIKPVISNDLDIGTTGRSQNLGDDDMSF 2499 ELQKNT+TL + + + LKGL+KP+ S+ G D S Sbjct: 119 RELQKNTRTLVTSSSSRSDPKPSSEPVIVLKGLVKPL-----------GSEPQGRDSYS- 166 Query: 2498 DQKGKDLRVVRDDASSRLKDLELGPGSREDKEGM--PDQAMIEAIKAKRERLRQAKAAAP 2325 + R + +L ++KEG PD I AI+AKRERLRQA+ AAP Sbjct: 167 ------------EGEHREVEAKLATVGIQNKEGSFYPDDETIRAIRAKRERLRQARPAAP 214 Query: 2324 DYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMP---KERG 2154 DYI+LDGGSNHG AEGLSDEEPEFRGRI FGEK+ G KKGVF++ E+R M K Sbjct: 215 DYISLDGGSNHGAAEGLSDEEPEFRGRIAMFGEKVDG-GKKGVFEEVEERIMDVRFKGGE 273 Query: 2153 IEVVSXXXXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXS--------FGY 1998 EVV +KMWE EQ RKGLGKR+D+ +G Sbjct: 274 DEVVDDDDDDEEKMWEEEQFRKGLGKRMDEGSARVDVSVMQGSQSPHNFVVPSAAKVYGA 333 Query: 1997 LGTGTSGVHPPVQNVDVXXXXXXXXXXXXSLFGIDVMSIPQQAELAKKALNENLRRVQES 1818 + + + V P + V SL +DV+ I QQAE A+KAL EN+RR++ES Sbjct: 334 VPSAAASVSPSIGGV------------IESLPALDVVPISQQAEAARKALLENVRRLKES 381 Query: 1817 HGRTMMSLAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFMQKLREFVSVICEFLQHKAPFI 1638 HGRTM SL+KT A EK+ FMQKLR +V+ IC+FLQHKA +I Sbjct: 382 HGRTMSSLSKTDENLSASLLNITALENSLVVADEKYRFMQKLRNYVTNICDFLQHKAFYI 441 Query: 1637 EELEEQMQKLHXXXXXXXXXXXASDNDDELFEIEQAIIAARTELRKGGGNXXXXXXXXXX 1458 EELEEQM+KLH A++NDDE+ E+E+A+ AA + L K G N Sbjct: 442 EELEEQMKKLHEDRALAISERRATNNDDEMIEVEEAVKAAMSVLSKKGNNMEAAKIAAQE 501 Query: 1457 XXXXXXXXXXXXXAPVELDEFGRDVNLQKRMDI---TXXXXXXXXXXXXADSKRKLTMEN 1287 PV+LDEFGRD+NL+KRM++ T DS + +ME Sbjct: 502 AFSAVRKQRDL---PVKLDEFGRDLNLEKRMNMKAKTRSEACQRKRSQAFDSNKVTSMEL 558 Query: 1286 DNSIQQMXXXXXXXXXXXXXXXXXXXXSQLLLVADEVFSDAAEEYSQFSIVVERFERWKK 1107 D+ ++ +L ADE+FSDA+EEY Q S+V R E WK+ Sbjct: 559 DD--HKIEGESSTDESDSESQAYQSQSDLVLQAADEIFSDASEEYGQLSLVKSRMEEWKR 616 Query: 1106 DYASSYRDAYMSLSIPAIFSPYVRLELLKWDPLHEDADFIDMKWHSLLFNYGLPEDENEI 927 +++SSY+DAYMSLS+P IFSPYVRLELL+WDPLH DF +MKW+ LLF YGLPED + Sbjct: 617 EHSSSYKDAYMSLSLPLIFSPYVRLELLRWDPLHNGVDFQEMKWYKLLFTYGLPEDGKDF 676 Query: 926 SEXXXXXXXXANIIPELVEKLAIPILHHQLAYCWDILSTRETTYAVSAMNLVIRYVDLSS 747 ++P LVEK+A+PILH+++++CWD++S +ET A++A L++++V S Sbjct: 677 VHDDGDADL--ELVPNLVEKVALPILHYEISHCWDMVSQQETVNAIAATKLMVQHVSHES 734 Query: 746 SALGDLITVLRDRLTKAVTDLMVPTWSPLEMKAVPNAARVAAYRFGTSVRLMRNICLWNK 567 AL DL+ ++ RL AV DL VPTWSP + AVP+AARVAAYRFG SVRL+RNICLW Sbjct: 735 EALADLLVSIQTRLADAVADLTVPTWSPSVLAAVPDAARVAAYRFGVSVRLLRNICLWKD 794 Query: 566 ILALPVLEKIALDELLSGKVLPHLHSIHSNVHDAIVRTERVVASLHGVWTGPNVTGDQSR 387 + ++PVLEK+ALDELL KVLPHL I NV DAI RTER++ASL G+W GP+V GD++R Sbjct: 795 VFSMPVLEKVALDELLCRKVLPHLRVISENVQDAITRTERIIASLSGIWAGPSVIGDKNR 854 Query: 386 KLQPLVDYLLLIGKTLEKKHVSSAMETETGRLVRRLKKMLVELNEYDHARALSRTFNLKE 207 KLQPLV Y+L +G+ LE+++V E +T L RRLKK+L +LNEYDHAR ++RTF+LKE Sbjct: 855 KLQPLVTYVLSLGRILERRNVP---ENDTSHLARRLKKILADLNEYDHARNMARTFHLKE 911 Query: 206 AL 201 AL Sbjct: 912 AL 913 >gb|EMJ26532.1| hypothetical protein PRUPE_ppa001044mg [Prunus persica] Length = 925 Score = 700 bits (1807), Expect = 0.0 Identities = 420/851 (49%), Positives = 525/851 (61%), Gaps = 6/851 (0%) Frame = -1 Query: 2735 SSSLPSNVQPQAGVYTKEALLELQKNTKTLAAPARNXXXXXXXXXXXVLKGLIKPV--IS 2562 S+SLPSNVQPQAG YTKEAL ELQKNT+TLA+ + LKGL+KP IS Sbjct: 109 STSLPSNVQPQAGTYTKEALRELQKNTRTLASSRPSSEPTIV------LKGLVKPTGTIS 162 Query: 2561 NDLDIGTTGRSQNLGDDDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGM-PDQA 2385 + L R + +D+ ++ R +DDA +RL + G + G+ PDQA Sbjct: 163 DTL---REARELDSDNDEEQEKERASLFRRDKDDAEARLASM--GIDKAKGSSGLFPDQA 217 Query: 2384 MIEAIKAKRERLRQAKAAAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDK 2205 I AI+AKRERLR+++AAAPD+I+LD GSNHG AEGLSDEEPEFRGRI FG+ + G K Sbjct: 218 TINAIRAKRERLRKSRAAAPDFISLDSGSNHGAAEGLSDEEPEFRGRIAIFGDNMEG-SK 276 Query: 2204 KGVFDDFEDRAMP---KERGIEVVSXXXXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXX 2034 KGVF+D +DRA +++ I+ +K+WE EQ RKGLGKR+DD Sbjct: 277 KGVFEDVDDRAADAVLRQKSIDR-DEDEDEEEKIWEEEQFRKGLGKRMDDGSSIGVVSTS 335 Query: 2033 XXXXXXXXSFGYLGTGTSGVHPPVQNVDVXXXXXXXXXXXXSLFGIDVMSIPQQAELAKK 1854 + +G + VQ+V V G +VMSI QAE+AKK Sbjct: 336 APVVQSVPQPKATYSAMAG-YSSVQSVPVGPSIGGAIGASQ---GSNVMSIKAQAEIAKK 391 Query: 1853 ALNENLRRVQESHGRTMMSLAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFMQKLREFVSV 1674 AL EN+ +++ESHGRTM+SL KT A EK+ M E SV Sbjct: 392 ALEENVMKLKESHGRTMLSLTKTDENLSSSLLNITALEKSLSAADEKYKGM----EIGSV 447 Query: 1673 ICEFLQHKAPFIEELEEQMQKLHXXXXXXXXXXXASDNDDELFEIEQAIIAARTELRKGG 1494 KAP IEELEE+MQK+H ++D DDE+ E+E A+ AA + K G Sbjct: 448 -------KAPLIEELEEEMQKIHEQRASATLERRSAD-DDEMMEVEAAVKAAMSIFSKEG 499 Query: 1493 GNXXXXXXXXXXXXXXXXXXXXXXXAPVELDEFGRDVNLQKRMDITXXXXXXXXXXXXAD 1314 + PV+LDEFGRD+NLQKR D+ + Sbjct: 500 SSAEIIAAAKSAAQAATTAEREQTNLPVKLDEFGRDMNLQKRRDMKGRSEAHQHRKRRYE 559 Query: 1313 SKRKLTMENDNSIQQMXXXXXXXXXXXXXXXXXXXXSQLLLVADEVFSDAAEEYSQFSIV 1134 SKR +ME D++ + + +L A +VFSDAAEEYS+ S+V Sbjct: 560 SKRLSSMEVDSTHRTIEGESSTDESDSESNAYHKHRQLVLETAAQVFSDAAEEYSKLSLV 619 Query: 1133 VERFERWKKDYASSYRDAYMSLSIPAIFSPYVRLELLKWDPLHEDADFIDMKWHSLLFNY 954 ERFE WK DYASSYRDAYMSLS PAIFSPYVRLEL+KWDPL E DF++M WHSLL +Y Sbjct: 620 KERFEEWKTDYASSYRDAYMSLSAPAIFSPYVRLELVKWDPLREKTDFLNMSWHSLLADY 679 Query: 953 GLPEDENEISEXXXXXXXXANIIPELVEKLAIPILHHQLAYCWDILSTRETTYAVSAMNL 774 LPED ++ + N++P+LVEK+A+PIL HQ+ +CWDILSTRET AV+A ++ Sbjct: 680 NLPEDGSDFAPDDADA----NLVPDLVEKVALPILLHQVVHCWDILSTRETKNAVAATSV 735 Query: 773 VIRYVDLSSSALGDLITVLRDRLTKAVTDLMVPTWSPLEMKAVPNAARVAAYRFGTSVRL 594 V YV SS AL DL+ +R RL AVT+L VPTWSPL + AVPNAAR+AAYRFG SVRL Sbjct: 736 VTDYVPPSSEALADLLVAIRTRLADAVTNLTVPTWSPLVLTAVPNAARIAAYRFGLSVRL 795 Query: 593 MRNICLWNKILALPVLEKIALDELLSGKVLPHLHSIHSNVHDAIVRTERVVASLHGVWTG 414 M+NICLW +ILA PVLEK+A++ELL GKVLPH+ SI +NVHDAI RTER+VASL GVW G Sbjct: 796 MKNICLWKEILAFPVLEKLAIEELLCGKVLPHVRSIAANVHDAITRTERIVASLSGVWAG 855 Query: 413 PNVTGDQSRKLQPLVDYLLLIGKTLEKKHVSSAMETETGRLVRRLKKMLVELNEYDHARA 234 NVTGD+ RKLQ LVDY+L +G+TLEKKH ++E L RRLKKMLV+LNEYD AR Sbjct: 856 SNVTGDR-RKLQSLVDYVLSLGRTLEKKHSLGVTQSEISGLARRLKKMLVDLNEYDKARD 914 Query: 233 LSRTFNLKEAL 201 L+RTFNLKEAL Sbjct: 915 LTRTFNLKEAL 925 >ref|XP_006605552.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Glycine max] Length = 916 Score = 686 bits (1771), Expect = 0.0 Identities = 425/960 (44%), Positives = 550/960 (57%), Gaps = 16/960 (1%) Frame = -1 Query: 3032 MSSAKSRNFRRRAGXXXXXXDNXXXXXXXXXXXXXXXXXXXXXXXXXXXTLLSFADDDDE 2853 MS+AKSRNFRRR G D+ LLSFADD+DE Sbjct: 1 MSTAKSRNFRRRGGDDTESNDDNDGDTTSTTLPSKPPSSAKPKKKPQAPKLLSFADDEDE 60 Query: 2852 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXK-DRIGPHHPSSSLPSNVQPQAGVYTKEAL 2676 DRI H S S+P+NVQPQAG YTKEAL Sbjct: 61 TDENPRPRASKPHRTAATAKKPSSSHKITTLKDRIA-HTSSPSVPTNVQPQAGTYTKEAL 119 Query: 2675 LELQKNTKTLAAPARNXXXXXXXXXXXV-LKGLIKPVISNDLDIGTTGRSQNLGDDDMSF 2499 ELQKNT+TL + + + + LKG +KP L T GR D Sbjct: 120 RELQKNTRTLVSSSSSRSDPKPSSEPVIVLKGHVKP-----LGPETQGR-------DSDS 167 Query: 2498 DQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERLRQAKAAAPDY 2319 D +G+ V K +G ++ED PD+ I AI+AKRERLR A+ AAPDY Sbjct: 168 DSEGEHREV-------EAKLATVGIQNKEDSF-YPDEETIRAIRAKRERLRLARPAAPDY 219 Query: 2318 IALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMP---KERGIE 2148 I+LDGGSNHG AEGLSDEEPEFRGRI FGEK+ G KKGVF++ E+R + K E Sbjct: 220 ISLDGGSNHGAAEGLSDEEPEFRGRIAMFGEKVDG-GKKGVFEEVEERRVDLRFKGGEEE 278 Query: 2147 VVSXXXXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXS----------FGY 1998 V+ +KMWE EQ RKGLGKR+D+ +G Sbjct: 279 VLDDDDDEEEKMWEEEQFRKGLGKRMDEGSARVDVAAAAVQGAQLQHNFVVPSAAKVYGA 338 Query: 1997 LGTGTSGVHPPVQNVDVXXXXXXXXXXXXSLFGIDVMSIPQQAELAKKALNENLRRVQES 1818 + + + V P + SL +DV+ I QQAE A+KAL EN+RR++ES Sbjct: 339 VPSAAASVSPSIGGA------------IESLPVLDVVPISQQAEAARKALLENVRRLKES 386 Query: 1817 HGRTMMSLAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFMQKLREFVSVICEFLQHKAPFI 1638 HGRTM SL+KT A EK+ FMQKLR +V+ IC+FLQHKA +I Sbjct: 387 HGRTMSSLSKTDENLSASLLNITALENSLVVADEKYRFMQKLRNYVTNICDFLQHKACYI 446 Query: 1637 EELEEQMQKLHXXXXXXXXXXXASDNDDELFEIEQAIIAARTELRKGGGNXXXXXXXXXX 1458 EELEEQM+KLH A++NDDE+ E+E+A+ AA + L K G N Sbjct: 447 EELEEQMKKLHQDRASAIFERRATNNDDEMVEVEEAVKAAMSVLIKKGNNMEAAKIAAQE 506 Query: 1457 XXXXXXXXXXXXXAPVELDEFGRDVNLQKRMDITXXXXXXXXXXXXADSKRKLT-MENDN 1281 PV+LDEFGRD+NL+KRM++ A K+T ME D+ Sbjct: 507 AFAAVRKQRDL---PVKLDEFGRDLNLEKRMNMKVRAEACQRKRSLAFGYNKVTSMEWDD 563 Query: 1280 SIQQMXXXXXXXXXXXXXXXXXXXXSQLLLVADEVFSDAAEEYSQFSIVVERFERWKKDY 1101 ++ +L ADE+FSDA+EEY Q S+V R E WK++Y Sbjct: 564 --HKIEGESSTDESDSESQAYQSQSDLVLQAADEIFSDASEEYGQLSLVKSRMEEWKREY 621 Query: 1100 ASSYRDAYMSLSIPAIFSPYVRLELLKWDPLHEDADFIDMKWHSLLFNYGLPEDENEISE 921 +S+Y+DAYMSLS+P IFSPYVRLELL+WDPLH+ DF +MKW+ LLF YGLPED + Sbjct: 622 SSTYKDAYMSLSLPLIFSPYVRLELLRWDPLHKGVDFQEMKWYKLLFTYGLPEDGKDFVH 681 Query: 920 XXXXXXXXANIIPELVEKLAIPILHHQLAYCWDILSTRETTYAVSAMNLVIRYVDLSSSA 741 ++P LVEK+A+PILH+++++CWD+LS +ET A++A L++++V S A Sbjct: 682 DDGDADL--ELVPNLVEKVALPILHYEISHCWDMLSQQETVNAIAATKLIVQHVSHESEA 739 Query: 740 LGDLITVLRDRLTKAVTDLMVPTWSPLEMKAVPNAARVAAYRFGTSVRLMRNICLWNKIL 561 L L+ +R RL AV +L VPTWS + AVP+AARVAAYRFG SVRL+RNI W + Sbjct: 740 LAGLLVSIRTRLADAVANLTVPTWSLPVLAAVPDAARVAAYRFGVSVRLLRNIGSWKDVF 799 Query: 560 ALPVLEKIALDELLSGKVLPHLHSIHSNVHDAIVRTERVVASLHGVWTGPNVTGDQSRKL 381 ++ VLEK+ALDELL GKVLPHL I NV DAI RTER++ASL GVW+GP+V GD++RKL Sbjct: 800 SMAVLEKVALDELLCGKVLPHLRVISENVQDAITRTERIIASLSGVWSGPSVIGDKNRKL 859 Query: 380 QPLVDYLLLIGKTLEKKHVSSAMETETGRLVRRLKKMLVELNEYDHARALSRTFNLKEAL 201 QPLV Y+L +G+ LE+++V E++T L RRLKK+LV+LNEYDHAR+++RTF+LKEAL Sbjct: 860 QPLVTYVLSLGRILERRNVP---ESDTSHLARRLKKILVDLNEYDHARSMARTFHLKEAL 916 >ref|XP_004514246.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cicer arietinum] Length = 916 Score = 686 bits (1770), Expect = 0.0 Identities = 424/954 (44%), Positives = 545/954 (57%), Gaps = 10/954 (1%) Frame = -1 Query: 3032 MSSAKSRNFRRRAGXXXXXXDNXXXXXXXXXXXXXXXXXXXXXXXXXXXTLLSFADDD-D 2856 MS+AKSRNFRRR D+ LLSFADD+ D Sbjct: 1 MSTAKSRNFRRR---NDTNEDDHADTSSTPSLPSKPSSSAPKPKKPQAPKLLSFADDEND 57 Query: 2855 EXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDRIGPHHPSSSLPSNVQPQAGVYTKEAL 2676 KDRI H PS S SNVQPQAG YTKEAL Sbjct: 58 NENENPRPRSSKPHRSGVSKSSSSSHKITTHKDRIS-HSPSPSFLSNVQPQAGTYTKEAL 116 Query: 2675 LELQKNTKTLAAPARNXXXXXXXXXXXV----LKGLIKPVISNDLDIGTTGRSQNLGDDD 2508 ELQKNT+TL + + LKGL+KP S GR + D+ Sbjct: 117 RELQKNTRTLVTGSTSRPSSTSXXPSSEPVIVLKGLLKPASSEP-----QGRESDSEDEH 171 Query: 2507 MSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERLRQAKAAA 2328 + K + + + S +PD+ I+AI+A+RERLRQA+ AA Sbjct: 172 KEVEAKFASVGIQNGNDSL-----------------IPDEETIKAIRARRERLRQARPAA 214 Query: 2327 PDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMPKE--RG 2154 DYI+LDGGSNHG AEGLSDEEPEFRGRI FGEK G KKGVF+D ++R + G Sbjct: 215 QDYISLDGGSNHGAAEGLSDEEPEFRGRIALFGEK-GEGGKKGVFEDVDERGVDGRFNGG 273 Query: 2153 IEVVSXXXXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXSFGYLGTGTSGV 1974 +VV +KMWE EQ RKGLGKR+D+ ++ + V Sbjct: 274 GDVVVEEEDEEEKMWEEEQFRKGLGKRMDEGPGRVSGGDVSVVQVAQQP-KFVVPSAATV 332 Query: 1973 HPPVQNV--DVXXXXXXXXXXXXSLFGIDVMSIPQQAELAKKALNENLRRVQESHGRTMM 1800 + V NV + +DV+SI QQAE+A+KAL +N+RR++ESHGRTM Sbjct: 333 YGAVPNVVAAAASVSTSIGGAIPATPALDVISISQQAEIARKALLDNVRRLKESHGRTMS 392 Query: 1799 SLAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFMQKLREFVSVICEFLQHKAPFIEELEEQ 1620 SL KT A EK+ FMQKLR +V+ IC+FLQHKA +IEELE+Q Sbjct: 393 SLNKTDENLSASLLNITDLENSLVVADEKYRFMQKLRNYVTNICDFLQHKAFYIEELEDQ 452 Query: 1619 MQKLHXXXXXXXXXXXASDNDDELFEIEQAIIAARTELRKGGGNXXXXXXXXXXXXXXXX 1440 M+KLH A++ DDE+ E+E A+ AA + L + G N Sbjct: 453 MKKLHEDRASAIFEKRATNIDDEMVEVEAAVKAAMSVLSRKGDNLEAARSAAQDAFSAVR 512 Query: 1439 XXXXXXXAPVELDEFGRDVNLQKRMDI-TXXXXXXXXXXXXADSKRKLTMENDNSIQQMX 1263 PV+LDEFGRD+NL+KRM + DS + +ME D+ ++ Sbjct: 513 KQRDF---PVQLDEFGRDLNLEKRMKMKVMAEARQRRKSKAFDSNKLASMEVDD--HKVE 567 Query: 1262 XXXXXXXXXXXXXXXXXXXSQLLLVADEVFSDAAEEYSQFSIVVERFERWKKDYASSYRD 1083 +L ADE+FSDA+EEYSQ S+V + E WK++Y SSY D Sbjct: 568 GESSTDESDSESQAYQSQRDLVLQAADEIFSDASEEYSQLSLVKNKMEEWKREYFSSYND 627 Query: 1082 AYMSLSIPAIFSPYVRLELLKWDPLHEDADFIDMKWHSLLFNYGLPEDENEISEXXXXXX 903 AY+SLS+P IFSPYVRLELL+WDPLH+ DF +MKW+ LLF YGLPED + Sbjct: 628 AYISLSLPLIFSPYVRLELLRWDPLHKGLDFQEMKWYKLLFTYGLPEDGKDFVHDDGDAD 687 Query: 902 XXANIIPELVEKLAIPILHHQLAYCWDILSTRETTYAVSAMNLVIRYVDLSSSALGDLIT 723 ++P LVEK+A+PI H+++++CWD+LS +ET A+SA L++++V S AL +L+ Sbjct: 688 L--ELVPNLVEKVALPIFHYEISHCWDMLSQQETMNAISATKLIVQHVSHESEALAELLV 745 Query: 722 VLRDRLTKAVTDLMVPTWSPLEMKAVPNAARVAAYRFGTSVRLMRNICLWNKILALPVLE 543 +R RL AV +L VPTWSPL + AVP+AARVAAYRFG SVRL+RNICLW I A+PVLE Sbjct: 746 SIRTRLADAVANLTVPTWSPLVLSAVPDAARVAAYRFGVSVRLLRNICLWKDIFAMPVLE 805 Query: 542 KIALDELLSGKVLPHLHSIHSNVHDAIVRTERVVASLHGVWTGPNVTGDQSRKLQPLVDY 363 K+ALDELL KVLPH SI NVHDAI RTER++ASL GVW GP+VTGD++RKLQPLV Y Sbjct: 806 KLALDELLYDKVLPHFRSISENVHDAITRTERIIASLSGVWAGPSVTGDRNRKLQPLVVY 865 Query: 362 LLLIGKTLEKKHVSSAMETETGRLVRRLKKMLVELNEYDHARALSRTFNLKEAL 201 +L +G+ LE+++V E++T L RRLKK+LV+LNEYDHAR ++RTF+LKEAL Sbjct: 866 VLSLGRVLERRNVP---ESDTSYLARRLKKILVDLNEYDHARNMARTFHLKEAL 916 >ref|XP_002513154.1| gc-rich sequence DNA-binding factor, putative [Ricinus communis] gi|223548165|gb|EEF49657.1| gc-rich sequence DNA-binding factor, putative [Ricinus communis] Length = 885 Score = 686 bits (1769), Expect = 0.0 Identities = 416/952 (43%), Positives = 523/952 (54%), Gaps = 9/952 (0%) Frame = -1 Query: 3029 SSAKSRNFRRRAGXXXXXXDNXXXXXXXXXXXXXXXXXXXXXXXXXXXTLLSFADDDDEX 2850 +S+KSRNFRRR N LLSFADD++E Sbjct: 3 TSSKSRNFRRRGDENEDNESNSNTTNPSYSSRKSSSKPKK---------LLSFADDEEED 53 Query: 2849 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDRIGPHHPSSSLPSNVQ------PQAGVYT 2688 DR+ +S+ +N PQAG YT Sbjct: 54 EETPRPSKQKPSKTKSSHKLTAPK------DRLSSSSTTSTTSTNTNSNNVLLPQAGTYT 107 Query: 2687 KEALLELQKNTKTLAAPARNXXXXXXXXXXXV--LKGLIKPVISNDLDIGTTGRSQNLGD 2514 KEALLELQK T+TLA P+ LKGL+KP + L N D Sbjct: 108 KEALLELQKKTRTLAKPSSKPPPPPPSSSEPKIILKGLLKPTLPQTL---------NQQD 158 Query: 2513 DDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERLRQAKA 2334 D D+ ++ D ED +PD+ I+ I+AKRERLRQ++A Sbjct: 159 ADPPQDE------IIID----------------EDYSLIPDEDTIKKIRAKRERLRQSRA 196 Query: 2333 AAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGG-PDKKGVFDDFEDRAMPKER 2157 APDYI+LDGG+ +A SDEEPEFR RI G+K P VF DF++ Sbjct: 197 TAPDYISLDGGAATSDA--FSDEEPEFRNRIAMIGKKDNTTPTTHAVFQDFDNGNDSHVI 254 Query: 2156 GIEVVSXXXXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXSFGYLGTGTSG 1977 E V DK+WE EQ RK LGKR+DD T ++ Sbjct: 255 AEETVVNDEDEEDKIWEEEQFRKALGKRMDDPSSSTP--------------SLFPTPSTS 300 Query: 1976 VHPPVQNVDVXXXXXXXXXXXXSLFGIDVMSIPQQAELAKKALNENLRRVQESHGRTMMS 1797 N G+D +S+PQQ+ +A+KAL +NL R++ESH RT+ S Sbjct: 301 TITTTNNHRHSHIVPTIGGAFGPTPGLDALSVPQQSHIARKALLDNLTRLKESHNRTVSS 360 Query: 1796 LAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFMQKLREFVSVICEFLQHKAPFIEELEEQM 1617 L K AGEKF+FMQKLR+FVSVICEFLQHKAP+IEELEEQM Sbjct: 361 LTKADENLSASLMNITALEKSLSAAGEKFIFMQKLRDFVSVICEFLQHKAPYIEELEEQM 420 Query: 1616 QKLHXXXXXXXXXXXASDNDDELFEIEQAIIAARTELRKGGGNXXXXXXXXXXXXXXXXX 1437 Q LH +DNDDE+ E++ A+ AA+ G N Sbjct: 421 QTLHEQRASAILERRTADNDDEMMEVKTALEAAKKVFSARGSNEAAITAAMNAAQDASAS 480 Query: 1436 XXXXXXAPVELDEFGRDVNLQKRMDITXXXXXXXXXXXXADSKRKLTMENDNSIQQMXXX 1257 PV+LDEFGRD+N QKR+D+ K+ ++E D S Q++ Sbjct: 481 MKEQINLPVKLDEFGRDINQQKRLDMKRRAEARQRRKA---QKKLSSVEVDGSNQKVEGE 537 Query: 1256 XXXXXXXXXXXXXXXXXSQLLLVADEVFSDAAEEYSQFSIVVERFERWKKDYASSYRDAY 1077 LL AD++F DA+EEY Q S+V +RFE WKK+Y++SYRDAY Sbjct: 538 SSTDESDSESAAYQSNRDLLLQTADQIFGDASEEYCQLSVVKQRFENWKKEYSTSYRDAY 597 Query: 1076 MSLSIPAIFSPYVRLELLKWDPLHEDADFIDMKWHSLLFNYGLPEDENEISEXXXXXXXX 897 MS+S PAIFSPYVRLELLKWDPLHEDA F MKWHSLL +YGLP+D +++S Sbjct: 598 MSISAPAIFSPYVRLELLKWDPLHEDAGFFHMKWHSLLSDYGLPQDGSDLSPEDADA--- 654 Query: 896 ANIIPELVEKLAIPILHHQLAYCWDILSTRETTYAVSAMNLVIRYVDLSSSALGDLITVL 717 N++PELVEK+AIPILHH++A+CWD+LSTRET AV A NLV YV SS AL +L+ + Sbjct: 655 -NLVPELVEKVAIPILHHEIAHCWDMLSTRETKNAVFATNLVTDYVPASSEALAELLLAI 713 Query: 716 RDRLTKAVTDLMVPTWSPLEMKAVPNAARVAAYRFGTSVRLMRNICLWNKILALPVLEKI 537 R RLT AV +MVPTWSP+E+KAVP AA++AAYRFG SVRLM+NICLW IL+LPVLEK+ Sbjct: 714 RTRLTDAVVSIMVPTWSPIELKAVPRAAQIAAYRFGMSVRLMKNICLWKDILSLPVLEKL 773 Query: 536 ALDELLSGKVLPHLHSIHSNVHDAIVRTERVVASLHGVWTGPNVTGDQSRKLQPLVDYLL 357 ALD+LL KVLPHL S+ SNVHDA+ RTER++ASL GVW G +VT +S KLQPLVD ++ Sbjct: 774 ALDDLLCRKVLPHLQSVASNVHDAVTRTERIIASLSGVWAGTSVTASRSHKLQPLVDCVM 833 Query: 356 LIGKTLEKKHVSSAMETETGRLVRRLKKMLVELNEYDHARALSRTFNLKEAL 201 +GK L+ KH A E E L RRLKKMLVELN+YD AR ++R F+L+EAL Sbjct: 834 SLGKRLKDKHPLGASEIEVSGLARRLKKMLVELNDYDKAREIARMFSLREAL 885 >gb|ESW32937.1| hypothetical protein PHAVU_001G030200g [Phaseolus vulgaris] Length = 882 Score = 685 bits (1768), Expect = 0.0 Identities = 424/953 (44%), Positives = 535/953 (56%), Gaps = 9/953 (0%) Frame = -1 Query: 3032 MSSAKSRNFRRRAGXXXXXXDNXXXXXXXXXXXXXXXXXXXXXXXXXXXTLLSFADDDDE 2853 MS+AKSRNFRRR G D LLSFADD++ Sbjct: 1 MSTAKSRNFRRRGGGDTEGNDEDGDTSTLSSKPPSSAKPKKPQAPK----LLSFADDEEN 56 Query: 2852 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDRIGPHHPSSSLPSNVQPQAGVYTKEALL 2673 DRI PS +PSNVQPQAG YTKE L Sbjct: 57 ENPRPRSAKPQRSSKPSSAHKITTLK-----DRIASSSPS--VPSNVQPQAGTYTKETLR 109 Query: 2672 ELQKNTKTLAAPARNXXXXXXXXXXXVLKGLIKPVISNDLDIGTTGRSQNLGDDDMSFDQ 2493 ELQKNT+TL + VLKGL+KPV S GR + D D Sbjct: 110 ELQKNTRTLVTSSSRSEPKPPGEPVIVLKGLVKPVASEP-----QGR-----ESDSEGDH 159 Query: 2492 KGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERLRQAKAAAPDYIA 2313 K + +L L L G PD+ I+AI+AKRERLRQA+ AA DYI+ Sbjct: 160 K---------EVEGKLGGLGLHNGK---DSFFPDEETIKAIRAKRERLRQARPAAQDYIS 207 Query: 2312 LDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMPKERGIEVVSXX 2133 LDGGSNHG AEGLSDEEPEFRGRI FGEK+ G KKGVF++ E+R + E Sbjct: 208 LDGGSNHGAAEGLSDEEPEFRGRIAMFGEKVEG-GKKGVFEEVEERRVDVRFKEE--EED 264 Query: 2132 XXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXSFGYLGTGTSGVHPPV-QN 1956 +KMWE EQ RKGLGKR+D+ G++ V PV Q Sbjct: 265 DDEEEKMWEEEQFRKGLGKRMDE-------------------------GSARVDVPVVQG 299 Query: 1955 VDVXXXXXXXXXXXXSLFG-------IDVMSIPQQAELAKKALNENLRRVQESHGRTMMS 1797 + FG +DV+S+ QQAE AKKAL EN+RR++ESHGRTM S Sbjct: 300 AQQHKYVVPSAAVPNAGFGTIESMPALDVLSLSQQAESAKKALVENVRRLKESHGRTMSS 359 Query: 1796 LAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFMQKLREFVSVICEFLQHKAPFIEELEEQM 1617 L+KT A +K+ FMQKLR +V+ IC+FLQHKA +IEELEEQ+ Sbjct: 360 LSKTDENLSASLLNITALENSLVVADDKYRFMQKLRNYVTNICDFLQHKAFYIEELEEQI 419 Query: 1616 QKLHXXXXXXXXXXXASDNDDELFEIEQAIIAARTELRKGGGNXXXXXXXXXXXXXXXXX 1437 +KLH ++NDDE+ E+E A+ AA + L K G N Sbjct: 420 KKLHGDRATAIFEKRTTNNDDEIVEVEAAVKAAMSVLNKKGNNMEAAKSAAQEAYTAVRK 479 Query: 1436 XXXXXXAPVELDEFGRDVNLQKRMDITXXXXXXXXXXXXADSKRKLT-MENDNSIQQMXX 1260 PV+LDEFGRD+NL+KRM + KLT ME D+ ++ Sbjct: 480 QKDL---PVKLDEFGRDLNLEKRMQMKMRAVARQRKRSQLFDSNKLTSMELDD--HKIEG 534 Query: 1259 XXXXXXXXXXXXXXXXXXSQLLLVADEVFSDAAEEYSQFSIVVERFERWKKDYASSYRDA 1080 +L ADE+F DA+EEY Q S+V R E WK+DY+SSY+DA Sbjct: 535 ESSTDESDSESQAYESQRDLVLQAADEIFGDASEEYGQLSLVKRRMEEWKRDYSSSYKDA 594 Query: 1079 YMSLSIPAIFSPYVRLELLKWDPLHEDADFIDMKWHSLLFNYGLPEDENEISEXXXXXXX 900 YMSLS+P +FSPYVRLELL+WDPLH+ DF +MKW+ LLF YGLPED + Sbjct: 595 YMSLSLPLVFSPYVRLELLRWDPLHKGIDFQEMKWYKLLFTYGLPEDGKDFVHDDGDADL 654 Query: 899 XANIIPELVEKLAIPILHHQLAYCWDILSTRETTYAVSAMNLVIRYVDLSSSALGDLITV 720 ++P LVEK+A+PIL +++++CWD+LS RET A++A L++++V S AL DL+ Sbjct: 655 --ELVPNLVEKVALPILQYEISHCWDMLSQRETMNAIAATKLIVQHVSRKSEALTDLLVS 712 Query: 719 LRDRLTKAVTDLMVPTWSPLEMKAVPNAARVAAYRFGTSVRLMRNICLWNKILALPVLEK 540 +R RL AV +L VPTWSP+ + AVP+AARVAAYRFG SVRL+RNICLW + + VLEK Sbjct: 713 IRTRLADAVANLKVPTWSPVVLVAVPDAARVAAYRFGVSVRLLRNICLWKDVFSTSVLEK 772 Query: 539 IALDELLSGKVLPHLHSIHSNVHDAIVRTERVVASLHGVWTGPNVTGDQSRKLQPLVDYL 360 +ALDELL GKVLPHL I NV DAI RTERV+ASL GVW GP+V GD+ KLQPL+ Y+ Sbjct: 773 LALDELLFGKVLPHLRIISENVQDAITRTERVIASLSGVWAGPSVIGDKKHKLQPLLTYV 832 Query: 359 LLIGKTLEKKHVSSAMETETGRLVRRLKKMLVELNEYDHARALSRTFNLKEAL 201 L +G+ LE+++V E++T L RRLKK+LV+LNEYDHAR ++RTF+LKEAL Sbjct: 833 LSLGRILERRNVP---ESDTSYLARRLKKILVDLNEYDHARTMARTFHLKEAL 882 >ref|XP_003530304.1| PREDICTED: PAX3- and PAX7-binding protein 1-like isoform X1 [Glycine max] Length = 896 Score = 674 bits (1739), Expect = 0.0 Identities = 415/946 (43%), Positives = 534/946 (56%), Gaps = 2/946 (0%) Frame = -1 Query: 3032 MSSAKSRNFRRRAGXXXXXXDNXXXXXXXXXXXXXXXXXXXXXXXXXXXTLLSFADDDDE 2853 MS+AKSRNFRRR G D+ LLSFADD++ Sbjct: 1 MSAAKSRNFRRRGGDTEANEDDGDTSTTFRSKPPSSAKPKKPQAPK----LLSFADDEE- 55 Query: 2852 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDRIGPHHPSSSLPSNVQPQAGVYTKEALL 2673 KDRI SSS+ SNVQPQAG YTKEAL Sbjct: 56 ---ISNPRPRSSAKPQRPSKPSSSHKITTLKDRIAH---SSSVSSNVQPQAGTYTKEALR 109 Query: 2672 ELQKNTKTLAAPARNXXXXXXXXXXXV-LKGLIKPVISNDLDIGTTGRSQNLGDDDMSFD 2496 ELQKNT+TL + + + LKGL+KPV+S GR D Sbjct: 110 ELQKNTRTLVSSSTTTTTSSSRSEPVIVLKGLVKPVVSEP-----QGRHS---------D 155 Query: 2495 QKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKRERLRQAKAAAPDYI 2316 +G+ V +L L + G PD+ I+AI+AKRERLR+A+ AAPDYI Sbjct: 156 SEGEHKEV-----EGKLSSLGIQNGK---DSFFPDEETIKAIRAKRERLRKARPAAPDYI 207 Query: 2315 ALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDFEDRAMPKERGIEVVSX 2136 +LDGGSNHG AEGLSDEEPEFRGRI F EK G KKGVF++ E+R +E + Sbjct: 208 SLDGGSNHGAAEGLSDEEPEFRGRIAMFEEKGEGGGKKGVFEEVEERLRDEEENDD---- 263 Query: 2135 XXXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXSFGYLGTGTSGVHPPVQN 1956 +KMWE EQ RKGLGKR+D+ GV P + Sbjct: 264 -DYEEEKMWEEEQFRKGLGKRMDEGAARVDVPVVQGAQQNKFVVSSAAAVYGGV--PSAD 320 Query: 1955 VDVXXXXXXXXXXXXSLFGIDVMSIPQQAELAKKALNENLRRVQESHGRTMMSLAKTXXX 1776 V S+ +DV+ + QQAE A+KAL EN+RR++ESH RTM SL+KT Sbjct: 321 ARVPSVSPSIGGATESMPALDVVPMSQQAERARKALVENVRRLKESHERTMSSLSKTDEN 380 Query: 1775 XXXXXXXXXXXXXXXXXAGEKFLFMQKLREFVSVICEFLQHKAPFIEELEEQMQKLHXXX 1596 A EK+ FMQKLR +VS +C+FLQHKA +IEELEEQM+KLH Sbjct: 381 LSASFLKITALENSLVVADEKYRFMQKLRNYVSNMCDFLQHKAFYIEELEEQMKKLHEDR 440 Query: 1595 XXXXXXXXASDNDDELFEIEQAIIAARTELRKGGGNXXXXXXXXXXXXXXXXXXXXXXXA 1416 ++NDDE+ E+E A+ A + L K G N Sbjct: 441 ASAIFERRTTNNDDEMIEVEAAVKAVMSVLNKKGNNMEAAKSAAQEAFAAVRKQKDL--- 497 Query: 1415 PVELDEFGRDVNLQKRMDITXXXXXXXXXXXXADSKRKL-TMENDNSIQQMXXXXXXXXX 1239 PV+LDEFGRD+NL+KRM + A + KL +ME D+ ++ Sbjct: 498 PVKLDEFGRDLNLEKRMQMKVRAEAHQRKRSQAFNSNKLASMELDDP--KIEGESSTDES 555 Query: 1238 XXXXXXXXXXXSQLLLVADEVFSDAAEEYSQFSIVVERFERWKKDYASSYRDAYMSLSIP 1059 +L AD +FSDA+EEY Q S V R E WK++Y+SSY+DAYMSLS+P Sbjct: 556 DSESQAYQSQRDLVLQAADGIFSDASEEYGQLSFVKRRMEEWKREYSSSYKDAYMSLSLP 615 Query: 1058 AIFSPYVRLELLKWDPLHEDADFIDMKWHSLLFNYGLPEDENEISEXXXXXXXXANIIPE 879 +FSPYVRLELL+WDPLH+ DF +MKW+ LLF YGLPED + ++P Sbjct: 616 LVFSPYVRLELLRWDPLHKGLDFQEMKWYKLLFTYGLPEDGKDFVHDDGDADL--ELVPN 673 Query: 878 LVEKLAIPILHHQLAYCWDILSTRETTYAVSAMNLVIRYVDLSSSALGDLITVLRDRLTK 699 LVEK+A+PILH+++++CWD+LS +ET A++A L++++V S AL DL+ +R RL Sbjct: 674 LVEKVALPILHYEISHCWDMLSQQETVNAIAATKLIVQHVSHESEALADLLVSIRTRLAD 733 Query: 698 AVTDLMVPTWSPLEMKAVPNAARVAAYRFGTSVRLMRNICLWNKILALPVLEKIALDELL 519 AV +L VPTWSP + AV +AARVAAYRFG SVRL+RNIC W + ++PVLE +ALDELL Sbjct: 734 AVANLTVPTWSPPVVAAVADAARVAAYRFGVSVRLLRNICSWKDVFSMPVLENLALDELL 793 Query: 518 SGKVLPHLHSIHSNVHDAIVRTERVVASLHGVWTGPNVTGDQSRKLQPLVDYLLLIGKTL 339 GKVLPHL I NV DAI RTER++ASL GVW GP+V D+ RKLQPL+ Y+L +G+ L Sbjct: 794 FGKVLPHLRIISENVQDAITRTERIIASLSGVWAGPSVIADRKRKLQPLLTYVLSLGRIL 853 Query: 338 EKKHVSSAMETETGRLVRRLKKMLVELNEYDHARALSRTFNLKEAL 201 E++ +A E++T L RRLKK+LV+LNEYDHAR ++RTF+LKEAL Sbjct: 854 ERR---NAPESDTSHLARRLKKILVDLNEYDHARTMARTFHLKEAL 896 >ref|XP_003610832.1| GC-rich sequence DNA-binding factor-like protein [Medicago truncatula] gi|355512167|gb|AES93790.1| GC-rich sequence DNA-binding factor-like protein [Medicago truncatula] Length = 892 Score = 673 bits (1736), Expect = 0.0 Identities = 414/957 (43%), Positives = 533/957 (55%), Gaps = 13/957 (1%) Frame = -1 Query: 3032 MSSAKSRNFRRRAGXXXXXXDNXXXXXXXXXXXXXXXXXXXXXXXXXXXTLLSFADDD-D 2856 MSSAKSRNFRRR LLSFADD+ D Sbjct: 1 MSSAKSRNFRRRTDTNSDDDT-----------PTTVPSKPSAPKPKKPPKLLSFADDEID 49 Query: 2855 EXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDRIGPHHPSSSLPSNVQPQAGVYTKEAL 2676 K+RI H PS S PSNVQPQAG YT EAL Sbjct: 50 ADNETPRPRSSKPHHHRPKPSSSSSHKITTHKNRITSHSPSPS-PSNVQPQAGTYTLEAL 108 Query: 2675 LELQKNTKTLAAPAR-----NXXXXXXXXXXXVLKGLIKPVISNDLDIGTTGRSQNLGDD 2511 ELQKNT+TL P + VLKGL+KPV T ++ ++ Sbjct: 109 RELQKNTRTLVTPTTASRPISSEPKPSSEPVIVLKGLLKPV---------TSEPESDSEE 159 Query: 2510 DMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGM-PDQAMIEAIKAKRERLRQAKA 2334 + F+ K + G + K+ P + I+A KAKRER+R+A A Sbjct: 160 NGEFEAKFASV------------------GIKNGKDSFFPGEEDIKAAKAKRERMRKAGA 201 Query: 2333 AAPDYIALDGGSNHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFD----DFEDRAMP 2166 AAPDYI+LDGGSNHG AEGLSDEEPE+RGRI FG K G +KKGVF+ F+D + Sbjct: 202 AAPDYISLDGGSNHGAAEGLSDEEPEYRGRIAMFGGKKGDGEKKGVFEVADERFDDVVVD 261 Query: 2165 KERGIEVVSXXXXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXSFGYLGTG 1986 +E G+ WE EQ +KGLGKR D+ ++G Sbjct: 262 EEDGL-------------WEEEQFKKGLGKRRDEGSARVGGGGEVPVVQAAQQPNFVGPS 308 Query: 1985 TSGVHPPVQNVDVXXXXXXXXXXXXSLFGI-DVMSIPQQAELAKKALNENLRRVQESHGR 1809 + V+ V NV + DV+SI QQAE+AKKA+ +N+RR++ESHGR Sbjct: 309 VANVYGAVPNVVAAASANTSIGGAIPATPVLDVISISQQAEIAKKAMLDNIRRLKESHGR 368 Query: 1808 TMMSLAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFMQKLREFVSVICEFLQHKAPFIEEL 1629 TM SL KT A EK+ FMQKLR ++S IC+FLQHKA +IEEL Sbjct: 369 TMSSLNKTDENLSASLLKITDLESSLVVADEKYRFMQKLRNYISNICDFLQHKAYYIEEL 428 Query: 1628 EEQMQKLHXXXXXXXXXXXASDNDDELFEIEQAIIAARTELRKGGGNXXXXXXXXXXXXX 1449 E+QM+KLH A++NDDE+ E+E A+ AA L + G N Sbjct: 429 EDQMKKLHEDRASAIFEKRATNNDDEMVEVEAAVKAAMLVLSRKGDNVEAARSAAQDAFA 488 Query: 1448 XXXXXXXXXXAPVELDEFGRDVNLQKRMDI-TXXXXXXXXXXXXADSKRKLTMENDNSIQ 1272 PV+LDEFGRD+NL+KR + DSK+ +ME D+ Sbjct: 489 AVRKQRDF---PVQLDEFGRDLNLEKRKQMKVMAEARQRRRSKAFDSKKSASMEIDD--H 543 Query: 1271 QMXXXXXXXXXXXXXXXXXXXXSQLLLVADEVFSDAAEEYSQFSIVVERFERWKKDYASS 1092 ++ +L ADE+FSDA+EEYSQ S+V R E WK++Y+SS Sbjct: 544 KVEGESSTDESDSESQAYQSQRDLVLQAADEIFSDASEEYSQLSLVKTRMEEWKREYSSS 603 Query: 1091 YRDAYMSLSIPAIFSPYVRLELLKWDPLHEDADFIDMKWHSLLFNYGLPEDENEISEXXX 912 Y +AY+SLS+P IFSPYVRLELL+WDPLH+ DF DMKW+ LLF YGLPED + Sbjct: 604 YNEAYISLSLPLIFSPYVRLELLRWDPLHKGLDFQDMKWYKLLFTYGLPEDGKDFVHDDG 663 Query: 911 XXXXXANIIPELVEKLAIPILHHQLAYCWDILSTRETTYAVSAMNLVIRYVDLSSSALGD 732 ++P LVEK+A+PILH+++++CWD+LS +ET A++A L++++V S AL Sbjct: 664 DADL--ELVPNLVEKVALPILHYEVSHCWDMLSQQETMNAIAATKLIVQHVSRESEALAG 721 Query: 731 LITVLRDRLTKAVTDLMVPTWSPLEMKAVPNAARVAAYRFGTSVRLMRNICLWNKILALP 552 L+ +R RL AV +L VPTWSPL + AVP+AA++AAYRFG SVRL+RNICLW I A+ Sbjct: 722 LLVSIRTRLADAVANLTVPTWSPLVLAAVPDAAKIAAYRFGVSVRLLRNICLWKDIFAMS 781 Query: 551 VLEKIALDELLSGKVLPHLHSIHSNVHDAIVRTERVVASLHGVWTGPNVTGDQSRKLQPL 372 VLEK+ALDELL KVLPH SI NV DAI RTER++ SL GVW GP+VTGD+SRKLQPL Sbjct: 782 VLEKLALDELLYAKVLPHFRSISENVQDAITRTERIIDSLSGVWAGPSVTGDKSRKLQPL 841 Query: 371 VDYLLLIGKTLEKKHVSSAMETETGRLVRRLKKMLVELNEYDHARALSRTFNLKEAL 201 V Y+L +G+ LE+++V + L RRLKK+LV+LNEYDHAR ++RTF+LKEAL Sbjct: 842 VAYVLSLGRILERRNVPES------DLARRLKKILVDLNEYDHARTMARTFHLKEAL 892 >ref|NP_196472.1| GC-rich sequence DNA-binding factor-like protein ILP1 [Arabidopsis thaliana] gi|9759349|dbj|BAB10004.1| unnamed protein product [Arabidopsis thaliana] gi|117413996|dbj|BAF36503.1| transcriptional repressor ILP1 [Arabidopsis thaliana] gi|332003936|gb|AED91319.1| GC-rich sequence DNA-binding factor-like protein ILP1 [Arabidopsis thaliana] Length = 908 Score = 657 bits (1696), Expect = 0.0 Identities = 394/911 (43%), Positives = 513/911 (56%), Gaps = 17/911 (1%) Frame = -1 Query: 2882 LLSFADDDDEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDRIGPHHPSSSLPSNVQPQ 2703 LLSFADD++E R SNV PQ Sbjct: 52 LLSFADDEEEEEDGAPRVTIKPKNGRDRVKSSSRLGVSGSSHRHSSTKERRPASSNVLPQ 111 Query: 2702 AGVYTKEALLELQKNTKTLAAPARNXXXXXXXXXXXVLKGLIKPVISNDLDIGTTGRSQN 2523 AG Y+KEALLELQKNT+TL + LKGLIKP ++ Q+ Sbjct: 112 AGSYSKEALLELQKNTRTLPYSRSSANAEPKVV----LKGLIKPPQDHE--------QQS 159 Query: 2522 LGD-----DDMSFDQKGKDLRVVRDDASSRLKDLELGPGSREDKEGMPDQAMIEAIKAKR 2358 L D D+ FD++G++ + ++ DQA I I+AK+ Sbjct: 160 LKDVVKQVSDLDFDEEGEE---------------------EQHEDAFADQAAI--IRAKK 196 Query: 2357 ERLRQAKAA-APDYIALDGGS-NHGEAEGLSDEEPEFRGRIGFFGEKIGGPDKKGVFDDF 2184 ER+RQ+++A APDYI+LDGG NH EG+SDE+ +F+G F G + DKKGVFD Sbjct: 197 ERMRQSRSAPAPDYISLDGGIVNHSAVEGVSDEDADFQGI--FVGPRPQKDDKKGVFDFG 254 Query: 2183 EDRAMPKERGIEVVSXXXXXXDKMWEAEQVRKGLGKRLDDXXXXXXXXXXXXXXXXXXS- 2007 ++ KE + DK+WE EQ +KG+GKR+D+ Sbjct: 255 DENPTAKETTTSSIYEDEDEEDKLWEEEQFKKGIGKRMDEGSHRTVTSNGIGVPLHSKQQ 314 Query: 2006 ---------FGYLGTGTSGVHPPVQNVDVXXXXXXXXXXXXSLFGIDVMSIPQQAELAKK 1854 + Y GT P+ NV V +D + + QQAELAKK Sbjct: 315 TLPQQQPQMYAY-HAGT-----PMPNVSVAPTIGPAT-------SVDTLPMSQQAELAKK 361 Query: 1853 ALNENLRRVQESHGRTMMSLAKTXXXXXXXXXXXXXXXXXXXXAGEKFLFMQKLREFVSV 1674 AL +N+++++ESH +T+ SL KT AG+K++FMQKLR+F+SV Sbjct: 362 ALKDNVKKLKESHAKTLSSLTKTDENLTASLMSITALESSLSAAGDKYVFMQKLRDFISV 421 Query: 1673 ICEFLQHKAPFIEELEEQMQKLHXXXXXXXXXXXASDNDDELFEIEQAIIAARTELRKGG 1494 IC+F+Q+K IEE+E+QM++L+ +DN+DE+ E+ A+ AA T L K G Sbjct: 422 ICDFMQNKGSLIEEIEDQMKELNEKHALSILERRIADNNDEMIELGAAVKAAMTVLNKHG 481 Query: 1493 GNXXXXXXXXXXXXXXXXXXXXXXXAPVELDEFGRDVNLQKRMDITXXXXXXXXXXXXAD 1314 + PV+LDEFGRD NLQKR ++ + Sbjct: 482 SSSSVIAAATGAALAASTSIRQQMNQPVKLDEFGRDENLQKRREVEQRAAARQKRRARFE 541 Query: 1313 SKRKLTMENDNSIQQMXXXXXXXXXXXXXXXXXXXXSQLLLVADEVFSDAAEEYSQFSIV 1134 +KR ME D ++ LL AD+VFSDA+EEYSQ S V Sbjct: 542 NKRASAMEVDGPSLKIEGESSTDESDTETSAYKETRDSLLQCADKVFSDASEEYSQLSKV 601 Query: 1133 VERFERWKKDYASSYRDAYMSLSIPAIFSPYVRLELLKWDPLHEDADFIDMKWHSLLFNY 954 RFERWK+DY+S+YRDAYMSL++P+IFSPYVRLELLKWDPLH+D DF DMKWH LLF+Y Sbjct: 602 KARFERWKRDYSSTYRDAYMSLTVPSIFSPYVRLELLKWDPLHQDVDFFDMKWHGLLFDY 661 Query: 953 GLPEDENEISEXXXXXXXXANIIPELVEKLAIPILHHQLAYCWDILSTRETTYAVSAMNL 774 G PED ++ + N++PELVEK+AIPILHHQ+ CWDILSTRET AV+A +L Sbjct: 662 GKPEDGDDFAPDDTDA----NLVPELVEKVAIPILHHQIVRCWDILSTRETRNAVAATSL 717 Query: 773 VIRYVDLSSSALGDLITVLRDRLTKAVTDLMVPTWSPLEMKAVPNAARVAAYRFGTSVRL 594 V YV SS AL +L +R RL +A+ + VPTW PL +KAVPN +VAAYRFGTSVRL Sbjct: 718 VTNYVSASSEALAELFAAIRARLVEAIAAISVPTWDPLVLKAVPNTPQVAAYRFGTSVRL 777 Query: 593 MRNICLWNKILALPVLEKIALDELLSGKVLPHLHSIHSNVHDAIVRTERVVASLHGVWTG 414 MRNIC+W ILALPVLE +AL +LL GKVLPH+ SI SN+HDA+ RTER+VASL GVWTG Sbjct: 778 MRNICMWKDILALPVLENLALSDLLFGKVLPHVRSIASNIHDAVTRTERIVASLSGVWTG 837 Query: 413 PNVTGDQSRKLQPLVDYLLLIGKTLEKKHVSSAMETETGRLVRRLKKMLVELNEYDHARA 234 P+VT SR LQPLVD L + + LEK+ S + ET L RRLK++LVEL+E+DHAR Sbjct: 838 PSVTRTHSRPLQPLVDCTLTLRRILEKRLGSGLDDAETTGLARRLKRILVELHEHDHARE 897 Query: 233 LSRTFNLKEAL 201 + RTFNLKEA+ Sbjct: 898 IVRTFNLKEAV 908