BLASTX nr result
ID: Papaver25_contig00002154
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver25_contig00002154 (3987 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding fact... 843 0.0 gb|EXB53993.1| GC-rich sequence DNA-binding factor 1 [Morus nota... 822 0.0 ref|XP_004135116.1| PREDICTED: GC-rich sequence DNA-binding fact... 795 0.0 ref|XP_004298307.1| PREDICTED: GC-rich sequence DNA-binding fact... 794 0.0 ref|XP_004159322.1| PREDICTED: GC-rich sequence DNA-binding fact... 794 0.0 ref|XP_007010500.1| GC-rich sequence DNA-binding factor-like pro... 788 0.0 ref|XP_006838726.1| hypothetical protein AMTR_s00002p00252610 [A... 786 0.0 ref|XP_007225333.1| hypothetical protein PRUPE_ppa001044mg [Prun... 785 0.0 ref|XP_006468681.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 773 0.0 ref|XP_002513154.1| gc-rich sequence DNA-binding factor, putativ... 770 0.0 ref|XP_006448500.1| hypothetical protein CICLE_v10014191mg [Citr... 769 0.0 ref|XP_004514246.1| PREDICTED: GC-rich sequence DNA-binding fact... 748 0.0 ref|XP_007160943.1| hypothetical protein PHAVU_001G030200g [Phas... 730 0.0 ref|XP_003528569.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 723 0.0 ref|XP_006605552.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 721 0.0 ref|XP_003610832.1| GC-rich sequence DNA-binding factor-like pro... 720 0.0 ref|XP_003530304.1| PREDICTED: PAX3- and PAX7-binding protein 1-... 705 0.0 gb|EYU22626.1| hypothetical protein MIMGU_mgv1a001081mg [Mimulus... 696 0.0 ref|XP_006399356.1| hypothetical protein EUTSA_v10012615mg [Eutr... 684 0.0 ref|XP_002873370.1| increased level of polyploidy1-1D [Arabidops... 633 e-178 >ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Vitis vinifera] Length = 913 Score = 843 bits (2177), Expect = 0.0 Identities = 482/891 (54%), Positives = 587/891 (65%), Gaps = 21/891 (2%) Frame = +3 Query: 42 LLSFADEEGDEESPFAR-------PXXXXXXXXXXXXXXXXXTHKITSMKERIXXXXXXX 200 LLSFAD+E + ESP P +HKIT+ K+R+ Sbjct: 53 LLSFADDE-ENESPSRSSSRSTQPPSRPSKTSSRFTKLSSSSSHKITTTKDRLTPSSASL 111 Query: 201 XXXXXNVQPQAGEYTKERLLELQKNTRTIGXXXXXXXXXXX--EPKIVLKGLIKPIYXXX 374 NVQPQAG YTKE L ELQKNTRT+ EP IVLKGL+KPI Sbjct: 112 PS---NVQPQAGTYTKEALRELQKNTRTLASSRPASSEPKPSLEPVIVLKGLVKPISAAE 168 Query: 375 XXXXXXXXXLDRMDVDDAETRLGSMGIGGEGDKDLIPDQATINAIRAKRERLRQSRAPAS 554 +D ++ E S GG +D IPDQATINAIRAKRERLRQSRA A Sbjct: 169 DAV---------IDEENVEEEPESKDKGG---RDSIPDQATINAIRAKRERLRQSRAAAP 216 Query: 555 DYISLDAGSNHGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFEDGRKLIKEVPIDLR 734 DYISLD GSNHG AEG+SDEEPEFQ RIA+FG+K + KG FED +D R Sbjct: 217 DYISLDGGSNHGAAEGLSDEEPEFQGRIAMFGEKPESG--KKGVFED---------VDER 265 Query: 735 N-GGXXXXXXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXXXXX-----LXXXXXXX 896 G Q RKG G + Sbjct: 266 GMEGGFKKDAHDSDDEEEEKIWEEEQFRKGLGKRMDDGSSRVVSSSVPVVQKVQQQKFMY 325 Query: 897 XXXXXXXXXPGHNVSPGLNIGGSAGVMK--KVLSIPQQATVASQAMRESLQRLKETHGRT 1070 PG VS LNIGG+ G + +S+ QQA +A +A+ E+L+RLKE+HGRT Sbjct: 326 SSVTAYTSVPG--VSAPLNIGGAVGPLPGFDAMSLSQQAELAKKALHENLRRLKESHGRT 383 Query: 1071 MSALDRNDENMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQHKAPYIEELE 1250 MS+L R DEN+S++LSNI LE SL A EKF+FMQ L+DFVSVICDFLQHKAP+IEELE Sbjct: 384 MSSLTRTDENLSSSLSNITTLEKSLTAAGEKFIFMQXLRDFVSVICDFLQHKAPFIEELE 443 Query: 1251 EQMQKLHEERAVAVLERRTADNADEMIEIEAPLSAAMLEYSKGGSSTAVI----NAAQLV 1418 EQMQKLHEERA A+LERR ADN DEM+EI+A + AAM ++K GS+ A++ AAQ Sbjct: 444 EQMQKLHEERASAILERRAADN-DEMMEIQASVDAAMSVFTKSGSNEAMVAAARTAAQAA 502 Query: 1419 SSKTREQTNLPVKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAVGDIFPYHHI 1598 S+ REQTNLPVKLDE GRD+NLQ M+ + + + I Sbjct: 503 SAAMREQTNLPVKLDEYGRDINLQKCMDKNRRSEARQRKRDRWDAKRMTFLENESSHQKI 562 Query: 1599 EGXXXXXXXXXXXXXYKSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETWKKRFFSSYR 1778 EG Y+SNR++LLQT+EQIFGDA EE+S+L+ VKE+ E WKK++ SSYR Sbjct: 563 EGESSTDESDSETTAYQSNRDLLLQTAEQIFGDAAEEYSQLSAVKERIERWKKQYSSSYR 622 Query: 1779 DAYMSLSVPAIFSPYVRLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGGDFNSDDADA 1958 DAYMSLSVPAIFSPYVRLELLKWDPL+EE+DF DM+WHSLLF+YGL E G DF+ DDADA Sbjct: 623 DAYMSLSVPAIFSPYVRLELLKWDPLYEEADFDDMKWHSLLFNYGLSEDGNDFSPDDADA 682 Query: 1959 NLVPGLVEKVALPILHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEALKDLLSAIH 2138 NLVP LVE+VALPILHH++AHCWD+ STR TKNAVSATNLVI Y+PA+ EAL +LL+ +H Sbjct: 683 NLVPELVERVALPILHHELAHCWDIFSTRETKNAVSATNLVIRYIPASSEALGELLAVVH 742 Query: 2139 SRLADAVANITVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDILASSAIEQLA 2318 RL A+ N VP W+ +V+KAVP+AAR+AAYRFGM++RL++NICLWKDILA +E+L Sbjct: 743 KRLYKALTNFMVPPWNILVMKAVPNAARVAAYRFGMSIRLMRNICLWKDILALPVLEKLV 802 Query: 2319 FDELLSGKVLPHVRSITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKLQPLVDYVLT 2498 D+LLSG+VLPH+ +I ++HDAITRTERI++SL+GVW+G SVT E S KLQPLVDYVL Sbjct: 803 LDQLLSGQVLPHIENIASDVHDAITRTERIISSLSGVWAGPSVTGERSNKLQPLVDYVLR 862 Query: 2499 LGKTLEKKHASGVSETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 LGK LEK+H GV+E++T+ LARRLK+MLV+LNE+DKAR I RTF LKEA+ Sbjct: 863 LGKRLEKRHLPGVTESDTSRLARRLKRMLVELNEYDKARDISRTFHLKEAL 913 >gb|EXB53993.1| GC-rich sequence DNA-binding factor 1 [Morus notabilis] Length = 952 Score = 822 bits (2122), Expect = 0.0 Identities = 471/896 (52%), Positives = 586/896 (65%), Gaps = 26/896 (2%) Frame = +3 Query: 42 LLSFADEEGDEESPFARPXXXXXXXXXXXXXXXXXT-HKITSMKERIXXXXXXXXXXXX- 215 LLSFAD+E +E ++P + HK+T++K+R+ Sbjct: 67 LLSFADDEDNETPSRSKPSSSSKLSSSSSRLSKPTSSHKMTALKDRLPHSSSSSPSSSSL 126 Query: 216 ----NVQPQAGEYTKERLLELQKNTRTIGXXXXXXXXXXXEPKIVLKGLIKPIYXXXXXX 383 NVQPQAG YTKE L ELQKNTRT+ EP IVLKGL+KP Sbjct: 127 SLPSNVQPQAGTYTKEALRELQKNTRTLASSKPSS-----EPVIVLKGLLKPSELAKSDW 181 Query: 384 XXXXXXLDRMD-VDDAETRLGSMGIGGEG-DKD------LIPDQATINAIRAKRERLRQS 539 D D + + L SM IG +G D+D LIPDQATINAIRAKRERLRQS Sbjct: 182 KLDSEEEDEPDELKERRGELASMEIGAKGRDRDNSSPEPLIPDQATINAIRAKRERLRQS 241 Query: 540 RAPASDYISLDAGSNHGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFEDGRKLIKEV 719 RA A D+I+LDAGSNHGEAEG+SDEEPE QTRIA+FG+K+ G KG FED I + Sbjct: 242 RAAAPDFIALDAGSNHGEAEGLSDEEPENQTRIAMFGEKAE--GPKKGVFEDD---IDDR 296 Query: 720 PIDL---RNGGXXXXXXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXXXXXLXXXXX 890 I+L R Q RKG G + Sbjct: 297 GIELGLLRRKQGVLEENHEDDEDEEDKIWEEEQFRKGLGKTRIDDGGKNSVVPVVKRETQ 356 Query: 891 XXXXXXXXXXXPGHNVSPGLNIGGSAGVMKKVLSI-----PQQATVASQAMRESLQRLKE 1055 + S G GGS+G L + QQA +A A+ ++++RLKE Sbjct: 357 QKFVSSVGSQTLPPSASIGGTFGGSSGGSSTGLGLGMMPFSQQAEIALNAIDDNVRRLKE 416 Query: 1056 THGRTMSALDRNDENMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQHKAPY 1235 TH + + +L++ D+N+S +L NI LE SL+ ADEK+ F QKL+DF+S+ICDFLQHKAP+ Sbjct: 417 THDQDLVSLNKADKNLSDSLLNITALEKSLSAADEKYKFTQKLRDFISIICDFLQHKAPF 476 Query: 1236 IEELEEQMQKLHEERAVAVLERRTADNADEMIEIEAPLSAAMLEYSKGGSSTAVI----N 1403 IEELE+QMQKLHE+ A A++ERRTA+N DEM+E+EA ++AAM +SK GS+ V+ + Sbjct: 477 IEELEDQMQKLHEKHASAIVERRTANNDDEMMEVEAEVNAAMSIFSKKGSNVDVVAAAKS 536 Query: 1404 AAQLVSSKTREQTNLPVKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAVGDIF 1583 AAQ S+ REQ NLPVKLDE GRDMNLQ +S++ Sbjct: 537 AAQAASAALREQGNLPVKLDEFGRDMNLQKRMEMKGRAEARQCRKARFDSKRLSSMDVDG 596 Query: 1584 PYHHIEGXXXXXXXXXXXXXYKSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETWKKRF 1763 PY +EG ++S+RE+LLQT+ IF DA EE+S+L++VKE+FE WK+ + Sbjct: 597 PYQRMEGESSTDESDSESTAFESHRELLLQTAAHIFSDASEEYSQLSVVKERFEEWKREY 656 Query: 1764 FSSYRDAYMSLSVPAIFSPYVRLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGGDFNS 1943 S+Y DAYMSLS P+IFSPYVRLELLKWDPLHE++DF +M WHSLL DYG+PE GG F Sbjct: 657 SSTYSDAYMSLSAPSIFSPYVRLELLKWDPLHEKTDFLNMNWHSLLMDYGVPEDGGGFAP 716 Query: 1944 DDADANLVPGLVEKVALPILHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEALKDL 2123 DDADANLVP LVEKVAL ILHH+I HCWD+LST T+NAV+AT+LV YVPA+ EAL DL Sbjct: 717 DDADANLVPELVEKVALRILHHEIVHCWDMLSTLETRNAVAATSLVTDYVPASSEALADL 776 Query: 2124 LSAIHSRLADAVANITVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDILASSA 2303 L AI +RLADAVAN+TVPTWS V++AVP+AAR+AAYRFG++VRL+KNICLWK+ILA Sbjct: 777 LVAIRTRLADAVANLTVPTWSPPVLQAVPNAARLAAYRFGVSVRLMKNICLWKEILALPV 836 Query: 2304 IEQLAFDELLSGKVLPHVRSITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKLQPLV 2483 +E+LA DELL GKVLPHVRSI N+HDAI RTE+IVASL+GVW+G SVT + S KLQPLV Sbjct: 837 LEKLALDELLCGKVLPHVRSIAANVHDAIPRTEKIVASLSGVWAGPSVTGDRSRKLQPLV 896 Query: 2484 DYVLTLGKTLEKKHASGVSETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 DY++ L K LEKKH SGV+E+ET+GLARRLKKMLV+LNE+DKAR I RTF LKEA+ Sbjct: 897 DYLMLLRKILEKKHESGVTESETSGLARRLKKMLVELNEYDKARDIARTFHLKEAL 952 >ref|XP_004135116.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cucumis sativus] Length = 920 Score = 795 bits (2053), Expect = 0.0 Identities = 457/883 (51%), Positives = 569/883 (64%), Gaps = 13/883 (1%) Frame = +3 Query: 42 LLSFA-DEEGDEE-SPFARPXXXXXXXXXXXXXXXXXTHKITSMKERIXXXXXXXXXXXX 215 LLSFA DEE D P + THKIT++K+RI Sbjct: 60 LLSFASDEENDAPLRPSSSKSSSSKKPSSARLAKPSSTHKITALKDRIAHSSSISASVPS 119 Query: 216 NVQPQAGEYTKERLLELQKNTRTIGXXXXXXXXXXX-EPKIVLKGLIKPIYXXXXXXXXX 392 NVQPQAG YTKE L ELQKNTRT+ EP IVLKGL+KP Sbjct: 120 NVQPQAGVYTKEALRELQKNTRTLASSRPSSESKPSAEPVIVLKGLLKPAEQVPDSAREA 179 Query: 393 XXXLDRMDVDDAETRLGSMGIGGEGDKDLIPDQATINAIRAKRERLRQSRAPASDYISLD 572 + DD R S G IPDQATINAIRAKRER+RQ+ A DYISLD Sbjct: 180 K---ESSSEDDEAGRKDSSGSS-------IPDQATINAIRAKRERMRQAGVAAPDYISLD 229 Query: 573 AGSNHGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFEDGRKLIKEVPIDLRNGGXXX 752 AGSN +SDEE EF RIA+ G K + KG FE+ + E ID G Sbjct: 230 AGSNRTAPGELSDEEAEFPGRIAMIGGKLESS--KKGVFEE----VDEQGID----GART 279 Query: 753 XXXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXXPGH 932 Q RKG G + G+ Sbjct: 280 NIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGSTRVESTS-VPVVPSVQPQNLIYPTTIGY 338 Query: 933 NVSPGLN----IGGSAGVMKKV--LSIPQQATVASQAMRESLQRLKETHGRTMSALDRND 1094 + P ++ IGGS + + + LSI QQA +A AM+ES+ RLKE++ RT ++ + D Sbjct: 339 SSVPSMSTATSIGGSVSISQGLDGLSISQQAEIAKTAMQESMGRLKESYRRTAMSVLKTD 398 Query: 1095 ENMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQHKAPYIEELEEQMQKLHE 1274 EN+SA+L I DLE +L+ A +KF+FMQKL+DFVSVICDFLQHKAP+IEELEEQMQKLHE Sbjct: 399 ENLSASLLKITDLEKALSAAGDKFMFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHE 458 Query: 1275 ERAVAVLERRTADNADEMIEIEAPLSAAMLEYSKGGSS----TAVINAAQLVSSKTREQT 1442 ERA V+ERR ADN DEM+EIE + AA+ +K GSS TA +AAQ + +REQ Sbjct: 459 ERASTVVERRVADNDDEMVEIETAVKAAISILNKKGSSNEMVTAATSAAQAAIALSREQA 518 Query: 1443 NLPVKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAVGDIFPYHHIEGXXXXXX 1622 NLP KLDE GRD+NLQ ++++ ++ + +EG Sbjct: 519 NLPTKLDEFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRLASM-EVDGHQKVEGESSTDE 577 Query: 1623 XXXXXXXYKSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETWKKRFFSSYRDAYMSLSV 1802 Y+SNR++LLQT+EQIF DA EEFS+L++VK++FE WK+ + ++YRDAYMSLS+ Sbjct: 578 SDSDSAAYQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRFEAWKRDYSATYRDAYMSLSI 637 Query: 1803 PAIFSPYVRLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGGDFNSDDADANLVPGLVE 1982 PAIFSPYVRLELLKWDPLHE +DFFDM WHSLLF+YG+PE G DF +DADANLVP LVE Sbjct: 638 PAIFSPYVRLELLKWDPLHESADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVE 697 Query: 1983 KVALPILHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEALKDLLSAIHSRLADAVA 2162 KVALPILHH+IAHCWD+LSTR T+NA AT+L+ YVP + EAL +LL I +RL+ A+ Sbjct: 698 KVALPILHHEIAHCWDMLSTRETRNAAFATSLITNYVPPSSEALTELLVVIRTRLSGAIE 757 Query: 2163 NITVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDILASSAIEQLAFDELLSGK 2342 ++TVPTW+++V KAVP+AARIAAYRFGM+VRL++NICLWK+I+A +E+LA +ELL GK Sbjct: 758 DLTVPTWNSLVTKAVPNAARIAAYRFGMSVRLMRNICLWKEIIALPILEKLALEELLYGK 817 Query: 2343 VLPHVRSITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKLQPLVDYVLTLGKTLEKK 2522 VLPHVRSIT NIHDA+TRTERI+ASL GVW+GS + + S+KLQPLVDYVL LG+TLEKK Sbjct: 818 VLPHVRSITANIHDAVTRTERIIASLAGVWTGSGIIGDRSHKLQPLVDYVLLLGRTLEKK 877 Query: 2523 HASGVSETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 H SG++E+ET+GLARRLKKMLV+LNE+D AR I +TF LKEA+ Sbjct: 878 HISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLKEAL 920 >ref|XP_004298307.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Fragaria vesca subsp. vesca] Length = 914 Score = 794 bits (2050), Expect = 0.0 Identities = 444/847 (52%), Positives = 560/847 (66%), Gaps = 13/847 (1%) Frame = +3 Query: 150 HKITSMKERIXXXXXXXXXXXX--NVQPQAGEYTKERLLELQKNTRTIGXXXXXXXXXXX 323 HK+T+ K+R+ NVQPQAG YTKE L ELQKNTRT+ Sbjct: 90 HKLTAAKDRLVNSTSSTASASLPSNVQPQAGTYTKEALRELQKNTRTLASSRTSSAAAAA 149 Query: 324 EPKIVLKGLIKPIYXXXXXXXXXXXXLDRMDVDDAETRLGSMGIGGEGDKDLIPDQATIN 503 EP IVL+G IKP LD DD E +G KD PDQATI Sbjct: 150 EPTIVLRGSIKPADASIADAVNGARELDS---DDEEQ---------QGSKDRYPDQATIE 197 Query: 504 AIRAKRERLRQSRAPASDYISLDAGSNHGEAEGISDEEPEFQTRIALFGDKSSNVGVTKG 683 AIR KRERLR+S+ A D+I+LD+GSNHG AEG+SDEEPEF+ RIA+FG+K N KG Sbjct: 198 AIRKKRERLRKSKPAAPDFIALDSGSNHGAAEGLSDEEPEFRNRIAMFGEKMEN---KKG 254 Query: 684 FFEDGRKLIKEVPIDLRNGGXXXXXXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXX 863 FED + + +D G Q RKG G Sbjct: 255 VFED----VDDTGVD--GGLRRESVVVEDDEDEEEKIWEEEQFRKGLGKRVDNDGASLGV 308 Query: 864 XXXLXXXXXXXXXXXXXXXX-PGHNVSPGL----NIGGSAGVMK--KVLSIPQQATVASQ 1022 + G++++ L +IGG+ G + LSI +Q+ +A + Sbjct: 309 SASVPRVHSAAPQPKASYNSIAGYSLAQSLAGVASIGGATGASQGSNALSINEQSEIAQK 368 Query: 1023 AMRESLQRLKETHGRTMSALDRNDENMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSV 1202 A+ E++++LKE+HGRT +L + +E++SA+L NI DLE SL+ ADEK+ FMQ+L+DFVS Sbjct: 369 ALLENVRKLKESHGRTKMSLTKANESLSASLLNITDLEKSLSAADEKYKFMQELRDFVST 428 Query: 1203 ICDFLQHKAPYIEELEEQMQKLHEERAVAVLERRTADNADEMIEIEAPLSAAMLEYSKGG 1382 ICDFLQ KAP IEELEE+MQK +ERA A+ ERR ADN DEM+E+EA ++AAM +SK G Sbjct: 429 ICDFLQDKAPLIEELEEEMQKQRDERASAIFERRIADNDDEMMEVEAAVNAAMSIFSKEG 488 Query: 1383 SSTAVI----NAAQLVSSKTREQTNLPVKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXX 1550 +S VI +AAQ S+ REQ NLPVKLDE GRDMNL+ Sbjct: 489 TSAGVIAVAKSAAQAASAAVREQKNLPVKLDEFGRDMNLKKRLDMKGRAEARQRRRKRYE 548 Query: 1551 XXXMSAVGDIFPYHHIEGXXXXXXXXXXXXXYKSNREMLLQTSEQIFGDAEEEFSKLALV 1730 S++ P +EG Y+S+R+++L T++Q+F DA EE+S+L+LV Sbjct: 549 AKRESSMDVDSPDRTVEGESSTDESDGESKEYESHRQLVLGTADQVFSDAAEEYSQLSLV 608 Query: 1731 KEKFETWKKRFFSSYRDAYMSLSVPAIFSPYVRLELLKWDPLHEESDFFDMQWHSLLFDY 1910 KE+FE WK+ + SSYRDAYMSLSVP IFSPYVRLELLKWDPL E +DF M WH LL +Y Sbjct: 609 KERFEKWKREYRSSYRDAYMSLSVPIIFSPYVRLELLKWDPLRENTDFVKMSWHELLENY 668 Query: 1911 GLPEHGGDFNSDDADANLVPGLVEKVALPILHHDIAHCWDVLSTRGTKNAVSATNLVITY 2090 G+PE G DF SDDADANL+P LVEKVALPILHH I HCWD+LSTR TKNAV+AT+LV Y Sbjct: 669 GVPEDGSDFASDDADANLIPALVEKVALPILHHQIVHCWDILSTRETKNAVAATSLVTDY 728 Query: 2091 VPANGEALKDLLSAIHSRLADAVANITVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNI 2270 V ++ EAL+DLL AI +RLADAV+ + VPTWS +V+KAVP+AARIAAYRFGM+VRL+KNI Sbjct: 729 V-SSSEALEDLLVAIRTRLADAVSKLMVPTWSPLVLKAVPNAARIAAYRFGMSVRLMKNI 787 Query: 2271 CLWKDILASSAIEQLAFDELLSGKVLPHVRSITPNIHDAITRTERIVASLNGVWSGSSVT 2450 CLWK+ILA +E+LA +ELL GKV+PH+RSI ++HDA+TRTER++ASL+GVWSGS VT Sbjct: 788 CLWKEILALPVLEKLAINELLCGKVIPHIRSIAADVHDAVTRTERVIASLSGVWSGSDVT 847 Query: 2451 MEHSYKLQPLVDYVLTLGKTLEKKHASGVSETETTGLARRLKKMLVDLNEHDKARAILRT 2630 + S KLQ LVDYVLTLGKT+EKKH+ GV+++ET GLARRLKKMLV+LNE+DKAR + RT Sbjct: 848 GDRSRKLQSLVDYVLTLGKTIEKKHSLGVTQSETGGLARRLKKMLVELNEYDKARDVART 907 Query: 2631 FQLKEAV 2651 F LKEA+ Sbjct: 908 FHLKEAL 914 >ref|XP_004159322.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cucumis sativus] Length = 889 Score = 794 bits (2050), Expect = 0.0 Identities = 449/850 (52%), Positives = 558/850 (65%), Gaps = 15/850 (1%) Frame = +3 Query: 147 THKITSMKERIXXXXXXXXXXXXNVQPQAGEYTKERLLELQKNTRTIGXXXXXXXXXXX- 323 THKIT++K+RI NVQPQAG YTKE L ELQKNTRT+ Sbjct: 67 THKITALKDRIAHSSSISASVPSNVQPQAGVYTKEALRELQKNTRTLASSRPSSESKPSA 126 Query: 324 EPKIVLKGLIKPIYXXXXXXXXXXXXLDRMDVDDAETRLGSMGIGGEGDKDL----IPDQ 491 EP IVLKGL+KP D A S E KD IPDQ Sbjct: 127 EPVIVLKGLLKPAEQVP---------------DSAREAKESSSEDDEAGKDSSGSSIPDQ 171 Query: 492 ATINAIRAKRERLRQSRAPASDYISLDAGSNHGEAEGISDEEPEFQTRIALFGDKSSNVG 671 ATINAIRAKRER+RQ+ A DYISLDAGSN +SDEE EF RIA+ G K + Sbjct: 172 ATINAIRAKRERMRQAGVAAPDYISLDAGSNRTAPGELSDEEAEFPGRIAMIGGKLESS- 230 Query: 672 VTKGFFEDGRKLIKEVPIDLRNGGXXXXXXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXX 851 KG FE+ + E ID G Q RKG G Sbjct: 231 -KKGVFEE----VDEQGID----GARTNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGST 281 Query: 852 XXXXXXXLXXXXXXXXXXXXXXXXPGHN----VSPGLNIGGSAGVMKKV--LSIPQQATV 1013 + G++ VS +IGGS + + + LSI QQA + Sbjct: 282 RVESTS-VPVVPSVQPQNLIYPTTIGYSSVPSVSTATSIGGSVSISQGLDGLSISQQAEI 340 Query: 1014 ASQAMRESLQRLKETHGRTMSALDRNDENMSAALSNIIDLENSLAHADEKFVFMQKLQDF 1193 A AM+ES+ RLKE++ RT ++ + DEN+SA+L I DLE +L+ A +KF+FMQKL+DF Sbjct: 341 AKTAMQESMGRLKESYRRTAMSVLKTDENLSASLLKITDLEKALSAAGDKFIFMQKLRDF 400 Query: 1194 VSVICDFLQHKAPYIEELEEQMQKLHEERAVAVLERRTADNADEMIEIEAPLSAAMLEYS 1373 VSVICDFLQHKAP+IEELEEQMQKLHEERA V+ERR ADN DEM+EIE + AA+ + Sbjct: 401 VSVICDFLQHKAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIETAVKAAISILN 460 Query: 1374 KGGSS----TAVINAAQLVSSKTREQTNLPVKLDELGRDMNLQXXXXXXXXXXXXXXXXX 1541 K GSS TA +AAQ + +REQ NLP KLDE GRD+NLQ Sbjct: 461 KKGSSNEMITAATSAAQAAIALSREQANLPTKLDEFGRDLNLQKRMDMKRRAEARKRRRS 520 Query: 1542 XXXXXXMSAVGDIFPYHHIEGXXXXXXXXXXXXXYKSNREMLLQTSEQIFGDAEEEFSKL 1721 ++++ ++ + +EG Y+SNR++LLQT+EQIF DA EEFS+L Sbjct: 521 QYDSKRLASM-EVDGHQKVEGESSTDESDSDSAAYQSNRDLLLQTAEQIFSDAAEEFSQL 579 Query: 1722 ALVKEKFETWKKRFFSSYRDAYMSLSVPAIFSPYVRLELLKWDPLHEESDFFDMQWHSLL 1901 ++VK++FE WK+ + ++YRDAYMSLS+PAIFSPYVRLELLKWDPLHE +DFFDM WHSLL Sbjct: 580 SVVKQRFEAWKRDYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHESADFFDMNWHSLL 639 Query: 1902 FDYGLPEHGGDFNSDDADANLVPGLVEKVALPILHHDIAHCWDVLSTRGTKNAVSATNLV 2081 F+YG+PE G DF +DADANLVP LVEKVALPILHH+IAHCWD+LSTR T+NA AT+L+ Sbjct: 640 FNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSTRETRNAAFATSLI 699 Query: 2082 ITYVPANGEALKDLLSAIHSRLADAVANITVPTWSTVVIKAVPDAARIAAYRFGMAVRLL 2261 YVP + EAL +LL I +RL+ A+ ++TVPTW+++V KAVP+AARIAAYRFGM+VRL+ Sbjct: 700 TNYVPPSSEALTELLVVIRTRLSGAIEDLTVPTWNSLVTKAVPNAARIAAYRFGMSVRLM 759 Query: 2262 KNICLWKDILASSAIEQLAFDELLSGKVLPHVRSITPNIHDAITRTERIVASLNGVWSGS 2441 +NICLWK+I+A +E+LA +ELL GKVLPHVRSIT NIHDA+TRTERI+ASL GVW+GS Sbjct: 760 RNICLWKEIIALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASLAGVWTGS 819 Query: 2442 SVTMEHSYKLQPLVDYVLTLGKTLEKKHASGVSETETTGLARRLKKMLVDLNEHDKARAI 2621 + + S+KLQPLVDYVL LG+TLEKKH SG++E+ET+GLARRLKKMLV+LNE+D AR I Sbjct: 820 GIIGDRSHKLQPLVDYVLLLGRTLEKKHISGIAESETSGLARRLKKMLVELNEYDNARDI 879 Query: 2622 LRTFQLKEAV 2651 +TF LKEA+ Sbjct: 880 AKTFHLKEAL 889 >ref|XP_007010500.1| GC-rich sequence DNA-binding factor-like protein, putative isoform 1 [Theobroma cacao] gi|590567380|ref|XP_007010501.1| GC-rich sequence DNA-binding factor-like protein, putative isoform 1 [Theobroma cacao] gi|508727413|gb|EOY19310.1| GC-rich sequence DNA-binding factor-like protein, putative isoform 1 [Theobroma cacao] gi|508727414|gb|EOY19311.1| GC-rich sequence DNA-binding factor-like protein, putative isoform 1 [Theobroma cacao] Length = 934 Score = 788 bits (2034), Expect = 0.0 Identities = 465/907 (51%), Positives = 579/907 (63%), Gaps = 37/907 (4%) Frame = +3 Query: 42 LLSFADEEGDEE----SPFARPXXXXXXXXXXXXXXXXXTHKITSMKERIXXXXXXXXXX 209 LLSFAD+E +EE S HKITS K+ Sbjct: 54 LLSFADDENEEETTKPSSNRNRDKEREKPFSSRVSKPLSAHKITSTKD-----CKTPSTL 108 Query: 210 XXNVQPQAGEYTKERLLELQKNTRTIGXXXXXXXXXXXEPKIVLKGLIKPIYXXXXXXXX 389 NVQPQAG YTKE LLELQKN RT+ EPKIVLKGL+KP Sbjct: 109 PSNVQPQAGTYTKEALLELQKNMRTLAAPSSRASSVSSEPKIVLKGLLKP-QSQNLNSER 167 Query: 390 XXXXLDRMDVDDAETRLGSMGIGGEGDKDL--IPDQATINAIRAKRERLRQSRA-PASDY 560 +++ DD E+RL +M G D D PDQATI+AI+AK++R+R+S A PA DY Sbjct: 168 DNDPPEKLQKDDTESRLATMAAGKGVDLDFSAFPDQATIDAIKAKKDRVRKSFARPAPDY 227 Query: 561 ISLDAGSNHG---EAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFEDGRKLIKE--VPI 725 ISLD GSN G E E DEEPEF R LFG+ KG FE +I+E V + Sbjct: 228 ISLDRGSNLGGAMEEELSDDEEPEFPGR--LFGESGK-----KGVFE----VIEERAVGV 276 Query: 726 DLRNGGXXXXXXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXXXXXLXXXXXXXXXX 905 LR G Q RKG G + Sbjct: 277 GLRKDG---IHDEDDDDNEEEKMWEEEQFRKGLG-----KRMDDSSNRVVSSSNNSGGVG 328 Query: 906 XXXXXXPGHNVSPGLNIGGSAGVMKKVLS---------------------IPQQATVASQ 1022 H G + GS G M +S I QQA + + Sbjct: 329 MVHNMQQQHQQRYGYSTMGSYGSMMPSVSPAPPSSIVGAAGASQGLDVTSISQQAEITKK 388 Query: 1023 AMRESLQRLKETHGRTMSALDRNDENMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSV 1202 A++E+++RLKE+H RT+S+L + DEN+SA+L NI LE SL+ A EKF+FMQKL+DFVSV Sbjct: 389 ALQENVRRLKESHDRTISSLTKADENLSASLFNITALEKSLSAAGEKFIFMQKLRDFVSV 448 Query: 1203 ICDFLQHKAPYIEELEEQMQKLHEERAVAVLERRTADNADEMIEIEAPLSAAMLEYSKGG 1382 IC+FLQHKAP IEELEE MQKL+EERA++VLERR+A+N DEM+E+EA ++AAML +S+ G Sbjct: 449 ICEFLQHKAPLIEELEEHMQKLNEERALSVLERRSANNDDEMVEVEAAVTAAMLVFSECG 508 Query: 1383 SSTAVI----NAAQLVSSKTREQTNLPVKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXX 1550 +S A+I NAAQ ++ R Q NLPVKLDE GRD+N Q Sbjct: 509 NSAAMIEVAANAAQAAAAAIRGQVNLPVKLDEFGRDVNRQKHLDMERRAEARQRRKARFD 568 Query: 1551 XXXMSAVGDIFPYHHIEGXXXXXXXXXXXXXYKSNREMLLQTSEQIFGDAEEEFSKLALV 1730 +S++ Y IEG Y+SNR+MLLQT+++IFGDA EE+S+L+LV Sbjct: 569 SKRLSSMEIDSSYQKIEGESSTDESDSESTAYRSNRDMLLQTADEIFGDASEEYSQLSLV 628 Query: 1731 KEKFETWKKRFFSSYRDAYMSLSVPAIFSPYVRLELLKWDPLHEESDFFDMQWHSLLFDY 1910 KE+FE WKK + SSYRDAYMSLS+PAIFSPYVRLELLKWDPLH + DF DM+WH+LLF+Y Sbjct: 629 KERFERWKKDYSSSYRDAYMSLSIPAIFSPYVRLELLKWDPLHVDEDFSDMKWHNLLFNY 688 Query: 1911 GLPEHGGDFNSDDADANLVPGLVEKVALPILHHDIAHCWDVLSTRGTKNAVSATNLVITY 2090 G PE G F DDADANLVP LVEKVALP+LHH+I+HCWD+LS + TKNAVSAT+L+I Y Sbjct: 689 GFPE-DGSFAPDDADANLVPALVEKVALPVLHHEISHCWDMLSMQETKNAVSATSLIIDY 747 Query: 2091 VPANGEALKDLLSAIHSRLADAVANITVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNI 2270 VPA+ EAL +LL I +RL++AVA+I VPTWS +V+KAVP+AAR+AAYRFGM+VRL++NI Sbjct: 748 VPASSEALAELLVTIRTRLSEAVADIMVPTWSPLVMKAVPNAARVAAYRFGMSVRLMRNI 807 Query: 2271 CLWKDILASSAIEQLAFDELLSGKVLPHVRSITPNIHDAITRTERIVASLNGVWSGSSVT 2450 CLWK+ILA +E+LA DELL GK+LPHVR+IT ++HDA+TRTERIVASL+GVW+G++V Sbjct: 808 CLWKEILALPILEKLALDELLYGKILPHVRNITSDVHDAVTRTERIVASLSGVWAGTNVI 867 Query: 2451 MEHSYKLQPLVDYVLTLGKTLEKKHASGVSETETTGLARRLKKMLVDLNEHDKARAILRT 2630 + S KLQPLVDYVL LGKTLE++HASGV+E+ T GLARRLKKMLV+LNE+D AR I R Sbjct: 868 QDSSRKLQPLVDYVLLLGKTLERRHASGVTESGTGGLARRLKKMLVELNEYDSARDIARR 927 Query: 2631 FQLKEAV 2651 F LKEA+ Sbjct: 928 FHLKEAL 934 >ref|XP_006838726.1| hypothetical protein AMTR_s00002p00252610 [Amborella trichopoda] gi|548841232|gb|ERN01295.1| hypothetical protein AMTR_s00002p00252610 [Amborella trichopoda] Length = 946 Score = 786 bits (2031), Expect = 0.0 Identities = 441/850 (51%), Positives = 558/850 (65%), Gaps = 15/850 (1%) Frame = +3 Query: 147 THKITSMKERIXXXXXXXXXXXXNVQPQAGEYTKERLLELQKNTRTIGXXXXXXXXXXXE 326 +HKI + K+R NVQPQAG+YTKE+LLELQKNT+T+G E Sbjct: 110 SHKIIAGKDRTSIQSPSVPS---NVQPQAGQYTKEKLLELQKNTKTLGGSKPPSETKPAE 166 Query: 327 PKIVLKGLIKPIYXXXXXXXXXXXXLDRMD-------VDDAETRLGSMGIGGEGDKDLIP 485 P IVLKGL+KPI D ++AE+ LG MGIG ++ P Sbjct: 167 PVIVLKGLVKPILEERKSEKTQVRESMENDREKFSREKEEAESSLGKMGIGQPKEEVGSP 226 Query: 486 --DQATINAIRAKRERLRQSRAPASDYISLDAGS----NHGEAEGISDEEPEFQTRIALF 647 DQATINAI+AKRERLRQ+R A DYISLD+G + G SD+E EFQ RIAL Sbjct: 227 VLDQATINAIKAKRERLRQARM-APDYISLDSGGARSMRDSDGLGSSDDESEFQGRIALL 285 Query: 648 GDKSSNVGVTKGFFEDGRKLIKEVPIDLRNGGXXXXXXXXXXXXXXXXXXXXXQCRKGRG 827 G+ N KG FE+ + + E+ + R Q RK G Sbjct: 286 GE--GNNSSRKGVFENADEKVFELKREERE-------TEVDDDDEEDKKWEEEQFRKALG 336 Query: 828 XXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXXPGHNVSPGLNIGGSAGVMKKV--LSIPQ 1001 H S GL GV + V ++ Q Sbjct: 337 KRMDDNSNRGSVQSVASAGSVKAVQSSVYSGGSYHGASSGLVSNLGVGVTRSVEFMTTSQ 396 Query: 1002 QATVASQAMRESLQRLKETHGRTMSALDRNDENMSAALSNIIDLENSLAHADEKFVFMQK 1181 QA VA+QA+R+S+ RLKE+H RT+S++ R D N+SA+LSNIIDLE SL+ A EK++FMQK Sbjct: 397 QAEVATQALRDSMARLKESHDRTISSIVRTDNNLSASLSNIIDLEKSLSAAGEKYLFMQK 456 Query: 1182 LQDFVSVICDFLQHKAPYIEELEEQMQKLHEERAVAVLERRTADNADEMIEIEAPLSAAM 1361 L+DFVSVICDFLQ KAP+IEELEEQMQ+LHEERA A+++RR D+ADEM EIEA ++AA+ Sbjct: 457 LRDFVSVICDFLQDKAPFIEELEEQMQRLHEERASAIVQRRADDDADEMAEIEAAVNAAI 516 Query: 1362 LEYSKGGSSTAVINAAQLVSSKTREQTNLPVKLDELGRDMNLQXXXXXXXXXXXXXXXXX 1541 ++KGGS ++ +AAQ S +EQ+NLPV+LDE GRD+NLQ Sbjct: 517 SVFNKGGSVSSAASAAQAASLAAKEQSNLPVELDEFGRDVNLQKRMDSKRRAEARKRRKA 576 Query: 1542 XXXXXXMSAVGDIFPYHHIEGXXXXXXXXXXXXXYKSNREMLLQTSEQIFGDAEEEFSKL 1721 + VGD Y IEG Y+S+ + LLQT+ +IF DA +EFS L Sbjct: 577 WSESKRIRTVGDGSSYQRIEGESSTDESDSDSTAYRSSCDELLQTASEIFSDAADEFSNL 636 Query: 1722 ALVKEKFETWKKRFFSSYRDAYMSLSVPAIFSPYVRLELLKWDPLHEESDFFDMQWHSLL 1901 ++VK +FE WK+++ +YRDAYMS++ AIFSPYVRLELLKWDPL++ +DF DM+WHSLL Sbjct: 637 SVVKVRFEGWKRQYLPTYRDAYMSMNASAIFSPYVRLELLKWDPLYKYTDFDDMRWHSLL 696 Query: 1902 FDYGLPEHGGDFNSDDADANLVPGLVEKVALPILHHDIAHCWDVLSTRGTKNAVSATNLV 2081 FDYG+ + SDD+DA+L+P LVEKVALPILHHDIAHCWD+LST+ TKNAVSAT L+ Sbjct: 697 FDYGIKAGASGYESDDSDADLIPKLVEKVALPILHHDIAHCWDMLSTKETKNAVSATKLL 756 Query: 2082 ITYVPANGEALKDLLSAIHSRLADAVANITVPTWSTVVIKAVPDAARIAAYRFGMAVRLL 2261 I Y+PA+ EAL++LL ++ +RL++AV+ + VPTWST+VI AVP AA+IAAYRFG +VRL+ Sbjct: 757 IDYIPASSEALQELLVSVRTRLSEAVSKLKVPTWSTLVINAVPQAAQIAAYRFGTSVRLM 816 Query: 2262 KNICLWKDILASSAIEQLAFDELLSGKVLPHVRSITPNIHDAITRTERIVASLNGVWSGS 2441 KNICLWKDI+A +EQL DELL +VLPHVR+I PNIHDAITRTER+VASL GVW+G Sbjct: 817 KNICLWKDIIALPVLEQLVLDELLCARVLPHVRNIMPNIHDAITRTERVVASLAGVWTGR 876 Query: 2442 SVTMEHSYKLQPLVDYVLTLGKTLEKKHASGVSETETTGLARRLKKMLVDLNEHDKARAI 2621 + + S KLQPLVDY+++LGKTLEKKHA GVS ETTGLARRLK MLV+LNE+DK RAI Sbjct: 877 DLIGDRSSKLQPLVDYLMSLGKTLEKKHALGVSTEETTGLARRLKCMLVELNEYDKGRAI 936 Query: 2622 LRTFQLKEAV 2651 LRTFQL+EA+ Sbjct: 937 LRTFQLREAL 946 >ref|XP_007225333.1| hypothetical protein PRUPE_ppa001044mg [Prunus persica] gi|462422269|gb|EMJ26532.1| hypothetical protein PRUPE_ppa001044mg [Prunus persica] Length = 925 Score = 785 bits (2028), Expect = 0.0 Identities = 461/900 (51%), Positives = 568/900 (63%), Gaps = 30/900 (3%) Frame = +3 Query: 42 LLSFADEEGDEESPFARPXXXXXXXXXXXXXXXXXTHKITSMKERIXXXXXXXXXXXXNV 221 LLSF D+E +P +R HK+T++K+R+ NV Sbjct: 58 LLSFVDDEESAAAP-SRSSSSKPDKPSSRLGKPSSAHKMTALKDRLAHTSSVSTSLPSNV 116 Query: 222 QPQAGEYTKERLLELQKNTRTIGXXXXXXXXXXXEPKIVLKGLIKPI------------- 362 QPQAG YTKE L ELQKNTRT+ EP IVLKGL+KP Sbjct: 117 QPQAGTYTKEALRELQKNTRTLASSRPSS-----EPTIVLKGLVKPTGTISDTLREAREL 171 Query: 363 -YXXXXXXXXXXXXLDRMDVDDAETRLGSMGIG-GEGDKDLIPDQATINAIRAKRERLRQ 536 L R D DDAE RL SMGI +G L PDQATINAIRAKRERLR+ Sbjct: 172 DSDNDEEQEKERASLFRRDKDDAEARLASMGIDKAKGSSGLFPDQATINAIRAKRERLRK 231 Query: 537 SRAPASDYISLDAGSNHGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFED-----GR 701 SRA A D+ISLD+GSNHG AEG+SDEEPEF+ RIA+FGD G KG FED Sbjct: 232 SRAAAPDFISLDSGSNHGAAEGLSDEEPEFRGRIAIFGDNME--GSKKGVFEDVDDRAAD 289 Query: 702 KLIKEVPIDLRNGGXXXXXXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXXXXXLXX 881 ++++ ID Q RKG G Sbjct: 290 AVLRQKSID-----------RDEDEDEEEKIWEEEQFRKGLGKRMDDGSSIGVVSTSAPV 338 Query: 882 XXXXXXXXXXXXXXPGHN----VSPGLNIGGSAGVMK--KVLSIPQQATVASQAMRESLQ 1043 G++ V G +IGG+ G + V+SI QA +A +A+ E++ Sbjct: 339 VQSVPQPKATYSAMAGYSSVQSVPVGPSIGGAIGASQGSNVMSIKAQAEIAKKALEENVM 398 Query: 1044 RLKETHGRTMSALDRNDENMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQH 1223 +LKE+HGRTM +L + DEN+S++L NI LE SL+ ADEK+ K + SV Sbjct: 399 KLKESHGRTMLSLTKTDENLSSSLLNITALEKSLSAADEKY----KGMEIGSV------- 447 Query: 1224 KAPYIEELEEQMQKLHEERAVAVLERRTADNADEMIEIEAPLSAAMLEYSKGGSSTAVI- 1400 KAP IEELEE+MQK+HE+RA A LERR+AD+ DEM+E+EA + AAM +SK GSS +I Sbjct: 448 KAPLIEELEEEMQKIHEQRASATLERRSADD-DEMMEVEAAVKAAMSIFSKEGSSAEIIA 506 Query: 1401 ---NAAQLVSSKTREQTNLPVKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAV 1571 +AAQ ++ REQTNLPVKLDE GRDMNLQ +S++ Sbjct: 507 AAKSAAQAATTAEREQTNLPVKLDEFGRDMNLQKRRDMKGRSEAHQHRKRRYESKRLSSM 566 Query: 1572 GDIFPYHHIEGXXXXXXXXXXXXXYKSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETW 1751 + IEG Y +R+++L+T+ Q+F DA EE+SKL+LVKE+FE W Sbjct: 567 EVDSTHRTIEGESSTDESDSESNAYHKHRQLVLETAAQVFSDAAEEYSKLSLVKERFEEW 626 Query: 1752 KKRFFSSYRDAYMSLSVPAIFSPYVRLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGG 1931 K + SSYRDAYMSLS PAIFSPYVRLEL+KWDPL E++DF +M WHSLL DY LPE G Sbjct: 627 KTDYASSYRDAYMSLSAPAIFSPYVRLELVKWDPLREKTDFLNMSWHSLLADYNLPEDGS 686 Query: 1932 DFNSDDADANLVPGLVEKVALPILHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEA 2111 DF DDADANLVP LVEKVALPIL H + HCWD+LSTR TKNAV+AT++V YVP + EA Sbjct: 687 DFAPDDADANLVPDLVEKVALPILLHQVVHCWDILSTRETKNAVAATSVVTDYVPPSSEA 746 Query: 2112 LKDLLSAIHSRLADAVANITVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDIL 2291 L DLL AI +RLADAV N+TVPTWS +V+ AVP+AARIAAYRFG++VRL+KNICLWK+IL Sbjct: 747 LADLLVAIRTRLADAVTNLTVPTWSPLVLTAVPNAARIAAYRFGLSVRLMKNICLWKEIL 806 Query: 2292 ASSAIEQLAFDELLSGKVLPHVRSITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKL 2471 A +E+LA +ELL GKVLPHVRSI N+HDAITRTERIVASL+GVW+GS+VT + KL Sbjct: 807 AFPVLEKLAIEELLCGKVLPHVRSIAANVHDAITRTERIVASLSGVWAGSNVTGDRR-KL 865 Query: 2472 QPLVDYVLTLGKTLEKKHASGVSETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 Q LVDYVL+LG+TLEKKH+ GV+++E +GLARRLKKMLVDLNE+DKAR + RTF LKEA+ Sbjct: 866 QSLVDYVLSLGRTLEKKHSLGVTQSEISGLARRLKKMLVDLNEYDKARDLTRTFNLKEAL 925 >ref|XP_006468681.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Citrus sinensis] Length = 913 Score = 773 bits (1995), Expect = 0.0 Identities = 450/886 (50%), Positives = 564/886 (63%), Gaps = 16/886 (1%) Frame = +3 Query: 42 LLSFADEEGDEESPFARPXXXXXXXXXXXXXXXXXTHKITSMKER-IXXXXXXXXXXXXN 218 LLSFAD+E EE +HKIT+ KER N Sbjct: 45 LLSFADDE--EEKSEIPTSNRDRTRPSSRLSKPSSSHKITASKERQSSSATSSSTSLLSN 102 Query: 219 VQPQAGEYTKERLLELQKNTRTIGXXXXXXXXXXXEPKIVLKGLIKPIYXXXXXXXXXXX 398 VQ QAG YT+E LLEL+KNT+T+ EP +VL+G IKP Sbjct: 103 VQAQAGTYTEEYLLELRKNTKTL---KAPSSKPPAEPVVVLRGSIKP-EDSNLTRVQQKP 158 Query: 399 XLDRMDVD-----DAETRLGSMGIGGEG-DKDLIPDQATINAIRAKRERLRQSRAPASDY 560 D D D + E R S+G+G +I D+A I AIRAK++RLRQS A A DY Sbjct: 159 SRDSSDSDSDHKAETEKRFASLGVGKIAVQSGVIYDEAEIKAIRAKKDRLRQSGAKAPDY 218 Query: 561 ISLDAGSN--HGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFEDGRKLIKEVPIDLR 734 I LD GS+ G+AEG SDEEPEF R+A+FG+++++ KG FED E P+ R Sbjct: 219 IPLDGGSSSLRGDAEGSSDEEPEFPRRVAMFGERTASGKKKKGVFEDDDVDEDERPVVAR 278 Query: 735 NGGXXXXXXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXXXXXLXXXXXXXXXXXXX 914 Q RKG G Sbjct: 279 -------VENDYEYVDEDVMWEEEQVRKGLGKRIDDGSVRVGANTSSSVAMPQQQQQFSY 331 Query: 915 XXXPGHNVSPGLNIGGSAGVMKKV--LSIPQQATVASQAMRESLQRLKETHGRTMSALDR 1088 V+P +IGG+ G + + +SI Q+A A +A++ ++ RLKE+H RTMS+L + Sbjct: 332 ST----TVTPIPSIGGAIGASQGLDTMSIAQKAESAMKALQTNVNRLKESHARTMSSLKK 387 Query: 1089 NDENMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQHKAPYIEELEEQMQKL 1268 DE++S++L I DLE+SL+ A EKF+FMQKL+D+VSVICDFLQ KAPYIE LE +MQKL Sbjct: 388 TDEDLSSSLLKITDLESSLSAAGEKFIFMQKLRDYVSVICDFLQDKAPYIETLEAEMQKL 447 Query: 1269 HEERAVAVLERRTADNADEMIEIEAPLSAAMLEYSKGGSSTAVINAAQ-----LVSSKTR 1433 ++ERA A+LERR ADN DEM E+EA + AA L G+S + + AA ++ + Sbjct: 448 NKERASAILERRAADNDDEMTEVEAAIKAATLVIGDRGNSASKLIAASSAAQAAAAAAVK 507 Query: 1434 EQTNLPVKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAVGDIFPYHHIEGXXX 1613 EQTNLPVKLDE GRDMNLQ +S++ +EG Sbjct: 508 EQTNLPVKLDEFGRDMNLQKRRDMERRAESRQHRRTRFDLKQLSSMDADISSQKLEGEST 567 Query: 1614 XXXXXXXXXXYKSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETWKKRFFSSYRDAYMS 1793 Y+SNRE LL+T+E IF DA EE+S+L++VKE+FE WK+ + SSYRDAYMS Sbjct: 568 TDESDSETEAYQSNREELLKTAEHIFSDAAEEYSQLSVVKERFEKWKRDYSSSYRDAYMS 627 Query: 1794 LSVPAIFSPYVRLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGGDFNSDDADANLVPG 1973 LS PAI SPYVRLELLKWDPLHE++DF +M+WH+LLF+YGLP+ G DF DDADANLVP Sbjct: 628 LSTPAIMSPYVRLELLKWDPLHEDADFSEMKWHNLLFNYGLPKDGEDFAHDDADANLVPT 687 Query: 1974 LVEKVALPILHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEALKDLLSAIHSRLAD 2153 LVEKVALPILHHDIA+CWD+LSTR TKNAVSAT LV+ YVP + EALKDLL AIH+RLA+ Sbjct: 688 LVEKVALPILHHDIAYCWDMLSTRETKNAVSATILVMAYVPTSSEALKDLLVAIHTRLAE 747 Query: 2154 AVANITVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDILASSAIEQLAFDELL 2333 AVANI VPTWS++ + AVP+AARIAAYRFG++VRL++NICLWK++ A +E+LA DELL Sbjct: 748 AVANIAVPTWSSLAMSAVPNAARIAAYRFGVSVRLMRNICLWKEVFALPILEKLALDELL 807 Query: 2334 SGKVLPHVRSITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKLQPLVDYVLTLGKTL 2513 KVLPHVRSI N+HDAI+RTERIVASL+GVW+G SVT +KLQPLVD++L+L KTL Sbjct: 808 CRKVLPHVRSIASNVHDAISRTERIVASLSGVWAGPSVTGSCCHKLQPLVDFMLSLAKTL 867 Query: 2514 EKKHASGVSETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 EKKH GV+E+ET GLARRLKKMLV+LNE+D AR I RTF LKEA+ Sbjct: 868 EKKHLPGVTESETAGLARRLKKMLVELNEYDNARDIARTFHLKEAL 913 >ref|XP_002513154.1| gc-rich sequence DNA-binding factor, putative [Ricinus communis] gi|223548165|gb|EEF49657.1| gc-rich sequence DNA-binding factor, putative [Ricinus communis] Length = 885 Score = 770 bits (1989), Expect = 0.0 Identities = 443/885 (50%), Positives = 552/885 (62%), Gaps = 15/885 (1%) Frame = +3 Query: 42 LLSFAD-EEGDEESPFARPXXXXXXXXXXXXXXXXXTHKITSMKERIXXXXXXXXXXXXN 218 LLSFAD EE DEE+P RP +HK+T+ K+R+ Sbjct: 43 LLSFADDEEEDEETP--RPSKQKPSKTKS-------SHKLTAPKDRLSSSSTTSTTSTNT 93 Query: 219 -----VQPQAGEYTKERLLELQKNTRTIGXXXXXXXXXXX---EPKIVLKGLIKPIYXXX 374 + PQAG YTKE LLELQK TRT+ EPKI+LKGL+KP Sbjct: 94 NSNNVLLPQAGTYTKEALLELQKKTRTLAKPSSKPPPPPPSSSEPKIILKGLLKPTLPQT 153 Query: 375 XXXXXXXXXLDRMDVDDAETRLGSMGIGGEGDKDLIPDQATINAIRAKRERLRQSRAPAS 554 D + +D+ D LIPD+ TI IRAKRERLRQSRA A Sbjct: 154 LNQQDADPPQDEIIIDE--------------DYSLIPDEDTIKKIRAKRERLRQSRATAP 199 Query: 555 DYISLDAGSNHGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFEDGRKLIKEVPIDLR 734 DYISLD G+ +A SDEEPEF+ RIA+ G K + T F+ D Sbjct: 200 DYISLDGGAATSDA--FSDEEPEFRNRIAMIGKKDNTTPTTHAVFQ-----------DFD 246 Query: 735 NGGXXXXXXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXXXXXLXXXXXXXXXXXXX 914 NG + + R L Sbjct: 247 NGNDSHVIAEETVVNDEDEEDKIWEEEQFRKALGKRMDDPSSSTPSLFPTPSTSTITTTN 306 Query: 915 XXXPGHNVSPGLNIGGSAGVMKKV--LSIPQQATVASQAMRESLQRLKETHGRTMSALDR 1088 H V IGG+ G + LS+PQQ+ +A +A+ ++L RLKE+H RT+S+L + Sbjct: 307 NHRHSHIVP---TIGGAFGPTPGLDALSVPQQSHIARKALLDNLTRLKESHNRTVSSLTK 363 Query: 1089 NDENMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQHKAPYIEELEEQMQKL 1268 DEN+SA+L NI LE SL+ A EKF+FMQKL+DFVSVIC+FLQHKAPYIEELEEQMQ L Sbjct: 364 ADENLSASLMNITALEKSLSAAGEKFIFMQKLRDFVSVICEFLQHKAPYIEELEEQMQTL 423 Query: 1269 HEERAVAVLERRTADNADEMIEIEAPLSAAMLEYSKGGSS----TAVINAAQLVSSKTRE 1436 HE+RA A+LERRTADN DEM+E++ L AA +S GS+ TA +NAAQ S+ +E Sbjct: 424 HEQRASAILERRTADNDDEMMEVKTALEAAKKVFSARGSNEAAITAAMNAAQDASASMKE 483 Query: 1437 QTNLPVKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAVGDIFPYHHIEGXXXX 1616 Q NLPVKLDE GRD+N Q + G +EG Sbjct: 484 QINLPVKLDEFGRDINQQKRLDMKRRAEARQRRKAQKKLSSVEVDGS---NQKVEGESST 540 Query: 1617 XXXXXXXXXYKSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETWKKRFFSSYRDAYMSL 1796 Y+SNR++LLQT++QIFGDA EE+ +L++VK++FE WKK + +SYRDAYMS+ Sbjct: 541 DESDSESAAYQSNRDLLLQTADQIFGDASEEYCQLSVVKQRFENWKKEYSTSYRDAYMSI 600 Query: 1797 SVPAIFSPYVRLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGGDFNSDDADANLVPGL 1976 S PAIFSPYVRLELLKWDPLHE++ FF M+WHSLL DYGLP+ G D + +DADANLVP L Sbjct: 601 SAPAIFSPYVRLELLKWDPLHEDAGFFHMKWHSLLSDYGLPQDGSDLSPEDADANLVPEL 660 Query: 1977 VEKVALPILHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEALKDLLSAIHSRLADA 2156 VEKVA+PILHH+IAHCWD+LSTR TKNAV ATNLV YVPA+ EAL +LL AI +RL DA Sbjct: 661 VEKVAIPILHHEIAHCWDMLSTRETKNAVFATNLVTDYVPASSEALAELLLAIRTRLTDA 720 Query: 2157 VANITVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDILASSAIEQLAFDELLS 2336 V +I VPTWS + +KAVP AA+IAAYRFGM+VRL+KNICLWKDIL+ +E+LA D+LL Sbjct: 721 VVSIMVPTWSPIELKAVPRAAQIAAYRFGMSVRLMKNICLWKDILSLPVLEKLALDDLLC 780 Query: 2337 GKVLPHVRSITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKLQPLVDYVLTLGKTLE 2516 KVLPH++S+ N+HDA+TRTERI+ASL+GVW+G+SVT S+KLQPLVD V++LGK L+ Sbjct: 781 RKVLPHLQSVASNVHDAVTRTERIIASLSGVWAGTSVTASRSHKLQPLVDCVMSLGKRLK 840 Query: 2517 KKHASGVSETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 KH G SE E +GLARRLKKMLV+LN++DKAR I R F L+EA+ Sbjct: 841 DKHPLGASEIEVSGLARRLKKMLVELNDYDKAREIARMFSLREAL 885 >ref|XP_006448500.1| hypothetical protein CICLE_v10014191mg [Citrus clementina] gi|557551111|gb|ESR61740.1| hypothetical protein CICLE_v10014191mg [Citrus clementina] Length = 913 Score = 769 bits (1986), Expect = 0.0 Identities = 450/886 (50%), Positives = 564/886 (63%), Gaps = 16/886 (1%) Frame = +3 Query: 42 LLSFADEEGDEESPFARPXXXXXXXXXXXXXXXXXTHKITSMKER-IXXXXXXXXXXXXN 218 LLSFAD+E EE +HKIT+ KER N Sbjct: 45 LLSFADDE--EEKSEIPTSNRDRTRPSSRLSKPSSSHKITASKERQSSSATSSSTSLLSN 102 Query: 219 VQPQAGEYTKERLLELQKNTRTIGXXXXXXXXXXXEPKIVLKGLIKPIYXXXXXXXXXXX 398 VQ QAG YT+E LLEL+KNT+T+ EP +VL+G IKP Sbjct: 103 VQAQAGTYTEEYLLELRKNTKTL---KAPSSKPPAEPVVVLRGSIKP-EDSNLTRVQQKP 158 Query: 399 XLDRMDVD-----DAETRLGSMGIGGEG-DKDLIPDQATINAIRAKRERLRQSRAPASDY 560 D D D + E R S+G+G +I D+A I AIRAK++RLRQS A A DY Sbjct: 159 SRDSSDSDSDHKAETEKRFASLGVGKIAVQSGVIYDEAEIKAIRAKKDRLRQSGAKAPDY 218 Query: 561 ISLDAGSN--HGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFEDGRKLIKEVPIDLR 734 I LD GS+ G+AEG SDEEPEF R+A+FG+++++ KG FED E P+ R Sbjct: 219 IPLDGGSSSLRGDAEGSSDEEPEFPRRVAMFGERTASGKKKKGVFEDDDVDEDERPVVAR 278 Query: 735 NGGXXXXXXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXXXXXLXXXXXXXXXXXXX 914 Q RKG G Sbjct: 279 -------VENDYEYVDEDVMWEEEQVRKGLGKRIDDSSVRVGANTSSSVAMPQQQQQFSY 331 Query: 915 XXXPGHNVSPGLNIGGSAGVMKKV--LSIPQQATVASQAMRESLQRLKETHGRTMSALDR 1088 V+P +IGG+ G + + +SI Q+A A +A++ ++ RLKE+H RTMS+L + Sbjct: 332 PT----TVTPIPSIGGAIGASQGLDTMSIAQKAESAMKALQTNVNRLKESHARTMSSLKK 387 Query: 1089 NDENMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQHKAPYIEELEEQMQKL 1268 DE++S++L I DLE+SL+ A E+F+FMQKL+D+VSVICDFLQ KAPYIE LE +MQKL Sbjct: 388 TDEDLSSSLLKITDLESSLSAAGERFIFMQKLRDYVSVICDFLQDKAPYIETLEAEMQKL 447 Query: 1269 HEERAVAVLERRTADNADEMIEIEAPLSAAMLEYSKGGSS----TAVINAAQLVSSKT-R 1433 ++ERA A+LERR ADN DEM E+EA + AA L G+S TA +AAQ ++ + Sbjct: 448 NKERASAILERRAADNDDEMTEVEAAIKAATLFIGDRGNSASKLTAASSAAQAAAAAAIK 507 Query: 1434 EQTNLPVKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAVGDIFPYHHIEGXXX 1613 EQTNLPVKLDE GRDMNLQ +S++ +EG Sbjct: 508 EQTNLPVKLDEFGRDMNLQKRRDMERRAESRQHRRTRFDLKQLSSMDADISSQKLEGEST 567 Query: 1614 XXXXXXXXXXYKSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETWKKRFFSSYRDAYMS 1793 Y+SNRE LL+T+E IF DA EE+S+L++VKE+FE WK+ + SSYRDAYMS Sbjct: 568 TDESDSETEAYQSNREELLKTAEHIFSDAAEEYSQLSVVKERFEKWKRDYSSSYRDAYMS 627 Query: 1794 LSVPAIFSPYVRLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGGDFNSDDADANLVPG 1973 LS PAI SPYVRLELLKWDPLHE++DF +M+WH+LLF+YGLP+ G DF DDADANLVP Sbjct: 628 LSTPAIMSPYVRLELLKWDPLHEDADFSEMKWHNLLFNYGLPKDGEDFAHDDADANLVPT 687 Query: 1974 LVEKVALPILHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEALKDLLSAIHSRLAD 2153 LVEKVALPILHHDIA+CWD+LSTR TKN VSAT LV+ YVP + EALKDLL AIH+RLA+ Sbjct: 688 LVEKVALPILHHDIAYCWDMLSTRETKNVVSATILVMAYVPTSSEALKDLLVAIHTRLAE 747 Query: 2154 AVANITVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDILASSAIEQLAFDELL 2333 AVANI VPTWS + + AVP++ARIAAYRFG++VRL++NICLWK++ A +E+LA DELL Sbjct: 748 AVANIAVPTWSPLAMSAVPNSARIAAYRFGVSVRLMRNICLWKEVFALPILEKLALDELL 807 Query: 2334 SGKVLPHVRSITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKLQPLVDYVLTLGKTL 2513 KVLPHVRSI N+HDAI+RTERIVASL+GVW+G SVT +KLQPLVD++L+L KTL Sbjct: 808 CRKVLPHVRSIASNVHDAISRTERIVASLSGVWAGPSVTGSCCHKLQPLVDFMLSLAKTL 867 Query: 2514 EKKHASGVSETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 EKKH GV+E+ET GLARRLKKMLV+LNE+D AR I RTF LKEA+ Sbjct: 868 EKKHLPGVTESETAGLARRLKKMLVELNEYDNARDIARTFHLKEAL 913 >ref|XP_004514246.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cicer arietinum] Length = 916 Score = 748 bits (1931), Expect = 0.0 Identities = 441/890 (49%), Positives = 563/890 (63%), Gaps = 20/890 (2%) Frame = +3 Query: 42 LLSFADEEGDEESPFARPXXXXXXXXXXXXXXXXXTHKITSMKERIXXXXXXXXXXXXNV 221 LLSFAD+E D E+ RP +HKIT+ K+RI NV Sbjct: 48 LLSFADDENDNENENPRPRSSKPHRSGVSKSSSS-SHKITTHKDRISHSPSPSFLS--NV 104 Query: 222 QPQAGEYTKERLLELQKNTRTI-----GXXXXXXXXXXXEPKIVLKGLIKPIYXXXXXXX 386 QPQAG YTKE L ELQKNTRT+ EP IVLKGL+KP Sbjct: 105 QPQAGTYTKEALRELQKNTRTLVTGSTSRPSSTSXXPSSEPVIVLKGLLKPASSEPQGRE 164 Query: 387 XXXXXLDRMDVDDAETRLGSMGIGGEGDKDLIPDQATINAIRAKRERLRQSRAPASDYIS 566 + + E + S+GI G+ LIPD+ TI AIRA+RERLRQ+R A DYIS Sbjct: 165 SDSEDEHK----EVEAKFASVGIQN-GNDSLIPDEETIKAIRARRERLRQARPAAQDYIS 219 Query: 567 LDAGSNHGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFEDGRKLIKEVPIDLR-NGG 743 LD GSNHG AEG+SDEEPEF+ RIALFG+K G KG FED + E +D R NGG Sbjct: 220 LDGGSNHGAAEGLSDEEPEFRGRIALFGEKGE--GGKKGVFED----VDERGVDGRFNGG 273 Query: 744 XXXXXXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXX 923 Q RKG G Sbjct: 274 GDVVVEEEDEEEKMWEEE---QFRKGLGKRMDEGPGRVSGGDVSVVQVAQQPKFVVPSAA 330 Query: 924 PGHNVSPGL---------NIGGS--AGVMKKVLSIPQQATVASQAMRESLQRLKETHGRT 1070 + P + +IGG+ A V+SI QQA +A +A+ ++++RLKE+HGRT Sbjct: 331 TVYGAVPNVVAAAASVSTSIGGAIPATPALDVISISQQAEIARKALLDNVRRLKESHGRT 390 Query: 1071 MSALDRNDENMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQHKAPYIEELE 1250 MS+L++ DEN+SA+L NI DLENSL ADEK+ FMQKL+++V+ ICDFLQHKA YIEELE Sbjct: 391 MSSLNKTDENLSASLLNITDLENSLVVADEKYRFMQKLRNYVTNICDFLQHKAFYIEELE 450 Query: 1251 EQMQKLHEERAVAVLERRTADNADEMIEIEAPLSAAMLEYS-KGGSSTAVINAAQLVSSK 1427 +QM+KLHE+RA A+ E+R + DEM+E+EA + AAM S KG + A +AAQ S Sbjct: 451 DQMKKLHEDRASAIFEKRATNIDDEMVEVEAAVKAAMSVLSRKGDNLEAARSAAQDAFSA 510 Query: 1428 TREQTNLPVKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAVGDIFPYHHIEGX 1607 R+Q + PV+LDE GRD+NL+ A ++ H +EG Sbjct: 511 VRKQRDFPVQLDEFGRDLNLEKRMKMKVMAEARQRRKSKAFDSNKLASMEVDD-HKVEGE 569 Query: 1608 XXXXXXXXXXXXYKSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETWKKRFFSSYRDAY 1787 Y+S R+++LQ +++IF DA EE+S+L+LVK K E WK+ +FSSY DAY Sbjct: 570 SSTDESDSESQAYQSQRDLVLQAADEIFSDASEEYSQLSLVKNKMEEWKREYFSSYNDAY 629 Query: 1788 MSLSVPAIFSPYVRLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGGDFNSDDADANL- 1964 +SLS+P IFSPYVRLELL+WDPLH+ DF +M+W+ LLF YGLPE G DF DD DA+L Sbjct: 630 ISLSLPLIFSPYVRLELLRWDPLHKGLDFQEMKWYKLLFTYGLPEDGKDFVHDDGDADLE 689 Query: 1965 -VPGLVEKVALPILHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEALKDLLSAIHS 2141 VP LVEKVALPI H++I+HCWD+LS + T NA+SAT L++ +V EAL +LL +I + Sbjct: 690 LVPNLVEKVALPIFHYEISHCWDMLSQQETMNAISATKLIVQHVSHESEALAELLVSIRT 749 Query: 2142 RLADAVANITVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDILASSAIEQLAF 2321 RLADAVAN+TVPTWS +V+ AVPDAAR+AAYRFG++VRLL+NICLWKDI A +E+LA Sbjct: 750 RLADAVANLTVPTWSPLVLSAVPDAARVAAYRFGVSVRLLRNICLWKDIFAMPVLEKLAL 809 Query: 2322 DELLSGKVLPHVRSITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKLQPLVDYVLTL 2501 DELL KVLPH RSI+ N+HDAITRTERI+ASL+GVW+G SVT + + KLQPLV YVL+L Sbjct: 810 DELLYDKVLPHFRSISENVHDAITRTERIIASLSGVWAGPSVTGDRNRKLQPLVVYVLSL 869 Query: 2502 GKTLEKKHASGVSETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 G+ LE+++ V E++T+ LARRLKK+LVDLNE+D AR + RTF LKEA+ Sbjct: 870 GRVLERRN---VPESDTSYLARRLKKILVDLNEYDHARNMARTFHLKEAL 916 >ref|XP_007160943.1| hypothetical protein PHAVU_001G030200g [Phaseolus vulgaris] gi|561034407|gb|ESW32937.1| hypothetical protein PHAVU_001G030200g [Phaseolus vulgaris] Length = 882 Score = 730 bits (1885), Expect = 0.0 Identities = 423/877 (48%), Positives = 552/877 (62%), Gaps = 7/877 (0%) Frame = +3 Query: 42 LLSFADEEGDEESPFARPXXXXXXXXXXXXXXXXXTHKITSMKERIXXXXXXXXXXXXNV 221 LLSFAD+E + E+P R HKIT++K+RI NV Sbjct: 47 LLSFADDE-ENENPRPRSAKPQRSSKPSS------AHKITTLKDRIASSSPSVPS---NV 96 Query: 222 QPQAGEYTKERLLELQKNTRT-IGXXXXXXXXXXXEPKIVLKGLIKPIYXXXXXXXXXXX 398 QPQAG YTKE L ELQKNTRT + EP IVLKGL+KP+ Sbjct: 97 QPQAGTYTKETLRELQKNTRTLVTSSSRSEPKPPGEPVIVLKGLVKPVASEPQGRESDSE 156 Query: 399 XLDRMDVDDAETRLGSMGIGGEGDKDLIPDQATINAIRAKRERLRQSRAPASDYISLDAG 578 D + E +LG +G+ G PD+ TI AIRAKRERLRQ+R A DYISLD G Sbjct: 157 G----DHKEVEGKLGGLGLHN-GKDSFFPDEETIKAIRAKRERLRQARPAAQDYISLDGG 211 Query: 579 SNHGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFEDGRKLIKEVPIDLRNGGXXXXX 758 SNHG AEG+SDEEPEF+ RIA+FG+K G KG FE+ ++E +D+R Sbjct: 212 SNHGAAEGLSDEEPEFRGRIAMFGEKVE--GGKKGVFEE----VEERRVDVR------FK 259 Query: 759 XXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXXPGHNV 938 Q RKG G G Sbjct: 260 EEEEDDDEEEKMWEEEQFRKGLGKRMDEGSARVDVPVVQGAQQHKYVVPSAAVPNAGFGT 319 Query: 939 ---SPGLNIGGSAGVMKKVLSIPQQATVASQAMRESLQRLKETHGRTMSALDRNDENMSA 1109 P L++ LS+ QQA A +A+ E+++RLKE+HGRTMS+L + DEN+SA Sbjct: 320 IESMPALDV----------LSLSQQAESAKKALVENVRRLKESHGRTMSSLSKTDENLSA 369 Query: 1110 ALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQHKAPYIEELEEQMQKLHEERAVA 1289 +L NI LENSL AD+K+ FMQKL+++V+ ICDFLQHKA YIEELEEQ++KLH +RA A Sbjct: 370 SLLNITALENSLVVADDKYRFMQKLRNYVTNICDFLQHKAFYIEELEEQIKKLHGDRATA 429 Query: 1290 VLERRTADNADEMIEIEAPLSAAM-LEYSKGGSSTAVINAAQLVSSKTREQTNLPVKLDE 1466 + E+RT +N DE++E+EA + AAM + KG + A +AAQ + R+Q +LPVKLDE Sbjct: 430 IFEKRTTNNDDEIVEVEAAVKAAMSVLNKKGNNMEAAKSAAQEAYTAVRKQKDLPVKLDE 489 Query: 1467 LGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAVGDIFPYHHIEGXXXXXXXXXXXXXY 1646 GRD+NL+ ++ H IEG Y Sbjct: 490 FGRDLNLEKRMQMKMRAVARQRKRSQLFDSNKLTSMEL-DDHKIEGESSTDESDSESQAY 548 Query: 1647 KSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETWKKRFFSSYRDAYMSLSVPAIFSPYV 1826 +S R+++LQ +++IFGDA EE+ +L+LVK + E WK+ + SSY+DAYMSLS+P +FSPYV Sbjct: 549 ESQRDLVLQAADEIFGDASEEYGQLSLVKRRMEEWKRDYSSSYKDAYMSLSLPLVFSPYV 608 Query: 1827 RLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGGDFNSDDADAN--LVPGLVEKVALPI 2000 RLELL+WDPLH+ DF +M+W+ LLF YGLPE G DF DD DA+ LVP LVEKVALPI Sbjct: 609 RLELLRWDPLHKGIDFQEMKWYKLLFTYGLPEDGKDFVHDDGDADLELVPNLVEKVALPI 668 Query: 2001 LHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEALKDLLSAIHSRLADAVANITVPT 2180 L ++I+HCWD+LS R T NA++AT L++ +V EAL DLL +I +RLADAVAN+ VPT Sbjct: 669 LQYEISHCWDMLSQRETMNAIAATKLIVQHVSRKSEALTDLLVSIRTRLADAVANLKVPT 728 Query: 2181 WSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDILASSAIEQLAFDELLSGKVLPHVR 2360 WS VV+ AVPDAAR+AAYRFG++VRLL+NICLWKD+ ++S +E+LA DELL GKVLPH+R Sbjct: 729 WSPVVLVAVPDAARVAAYRFGVSVRLLRNICLWKDVFSTSVLEKLALDELLFGKVLPHLR 788 Query: 2361 SITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKLQPLVDYVLTLGKTLEKKHASGVS 2540 I+ N+ DAITRTER++ASL+GVW+G SV + +KLQPL+ YVL+LG+ LE+++ V Sbjct: 789 IISENVQDAITRTERVIASLSGVWAGPSVIGDKKHKLQPLLTYVLSLGRILERRN---VP 845 Query: 2541 ETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 E++T+ LARRLKK+LVDLNE+D AR + RTF LKEA+ Sbjct: 846 ESDTSYLARRLKKILVDLNEYDHARTMARTFHLKEAL 882 >ref|XP_003528569.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Glycine max] Length = 913 Score = 723 bits (1865), Expect = 0.0 Identities = 426/883 (48%), Positives = 550/883 (62%), Gaps = 13/883 (1%) Frame = +3 Query: 42 LLSFADE-EGDEESPFARPXXXXXXXXXXXXXXXXXTHKITSMKERIXXXXXXXXXXXXN 218 LLSFADE E +E+P RP +HKIT++K+RI N Sbjct: 50 LLSFADEDEQTDENP--RPRASKPYRSAATAKKPSSSHKITTLKDRIAHSSSPSVPS--N 105 Query: 219 VQPQAGEYTKERLLELQKNTRTI--GXXXXXXXXXXXEPKIVLKGLIKPIYXXXXXXXXX 392 VQPQAG YTKE L ELQKNTRT+ EP IVLKGL+KP+ Sbjct: 106 VQPQAGTYTKEALRELQKNTRTLVTSSSSRSDPKPSSEPVIVLKGLVKPLGSEPQGRDSY 165 Query: 393 XXXLDRMDVDDAETRLGSMGIGGEGDKDLIPDQATINAIRAKRERLRQSRAPASDYISLD 572 R + E +L ++GI + + PD TI AIRAKRERLRQ+R A DYISLD Sbjct: 166 SEGEHR----EVEAKLATVGIQNK-EGSFYPDDETIRAIRAKRERLRQARPAAPDYISLD 220 Query: 573 AGSNHGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFEDGRKLIKEVPIDLRNGGXXX 752 GSNHG AEG+SDEEPEF+ RIA+FG+K G KG FE+ ++E +D+R G Sbjct: 221 GGSNHGAAEGLSDEEPEFRGRIAMFGEKVD--GGKKGVFEE----VEERIMDVRFKGGED 274 Query: 753 XXXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXXPGH 932 Q RKG G G Sbjct: 275 EVVDDDDDDEEKMWEEE-QFRKGLGKRMDEGSARVDVSVMQGSQSPHNFVVPSAAKVYGA 333 Query: 933 NVSPGLNIGGSAGVMKK------VLSIPQQATVASQAMRESLQRLKETHGRTMSALDRND 1094 S ++ S G + + V+ I QQA A +A+ E+++RLKE+HGRTMS+L + D Sbjct: 334 VPSAAASVSPSIGGVIESLPALDVVPISQQAEAARKALLENVRRLKESHGRTMSSLSKTD 393 Query: 1095 ENMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQHKAPYIEELEEQMQKLHE 1274 EN+SA+L NI LENSL ADEK+ FMQKL+++V+ ICDFLQHKA YIEELEEQM+KLHE Sbjct: 394 ENLSASLLNITALENSLVVADEKYRFMQKLRNYVTNICDFLQHKAFYIEELEEQMKKLHE 453 Query: 1275 ERAVAVLERRTADNADEMIEIEAPLSAAMLEYSKGGSSTAVIN-AAQLVSSKTREQTNLP 1451 +RA+A+ ERR +N DEMIE+E + AAM SK G++ AAQ S R+Q +LP Sbjct: 454 DRALAISERRATNNDDEMIEVEEAVKAAMSVLSKKGNNMEAAKIAAQEAFSAVRKQRDLP 513 Query: 1452 VKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAVGDI-FPYHHIEGXXXXXXXX 1628 VKLDE GRD+NL+ + V + H IEG Sbjct: 514 VKLDEFGRDLNLEKRMNMKAKTRSEACQRKRSQAFDSNKVTSMELDDHKIEGESSTDESD 573 Query: 1629 XXXXXYKSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETWKKRFFSSYRDAYMSLSVPA 1808 Y+S +++LQ +++IF DA EE+ +L+LVK + E WK+ SSY+DAYMSLS+P Sbjct: 574 SESQAYQSQSDLVLQAADEIFSDASEEYGQLSLVKSRMEEWKREHSSSYKDAYMSLSLPL 633 Query: 1809 IFSPYVRLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGGDFNSDDADANL--VPGLVE 1982 IFSPYVRLELL+WDPLH DF +M+W+ LLF YGLPE G DF DD DA+L VP LVE Sbjct: 634 IFSPYVRLELLRWDPLHNGVDFQEMKWYKLLFTYGLPEDGKDFVHDDGDADLELVPNLVE 693 Query: 1983 KVALPILHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEALKDLLSAIHSRLADAVA 2162 KVALPILH++I+HCWD++S + T NA++AT L++ +V EAL DLL +I +RLADAVA Sbjct: 694 KVALPILHYEISHCWDMVSQQETVNAIAATKLMVQHVSHESEALADLLVSIQTRLADAVA 753 Query: 2163 NITVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDILASSAIEQLAFDELLSGK 2342 ++TVPTWS V+ AVPDAAR+AAYRFG++VRLL+NICLWKD+ + +E++A DELL K Sbjct: 754 DLTVPTWSPSVLAAVPDAARVAAYRFGVSVRLLRNICLWKDVFSMPVLEKVALDELLCRK 813 Query: 2343 VLPHVRSITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKLQPLVDYVLTLGKTLEKK 2522 VLPH+R I+ N+ DAITRTERI+ASL+G+W+G SV + + KLQPLV YVL+LG+ LE++ Sbjct: 814 VLPHLRVISENVQDAITRTERIIASLSGIWAGPSVIGDKNRKLQPLVTYVLSLGRILERR 873 Query: 2523 HASGVSETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 + V E +T+ LARRLKK+L DLNE+D AR + RTF LKEA+ Sbjct: 874 N---VPENDTSHLARRLKKILADLNEYDHARNMARTFHLKEAL 913 >ref|XP_006605552.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Glycine max] Length = 916 Score = 721 bits (1860), Expect = 0.0 Identities = 424/883 (48%), Positives = 550/883 (62%), Gaps = 13/883 (1%) Frame = +3 Query: 42 LLSFADEEGDEESPFARPXXXXXXXXXXXXXXXXXTHKITSMKERIXXXXXXXXXXXXNV 221 LLSFAD+E DE RP +HKIT++K+RI NV Sbjct: 51 LLSFADDE-DETDENPRPRASKPHRTAATAKKPSSSHKITTLKDRIAHTSSPSVPT--NV 107 Query: 222 QPQAGEYTKERLLELQKNTRTI--GXXXXXXXXXXXEPKIVLKGLIKPIYXXXXXXXXXX 395 QPQAG YTKE L ELQKNTRT+ EP IVLKG +KP+ Sbjct: 108 QPQAGTYTKEALRELQKNTRTLVSSSSSRSDPKPSSEPVIVLKGHVKPLGPETQGRDSDS 167 Query: 396 XXLDRMDVDDAETRLGSMGIGGEGDKDLIPDQATINAIRAKRERLRQSRAPASDYISLDA 575 + + E +L ++GI + D PD+ TI AIRAKRERLR +R A DYISLD Sbjct: 168 D--SEGEHREVEAKLATVGIQNKEDS-FYPDEETIRAIRAKRERLRLARPAAPDYISLDG 224 Query: 576 GSNHGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFEDGRKLIKEVPIDLRNGGXXXX 755 GSNHG AEG+SDEEPEF+ RIA+FG+K G KG FE+ ++E +DLR G Sbjct: 225 GSNHGAAEGLSDEEPEFRGRIAMFGEKVD--GGKKGVFEE----VEERRVDLRFKGGEEE 278 Query: 756 XXXXXXXXXXXXXXXXXQCRKG------RGXXXXXXXXXXXXXXXLXXXXXXXXXXXXXX 917 Q RKG G L Sbjct: 279 VLDDDDDEEEKMWEEE-QFRKGLGKRMDEGSARVDVAAAAVQGAQLQHNFVVPSAAKVYG 337 Query: 918 XXPGHNVSPGLNIGGSAGVMK--KVLSIPQQATVASQAMRESLQRLKETHGRTMSALDRN 1091 P S +IGG+ + V+ I QQA A +A+ E+++RLKE+HGRTMS+L + Sbjct: 338 AVPSAAASVSPSIGGAIESLPVLDVVPISQQAEAARKALLENVRRLKESHGRTMSSLSKT 397 Query: 1092 DENMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQHKAPYIEELEEQMQKLH 1271 DEN+SA+L NI LENSL ADEK+ FMQKL+++V+ ICDFLQHKA YIEELEEQM+KLH Sbjct: 398 DENLSASLLNITALENSLVVADEKYRFMQKLRNYVTNICDFLQHKACYIEELEEQMKKLH 457 Query: 1272 EERAVAVLERRTADNADEMIEIEAPLSAAM-LEYSKGGSSTAVINAAQLVSSKTREQTNL 1448 ++RA A+ ERR +N DEM+E+E + AAM + KG + A AAQ + R+Q +L Sbjct: 458 QDRASAIFERRATNNDDEMVEVEEAVKAAMSVLIKKGNNMEAAKIAAQEAFAAVRKQRDL 517 Query: 1449 PVKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAVGDIFPYHHIEGXXXXXXXX 1628 PVKLDE GRD+NL+ + + H IEG Sbjct: 518 PVKLDEFGRDLNLEKRMNMKVRAEACQRKRSLAFGYNKVTSME-WDDHKIEGESSTDESD 576 Query: 1629 XXXXXYKSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETWKKRFFSSYRDAYMSLSVPA 1808 Y+S +++LQ +++IF DA EE+ +L+LVK + E WK+ + S+Y+DAYMSLS+P Sbjct: 577 SESQAYQSQSDLVLQAADEIFSDASEEYGQLSLVKSRMEEWKREYSSTYKDAYMSLSLPL 636 Query: 1809 IFSPYVRLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGGDFNSDDADANL--VPGLVE 1982 IFSPYVRLELL+WDPLH+ DF +M+W+ LLF YGLPE G DF DD DA+L VP LVE Sbjct: 637 IFSPYVRLELLRWDPLHKGVDFQEMKWYKLLFTYGLPEDGKDFVHDDGDADLELVPNLVE 696 Query: 1983 KVALPILHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEALKDLLSAIHSRLADAVA 2162 KVALPILH++I+HCWD+LS + T NA++AT L++ +V EAL LL +I +RLADAVA Sbjct: 697 KVALPILHYEISHCWDMLSQQETVNAIAATKLIVQHVSHESEALAGLLVSIRTRLADAVA 756 Query: 2163 NITVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDILASSAIEQLAFDELLSGK 2342 N+TVPTWS V+ AVPDAAR+AAYRFG++VRLL+NI WKD+ + + +E++A DELL GK Sbjct: 757 NLTVPTWSLPVLAAVPDAARVAAYRFGVSVRLLRNIGSWKDVFSMAVLEKVALDELLCGK 816 Query: 2343 VLPHVRSITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKLQPLVDYVLTLGKTLEKK 2522 VLPH+R I+ N+ DAITRTERI+ASL+GVWSG SV + + KLQPLV YVL+LG+ LE++ Sbjct: 817 VLPHLRVISENVQDAITRTERIIASLSGVWSGPSVIGDKNRKLQPLVTYVLSLGRILERR 876 Query: 2523 HASGVSETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 + V E++T+ LARRLKK+LVDLNE+D AR++ RTF LKEA+ Sbjct: 877 N---VPESDTSHLARRLKKILVDLNEYDHARSMARTFHLKEAL 916 >ref|XP_003610832.1| GC-rich sequence DNA-binding factor-like protein [Medicago truncatula] gi|355512167|gb|AES93790.1| GC-rich sequence DNA-binding factor-like protein [Medicago truncatula] Length = 892 Score = 720 bits (1859), Expect = 0.0 Identities = 423/881 (48%), Positives = 549/881 (62%), Gaps = 11/881 (1%) Frame = +3 Query: 42 LLSFADEEGDEESPFARPXXXXXXXXXXXXXXXXXTHKITSMKERIXXXXXXXXXXXXNV 221 LLSFAD+E D ++ RP +HKIT+ K RI NV Sbjct: 40 LLSFADDEIDADNETPRPRSSKPHHHRPKPSSSS-SHKITTHKNRITSHSPSPSPS--NV 96 Query: 222 QPQAGEYTKERLLELQKNTRTIGXXXXXXXXXXXEPK------IVLKGLIKPIYXXXXXX 383 QPQAG YT E L ELQKNTRT+ EPK IVLKGL+KP+ Sbjct: 97 QPQAGTYTLEALRELQKNTRTLVTPTTASRPISSEPKPSSEPVIVLKGLLKPVTSEPES- 155 Query: 384 XXXXXXLDRMDVDDAETRLGSMGIGGEGDKDLIPDQATINAIRAKRERLRQSRAPASDYI 563 D + + E + S+GI G P + I A +AKRER+R++ A A DYI Sbjct: 156 -------DSEENGEFEAKFASVGIKN-GKDSFFPGEEDIKAAKAKRERMRKAGAAAPDYI 207 Query: 564 SLDAGSNHGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFEDGRKLIKEVPIDLRNGG 743 SLD GSNHG AEG+SDEEPE++ RIA+FG K + G KG FE + +V +D +G Sbjct: 208 SLDGGSNHGAAEGLSDEEPEYRGRIAMFGGKKGD-GEKKGVFEVADERFDDVVVDEEDG- 265 Query: 744 XXXXXXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXX 923 R G G Sbjct: 266 ---LWEEEQFKKGLGKRRDEGSARVGGGGEVPVVQAAQQPNFVGPSVANVYGAVPNVVAA 322 Query: 924 PGHNVSPGLNIGGS--AGVMKKVLSIPQQATVASQAMRESLQRLKETHGRTMSALDRNDE 1097 N S IGG+ A + V+SI QQA +A +AM ++++RLKE+HGRTMS+L++ DE Sbjct: 323 ASANTS----IGGAIPATPVLDVISISQQAEIAKKAMLDNIRRLKESHGRTMSSLNKTDE 378 Query: 1098 NMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQHKAPYIEELEEQMQKLHEE 1277 N+SA+L I DLE+SL ADEK+ FMQKL++++S ICDFLQHKA YIEELE+QM+KLHE+ Sbjct: 379 NLSASLLKITDLESSLVVADEKYRFMQKLRNYISNICDFLQHKAYYIEELEDQMKKLHED 438 Query: 1278 RAVAVLERRTADNADEMIEIEAPLSAAMLEYS-KGGSSTAVINAAQLVSSKTREQTNLPV 1454 RA A+ E+R +N DEM+E+EA + AAML S KG + A +AAQ + R+Q + PV Sbjct: 439 RASAIFEKRATNNDDEMVEVEAAVKAAMLVLSRKGDNVEAARSAAQDAFAAVRKQRDFPV 498 Query: 1455 KLDELGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAVGDIFPYHHIEGXXXXXXXXXX 1634 +LDE GRD+NL+ SA +I H +EG Sbjct: 499 QLDEFGRDLNLEKRKQMKVMAEARQRRRSKAFDSKKSASMEI-DDHKVEGESSTDESDSE 557 Query: 1635 XXXYKSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETWKKRFFSSYRDAYMSLSVPAIF 1814 Y+S R+++LQ +++IF DA EE+S+L+LVK + E WK+ + SSY +AY+SLS+P IF Sbjct: 558 SQAYQSQRDLVLQAADEIFSDASEEYSQLSLVKTRMEEWKREYSSSYNEAYISLSLPLIF 617 Query: 1815 SPYVRLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGGDFNSDDADAN--LVPGLVEKV 1988 SPYVRLELL+WDPLH+ DF DM+W+ LLF YGLPE G DF DD DA+ LVP LVEKV Sbjct: 618 SPYVRLELLRWDPLHKGLDFQDMKWYKLLFTYGLPEDGKDFVHDDGDADLELVPNLVEKV 677 Query: 1989 ALPILHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEALKDLLSAIHSRLADAVANI 2168 ALPILH++++HCWD+LS + T NA++AT L++ +V EAL LL +I +RLADAVAN+ Sbjct: 678 ALPILHYEVSHCWDMLSQQETMNAIAATKLIVQHVSRESEALAGLLVSIRTRLADAVANL 737 Query: 2169 TVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDILASSAIEQLAFDELLSGKVL 2348 TVPTWS +V+ AVPDAA+IAAYRFG++VRLL+NICLWKDI A S +E+LA DELL KVL Sbjct: 738 TVPTWSPLVLAAVPDAAKIAAYRFGVSVRLLRNICLWKDIFAMSVLEKLALDELLYAKVL 797 Query: 2349 PHVRSITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKLQPLVDYVLTLGKTLEKKHA 2528 PH RSI+ N+ DAITRTERI+ SL+GVW+G SVT + S KLQPLV YVL+LG+ LE+++ Sbjct: 798 PHFRSISENVQDAITRTERIIDSLSGVWAGPSVTGDKSRKLQPLVAYVLSLGRILERRN- 856 Query: 2529 SGVSETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 V E++ LARRLKK+LVDLNE+D AR + RTF LKEA+ Sbjct: 857 --VPESD---LARRLKKILVDLNEYDHARTMARTFHLKEAL 892 >ref|XP_003530304.1| PREDICTED: PAX3- and PAX7-binding protein 1-like isoform X1 [Glycine max] Length = 896 Score = 705 bits (1820), Expect = 0.0 Identities = 422/885 (47%), Positives = 540/885 (61%), Gaps = 15/885 (1%) Frame = +3 Query: 42 LLSFADEEGDEESPFARPXXXXXXXXXXXXXXXXXTHKITSMKERIXXXXXXXXXXXXNV 221 LLSFAD DEE RP +HKIT++K+RI NV Sbjct: 47 LLSFAD---DEEISNPRPRSSAKPQRPSKPSS---SHKITTLKDRIAHSSSVSS----NV 96 Query: 222 QPQAGEYTKERLLELQKNTRTI--GXXXXXXXXXXXEPKIVLKGLIKPIYXXXXXXXXXX 395 QPQAG YTKE L ELQKNTRT+ EP IVLKGL+KP+ Sbjct: 97 QPQAGTYTKEALRELQKNTRTLVSSSTTTTTSSSRSEPVIVLKGLVKPVVSEPQGRHSDS 156 Query: 396 XXLDRMDVDDAETRLGSMGIGGEGDKDLIPDQATINAIRAKRERLRQSRAPASDYISLDA 575 + + E +L S+GI G PD+ TI AIRAKRERLR++R A DYISLD Sbjct: 157 EGEHK----EVEGKLSSLGIQN-GKDSFFPDEETIKAIRAKRERLRKARPAAPDYISLDG 211 Query: 576 GSNHGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFEDGRKLIKEVPIDLRNGGXXXX 755 GSNHG AEG+SDEEPEF+ RIA+F +K G F E +L E D Sbjct: 212 GSNHGAAEGLSDEEPEFRGRIAMFEEKGEGGGKKGVFEEVEERLRDEEEND--------- 262 Query: 756 XXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXXPG-- 929 Q RKG G G Sbjct: 263 -----DDYEEEKMWEEEQFRKGLGKRMDEGAARVDVPVVQGAQQNKFVVSSAAAVYGGVP 317 Query: 930 ------HNVSPGLNIGGSAGVMKKVLSIP--QQATVASQAMRESLQRLKETHGRTMSALD 1085 +VSP +IGG+ M + +P QQA A +A+ E+++RLKE+H RTMS+L Sbjct: 318 SADARVPSVSP--SIGGATESMPALDVVPMSQQAERARKALVENVRRLKESHERTMSSLS 375 Query: 1086 RNDENMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQHKAPYIEELEEQMQK 1265 + DEN+SA+ I LENSL ADEK+ FMQKL+++VS +CDFLQHKA YIEELEEQM+K Sbjct: 376 KTDENLSASFLKITALENSLVVADEKYRFMQKLRNYVSNMCDFLQHKAFYIEELEEQMKK 435 Query: 1266 LHEERAVAVLERRTADNADEMIEIEAPLSAAMLEYSKGGSST-AVINAAQLVSSKTREQT 1442 LHE+RA A+ ERRT +N DEMIE+EA + A M +K G++ A +AAQ + R+Q Sbjct: 436 LHEDRASAIFERRTTNNDDEMIEVEAAVKAVMSVLNKKGNNMEAAKSAAQEAFAAVRKQK 495 Query: 1443 NLPVKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAVGDIFPYHHIEGXXXXXX 1622 +LPVKLDE GRD+NL+ A ++ IEG Sbjct: 496 DLPVKLDEFGRDLNLEKRMQMKVRAEAHQRKRSQAFNSNKLASMELDD-PKIEGESSTDE 554 Query: 1623 XXXXXXXYKSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETWKKRFFSSYRDAYMSLSV 1802 Y+S R+++LQ ++ IF DA EE+ +L+ VK + E WK+ + SSY+DAYMSLS+ Sbjct: 555 SDSESQAYQSQRDLVLQAADGIFSDASEEYGQLSFVKRRMEEWKREYSSSYKDAYMSLSL 614 Query: 1803 PAIFSPYVRLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGGDFNSDDADANL--VPGL 1976 P +FSPYVRLELL+WDPLH+ DF +M+W+ LLF YGLPE G DF DD DA+L VP L Sbjct: 615 PLVFSPYVRLELLRWDPLHKGLDFQEMKWYKLLFTYGLPEDGKDFVHDDGDADLELVPNL 674 Query: 1977 VEKVALPILHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEALKDLLSAIHSRLADA 2156 VEKVALPILH++I+HCWD+LS + T NA++AT L++ +V EAL DLL +I +RLADA Sbjct: 675 VEKVALPILHYEISHCWDMLSQQETVNAIAATKLIVQHVSHESEALADLLVSIRTRLADA 734 Query: 2157 VANITVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDILASSAIEQLAFDELLS 2336 VAN+TVPTWS V+ AV DAAR+AAYRFG++VRLL+NIC WKD+ + +E LA DELL Sbjct: 735 VANLTVPTWSPPVVAAVADAARVAAYRFGVSVRLLRNICSWKDVFSMPVLENLALDELLF 794 Query: 2337 GKVLPHVRSITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKLQPLVDYVLTLGKTLE 2516 GKVLPH+R I+ N+ DAITRTERI+ASL+GVW+G SV + KLQPL+ YVL+LG+ LE Sbjct: 795 GKVLPHLRIISENVQDAITRTERIIASLSGVWAGPSVIADRKRKLQPLLTYVLSLGRILE 854 Query: 2517 KKHASGVSETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 +++A E++T+ LARRLKK+LVDLNE+D AR + RTF LKEA+ Sbjct: 855 RRNA---PESDTSHLARRLKKILVDLNEYDHARTMARTFHLKEAL 896 >gb|EYU22626.1| hypothetical protein MIMGU_mgv1a001081mg [Mimulus guttatus] Length = 894 Score = 696 bits (1795), Expect = 0.0 Identities = 406/885 (45%), Positives = 529/885 (59%), Gaps = 15/885 (1%) Frame = +3 Query: 42 LLSFADEEGDEESPFARPXXXXXXXXXXXXXXXXXTHKITSMKERIXXXXXXXXXXXXNV 221 LLSFAD+ DEESPF+RP HK+TS K+RI NV Sbjct: 59 LLSFADD--DEESPFSRPPSKPPSSSSSSRINKSSAHKLTSSKDRIAPHPPSTSLPS-NV 115 Query: 222 QPQAGEYTKERLLELQKNTRTIGXXXXXXXXXXXEPKIVLKGLIKPIYXXXXXXXXXXXX 401 QPQAG YTKE LLELQKNT+T EP ++LKG IKPI Sbjct: 116 QPQAGLYTKEALLELQKNTKTFAAPARNKPKPDPEPVVILKGSIKPINSTDSNSEANGRG 175 Query: 402 ---LDRM------DVDDAETRLGSMGIGGE--GDKDLIPDQATINAIRAKRERLRQSRAP 548 D+ D +DAE+RL + +G + D +++PDQ I+AI+AKRERLRQ++ Sbjct: 176 EVGFDQKRQGLSADRNDAESRLKDIALGPDLGDDNEVMPDQTMIDAIKAKRERLRQAKPA 235 Query: 549 ASDYISLDAGSNHGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFED--GRKLIKEVP 722 A DYI+LD GSNHGEAEG+SDEEPEFQ RI FG+K KG FED R + KE Sbjct: 236 APDYIALDGGSNHGEAEGLSDEEPEFQGRIGFFGEKIGGRDSKKGVFEDFEERAMSKERG 295 Query: 723 IDLRNGGXXXXXXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXXXXXLXXXXXXXXX 902 I+ + Q RKG G L Sbjct: 296 IETDDD----------EEDEEDKMWEEEQVRKGLGKR-------------LDDGVGSVNS 332 Query: 903 XXXXXXXPGHNVSPGLNIGGSAGVMKKV--LSIPQQATVASQAMRESLQRLKETHGRTMS 1076 P N+GG+ + + +SI QQA VA +A+ E+L+R+KE+HGRTM Sbjct: 333 NVSGVNSISVMHPPSKNVGGAGVDIFGIDDISISQQAEVAKKALTENLRRVKESHGRTMM 392 Query: 1077 ALDRNDENMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQHKAPYIEELEEQ 1256 +L +++EN+S++L N++ LE+SLA A EKFVFMQKL++FVSV+C+FL+HK I ELEE+ Sbjct: 393 SLAKSEENLSSSLRNVLSLEDSLAAAGEKFVFMQKLREFVSVLCEFLEHKDFEIVELEER 452 Query: 1257 MQKLHEERAVAVLERRTADNADEMIEIEAPLSAAMLEYSKGGSSTAVINAAQLVSSKTRE 1436 +Q LHEERA A+ +RR ADN DE+ EIE ++ S R Sbjct: 453 LQNLHEERARAIEKRRAADNDDEISEIEQVIAG----------------------SNARA 490 Query: 1437 QTNLPVKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAVGDIFPYHHIEGXXXX 1616 ++PV+LDE GRD+NLQ SA+ +EG Sbjct: 491 VKSVPVELDEFGRDVNLQKRMDISRRREARQRRRAKADSKRNSAMEKDGSVQQMEGELST 550 Query: 1617 XXXXXXXXXYKSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETWKKRFFSSYRDAYMSL 1796 Y+S+ + LL+ ++ IF DA EE+S+ + V E+FETWKK + SSYRDAYMS+ Sbjct: 551 DESDSESTAYESHHKELLKCADDIFSDAAEEYSEFSNVVERFETWKKEYGSSYRDAYMSM 610 Query: 1797 SVPAIFSPYVRLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGGDFNSDDADANLVPGL 1976 S+P +FSPYVRLEL+KWDPLH ++DF DM+WHSLLF+YG G+ DDAD NLVP L Sbjct: 611 SIPELFSPYVRLELVKWDPLHGDADFMDMKWHSLLFNYGENGISGENAEDDADTNLVPQL 670 Query: 1977 VEKVALPILHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEALKDLLSAIHSRLADA 2156 VEK+A+PILHH +A+CWD+LSTR TK AVSA NLV+TYV + AL +L+ + RL A Sbjct: 671 VEKIAIPILHHQLAYCWDILSTRETKFAVSAMNLVMTYVDHSSSALGNLIPVLRDRLTKA 730 Query: 2157 VANITVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDILASSAIEQLAFDELLS 2336 VA++ VPTWS + +KAVP+AAR+ AYRFG VRL++NICLW IL +E++A DELL Sbjct: 731 VADLMVPTWSPLEMKAVPNAARVGAYRFGTCVRLMRNICLWNGILDKPVLEKIALDELLG 790 Query: 2337 GKVLPHVRSITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKLQPLVDYVLTLGKTLE 2516 K+LPH+ SI+ N+HDA+ RTER++ SL GVW+G V + KLQPLV ++L +GKTLE Sbjct: 791 RKILPHLHSISSNVHDAVIRTERVIDSLCGVWTGPGVAGD-KRKLQPLVKFLLLIGKTLE 849 Query: 2517 KKHASGVSETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 K+ AS ETE+ L RRLKKMLVDLNE+D AR + R F LKEA+ Sbjct: 850 KRQASSAVETESGSLVRRLKKMLVDLNEYDHARELSRKFNLKEAL 894 >ref|XP_006399356.1| hypothetical protein EUTSA_v10012615mg [Eutrema salsugineum] gi|557100446|gb|ESQ40809.1| hypothetical protein EUTSA_v10012615mg [Eutrema salsugineum] Length = 909 Score = 684 bits (1765), Expect = 0.0 Identities = 406/888 (45%), Positives = 532/888 (59%), Gaps = 18/888 (2%) Frame = +3 Query: 42 LLSFADEEGDEESPFA------RPXXXXXXXXXXXXXXXXXTHKI-TSMKERIXXXXXXX 200 LLSFAD+E +E+ P + +H++ +S E Sbjct: 52 LLSFADDEEEEDGPLSVAVKPKNKSGRDRSKSSSRLGISGSSHRLNSSTMESRPSSYSST 111 Query: 201 XXXXXNVQPQAGEYTKERLLELQKNTRTIGXXXXXXXXXXXEPKIVLKGLIKPIYXXXXX 380 NV PQAG YTKE LLELQKNTRT+ EPK+VLKGLIKP Sbjct: 112 ATPLSNVLPQAGSYTKEALLELQKNTRTL---PYSRPSANTEPKVVLKGLIKP------- 161 Query: 381 XXXXXXXLDRMDVDDAETRLGSMGIGGEGDKD----LIPDQATINAIRAKRERLRQSR-A 545 ++ + D ++ + E + + + DQATI AI A + RQSR A Sbjct: 162 ----PQEQEQQSLKDVVKQVSDLDFDEEKEDERPEGMFYDQATIEAILATK---RQSRTA 214 Query: 546 PASDYISLDAGS-NHGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFEDGRKLIKEVP 722 PA D+ISLD + NH EGISDEE +F + N G + F D + +KE Sbjct: 215 PAPDFISLDGSTANHSAVEGISDEEADFHGSLIGARQHKGN-GKSVLDFGDEKPTVKEST 273 Query: 723 IDLRNGGXXXXXXXXXXXXXXXXXXXXXQCRKGRGXXXXXXXXXXXXXXXLXXXXXXXXX 902 Q +KG G + Sbjct: 274 TS----------SYYEDEDEEDKLWEEEQFKKGIGKRMDEGSNRTANSSGIGVPLHPQQK 323 Query: 903 XXXXXXXPGHNVS--PGLNIGGSAGVMKKVLSIPQQATVASQAMRESLQRLKETHGRTMS 1076 PG ++ P + IG ++ V L + QQA +A +A+ ++++RLKE+H +T+ Sbjct: 324 PQMYAYHPGTPLASVPNVTIGPASSV--DTLPMSQQAELAKKALLDNVKRLKESHAKTLL 381 Query: 1077 ALDRNDENMSAALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQHKAPYIEELEEQ 1256 +L + DEN++A+L +I LE+SL+ A +K+VFMQKL+DF+SVICDF+Q K +IEE+E++ Sbjct: 382 SLTKTDENLTASLMSITALESSLSAAGDKYVFMQKLRDFISVICDFMQEKGSFIEEIEDR 441 Query: 1257 MQKLHEERAVAVLERRTADNADEMIEIEAPLSAAMLEYSKGGSSTAVINAAQ---LVSSK 1427 M++L+E A A+LERR ADN DEM+E+ A + AAM + GSST+VI AA L +S Sbjct: 442 MKELNENHAAAILERRIADNDDEMVELGAAVKAAMAVLNTQGSSTSVIAAATSAALAASA 501 Query: 1428 TREQTNLPVKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAVGDIFPYHHIEGX 1607 + Q PVKLDELGRD NLQ SA+ IEG Sbjct: 502 SIRQQIQPVKLDELGRDENLQKRRQAEQRAAARQKRRARFENKRASAMEIDGSSLKIEGE 561 Query: 1608 XXXXXXXXXXXXYKSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETWKKRFFSSYRDAY 1787 YK ++ LLQ +Q+F DA EE+S+L+ VKE+FE WK+ + S+YRDAY Sbjct: 562 SSTDESDSESSAYKELKDKLLQYGDQVFSDASEEYSQLSRVKERFERWKRDYSSTYRDAY 621 Query: 1788 MSLSVPAIFSPYVRLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGGDFNSDDADANLV 1967 MSL+VP+IFSPYVRLELLKWDPLH++ DFF+M WH LLFDYG PE G DF DD DANLV Sbjct: 622 MSLTVPSIFSPYVRLELLKWDPLHQDVDFFNMNWHQLLFDYGKPEDGDDFAPDDTDANLV 681 Query: 1968 PGLVEKVALPILHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEALKDLLSAIHSRL 2147 P LVEKVA+PILHH I CWD+LSTR T+NAV+AT+LV YV ++ EAL +L +AI SRL Sbjct: 682 PELVEKVAIPILHHQIVRCWDILSTRETRNAVAATSLVTNYVLSSSEALAELFAAIRSRL 741 Query: 2148 ADAVANITVPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDILASSAIEQLAFDE 2327 +A+ ITVPTW +V+K VP+A ++AAYRFG +VRL++NIC+WKDILA +E LA + Sbjct: 742 VEAIKAITVPTWDPLVLKTVPNAPQVAAYRFGTSVRLMRNICMWKDILALPVLENLALSD 801 Query: 2328 LLSGKVLPHVRSITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKLQPLVDYVLTLGK 2507 LL GKVLPHVRSI NIHDA+TRTE+IVASL+GVW+G SVT HS LQPLVD +LTL + Sbjct: 802 LLFGKVLPHVRSIASNIHDAVTRTEKIVASLSGVWTGQSVTRTHSRPLQPLVDCILTLKR 861 Query: 2508 TLEKKHASGVSETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 LEK+ ASG+ + ETTGLARRLK++LV+L+EHD AR I+RTF LKEAV Sbjct: 862 ILEKRLASGLDDAETTGLARRLKRILVELHEHDHARDIVRTFNLKEAV 909 >ref|XP_002873370.1| increased level of polyploidy1-1D [Arabidopsis lyrata subsp. lyrata] gi|297319207|gb|EFH49629.1| increased level of polyploidy1-1D [Arabidopsis lyrata subsp. lyrata] Length = 908 Score = 633 bits (1632), Expect = e-178 Identities = 326/580 (56%), Positives = 424/580 (73%), Gaps = 4/580 (0%) Frame = +3 Query: 924 PGHNVSPGLNIGGSAGVMKKVLSIPQQATVASQAMRESLQRLKETHGRTMSALDRNDENM 1103 P N+S IG + V L + QQA +A +A+++++++LKE+H +T+S+L + DEN+ Sbjct: 331 PMPNISVAPTIGPATSV--DTLPMSQQAALAKKALQDNVKKLKESHAKTLSSLTKTDENL 388 Query: 1104 SAALSNIIDLENSLAHADEKFVFMQKLQDFVSVICDFLQHKAPYIEELEEQMQKLHEERA 1283 +A+L +I LE+SL+ A +K+VFMQKL+DF+SVICDF+Q+K IEE+E+QM++L+E+ A Sbjct: 389 TASLMSITALESSLSAAGDKYVFMQKLRDFISVICDFMQNKGSLIEEIEDQMKELNEKHA 448 Query: 1284 VAVLERRTADNADEMIEIEAPLSAAMLEYSKGGSSTAVI----NAAQLVSSKTREQTNLP 1451 +++LERR ADN DEMIE+ A + AAM +K GSST+VI +AA S+ R+Q N P Sbjct: 449 LSILERRIADNNDEMIELGAAVKAAMTVLNKQGSSTSVIAAATSAALAASASIRQQMNQP 508 Query: 1452 VKLDELGRDMNLQXXXXXXXXXXXXXXXXXXXXXXXMSAVGDIFPYHHIEGXXXXXXXXX 1631 VKLDE GRD NLQ SA+ IEG Sbjct: 509 VKLDEFGRDENLQKRREVEQRAAARQKRRARFENKRASAMEIEGSSLKIEGESSTDESDT 568 Query: 1632 XXXXYKSNREMLLQTSEQIFGDAEEEFSKLALVKEKFETWKKRFFSSYRDAYMSLSVPAI 1811 YK R+ LLQ ++++F DA EE+S+L+ VK +FE WK+ + S+YRDAYMSL+VP+I Sbjct: 569 ETSAYKETRDSLLQCADKVFSDASEEYSQLSRVKARFERWKRDYSSTYRDAYMSLTVPSI 628 Query: 1812 FSPYVRLELLKWDPLHEESDFFDMQWHSLLFDYGLPEHGGDFNSDDADANLVPGLVEKVA 1991 FSPYVRLELLKWDPLH++ DFFDM+WH LLFDYG PE G DF DD DANLVP LVEKVA Sbjct: 629 FSPYVRLELLKWDPLHQDVDFFDMKWHGLLFDYGKPEDGDDFAPDDTDANLVPELVEKVA 688 Query: 1992 LPILHHDIAHCWDVLSTRGTKNAVSATNLVITYVPANGEALKDLLSAIHSRLADAVANIT 2171 +PILHH I CWD+LSTR T+NAV+AT+LV YV A+ EAL +L +AI +RL +A+A I+ Sbjct: 689 IPILHHQIVRCWDILSTRETRNAVAATSLVTNYVSASSEALAELFAAIRARLVEAIAAIS 748 Query: 2172 VPTWSTVVIKAVPDAARIAAYRFGMAVRLLKNICLWKDILASSAIEQLAFDELLSGKVLP 2351 VPTW +V+KAVP+A ++AAYRFG +VRL++NIC+WKDILA S +E LA +LL GKVLP Sbjct: 749 VPTWDPLVLKAVPNAPQVAAYRFGTSVRLMRNICMWKDILALSVLENLALSDLLFGKVLP 808 Query: 2352 HVRSITPNIHDAITRTERIVASLNGVWSGSSVTMEHSYKLQPLVDYVLTLGKTLEKKHAS 2531 HVRSI NIHDA+TRTERIVASL+GVW+G SVT HS LQPLVD LTL + LEK+ AS Sbjct: 809 HVRSIASNIHDAVTRTERIVASLSGVWTGPSVTRTHSRPLQPLVDCTLTLRRILEKRLAS 868 Query: 2532 GVSETETTGLARRLKKMLVDLNEHDKARAILRTFQLKEAV 2651 G+ + ETTGLARRLK++LV+L+EHD AR I+RTF LKEAV Sbjct: 869 GLDDAETTGLARRLKRILVELHEHDHAREIVRTFNLKEAV 908 Score = 112 bits (280), Expect = 1e-21 Identities = 88/230 (38%), Positives = 113/230 (49%), Gaps = 11/230 (4%) Frame = +3 Query: 42 LLSFADEEGDEESPFAR-----PXXXXXXXXXXXXXXXXXTHKITSMKERIXXXXXXXXX 206 LLSFAD+E +EE R +H+ +S KE Sbjct: 52 LLSFADDEEEEEDGAPRVTIKPKNGRDRVKSSFRLGVSGSSHRHSSTKEH--------RP 103 Query: 207 XXXNVQPQAGEYTKERLLELQKNTRTIGXXXXXXXXXXXEPKIVLKGLIKPIYXXXXXXX 386 NV PQAG Y+KE LLELQKNTRT+ EPK+VLKGLIKP + Sbjct: 104 ASSNVLPQAGSYSKEALLELQKNTRTL---PYSRPSSNSEPKVVLKGLIKPPHQH----- 155 Query: 387 XXXXXLDRMDVDDAETRLGSMGIGGEGDK----DLIPDQATINAIRAKRERLRQSR-APA 551 ++ + D ++ + EG+K D DQA I IRAK+ER+RQSR APA Sbjct: 156 ------EQQSLKDVVKQVSDLDFDEEGEKEQPEDAFADQAAI--IRAKKERMRQSRSAPA 207 Query: 552 SDYISLDAG-SNHGEAEGISDEEPEFQTRIALFGDKSSNVGVTKGFFEDG 698 DYISLD G +NH EG+SDE+ +FQ +F + G KG F+ G Sbjct: 208 PDYISLDGGTANHSAVEGVSDEDADFQ---GIFVGARPHKGDKKGVFDFG 254