BLASTX nr result

ID: Akebia24_contig00011232 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00011232
         (3050 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding fact...   956   0.0  
ref|XP_004159322.1| PREDICTED: GC-rich sequence DNA-binding fact...   905   0.0  
ref|XP_004135116.1| PREDICTED: GC-rich sequence DNA-binding fact...   900   0.0  
gb|EXB53993.1| GC-rich sequence DNA-binding factor 1 [Morus nota...   879   0.0  
ref|XP_007225333.1| hypothetical protein PRUPE_ppa001044mg [Prun...   872   0.0  
ref|XP_007010500.1| GC-rich sequence DNA-binding factor-like pro...   864   0.0  
ref|XP_006379383.1| hypothetical protein POPTR_0008s00320g [Popu...   858   0.0  
ref|XP_004298307.1| PREDICTED: GC-rich sequence DNA-binding fact...   855   0.0  
ref|XP_006468681.1| PREDICTED: PAX3- and PAX7-binding protein 1-...   855   0.0  
ref|XP_006448500.1| hypothetical protein CICLE_v10014191mg [Citr...   850   0.0  
ref|XP_006838726.1| hypothetical protein AMTR_s00002p00252610 [A...   848   0.0  
ref|XP_003528569.1| PREDICTED: PAX3- and PAX7-binding protein 1-...   811   0.0  
ref|XP_006605552.1| PREDICTED: PAX3- and PAX7-binding protein 1-...   809   0.0  
ref|XP_004514246.1| PREDICTED: GC-rich sequence DNA-binding fact...   808   0.0  
ref|XP_004238967.1| PREDICTED: GC-rich sequence DNA-binding fact...   804   0.0  
ref|XP_002513154.1| gc-rich sequence DNA-binding factor, putativ...   802   0.0  
ref|XP_006362530.1| PREDICTED: PAX3- and PAX7-binding protein 1-...   800   0.0  
ref|XP_007160943.1| hypothetical protein PHAVU_001G030200g [Phas...   791   0.0  
ref|XP_003530304.1| PREDICTED: PAX3- and PAX7-binding protein 1-...   789   0.0  
ref|XP_003610832.1| GC-rich sequence DNA-binding factor-like pro...   762   0.0  

>ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Vitis
            vinifera]
          Length = 913

 Score =  956 bits (2470), Expect = 0.0
 Identities = 550/938 (58%), Positives = 624/938 (66%), Gaps = 12/938 (1%)
 Frame = -1

Query: 3032 MSSRSKNFRRRAEDED---VNGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXKLLSFADE 2862
            MSSR +NFRRRA+D+D    NG+                             KLLSFAD+
Sbjct: 1    MSSRPRNFRRRADDDDNDDTNGD-GPPLIKPTSKPSTTTATTAAAAKPKKPPKLLSFADD 59

Query: 2861 EDEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHKITTTKDRVF-TSPSLPSNVQP 2685
            E+ E                                 HKITTTKDR+  +S SLPSNVQP
Sbjct: 60   EENESPSRSSSRSTQPPSRPSKTSSRFTKLSSSSS--HKITTTKDRLTPSSASLPSNVQP 117

Query: 2684 QAGEYTKEKLRELQKNTRTLASSTPNTSEP------VIVLKGFVKPHSVDEDRGNSRXXX 2523
            QAG YTKE LRELQKNTRTLASS P +SEP      VIVLKG VKP S  ED        
Sbjct: 118  QAGTYTKEALRELQKNTRTLASSRPASSEPKPSLEPVIVLKGLVKPISAAEDAVIDEENV 177

Query: 2522 XXXXXXXXXXXXNQLASMGIGKSRDSSG-SLIPDQATINAIRAKRERLRQSRAAAPDYIS 2346
                                 +S+D  G   IPDQATINAIRAKRERLRQSRAAAPDYIS
Sbjct: 178  EEEP-----------------ESKDKGGRDSIPDQATINAIRAKRERLRQSRAAAPDYIS 220

Query: 2345 LDGGSNHGAAEGLSDEEPEFQGRIALLGDKTDVAKKGVFESVDERGIENDLRKXXXXXXX 2166
            LDGGSNHGAAEGLSDEEPEFQGRIA+ G+K +  KKGVFE VDERG+E   +K       
Sbjct: 221  LDGGSNHGAAEGLSDEEPEFQGRIAMFGEKPESGKKGVFEDVDERGMEGGFKKDAHDSDD 280

Query: 2165 XXXXXXXXXXXXEQFRKGLGKRIEDGXXXXXXXXXXXXXNQIVQQQHXXXXXXXXXXXXS 1986
                         QFRKGLGKR++DG              ++ QQ+              
Sbjct: 281  EEEEKIWEEE---QFRKGLGKRMDDGSSRVVSSSVPVVQ-KVQQQKFMYSSVTAYTSVPG 336

Query: 1985 VPAAPTIGGAVGGSRSAEVMSISXXXXXXXXXXXQNIRRLKESYGRAMSSIARTDENLSS 1806
            V A   IGGAVG     + MS+S           +N+RRLKES+GR MSS+ RTDENLSS
Sbjct: 337  VSAPLNIGGAVGPLPGFDAMSLSQQAELAKKALHENLRRLKESHGRTMSSLTRTDENLSS 396

Query: 1805 SLSNITDLEMSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERASA 1626
            SLSNIT LE SL+AAGEKFIFMQ LRDFVSVICDFLQHKAPFIEELEEQMQKLHEERASA
Sbjct: 397  SLSNITTLEKSLTAAGEKFIFMQXLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERASA 456

Query: 1625 ILERRAADNTDEFKEVEAAVSTAMSVLGKGGGXXXXXXXXXXXXXXXXXA-REQSNLSVQ 1449
            ILERRAADN DE  E++A+V  AMSV  K G                  A REQ+NL V+
Sbjct: 457  ILERRAADN-DEMMEIQASVDAAMSVFTKSGSNEAMVAAARTAAQAASAAMREQTNLPVK 515

Query: 1448 LDEFGRDMNLQKRMDIXXXXXXXXXXXXXXXXXXXXSVGDDSAFSHIEGXXXXXXXXXXX 1269
            LDE+GRD+NLQK MD                      + ++S+   IEG           
Sbjct: 516  LDEYGRDINLQKCMDKNRRSEARQRKRDRWDAKRMTFLENESSHQKIEGESSTDESDSET 575

Query: 1268 XSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVKERFERWKKHYSSSYRDAYMSLSVPAIFS 1089
             +Y+SNRDLLLQTA QIF DAAEEYS LS VKER ERWKK YSSSYRDAYMSLSVPAIFS
Sbjct: 576  TAYQSNRDLLLQTAEQIFGDAAEEYSQLSAVKERIERWKKQYSSSYRDAYMSLSVPAIFS 635

Query: 1088 PYVRLELLKWDPLYEETDFNDMQWHSLLFDYGLPEDTSDFNPGDADADLVPGLVEKIALP 909
            PYVRLELLKWDPLYEE DF+DM+WHSLLF+YGL ED +DF+P DADA+LVP LVE++ALP
Sbjct: 636  PYVRLELLKWDPLYEEADFDDMKWHSLLFNYGLSEDGNDFSPDDADANLVPELVERVALP 695

Query: 908  ILHHQIAHCWDMLSTRGTRNAVSAMNLVINYVPASSEALRELLGAIHTRLADAIANLIVP 729
            ILHH++AHCWD+ STR T+NAVSA NLVI Y+PASSEAL ELL  +H RL  A+ N +VP
Sbjct: 696  ILHHELAHCWDIFSTRETKNAVSATNLVIRYIPASSEALGELLAVVHKRLYKALTNFMVP 755

Query: 728  TWSPLVIKAVPNAARVAAYQFGMSIRLLRNICLWKDILALPVLEQLALDELFCGKVLPHV 549
             W+ LV+KAVPNAARVAAY+FGMSIRL+RNICLWKDILALPVLE+L LD+L  G+VLPH+
Sbjct: 756  PWNILVMKAVPNAARVAAYRFGMSIRLMRNICLWKDILALPVLEKLVLDQLLSGQVLPHI 815

Query: 548  RSITANIHDAITRTERIISSLSGVWTGTKVIAERSYKLQPLVDYVLTLAKTLEKKHVSGV 369
             +I +++HDAITRTERIISSLSGVW G  V  ERS KLQPLVDYVL L K LEK+H+ GV
Sbjct: 816  ENIASDVHDAITRTERIISSLSGVWAGPSVTGERSNKLQPLVDYVLRLGKRLEKRHLPGV 875

Query: 368  SESETRGLARRLKKMLVELNEYDNARAISRTFQLKEAL 255
            +ES+T  LARRLK+MLVELNEYD AR ISRTF LKEAL
Sbjct: 876  TESDTSRLARRLKRMLVELNEYDKARDISRTFHLKEAL 913


>ref|XP_004159322.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cucumis
            sativus]
          Length = 889

 Score =  905 bits (2338), Expect = 0.0
 Identities = 501/842 (59%), Positives = 587/842 (69%), Gaps = 10/842 (1%)
 Frame = -1

Query: 2750 HKITTTKDRVFTSPSL----PSNVQPQAGEYTKEKLRELQKNTRTLASSTPNT-----SE 2598
            HKIT  KDR+  S S+    PSNVQPQAG YTKE LRELQKNTRTLASS P++     +E
Sbjct: 68   HKITALKDRIAHSSSISASVPSNVQPQAGVYTKEALRELQKNTRTLASSRPSSESKPSAE 127

Query: 2597 PVIVLKGFVKPHSVDEDRGNSRXXXXXXXXXXXXXXXNQLASMGIGKSRDSSGSLIPDQA 2418
            PVIVLKG +KP     D                     + +S      +DSSGS IPDQA
Sbjct: 128  PVIVLKGLLKPAEQVPDSAREAK---------------ESSSEDDEAGKDSSGSSIPDQA 172

Query: 2417 TINAIRAKRERLRQSRAAAPDYISLDGGSNHGAAEGLSDEEPEFQGRIALLGDKTDVAKK 2238
            TINAIRAKRER+RQ+  AAPDYISLD GSN  A   LSDEE EF GRIA++G K + +KK
Sbjct: 173  TINAIRAKRERMRQAGVAAPDYISLDAGSNRTAPGELSDEEAEFPGRIAMIGGKLESSKK 232

Query: 2237 GVFESVDERGIENDLRKXXXXXXXXXXXXXXXXXXXEQFRKGLGKRIEDGXXXXXXXXXX 2058
            GVFE VDE+GI+                        EQFRKGLGKR++DG          
Sbjct: 233  GVFEEVDEQGIDG---ARTNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGSTRVESTSVP 289

Query: 2057 XXXNQIVQQQHXXXXXXXXXXXXSVPAAPTIGGAVGGSRSAEVMSISXXXXXXXXXXXQN 1878
                 +  Q              SV  A +IGG+V  S+  + +SIS           ++
Sbjct: 290  VVP-SVQPQNLIYPTTIGYSSVPSVSTATSIGGSVSISQGLDGLSISQQAEIAKTAMQES 348

Query: 1877 IRRLKESYGRAMSSIARTDENLSSSLSNITDLEMSLSAAGEKFIFMQKLRDFVSVICDFL 1698
            + RLKESY R   S+ +TDENLS+SL  ITDLE +LSAAG+KFIFMQKLRDFVSVICDFL
Sbjct: 349  MGRLKESYRRTAMSVLKTDENLSASLLKITDLEKALSAAGDKFIFMQKLRDFVSVICDFL 408

Query: 1697 QHKAPFIEELEEQMQKLHEERASAILERRAADNTDEFKEVEAAVSTAMSVLGK-GGGXXX 1521
            QHKAPFIEELEEQMQKLHEERAS ++ERR ADN DE  E+E AV  A+S+L K G     
Sbjct: 409  QHKAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIETAVKAAISILNKKGSSNEM 468

Query: 1520 XXXXXXXXXXXXXXAREQSNLSVQLDEFGRDMNLQKRMDIXXXXXXXXXXXXXXXXXXXX 1341
                          +REQ+NL  +LDEFGRD+NLQKRMD+                    
Sbjct: 469  ITAATSAAQAAIALSREQANLPTKLDEFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRLA 528

Query: 1340 SVGDDSAFSHIEGXXXXXXXXXXXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVKERFE 1161
            S+  D     +EG            +Y+SNRDLLLQTA QIFSDAAEE+S LSVVK+RFE
Sbjct: 529  SMEVDG-HQKVEGESSTDESDSDSAAYQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRFE 587

Query: 1160 RWKKHYSSSYRDAYMSLSVPAIFSPYVRLELLKWDPLYEETDFNDMQWHSLLFDYGLPED 981
             WK+ YS++YRDAYMSLS+PAIFSPYVRLELLKWDPL+E  DF DM WHSLLF+YG+PED
Sbjct: 588  AWKRDYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHESADFFDMNWHSLLFNYGMPED 647

Query: 980  TSDFNPGDADADLVPGLVEKIALPILHHQIAHCWDMLSTRGTRNAVSAMNLVINYVPASS 801
             SDF P DADA+LVP LVEK+ALPILHH+IAHCWDMLSTR TRNA  A +L+ NYVP SS
Sbjct: 648  GSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSTRETRNAAFATSLITNYVPPSS 707

Query: 800  EALRELLGAIHTRLADAIANLIVPTWSPLVIKAVPNAARVAAYQFGMSIRLLRNICLWKD 621
            EAL ELL  I TRL+ AI +L VPTW+ LV KAVPNAAR+AAY+FGMS+RL+RNICLWK+
Sbjct: 708  EALTELLVVIRTRLSGAIEDLTVPTWNSLVTKAVPNAARIAAYRFGMSVRLMRNICLWKE 767

Query: 620  ILALPVLEQLALDELFCGKVLPHVRSITANIHDAITRTERIISSLSGVWTGTKVIAERSY 441
            I+ALP+LE+LAL+EL  GKVLPHVRSITANIHDA+TRTERII+SL+GVWTG+ +I +RS+
Sbjct: 768  IIALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASLAGVWTGSGIIGDRSH 827

Query: 440  KLQPLVDYVLTLAKTLEKKHVSGVSESETRGLARRLKKMLVELNEYDNARAISRTFQLKE 261
            KLQPLVDYVL L +TLEKKH+SG++ESET GLARRLKKMLVELNEYDNAR I++TF LKE
Sbjct: 828  KLQPLVDYVLLLGRTLEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLKE 887

Query: 260  AL 255
            AL
Sbjct: 888  AL 889


>ref|XP_004135116.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cucumis
            sativus]
          Length = 920

 Score =  900 bits (2326), Expect = 0.0
 Identities = 498/842 (59%), Positives = 584/842 (69%), Gaps = 10/842 (1%)
 Frame = -1

Query: 2750 HKITTTKDRVFTSPSL----PSNVQPQAGEYTKEKLRELQKNTRTLASSTPNT-----SE 2598
            HKIT  KDR+  S S+    PSNVQPQAG YTKE LRELQKNTRTLASS P++     +E
Sbjct: 98   HKITALKDRIAHSSSISASVPSNVQPQAGVYTKEALRELQKNTRTLASSRPSSESKPSAE 157

Query: 2597 PVIVLKGFVKPHSVDEDRGNSRXXXXXXXXXXXXXXXNQLASMGIGKSRDSSGSLIPDQA 2418
            PVIVLKG +KP     D                               +DSSGS IPDQA
Sbjct: 158  PVIVLKGLLKPAEQVPDSAREAKESSSEDDEAGR--------------KDSSGSSIPDQA 203

Query: 2417 TINAIRAKRERLRQSRAAAPDYISLDGGSNHGAAEGLSDEEPEFQGRIALLGDKTDVAKK 2238
            TINAIRAKRER+RQ+  AAPDYISLD GSN  A   LSDEE EF GRIA++G K + +KK
Sbjct: 204  TINAIRAKRERMRQAGVAAPDYISLDAGSNRTAPGELSDEEAEFPGRIAMIGGKLESSKK 263

Query: 2237 GVFESVDERGIENDLRKXXXXXXXXXXXXXXXXXXXEQFRKGLGKRIEDGXXXXXXXXXX 2058
            GVFE VDE+GI+                        EQFRKGLGKR++DG          
Sbjct: 264  GVFEEVDEQGIDG---ARTNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGSTRVESTSVP 320

Query: 2057 XXXNQIVQQQHXXXXXXXXXXXXSVPAAPTIGGAVGGSRSAEVMSISXXXXXXXXXXXQN 1878
                 +  Q              S+  A +IGG+V  S+  + +SIS           ++
Sbjct: 321  VVP-SVQPQNLIYPTTIGYSSVPSMSTATSIGGSVSISQGLDGLSISQQAEIAKTAMQES 379

Query: 1877 IRRLKESYGRAMSSIARTDENLSSSLSNITDLEMSLSAAGEKFIFMQKLRDFVSVICDFL 1698
            + RLKESY R   S+ +TDENLS+SL  ITDLE +LSAAG+KF+FMQKLRDFVSVICDFL
Sbjct: 380  MGRLKESYRRTAMSVLKTDENLSASLLKITDLEKALSAAGDKFMFMQKLRDFVSVICDFL 439

Query: 1697 QHKAPFIEELEEQMQKLHEERASAILERRAADNTDEFKEVEAAVSTAMSVLGK-GGGXXX 1521
            QHKAPFIEELEEQMQKLHEERAS ++ERR ADN DE  E+E AV  A+S+L K G     
Sbjct: 440  QHKAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIETAVKAAISILNKKGSSNEM 499

Query: 1520 XXXXXXXXXXXXXXAREQSNLSVQLDEFGRDMNLQKRMDIXXXXXXXXXXXXXXXXXXXX 1341
                          +REQ+NL  +LDEFGRD+NLQKRMD+                    
Sbjct: 500  VTAATSAAQAAIALSREQANLPTKLDEFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRLA 559

Query: 1340 SVGDDSAFSHIEGXXXXXXXXXXXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVKERFE 1161
            S+  D     +EG            +Y+SNRDLLLQTA QIFSDAAEE+S LSVVK+RFE
Sbjct: 560  SMEVDG-HQKVEGESSTDESDSDSAAYQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRFE 618

Query: 1160 RWKKHYSSSYRDAYMSLSVPAIFSPYVRLELLKWDPLYEETDFNDMQWHSLLFDYGLPED 981
             WK+ YS++YRDAYMSLS+PAIFSPYVRLELLKWDPL+E  DF DM WHSLLF+YG+PED
Sbjct: 619  AWKRDYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHESADFFDMNWHSLLFNYGMPED 678

Query: 980  TSDFNPGDADADLVPGLVEKIALPILHHQIAHCWDMLSTRGTRNAVSAMNLVINYVPASS 801
             SDF P DADA+LVP LVEK+ALPILHH+IAHCWDMLSTR TRNA  A +L+ NYVP SS
Sbjct: 679  GSDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSTRETRNAAFATSLITNYVPPSS 738

Query: 800  EALRELLGAIHTRLADAIANLIVPTWSPLVIKAVPNAARVAAYQFGMSIRLLRNICLWKD 621
            EAL ELL  I TRL+ AI +L VPTW+ LV KAVPNAAR+AAY+FGMS+RL+RNICLWK+
Sbjct: 739  EALTELLVVIRTRLSGAIEDLTVPTWNSLVTKAVPNAARIAAYRFGMSVRLMRNICLWKE 798

Query: 620  ILALPVLEQLALDELFCGKVLPHVRSITANIHDAITRTERIISSLSGVWTGTKVIAERSY 441
            I+ALP+LE+LAL+EL  GKVLPHVRSITANIHDA+TRTERII+SL+GVWTG+ +I +RS+
Sbjct: 799  IIALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASLAGVWTGSGIIGDRSH 858

Query: 440  KLQPLVDYVLTLAKTLEKKHVSGVSESETRGLARRLKKMLVELNEYDNARAISRTFQLKE 261
            KLQPLVDYVL L +TLEKKH+SG++ESET GLARRLKKMLVELNEYDNAR I++TF LKE
Sbjct: 859  KLQPLVDYVLLLGRTLEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLKE 918

Query: 260  AL 255
            AL
Sbjct: 919  AL 920


>gb|EXB53993.1| GC-rich sequence DNA-binding factor 1 [Morus notabilis]
          Length = 952

 Score =  879 bits (2271), Expect = 0.0
 Identities = 507/904 (56%), Positives = 594/904 (65%), Gaps = 28/904 (3%)
 Frame = -1

Query: 2882 LLSFADEEDEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHKITTTKDRV------ 2721
            LLSFAD+ED E                                 HK+T  KDR+      
Sbjct: 67   LLSFADDEDNETPSRSKPSSSSKLSSSSSRLSKPTSS-------HKMTALKDRLPHSSSS 119

Query: 2720 ---FTSPSLPSNVQPQAGEYTKEKLRELQKNTRTLASSTPNTSEPVIVLKGFVKPHSVDE 2550
                +S SLPSNVQPQAG YTKE LRELQKNTRTLASS P+ SEPVIVLKG +KP  + +
Sbjct: 120  SPSSSSLSLPSNVQPQAGTYTKEALRELQKNTRTLASSKPS-SEPVIVLKGLLKPSELAK 178

Query: 2549 DRGNSRXXXXXXXXXXXXXXXNQLASMGIG-KSRDSSGS----LIPDQATINAIRAKRER 2385
                                  +LASM IG K RD   S    LIPDQATINAIRAKRER
Sbjct: 179  SDWKL-DSEEEDEPDELKERRGELASMEIGAKGRDRDNSSPEPLIPDQATINAIRAKRER 237

Query: 2384 LRQSRAAAPDYISLDGGSNHGAAEGLSDEEPEFQGRIALLGDKTDVAKKGVFES-VDERG 2208
            LRQSRAAAPD+I+LD GSNHG AEGLSDEEPE Q RIA+ G+K +  KKGVFE  +D+RG
Sbjct: 238  LRQSRAAAPDFIALDAGSNHGEAEGLSDEEPENQTRIAMFGEKAEGPKKGVFEDDIDDRG 297

Query: 2207 IENDLRKXXXXXXXXXXXXXXXXXXXE----QFRKGLGK-RIEDGXXXXXXXXXXXXXNQ 2043
            IE  L +                        QFRKGLGK RI+DG               
Sbjct: 298  IELGLLRRKQGVLEENHEDDEDEEDKIWEEEQFRKGLGKTRIDDGGKNSVVP-------- 349

Query: 2042 IVQQQHXXXXXXXXXXXXSVPAAPTIGGAVGGSRSAE-------VMSISXXXXXXXXXXX 1884
             V ++             ++P + +IGG  GGS           +M  S           
Sbjct: 350  -VVKRETQQKFVSSVGSQTLPPSASIGGTFGGSSGGSSTGLGLGMMPFSQQAEIALNAID 408

Query: 1883 QNIRRLKESYGRAMSSIARTDENLSSSLSNITDLEMSLSAAGEKFIFMQKLRDFVSVICD 1704
             N+RRLKE++ + + S+ + D+NLS SL NIT LE SLSAA EK+ F QKLRDF+S+ICD
Sbjct: 409  DNVRRLKETHDQDLVSLNKADKNLSDSLLNITALEKSLSAADEKYKFTQKLRDFISIICD 468

Query: 1703 FLQHKAPFIEELEEQMQKLHEERASAILERRAADNTDEFKEVEAAVSTAMSVLGK-GGGX 1527
            FLQHKAPFIEELE+QMQKLHE+ ASAI+ERR A+N DE  EVEA V+ AMS+  K G   
Sbjct: 469  FLQHKAPFIEELEDQMQKLHEKHASAIVERRTANNDDEMMEVEAEVNAAMSIFSKKGSNV 528

Query: 1526 XXXXXXXXXXXXXXXXAREQSNLSVQLDEFGRDMNLQKRMDIXXXXXXXXXXXXXXXXXX 1347
                             REQ NL V+LDEFGRDMNLQKRM++                  
Sbjct: 529  DVVAAAKSAAQAASAALREQGNLPVKLDEFGRDMNLQKRMEMKGRAEARQCRKARFDSKR 588

Query: 1346 XXSVGDDSAFSHIEGXXXXXXXXXXXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVKER 1167
              S+  D  +  +EG            ++ S+R+LLLQTAA IFSDA+EEYS LSVVKER
Sbjct: 589  LSSMDVDGPYQRMEGESSTDESDSESTAFESHRELLLQTAAHIFSDASEEYSQLSVVKER 648

Query: 1166 FERWKKHYSSSYRDAYMSLSVPAIFSPYVRLELLKWDPLYEETDFNDMQWHSLLFDYGLP 987
            FE WK+ YSS+Y DAYMSLS P+IFSPYVRLELLKWDPL+E+TDF +M WHSLL DYG+P
Sbjct: 649  FEEWKREYSSTYSDAYMSLSAPSIFSPYVRLELLKWDPLHEKTDFLNMNWHSLLMDYGVP 708

Query: 986  EDTSDFNPGDADADLVPGLVEKIALPILHHQIAHCWDMLSTRGTRNAVSAMNLVINYVPA 807
            ED   F P DADA+LVP LVEK+AL ILHH+I HCWDMLST  TRNAV+A +LV +YVPA
Sbjct: 709  EDGGGFAPDDADANLVPELVEKVALRILHHEIVHCWDMLSTLETRNAVAATSLVTDYVPA 768

Query: 806  SSEALRELLGAIHTRLADAIANLIVPTWSPLVIKAVPNAARVAAYQFGMSIRLLRNICLW 627
            SSEAL +LL AI TRLADA+ANL VPTWSP V++AVPNAAR+AAY+FG+S+RL++NICLW
Sbjct: 769  SSEALADLLVAIRTRLADAVANLTVPTWSPPVLQAVPNAARLAAYRFGVSVRLMKNICLW 828

Query: 626  KDILALPVLEQLALDELFCGKVLPHVRSITANIHDAITRTERIISSLSGVWTGTKVIAER 447
            K+ILALPVLE+LALDEL CGKVLPHVRSI AN+HDAI RTE+I++SLSGVW G  V  +R
Sbjct: 829  KEILALPVLEKLALDELLCGKVLPHVRSIAANVHDAIPRTEKIVASLSGVWAGPSVTGDR 888

Query: 446  SYKLQPLVDYVLTLAKTLEKKHVSGVSESETRGLARRLKKMLVELNEYDNARAISRTFQL 267
            S KLQPLVDY++ L K LEKKH SGV+ESET GLARRLKKMLVELNEYD AR I+RTF L
Sbjct: 889  SRKLQPLVDYLMLLRKILEKKHESGVTESETSGLARRLKKMLVELNEYDKARDIARTFHL 948

Query: 266  KEAL 255
            KEAL
Sbjct: 949  KEAL 952


>ref|XP_007225333.1| hypothetical protein PRUPE_ppa001044mg [Prunus persica]
            gi|462422269|gb|EMJ26532.1| hypothetical protein
            PRUPE_ppa001044mg [Prunus persica]
          Length = 925

 Score =  872 bits (2252), Expect = 0.0
 Identities = 510/950 (53%), Positives = 605/950 (63%), Gaps = 24/950 (2%)
 Frame = -1

Query: 3032 MSSRSKNFRRRAEDEDVNGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXK-------LLS 2874
            MSSR++NFRRRA+D+D   ++                            K       LLS
Sbjct: 1    MSSRARNFRRRADDDDDKNDDPNDTGTPATIPTVKSSSKPSSSSSSKPKKPHNQAPKLLS 60

Query: 2873 FADEEDEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHKITTTKDRVF----TSPS 2706
            F D+E+                                   HK+T  KDR+      S S
Sbjct: 61   FVDDEESAAAPSRSSSSKPDKPSSRLGKPSSA---------HKMTALKDRLAHTSSVSTS 111

Query: 2705 LPSNVQPQAGEYTKEKLRELQKNTRTLASSTPNTSEPVIVLKGFVKP-----------HS 2559
            LPSNVQPQAG YTKE LRELQKNTRTLASS P+ SEP IVLKG VKP             
Sbjct: 112  LPSNVQPQAGTYTKEALRELQKNTRTLASSRPS-SEPTIVLKGLVKPTGTISDTLREARE 170

Query: 2558 VDEDRGNSRXXXXXXXXXXXXXXXN-QLASMGIGKSRDSSGSLIPDQATINAIRAKRERL 2382
            +D D    +                 +LASMGI K++ SSG L PDQATINAIRAKRERL
Sbjct: 171  LDSDNDEEQEKERASLFRRDKDDAEARLASMGIDKAKGSSG-LFPDQATINAIRAKRERL 229

Query: 2381 RQSRAAAPDYISLDGGSNHGAAEGLSDEEPEFQGRIALLGDKTDVAKKGVFESVDERGIE 2202
            R+SRAAAPD+ISLD GSNHGAAEGLSDEEPEF+GRIA+ GD  + +KKGVFE VD+R  +
Sbjct: 230  RKSRAAAPDFISLDSGSNHGAAEGLSDEEPEFRGRIAIFGDNMEGSKKGVFEDVDDRAAD 289

Query: 2201 NDLRKXXXXXXXXXXXXXXXXXXXEQFRKGLGKRIEDGXXXXXXXXXXXXXNQIVQQQHX 2022
              LR+                    QFRKGLGKR++DG               + Q +  
Sbjct: 290  AVLRQKSIDRDEDEDEEEKIWEEE-QFRKGLGKRMDDGSSIGVVSTSAPVVQSVPQPKAT 348

Query: 2021 XXXXXXXXXXXSVPAAPTIGGAVGGSRSAEVMSISXXXXXXXXXXXQNIRRLKESYGRAM 1842
                       SVP  P+IGGA+G S+ + VMSI            +N+ +LKES+GR M
Sbjct: 349  YSAMAGYSSVQSVPVGPSIGGAIGASQGSNVMSIKAQAEIAKKALEENVMKLKESHGRTM 408

Query: 1841 SSIARTDENLSSSLSNITDLEMSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEE 1662
             S+ +TDENLSSSL NIT LE SLSAA EK+    K  +  SV       KAP IEELEE
Sbjct: 409  LSLTKTDENLSSSLLNITALEKSLSAADEKY----KGMEIGSV-------KAPLIEELEE 457

Query: 1661 QMQKLHEERASAILERRAADNTDEFKEVEAAVSTAMSVLGK-GGGXXXXXXXXXXXXXXX 1485
            +MQK+HE+RASA LERR+AD+ DE  EVEAAV  AMS+  K G                 
Sbjct: 458  EMQKIHEQRASATLERRSADD-DEMMEVEAAVKAAMSIFSKEGSSAEIIAAAKSAAQAAT 516

Query: 1484 XXAREQSNLSVQLDEFGRDMNLQKRMDIXXXXXXXXXXXXXXXXXXXXSVGDDSAFSHIE 1305
               REQ+NL V+LDEFGRDMNLQKR D+                    S+  DS    IE
Sbjct: 517  TAEREQTNLPVKLDEFGRDMNLQKRRDMKGRSEAHQHRKRRYESKRLSSMEVDSTHRTIE 576

Query: 1304 GXXXXXXXXXXXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVKERFERWKKHYSSSYRD 1125
            G            +Y  +R L+L+TAAQ+FSDAAEEYS LS+VKERFE WK  Y+SSYRD
Sbjct: 577  GESSTDESDSESNAYHKHRQLVLETAAQVFSDAAEEYSKLSLVKERFEEWKTDYASSYRD 636

Query: 1124 AYMSLSVPAIFSPYVRLELLKWDPLYEETDFNDMQWHSLLFDYGLPEDTSDFNPGDADAD 945
            AYMSLS PAIFSPYVRLEL+KWDPL E+TDF +M WHSLL DY LPED SDF P DADA+
Sbjct: 637  AYMSLSAPAIFSPYVRLELVKWDPLREKTDFLNMSWHSLLADYNLPEDGSDFAPDDADAN 696

Query: 944  LVPGLVEKIALPILHHQIAHCWDMLSTRGTRNAVSAMNLVINYVPASSEALRELLGAIHT 765
            LVP LVEK+ALPIL HQ+ HCWD+LSTR T+NAV+A ++V +YVP SSEAL +LL AI T
Sbjct: 697  LVPDLVEKVALPILLHQVVHCWDILSTRETKNAVAATSVVTDYVPPSSEALADLLVAIRT 756

Query: 764  RLADAIANLIVPTWSPLVIKAVPNAARVAAYQFGMSIRLLRNICLWKDILALPVLEQLAL 585
            RLADA+ NL VPTWSPLV+ AVPNAAR+AAY+FG+S+RL++NICLWK+ILA PVLE+LA+
Sbjct: 757  RLADAVTNLTVPTWSPLVLTAVPNAARIAAYRFGLSVRLMKNICLWKEILAFPVLEKLAI 816

Query: 584  DELFCGKVLPHVRSITANIHDAITRTERIISSLSGVWTGTKVIAERSYKLQPLVDYVLTL 405
            +EL CGKVLPHVRSI AN+HDAITRTERI++SLSGVW G+ V  +R  KLQ LVDYVL+L
Sbjct: 817  EELLCGKVLPHVRSIAANVHDAITRTERIVASLSGVWAGSNVTGDRR-KLQSLVDYVLSL 875

Query: 404  AKTLEKKHVSGVSESETRGLARRLKKMLVELNEYDNARAISRTFQLKEAL 255
             +TLEKKH  GV++SE  GLARRLKKMLV+LNEYD AR ++RTF LKEAL
Sbjct: 876  GRTLEKKHSLGVTQSEISGLARRLKKMLVDLNEYDKARDLTRTFNLKEAL 925


>ref|XP_007010500.1| GC-rich sequence DNA-binding factor-like protein, putative isoform 1
            [Theobroma cacao] gi|590567380|ref|XP_007010501.1|
            GC-rich sequence DNA-binding factor-like protein,
            putative isoform 1 [Theobroma cacao]
            gi|508727413|gb|EOY19310.1| GC-rich sequence DNA-binding
            factor-like protein, putative isoform 1 [Theobroma cacao]
            gi|508727414|gb|EOY19311.1| GC-rich sequence DNA-binding
            factor-like protein, putative isoform 1 [Theobroma cacao]
          Length = 934

 Score =  864 bits (2232), Expect = 0.0
 Identities = 510/943 (54%), Positives = 615/943 (65%), Gaps = 20/943 (2%)
 Frame = -1

Query: 3023 RSKNFRRRAEDEDVNG-EEXXXXXXXXXXXXXXXXXXXXXXXXXXXXKLLSFADEEDEEX 2847
            R++NFRRR +D D +G ++                            KLLSFAD+E+EE 
Sbjct: 6    RARNFRRRGDDIDDDGNDDNNTPNIASATVTATKKPSSSKPTAKKPPKLLSFADDENEEE 65

Query: 2846 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHKITTTKDRVFTSPSLPSNVQPQAGEYT 2667
                                            HKIT+TKD   T  +LPSNVQPQAG YT
Sbjct: 66   TTKPSSNRNRDKEREKPFSSRVSKPLSA----HKITSTKD-CKTPSTLPSNVQPQAGTYT 120

Query: 2666 KEKLRELQKNTRTLASSTPN----TSEPVIVLKGFVKPHSVDEDRGNSRXXXXXXXXXXX 2499
            KE L ELQKN RTLA+ +      +SEP IVLKG +KP S +    NS            
Sbjct: 121  KEALLELQKNMRTLAAPSSRASSVSSEPKIVLKGLLKPQSQNL---NSERDNDPPEKLQK 177

Query: 2498 XXXXNQLASMGIGKSRDSSGSLIPDQATINAIRAKRERLRQSRAA-APDYISLDGGSNHG 2322
                ++LA+M  GK  D   S  PDQATI+AI+AK++R+R+S A  APDYISLD GSN G
Sbjct: 178  DDTESRLATMAAGKGVDLDFSAFPDQATIDAIKAKKDRVRKSFARPAPDYISLDRGSNLG 237

Query: 2321 AA--EGLSD-EEPEFQGRIALLGDKTDVAKKGVFESVDERGIENDLRKXXXXXXXXXXXX 2151
             A  E LSD EEPEF GR  L G+     KKGVFE ++ER +   LRK            
Sbjct: 238  GAMEEELSDDEEPEFPGR--LFGES---GKKGVFEVIEERAVGVGLRKDGIHDEDDDDNE 292

Query: 2150 XXXXXXXEQFRKGLGKRIEDGXXXXXXXXXXXXXNQIV---QQQHXXXXXXXXXXXXSV- 1983
                   EQFRKGLGKR++D                +V   QQQH               
Sbjct: 293  EEKMWEEEQFRKGLGKRMDDSSNRVVSSSNNSGGVGMVHNMQQQHQQRYGYSTMGSYGSM 352

Query: 1982 -----PAAPT-IGGAVGGSRSAEVMSISXXXXXXXXXXXQNIRRLKESYGRAMSSIARTD 1821
                 PA P+ I GA G S+  +V SIS           +N+RRLKES+ R +SS+ + D
Sbjct: 353  MPSVSPAPPSSIVGAAGASQGLDVTSISQQAEITKKALQENVRRLKESHDRTISSLTKAD 412

Query: 1820 ENLSSSLSNITDLEMSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHE 1641
            ENLS+SL NIT LE SLSAAGEKFIFMQKLRDFVSVIC+FLQHKAP IEELEE MQKL+E
Sbjct: 413  ENLSASLFNITALEKSLSAAGEKFIFMQKLRDFVSVICEFLQHKAPLIEELEEHMQKLNE 472

Query: 1640 ERASAILERRAADNTDEFKEVEAAVSTAMSVLGK-GGGXXXXXXXXXXXXXXXXXAREQS 1464
            ERA ++LERR+A+N DE  EVEAAV+ AM V  + G                    R Q 
Sbjct: 473  ERALSVLERRSANNDDEMVEVEAAVTAAMLVFSECGNSAAMIEVAANAAQAAAAAIRGQV 532

Query: 1463 NLSVQLDEFGRDMNLQKRMDIXXXXXXXXXXXXXXXXXXXXSVGDDSAFSHIEGXXXXXX 1284
            NL V+LDEFGRD+N QK +D+                    S+  DS++  IEG      
Sbjct: 533  NLPVKLDEFGRDVNRQKHLDMERRAEARQRRKARFDSKRLSSMEIDSSYQKIEGESSTDE 592

Query: 1283 XXXXXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVKERFERWKKHYSSSYRDAYMSLSV 1104
                  +YRSNRD+LLQTA +IF DA+EEYS LS+VKERFERWKK YSSSYRDAYMSLS+
Sbjct: 593  SDSESTAYRSNRDMLLQTADEIFGDASEEYSQLSLVKERFERWKKDYSSSYRDAYMSLSI 652

Query: 1103 PAIFSPYVRLELLKWDPLYEETDFNDMQWHSLLFDYGLPEDTSDFNPGDADADLVPGLVE 924
            PAIFSPYVRLELLKWDPL+ + DF+DM+WH+LLF+YG PED S F P DADA+LVP LVE
Sbjct: 653  PAIFSPYVRLELLKWDPLHVDEDFSDMKWHNLLFNYGFPEDGS-FAPDDADANLVPALVE 711

Query: 923  KIALPILHHQIAHCWDMLSTRGTRNAVSAMNLVINYVPASSEALRELLGAIHTRLADAIA 744
            K+ALP+LHH+I+HCWDMLS + T+NAVSA +L+I+YVPASSEAL ELL  I TRL++A+A
Sbjct: 712  KVALPVLHHEISHCWDMLSMQETKNAVSATSLIIDYVPASSEALAELLVTIRTRLSEAVA 771

Query: 743  NLIVPTWSPLVIKAVPNAARVAAYQFGMSIRLLRNICLWKDILALPVLEQLALDELFCGK 564
            +++VPTWSPLV+KAVPNAARVAAY+FGMS+RL+RNICLWK+ILALP+LE+LALDEL  GK
Sbjct: 772  DIMVPTWSPLVMKAVPNAARVAAYRFGMSVRLMRNICLWKEILALPILEKLALDELLYGK 831

Query: 563  VLPHVRSITANIHDAITRTERIISSLSGVWTGTKVIAERSYKLQPLVDYVLTLAKTLEKK 384
            +LPHVR+IT+++HDA+TRTERI++SLSGVW GT VI + S KLQPLVDYVL L KTLE++
Sbjct: 832  ILPHVRNITSDVHDAVTRTERIVASLSGVWAGTNVIQDSSRKLQPLVDYVLLLGKTLERR 891

Query: 383  HVSGVSESETRGLARRLKKMLVELNEYDNARAISRTFQLKEAL 255
            H SGV+ES T GLARRLKKMLVELNEYD+AR I+R F LKEAL
Sbjct: 892  HASGVTESGTGGLARRLKKMLVELNEYDSARDIARRFHLKEAL 934


>ref|XP_006379383.1| hypothetical protein POPTR_0008s00320g [Populus trichocarpa]
            gi|550332058|gb|ERP57180.1| hypothetical protein
            POPTR_0008s00320g [Populus trichocarpa]
          Length = 972

 Score =  858 bits (2217), Expect = 0.0
 Identities = 505/982 (51%), Positives = 609/982 (62%), Gaps = 57/982 (5%)
 Frame = -1

Query: 3029 SSRSKNFRRRAE--DEDVNGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXKLLSFADEED 2856
            SS+S+NFRRR +  DE  +                               KLLSFA++E+
Sbjct: 4    SSKSRNFRRRGDVDDEKTDANTNNTDTNAKATPSTTRKPPPPQSTKPKPKKLLSFAEDEE 63

Query: 2855 EEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHKITTTKDRVFTSPSL---PSNVQP 2685
            +E                                 HK+T ++DR+  + S     SNVQP
Sbjct: 64   DEQAVTRIPSSKSKPKPKPKPTSSSS---------HKLTVSQDRLPPTTSYLTTASNVQP 114

Query: 2684 QAGEYTKEKLRELQKNTRTLASSTPNT-----SEPVIVLKGFVKP--------------- 2565
            QAG YTKE L ELQ+NTRTLA ST  T     SEP I+LKG +KP               
Sbjct: 115  QAGTYTKEALLELQRNTRTLAKSTKTTTPASASEPKIILKGLLKPSFSPSPNPNPNYSSN 174

Query: 2564 HSVDEDRGNSRXXXXXXXXXXXXXXXNQLASMGIGKSRDSSGSLIPDQATINAIRAKRER 2385
            H   +D  +                 N+LASMG+GKS     S  PD+ TI  IRAKRER
Sbjct: 175  HQQQDDADDQSEDENEDKDNGADDAQNRLASMGLGKSTSDDYSCFPDEDTIKKIRAKRER 234

Query: 2384 LRQSRAAAPDYISLDGGSNHGAAEGLSDEEPEFQGRIALLGDKT-DVAKKG-VFESV--- 2220
            LRQSRAAAPDYISLD GSNH    G SDEEPEF+ RIA++G  T D A  G VF++    
Sbjct: 235  LRQSRAAAPDYISLDSGSNHQG--GFSDEEPEFRTRIAMIGTMTKDTATHGGVFDAAADD 292

Query: 2219 -----DERGI-----------------ENDLRKXXXXXXXXXXXXXXXXXXXEQFRKGLG 2106
                 D+R I                 ++                       EQFRKGLG
Sbjct: 293  DEDDDDDRSIKAKALAMMGTHHHHAVVDDGNVAAAASVVHDEEDEEDRIWEEEQFRKGLG 352

Query: 2105 KRIEDGXXXXXXXXXXXXXNQIVQQQHXXXXXXXXXXXXSVPAAPTIGGAVGGSRSAEVM 1926
            KR++D                                     + P+IGGA G S+  +V+
Sbjct: 353  KRMDDASAPIANRALASTAGAAASST--IPMQPQQRPTPGYGSIPSIGGAFGSSQGLDVL 410

Query: 1925 SISXXXXXXXXXXXQNIRRLKESYGRAMSSIARTDENLSSSLSNITDLEMSLSAAGEKFI 1746
            SI             N+RRLKES+GR +S +++TDENLS+SL N+T LE S+SAAGEKFI
Sbjct: 411  SIPQQADIAKKALQDNLRRLKESHGRTISLLSKTDENLSASLMNVTALEKSISAAGEKFI 470

Query: 1745 FMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERASAILERRAADNTDEFKEVEAAV 1566
            FMQKLRDFVSVIC+FLQHKA  IEELEE+MQKLHEE+AS ILERR ADN DE  EVEAAV
Sbjct: 471  FMQKLRDFVSVICEFLQHKATLIEELEERMQKLHEEQASLILERRTADNEDEMMEVEAAV 530

Query: 1565 STAMSVLG-KGGGXXXXXXXXXXXXXXXXXAREQSNLSVQLDEFGRDMNLQKRMDIXXXX 1389
              AMSV   +G                    ++Q+NL V+LDEFGRD+NLQKRMD+    
Sbjct: 531  KAAMSVFSARGNSAATIDAAKSAAAAALVALKDQANLPVKLDEFGRDINLQKRMDMEKRA 590

Query: 1388 XXXXXXXXXXXXXXXXSVGDDSAFSHIEGXXXXXXXXXXXXS---YRSNRDLLLQTAAQI 1218
                             +  DS+   IEG                Y+S RDLLL+TA +I
Sbjct: 591  KARQRRKARFDSKRLSYMEVDSSDQKIEGELSTDESDSDSEKNAAYQSTRDLLLRTAEEI 650

Query: 1217 FSDAAEEYSHLSVVKERFERWKKHYSSSYRDAYMSLSVPAIFSPYVRLELLKWDPLYEET 1038
            FSDA+EEYS LSVVKERFE WKK Y +SYRDAYMSLS PAIFSPYVRLELLKWDPL+E++
Sbjct: 651  FSDASEEYSQLSVVKERFETWKKEYFASYRDAYMSLSAPAIFSPYVRLELLKWDPLHEDS 710

Query: 1037 DFNDMQWHSLLFDYGLPEDTSDFNPGDADADLVPGLVEKIALPILHHQIAHCWDMLSTRG 858
            DF DM+WHSLLF+YGLPED SD NP D DA+LVPGLVEKIA+PIL+H+IAHCWDMLST+ 
Sbjct: 711  DFFDMKWHSLLFNYGLPEDGSDLNPDDVDANLVPGLVEKIAIPILYHEIAHCWDMLSTQE 770

Query: 857  TRNAVSAMNLVINYVPASSEALRELLGAIHTRLADAIANLIVPTWSPLVIKAVPNAARVA 678
            T+NA+SA +LVINYVPA+SEAL ELL AI TRLADA+A+ +VPTWS LV+KAVP+AA+VA
Sbjct: 771  TKNAISATSLVINYVPATSEALSELLAAIRTRLADAVASTVVPTWSLLVLKAVPSAAQVA 830

Query: 677  AYQFGMSIRLLRNICLWKDILALPVLEQLALDELFCGKVLPHVRSITANIHDAITRTERI 498
            AY+FGMS+RL+RNICLWKDILALPVLE+L LDEL CGKVLPHVRSI +N+HDA+TRTERI
Sbjct: 831  AYRFGMSVRLMRNICLWKDILALPVLEKLVLDELLCGKVLPHVRSIASNVHDAVTRTERI 890

Query: 497  ISSLSGVWTGTKVIAER-SYKLQPLVDYVLTLAKTLEKKHVSGVSESETRGLARRLKKML 321
            ++SLS  W G    ++  S+KLQPLVD++L++  TLEK+HVSGV+E+ET GLARRLKKML
Sbjct: 891  VASLSRAWAGPSATSDHSSHKLQPLVDFILSIGMTLEKRHVSGVTETETSGLARRLKKML 950

Query: 320  VELNEYDNARAISRTFQLKEAL 255
            VELN+YDNAR ++RTF LKEAL
Sbjct: 951  VELNDYDNARDMARTFHLKEAL 972


>ref|XP_004298307.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Fragaria vesca
            subsp. vesca]
          Length = 914

 Score =  855 bits (2210), Expect = 0.0
 Identities = 491/940 (52%), Positives = 599/940 (63%), Gaps = 15/940 (1%)
 Frame = -1

Query: 3029 SSRSKNFRRRAEDEDVN-GEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXKLLSFADEEDE 2853
            S+R KNFRRR +D+D +  +                             KLLSF D+E+ 
Sbjct: 3    SARPKNFRRRIDDDDDDDADTPSTTSTLKSLSKPSSSAAKPKKPQSQAPKLLSFVDDEEN 62

Query: 2852 EXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHKITTTKDRVFTSPS------LPSNV 2691
                                              HK+T  KDR+  S S      LPSNV
Sbjct: 63   ATPSRSSSSSSKRDKSSSSRLAKPSSA-------HKLTAAKDRLVNSTSSTASASLPSNV 115

Query: 2690 QPQAGEYTKEKLRELQKNTRTLASSTPNTS----EPVIVLKGFVKPH--SVDEDRGNSRX 2529
            QPQAG YTKE LRELQKNTRTLASS  +++    EP IVL+G +KP   S+ +    +R 
Sbjct: 116  QPQAGTYTKEALRELQKNTRTLASSRTSSAAAAAEPTIVLRGSIKPADASIADAVNGARE 175

Query: 2528 XXXXXXXXXXXXXXNQLASMGIGKSRDSSGSLIPDQATINAIRAKRERLRQSRAAAPDYI 2349
                                   + +  S    PDQATI AIR KRERLR+S+ AAPD+I
Sbjct: 176  LDSDD------------------EEQQGSKDRYPDQATIEAIRKKRERLRKSKPAAPDFI 217

Query: 2348 SLDGGSNHGAAEGLSDEEPEFQGRIALLGDKTDVAKKGVFESVDERGIENDLRKXXXXXX 2169
            +LD GSNHGAAEGLSDEEPEF+ RIA+ G+K +  KKGVFE VD+ G++  LR+      
Sbjct: 218  ALDSGSNHGAAEGLSDEEPEFRNRIAMFGEKME-NKKGVFEDVDDTGVDGGLRRESVVVE 276

Query: 2168 XXXXXXXXXXXXXEQFRKGLGKRIE-DGXXXXXXXXXXXXXNQIVQQQHXXXXXXXXXXX 1992
                          QFRKGLGKR++ DG             +   Q +            
Sbjct: 277  DDEDEEEKIWEEE-QFRKGLGKRVDNDGASLGVSASVPRVHSAAPQPKASYNSIAGYSLA 335

Query: 1991 XSVPAAPTIGGAVGGSRSAEVMSISXXXXXXXXXXXQNIRRLKESYGRAMSSIARTDENL 1812
             S+    +IGGA G S+ +  +SI+           +N+R+LKES+GR   S+ + +E+L
Sbjct: 336  QSLAGVASIGGATGASQGSNALSINEQSEIAQKALLENVRKLKESHGRTKMSLTKANESL 395

Query: 1811 SSSLSNITDLEMSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERA 1632
            S+SL NITDLE SLSAA EK+ FMQ+LRDFVS ICDFLQ KAP IEELEE+MQK  +ERA
Sbjct: 396  SASLLNITDLEKSLSAADEKYKFMQELRDFVSTICDFLQDKAPLIEELEEEMQKQRDERA 455

Query: 1631 SAILERRAADNTDEFKEVEAAVSTAMSVLGKGG-GXXXXXXXXXXXXXXXXXAREQSNLS 1455
            SAI ERR ADN DE  EVEAAV+ AMS+  K G                    REQ NL 
Sbjct: 456  SAIFERRIADNDDEMMEVEAAVNAAMSIFSKEGTSAGVIAVAKSAAQAASAAVREQKNLP 515

Query: 1454 VQLDEFGRDMNLQKRMDIXXXXXXXXXXXXXXXXXXXXSVGDDSAFSHIEGXXXXXXXXX 1275
            V+LDEFGRDMNL+KR+D+                    S+  DS    +EG         
Sbjct: 516  VKLDEFGRDMNLKKRLDMKGRAEARQRRRKRYEAKRESSMDVDSPDRTVEGESSTDESDG 575

Query: 1274 XXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVKERFERWKKHYSSSYRDAYMSLSVPAI 1095
                Y S+R L+L TA Q+FSDAAEEYS LS+VKERFE+WK+ Y SSYRDAYMSLSVP I
Sbjct: 576  ESKEYESHRQLVLGTADQVFSDAAEEYSQLSLVKERFEKWKREYRSSYRDAYMSLSVPII 635

Query: 1094 FSPYVRLELLKWDPLYEETDFNDMQWHSLLFDYGLPEDTSDFNPGDADADLVPGLVEKIA 915
            FSPYVRLELLKWDPL E TDF  M WH LL +YG+PED SDF   DADA+L+P LVEK+A
Sbjct: 636  FSPYVRLELLKWDPLRENTDFVKMSWHELLENYGVPEDGSDFASDDADANLIPALVEKVA 695

Query: 914  LPILHHQIAHCWDMLSTRGTRNAVSAMNLVINYVPASSEALRELLGAIHTRLADAIANLI 735
            LPILHHQI HCWD+LSTR T+NAV+A +LV +YV +SSEAL +LL AI TRLADA++ L+
Sbjct: 696  LPILHHQIVHCWDILSTRETKNAVAATSLVTDYV-SSSEALEDLLVAIRTRLADAVSKLM 754

Query: 734  VPTWSPLVIKAVPNAARVAAYQFGMSIRLLRNICLWKDILALPVLEQLALDELFCGKVLP 555
            VPTWSPLV+KAVPNAAR+AAY+FGMS+RL++NICLWK+ILALPVLE+LA++EL CGKV+P
Sbjct: 755  VPTWSPLVLKAVPNAARIAAYRFGMSVRLMKNICLWKEILALPVLEKLAINELLCGKVIP 814

Query: 554  HVRSITANIHDAITRTERIISSLSGVWTGTKVIAERSYKLQPLVDYVLTLAKTLEKKHVS 375
            H+RSI A++HDA+TRTER+I+SLSGVW+G+ V  +RS KLQ LVDYVLTL KT+EKKH  
Sbjct: 815  HIRSIAADVHDAVTRTERVIASLSGVWSGSDVTGDRSRKLQSLVDYVLTLGKTIEKKHSL 874

Query: 374  GVSESETRGLARRLKKMLVELNEYDNARAISRTFQLKEAL 255
            GV++SET GLARRLKKMLVELNEYD AR ++RTF LKEAL
Sbjct: 875  GVTQSETGGLARRLKKMLVELNEYDKARDVARTFHLKEAL 914


>ref|XP_006468681.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Citrus sinensis]
          Length = 913

 Score =  855 bits (2209), Expect = 0.0
 Identities = 498/940 (52%), Positives = 606/940 (64%), Gaps = 15/940 (1%)
 Frame = -1

Query: 3029 SSRSKNFRRRAEDEDVNGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXKLLSFADEEDEE 2850
            SSR++NFRRRA+D++ N ++                             LLSFAD+E+E+
Sbjct: 3    SSRARNFRRRADDDEDNNDDNTPSAATTTATKKPPSSSKPKK-------LLSFADDEEEK 55

Query: 2849 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHKITTTKDR-----VFTSPSLPSNVQP 2685
                                             HKIT +K+R       +S SL SNVQ 
Sbjct: 56   SEIPTSNRDRTRPSSRLSKPSSS----------HKITASKERQSSSATSSSTSLLSNVQA 105

Query: 2684 QAGEYTKEKLRELQKNTRTL-ASSTPNTSEPVIVLKGFVKPHSVDEDRGNSRXXXXXXXX 2508
            QAG YT+E L EL+KNT+TL A S+   +EPV+VL+G +KP   +  R   +        
Sbjct: 106  QAGTYTEEYLLELRKNTKTLKAPSSKPPAEPVVVLRGSIKPEDSNLTRVQQKPSRDSSDS 165

Query: 2507 XXXXXXXNQ--LASMGIGKSRDSSGSLIPDQATINAIRAKRERLRQSRAAAPDYISLDGG 2334
                    +   AS+G+GK    SG +I D+A I AIRAK++RLRQS A APDYI LDGG
Sbjct: 166  DSDHKAETEKRFASLGVGKIAVQSG-VIYDEAEIKAIRAKKDRLRQSGAKAPDYIPLDGG 224

Query: 2333 SN--HGAAEGLSDEEPEFQGRIALLGDKTDVAKK--GVFESVDERGIENDLRKXXXXXXX 2166
            S+   G AEG SDEEPEF  R+A+ G++T   KK  GVFE  D   ++ D R        
Sbjct: 225  SSSLRGDAEGSSDEEPEFPRRVAMFGERTASGKKKKGVFEDDD---VDEDERPVVARVEN 281

Query: 2165 XXXXXXXXXXXXE-QFRKGLGKRIEDGXXXXXXXXXXXXXNQIVQQQHXXXXXXXXXXXX 1989
                        E Q RKGLGKRI+DG                 QQQ             
Sbjct: 282  DYEYVDEDVMWEEEQVRKGLGKRIDDGSVRVGANTSSSVAMPQQQQQFSYSTT------- 334

Query: 1988 SVPAAPTIGGAVGGSRSAEVMSISXXXXXXXXXXXQNIRRLKESYGRAMSSIARTDENLS 1809
             V   P+IGGA+G S+  + MSI+            N+ RLKES+ R MSS+ +TDE+LS
Sbjct: 335  -VTPIPSIGGAIGASQGLDTMSIAQKAESAMKALQTNVNRLKESHARTMSSLKKTDEDLS 393

Query: 1808 SSLSNITDLEMSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERAS 1629
            SSL  ITDLE SLSAAGEKFIFMQKLRD+VSVICDFLQ KAP+IE LE +MQKL++ERAS
Sbjct: 394  SSLLKITDLESSLSAAGEKFIFMQKLRDYVSVICDFLQDKAPYIETLEAEMQKLNKERAS 453

Query: 1628 AILERRAADNTDEFKEVEAAVSTAMSVLGKGGGXXXXXXXXXXXXXXXXXA--REQSNLS 1455
            AILERRAADN DE  EVEAA+  A  V+G  G                  A  +EQ+NL 
Sbjct: 454  AILERRAADNDDEMTEVEAAIKAATLVIGDRGNSASKLIAASSAAQAAAAAAVKEQTNLP 513

Query: 1454 VQLDEFGRDMNLQKRMDIXXXXXXXXXXXXXXXXXXXXSVGDDSAFSHIEGXXXXXXXXX 1275
            V+LDEFGRDMNLQKR D+                    S+  D +   +EG         
Sbjct: 514  VKLDEFGRDMNLQKRRDMERRAESRQHRRTRFDLKQLSSMDADISSQKLEGESTTDESDS 573

Query: 1274 XXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVKERFERWKKHYSSSYRDAYMSLSVPAI 1095
               +Y+SNR+ LL+TA  IFSDAAEEYS LSVVKERFE+WK+ YSSSYRDAYMSLS PAI
Sbjct: 574  ETEAYQSNREELLKTAEHIFSDAAEEYSQLSVVKERFEKWKRDYSSSYRDAYMSLSTPAI 633

Query: 1094 FSPYVRLELLKWDPLYEETDFNDMQWHSLLFDYGLPEDTSDFNPGDADADLVPGLVEKIA 915
             SPYVRLELLKWDPL+E+ DF++M+WH+LLF+YGLP+D  DF   DADA+LVP LVEK+A
Sbjct: 634  MSPYVRLELLKWDPLHEDADFSEMKWHNLLFNYGLPKDGEDFAHDDADANLVPTLVEKVA 693

Query: 914  LPILHHQIAHCWDMLSTRGTRNAVSAMNLVINYVPASSEALRELLGAIHTRLADAIANLI 735
            LPILHH IA+CWDMLSTR T+NAVSA  LV+ YVP SSEAL++LL AIHTRLA+A+AN+ 
Sbjct: 694  LPILHHDIAYCWDMLSTRETKNAVSATILVMAYVPTSSEALKDLLVAIHTRLAEAVANIA 753

Query: 734  VPTWSPLVIKAVPNAARVAAYQFGMSIRLLRNICLWKDILALPVLEQLALDELFCGKVLP 555
            VPTWS L + AVPNAAR+AAY+FG+S+RL+RNICLWK++ ALP+LE+LALDEL C KVLP
Sbjct: 754  VPTWSSLAMSAVPNAARIAAYRFGVSVRLMRNICLWKEVFALPILEKLALDELLCRKVLP 813

Query: 554  HVRSITANIHDAITRTERIISSLSGVWTGTKVIAERSYKLQPLVDYVLTLAKTLEKKHVS 375
            HVRSI +N+HDAI+RTERI++SLSGVW G  V     +KLQPLVD++L+LAKTLEKKH+ 
Sbjct: 814  HVRSIASNVHDAISRTERIVASLSGVWAGPSVTGSCCHKLQPLVDFMLSLAKTLEKKHLP 873

Query: 374  GVSESETRGLARRLKKMLVELNEYDNARAISRTFQLKEAL 255
            GV+ESET GLARRLKKMLVELNEYDNAR I+RTF LKEAL
Sbjct: 874  GVTESETAGLARRLKKMLVELNEYDNARDIARTFHLKEAL 913


>ref|XP_006448500.1| hypothetical protein CICLE_v10014191mg [Citrus clementina]
            gi|557551111|gb|ESR61740.1| hypothetical protein
            CICLE_v10014191mg [Citrus clementina]
          Length = 913

 Score =  850 bits (2196), Expect = 0.0
 Identities = 494/940 (52%), Positives = 604/940 (64%), Gaps = 15/940 (1%)
 Frame = -1

Query: 3029 SSRSKNFRRRAEDEDVNGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXKLLSFADEEDEE 2850
            SSR++NFRRRA+D++ N ++                             LLSFAD+E+E+
Sbjct: 3    SSRARNFRRRADDDEDNNDDNTPSVATTTATKKPPSSSKPKK-------LLSFADDEEEK 55

Query: 2849 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHKITTTKDR-----VFTSPSLPSNVQP 2685
                                             HKIT +K+R       +S SL SNVQ 
Sbjct: 56   SEIPTSNRDRTRPSSRLSKPSSS----------HKITASKERQSSSATSSSTSLLSNVQA 105

Query: 2684 QAGEYTKEKLRELQKNTRTL-ASSTPNTSEPVIVLKGFVKPHSVDEDRGNSRXXXXXXXX 2508
            QAG YT+E L EL+KNT+TL A S+   +EPV+VL+G +KP   +  R   +        
Sbjct: 106  QAGTYTEEYLLELRKNTKTLKAPSSKPPAEPVVVLRGSIKPEDSNLTRVQQKPSRDSSDS 165

Query: 2507 XXXXXXXNQ--LASMGIGKSRDSSGSLIPDQATINAIRAKRERLRQSRAAAPDYISLDGG 2334
                    +   AS+G+GK    SG +I D+A I AIRAK++RLRQS A APDYI LDGG
Sbjct: 166  DSDHKAETEKRFASLGVGKIAVQSG-VIYDEAEIKAIRAKKDRLRQSGAKAPDYIPLDGG 224

Query: 2333 SN--HGAAEGLSDEEPEFQGRIALLGDKTDVAKK--GVFESVDERGIENDLRKXXXXXXX 2166
            S+   G AEG SDEEPEF  R+A+ G++T   KK  GVFE  D   ++ D R        
Sbjct: 225  SSSLRGDAEGSSDEEPEFPRRVAMFGERTASGKKKKGVFEDDD---VDEDERPVVARVEN 281

Query: 2165 XXXXXXXXXXXXE-QFRKGLGKRIEDGXXXXXXXXXXXXXNQIVQQQHXXXXXXXXXXXX 1989
                        E Q RKGLGKRI+D                  QQQ             
Sbjct: 282  DYEYVDEDVMWEEEQVRKGLGKRIDDSSVRVGANTSSSVAMPQQQQQFSYPTT------- 334

Query: 1988 SVPAAPTIGGAVGGSRSAEVMSISXXXXXXXXXXXQNIRRLKESYGRAMSSIARTDENLS 1809
             V   P+IGGA+G S+  + MSI+            N+ RLKES+ R MSS+ +TDE+LS
Sbjct: 335  -VTPIPSIGGAIGASQGLDTMSIAQKAESAMKALQTNVNRLKESHARTMSSLKKTDEDLS 393

Query: 1808 SSLSNITDLEMSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERAS 1629
            SSL  ITDLE SLSAAGE+FIFMQKLRD+VSVICDFLQ KAP+IE LE +MQKL++ERAS
Sbjct: 394  SSLLKITDLESSLSAAGERFIFMQKLRDYVSVICDFLQDKAPYIETLEAEMQKLNKERAS 453

Query: 1628 AILERRAADNTDEFKEVEAAVSTAMSVLGKGGGXXXXXXXXXXXXXXXXXA--REQSNLS 1455
            AILERRAADN DE  EVEAA+  A   +G  G                  A  +EQ+NL 
Sbjct: 454  AILERRAADNDDEMTEVEAAIKAATLFIGDRGNSASKLTAASSAAQAAAAAAIKEQTNLP 513

Query: 1454 VQLDEFGRDMNLQKRMDIXXXXXXXXXXXXXXXXXXXXSVGDDSAFSHIEGXXXXXXXXX 1275
            V+LDEFGRDMNLQKR D+                    S+  D +   +EG         
Sbjct: 514  VKLDEFGRDMNLQKRRDMERRAESRQHRRTRFDLKQLSSMDADISSQKLEGESTTDESDS 573

Query: 1274 XXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVKERFERWKKHYSSSYRDAYMSLSVPAI 1095
               +Y+SNR+ LL+TA  IFSDAAEEYS LSVVKERFE+WK+ YSSSYRDAYMSLS PAI
Sbjct: 574  ETEAYQSNREELLKTAEHIFSDAAEEYSQLSVVKERFEKWKRDYSSSYRDAYMSLSTPAI 633

Query: 1094 FSPYVRLELLKWDPLYEETDFNDMQWHSLLFDYGLPEDTSDFNPGDADADLVPGLVEKIA 915
             SPYVRLELLKWDPL+E+ DF++M+WH+LLF+YGLP+D  DF   DADA+LVP LVEK+A
Sbjct: 634  MSPYVRLELLKWDPLHEDADFSEMKWHNLLFNYGLPKDGEDFAHDDADANLVPTLVEKVA 693

Query: 914  LPILHHQIAHCWDMLSTRGTRNAVSAMNLVINYVPASSEALRELLGAIHTRLADAIANLI 735
            LPILHH IA+CWDMLSTR T+N VSA  LV+ YVP SSEAL++LL AIHTRLA+A+AN+ 
Sbjct: 694  LPILHHDIAYCWDMLSTRETKNVVSATILVMAYVPTSSEALKDLLVAIHTRLAEAVANIA 753

Query: 734  VPTWSPLVIKAVPNAARVAAYQFGMSIRLLRNICLWKDILALPVLEQLALDELFCGKVLP 555
            VPTWSPL + AVPN+AR+AAY+FG+S+RL+RNICLWK++ ALP+LE+LALDEL C KVLP
Sbjct: 754  VPTWSPLAMSAVPNSARIAAYRFGVSVRLMRNICLWKEVFALPILEKLALDELLCRKVLP 813

Query: 554  HVRSITANIHDAITRTERIISSLSGVWTGTKVIAERSYKLQPLVDYVLTLAKTLEKKHVS 375
            HVRSI +N+HDAI+RTERI++SLSGVW G  V     +KLQPLVD++L+LAKTLEKKH+ 
Sbjct: 814  HVRSIASNVHDAISRTERIVASLSGVWAGPSVTGSCCHKLQPLVDFMLSLAKTLEKKHLP 873

Query: 374  GVSESETRGLARRLKKMLVELNEYDNARAISRTFQLKEAL 255
            GV+ESET GLARRLKKMLVELNEYDNAR I+RTF LKEAL
Sbjct: 874  GVTESETAGLARRLKKMLVELNEYDNARDIARTFHLKEAL 913


>ref|XP_006838726.1| hypothetical protein AMTR_s00002p00252610 [Amborella trichopoda]
            gi|548841232|gb|ERN01295.1| hypothetical protein
            AMTR_s00002p00252610 [Amborella trichopoda]
          Length = 946

 Score =  848 bits (2191), Expect = 0.0
 Identities = 473/848 (55%), Positives = 577/848 (68%), Gaps = 16/848 (1%)
 Frame = -1

Query: 2750 HKITTTKDRV-FTSPSLPSNVQPQAGEYTKEKLRELQKNTRTLASSTPNT----SEPVIV 2586
            HKI   KDR    SPS+PSNVQPQAG+YTKEKL ELQKNT+TL  S P +    +EPVIV
Sbjct: 111  HKIIAGKDRTSIQSPSVPSNVQPQAGQYTKEKLLELQKNTKTLGGSKPPSETKPAEPVIV 170

Query: 2585 LKGFVKPHSVDEDRGNSRXXXXXXXXXXXXXXXNQ-------LASMGIGKSRDSSGSLIP 2427
            LKG VKP  + E+R + +                +       L  MGIG+ ++  GS + 
Sbjct: 171  LKGLVKP--ILEERKSEKTQVRESMENDREKFSREKEEAESSLGKMGIGQPKEEVGSPVL 228

Query: 2426 DQATINAIRAKRERLRQSRAAAPDYISLDGGSNHGAAE----GLSDEEPEFQGRIALLGD 2259
            DQATINAI+AKRERLRQ+R A PDYISLD G      +    G SD+E EFQGRIALLG+
Sbjct: 229  DQATINAIKAKRERLRQARMA-PDYISLDSGGARSMRDSDGLGSSDDESEFQGRIALLGE 287

Query: 2258 KTDVAKKGVFESVDERGIENDLRKXXXXXXXXXXXXXXXXXXXEQFRKGLGKRIEDGXXX 2079
              + ++KGVFE+ DE+  E  L++                   EQFRK LGKR++D    
Sbjct: 288  GNNSSRKGVFENADEKVFE--LKREERETEVDDDDEEDKKWEEEQFRKALGKRMDDNSNR 345

Query: 2078 XXXXXXXXXXNQIVQQQHXXXXXXXXXXXXSVPAAPTIGGAVGGSRSAEVMSISXXXXXX 1899
                      +    Q               + +   +G  VG +RS E M+ S      
Sbjct: 346  GSVQSVASAGSVKAVQSSVYSGGSYHGASSGLVS--NLG--VGVTRSVEFMTTSQQAEVA 401

Query: 1898 XXXXXQNIRRLKESYGRAMSSIARTDENLSSSLSNITDLEMSLSAAGEKFIFMQKLRDFV 1719
                  ++ RLKES+ R +SSI RTD NLS+SLSNI DLE SLSAAGEK++FMQKLRDFV
Sbjct: 402  TQALRDSMARLKESHDRTISSIVRTDNNLSASLSNIIDLEKSLSAAGEKYLFMQKLRDFV 461

Query: 1718 SVICDFLQHKAPFIEELEEQMQKLHEERASAILERRAADNTDEFKEVEAAVSTAMSVLGK 1539
            SVICDFLQ KAPFIEELEEQMQ+LHEERASAI++RRA D+ DE  E+EAAV+ A+SV  K
Sbjct: 462  SVICDFLQDKAPFIEELEEQMQRLHEERASAIVQRRADDDADEMAEIEAAVNAAISVFNK 521

Query: 1538 GGGXXXXXXXXXXXXXXXXXAREQSNLSVQLDEFGRDMNLQKRMDIXXXXXXXXXXXXXX 1359
            GG                   +EQSNL V+LDEFGRD+NLQKRMD               
Sbjct: 522  GGSVSSAASAAQAASLAA---KEQSNLPVELDEFGRDVNLQKRMDSKRRAEARKRRKAWS 578

Query: 1358 XXXXXXSVGDDSAFSHIEGXXXXXXXXXXXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSV 1179
                  +VGD S++  IEG            +YRS+ D LLQTA++IFSDAA+E+S+LSV
Sbjct: 579  ESKRIRTVGDGSSYQRIEGESSTDESDSDSTAYRSSCDELLQTASEIFSDAADEFSNLSV 638

Query: 1178 VKERFERWKKHYSSSYRDAYMSLSVPAIFSPYVRLELLKWDPLYEETDFNDMQWHSLLFD 999
            VK RFE WK+ Y  +YRDAYMS++  AIFSPYVRLELLKWDPLY+ TDF+DM+WHSLLFD
Sbjct: 639  VKVRFEGWKRQYLPTYRDAYMSMNASAIFSPYVRLELLKWDPLYKYTDFDDMRWHSLLFD 698

Query: 998  YGLPEDTSDFNPGDADADLVPGLVEKIALPILHHQIAHCWDMLSTRGTRNAVSAMNLVIN 819
            YG+    S +   D+DADL+P LVEK+ALPILHH IAHCWDMLST+ T+NAVSA  L+I+
Sbjct: 699  YGIKAGASGYESDDSDADLIPKLVEKVALPILHHDIAHCWDMLSTKETKNAVSATKLLID 758

Query: 818  YVPASSEALRELLGAIHTRLADAIANLIVPTWSPLVIKAVPNAARVAAYQFGMSIRLLRN 639
            Y+PASSEAL+ELL ++ TRL++A++ L VPTWS LVI AVP AA++AAY+FG S+RL++N
Sbjct: 759  YIPASSEALQELLVSVRTRLSEAVSKLKVPTWSTLVINAVPQAAQIAAYRFGTSVRLMKN 818

Query: 638  ICLWKDILALPVLEQLALDELFCGKVLPHVRSITANIHDAITRTERIISSLSGVWTGTKV 459
            ICLWKDI+ALPVLEQL LDEL C +VLPHVR+I  NIHDAITRTER+++SL+GVWTG  +
Sbjct: 819  ICLWKDIIALPVLEQLVLDELLCARVLPHVRNIMPNIHDAITRTERVVASLAGVWTGRDL 878

Query: 458  IAERSYKLQPLVDYVLTLAKTLEKKHVSGVSESETRGLARRLKKMLVELNEYDNARAISR 279
            I +RS KLQPLVDY+++L KTLEKKH  GVS  ET GLARRLK MLVELNEYD  RAI R
Sbjct: 879  IGDRSSKLQPLVDYLMSLGKTLEKKHALGVSTEETTGLARRLKCMLVELNEYDKGRAILR 938

Query: 278  TFQLKEAL 255
            TFQL+EAL
Sbjct: 939  TFQLREAL 946


>ref|XP_003528569.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Glycine max]
          Length = 913

 Score =  811 bits (2095), Expect = 0.0
 Identities = 455/848 (53%), Positives = 570/848 (67%), Gaps = 16/848 (1%)
 Frame = -1

Query: 2750 HKITTTKDRVF--TSPSLPSNVQPQAGEYTKEKLRELQKNTRTLASSTPN------TSEP 2595
            HKITT KDR+   +SPS+PSNVQPQAG YTKE LRELQKNTRTL +S+ +      +SEP
Sbjct: 85   HKITTLKDRIAHSSSPSVPSNVQPQAGTYTKEALRELQKNTRTLVTSSSSRSDPKPSSEP 144

Query: 2594 VIVLKGFVKPHSVDEDRGNSRXXXXXXXXXXXXXXXNQLASMGIGKSRDSSGSLIPDQAT 2415
            VIVLKG VKP       G+                  +LA++GI   ++  GS  PD  T
Sbjct: 145  VIVLKGLVKP------LGSEPQGRDSYSEGEHREVEAKLATVGI---QNKEGSFYPDDET 195

Query: 2414 INAIRAKRERLRQSRAAAPDYISLDGGSNHGAAEGLSDEEPEFQGRIALLGDKTDVAKKG 2235
            I AIRAKRERLRQ+R AAPDYISLDGGSNHGAAEGLSDEEPEF+GRIA+ G+K D  KKG
Sbjct: 196  IRAIRAKRERLRQARPAAPDYISLDGGSNHGAAEGLSDEEPEFRGRIAMFGEKVDGGKKG 255

Query: 2234 VFESVDERGIENDLRKXXXXXXXXXXXXXXXXXXXEQFRKGLGKRIEDGXXXXXXXXXXX 2055
            VFE V+ER ++   +                    EQFRKGLGKR+++G           
Sbjct: 256  VFEEVEERIMDVRFKGGEDEVVDDDDDDEEKMWEEEQFRKGLGKRMDEGSARVDVSVM-- 313

Query: 2054 XXNQIVQQQHXXXXXXXXXXXXSVPAA-----PTIGGAVGGSRSAEVMSISXXXXXXXXX 1890
               Q  Q  H            +VP+A     P+IGG +    + +V+ IS         
Sbjct: 314  ---QGSQSPHNFVVPSAAKVYGAVPSAAASVSPSIGGVIESLPALDVVPISQQAEAARKA 370

Query: 1889 XXQNIRRLKESYGRAMSSIARTDENLSSSLSNITDLEMSLSAAGEKFIFMQKLRDFVSVI 1710
              +N+RRLKES+GR MSS+++TDENLS+SL NIT LE SL  A EK+ FMQKLR++V+ I
Sbjct: 371  LLENVRRLKESHGRTMSSLSKTDENLSASLLNITALENSLVVADEKYRFMQKLRNYVTNI 430

Query: 1709 CDFLQHKAPFIEELEEQMQKLHEERASAILERRAADNTDEFKEVEAAVSTAMSVLGKGGG 1530
            CDFLQHKA +IEELEEQM+KLHE+RA AI ERRA +N DE  EVE AV  AMSVL K G 
Sbjct: 431  CDFLQHKAFYIEELEEQMKKLHEDRALAISERRATNNDDEMIEVEEAVKAAMSVLSKKGN 490

Query: 1529 XXXXXXXXXXXXXXXXXAREQSNLSVQLDEFGRDMNLQKRMDIXXXXXXXXXXXXXXXXX 1350
                              R+Q +L V+LDEFGRD+NL+KRM++                 
Sbjct: 491  NMEAAKIAAQEAFSAV--RKQRDLPVKLDEFGRDLNLEKRMNMKAKTRSEACQRKRSQAF 548

Query: 1349 XXXSVGDDSAFSH-IEGXXXXXXXXXXXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVK 1173
                V       H IEG            +Y+S  DL+LQ A +IFSDA+EEY  LS+VK
Sbjct: 549  DSNKVTSMELDDHKIEGESSTDESDSESQAYQSQSDLVLQAADEIFSDASEEYGQLSLVK 608

Query: 1172 ERFERWKKHYSSSYRDAYMSLSVPAIFSPYVRLELLKWDPLYEETDFNDMQWHSLLFDYG 993
             R E WK+ +SSSY+DAYMSLS+P IFSPYVRLELL+WDPL+   DF +M+W+ LLF YG
Sbjct: 609  SRMEEWKREHSSSYKDAYMSLSLPLIFSPYVRLELLRWDPLHNGVDFQEMKWYKLLFTYG 668

Query: 992  LPEDTSDF--NPGDADADLVPGLVEKIALPILHHQIAHCWDMLSTRGTRNAVSAMNLVIN 819
            LPED  DF  + GDAD +LVP LVEK+ALPILH++I+HCWDM+S + T NA++A  L++ 
Sbjct: 669  LPEDGKDFVHDDGDADLELVPNLVEKVALPILHYEISHCWDMVSQQETVNAIAATKLMVQ 728

Query: 818  YVPASSEALRELLGAIHTRLADAIANLIVPTWSPLVIKAVPNAARVAAYQFGMSIRLLRN 639
            +V   SEAL +LL +I TRLADA+A+L VPTWSP V+ AVP+AARVAAY+FG+S+RLLRN
Sbjct: 729  HVSHESEALADLLVSIQTRLADAVADLTVPTWSPSVLAAVPDAARVAAYRFGVSVRLLRN 788

Query: 638  ICLWKDILALPVLEQLALDELFCGKVLPHVRSITANIHDAITRTERIISSLSGVWTGTKV 459
            ICLWKD+ ++PVLE++ALDEL C KVLPH+R I+ N+ DAITRTERII+SLSG+W G  V
Sbjct: 789  ICLWKDVFSMPVLEKVALDELLCRKVLPHLRVISENVQDAITRTERIIASLSGIWAGPSV 848

Query: 458  IAERSYKLQPLVDYVLTLAKTLEKKHVSGVSESETRGLARRLKKMLVELNEYDNARAISR 279
            I +++ KLQPLV YVL+L + LE+++   V E++T  LARRLKK+L +LNEYD+AR ++R
Sbjct: 849  IGDKNRKLQPLVTYVLSLGRILERRN---VPENDTSHLARRLKKILADLNEYDHARNMAR 905

Query: 278  TFQLKEAL 255
            TF LKEAL
Sbjct: 906  TFHLKEAL 913


>ref|XP_006605552.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Glycine max]
          Length = 916

 Score =  809 bits (2089), Expect = 0.0
 Identities = 475/942 (50%), Positives = 595/942 (63%), Gaps = 17/942 (1%)
 Frame = -1

Query: 3029 SSRSKNFRRRAEDEDVNGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXKLLSFADEEDEE 2850
            +++S+NFRRR  D D    +                            KLLSFAD+EDE 
Sbjct: 3    TAKSRNFRRRGGD-DTESNDDNDGDTTSTTLPSKPPSSAKPKKKPQAPKLLSFADDEDET 61

Query: 2849 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHKITTTKDRVF--TSPSLPSNVQPQAG 2676
                                             HKITT KDR+   +SPS+P+NVQPQAG
Sbjct: 62   DENPRPRASKPHRTAATAKKPSSS---------HKITTLKDRIAHTSSPSVPTNVQPQAG 112

Query: 2675 EYTKEKLRELQKNTRTLASSTPN------TSEPVIVLKGFVKPHSVDEDRGNSRXXXXXX 2514
             YTKE LRELQKNTRTL SS+ +      +SEPVIVLKG VKP   +    +S       
Sbjct: 113  TYTKEALRELQKNTRTLVSSSSSRSDPKPSSEPVIVLKGHVKPLGPETQGRDS----DSD 168

Query: 2513 XXXXXXXXXNQLASMGIGKSRDSSGSLIPDQATINAIRAKRERLRQSRAAAPDYISLDGG 2334
                      +LA++GI    DS     PD+ TI AIRAKRERLR +R AAPDYISLDGG
Sbjct: 169  SEGEHREVEAKLATVGIQNKEDS---FYPDEETIRAIRAKRERLRLARPAAPDYISLDGG 225

Query: 2333 SNHGAAEGLSDEEPEFQGRIALLGDKTDVAKKGVFESVDERGIENDLRKXXXXXXXXXXX 2154
            SNHGAAEGLSDEEPEF+GRIA+ G+K D  KKGVFE V+ER ++   +            
Sbjct: 226  SNHGAAEGLSDEEPEFRGRIAMFGEKVDGGKKGVFEEVEERRVDLRFKGGEEEVLDDDDD 285

Query: 2153 XXXXXXXXEQFRKGLGKRIEDGXXXXXXXXXXXXXNQIVQQQHXXXXXXXXXXXXSVPAA 1974
                    EQFRKGLGKR+++G              Q+   QH            +VP+A
Sbjct: 286  EEEKMWEEEQFRKGLGKRMDEGSARVDVAAAAVQGAQL---QHNFVVPSAAKVYGAVPSA 342

Query: 1973 -----PTIGGAVGGSRSAEVMSISXXXXXXXXXXXQNIRRLKESYGRAMSSIARTDENLS 1809
                 P+IGGA+      +V+ IS           +N+RRLKES+GR MSS+++TDENLS
Sbjct: 343  AASVSPSIGGAIESLPVLDVVPISQQAEAARKALLENVRRLKESHGRTMSSLSKTDENLS 402

Query: 1808 SSLSNITDLEMSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERAS 1629
            +SL NIT LE SL  A EK+ FMQKLR++V+ ICDFLQHKA +IEELEEQM+KLH++RAS
Sbjct: 403  ASLLNITALENSLVVADEKYRFMQKLRNYVTNICDFLQHKACYIEELEEQMKKLHQDRAS 462

Query: 1628 AILERRAADNTDEFKEVEAAVSTAMSVLGKGGGXXXXXXXXXXXXXXXXXAREQSNLSVQ 1449
            AI ERRA +N DE  EVE AV  AMSVL K G                   R+Q +L V+
Sbjct: 463  AIFERRATNNDDEMVEVEEAVKAAMSVLIKKGNNMEAAKIAAQEAFAAV--RKQRDLPVK 520

Query: 1448 LDEFGRDMNLQKRMD--IXXXXXXXXXXXXXXXXXXXXSVGDDSAFSHIEGXXXXXXXXX 1275
            LDEFGRD+NL+KRM+  +                       DD     IEG         
Sbjct: 521  LDEFGRDLNLEKRMNMKVRAEACQRKRSLAFGYNKVTSMEWDDHK---IEGESSTDESDS 577

Query: 1274 XXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVKERFERWKKHYSSSYRDAYMSLSVPAI 1095
               +Y+S  DL+LQ A +IFSDA+EEY  LS+VK R E WK+ YSS+Y+DAYMSLS+P I
Sbjct: 578  ESQAYQSQSDLVLQAADEIFSDASEEYGQLSLVKSRMEEWKREYSSTYKDAYMSLSLPLI 637

Query: 1094 FSPYVRLELLKWDPLYEETDFNDMQWHSLLFDYGLPEDTSDF--NPGDADADLVPGLVEK 921
            FSPYVRLELL+WDPL++  DF +M+W+ LLF YGLPED  DF  + GDAD +LVP LVEK
Sbjct: 638  FSPYVRLELLRWDPLHKGVDFQEMKWYKLLFTYGLPEDGKDFVHDDGDADLELVPNLVEK 697

Query: 920  IALPILHHQIAHCWDMLSTRGTRNAVSAMNLVINYVPASSEALRELLGAIHTRLADAIAN 741
            +ALPILH++I+HCWDMLS + T NA++A  L++ +V   SEAL  LL +I TRLADA+AN
Sbjct: 698  VALPILHYEISHCWDMLSQQETVNAIAATKLIVQHVSHESEALAGLLVSIRTRLADAVAN 757

Query: 740  LIVPTWSPLVIKAVPNAARVAAYQFGMSIRLLRNICLWKDILALPVLEQLALDELFCGKV 561
            L VPTWS  V+ AVP+AARVAAY+FG+S+RLLRNI  WKD+ ++ VLE++ALDEL CGKV
Sbjct: 758  LTVPTWSLPVLAAVPDAARVAAYRFGVSVRLLRNIGSWKDVFSMAVLEKVALDELLCGKV 817

Query: 560  LPHVRSITANIHDAITRTERIISSLSGVWTGTKVIAERSYKLQPLVDYVLTLAKTLEKKH 381
            LPH+R I+ N+ DAITRTERII+SLSGVW+G  VI +++ KLQPLV YVL+L + LE+++
Sbjct: 818  LPHLRVISENVQDAITRTERIIASLSGVWSGPSVIGDKNRKLQPLVTYVLSLGRILERRN 877

Query: 380  VSGVSESETRGLARRLKKMLVELNEYDNARAISRTFQLKEAL 255
               V ES+T  LARRLKK+LV+LNEYD+AR+++RTF LKEAL
Sbjct: 878  ---VPESDTSHLARRLKKILVDLNEYDHARSMARTFHLKEAL 916


>ref|XP_004514246.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cicer
            arietinum]
          Length = 916

 Score =  808 bits (2087), Expect = 0.0
 Identities = 463/854 (54%), Positives = 566/854 (66%), Gaps = 22/854 (2%)
 Frame = -1

Query: 2750 HKITTTKDRVF--TSPSLPSNVQPQAGEYTKEKLRELQKNTRTLASSTPN---------T 2604
            HKITT KDR+    SPS  SNVQPQAG YTKE LRELQKNTRTL + + +         +
Sbjct: 83   HKITTHKDRISHSPSPSFLSNVQPQAGTYTKEALRELQKNTRTLVTGSTSRPSSTSXXPS 142

Query: 2603 SEPVIVLKGFVKPHSVDEDRGNSRXXXXXXXXXXXXXXXNQLASMGIGKSRDSSGSLIPD 2424
            SEPVIVLKG +KP S +     S                 + AS+GI    DS   LIPD
Sbjct: 143  SEPVIVLKGLLKPASSEPQGRES------DSEDEHKEVEAKFASVGIQNGNDS---LIPD 193

Query: 2423 QATINAIRAKRERLRQSRAAAPDYISLDGGSNHGAAEGLSDEEPEFQGRIALLGDKTDVA 2244
            + TI AIRA+RERLRQ+R AA DYISLDGGSNHGAAEGLSDEEPEF+GRIAL G+K +  
Sbjct: 194  EETIKAIRARRERLRQARPAAQDYISLDGGSNHGAAEGLSDEEPEFRGRIALFGEKGEGG 253

Query: 2243 KKGVFESVDERGIENDLRKXXXXXXXXXXXXXXXXXXXEQFRKGLGKRIEDGXXXXXXXX 2064
            KKGVFE VDERG++                        EQFRKGLGKR+++G        
Sbjct: 254  KKGVFEDVDERGVDGRFN-GGGDVVVEEEDEEEKMWEEEQFRKGLGKRMDEGPGRVSGGD 312

Query: 2063 XXXXXNQIVQQQHXXXXXXXXXXXXSVP--------AAPTIGGAVGGSRSAEVMSISXXX 1908
                    V QQ             +VP         + +IGGA+  + + +V+SIS   
Sbjct: 313  VSVVQ---VAQQPKFVVPSAATVYGAVPNVVAAAASVSTSIGGAIPATPALDVISISQQA 369

Query: 1907 XXXXXXXXQNIRRLKESYGRAMSSIARTDENLSSSLSNITDLEMSLSAAGEKFIFMQKLR 1728
                     N+RRLKES+GR MSS+ +TDENLS+SL NITDLE SL  A EK+ FMQKLR
Sbjct: 370  EIARKALLDNVRRLKESHGRTMSSLNKTDENLSASLLNITDLENSLVVADEKYRFMQKLR 429

Query: 1727 DFVSVICDFLQHKAPFIEELEEQMQKLHEERASAILERRAADNTDEFKEVEAAVSTAMSV 1548
            ++V+ ICDFLQHKA +IEELE+QM+KLHE+RASAI E+RA +  DE  EVEAAV  AMSV
Sbjct: 430  NYVTNICDFLQHKAFYIEELEDQMKKLHEDRASAIFEKRATNIDDEMVEVEAAVKAAMSV 489

Query: 1547 LGKGGGXXXXXXXXXXXXXXXXXAREQSNLSVQLDEFGRDMNLQKRMDIXXXXXXXXXXX 1368
            L + G                   R+Q +  VQLDEFGRD+NL+KRM +           
Sbjct: 490  LSRKGDNLEAARSAAQDAFSAV--RKQRDFPVQLDEFGRDLNLEKRMKMKVMAEARQRRK 547

Query: 1367 XXXXXXXXXSVGDDSAFSH-IEGXXXXXXXXXXXXSYRSNRDLLLQTAAQIFSDAAEEYS 1191
                      +       H +EG            +Y+S RDL+LQ A +IFSDA+EEYS
Sbjct: 548  SKAFDSNK--LASMEVDDHKVEGESSTDESDSESQAYQSQRDLVLQAADEIFSDASEEYS 605

Query: 1190 HLSVVKERFERWKKHYSSSYRDAYMSLSVPAIFSPYVRLELLKWDPLYEETDFNDMQWHS 1011
             LS+VK + E WK+ Y SSY DAY+SLS+P IFSPYVRLELL+WDPL++  DF +M+W+ 
Sbjct: 606  QLSLVKNKMEEWKREYFSSYNDAYISLSLPLIFSPYVRLELLRWDPLHKGLDFQEMKWYK 665

Query: 1010 LLFDYGLPEDTSDF--NPGDADADLVPGLVEKIALPILHHQIAHCWDMLSTRGTRNAVSA 837
            LLF YGLPED  DF  + GDAD +LVP LVEK+ALPI H++I+HCWDMLS + T NA+SA
Sbjct: 666  LLFTYGLPEDGKDFVHDDGDADLELVPNLVEKVALPIFHYEISHCWDMLSQQETMNAISA 725

Query: 836  MNLVINYVPASSEALRELLGAIHTRLADAIANLIVPTWSPLVIKAVPNAARVAAYQFGMS 657
              L++ +V   SEAL ELL +I TRLADA+ANL VPTWSPLV+ AVP+AARVAAY+FG+S
Sbjct: 726  TKLIVQHVSHESEALAELLVSIRTRLADAVANLTVPTWSPLVLSAVPDAARVAAYRFGVS 785

Query: 656  IRLLRNICLWKDILALPVLEQLALDELFCGKVLPHVRSITANIHDAITRTERIISSLSGV 477
            +RLLRNICLWKDI A+PVLE+LALDEL   KVLPH RSI+ N+HDAITRTERII+SLSGV
Sbjct: 786  VRLLRNICLWKDIFAMPVLEKLALDELLYDKVLPHFRSISENVHDAITRTERIIASLSGV 845

Query: 476  WTGTKVIAERSYKLQPLVDYVLTLAKTLEKKHVSGVSESETRGLARRLKKMLVELNEYDN 297
            W G  V  +R+ KLQPLV YVL+L + LE+++   V ES+T  LARRLKK+LV+LNEYD+
Sbjct: 846  WAGPSVTGDRNRKLQPLVVYVLSLGRVLERRN---VPESDTSYLARRLKKILVDLNEYDH 902

Query: 296  ARAISRTFQLKEAL 255
            AR ++RTF LKEAL
Sbjct: 903  ARNMARTFHLKEAL 916


>ref|XP_004238967.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Solanum
            lycopersicum]
          Length = 941

 Score =  804 bits (2076), Expect = 0.0
 Identities = 470/952 (49%), Positives = 576/952 (60%), Gaps = 26/952 (2%)
 Frame = -1

Query: 3032 MSSRSKNFRRRAEDEDVNGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXKLLSFADEEDE 2853
            MS +S+NFRRR  D+    ++                             LLSFAD+E+ 
Sbjct: 1    MSGKSRNFRRRGGDD--GDDDETATKSTNGTAAKPTTTASASAAKPKKKSLLSFADDEES 58

Query: 2852 EXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHKITTTKDRVFTSP-SLPSNVQPQAG 2676
            +                                 HK+T+ KDR+   P S  SNVQPQAG
Sbjct: 59   DDTPFVRPSSKPSSASSRITKPSSSSSA------HKLTSGKDRITPKPTSFTSNVQPQAG 112

Query: 2675 EYTKEKLRELQKNTRTLASST---------PNTSEPVIVLKGFVKPH---SVDEDRGNSR 2532
             YTKE L ELQKNTRTL  S          P   EPVIVLKG VKP    S    +    
Sbjct: 113  TYTKEALLELQKNTRTLVGSRSSQPKPEPRPGPVEPVIVLKGLVKPPFSVSAQTQQNGKE 172

Query: 2531 XXXXXXXXXXXXXXXNQLASMGIGKS---RDSSGSLIPDQATINAIRAKRERLRQSRAAA 2361
                           N+L SM + K    +D  GS+IPD+ TI+AIRAKRERLRQ+R AA
Sbjct: 173  SEDDEMDVDQFGGTVNRLGSMALEKDSRKKDDVGSVIPDKMTIDAIRAKRERLRQARPAA 232

Query: 2360 PDYISLDGGSNHGAAEGLSDEEPEFQGRIALLGDKTDVAKKGVFESVDERGIENDLRKXX 2181
             D+I+LD G NHG AEGLSDEEPEFQ RI   G+K    +KGVFE  D++ ++ D     
Sbjct: 233  QDFIALDEGGNHGEAEGLSDEEPEFQQRIGFYGEKIGSGRKGVFEDFDDKALQKD---GG 289

Query: 2180 XXXXXXXXXXXXXXXXXEQFRKGLGKRIEDGXXXXXXXXXXXXXNQIVQQQHXXXXXXXX 2001
                             EQ RKGLGKR++DG               +   Q         
Sbjct: 290  FRSDDDEEDEEDKMWEEEQVRKGLGKRLDDGSNRGVMSSVVSSAAAVQNAQKANFGSSAV 349

Query: 2000 XXXXS-------VPAAPTIGGAV-GGSRSAEVMSISXXXXXXXXXXXQNIRRLKESYGRA 1845
                        V   PTIGG V GG  S + +SIS           +++ RLKES+GR 
Sbjct: 350  GASVYSSVQSIDVSDGPTIGGGVVGGLPSLDALSISMKAEVAKKALYESMGRLKESHGRT 409

Query: 1844 MSSIARTDENLSSSLSNITDLEMSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELE 1665
            ++S+ +T+ENLS+SLS +T LE SLSAAGEK++FMQKLRDFVSVIC  LQ K P+IEELE
Sbjct: 410  VTSLHKTEENLSASLSKVTTLENSLSAAGEKYMFMQKLRDFVSVICALLQDKGPYIEELE 469

Query: 1664 EQMQKLHEERASAILERRAADNTDEFKEVEAAVSTAMSVLGKGGGXXXXXXXXXXXXXXX 1485
            +QMQKLHEERA+AILERRAADN DE KE+EAAVS A  VL +GG                
Sbjct: 470  DQMQKLHEERAAAILERRAADNDDEMKELEAAVSAARQVLSRGGSNAATIEAATAAAQTS 529

Query: 1484 XXA-REQSNLSVQLDEFGRDMNLQKRMDIXXXXXXXXXXXXXXXXXXXXSVGDDSAFSHI 1308
              A R+  +L V+LDEFGRD NLQKRMD                     ++  DS++  I
Sbjct: 530  TAAMRKGGDLPVELDEFGRDKNLQKRMDTTRRAEARKRRRMKNDVKRMSAIKCDSSYQKI 589

Query: 1307 EGXXXXXXXXXXXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVKERFERWKKHYSSSYR 1128
            EG            +Y+SNRD LLQ + QIF DA EEYS LSVV E+F+RWKK Y+SSYR
Sbjct: 590  EGESSTDESDSESTAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYR 649

Query: 1127 DAYMSLSVPAIFSPYVRLELLKWDPLYEETDFNDMQWHSLLFDYGL-PEDTSDFNPGDAD 951
            DAYMSLS+P IFSPYVRLELLKWDPL+E TDF DM WH+ LF YG+ PE  ++ +  D D
Sbjct: 650  DAYMSLSIPVIFSPYVRLELLKWDPLHENTDFMDMNWHNSLFSYGISPEGETEISADDTD 709

Query: 950  ADLVPGLVEKIALPILHHQIAHCWDMLSTRGTRNAVSAMNLVINYVPASSEALRELLGAI 771
             +L+P LVEK+A+PILH+Q+A+CWDMLST  T  AVSAM LV+ Y P S  AL  L+  +
Sbjct: 710  VNLIPQLVEKLAIPILHNQLANCWDMLSTSETVCAVSAMRLVLRYGPFSGSALSNLIAVL 769

Query: 770  HTRLADAIANLIVPTWSPLVIKAVPNAARVAAYQFGMSIRLLRNICLWKDILALPVLEQL 591
              RLADA+ANL VPTW  LV++AVP+AARVAAY+FGMSIRL+RNICL+ +I A+PVLE+L
Sbjct: 770  RDRLADAVANLKVPTWDTLVMRAVPDAARVAAYRFGMSIRLIRNICLFHEIFAMPVLEEL 829

Query: 590  ALDELFCGKVLPHVRSITANIHDAITRTERIISSLSGVWTGTKVIAERSYKLQPLVDYVL 411
             LD+L  GK++PH+RSI +NIHDA+TRTER+++SL GVW G K   + S KL+PLVDY+L
Sbjct: 830  VLDQLLSGKIVPHLRSIQSNIHDAVTRTERVVTSLHGVWAGPKATGDCSPKLRPLVDYLL 889

Query: 410  TLAKTLEKKHVSGVSESETRGLARRLKKMLVELNEYDNARAISRTFQLKEAL 255
            +LA+ LEKKH S   E ET   ARRLKKMLVELN+YD AR ISRTF +KEAL
Sbjct: 890  SLARVLEKKHSSSSGEIETSKFARRLKKMLVELNQYDYARDISRTFNIKEAL 941


>ref|XP_002513154.1| gc-rich sequence DNA-binding factor, putative [Ricinus communis]
            gi|223548165|gb|EEF49657.1| gc-rich sequence DNA-binding
            factor, putative [Ricinus communis]
          Length = 885

 Score =  802 bits (2072), Expect = 0.0
 Identities = 472/950 (49%), Positives = 571/950 (60%), Gaps = 25/950 (2%)
 Frame = -1

Query: 3029 SSRSKNFRRRAEDEDVNGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXKLLSFADEEDEE 2850
            SS+S+NFRRR ++ + N                                LLSFAD+E+E+
Sbjct: 4    SSKSRNFRRRGDENEDNESNSNTTNPSYSSRKSSSKPKK----------LLSFADDEEED 53

Query: 2849 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHKITTTKDRVF-------TSPSLPSN- 2694
                                             HK+T  KDR+        TS +  SN 
Sbjct: 54   EETPRPSKQKPSKTKSS----------------HKLTAPKDRLSSSSTTSTTSTNTNSNN 97

Query: 2693 -VQPQAGEYTKEKLRELQKNTRTLASST-------PNTSEPVIVLKGFVKP------HSV 2556
             + PQAG YTKE L ELQK TRTLA  +       P++SEP I+LKG +KP      +  
Sbjct: 98   VLLPQAGTYTKEALLELQKKTRTLAKPSSKPPPPPPSSSEPKIILKGLLKPTLPQTLNQQ 157

Query: 2555 DEDRGNSRXXXXXXXXXXXXXXXNQLASMGIGKSRDSSGSLIPDQATINAIRAKRERLRQ 2376
            D D                                D   SLIPD+ TI  IRAKRERLRQ
Sbjct: 158  DADPPQDEIII------------------------DEDYSLIPDEDTIKKIRAKRERLRQ 193

Query: 2375 SRAAAPDYISLDGGSNHGAAEGLSDEEPEFQGRIALLG--DKTDVAKKGVFESVDERGIE 2202
            SRA APDYISLDGG+    ++  SDEEPEF+ RIA++G  D T      VF+  D     
Sbjct: 194  SRATAPDYISLDGGA--ATSDAFSDEEPEFRNRIAMIGKKDNTTPTTHAVFQDFDNG--- 248

Query: 2201 NDLRKXXXXXXXXXXXXXXXXXXXEQFRKGLGKRIEDGXXXXXXXXXXXXXNQIVQQQHX 2022
            ND                      EQFRK LGKR++D              + I    + 
Sbjct: 249  NDSHVIAEETVVNDEDEEDKIWEEEQFRKALGKRMDDPSSSTPSLFPTPSTSTITTTNNH 308

Query: 2021 XXXXXXXXXXXSVPAAPTIGGAVGGSRSAEVMSISXXXXXXXXXXXQNIRRLKESYGRAM 1842
                            PTIGGA G +   + +S+             N+ RLKES+ R +
Sbjct: 309  RHSHI----------VPTIGGAFGPTPGLDALSVPQQSHIARKALLDNLTRLKESHNRTV 358

Query: 1841 SSIARTDENLSSSLSNITDLEMSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEE 1662
            SS+ + DENLS+SL NIT LE SLSAAGEKFIFMQKLRDFVSVIC+FLQHKAP+IEELEE
Sbjct: 359  SSLTKADENLSASLMNITALEKSLSAAGEKFIFMQKLRDFVSVICEFLQHKAPYIEELEE 418

Query: 1661 QMQKLHEERASAILERRAADNTDEFKEVEAAVSTAMSVLG-KGGGXXXXXXXXXXXXXXX 1485
            QMQ LHE+RASAILERR ADN DE  EV+ A+  A  V   +G                 
Sbjct: 419  QMQTLHEQRASAILERRTADNDDEMMEVKTALEAAKKVFSARGSNEAAITAAMNAAQDAS 478

Query: 1484 XXAREQSNLSVQLDEFGRDMNLQKRMDIXXXXXXXXXXXXXXXXXXXXSVGDDSAFSHIE 1305
               +EQ NL V+LDEFGRD+N QKR+D+                     V  D +   +E
Sbjct: 479  ASMKEQINLPVKLDEFGRDINQQKRLDMKRRAEARQRRKAQKKLSS---VEVDGSNQKVE 535

Query: 1304 GXXXXXXXXXXXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVKERFERWKKHYSSSYRD 1125
            G            +Y+SNRDLLLQTA QIF DA+EEY  LSVVK+RFE WKK YS+SYRD
Sbjct: 536  GESSTDESDSESAAYQSNRDLLLQTADQIFGDASEEYCQLSVVKQRFENWKKEYSTSYRD 595

Query: 1124 AYMSLSVPAIFSPYVRLELLKWDPLYEETDFNDMQWHSLLFDYGLPEDTSDFNPGDADAD 945
            AYMS+S PAIFSPYVRLELLKWDPL+E+  F  M+WHSLL DYGLP+D SD +P DADA+
Sbjct: 596  AYMSISAPAIFSPYVRLELLKWDPLHEDAGFFHMKWHSLLSDYGLPQDGSDLSPEDADAN 655

Query: 944  LVPGLVEKIALPILHHQIAHCWDMLSTRGTRNAVSAMNLVINYVPASSEALRELLGAIHT 765
            LVP LVEK+A+PILHH+IAHCWDMLSTR T+NAV A NLV +YVPASSEAL ELL AI T
Sbjct: 656  LVPELVEKVAIPILHHEIAHCWDMLSTRETKNAVFATNLVTDYVPASSEALAELLLAIRT 715

Query: 764  RLADAIANLIVPTWSPLVIKAVPNAARVAAYQFGMSIRLLRNICLWKDILALPVLEQLAL 585
            RL DA+ +++VPTWSP+ +KAVP AA++AAY+FGMS+RL++NICLWKDIL+LPVLE+LAL
Sbjct: 716  RLTDAVVSIMVPTWSPIELKAVPRAAQIAAYRFGMSVRLMKNICLWKDILSLPVLEKLAL 775

Query: 584  DELFCGKVLPHVRSITANIHDAITRTERIISSLSGVWTGTKVIAERSYKLQPLVDYVLTL 405
            D+L C KVLPH++S+ +N+HDA+TRTERII+SLSGVW GT V A RS+KLQPLVD V++L
Sbjct: 776  DDLLCRKVLPHLQSVASNVHDAVTRTERIIASLSGVWAGTSVTASRSHKLQPLVDCVMSL 835

Query: 404  AKTLEKKHVSGVSESETRGLARRLKKMLVELNEYDNARAISRTFQLKEAL 255
             K L+ KH  G SE E  GLARRLKKMLVELN+YD AR I+R F L+EAL
Sbjct: 836  GKRLKDKHPLGASEIEVSGLARRLKKMLVELNDYDKAREIARMFSLREAL 885


>ref|XP_006362530.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Solanum tuberosum]
          Length = 939

 Score =  800 bits (2067), Expect = 0.0
 Identities = 470/952 (49%), Positives = 580/952 (60%), Gaps = 26/952 (2%)
 Frame = -1

Query: 3032 MSSRSKNFRRRAEDEDVNGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXKLLSFADEEDE 2853
            MS +S+NFRRR  D+  + E                              LLSFAD+ED 
Sbjct: 1    MSGKSRNFRRRGGDDGDDDETSAKTTNGTAAKPTTTASATKPKKKS----LLSFADDEDS 56

Query: 2852 EXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHKITTTKDRVFTSP-SLPSNVQPQAG 2676
            +                                 HK+T+ KDR+   P S  SNVQPQAG
Sbjct: 57   DDTPFVRPSSKPSSASSRITKPSSSSSA------HKLTSGKDRITPKPPSFTSNVQPQAG 110

Query: 2675 EYTKEKLRELQKNTRTLASST---------PNTSEPVIVLKGFVKPH---SVDEDRGNSR 2532
             YTKE L ELQKNTRTL  S          P   EPVIVLKG VKP    +    +    
Sbjct: 111  TYTKEALLELQKNTRTLVGSRSAQPKPEPRPGPVEPVIVLKGLVKPPFSVTAQTQQNGQE 170

Query: 2531 XXXXXXXXXXXXXXXNQLASMGIGKS---RDSSGSLIPDQATINAIRAKRERLRQSRAAA 2361
                           N+L SM + K    +D  GS+IPD+ TI+AIRAKRERLRQ+R AA
Sbjct: 171  SEDDEMDVDQFGGTVNRLGSMALEKDSRKKDDVGSVIPDKMTIDAIRAKRERLRQARPAA 230

Query: 2360 PDYISLDGGSNHGAAEGLSDEEPEFQGRIALLGDKTDVAKKGVFESVDERGIENDLRKXX 2181
             D+I+LD G NHG AEGLSDEEPEFQ RI   G+K    ++GVFE  +++ ++ D     
Sbjct: 231  QDFIALDEGGNHGEAEGLSDEEPEFQQRIGFYGEKIGSGRRGVFEDFEDKAMQKD---GG 287

Query: 2180 XXXXXXXXXXXXXXXXXEQFRKGLGKRIEDG--XXXXXXXXXXXXXNQIVQQQHXXXXXX 2007
                             EQ RKGLGKR++DG                Q VQ+ +      
Sbjct: 288  FRSDDDEEDEEEKMWEEEQVRKGLGKRLDDGSNRGVMSSVVSSAAAVQNVQKANFGSSAV 347

Query: 2006 XXXXXXSVPA-----APTI-GGAVGGSRSAEVMSISXXXXXXXXXXXQNIRRLKESYGRA 1845
                  SV +      PTI GG VGG  S + +SIS           +++ RLKES+GR 
Sbjct: 348  GASVYSSVQSIDVSDGPTIGGGVVGGLPSLDALSISKKAEVAKKALYESMGRLKESHGRT 407

Query: 1844 MSSIARTDENLSSSLSNITDLEMSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELE 1665
            ++S+ +T+ENLS+SLS +T LE SLSAAGEK++FMQKLRDFVSVIC  LQ K P+IEELE
Sbjct: 408  VTSLHKTEENLSASLSKVTTLENSLSAAGEKYMFMQKLRDFVSVICALLQDKGPYIEELE 467

Query: 1664 EQMQKLHEERASAILERRAADNTDEFKEVEAAVSTAMSVLGKGG-GXXXXXXXXXXXXXX 1488
            +QMQKLHEERA+AILERRAADN DE KE+EAAVS A  VL +GG                
Sbjct: 468  DQMQKLHEERAAAILERRAADNDDEMKELEAAVSAARQVLSRGGSNAATIEAATAAAQTS 527

Query: 1487 XXXAREQSNLSVQLDEFGRDMNLQKRMDIXXXXXXXXXXXXXXXXXXXXSVGDDSAFSHI 1308
                R+  +L ++LDEFGRD NLQKRMD                     ++  DS++  I
Sbjct: 528  TAAMRKGGDLPIELDEFGRDKNLQKRMDTTRRAEARKRRRVKNDVKRMSAIKCDSSYQKI 587

Query: 1307 EGXXXXXXXXXXXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVKERFERWKKHYSSSYR 1128
            EG            +Y+SNRD LLQ + QIF DA EEYS LSVV E+F+RWKK Y+SSYR
Sbjct: 588  EGESSTDESDSESTAYQSNRDQLLQVSEQIFGDAHEEYSQLSVVVEKFDRWKKDYASSYR 647

Query: 1127 DAYMSLSVPAIFSPYVRLELLKWDPLYEETDFNDMQWHSLLFDYGL-PEDTSDFNPGDAD 951
            DAYMSLS+P IFSPYVRLELLKWDPL+E TDF DM WH+ LF YG+ PE  ++ +  D D
Sbjct: 648  DAYMSLSIPVIFSPYVRLELLKWDPLHENTDFMDMNWHNSLFSYGIPPEGEAEISVDDTD 707

Query: 950  ADLVPGLVEKIALPILHHQIAHCWDMLSTRGTRNAVSAMNLVINYVPASSEALRELLGAI 771
             +L+P LVEK+A+PILH+Q+A+CWDMLST  T  AVSAM LV+ Y P S  AL  L+  +
Sbjct: 708  VNLIPQLVEKLAIPILHNQLANCWDMLSTSETVCAVSAMRLVLRYGPFSGSALSNLIAVL 767

Query: 770  HTRLADAIANLIVPTWSPLVIKAVPNAARVAAYQFGMSIRLLRNICLWKDILALPVLEQL 591
              RLADA+ANL VPTW  LV++AVP+AARVAAY+FGMSIRL+RNICL+ +I A+PVLE+L
Sbjct: 768  RDRLADAVANLKVPTWDTLVMRAVPDAARVAAYRFGMSIRLIRNICLFHEIFAMPVLEEL 827

Query: 590  ALDELFCGKVLPHVRSITANIHDAITRTERIISSLSGVWTGTKVIAERSYKLQPLVDYVL 411
             LD+L  GK+LPH+RSI +NIHDA+TRTER+++SL GVW G K   + S KL+PLVDY+L
Sbjct: 828  VLDQLLSGKILPHLRSIQSNIHDAVTRTERVVTSLHGVWAGPKATGDFSPKLRPLVDYLL 887

Query: 410  TLAKTLEKKHVSGVSESETRGLARRLKKMLVELNEYDNARAISRTFQLKEAL 255
            +LA+ LEKKH S   E +T   ARRLKKMLVELN+YD AR ISRTF +KEAL
Sbjct: 888  SLARVLEKKHSSSSGEIDTSKFARRLKKMLVELNQYDYARDISRTFNIKEAL 939


>ref|XP_007160943.1| hypothetical protein PHAVU_001G030200g [Phaseolus vulgaris]
            gi|561034407|gb|ESW32937.1| hypothetical protein
            PHAVU_001G030200g [Phaseolus vulgaris]
          Length = 882

 Score =  791 bits (2042), Expect = 0.0
 Identities = 447/845 (52%), Positives = 564/845 (66%), Gaps = 13/845 (1%)
 Frame = -1

Query: 2750 HKITTTKDRVFTS-PSLPSNVQPQAGEYTKEKLRELQKNTRTLASSTPNTS-----EPVI 2589
            HKITT KDR+ +S PS+PSNVQPQAG YTKE LRELQKNTRTL +S+  +      EPVI
Sbjct: 76   HKITTLKDRIASSSPSVPSNVQPQAGTYTKETLRELQKNTRTLVTSSSRSEPKPPGEPVI 135

Query: 2588 VLKGFVKPHSVDEDRGNSRXXXXXXXXXXXXXXXNQLASMGIGKSRDSSGSLIPDQATIN 2409
            VLKG VKP + +     S                 +L  +G+   +DS     PD+ TI 
Sbjct: 136  VLKGLVKPVASEPQGRES------DSEGDHKEVEGKLGGLGLHNGKDS---FFPDEETIK 186

Query: 2408 AIRAKRERLRQSRAAAPDYISLDGGSNHGAAEGLSDEEPEFQGRIALLGDKTDVAKKGVF 2229
            AIRAKRERLRQ+R AA DYISLDGGSNHGAAEGLSDEEPEF+GRIA+ G+K +  KKGVF
Sbjct: 187  AIRAKRERLRQARPAAQDYISLDGGSNHGAAEGLSDEEPEFRGRIAMFGEKVEGGKKGVF 246

Query: 2228 ESVDERGIENDLRKXXXXXXXXXXXXXXXXXXXEQFRKGLGKRIEDGXXXXXXXXXXXXX 2049
            E V+ER ++   ++                    QFRKGLGKR+++G             
Sbjct: 247  EEVEERRVDVRFKEEEEDDDEEEKMWEEE-----QFRKGLGKRMDEGSARVDVP------ 295

Query: 2048 NQIVQ--QQHXXXXXXXXXXXXSVPAA--PTIG-GAVGGSRSAEVMSISXXXXXXXXXXX 1884
              +VQ  QQH             VP+A  P  G G +    + +V+S+S           
Sbjct: 296  --VVQGAQQHKYV----------VPSAAVPNAGFGTIESMPALDVLSLSQQAESAKKALV 343

Query: 1883 QNIRRLKESYGRAMSSIARTDENLSSSLSNITDLEMSLSAAGEKFIFMQKLRDFVSVICD 1704
            +N+RRLKES+GR MSS+++TDENLS+SL NIT LE SL  A +K+ FMQKLR++V+ ICD
Sbjct: 344  ENVRRLKESHGRTMSSLSKTDENLSASLLNITALENSLVVADDKYRFMQKLRNYVTNICD 403

Query: 1703 FLQHKAPFIEELEEQMQKLHEERASAILERRAADNTDEFKEVEAAVSTAMSVLGKGGGXX 1524
            FLQHKA +IEELEEQ++KLH +RA+AI E+R  +N DE  EVEAAV  AMSVL K G   
Sbjct: 404  FLQHKAFYIEELEEQIKKLHGDRATAIFEKRTTNNDDEIVEVEAAVKAAMSVLNKKGNNM 463

Query: 1523 XXXXXXXXXXXXXXXAREQSNLSVQLDEFGRDMNLQKRMDIXXXXXXXXXXXXXXXXXXX 1344
                            R+Q +L V+LDEFGRD+NL+KRM +                   
Sbjct: 464  EAAKSAAQEAYTAV--RKQKDLPVKLDEFGRDLNLEKRMQMKMRAVARQRKRSQLFDSNK 521

Query: 1343 XSVGDDSAFSHIEGXXXXXXXXXXXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVKERF 1164
                 +     IEG            +Y S RDL+LQ A +IF DA+EEY  LS+VK R 
Sbjct: 522  L-TSMELDDHKIEGESSTDESDSESQAYESQRDLVLQAADEIFGDASEEYGQLSLVKRRM 580

Query: 1163 ERWKKHYSSSYRDAYMSLSVPAIFSPYVRLELLKWDPLYEETDFNDMQWHSLLFDYGLPE 984
            E WK+ YSSSY+DAYMSLS+P +FSPYVRLELL+WDPL++  DF +M+W+ LLF YGLPE
Sbjct: 581  EEWKRDYSSSYKDAYMSLSLPLVFSPYVRLELLRWDPLHKGIDFQEMKWYKLLFTYGLPE 640

Query: 983  DTSDF--NPGDADADLVPGLVEKIALPILHHQIAHCWDMLSTRGTRNAVSAMNLVINYVP 810
            D  DF  + GDAD +LVP LVEK+ALPIL ++I+HCWDMLS R T NA++A  L++ +V 
Sbjct: 641  DGKDFVHDDGDADLELVPNLVEKVALPILQYEISHCWDMLSQRETMNAIAATKLIVQHVS 700

Query: 809  ASSEALRELLGAIHTRLADAIANLIVPTWSPLVIKAVPNAARVAAYQFGMSIRLLRNICL 630
              SEAL +LL +I TRLADA+ANL VPTWSP+V+ AVP+AARVAAY+FG+S+RLLRNICL
Sbjct: 701  RKSEALTDLLVSIRTRLADAVANLKVPTWSPVVLVAVPDAARVAAYRFGVSVRLLRNICL 760

Query: 629  WKDILALPVLEQLALDELFCGKVLPHVRSITANIHDAITRTERIISSLSGVWTGTKVIAE 450
            WKD+ +  VLE+LALDEL  GKVLPH+R I+ N+ DAITRTER+I+SLSGVW G  VI +
Sbjct: 761  WKDVFSTSVLEKLALDELLFGKVLPHLRIISENVQDAITRTERVIASLSGVWAGPSVIGD 820

Query: 449  RSYKLQPLVDYVLTLAKTLEKKHVSGVSESETRGLARRLKKMLVELNEYDNARAISRTFQ 270
            + +KLQPL+ YVL+L + LE+++   V ES+T  LARRLKK+LV+LNEYD+AR ++RTF 
Sbjct: 821  KKHKLQPLLTYVLSLGRILERRN---VPESDTSYLARRLKKILVDLNEYDHARTMARTFH 877

Query: 269  LKEAL 255
            LKEAL
Sbjct: 878  LKEAL 882


>ref|XP_003530304.1| PREDICTED: PAX3- and PAX7-binding protein 1-like isoform X1 [Glycine
            max]
          Length = 896

 Score =  789 bits (2037), Expect = 0.0
 Identities = 474/947 (50%), Positives = 586/947 (61%), Gaps = 22/947 (2%)
 Frame = -1

Query: 3029 SSRSKNFRRRAEDEDVNGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXKLLSFADEEDEE 2850
            +++S+NFRRR  D + N ++                             LLSFAD+E+  
Sbjct: 3    AAKSRNFRRRGGDTEANEDDGDTSTTFRSKPPSSAKPKKPQAPK-----LLSFADDEE-- 55

Query: 2849 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHKITTTKDRVFTSPSLPSNVQPQAGEY 2670
                                            SHKITT KDR+  S S+ SNVQPQAG Y
Sbjct: 56   ------------ISNPRPRSSAKPQRPSKPSSSHKITTLKDRIAHSSSVSSNVQPQAGTY 103

Query: 2669 TKEKLRELQKNTRTLASSTPNT------SEPVIVLKGFVKPHSVDEDRGNSRXXXXXXXX 2508
            TKE LRELQKNTRTL SS+  T      SEPVIVLKG VKP  V E +G           
Sbjct: 104  TKEALRELQKNTRTLVSSSTTTTTSSSRSEPVIVLKGLVKP-VVSEPQGRHSDSEGEHKE 162

Query: 2507 XXXXXXXNQLASMGIGKSRDSSGSLIPDQATINAIRAKRERLRQSRAAAPDYISLDGGSN 2328
                    +L+S+GI   +DS     PD+ TI AIRAKRERLR++R AAPDYISLDGGSN
Sbjct: 163  VEG-----KLSSLGIQNGKDS---FFPDEETIKAIRAKRERLRKARPAAPDYISLDGGSN 214

Query: 2327 HGAAEGLSDEEPEFQGRIALLGDKTDVA-KKGVFESVDER---GIENDLRKXXXXXXXXX 2160
            HGAAEGLSDEEPEF+GRIA+  +K +   KKGVFE V+ER     END            
Sbjct: 215  HGAAEGLSDEEPEFRGRIAMFEEKGEGGGKKGVFEEVEERLRDEEEND-----------D 263

Query: 2159 XXXXXXXXXXEQFRKGLGKRIEDGXXXXXXXXXXXXXNQIVQ--QQHXXXXXXXXXXXXS 1986
                      EQFRKGLGKR+++G               +VQ  QQ+             
Sbjct: 264  DYEEEKMWEEEQFRKGLGKRMDEGAARVDVP--------VVQGAQQNKFVVSSAAAVYGG 315

Query: 1985 VPAA--------PTIGGAVGGSRSAEVMSISXXXXXXXXXXXQNIRRLKESYGRAMSSIA 1830
            VP+A        P+IGGA     + +V+ +S           +N+RRLKES+ R MSS++
Sbjct: 316  VPSADARVPSVSPSIGGATESMPALDVVPMSQQAERARKALVENVRRLKESHERTMSSLS 375

Query: 1829 RTDENLSSSLSNITDLEMSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQK 1650
            +TDENLS+S   IT LE SL  A EK+ FMQKLR++VS +CDFLQHKA +IEELEEQM+K
Sbjct: 376  KTDENLSASFLKITALENSLVVADEKYRFMQKLRNYVSNMCDFLQHKAFYIEELEEQMKK 435

Query: 1649 LHEERASAILERRAADNTDEFKEVEAAVSTAMSVLGKGGGXXXXXXXXXXXXXXXXXARE 1470
            LHE+RASAI ERR  +N DE  EVEAAV   MSVL K G                   R+
Sbjct: 436  LHEDRASAIFERRTTNNDDEMIEVEAAVKAVMSVLNKKGNNMEAAKSAAQEAFAAV--RK 493

Query: 1469 QSNLSVQLDEFGRDMNLQKRMDIXXXXXXXXXXXXXXXXXXXXSVGDDSAFSHIEGXXXX 1290
            Q +L V+LDEFGRD+NL+KRM +                        +     IEG    
Sbjct: 494  QKDLPVKLDEFGRDLNLEKRMQMKVRAEAHQRKRSQAFNSNKL-ASMELDDPKIEGESST 552

Query: 1289 XXXXXXXXSYRSNRDLLLQTAAQIFSDAAEEYSHLSVVKERFERWKKHYSSSYRDAYMSL 1110
                    +Y+S RDL+LQ A  IFSDA+EEY  LS VK R E WK+ YSSSY+DAYMSL
Sbjct: 553  DESDSESQAYQSQRDLVLQAADGIFSDASEEYGQLSFVKRRMEEWKREYSSSYKDAYMSL 612

Query: 1109 SVPAIFSPYVRLELLKWDPLYEETDFNDMQWHSLLFDYGLPEDTSDF--NPGDADADLVP 936
            S+P +FSPYVRLELL+WDPL++  DF +M+W+ LLF YGLPED  DF  + GDAD +LVP
Sbjct: 613  SLPLVFSPYVRLELLRWDPLHKGLDFQEMKWYKLLFTYGLPEDGKDFVHDDGDADLELVP 672

Query: 935  GLVEKIALPILHHQIAHCWDMLSTRGTRNAVSAMNLVINYVPASSEALRELLGAIHTRLA 756
             LVEK+ALPILH++I+HCWDMLS + T NA++A  L++ +V   SEAL +LL +I TRLA
Sbjct: 673  NLVEKVALPILHYEISHCWDMLSQQETVNAIAATKLIVQHVSHESEALADLLVSIRTRLA 732

Query: 755  DAIANLIVPTWSPLVIKAVPNAARVAAYQFGMSIRLLRNICLWKDILALPVLEQLALDEL 576
            DA+ANL VPTWSP V+ AV +AARVAAY+FG+S+RLLRNIC WKD+ ++PVLE LALDEL
Sbjct: 733  DAVANLTVPTWSPPVVAAVADAARVAAYRFGVSVRLLRNICSWKDVFSMPVLENLALDEL 792

Query: 575  FCGKVLPHVRSITANIHDAITRTERIISSLSGVWTGTKVIAERSYKLQPLVDYVLTLAKT 396
              GKVLPH+R I+ N+ DAITRTERII+SLSGVW G  VIA+R  KLQPL+ YVL+L + 
Sbjct: 793  LFGKVLPHLRIISENVQDAITRTERIIASLSGVWAGPSVIADRKRKLQPLLTYVLSLGRI 852

Query: 395  LEKKHVSGVSESETRGLARRLKKMLVELNEYDNARAISRTFQLKEAL 255
            LE+++     ES+T  LARRLKK+LV+LNEYD+AR ++RTF LKEAL
Sbjct: 853  LERRN---APESDTSHLARRLKKILVDLNEYDHARTMARTFHLKEAL 896


>ref|XP_003610832.1| GC-rich sequence DNA-binding factor-like protein [Medicago
            truncatula] gi|355512167|gb|AES93790.1| GC-rich sequence
            DNA-binding factor-like protein [Medicago truncatula]
          Length = 892

 Score =  762 bits (1968), Expect = 0.0
 Identities = 451/859 (52%), Positives = 556/859 (64%), Gaps = 27/859 (3%)
 Frame = -1

Query: 2750 HKITTTKDRVFT---SPSLPSNVQPQAGEYTKEKLRELQKNTRTLA---------SSTPN 2607
            HKITT K+R+ +   SPS PSNVQPQAG YT E LRELQKNTRTL          SS P 
Sbjct: 75   HKITTHKNRITSHSPSPS-PSNVQPQAGTYTLEALRELQKNTRTLVTPTTASRPISSEPK 133

Query: 2606 -TSEPVIVLKGFVKPHSVDEDRGNSRXXXXXXXXXXXXXXXNQLASMGIGKSRDSSGSLI 2430
             +SEPVIVLKG +KP + + +  +                  + AS+GI   +DS     
Sbjct: 134  PSSEPVIVLKGLLKPVTSEPESDSEENGEFEA----------KFASVGIKNGKDS---FF 180

Query: 2429 PDQATINAIRAKRERLRQSRAAAPDYISLDGGSNHGAAEGLSDEEPEFQGRIALLGDKT- 2253
            P +  I A +AKRER+R++ AAAPDYISLDGGSNHGAAEGLSDEEPE++GRIA+ G K  
Sbjct: 181  PGEEDIKAAKAKRERMRKAGAAAPDYISLDGGSNHGAAEGLSDEEPEYRGRIAMFGGKKG 240

Query: 2252 DVAKKGVFESVDERGIENDLRKXXXXXXXXXXXXXXXXXXXEQFRKGLGKRIEDGXXXXX 2073
            D  KKGVFE  DER                           EQF+KGLGKR ++G     
Sbjct: 241  DGEKKGVFEVADER------------FDDVVVDEEDGLWEEEQFKKGLGKRRDEG----S 284

Query: 2072 XXXXXXXXNQIVQ--QQHXXXXXXXXXXXXSVP-------AAPTIGGAVGGSRSAEVMSI 1920
                      +VQ  QQ             +VP       A  +IGGA+  +   +V+SI
Sbjct: 285  ARVGGGGEVPVVQAAQQPNFVGPSVANVYGAVPNVVAAASANTSIGGAIPATPVLDVISI 344

Query: 1919 SXXXXXXXXXXXQNIRRLKESYGRAMSSIARTDENLSSSLSNITDLEMSLSAAGEKFIFM 1740
            S            NIRRLKES+GR MSS+ +TDENLS+SL  ITDLE SL  A EK+ FM
Sbjct: 345  SQQAEIAKKAMLDNIRRLKESHGRTMSSLNKTDENLSASLLKITDLESSLVVADEKYRFM 404

Query: 1739 QKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERASAILERRAADNTDEFKEVEAAVST 1560
            QKLR+++S ICDFLQHKA +IEELE+QM+KLHE+RASAI E+RA +N DE  EVEAAV  
Sbjct: 405  QKLRNYISNICDFLQHKAYYIEELEDQMKKLHEDRASAIFEKRATNNDDEMVEVEAAVKA 464

Query: 1559 AMSVLGKGGGXXXXXXXXXXXXXXXXXAREQSNLSVQLDEFGRDMNLQKR--MDIXXXXX 1386
            AM VL + G                   R+Q +  VQLDEFGRD+NL+KR  M +     
Sbjct: 465  AMLVLSRKG--DNVEAARSAAQDAFAAVRKQRDFPVQLDEFGRDLNLEKRKQMKVMAEAR 522

Query: 1385 XXXXXXXXXXXXXXXSVGDDSAFSHIEGXXXXXXXXXXXXSYRSNRDLLLQTAAQIFSDA 1206
                              DD     +EG            +Y+S RDL+LQ A +IFSDA
Sbjct: 523  QRRRSKAFDSKKSASMEIDD---HKVEGESSTDESDSESQAYQSQRDLVLQAADEIFSDA 579

Query: 1205 AEEYSHLSVVKERFERWKKHYSSSYRDAYMSLSVPAIFSPYVRLELLKWDPLYEETDFND 1026
            +EEYS LS+VK R E WK+ YSSSY +AY+SLS+P IFSPYVRLELL+WDPL++  DF D
Sbjct: 580  SEEYSQLSLVKTRMEEWKREYSSSYNEAYISLSLPLIFSPYVRLELLRWDPLHKGLDFQD 639

Query: 1025 MQWHSLLFDYGLPEDTSDF--NPGDADADLVPGLVEKIALPILHHQIAHCWDMLSTRGTR 852
            M+W+ LLF YGLPED  DF  + GDAD +LVP LVEK+ALPILH++++HCWDMLS + T 
Sbjct: 640  MKWYKLLFTYGLPEDGKDFVHDDGDADLELVPNLVEKVALPILHYEVSHCWDMLSQQETM 699

Query: 851  NAVSAMNLVINYVPASSEALRELLGAIHTRLADAIANLIVPTWSPLVIKAVPNAARVAAY 672
            NA++A  L++ +V   SEAL  LL +I TRLADA+ANL VPTWSPLV+ AVP+AA++AAY
Sbjct: 700  NAIAATKLIVQHVSRESEALAGLLVSIRTRLADAVANLTVPTWSPLVLAAVPDAAKIAAY 759

Query: 671  QFGMSIRLLRNICLWKDILALPVLEQLALDELFCGKVLPHVRSITANIHDAITRTERIIS 492
            +FG+S+RLLRNICLWKDI A+ VLE+LALDEL   KVLPH RSI+ N+ DAITRTERII 
Sbjct: 760  RFGVSVRLLRNICLWKDIFAMSVLEKLALDELLYAKVLPHFRSISENVQDAITRTERIID 819

Query: 491  SLSGVWTGTKVIAERSYKLQPLVDYVLTLAKTLEKKHVSGVSESETRGLARRLKKMLVEL 312
            SLSGVW G  V  ++S KLQPLV YVL+L + LE+++   V ES+   LARRLKK+LV+L
Sbjct: 820  SLSGVWAGPSVTGDKSRKLQPLVAYVLSLGRILERRN---VPESD---LARRLKKILVDL 873

Query: 311  NEYDNARAISRTFQLKEAL 255
            NEYD+AR ++RTF LKEAL
Sbjct: 874  NEYDHARTMARTFHLKEAL 892


Top