BLASTX nr result

ID: Rauwolfia21_contig00012057 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00012057
         (1512 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006362530.1| PREDICTED: PAX3- and PAX7-binding protein 1-...   297   9e-78
ref|XP_004238967.1| PREDICTED: GC-rich sequence DNA-binding fact...   295   3e-77
emb|CBI27069.3| unnamed protein product [Vitis vinifera]              292   3e-76
ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding fact...   277   8e-72
gb|EMJ26532.1| hypothetical protein PRUPE_ppa001044mg [Prunus pe...   270   1e-69
gb|EPS73173.1| hypothetical protein M569_01583, partial [Genlise...   256   1e-65
ref|XP_003528569.1| PREDICTED: PAX3- and PAX7-binding protein 1-...   243   1e-61
gb|ESW32937.1| hypothetical protein PHAVU_001G030200g [Phaseolus...   243   2e-61
ref|XP_006605552.1| PREDICTED: PAX3- and PAX7-binding protein 1-...   241   5e-61
ref|XP_004514246.1| PREDICTED: GC-rich sequence DNA-binding fact...   239   2e-60
ref|XP_006583671.1| PREDICTED: PAX3- and PAX7-binding protein 1-...   238   7e-60
ref|XP_003530304.1| PREDICTED: PAX3- and PAX7-binding protein 1-...   238   7e-60
gb|EOY19310.1| GC-rich sequence DNA-binding factor-like protein,...   234   6e-59
gb|EXB53993.1| GC-rich sequence DNA-binding factor 1 [Morus nota...   223   2e-55
ref|XP_003610832.1| GC-rich sequence DNA-binding factor-like pro...   222   4e-55
ref|XP_004135116.1| PREDICTED: GC-rich sequence DNA-binding fact...   219   3e-54
ref|XP_004159322.1| PREDICTED: GC-rich sequence DNA-binding fact...   218   7e-54
ref|XP_004298307.1| PREDICTED: GC-rich sequence DNA-binding fact...   214   8e-53
ref|XP_006838726.1| hypothetical protein AMTR_s00002p00252610 [A...   203   1e-49
ref|XP_006448500.1| hypothetical protein CICLE_v10014191mg [Citr...   197   1e-47

>ref|XP_006362530.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Solanum tuberosum]
          Length = 939

 Score =  297 bits (760), Expect = 9e-78
 Identities = 187/354 (52%), Positives = 228/354 (64%), Gaps = 1/354 (0%)
 Frame = -3

Query: 1063 KDRNAPYVSSLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEPVI 884
            KDR  P   S  SNVQPQAGTYTKEALLELQKNTRTL  SR A+P  KP+P     EPVI
Sbjct: 91   KDRITPKPPSFTSNVQPQAGTYTKEALLELQKNTRTLVGSRSAQP--KPEPRPGPVEPVI 148

Query: 883  VLKGLVKPKIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKTDV 704
            VLKGLVKP     +  +T++  Q+S+++EM     ++      RLGSM L K   +K DV
Sbjct: 149  VLKGLVKPPFSVTA--QTQQNGQESEDDEMD---VDQFGGTVNRLGSMALEKDSRKKDDV 203

Query: 703  PGSVIPDQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRIXXX 524
             GSVIPD+ TI+AIRAKRERLRQAR AA D+IALD G NHG AEGLSDEEPEFQ RI   
Sbjct: 204  -GSVIPDKMTIDAIRAKRERLRQARPAAQDFIALDEGGNHGEAEGLSDEEPEFQQRIGFY 262

Query: 523  XXXXXXXXXXXXEDFEQRVVEKDARVES-GXXXXXXXXXXXXEQVRKGLGKRLDEAASRG 347
                        EDFE + ++KD    S              EQVRKGLGKRLD+ ++RG
Sbjct: 263  GEKIGSGRRGVFEDFEDKAMQKDGGFRSDDDEEDEEEKMWEEEQVRKGLGKRLDDGSNRG 322

Query: 346  VSTNNGVGTASVVQSAHQQKVRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXLPGFDAV 167
            V  ++ V +A+ VQ+  +    +S  G +S+YSS Q                 LP  DA+
Sbjct: 323  V-MSSVVSSAAAVQNVQKANFGSSAVG-ASVYSSVQSIDVSDGPTIGGGVVGGLPSLDAL 380

Query: 166  SLSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSLSA 5
            S+S++AE+AKKAL+E+  RLKESH RT+TSL +T+ENLSASL KVTTLENSLSA
Sbjct: 381  SISKKAEVAKKALYESMGRLKESHGRTVTSLHKTEENLSASLSKVTTLENSLSA 434


>ref|XP_004238967.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Solanum
            lycopersicum]
          Length = 941

 Score =  295 bits (756), Expect = 3e-77
 Identities = 184/354 (51%), Positives = 224/354 (63%), Gaps = 1/354 (0%)
 Frame = -3

Query: 1063 KDRNAPYVSSLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEPVI 884
            KDR  P  +S  SNVQPQAGTYTKEALLELQKNTRTL  SR ++P  KP+P     EPVI
Sbjct: 93   KDRITPKPTSFTSNVQPQAGTYTKEALLELQKNTRTLVGSRSSQP--KPEPRPGPVEPVI 150

Query: 883  VLKGLVKPKIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKTDV 704
            VLKGLVKP     +  +   KE + D  ++     ++      RLGSM L K   +K DV
Sbjct: 151  VLKGLVKPPFSVSAQTQQNGKESEDDEMDV-----DQFGGTVNRLGSMALEKDSRKKDDV 205

Query: 703  PGSVIPDQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRIXXX 524
             GSVIPD+ TI+AIRAKRERLRQAR AA D+IALD G NHG AEGLSDEEPEFQ RI   
Sbjct: 206  -GSVIPDKMTIDAIRAKRERLRQARPAAQDFIALDEGGNHGEAEGLSDEEPEFQQRIGFY 264

Query: 523  XXXXXXXXXXXXEDFEQRVVEKDARVES-GXXXXXXXXXXXXEQVRKGLGKRLDEAASRG 347
                        EDF+ + ++KD    S              EQVRKGLGKRLD+ ++RG
Sbjct: 265  GEKIGSGRKGVFEDFDDKALQKDGGFRSDDDEEDEEDKMWEEEQVRKGLGKRLDDGSNRG 324

Query: 346  VSTNNGVGTASVVQSAHQQKVRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXLPGFDAV 167
            V  ++ V +A+ VQ+A +    +S  G +S+YSS Q                 LP  DA+
Sbjct: 325  V-MSSVVSSAAAVQNAQKANFGSSAVG-ASVYSSVQSIDVSDGPTIGGGVVGGLPSLDAL 382

Query: 166  SLSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSLSA 5
            S+S +AE+AKKAL+E+  RLKESH RT+TSL +T+ENLSASL KVTTLENSLSA
Sbjct: 383  SISMKAEVAKKALYESMGRLKESHGRTVTSLHKTEENLSASLSKVTTLENSLSA 436


>emb|CBI27069.3| unnamed protein product [Vitis vinifera]
          Length = 425

 Score =  292 bits (747), Expect = 3e-76
 Identities = 194/357 (54%), Positives = 217/357 (60%), Gaps = 4/357 (1%)
 Frame = -3

Query: 1063 KDRNAPYVSSLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEPVI 884
            KDR  P  +SLPSNVQPQAGTYTKEAL ELQKNTRTLA SRPA  E KP     S EPVI
Sbjct: 101  KDRLTPSSASLPSNVQPQAGTYTKEALRELQKNTRTLASSRPASSEPKP-----SLEPVI 155

Query: 883  VLKGLVKPKIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKTDV 704
            VLKGLVKP I A  D                 ++  E +D   RL SMG+GKG       
Sbjct: 156  VLKGLVKP-ISAAEDA----------------VIDEENEDTETRLASMGIGKGRDS---- 194

Query: 703  PGSVIPDQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRIXXX 524
                IPDQATI AIRAKRERLRQ+RAAAPDYI+LD GSNHGAAEGLSDEEPEFQGRI   
Sbjct: 195  ----IPDQATINAIRAKRERLRQSRAAAPDYISLDGGSNHGAAEGLSDEEPEFQGRIAMF 250

Query: 523  XXXXXXXXXXXXEDFEQRVVE----KDARVESGXXXXXXXXXXXXEQVRKGLGKRLDEAA 356
                        ED ++R +E    KDA                 EQ RKGLGKR+D+ +
Sbjct: 251  GEKPESGKKGVFEDVDERGMEGGFKKDAH---DSDDEEEEKIWEEEQFRKGLGKRMDDGS 307

Query: 355  SRGVSTNNGVGTASVVQSAHQQKVRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXLPGF 176
            SR VS+     +  VVQ   QQK   S   S + Y+S                   LPGF
Sbjct: 308  SRVVSS-----SVPVVQKVQQQKFMYS---SVTAYTS---VPGVSAPLNIGGAVGPLPGF 356

Query: 175  DAVSLSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSLSA 5
            DA+SLSQQAELAKKALHEN +RLKESH RTM+SL RTDENLS+SL  +TTLE SL+A
Sbjct: 357  DAMSLSQQAELAKKALHENLRRLKESHGRTMSSLTRTDENLSSSLSNITTLEKSLTA 413


>ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Vitis
            vinifera]
          Length = 913

 Score =  277 bits (709), Expect = 8e-72
 Identities = 188/358 (52%), Positives = 215/358 (60%), Gaps = 5/358 (1%)
 Frame = -3

Query: 1063 KDRNAPYVSSLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEPVI 884
            KDR  P  +SLPSNVQPQAGTYTKEAL ELQKNTRTLA SRPA  E KP     S EPVI
Sbjct: 101  KDRLTPSSASLPSNVQPQAGTYTKEALRELQKNTRTLASSRPASSEPKP-----SLEPVI 155

Query: 883  VLKGLVKPKIMA-DSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKTD 707
            VLKGLVKP   A D+  + E  E++ ++++ G      RD                    
Sbjct: 156  VLKGLVKPISAAEDAVIDEENVEEEPESKDKGG-----RDS------------------- 191

Query: 706  VPGSVIPDQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRIXX 527
                 IPDQATI AIRAKRERLRQ+RAAAPDYI+LD GSNHGAAEGLSDEEPEFQGRI  
Sbjct: 192  -----IPDQATINAIRAKRERLRQSRAAAPDYISLDGGSNHGAAEGLSDEEPEFQGRIAM 246

Query: 526  XXXXXXXXXXXXXEDFEQRVVE----KDARVESGXXXXXXXXXXXXEQVRKGLGKRLDEA 359
                         ED ++R +E    KDA                 EQ RKGLGKR+D+ 
Sbjct: 247  FGEKPESGKKGVFEDVDERGMEGGFKKDAH---DSDDEEEEKIWEEEQFRKGLGKRMDDG 303

Query: 358  ASRGVSTNNGVGTASVVQSAHQQKVRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXLPG 179
            +SR VS+     +  VVQ   QQK   S   S + Y+S                   LPG
Sbjct: 304  SSRVVSS-----SVPVVQKVQQQKFMYS---SVTAYTS---VPGVSAPLNIGGAVGPLPG 352

Query: 178  FDAVSLSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSLSA 5
            FDA+SLSQQAELAKKALHEN +RLKESH RTM+SL RTDENLS+SL  +TTLE SL+A
Sbjct: 353  FDAMSLSQQAELAKKALHENLRRLKESHGRTMSSLTRTDENLSSSLSNITTLEKSLTA 410


>gb|EMJ26532.1| hypothetical protein PRUPE_ppa001044mg [Prunus persica]
          Length = 925

 Score =  270 bits (690), Expect = 1e-69
 Identities = 178/364 (48%), Positives = 214/364 (58%), Gaps = 9/364 (2%)
 Frame = -3

Query: 1069 DRKDRNAPYVSSLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEP 890
            DR    +   +SLPSNVQPQAGTYTKEAL ELQKNTRTLA SRP            SSEP
Sbjct: 100  DRLAHTSSVSTSLPSNVQPQAGTYTKEALRELQKNTRTLASSRP------------SSEP 147

Query: 889  VIVLKGLVKPKIMADSDGETEKKEQDSDNEE-----MGNLLKNERDDATARLGSMGLGKG 725
             IVLKGLVKP     SD   E +E DSDN+E       +L + ++DDA ARL SMG+   
Sbjct: 148  TIVLKGLVKPTGTI-SDTLREARELDSDNDEEQEKERASLFRRDKDDAEARLASMGI--- 203

Query: 724  FSEKTDVPGSVIPDQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEF 545
              +K      + PDQATI AIRAKRERLR++RAAAPD+I+LD+GSNHGAAEGLSDEEPEF
Sbjct: 204  --DKAKGSSGLFPDQATINAIRAKRERLRKSRAAAPDFISLDSGSNHGAAEGLSDEEPEF 261

Query: 544  QGRIXXXXXXXXXXXXXXXEDFEQRVVE---KDARVESGXXXXXXXXXXXXEQVRKGLGK 374
            +GRI               ED + R  +   +   ++              EQ RKGLGK
Sbjct: 262  RGRIAIFGDNMEGSKKGVFEDVDDRAADAVLRQKSIDRDEDEDEEEKIWEEEQFRKGLGK 321

Query: 373  RLDEAASRGVSTNNGVGTASVVQSAHQQKVRNSP-AGSSSMYSSKQXXXXXXXXXXXXXX 197
            R+D+ +S GV +     +A VVQS  Q K   S  AG SS+ S                 
Sbjct: 322  RMDDGSSIGVVST----SAPVVQSVPQPKATYSAMAGYSSVQS-------VPVGPSIGGA 370

Query: 196  XXXLPGFDAVSLSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLEN 17
                 G + +S+  QAE+AKKAL EN  +LKESH RTM SL +TDENLS+SLL +T LE 
Sbjct: 371  IGASQGSNVMSIKAQAEIAKKALEENVMKLKESHGRTMLSLTKTDENLSSSLLNITALEK 430

Query: 16   SLSA 5
            SLSA
Sbjct: 431  SLSA 434


>gb|EPS73173.1| hypothetical protein M569_01583, partial [Genlisea aurea]
          Length = 765

 Score =  256 bits (655), Expect = 1e-65
 Identities = 180/360 (50%), Positives = 214/360 (59%), Gaps = 7/360 (1%)
 Frame = -3

Query: 1063 KDRNAPYVSS--LPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEP 890
            KDRNAP+ SS  +PSNVQPQAGTYTKE LLELQ+NTRTLA   PAR   KPK E    E 
Sbjct: 103  KDRNAPHPSSSSIPSNVQPQAGTYTKETLLELQRNTRTLAA--PARH--KPKAE---QET 155

Query: 889  VIVLKGLVKPKIMADSDGET-EKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEK 713
            V+VLKGL+KP + +D  G   +    D+D +  GN+     +DAT    S   G GF   
Sbjct: 156  VVVLKGLIKPVVSSDLGGSGHDSAAHDADFD--GNIDLGAENDATLTKLS---GLGFEGG 210

Query: 712  TDVPGSVIPDQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRI 533
            ++    VIPD+ATIEAIRAKRERLRQA+AAAPDY+ALD GSNHGAAEGLSDEEPEF+GRI
Sbjct: 211  SEGDKDVIPDRATIEAIRAKRERLRQAKAAAPDYVALDGGSNHGAAEGLSDEEPEFRGRI 270

Query: 532  -XXXXXXXXXXXXXXXEDFEQRVVEKDARVESG-XXXXXXXXXXXXEQVRKGLGKRL-DE 362
                            ED EQR + +D  VESG             EQVRKGLGKRL + 
Sbjct: 271  GFFADKAGVHDKRGVFEDLEQRAMPRDRFVESGSDAEDEEDKMWEEEQVRKGLGKRLGNG 330

Query: 361  AASRGVSTN-NGVGTASVVQSAHQQKVRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXL 185
               +GV+ N  G G  +V      Q     P    S+ +S                    
Sbjct: 331  VGGKGVTVNIAGSGLTTVHHLGGPQ-----PTSGHSIIASSNGDRVSDAASVVGSW---- 381

Query: 184  PGFDAVSLSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSLSA 5
             G D++S+SQQA+LAKK L  N  RLKESH +T   L + DENLS+SL +VTTLENSLSA
Sbjct: 382  -GLDSMSISQQADLAKKTLTTNLARLKESHRQTKALLDKNDENLSSSLQRVTTLENSLSA 440


>ref|XP_003528569.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Glycine max]
          Length = 913

 Score =  243 bits (621), Expect = 1e-61
 Identities = 170/358 (47%), Positives = 204/358 (56%), Gaps = 7/358 (1%)
 Frame = -3

Query: 1063 KDRNAPYVS-SLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEPV 887
            KDR A   S S+PSNVQPQAGTYTKEAL ELQKNTRTL  S  +R + KP     SSEPV
Sbjct: 91   KDRIAHSSSPSVPSNVQPQAGTYTKEALRELQKNTRTLVTSSSSRSDPKP-----SSEPV 145

Query: 886  IVLKGLVKPKIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKTD 707
            IVLKGLVKP         +E + +DS +E        E  +  A+L ++G+        +
Sbjct: 146  IVLKGLVKPL-------GSEPQGRDSYSE-------GEHREVEAKLATVGI-------QN 184

Query: 706  VPGSVIPDQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRIXX 527
              GS  PD  TI AIRAKRERLRQAR AAPDYI+LD GSNHGAAEGLSDEEPEF+GRI  
Sbjct: 185  KEGSFYPDDETIRAIRAKRERLRQARPAAPDYISLDGGSNHGAAEGLSDEEPEFRGRIAM 244

Query: 526  XXXXXXXXXXXXXEDFEQRVVEKDARVESG------XXXXXXXXXXXXEQVRKGLGKRLD 365
                         E+ E+R++  D R + G                  EQ RKGLGKR+D
Sbjct: 245  FGEKVDGGKKGVFEEVEERIM--DVRFKGGEDEVVDDDDDDEEKMWEEEQFRKGLGKRMD 302

Query: 364  EAASRGVSTNNGVGTASVVQSAHQQKVRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXL 185
            E ++R           SV+Q +  Q   N    S++                       L
Sbjct: 303  EGSAR--------VDVSVMQGS--QSPHNFVVPSAAKVYGAVPSAAASVSPSIGGVIESL 352

Query: 184  PGFDAVSLSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSL 11
            P  D V +SQQAE A+KAL EN +RLKESH RTM+SL++TDENLSASLL +T LENSL
Sbjct: 353  PALDVVPISQQAEAARKALLENVRRLKESHGRTMSSLSKTDENLSASLLNITALENSL 410


>gb|ESW32937.1| hypothetical protein PHAVU_001G030200g [Phaseolus vulgaris]
          Length = 882

 Score =  243 bits (619), Expect = 2e-61
 Identities = 169/352 (48%), Positives = 203/352 (57%), Gaps = 1/352 (0%)
 Frame = -3

Query: 1063 KDRNAPYVSSLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEPVI 884
            KDR A    S+PSNVQPQAGTYTKE L ELQKNTRTL  S  +R E KP       EPVI
Sbjct: 82   KDRIASSSPSVPSNVQPQAGTYTKETLRELQKNTRTLVTSS-SRSEPKPP-----GEPVI 135

Query: 883  VLKGLVKPKIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKTDV 704
            VLKGLVKP         +E + ++SD+E        +  +   +LG +GL  G       
Sbjct: 136  VLKGLVKPVA-------SEPQGRESDSE-------GDHKEVEGKLGGLGLHNG------- 174

Query: 703  PGSVIPDQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRIXXX 524
              S  PD+ TI+AIRAKRERLRQAR AA DYI+LD GSNHGAAEGLSDEEPEF+GRI   
Sbjct: 175  KDSFFPDEETIKAIRAKRERLRQARPAAQDYISLDGGSNHGAAEGLSDEEPEFRGRIAMF 234

Query: 523  XXXXXXXXXXXXEDFEQRVVEKDARVESGXXXXXXXXXXXXEQVRKGLGKRLDEAASRGV 344
                        E+ E+R V+   + E              EQ RKGLGKR+DE ++R  
Sbjct: 235  GEKVEGGKKGVFEEVEERRVDVRFK-EEEEDDDEEEKMWEEEQFRKGLGKRMDEGSAR-- 291

Query: 343  STNNGVGTASVVQSAHQQK-VRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXLPGFDAV 167
                      VVQ A Q K V  S A  ++ + + +                 +P  D +
Sbjct: 292  ------VDVPVVQGAQQHKYVVPSAAVPNAGFGTIE----------------SMPALDVL 329

Query: 166  SLSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSL 11
            SLSQQAE AKKAL EN +RLKESH RTM+SL++TDENLSASLL +T LENSL
Sbjct: 330  SLSQQAESAKKALVENVRRLKESHGRTMSSLSKTDENLSASLLNITALENSL 381


>ref|XP_006605552.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Glycine max]
          Length = 916

 Score =  241 bits (616), Expect = 5e-61
 Identities = 173/358 (48%), Positives = 205/358 (57%), Gaps = 7/358 (1%)
 Frame = -3

Query: 1063 KDRNAPYVS-SLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEPV 887
            KDR A   S S+P+NVQPQAGTYTKEAL ELQKNTRTL  S  +R + KP     SSEPV
Sbjct: 92   KDRIAHTSSPSVPTNVQPQAGTYTKEALRELQKNTRTLVSSSSSRSDPKP-----SSEPV 146

Query: 886  IVLKGLVKPKIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKTD 707
            IVLKG VKP        ET+ ++ DSD+E        E  +  A+L ++G+      K D
Sbjct: 147  IVLKGHVKPL-----GPETQGRDSDSDSE-------GEHREVEAKLATVGI----QNKED 190

Query: 706  VPGSVIPDQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRIXX 527
               S  PD+ TI AIRAKRERLR AR AAPDYI+LD GSNHGAAEGLSDEEPEF+GRI  
Sbjct: 191  ---SFYPDEETIRAIRAKRERLRLARPAAPDYISLDGGSNHGAAEGLSDEEPEFRGRIAM 247

Query: 526  XXXXXXXXXXXXXEDFEQRVVEKDARVESG------XXXXXXXXXXXXEQVRKGLGKRLD 365
                         E+ E+R V  D R + G                  EQ RKGLGKR+D
Sbjct: 248  FGEKVDGGKKGVFEEVEERRV--DLRFKGGEEEVLDDDDDEEEKMWEEEQFRKGLGKRMD 305

Query: 364  EAASRGVSTNNGVGTASVVQSAHQQKVRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXL 185
            E ++R V        A+ VQ A  Q   N    S++                       L
Sbjct: 306  EGSAR-VDV-----AAAAVQGAQLQ--HNFVVPSAAKVYGAVPSAAASVSPSIGGAIESL 357

Query: 184  PGFDAVSLSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSL 11
            P  D V +SQQAE A+KAL EN +RLKESH RTM+SL++TDENLSASLL +T LENSL
Sbjct: 358  PVLDVVPISQQAEAARKALLENVRRLKESHGRTMSSLSKTDENLSASLLNITALENSL 415


>ref|XP_004514246.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cicer
            arietinum]
          Length = 916

 Score =  239 bits (611), Expect = 2e-60
 Identities = 167/358 (46%), Positives = 200/358 (55%), Gaps = 7/358 (1%)
 Frame = -3

Query: 1063 KDR--NAPYVSSLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEP 890
            KDR  ++P  S L SNVQPQAGTYTKEAL ELQKNTRTL     +RP         SSEP
Sbjct: 89   KDRISHSPSPSFL-SNVQPQAGTYTKEALRELQKNTRTLVTGSTSRPS--STSXXPSSEP 145

Query: 889  VIVLKGLVKPKIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKT 710
            VIVLKGL+KP         +E + ++SD+E+       E  +  A+  S+G+  G     
Sbjct: 146  VIVLKGLLKP-------ASSEPQGRESDSED-------EHKEVEAKFASVGIQNGND--- 188

Query: 709  DVPGSVIPDQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRIX 530
                S+IPD+ TI+AIRA+RERLRQAR AA DYI+LD GSNHGAAEGLSDEEPEF+GRI 
Sbjct: 189  ----SLIPDEETIKAIRARRERLRQARPAAQDYISLDGGSNHGAAEGLSDEEPEFRGRIA 244

Query: 529  XXXXXXXXXXXXXXEDFEQRVVEKDARVESG-----XXXXXXXXXXXXEQVRKGLGKRLD 365
                          ED ++R V  D R   G                 EQ RKGLGKR+D
Sbjct: 245  LFGEKGEGGKKGVFEDVDERGV--DGRFNGGGDVVVEEEDEEEKMWEEEQFRKGLGKRMD 302

Query: 364  EAASRGVSTNNGVGTASVVQSAHQQKVRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXL 185
            E   R        G  SVVQ A Q K     A +                          
Sbjct: 303  EGPGRVSG-----GDVSVVQVAQQPKFVVPSAATVYGAVPNVVAAAASVSTSIGGAIPAT 357

Query: 184  PGFDAVSLSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSL 11
            P  D +S+SQQAE+A+KAL +N +RLKESH RTM+SL +TDENLSASLL +T LENSL
Sbjct: 358  PALDVISISQQAEIARKALLDNVRRLKESHGRTMSSLNKTDENLSASLLNITDLENSL 415


>ref|XP_006583671.1| PREDICTED: PAX3- and PAX7-binding protein 1-like isoform X2 [Glycine
            max]
          Length = 838

 Score =  238 bits (606), Expect = 7e-60
 Identities = 165/351 (47%), Positives = 196/351 (55%)
 Frame = -3

Query: 1063 KDRNAPYVSSLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEPVI 884
            KDR A + SS+ SNVQPQAGTYTKEAL ELQKNTRTL  S            ++ SEPVI
Sbjct: 83   KDRIA-HSSSVSSNVQPQAGTYTKEALRELQKNTRTLVSSSTTTTT-----SSSRSEPVI 136

Query: 883  VLKGLVKPKIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKTDV 704
            VLKGLVKP +       +E + + SD+E        E  +   +L S+G+  G       
Sbjct: 137  VLKGLVKPVV-------SEPQGRHSDSE-------GEHKEVEGKLSSLGIQNG------- 175

Query: 703  PGSVIPDQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRIXXX 524
              S  PD+ TI+AIRAKRERLR+AR AAPDYI+LD GSNHGAAEGLSDEEPEF+GRI   
Sbjct: 176  KDSFFPDEETIKAIRAKRERLRKARPAAPDYISLDGGSNHGAAEGLSDEEPEFRGRIAMF 235

Query: 523  XXXXXXXXXXXXEDFEQRVVEKDARVESGXXXXXXXXXXXXEQVRKGLGKRLDEAASRGV 344
                          FE+ V E+    E              EQ RKGLGKR+DE A+R  
Sbjct: 236  EEKGEGGGKKGV--FEE-VEERLRDEEENDDDYEEEKMWEEEQFRKGLGKRMDEGAAR-- 290

Query: 343  STNNGVGTASVVQSAHQQKVRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXLPGFDAVS 164
                      VVQ A Q K   S A +                         +P  D V 
Sbjct: 291  ------VDVPVVQGAQQNKFVVSSAAAVYGGVPSADARVPSVSPSIGGATESMPALDVVP 344

Query: 163  LSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSL 11
            +SQQAE A+KAL EN +RLKESH RTM+SL++TDENLSAS LK+T LENSL
Sbjct: 345  MSQQAERARKALVENVRRLKESHERTMSSLSKTDENLSASFLKITALENSL 395


>ref|XP_003530304.1| PREDICTED: PAX3- and PAX7-binding protein 1-like isoform X1 [Glycine
            max]
          Length = 896

 Score =  238 bits (606), Expect = 7e-60
 Identities = 165/351 (47%), Positives = 196/351 (55%)
 Frame = -3

Query: 1063 KDRNAPYVSSLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEPVI 884
            KDR A + SS+ SNVQPQAGTYTKEAL ELQKNTRTL  S            ++ SEPVI
Sbjct: 83   KDRIA-HSSSVSSNVQPQAGTYTKEALRELQKNTRTLVSSSTTTTT-----SSSRSEPVI 136

Query: 883  VLKGLVKPKIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKTDV 704
            VLKGLVKP +       +E + + SD+E        E  +   +L S+G+  G       
Sbjct: 137  VLKGLVKPVV-------SEPQGRHSDSE-------GEHKEVEGKLSSLGIQNG------- 175

Query: 703  PGSVIPDQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRIXXX 524
              S  PD+ TI+AIRAKRERLR+AR AAPDYI+LD GSNHGAAEGLSDEEPEF+GRI   
Sbjct: 176  KDSFFPDEETIKAIRAKRERLRKARPAAPDYISLDGGSNHGAAEGLSDEEPEFRGRIAMF 235

Query: 523  XXXXXXXXXXXXEDFEQRVVEKDARVESGXXXXXXXXXXXXEQVRKGLGKRLDEAASRGV 344
                          FE+ V E+    E              EQ RKGLGKR+DE A+R  
Sbjct: 236  EEKGEGGGKKGV--FEE-VEERLRDEEENDDDYEEEKMWEEEQFRKGLGKRMDEGAAR-- 290

Query: 343  STNNGVGTASVVQSAHQQKVRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXLPGFDAVS 164
                      VVQ A Q K   S A +                         +P  D V 
Sbjct: 291  ------VDVPVVQGAQQNKFVVSSAAAVYGGVPSADARVPSVSPSIGGATESMPALDVVP 344

Query: 163  LSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSL 11
            +SQQAE A+KAL EN +RLKESH RTM+SL++TDENLSAS LK+T LENSL
Sbjct: 345  MSQQAERARKALVENVRRLKESHERTMSSLSKTDENLSASFLKITALENSL 395


>gb|EOY19310.1| GC-rich sequence DNA-binding factor-like protein, putative isoform 1
            [Theobroma cacao] gi|508727414|gb|EOY19311.1| GC-rich
            sequence DNA-binding factor-like protein, putative
            isoform 1 [Theobroma cacao]
          Length = 934

 Score =  234 bits (598), Expect = 6e-59
 Identities = 167/365 (45%), Positives = 206/365 (56%), Gaps = 12/365 (3%)
 Frame = -3

Query: 1063 KDRNAPYVSSLPSNVQPQAGTYTKEALLELQKNTRTLA-PSRPARPELKPKPEAASSEPV 887
            KD   P  S+LPSNVQPQAGTYTKEALLELQKN RTLA PS  A         + SSEP 
Sbjct: 100  KDCKTP--STLPSNVQPQAGTYTKEALLELQKNMRTLAAPSSRA--------SSVSSEPK 149

Query: 886  IVLKGLVKPKIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKTD 707
            IVLKGL+KP+       +    E+D+D  E     K ++DD  +RL +M  GKG     D
Sbjct: 150  IVLKGLLKPQ------SQNLNSERDNDPPE-----KLQKDDTESRLATMAAGKG----VD 194

Query: 706  VPGSVIPDQATIEAIRAKRERLRQARAA-APDYIALDAGSNHGAA--EGLSD-EEPEFQG 539
            +  S  PDQATI+AI+AK++R+R++ A  APDYI+LD GSN G A  E LSD EEPEF G
Sbjct: 195  LDFSAFPDQATIDAIKAKKDRVRKSFARPAPDYISLDRGSNLGGAMEEELSDDEEPEFPG 254

Query: 538  RIXXXXXXXXXXXXXXXEDFEQRVV----EKDARVESGXXXXXXXXXXXXEQVRKGLGKR 371
            R+                  E+R V     KD   +              EQ RKGLGKR
Sbjct: 255  RLFGESGKKGVFEV-----IEERAVGVGLRKDGIHDEDDDDNEEEKMWEEEQFRKGLGKR 309

Query: 370  LDEAASRGVSTNN---GVGTASVVQSAHQQKVRNSPAGSSSMYSSKQXXXXXXXXXXXXX 200
            +D++++R VS++N   GVG    +Q  HQQ+   S  GS   Y S               
Sbjct: 310  MDDSSNRVVSSSNNSGGVGMVHNMQQQHQQRYGYSTMGS---YGSMMPSVSPAPPSSIVG 366

Query: 199  XXXXLPGFDAVSLSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLE 20
                  G D  S+SQQAE+ KKAL EN +RLKESH RT++SL + DENLSASL  +T LE
Sbjct: 367  AAGASQGLDVTSISQQAEITKKALQENVRRLKESHDRTISSLTKADENLSASLFNITALE 426

Query: 19   NSLSA 5
             SLSA
Sbjct: 427  KSLSA 431


>gb|EXB53993.1| GC-rich sequence DNA-binding factor 1 [Morus notabilis]
          Length = 952

 Score =  223 bits (568), Expect = 2e-55
 Identities = 161/355 (45%), Positives = 195/355 (54%), Gaps = 11/355 (3%)
 Frame = -3

Query: 1036 SLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEPVIVLKGLVKPK 857
            SLPSNVQPQAGTYTKEAL ELQKNTRTLA S+P            SSEPVIVLKGL+KP 
Sbjct: 127  SLPSNVQPQAGTYTKEALRELQKNTRTLASSKP------------SSEPVIVLKGLLKPS 174

Query: 856  IMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLG-KGFSEKTDVPGSVIPDQ 680
             +A SD + + +E+D  +E     LK  R +    L SM +G KG       P  +IPDQ
Sbjct: 175  ELAKSDWKLDSEEEDEPDE-----LKERRGE----LASMEIGAKGRDRDNSSPEPLIPDQ 225

Query: 679  ATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRI-XXXXXXXXXX 503
            ATI AIRAKRERLRQ+RAAAPD+IALDAGSNHG AEGLSDEEPE Q RI           
Sbjct: 226  ATINAIRAKRERLRQSRAAAPDFIALDAGSNHGEAEGLSDEEPENQTRIAMFGEKAEGPK 285

Query: 502  XXXXXEDFEQRVVE------KDARVESG--XXXXXXXXXXXXEQVRKGLGK-RLDEAASR 350
                 +D + R +E      K   +E                EQ RKGLGK R+D+    
Sbjct: 286  KGVFEDDIDDRGIELGLLRRKQGVLEENHEDDEDEEDKIWEEEQFRKGLGKTRIDDGGKN 345

Query: 349  GVSTNNGVGTASVVQSAHQQKVRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXLPGFDA 170
             V          VV+   QQK  +S  GS ++  S                     G   
Sbjct: 346  SV--------VPVVKRETQQKFVSS-VGSQTLPPSASIGGTFGGSSGGSSTGL---GLGM 393

Query: 169  VSLSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSLSA 5
            +  SQQAE+A  A+ +N +RLKE+H + + SL + D+NLS SLL +T LE SLSA
Sbjct: 394  MPFSQQAEIALNAIDDNVRRLKETHDQDLVSLNKADKNLSDSLLNITALEKSLSA 448


>ref|XP_003610832.1| GC-rich sequence DNA-binding factor-like protein [Medicago
            truncatula] gi|355512167|gb|AES93790.1| GC-rich sequence
            DNA-binding factor-like protein [Medicago truncatula]
          Length = 892

 Score =  222 bits (565), Expect = 4e-55
 Identities = 156/350 (44%), Positives = 196/350 (56%), Gaps = 7/350 (2%)
 Frame = -3

Query: 1039 SSLPSNVQPQAGTYTKEALLELQKNTRTLA-PSRPARP-ELKPKPEAASSEPVIVLKGLV 866
            S  PSNVQPQAGTYT EAL ELQKNTRTL  P+  +RP   +PKP   SSEPVIVLKGL+
Sbjct: 90   SPSPSNVQPQAGTYTLEALRELQKNTRTLVTPTTASRPISSEPKP---SSEPVIVLKGLL 146

Query: 865  KPKIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKTDVPGSVIP 686
            KP             E +SD+EE G           A+  S+G+  G         S  P
Sbjct: 147  KPVT----------SEPESDSEENGEF--------EAKFASVGIKNG-------KDSFFP 181

Query: 685  DQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRIXXXXXXXXX 506
             +  I+A +AKRER+R+A AAAPDYI+LD GSNHGAAEGLSDEEPE++GRI         
Sbjct: 182  GEEDIKAAKAKRERMRKAGAAAPDYISLDGGSNHGAAEGLSDEEPEYRGRIAMFGGKKGD 241

Query: 505  XXXXXXED-----FEQRVVEKDARVESGXXXXXXXXXXXXEQVRKGLGKRLDEAASRGVS 341
                   +     F+  VV+++                  EQ +KGLGKR DE ++R   
Sbjct: 242  GEKKGVFEVADERFDDVVVDEE------------DGLWEEEQFKKGLGKRRDEGSAR--- 286

Query: 340  TNNGVGTASVVQSAHQQKVRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXLPGFDAVSL 161
               G G   VVQ+A QQ     P+ ++   +                     P  D +S+
Sbjct: 287  -VGGGGEVPVVQAA-QQPNFVGPSVANVYGAVPNVVAAASANTSIGGAIPATPVLDVISI 344

Query: 160  SQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSL 11
            SQQAE+AKKA+ +N +RLKESH RTM+SL +TDENLSASLLK+T LE+SL
Sbjct: 345  SQQAEIAKKAMLDNIRRLKESHGRTMSSLNKTDENLSASLLKITDLESSL 394


>ref|XP_004135116.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cucumis
            sativus]
          Length = 920

 Score =  219 bits (557), Expect = 3e-54
 Identities = 156/357 (43%), Positives = 194/357 (54%), Gaps = 2/357 (0%)
 Frame = -3

Query: 1069 DRKDRNAPYVSSLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEP 890
            DR   ++   +S+PSNVQPQAG YTKEAL ELQKNTRTLA SRP+  E KP     S+EP
Sbjct: 105  DRIAHSSSISASVPSNVQPQAGVYTKEALRELQKNTRTLASSRPSS-ESKP-----SAEP 158

Query: 889  VIVLKGLVKPKIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKT 710
            VIVLKGL+KP      D   E KE  S+++E G                         + 
Sbjct: 159  VIVLKGLLKPAEQVP-DSAREAKESSSEDDEAG-------------------------RK 192

Query: 709  DVPGSVIPDQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRIX 530
            D  GS IPDQATI AIRAKRER+RQA  AAPDYI+LDAGSN  A   LSDEE EF GRI 
Sbjct: 193  DSSGSSIPDQATINAIRAKRERMRQAGVAAPDYISLDAGSNRTAPGELSDEEAEFPGRIA 252

Query: 529  XXXXXXXXXXXXXXEDFEQRVVE-KDARVESGXXXXXXXXXXXXEQVRKGLGKRLDEAAS 353
                          E+ +++ ++     +               EQ RKGLGKR+D+ ++
Sbjct: 253  MIGGKLESSKKGVFEEVDEQGIDGARTNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGST 312

Query: 352  RGVSTNNGVGTASVVQSAHQQK-VRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXLPGF 176
            R  ST     +  VV S   Q  +  +  G SS+ S                      G 
Sbjct: 313  RVEST-----SVPVVPSVQPQNLIYPTTIGYSSVPSMS-------TATSIGGSVSISQGL 360

Query: 175  DAVSLSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSLSA 5
            D +S+SQQAE+AK A+ E+  RLKES+ RT  S+ +TDENLSASLLK+T LE +LSA
Sbjct: 361  DGLSISQQAEIAKTAMQESMGRLKESYRRTAMSVLKTDENLSASLLKITDLEKALSA 417


>ref|XP_004159322.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cucumis
            sativus]
          Length = 889

 Score =  218 bits (554), Expect = 7e-54
 Identities = 156/357 (43%), Positives = 193/357 (54%), Gaps = 2/357 (0%)
 Frame = -3

Query: 1069 DRKDRNAPYVSSLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEP 890
            DR   ++   +S+PSNVQPQAG YTKEAL ELQKNTRTLA SRP+  E KP     S+EP
Sbjct: 75   DRIAHSSSISASVPSNVQPQAGVYTKEALRELQKNTRTLASSRPSS-ESKP-----SAEP 128

Query: 889  VIVLKGLVKPKIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKT 710
            VIVLKGL+KP      D   E KE  S+++E G                           
Sbjct: 129  VIVLKGLLKPAEQVP-DSAREAKESSSEDDEAGK-------------------------- 161

Query: 709  DVPGSVIPDQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRIX 530
            D  GS IPDQATI AIRAKRER+RQA  AAPDYI+LDAGSN  A   LSDEE EF GRI 
Sbjct: 162  DSSGSSIPDQATINAIRAKRERMRQAGVAAPDYISLDAGSNRTAPGELSDEEAEFPGRIA 221

Query: 529  XXXXXXXXXXXXXXEDFEQRVVE-KDARVESGXXXXXXXXXXXXEQVRKGLGKRLDEAAS 353
                          E+ +++ ++     +               EQ RKGLGKR+D+ ++
Sbjct: 222  MIGGKLESSKKGVFEEVDEQGIDGARTNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGST 281

Query: 352  RGVSTNNGVGTASVVQSAHQQK-VRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXLPGF 176
            R  ST     +  VV S   Q  +  +  G SS+ S                      G 
Sbjct: 282  RVEST-----SVPVVPSVQPQNLIYPTTIGYSSVPS-------VSTATSIGGSVSISQGL 329

Query: 175  DAVSLSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSLSA 5
            D +S+SQQAE+AK A+ E+  RLKES+ RT  S+ +TDENLSASLLK+T LE +LSA
Sbjct: 330  DGLSISQQAEIAKTAMQESMGRLKESYRRTAMSVLKTDENLSASLLKITDLEKALSA 386


>ref|XP_004298307.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Fragaria vesca
            subsp. vesca]
          Length = 914

 Score =  214 bits (545), Expect = 8e-53
 Identities = 156/350 (44%), Positives = 195/350 (55%), Gaps = 5/350 (1%)
 Frame = -3

Query: 1039 SSLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEPVIVLKGLVKP 860
            +SLPSNVQPQAGTYTKEAL ELQKNTRTLA SR +         AA++EP IVL+G +KP
Sbjct: 109  ASLPSNVQPQAGTYTKEALRELQKNTRTLASSRTSSA-------AAAAEPTIVLRGSIKP 161

Query: 859  KIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKTDVPGSVIPDQ 680
               + +D     +E DSD+EE                      +G  ++        PDQ
Sbjct: 162  ADASIADAVNGARELDSDDEEQ---------------------QGSKDR-------YPDQ 193

Query: 679  ATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAEGLSDEEPEFQGRIXXXXXXXXXXX 500
            ATIEAIR KRERLR+++ AAPD+IALD+GSNHGAAEGLSDEEPEF+ RI           
Sbjct: 194  ATIEAIRKKRERLRKSKPAAPDFIALDSGSNHGAAEGLSDEEPEFRNRI-AMFGEKMENK 252

Query: 499  XXXXEDFEQRVVEKDARVES---GXXXXXXXXXXXXEQVRKGLGKRLD-EAASRGVSTNN 332
                ED +   V+   R ES                EQ RKGLGKR+D + AS GVS + 
Sbjct: 253  KGVFEDVDDTGVDGGLRRESVVVEDDEDEEEKIWEEEQFRKGLGKRVDNDGASLGVSAS- 311

Query: 331  GVGTASVVQSAHQQKVR-NSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXLPGFDAVSLSQ 155
                  V  +A Q K   NS AG S   S                      G +A+S+++
Sbjct: 312  ---VPRVHSAAPQPKASYNSIAGYSLAQS-------LAGVASIGGATGASQGSNALSINE 361

Query: 154  QAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSLSA 5
            Q+E+A+KAL EN ++LKESH RT  SL + +E+LSASLL +T LE SLSA
Sbjct: 362  QSEIAQKALLENVRKLKESHGRTKMSLTKANESLSASLLNITDLEKSLSA 411


>ref|XP_006838726.1| hypothetical protein AMTR_s00002p00252610 [Amborella trichopoda]
            gi|548841232|gb|ERN01295.1| hypothetical protein
            AMTR_s00002p00252610 [Amborella trichopoda]
          Length = 946

 Score =  203 bits (517), Expect = 1e-49
 Identities = 148/360 (41%), Positives = 195/360 (54%), Gaps = 7/360 (1%)
 Frame = -3

Query: 1063 KDRNAPYVSSLPSNVQPQAGTYTKEALLELQKNTRTLAPSRPARPELKPKPEAASSEPVI 884
            KDR +    S+PSNVQPQAG YTKE LLELQKNT+TL  S       KP  E   +EPVI
Sbjct: 117  KDRTSIQSPSVPSNVQPQAGQYTKEKLLELQKNTKTLGGS-------KPPSETKPAEPVI 169

Query: 883  VLKGLVKPKIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKTDV 704
            VLKGLVKP +      +T+ +E   ++ E       E+++A + LG MG+G+   E    
Sbjct: 170  VLKGLVKPILEERKSEKTQVRESMENDREK---FSREKEEAESSLGKMGIGQPKEEV--- 223

Query: 703  PGSVIPDQATIEAIRAKRERLRQARAAAPDYIALDAGSNHGAAE----GLSDEEPEFQGR 536
             GS + DQATI AI+AKRERLRQAR A PDYI+LD+G      +    G SD+E EFQGR
Sbjct: 224  -GSPVLDQATINAIKAKRERLRQARMA-PDYISLDSGGARSMRDSDGLGSSDDESEFQGR 281

Query: 535  IXXXXXXXXXXXXXXXEDFEQRVVE--KDARVESGXXXXXXXXXXXXEQVRKGLGKRLDE 362
            I               E+ +++V E  ++ R                EQ RK LGKR+D+
Sbjct: 282  IALLGEGNNSSRKGVFENADEKVFELKREERETEVDDDDEEDKKWEEEQFRKALGKRMDD 341

Query: 361  AASRG-VSTNNGVGTASVVQSAHQQKVRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXL 185
             ++RG V +    G+   VQS+       S  G+SS   S                    
Sbjct: 342  NSNRGSVQSVASAGSVKAVQSSVYSG--GSYHGASSGLVSNLGVGVTR------------ 387

Query: 184  PGFDAVSLSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSLSA 5
               + ++ SQQAE+A +AL ++  RLKESH RT++S+ RTD NLSASL  +  LE SLSA
Sbjct: 388  -SVEFMTTSQQAEVATQALRDSMARLKESHDRTISSIVRTDNNLSASLSNIIDLEKSLSA 446


>ref|XP_006448500.1| hypothetical protein CICLE_v10014191mg [Citrus clementina]
            gi|557551111|gb|ESR61740.1| hypothetical protein
            CICLE_v10014191mg [Citrus clementina]
          Length = 913

 Score =  197 bits (500), Expect = 1e-47
 Identities = 148/353 (41%), Positives = 197/353 (55%), Gaps = 8/353 (2%)
 Frame = -3

Query: 1039 SSLPSNVQPQAGTYTKEALLELQKNTRTL-APSRPARPELKPKPEAASSEPVIVLKGLVK 863
            +SL SNVQ QAGTYT+E LLEL+KNT+TL APS         KP A   EPV+VL+G +K
Sbjct: 97   TSLLSNVQAQAGTYTEEYLLELRKNTKTLKAPSS--------KPPA---EPVVVLRGSIK 145

Query: 862  PKIMADSDGETEKKEQDSDNEEMGNLLKNERDDATARLGSMGLGKGFSEKTDVPGSVIPD 683
            P+  ++     +K  +DS + +  +  + E+     R  S+G+GK       V   VI D
Sbjct: 146  PED-SNLTRVQQKPSRDSSDSDSDHKAETEK-----RFASLGVGK-----IAVQSGVIYD 194

Query: 682  QATIEAIRAKRERLRQARAAAPDYIALDAGSN--HGAAEGLSDEEPEFQGRIXXXXXXXX 509
            +A I+AIRAK++RLRQ+ A APDYI LD GS+   G AEG SDEEPEF  R+        
Sbjct: 195  EAEIKAIRAKKDRLRQSGAKAPDYIPLDGGSSSLRGDAEGSSDEEPEFPRRVAMFGERTA 254

Query: 508  XXXXXXXEDFEQRVVEKD-----ARVESGXXXXXXXXXXXXEQVRKGLGKRLDEAASRGV 344
                     FE   V++D     ARVE+             EQVRKGLGKR+D+++ R  
Sbjct: 255  SGKKKKGV-FEDDDVDEDERPVVARVENDYEYVDEDVMWEEEQVRKGLGKRIDDSSVRV- 312

Query: 343  STNNGVGTASVVQSAHQQKVRNSPAGSSSMYSSKQXXXXXXXXXXXXXXXXXLPGFDAVS 164
                G  T+S V    QQ+  + P   + + S                      G D +S
Sbjct: 313  ----GANTSSSVAMPQQQQQFSYPTTVTPIPS-------------IGGAIGASQGLDTMS 355

Query: 163  LSQQAELAKKALHENFKRLKESHSRTMTSLARTDENLSASLLKVTTLENSLSA 5
            ++Q+AE A KAL  N  RLKESH+RTM+SL +TDE+LS+SLLK+T LE+SLSA
Sbjct: 356  IAQKAESAMKALQTNVNRLKESHARTMSSLKKTDEDLSSSLLKITDLESSLSA 408


Top