BLASTX nr result

ID: Coptis23_contig00001199 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00001199
         (3008 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding fact...   852   0.0  
ref|XP_004159322.1| PREDICTED: GC-rich sequence DNA-binding fact...   833   0.0  
ref|XP_004135116.1| PREDICTED: GC-rich sequence DNA-binding fact...   833   0.0  
ref|XP_002513154.1| gc-rich sequence DNA-binding factor, putativ...   796   0.0  
ref|XP_003528569.1| PREDICTED: GC-rich sequence DNA-binding fact...   746   0.0  

>ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Vitis
            vinifera]
          Length = 913

 Score =  852 bits (2201), Expect = 0.0
 Identities = 477/924 (51%), Positives = 606/924 (65%), Gaps = 41/924 (4%)
 Frame = +3

Query: 45   MSSRNKNFRRVRHADDEEXXXXXXXXXXXX---------------------RRLSFADEE 161
            MSSR +NFRR    DD +                                 + LSFAD+E
Sbjct: 1    MSSRPRNFRRRADDDDNDDTNGDGPPLIKPTSKPSTTTATTAAAAKPKKPPKLLSFADDE 60

Query: 162  EDDXXXXXXXXXXXXXXXLN------TRKASSSSHRLTSSKERLITPKTTASNTTSSFVT 323
            E++                +      T+ +SSSSH++T++K+RL TP      +++S  +
Sbjct: 61   ENESPSRSSSRSTQPPSRPSKTSSRFTKLSSSSSHKITTTKDRL-TP------SSASLPS 113

Query: 324  NVQPQTGQYTKEKLLELEKNTPTLSRPRHPPTPSQPSNAQPLFILKGLVKPESAXXXXXX 503
            NVQPQ G YTKE L EL+KNT TL+  R   +  +PS  +P+ +LKGLVKP SA      
Sbjct: 114  NVQPQAGTYTKEALRELQKNTRTLASSRPASSEPKPS-LEPVIVLKGLVKPISAAEDAVI 172

Query: 504  XXXXXXXGLAIL---GIDQIDVLDQAAINAIRAKRERLRKSRTVGSDYISLDGGSNHGEA 674
                           G D I   DQA INAIRAKRERLR+SR    DYISLDGGSNHG A
Sbjct: 173  DEENVEEEPESKDKGGRDSIP--DQATINAIRAKRERLRQSRAAAPDYISLDGGSNHGAA 230

Query: 675  EGLSDEEPEFQGRISLLGDK-----KGVFESVDEIEMRNG-NXXXXXXXXXXXXXXXXXX 836
            EGLSDEEPEFQGRI++ G+K     KGVFE VDE  M  G                    
Sbjct: 231  EGLSDEEPEFQGRIAMFGEKPESGKKGVFEDVDERGMEGGFKKDAHDSDDEEEEKIWEEE 290

Query: 837  QFRKGLGKRIDDXXXXXXXXXXXXXXXXXXXXXXYHGTGYQLTHSGTIGPV-IGGSLGVS 1013
            QFRKGLGKR+DD                      Y       +  G   P+ IGG++G  
Sbjct: 291  QFRKGLGKRMDDGSSRVVSSSVPVVQKVQQQKFMYSSVTAYTSVPGVSAPLNIGGAVGPL 350

Query: 1014 RSVEVMSISQKAQIATQAMQQNLKRLRESHGRTTSSIDKVEESLSSSLSNITDLEKSLSA 1193
               + MS+SQ+A++A +A+ +NL+RL+ESHGRT SS+ + +E+LSSSLSNIT LEKSL+A
Sbjct: 351  PGFDAMSLSQQAELAKKALHENLRRLKESHGRTMSSLTRTDENLSSSLSNITTLEKSLTA 410

Query: 1194 AGEKFIYMQKLRDFVSVICDFLKHKKSYLDELEDEVRKLQEDRASAIVERRKNDSTDEMY 1373
            AGEKFI+MQ LRDFVSVICDFL+HK  +++ELE++++KL E+RASAI+ERR  D+ DEM 
Sbjct: 411  AGEKFIFMQXLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERASAILERRAADN-DEMM 469

Query: 1374 EIEAAISAARSVLKKGDSSXXXXXXXXXXXXX----MKDQSNLPVELDEFGRDKNLQKRL 1541
            EI+A++ AA SV  K  S+                 M++Q+NLPV+LDE+GRD NLQK +
Sbjct: 470  EIQASVDAAMSVFTKSGSNEAMVAAARTAAQAASAAMREQTNLPVKLDEYGRDINLQKCM 529

Query: 1542 DVMXXXXXXXXXXXXXDLRQVQSVGDDNAYHRVXXXXXXXXXXXXXXXYRDNRDKYLQVA 1721
            D               D +++  + +++++ ++               Y+ NRD  LQ A
Sbjct: 530  DKNRRSEARQRKRDRWDAKRMTFLENESSHQKIEGESSTDESDSETTAYQSNRDLLLQTA 589

Query: 1722 EELFSDASEEYSQLSVVKERFEKWKQLYSPSYRDAYMSLSVPSIFAPYVRLELLKWDPLN 1901
            E++F DA+EEYSQLS VKER E+WK+ YS SYRDAYMSLSVP+IF+PYVRLELLKWDPL 
Sbjct: 590  EQIFGDAAEEYSQLSAVKERIERWKKQYSSSYRDAYMSLSVPAIFSPYVRLELLKWDPLY 649

Query: 1902 EDKDFNDMQWHSLLFHYGVPKDGSDFNADDPDGNLVPGLVEKIALPILHHQIAHCWDMLS 2081
            E+ DF+DM+WHSLLF+YG+ +DG+DF+ DD D NLVP LVE++ALPILHH++AHCWD+ S
Sbjct: 650  EEADFDDMKWHSLLFNYGLSEDGNDFSPDDADANLVPELVERVALPILHHELAHCWDIFS 709

Query: 2082 TRETKNAVYATDLVINYVPASSEALRELLAEIHTRFADAIAKLTVPTWTPLEIKAVPNAA 2261
            TRETKNAV AT+LVI Y+PASSEAL ELLA +H R   A+    VP W  L +KAVPNAA
Sbjct: 710  TRETKNAVSATNLVIRYIPASSEALGELLAVVHKRLYKALTNFMVPPWNILVMKAVPNAA 769

Query: 2262 RVAAYQFGMSIRLLRNICLWKDILRLPTLEKLALDELLGGKILPHVRSVISNIHDAITRT 2441
            RVAAY+FGMSIRL+RNICLWKDIL LP LEKL LD+LL G++LPH+ ++ S++HDAITRT
Sbjct: 770  RVAAYRFGMSIRLMRNICLWKDILALPVLEKLVLDQLLSGQVLPHIENIASDVHDAITRT 829

Query: 2442 ERIVASMSGVWAGPGAMGVRSQKLQALVDYVLTLGKTLEKKHVSGVSESETSGLARRLKK 2621
            ERI++S+SGVWAGP   G RS KLQ LVDYVL LGK LEK+H+ GV+ES+TS LARRLK+
Sbjct: 830  ERIISSLSGVWAGPSVTGERSNKLQPLVDYVLRLGKRLEKRHLPGVTESDTSRLARRLKR 889

Query: 2622 MLVELNEYDRARALARGFQLKEAL 2693
            MLVELNEYD+AR ++R F LKEAL
Sbjct: 890  MLVELNEYDKARDISRTFHLKEAL 913


>ref|XP_004159322.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cucumis
            sativus]
          Length = 889

 Score =  833 bits (2153), Expect = 0.0
 Identities = 470/901 (52%), Positives = 595/901 (66%), Gaps = 17/901 (1%)
 Frame = +3

Query: 42   LMSSRNKNFRRVRHADDEEXXXXXXXXXXXXRRLSFADEEEDDXXXXXXXXXXXXXXXLN 221
            +  SR +NFRR   ADD +               S A  +                   +
Sbjct: 1    MSGSRARNFRR--RADDNDDDDEPKGSTAPSISASNASSKPSSTSSVVATKPKKFQEPSS 58

Query: 222  TRKAS-SSSHRLTSSKERLITPKTTASNTTSSFVTNVQPQTGQYTKEKLLELEKNTPTLS 398
             R A  SS+H++T+ K+R+      +S+ ++S  +NVQPQ G YTKE L EL+KNT TL+
Sbjct: 59   ARLAKPSSTHKITALKDRI----AHSSSISASVPSNVQPQAGVYTKEALRELQKNTRTLA 114

Query: 399  RPRHPPTPSQPSNAQPLFILKGLVKP-----ESAXXXXXXXXXXXXXGLAILGIDQIDVL 563
              R P + S+PS A+P+ +LKGL+KP     +SA             G    G     + 
Sbjct: 115  SSR-PSSESKPS-AEPVIVLKGLLKPAEQVPDSAREAKESSSEDDEAGKDSSGSS---IP 169

Query: 564  DQAAINAIRAKRERLRKSRTVGSDYISLDGGSNHGEAEGLSDEEPEFQGRISLLG----- 728
            DQA INAIRAKRER+R++     DYISLD GSN      LSDEE EF GRI+++G     
Sbjct: 170  DQATINAIRAKRERMRQAGVAAPDYISLDAGSNRTAPGELSDEEAEFPGRIAMIGGKLES 229

Query: 729  DKKGVFESVDEIEMRNGNXXXXXXXXXXXXXXXXXX-QFRKGLGKRIDDXXXXXXXXXXX 905
             KKGVFE VDE  +                       QFRKGLGKR+DD           
Sbjct: 230  SKKGVFEEVDEQGIDGARTNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGSTRVESTSVP 289

Query: 906  XXXXXXXXXXXYHGT-GYQLTHSGTIGPVIGGSLGVSRSVEVMSISQKAQIATQAMQQNL 1082
                       Y  T GY    S +    IGGS+ +S+ ++ +SISQ+A+IA  AMQ+++
Sbjct: 290  VVPSVQPQNLIYPTTIGYSSVPSVSTATSIGGSVSISQGLDGLSISQQAEIAKTAMQESM 349

Query: 1083 KRLRESHGRTTSSIDKVEESLSSSLSNITDLEKSLSAAGEKFIYMQKLRDFVSVICDFLK 1262
             RL+ES+ RT  S+ K +E+LS+SL  ITDLEK+LSAAG+KFI+MQKLRDFVSVICDFL+
Sbjct: 350  GRLKESYRRTAMSVLKTDENLSASLLKITDLEKALSAAGDKFIFMQKLRDFVSVICDFLQ 409

Query: 1263 HKKSYLDELEDEVRKLQEDRASAIVERRKNDSTDEMYEIEAAISAARSVL-KKGDSSXXX 1439
            HK  +++ELE++++KL E+RAS +VERR  D+ DEM EIE A+ AA S+L KKG S+   
Sbjct: 410  HKAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIETAVKAAISILNKKGSSNEMI 469

Query: 1440 XXXXXXXXXXM---KDQSNLPVELDEFGRDKNLQKRLDVMXXXXXXXXXXXXXDLRQVQS 1610
                      +   ++Q+NLP +LDEFGRD NLQKR+D+              D +++ S
Sbjct: 470  TAATSAAQAAIALSREQANLPTKLDEFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRLAS 529

Query: 1611 VGDDNAYHRVXXXXXXXXXXXXXXXYRDNRDKYLQVAEELFSDASEEYSQLSVVKERFEK 1790
            +  D  + +V               Y+ NRD  LQ AE++FSDA+EE+SQLSVVK+RFE 
Sbjct: 530  MEVDG-HQKVEGESSTDESDSDSAAYQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRFEA 588

Query: 1791 WKQLYSPSYRDAYMSLSVPSIFAPYVRLELLKWDPLNEDKDFNDMQWHSLLFHYGVPKDG 1970
            WK+ YS +YRDAYMSLS+P+IF+PYVRLELLKWDPL+E  DF DM WHSLLF+YG+P+DG
Sbjct: 589  WKRDYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHESADFFDMNWHSLLFNYGMPEDG 648

Query: 1971 SDFNADDPDGNLVPGLVEKIALPILHHQIAHCWDMLSTRETKNAVYATDLVINYVPASSE 2150
            SDF  +D D NLVP LVEK+ALPILHH+IAHCWDMLSTRET+NA +AT L+ NYVP SSE
Sbjct: 649  SDFAPNDADANLVPELVEKVALPILHHEIAHCWDMLSTRETRNAAFATSLITNYVPPSSE 708

Query: 2151 ALRELLAEIHTRFADAIAKLTVPTWTPLEIKAVPNAARVAAYQFGMSIRLLRNICLWKDI 2330
            AL ELL  I TR + AI  LTVPTW  L  KAVPNAAR+AAY+FGMS+RL+RNICLWK+I
Sbjct: 709  ALTELLVVIRTRLSGAIEDLTVPTWNSLVTKAVPNAARIAAYRFGMSVRLMRNICLWKEI 768

Query: 2331 LRLPTLEKLALDELLGGKILPHVRSVISNIHDAITRTERIVASMSGVWAGPGAMGVRSQK 2510
            + LP LEKLAL+ELL GK+LPHVRS+ +NIHDA+TRTERI+AS++GVW G G +G RS K
Sbjct: 769  IALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASLAGVWTGSGIIGDRSHK 828

Query: 2511 LQALVDYVLTLGKTLEKKHVSGVSESETSGLARRLKKMLVELNEYDRARALARGFQLKEA 2690
            LQ LVDYVL LG+TLEKKH+SG++ESETSGLARRLKKMLVELNEYD AR +A+ F LKEA
Sbjct: 829  LQPLVDYVLLLGRTLEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLKEA 888

Query: 2691 L 2693
            L
Sbjct: 889  L 889


>ref|XP_004135116.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cucumis
            sativus]
          Length = 920

 Score =  833 bits (2151), Expect = 0.0
 Identities = 463/869 (53%), Positives = 589/869 (67%), Gaps = 18/869 (2%)
 Frame = +3

Query: 141  LSFADEEEDDXXXXXXXXXXXXXXXLNTRKAS--SSSHRLTSSKERLITPKTTASNTTSS 314
            LSFA +EE+D                ++ + +  SS+H++T+ K+R+      +S+ ++S
Sbjct: 61   LSFASDEENDAPLRPSSSKSSSSKKPSSARLAKPSSTHKITALKDRI----AHSSSISAS 116

Query: 315  FVTNVQPQTGQYTKEKLLELEKNTPTLSRPRHPPTPSQPSNAQPLFILKGLVKP-----E 479
              +NVQPQ G YTKE L EL+KNT TL+  R P + S+PS A+P+ +LKGL+KP     +
Sbjct: 117  VPSNVQPQAGVYTKEALRELQKNTRTLASSR-PSSESKPS-AEPVIVLKGLLKPAEQVPD 174

Query: 480  SAXXXXXXXXXXXXXGLAILGIDQIDVLDQAAINAIRAKRERLRKSRTVGSDYISLDGGS 659
            SA             G        I   DQA INAIRAKRER+R++     DYISLD GS
Sbjct: 175  SAREAKESSSEDDEAGRKDSSGSSIP--DQATINAIRAKRERMRQAGVAAPDYISLDAGS 232

Query: 660  NHGEAEGLSDEEPEFQGRISLLG-----DKKGVFESVDEIEMRNGNXXXXXXXXXXXXXX 824
            N      LSDEE EF GRI+++G      KKGVFE VDE  +                  
Sbjct: 233  NRTAPGELSDEEAEFPGRIAMIGGKLESSKKGVFEEVDEQGIDGARTNIIEHSDEDEEEK 292

Query: 825  XXXX-QFRKGLGKRIDDXXXXXXXXXXXXXXXXXXXXXXYHGT-GYQLTHSGTIGPVIGG 998
                 QFRKGLGKR+DD                      Y  T GY    S +    IGG
Sbjct: 293  IWEEEQFRKGLGKRMDDGSTRVESTSVPVVPSVQPQNLIYPTTIGYSSVPSMSTATSIGG 352

Query: 999  SLGVSRSVEVMSISQKAQIATQAMQQNLKRLRESHGRTTSSIDKVEESLSSSLSNITDLE 1178
            S+ +S+ ++ +SISQ+A+IA  AMQ+++ RL+ES+ RT  S+ K +E+LS+SL  ITDLE
Sbjct: 353  SVSISQGLDGLSISQQAEIAKTAMQESMGRLKESYRRTAMSVLKTDENLSASLLKITDLE 412

Query: 1179 KSLSAAGEKFIYMQKLRDFVSVICDFLKHKKSYLDELEDEVRKLQEDRASAIVERRKNDS 1358
            K+LSAAG+KF++MQKLRDFVSVICDFL+HK  +++ELE++++KL E+RAS +VERR  D+
Sbjct: 413  KALSAAGDKFMFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERASTVVERRVADN 472

Query: 1359 TDEMYEIEAAISAARSVL-KKGDSSXXXXXXXXXXXXXM---KDQSNLPVELDEFGRDKN 1526
             DEM EIE A+ AA S+L KKG S+             +   ++Q+NLP +LDEFGRD N
Sbjct: 473  DDEMVEIETAVKAAISILNKKGSSNEMVTAATSAAQAAIALSREQANLPTKLDEFGRDLN 532

Query: 1527 LQKRLDVMXXXXXXXXXXXXXDLRQVQSVGDDNAYHRVXXXXXXXXXXXXXXXYRDNRDK 1706
            LQKR+D+              D +++ S+  D  + +V               Y+ NRD 
Sbjct: 533  LQKRMDMKRRAEARKRRRSQYDSKRLASMEVDG-HQKVEGESSTDESDSDSAAYQSNRDL 591

Query: 1707 YLQVAEELFSDASEEYSQLSVVKERFEKWKQLYSPSYRDAYMSLSVPSIFAPYVRLELLK 1886
             LQ AE++FSDA+EE+SQLSVVK+RFE WK+ YS +YRDAYMSLS+P+IF+PYVRLELLK
Sbjct: 592  LLQTAEQIFSDAAEEFSQLSVVKQRFEAWKRDYSATYRDAYMSLSIPAIFSPYVRLELLK 651

Query: 1887 WDPLNEDKDFNDMQWHSLLFHYGVPKDGSDFNADDPDGNLVPGLVEKIALPILHHQIAHC 2066
            WDPL+E  DF DM WHSLLF+YG+P+DGSDF  +D D NLVP LVEK+ALPILHH+IAHC
Sbjct: 652  WDPLHESADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEIAHC 711

Query: 2067 WDMLSTRETKNAVYATDLVINYVPASSEALRELLAEIHTRFADAIAKLTVPTWTPLEIKA 2246
            WDMLSTRET+NA +AT L+ NYVP SSEAL ELL  I TR + AI  LTVPTW  L  KA
Sbjct: 712  WDMLSTRETRNAAFATSLITNYVPPSSEALTELLVVIRTRLSGAIEDLTVPTWNSLVTKA 771

Query: 2247 VPNAARVAAYQFGMSIRLLRNICLWKDILRLPTLEKLALDELLGGKILPHVRSVISNIHD 2426
            VPNAAR+AAY+FGMS+RL+RNICLWK+I+ LP LEKLAL+ELL GK+LPHVRS+ +NIHD
Sbjct: 772  VPNAARIAAYRFGMSVRLMRNICLWKEIIALPILEKLALEELLYGKVLPHVRSITANIHD 831

Query: 2427 AITRTERIVASMSGVWAGPGAMGVRSQKLQALVDYVLTLGKTLEKKHVSGVSESETSGLA 2606
            A+TRTERI+AS++GVW G G +G RS KLQ LVDYVL LG+TLEKKH+SG++ESETSGLA
Sbjct: 832  AVTRTERIIASLAGVWTGSGIIGDRSHKLQPLVDYVLLLGRTLEKKHISGIAESETSGLA 891

Query: 2607 RRLKKMLVELNEYDRARALARGFQLKEAL 2693
            RRLKKMLVELNEYD AR +A+ F LKEAL
Sbjct: 892  RRLKKMLVELNEYDNARDIAKTFHLKEAL 920


>ref|XP_002513154.1| gc-rich sequence DNA-binding factor, putative [Ricinus communis]
            gi|223548165|gb|EEF49657.1| gc-rich sequence DNA-binding
            factor, putative [Ricinus communis]
          Length = 885

 Score =  796 bits (2057), Expect = 0.0
 Identities = 445/904 (49%), Positives = 574/904 (63%), Gaps = 22/904 (2%)
 Frame = +3

Query: 48   SSRNKNFRRVRHADDEEXXXXXXXXXXXXRR---------LSFADEEEDDXXXXXXXXXX 200
            SS+++NFRR    +++              R         LSFAD+EE+D          
Sbjct: 4    SSKSRNFRRRGDENEDNESNSNTTNPSYSSRKSSSKPKKLLSFADDEEEDEETPRPSKQK 63

Query: 201  XXXXXLNTRKASSSSHRLTSSKERLITPKTTASNTTSSFVTNVQ-PQTGQYTKEKLLELE 377
                       + SSH+LT+ K+RL +  TT++ +T++   NV  PQ G YTKE LLEL+
Sbjct: 64   P--------SKTKSSHKLTAPKDRLSSSSTTSTTSTNTNSNNVLLPQAGTYTKEALLELQ 115

Query: 378  KNTPTLSRPRHPPTPSQPSNAQPLFILKGLVKPESAXXXXXXXXXXXXXGLAILGIDQID 557
            K T TL++P   P P  PS+++P  ILKGL+KP                 + I+  D   
Sbjct: 116  KKTRTLAKPSSKPPPPPPSSSEPKIILKGLLKPTLPQTLNQQDADPPQDEI-IIDEDYSL 174

Query: 558  VLDQAAINAIRAKRERLRKSRTVGSDYISLDGGSNHGEAEGLSDEEPEFQGRISLLGDKK 737
            + D+  I  IRAKRERLR+SR    DYISLDGG+    ++  SDEEPEF+ RI+++G K 
Sbjct: 175  IPDEDTIKKIRAKRERLRQSRATAPDYISLDGGA--ATSDAFSDEEPEFRNRIAMIGKKD 232

Query: 738  GVFESVDEI--EMRNGN------XXXXXXXXXXXXXXXXXXQFRKGLGKRIDDXXXXXXX 893
                +   +  +  NGN                        QFRK LGKR+DD       
Sbjct: 233  NTTPTTHAVFQDFDNGNDSHVIAEETVVNDEDEEDKIWEEEQFRKALGKRMDDPSSSTPS 292

Query: 894  XXXXXXXXXXXXXXXYHGTGYQLTHSGTIGPVIGGSLGVSRSVEVMSISQKAQIATQAMQ 1073
                           +        HS  + P IGG+ G +  ++ +S+ Q++ IA +A+ 
Sbjct: 293  LFPTPSTSTITTTNNHR-------HSHIV-PTIGGAFGPTPGLDALSVPQQSHIARKALL 344

Query: 1074 QNLKRLRESHGRTTSSIDKVEESLSSSLSNITDLEKSLSAAGEKFIYMQKLRDFVSVICD 1253
             NL RL+ESH RT SS+ K +E+LS+SL NIT LEKSLSAAGEKFI+MQKLRDFVSVIC+
Sbjct: 345  DNLTRLKESHNRTVSSLTKADENLSASLMNITALEKSLSAAGEKFIFMQKLRDFVSVICE 404

Query: 1254 FLKHKKSYLDELEDEVRKLQEDRASAIVERRKNDSTDEMYEIEAAISAARSVLKKGDSS- 1430
            FL+HK  Y++ELE++++ L E RASAI+ERR  D+ DEM E++ A+ AA+ V     S+ 
Sbjct: 405  FLQHKAPYIEELEEQMQTLHEQRASAILERRTADNDDEMMEVKTALEAAKKVFSARGSNE 464

Query: 1431 ---XXXXXXXXXXXXXMKDQSNLPVELDEFGRDKNLQKRLDVMXXXXXXXXXXXXXDLRQ 1601
                            MK+Q NLPV+LDEFGRD N QKRLD+                ++
Sbjct: 465  AAITAAMNAAQDASASMKEQINLPVKLDEFGRDINQQKRLDMKRRAEARQRRKAQ---KK 521

Query: 1602 VQSVGDDNAYHRVXXXXXXXXXXXXXXXYRDNRDKYLQVAEELFSDASEEYSQLSVVKER 1781
            + SV  D +  +V               Y+ NRD  LQ A+++F DASEEY QLSVVK+R
Sbjct: 522  LSSVEVDGSNQKVEGESSTDESDSESAAYQSNRDLLLQTADQIFGDASEEYCQLSVVKQR 581

Query: 1782 FEKWKQLYSPSYRDAYMSLSVPSIFAPYVRLELLKWDPLNEDKDFNDMQWHSLLFHYGVP 1961
            FE WK+ YS SYRDAYMS+S P+IF+PYVRLELLKWDPL+ED  F  M+WHSLL  YG+P
Sbjct: 582  FENWKKEYSTSYRDAYMSISAPAIFSPYVRLELLKWDPLHEDAGFFHMKWHSLLSDYGLP 641

Query: 1962 KDGSDFNADDPDGNLVPGLVEKIALPILHHQIAHCWDMLSTRETKNAVYATDLVINYVPA 2141
            +DGSD + +D D NLVP LVEK+A+PILHH+IAHCWDMLSTRETKNAV+AT+LV +YVPA
Sbjct: 642  QDGSDLSPEDADANLVPELVEKVAIPILHHEIAHCWDMLSTRETKNAVFATNLVTDYVPA 701

Query: 2142 SSEALRELLAEIHTRFADAIAKLTVPTWTPLEIKAVPNAARVAAYQFGMSIRLLRNICLW 2321
            SSEAL ELL  I TR  DA+  + VPTW+P+E+KAVP AA++AAY+FGMS+RL++NICLW
Sbjct: 702  SSEALAELLLAIRTRLTDAVVSIMVPTWSPIELKAVPRAAQIAAYRFGMSVRLMKNICLW 761

Query: 2322 KDILRLPTLEKLALDELLGGKILPHVRSVISNIHDAITRTERIVASMSGVWAGPGAMGVR 2501
            KDIL LP LEKLALD+LL  K+LPH++SV SN+HDA+TRTERI+AS+SGVWAG      R
Sbjct: 762  KDILSLPVLEKLALDDLLCRKVLPHLQSVASNVHDAVTRTERIIASLSGVWAGTSVTASR 821

Query: 2502 SQKLQALVDYVLTLGKTLEKKHVSGVSESETSGLARRLKKMLVELNEYDRARALARGFQL 2681
            S KLQ LVD V++LGK L+ KH  G SE E SGLARRLKKMLVELN+YD+AR +AR F L
Sbjct: 822  SHKLQPLVDCVMSLGKRLKDKHPLGASEIEVSGLARRLKKMLVELNDYDKAREIARMFSL 881

Query: 2682 KEAL 2693
            +EAL
Sbjct: 882  REAL 885


>ref|XP_003528569.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Glycine max]
          Length = 913

 Score =  746 bits (1927), Expect = 0.0
 Identities = 424/879 (48%), Positives = 563/879 (64%), Gaps = 28/879 (3%)
 Frame = +3

Query: 141  LSFADEEEDDXXXXXXXXXXXXXXXLNTRKASSSSHRLTSSKERLITPKTTASNTTSSFV 320
            LSFADE+E                   T K  SSSH++T+ K+R+      A +++ S  
Sbjct: 51   LSFADEDEQTDENPRPRASKPYRSAA-TAKKPSSSHKITTLKDRI------AHSSSPSVP 103

Query: 321  TNVQPQTGQYTKEKLLELEKNTPTL---SRPRHPPTPSQPSNAQPLFILKGLVKP----- 476
            +NVQPQ G YTKE L EL+KNT TL   S  R  P PS    ++P+ +LKGLVKP     
Sbjct: 104  SNVQPQAGTYTKEALRELQKNTRTLVTSSSSRSDPKPS----SEPVIVLKGLVKPLGSEP 159

Query: 477  ESAXXXXXXXXXXXXXGLAILGIDQID---VLDQAAINAIRAKRERLRKSRTVGSDYISL 647
            +                LA +GI   +     D   I AIRAKRERLR++R    DYISL
Sbjct: 160  QGRDSYSEGEHREVEAKLATVGIQNKEGSFYPDDETIRAIRAKRERLRQARPAAPDYISL 219

Query: 648  DGGSNHGEAEGLSDEEPEFQGRISLLGDK-----KGVFESVDE----IEMRNGNXXXXXX 800
            DGGSNHG AEGLSDEEPEF+GRI++ G+K     KGVFE V+E    +  + G       
Sbjct: 220  DGGSNHGAAEGLSDEEPEFRGRIAMFGEKVDGGKKGVFEEVEERIMDVRFKGGEDEVVDD 279

Query: 801  XXXXXXXXXXXXQFRKGLGKRIDDXXXXXXXXXXXXXXXXXXXXXXYHGTGYQLTHSG-- 974
                        QFRKGLGKR+D+                           Y    S   
Sbjct: 280  DDDDEEKMWEEEQFRKGLGKRMDEGSARVDVSVMQGSQSPHNFVVPSAAKVYGAVPSAAA 339

Query: 975  TIGPVIGGSLGVSRSVEVMSISQKAQIATQAMQQNLKRLRESHGRTTSSIDKVEESLSSS 1154
            ++ P IGG +    +++V+ ISQ+A+ A +A+ +N++RL+ESHGRT SS+ K +E+LS+S
Sbjct: 340  SVSPSIGGVIESLPALDVVPISQQAEAARKALLENVRRLKESHGRTMSSLSKTDENLSAS 399

Query: 1155 LSNITDLEKSLSAAGEKFIYMQKLRDFVSVICDFLKHKKSYLDELEDEVRKLQEDRASAI 1334
            L NIT LE SL  A EK+ +MQKLR++V+ ICDFL+HK  Y++ELE++++KL EDRA AI
Sbjct: 400  LLNITALENSLVVADEKYRFMQKLRNYVTNICDFLQHKAFYIEELEEQMKKLHEDRALAI 459

Query: 1335 VERRKNDSTDEMYEIEAAISAARSVL-KKGDSSXXXXXXXXXXXXXMKDQSNLPVELDEF 1511
             ERR  ++ DEM E+E A+ AA SVL KKG++              ++ Q +LPV+LDEF
Sbjct: 460  SERRATNNDDEMIEVEEAVKAAMSVLSKKGNNMEAAKIAAQEAFSAVRKQRDLPVKLDEF 519

Query: 1512 GRDKNLQKRLDVMXXXXXXXXXXXXX---DLRQVQSVGDDNAYHRVXXXXXXXXXXXXXX 1682
            GRD NL+KR+++                 D  +V S+  D+  H++              
Sbjct: 520  GRDLNLEKRMNMKAKTRSEACQRKRSQAFDSNKVTSMELDD--HKIEGESSTDESDSESQ 577

Query: 1683 XYRDNRDKYLQVAEELFSDASEEYSQLSVVKERFEKWKQLYSPSYRDAYMSLSVPSIFAP 1862
             Y+   D  LQ A+E+FSDASEEY QLS+VK R E+WK+ +S SY+DAYMSLS+P IF+P
Sbjct: 578  AYQSQSDLVLQAADEIFSDASEEYGQLSLVKSRMEEWKREHSSSYKDAYMSLSLPLIFSP 637

Query: 1863 YVRLELLKWDPLNEDKDFNDMQWHSLLFHYGVPKDGSDFNADDPDGNL--VPGLVEKIAL 2036
            YVRLELL+WDPL+   DF +M+W+ LLF YG+P+DG DF  DD D +L  VP LVEK+AL
Sbjct: 638  YVRLELLRWDPLHNGVDFQEMKWYKLLFTYGLPEDGKDFVHDDGDADLELVPNLVEKVAL 697

Query: 2037 PILHHQIAHCWDMLSTRETKNAVYATDLVINYVPASSEALRELLAEIHTRFADAIAKLTV 2216
            PILH++I+HCWDM+S +ET NA+ AT L++ +V   SEAL +LL  I TR ADA+A LTV
Sbjct: 698  PILHYEISHCWDMVSQQETVNAIAATKLMVQHVSHESEALADLLVSIQTRLADAVADLTV 757

Query: 2217 PTWTPLEIKAVPNAARVAAYQFGMSIRLLRNICLWKDILRLPTLEKLALDELLGGKILPH 2396
            PTW+P  + AVP+AARVAAY+FG+S+RLLRNICLWKD+  +P LEK+ALDELL  K+LPH
Sbjct: 758  PTWSPSVLAAVPDAARVAAYRFGVSVRLLRNICLWKDVFSMPVLEKVALDELLCRKVLPH 817

Query: 2397 VRSVISNIHDAITRTERIVASMSGVWAGPGAMGVRSQKLQALVDYVLTLGKTLEKKHVSG 2576
            +R +  N+ DAITRTERI+AS+SG+WAGP  +G +++KLQ LV YVL+LG+ LE+++   
Sbjct: 818  LRVISENVQDAITRTERIIASLSGIWAGPSVIGDKNRKLQPLVTYVLSLGRILERRN--- 874

Query: 2577 VSESETSGLARRLKKMLVELNEYDRARALARGFQLKEAL 2693
            V E++TS LARRLKK+L +LNEYD AR +AR F LKEAL
Sbjct: 875  VPENDTSHLARRLKKILADLNEYDHARNMARTFHLKEAL 913


Top