BLASTX nr result

ID: Cephaelis21_contig00021801 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00021801
         (3651 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002274643.2| PREDICTED: methyl-CpG-binding domain-contain...   543   e-151
ref|XP_004167238.1| PREDICTED: LOW QUALITY PROTEIN: methyl-CpG-b...   514   e-143
ref|XP_004141185.1| PREDICTED: methyl-CpG-binding domain-contain...   509   e-141
ref|NP_186795.1| methyl-CPG-binding domain 9 [Arabidopsis thalia...   502   e-139
ref|XP_002517349.1| DNA binding protein, putative [Ricinus commu...   489   e-135

>ref|XP_002274643.2| PREDICTED: methyl-CpG-binding domain-containing protein 9-like [Vitis
            vinifera]
          Length = 2164

 Score =  543 bits (1399), Expect = e-151
 Identities = 378/1058 (35%), Positives = 535/1058 (50%), Gaps = 63/1058 (5%)
 Frame = +1

Query: 1    AETLAQKFEDLYEKEVLSFVQKTMLPINA---GIESEKERDDIFARVNESLIPKASWEEG 171
            A TL+Q FE ++EKEVL  VQK      +     E+EKE DD     +E  IPKA W+EG
Sbjct: 1162 ARTLSQNFESMFEKEVLPLVQKFTEYAKSECLSAETEKEIDDFLVSASE--IPKAPWDEG 1219

Query: 172  ICKVCGMXXXXXXXXXXXXXXSEYHTYCLNPPLIRIPEGNWYCPSCLXXXXXXXXXXXXX 351
            +CKVCG+              +EYHTYCLNPPL RIPEGNWYCPSC+             
Sbjct: 1220 VCKVCGIDKDDDSVLLCDMCDAEYHTYCLNPPLARIPEGNWYCPSCVAGISMVDVSEHTH 1279

Query: 352  XXVKRYWKGRSQRKYLHKNLEMLAQLASAMELNEYWELTVEERVFLMKFLCDEVLNSAML 531
               +R  K   Q  + H  LE LA LA+AME  EYWEL+V++R FL KFLCDE+LN+A++
Sbjct: 1280 VIAQRQGKN-CQGDFTHAYLESLAHLAAAMEEKEYWELSVDQRTFLFKFLCDELLNTALI 1338

Query: 532  RDHIGQCSIKFGDMLQKLRSLNSERKLLKFKEDNLVANMAKAKVPA-HXXXXXXXXXXXA 708
            R H+ QC+    ++ QKLRS++ E K LK KE+NL A   K      +           +
Sbjct: 1339 RQHLEQCAESSAELQQKLRSISVEWKNLKLKEENLAARAPKVDSGMIYVAGEVGTEGGLS 1398

Query: 709  SVLPDDGKLKSKVPEGSNISPLPGCLGQMEEGEGQQVKGRSGCNSTSITKTPVSEASNSI 888
            S L ++GK  +K      +S  P   G +   + Q   G  G     + K P S  S   
Sbjct: 1399 SALTNNGKCIAKP---HTLSDRPKDFGILSNDQLQVEGGSEGIRPNGLDKHPSSNCSEGN 1455

Query: 889  NTLFQQSGRDDIQSRCTEVAGCKNELLGAFVEHKDVQNVDNVGSSVDLSQGGGSSAAMLC 1068
             TL        ++             + A V+   V +VD+    V    G       L 
Sbjct: 1456 CTLKPIDNEGQLKE------------VHAVVDETQV-SVDHFPHMVYQGNGSSCRPNELH 1502

Query: 1069 MSRQISSDTAGQLSAVDLPSSQCHQCST---QANGSSSQ-------ECNDELSYLKSEIT 1218
            +   +  +  G  +  +L  + C        Q     S        E + EL+ +K++I+
Sbjct: 1503 LQNPLQQEMDGLGTEFNLQVNMCENMEKNDLQGLHHPSDIRIVHVAEHDSELNSIKNDIS 1562

Query: 1219 FLQESIDKLDAELLRLAVRKEYLGRDSDGRIYWIYGRPGAGPWVLVNGCSNVEE------ 1380
             LQ+S+  ++++LL+L+VR+E+LG DS GR+YWI  +PG  PWVLV+G   +++      
Sbjct: 1563 DLQDSMASIESQLLKLSVRREFLGSDSAGRLYWILAKPGWHPWVLVDGSMALQKKEKMRY 1622

Query: 1381 ---------------------------------VFEPDKLFLNFTLWTSHCTRAEIEELV 1461
                                             ++ P+      + W S+ +  EI+ L+
Sbjct: 1623 LKNPGDSSVQKNSTSLSMDILSTLGGSNASCPFLYRPNASISICSQWVSYQSGEEIDALI 1682

Query: 1462 NWLGDGDIRDRELKECILHWQCNKSMDTNDT-ENDVLIRGKEISRINCSVEKAADSDLLT 1638
             WL D D R++ELKE ILH    +  D   T + D +     +SR   S  + A SD L 
Sbjct: 1683 GWLKDADPREKELKESILHLHKLRFRDWKLTGDPDQVDSQTTLSRFPNS--ENAFSDGLL 1740

Query: 1639 TKAVRALEKKFGPCPDVWAIDIRKNLH-EGSLTRLGEMCRCECLEMLWPSRFHCVSCHKS 1815
            TKA   L KK+GP  +    D  K       +T   +M RCECLE +W SR HC SCH++
Sbjct: 1741 TKAGILLGKKYGPWFEPEIADSSKKWDLRSKVTNESKMYRCECLEPIWSSRHHCPSCHRT 1800

Query: 1816 FSTGVELAQHAGGQCKTTSAVCESIQKTEDSMH---KTMLKNEKPGEKCSGSGSAIP--- 1977
            F T ++L +H  G C++     E  +  E+S H   K  +K++   E+ +G    +    
Sbjct: 1801 FFTDIQLEEHNDGSCRSGPPTSE--KSKENSSHLKGKGTMKSKISREESTGDIDMVEIPK 1858

Query: 1978 GSVNEKHDNGSSCFDQPLEPECSFNFQEILSKFKIETSLKELVKEVGLIGSNGVVSFVPS 2157
            G  ++         ++ L   C ++F+EI SKF  + S KELV+E+GLIGS GV SFV S
Sbjct: 1859 GGCSQPRSRLIKFQNEGLV--CPYDFEEICSKFVTKNSNKELVQEIGLIGSKGVPSFVSS 1916

Query: 2158 RSPYLDDPSLTLASSTNNVVNIGDVRSFSESQRLQSDXXXXXXXXXXDISAHVQSSRLDK 2337
            R PY+ D +L L  S       G++++  +    Q +            S    SSR   
Sbjct: 1917 RPPYISDATLLLVPS-------GELKATGDMMLAQGNRIPAGGSG----SFSDNSSRDSA 1965

Query: 2338 VQEVAKAEFV-KPMFPKERCQFTVKDSNSGLGVSKSSIIRESSLVPKVGKACEILRCLKI 2514
              E + A    K    ++  ++++ ++   + V +  +I +SSL P VGK  +ILR LKI
Sbjct: 1966 ANETSAASRTDKSALEQKDKKYSLNNNGPEMEVGRCCVIPQSSLRPLVGKVYQILRQLKI 2025

Query: 2515 NLLDMDAALPEASLKASRSHSDRRCAWRTFVKSASTIYEMVQATIILEDTIKADYLRNDW 2694
            NLLDMDAALPE +LK SR+  ++R AWR FVKSA TI+EMVQATI+LED IK +YL N W
Sbjct: 2026 NLLDMDAALPEEALKPSRADLEKRLAWRAFVKSAETIFEMVQATIMLEDMIKTEYLMNGW 2085

Query: 2695 WYWSSPSAAANISTLSALALRIYTLDSAILYEKPFGN-DTTERFTPDGKFEKESRLGSVP 2871
            WYWSS SAAA  ST+S+LALRIY+LD+AI YEK   N D T+   P  K +         
Sbjct: 2086 WYWSSLSAAAKTSTVSSLALRIYSLDAAIAYEKISSNLDLTDSPKPSSKPDP-------- 2137

Query: 2872 TNNLKPSDQLMQKMPDLDSGENLKPRTRASKRRKDSEG 2985
                KP       +P+LD+ E  K   + +KRRK+SEG
Sbjct: 2138 ----KP-------VPNLDTMEKSKLGRKQNKRRKESEG 2164


>ref|XP_004167238.1| PREDICTED: LOW QUALITY PROTEIN: methyl-CpG-binding domain-containing
            protein 9-like [Cucumis sativus]
          Length = 1277

 Score =  514 bits (1324), Expect = e-143
 Identities = 360/1066 (33%), Positives = 518/1066 (48%), Gaps = 72/1066 (6%)
 Frame = +1

Query: 4    ETLAQKFEDLYEKEVLSFVQKTM---LPINAGIESEKERDDIFARVNESLIPKASWEEGI 174
            ETL++ FE LYE EVLS ++K        +   E++ E D     +NE  IPKA W+EG+
Sbjct: 305  ETLSENFERLYENEVLSLIEKLKEFSKLESLSAETKVEVDGFLVSLNE--IPKAPWDEGV 362

Query: 175  CKVCGMXXXXXXXXXXXXXXSEYHTYCLNPPLIRIPEGNWYCPSCLXXXXXXXXXXXXXX 354
            CKVCG+              +EYHTYCLNPPL RIPEGNWYCPSC+              
Sbjct: 363  CKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLARIPEGNWYCPSCVMGTRMVEDPSEHTK 422

Query: 355  XVKRYWKGRSQRKYLHKN-LEMLAQLASAMELNEYWELTVEERVFLMKFLCDEVLNSAML 531
             +    KG+  R  + ++ L  LA LA+A+E  EYWE +V+ER+FL+K+LCDE+L+SA++
Sbjct: 423  HIINLHKGKKFRGEVTRDFLNKLANLAAALEEKEYWEFSVDERLFLLKYLCDELLSSALI 482

Query: 532  RDHIGQCSIKFGDMLQKLRSLNSERKLLKFKEDNLVANMAKAKVPAHXXXXXXXXXXXAS 711
            R H+ QC     ++ QKLRS   E K LK +E+ + A  AK                   
Sbjct: 483  RQHLEQCVEALAELQQKLRSCFIEWKNLKCREEVVAARAAKLDTTM-------------- 528

Query: 712  VLPDDGKLKSKVPEGSNISPLPGCLGQMEEGEGQQVKGRSGCNSTSITKTPVSEASNSIN 891
                                    L  + EG+G     R G +                 
Sbjct: 529  ------------------------LSAVREGQGSCDGARLGASD---------------- 548

Query: 892  TLFQQSGRDDIQSRCTEVAGCKNELLGAFVEHKDVQNVDNVGSSVDLSQGGGSSAAMLCM 1071
               Q S    ++++C   A  + ++  A     DV + ++ G +V  S G  +S   +  
Sbjct: 549  ---QYSSLTSLENKCHNHASFQEQMSSAH----DVTDNNDAGGNVLSSSGSQNSGKPVKF 601

Query: 1072 SRQISSDTAGQLSAVD----------LPSSQCHQCSTQANG----------SSSQECNDE 1191
            +    S    ++   D          LPS + +     ANG          + SQ  + E
Sbjct: 602  NEPSLSGLPQEVDGSDQSNMETEISILPSGKQYFTPCDANGVPVAPQVPPPNESQAYHSE 661

Query: 1192 LSYLKSEITFLQESIDKLDAELLRLAVRKEYLGRDSDGRIYWIYGRPGAGPWVLVNGCS- 1368
            L  +K +I  +Q+SI   + ELL+++VR+E+LG D+ GR+YW        P ++ +G S 
Sbjct: 662  LDSIKKDILQVQDSIASTELELLKISVRREFLGSDAAGRLYWASVMSNGLPQIISSGSSV 721

Query: 1369 -----------------NVEEVFEPDKLFLNFTLWTS----------------HCTRAEI 1449
                             N       +   LN  +++S                + T A+I
Sbjct: 722  HIGSESRDRVVKGRFFKNYTSTSNANSSTLNSNMYSSLLHLPKDFIGNSPCISYQTEADI 781

Query: 1450 EELVNWLGDGDIRDRELKECILHWQCNKSMDTNDTENDVLIRGKEISRINCSVEKAADSD 1629
             EL++WL D D ++RELKE IL W   K   ++ + N       + S  +  VEK   S 
Sbjct: 782  LELIDWLKDSDPKERELKESILQWLKPKLQTSSRSNNQSPEEQLKDSSSSSDVEKLECSG 841

Query: 1630 LLTTKAVRALEKKFGPCPD-VWAIDIRKNLHEGSLTRLGEMCRCECLEMLWPSRFHCVSC 1806
             L  +A   LE K+GP  + V   D+ + L +  L    +M RC C+E +WPSR+HC+SC
Sbjct: 842  FLVNRASALLESKYGPFLEFVTPDDLNRWLDKARLAEDEKMFRCVCMEPVWPSRYHCLSC 901

Query: 1807 HKSFSTGVELAQHAGGQCKTTSAVCESIQKTEDSMH-KTMLKNEKPGEKCSGSGSAIPGS 1983
            HKSFST VEL +H  GQC +  A C+ I++  DS   K  +K E   E+ S    A    
Sbjct: 902  HKSFSTDVELEEHDNGQCSSLPASCDGIKEVGDSSKSKCNIKFESKQEESSSMVIAETSR 961

Query: 1984 VNEKHDNGSSCFDQPLEPECSFNFQEILSKFKIETSLKELVKEVGLIGSNGVVSFVPSRS 2163
                H  G   +       C ++F+ I SKF  + S K+L+KE+GLI SNGV SF+ S S
Sbjct: 962  GYFNHSMGLIKYQND-GMMCPYDFELICSKFLTKDSNKDLIKEIGLISSNGVPSFLSSVS 1020

Query: 2164 PYLDDPSLTLASSTNNVVNIGDVRSFSESQRLQSDXXXXXXXXXXDISAHVQSSRLDKVQ 2343
            PY+ +       ST NV+++    S  E   L S+          +   H  SS    +Q
Sbjct: 1021 PYIME-------STLNVIDLKKDSSTPEDGTLLSEWPSLENIILEN-GCHQSSSIDSSIQ 1072

Query: 2344 EVAKAEFVKP---------MFPKERCQFTVKDSNSGLGVSKSSIIRESSLVPKVGKACEI 2496
            + A  E   P         + PK + +  + +  S  G+ +  +I +SS  P VGK  ++
Sbjct: 1073 KPAGNEISAPKTKRLAAGCLEPKSK-KSXMDNRFSEFGIGRCFVIPQSSQRPLVGKILQV 1131

Query: 2497 LRCLKINLLDMDAALPEASLKASRSHSDRRCAWRTFVKSASTIYEMVQATIILEDTIKAD 2676
            +R LK+NLLDMDAALP+ +LK S+ H +RR AWR FVKSA TIYEMVQATI LED I+ +
Sbjct: 1132 VRGLKMNLLDMDAALPDEALKPSKLHIERRWAWRAFVKSAGTIYEMVQATIALEDMIRTE 1191

Query: 2677 YLRNDWWYWSSPSAAANISTLSALALRIYTLDSAILYEKPFGNDTTERFTPDGKFEKESR 2856
            YL+N+WWYWSS SAAA IST+S+LALRI++LD+AI+YEK   N  +  +        E +
Sbjct: 1192 YLKNEWWYWSSLSAAAKISTVSSLALRIFSLDAAIIYEKISPNQDSNDYLDTTSSIPEQK 1251

Query: 2857 LGSVPTNNLKPSDQLMQKMPDLDSGENLKPRT---RASKRRKDSEG 2985
            LG V                        KPRT   ++ K+RK+ EG
Sbjct: 1252 LGGVDLTE--------------------KPRTSSRKSGKKRKEPEG 1277


>ref|XP_004141185.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
            [Cucumis sativus]
          Length = 2131

 Score =  509 bits (1312), Expect = e-141
 Identities = 359/1067 (33%), Positives = 518/1067 (48%), Gaps = 73/1067 (6%)
 Frame = +1

Query: 4    ETLAQKFEDLYEKEVLSFVQKTM---LPINAGIESEKERDDIFARVNESLIPKASWEEGI 174
            ETL++ FE LYE EVLS ++K        +   E++ E D     +NE  IPKA W+EG+
Sbjct: 1158 ETLSENFERLYENEVLSLIEKLKEFSKLESLSAETKVEVDGFLVSLNE--IPKAPWDEGV 1215

Query: 175  CKVCGMXXXXXXXXXXXXXXSEYHTYCLNPPLIRIPEGNWYCPSC-LXXXXXXXXXXXXX 351
            CKVCG+              +EYHTYCLNPPL RIPEGNWYCPSC +             
Sbjct: 1216 CKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLARIPEGNWYCPSCVMGTRMVEDPSEHTK 1275

Query: 352  XXVKRYWKGRSQRKYLHKN-LEMLAQLASAMELNEYWELTVEERVFLMKFLCDEVLNSAM 528
              +    KG+  R  + ++ L  LA LA+A+E  EYWE +V+ER+FL+K+LCDE+L+SA+
Sbjct: 1276 NHIINLHKGKKFRGEVTRDFLNKLANLAAALEEKEYWEFSVDERLFLLKYLCDELLSSAL 1335

Query: 529  LRDHIGQCSIKFGDMLQKLRSLNSERKLLKFKEDNLVANMAKAKVPAHXXXXXXXXXXXA 708
            +R H+ QC     ++ QKLRS   E K LK +E+ + A  AK                  
Sbjct: 1336 IRQHLEQCVEALAELQQKLRSCFIEWKNLKCREEVVAARAAKLDTTM------------- 1382

Query: 709  SVLPDDGKLKSKVPEGSNISPLPGCLGQMEEGEGQQVKGRSGCNSTSITKTPVSEASNSI 888
                                     L  + EG+G     R G +                
Sbjct: 1383 -------------------------LSAVREGQGSCDGARLGASD--------------- 1402

Query: 889  NTLFQQSGRDDIQSRCTEVAGCKNELLGAFVEHKDVQNVDNVGSSVDLSQGGGSSAAMLC 1068
                Q S    ++++C   A  + ++  A     DV + ++ G +V  S G  +S   + 
Sbjct: 1403 ----QYSSLTSLENKCHNHASFQEQMSSAH----DVTDNNDAGGNVLSSSGSQNSGKPVK 1454

Query: 1069 MSRQISSDTAGQLSAVD----------LPSSQCHQCSTQANG----------SSSQECND 1188
             +    S    ++   D          LPS + +     ANG          + SQ  + 
Sbjct: 1455 FNEPSLSGLPQEVDGSDQSNMETEISILPSGKQYFTPCDANGVPVAPQVPPPNESQAYHS 1514

Query: 1189 ELSYLKSEITFLQESIDKLDAELLRLAVRKEYLGRDSDGRIYWIYGRPGAGPWVLVNGCS 1368
            EL  +K +I  +Q+SI   + ELL+++VR+E+LG D+ GR+YW        P ++ +G S
Sbjct: 1515 ELDSIKKDILQVQDSIASTELELLKISVRREFLGSDAAGRLYWASVMSNGLPQIISSGSS 1574

Query: 1369 ------------------NVEEVFEPDKLFLNFTLWTS----------------HCTRAE 1446
                              N       +   LN  +++S                + T A+
Sbjct: 1575 VHIGSESRDRVVKGRFFKNYTSTSNANSSTLNSNMYSSLLHLPKDFIGNSPCISYQTEAD 1634

Query: 1447 IEELVNWLGDGDIRDRELKECILHWQCNKSMDTNDTENDVLIRGKEISRINCSVEKAADS 1626
            I EL++WL D D ++RELKE IL W   K   ++ + N       + S  +  VEK   S
Sbjct: 1635 ILELIDWLKDSDPKERELKESILQWLKPKLQTSSRSNNQSPEEQLKDSSSSSDVEKLECS 1694

Query: 1627 DLLTTKAVRALEKKFGPCPD-VWAIDIRKNLHEGSLTRLGEMCRCECLEMLWPSRFHCVS 1803
              L  +A   LE K+GP  + V   D+ + L +  L    +M RC C+E +WPSR+HC+S
Sbjct: 1695 GFLVNRASALLESKYGPFLEFVTPDDLNRWLDKARLAEDEKMFRCVCMEPVWPSRYHCLS 1754

Query: 1804 CHKSFSTGVELAQHAGGQCKTTSAVCESIQKTEDSMH-KTMLKNEKPGEKCSGSGSAIPG 1980
            CH+SFST VEL +H  GQC +  A C+ I++  DS   K  +K E   E+ S    A   
Sbjct: 1755 CHRSFSTDVELEEHDNGQCSSLPASCDGIKEVGDSSKSKCNIKFESKQEESSSMVIAETS 1814

Query: 1981 SVNEKHDNGSSCFDQPLEPECSFNFQEILSKFKIETSLKELVKEVGLIGSNGVVSFVPSR 2160
                 H  G   +       C ++F+ I SKF  + S K+L+KE+GLI SNGV SF+ S 
Sbjct: 1815 RGYFNHSMGLIKYQND-GMMCPYDFELICSKFLTKDSNKDLIKEIGLISSNGVPSFLSSV 1873

Query: 2161 SPYLDDPSLTLASSTNNVVNIGDVRSFSESQRLQSDXXXXXXXXXXDISAHVQSSRLDKV 2340
            SPY+ +       ST NV+++    S  E   L S+          +   H  SS    +
Sbjct: 1874 SPYIME-------STLNVIDLKKDSSTPEDGTLLSEWPSLENIILEN-GCHQSSSIDSSI 1925

Query: 2341 QEVAKAEFVKP---------MFPKERCQFTVKDSNSGLGVSKSSIIRESSLVPKVGKACE 2493
            Q+ A  E   P         + PK + +  + +  S  G+ +  +I +SS  P VGK  +
Sbjct: 1926 QKPAGNEISAPKTKRLAAGCLEPKSK-KICMDNRFSEFGIGRCFVIPQSSQRPLVGKILQ 1984

Query: 2494 ILRCLKINLLDMDAALPEASLKASRSHSDRRCAWRTFVKSASTIYEMVQATIILEDTIKA 2673
            ++R LK+NLLDMDAALP+ +LK S+ H +RR AWR FVKSA TIYEMVQATI LED I+ 
Sbjct: 1985 VVRGLKMNLLDMDAALPDEALKPSKLHIERRWAWRAFVKSAGTIYEMVQATIALEDMIRT 2044

Query: 2674 DYLRNDWWYWSSPSAAANISTLSALALRIYTLDSAILYEKPFGNDTTERFTPDGKFEKES 2853
            +YL+N+WWYWSS SAAA IST+S+LALRI++LD+AI+YEK   N  +  +        E 
Sbjct: 2045 EYLKNEWWYWSSLSAAAKISTVSSLALRIFSLDAAIIYEKISPNQDSNDYLDTTSSIPEQ 2104

Query: 2854 RLGSVPTNNLKPSDQLMQKMPDLDSGENLKPRT---RASKRRKDSEG 2985
            +LG V                        KPRT   ++ K+RK+ EG
Sbjct: 2105 KLGGVDLTE--------------------KPRTSSRKSGKKRKEPEG 2131


>ref|NP_186795.1| methyl-CPG-binding domain 9 [Arabidopsis thaliana]
            gi|75337201|sp|Q9SGH2.1|MBD9_ARATH RecName:
            Full=Methyl-CpG-binding domain-containing protein 9;
            Short=AtMBD9; Short=MBD09; AltName: Full=Histone acetyl
            tranferase MBD9; AltName: Full=Methyl-CpG-binding protein
            MBD9 gi|6692266|gb|AAF24616.1|AC010870_9 unknown protein
            [Arabidopsis thaliana] gi|332640148|gb|AEE73669.1|
            methyl-CPG-binding domain 9 [Arabidopsis thaliana]
          Length = 2176

 Score =  502 bits (1293), Expect = e-139
 Identities = 342/1018 (33%), Positives = 524/1018 (51%), Gaps = 25/1018 (2%)
 Frame = +1

Query: 7    TLAQKFEDLYEKEVLSFVQKT-----MLPINAGIESEKERDDIFARVNESLIPKASWEEG 171
            TL++KF+ LYE EV+  VQK      +  ++A  E +KE  DI   VN+  +PKA W+EG
Sbjct: 1233 TLSEKFKSLYEAEVVPLVQKLKDYRKLECLSA--EMKKEIKDIVVSVNK--LPKAPWDEG 1288

Query: 172  ICKVCGMXXXXXXXXXXXXXXSEYHTYCLNPPLIRIPEGNWYCPSCLXXXXXXXXXXXXX 351
            +CKVCG+              +EYHTYCLNPPLIRIP+GNWYCPSC+             
Sbjct: 1289 VCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLIRIPDGNWYCPSCVIAKRMAQEALESY 1348

Query: 352  XXVKRYWKGRSQRKYLHKNLEMLAQLASAMELNEYWELTVEERVFLMKFLCDEVLNSAML 531
              V+R    + Q +    ++E+ A LA  ME  +YWE + EER+ L+K LCDE+L+S+++
Sbjct: 1349 KLVRRRKGRKYQGELTRASMELTAHLADVMEEKDYWEFSAEERILLLKLLCDELLSSSLV 1408

Query: 532  RDHIGQCSIKFGDMLQKLRSLNSERKLLKFKEDNLVANMAKAKVPAHXXXXXXXXXXXAS 711
              H+ QC+    +M QKLRSL+SE K  K +++ L A +AK +                S
Sbjct: 1409 HQHLEQCAEAIIEMQQKLRSLSSEWKNAKMRQEFLTAKLAKVE---------------PS 1453

Query: 712  VLPDDGKLKSKVPEGSNISPLPGCLGQMEEGEGQQVKGRSGCNSTSITKTPVSEASNSIN 891
            +L + G+  +     S  +   GC  Q +EG G  V      +ST+       ++    +
Sbjct: 1454 ILKEVGEPHN----SSYFADQMGCDPQPQEGVGDGVTRDDETSSTAYLNKNQGKSPLETD 1509

Query: 892  TLFQQS----GRDDIQSRCTEVAGCKNELLGAFVEHKDVQNVDNVGSSVDLSQGGGSSAA 1059
            T   +S    G   I S  T  +  ++EL  A        N+    +S  L +  G +  
Sbjct: 1510 TQPGESHVNFGESKISSPETISSPGRHELPIADTSPLVTDNLPEKDTSETLLKSVGRN-- 1567

Query: 1060 MLCMSRQISSDTAGQLSAVDLPSSQCHQCSTQANGSSSQECNDELSYLKSEITFLQESID 1239
                        +   +AV+LP++  H  S+QA+    Q C  +LS   +EI  LQ+SI 
Sbjct: 1568 --------HETHSPNSNAVELPTA--HDASSQAS-QELQACQQDLSATSNEIQNLQQSIR 1616

Query: 1240 KLDAELLRLAVRKEYLGRDSDGRIYWIYGRPGAGPWVLVNGCSNVEEVFEPDKL------ 1401
             ++++LL+ ++R+++LG D+ GR+YW    P   P +LV+G  ++++  + D +      
Sbjct: 1617 SIESQLLKQSIRRDFLGTDASGRLYWGCCFPDENPRILVDGSISLQKPVQADLIGSKVPS 1676

Query: 1402 ---------FLNFTLWTSHCTRAEIEELVNWLGDGDIRDRELKECILHWQCNKSMDTNDT 1554
                      L  + WT + T  EI ELV WL D D+++R+L+E IL W   K +   D 
Sbjct: 1677 PFLHTVDHGRLRLSPWTYYETETEISELVQWLHDDDLKERDLRESILWW---KRLRYGDV 1733

Query: 1555 ENDVLIRGKEISRINCSVEKAADSDLLTTKAVRALEKKFGPCPDVWAIDIRKNLHEGSLT 1734
            + +     K+   ++  V        L TKA  ++EK++GPC  +    ++K   +  + 
Sbjct: 1734 QKE----KKQAQNLSAPVFATG----LETKAAMSMEKRYGPCIKLEMETLKKRGKKTKVA 1785

Query: 1735 RLGEMCRCECLEMLWPSRFHCVSCHKSFSTGVELAQHAGGQCKTTSAVCESIQKTEDSMH 1914
               ++CRCECLE + PS  HC+ CHK+F++  E   H   +C   S   E  +   DS  
Sbjct: 1786 EREKLCRCECLESILPSMIHCLICHKTFASDDEFEDHTESKCIPYSLATEEGKDISDSSK 1845

Query: 1915 -KTMLKNEKPGEKCSGSGSAIPGSVNEKHDNGSSCFDQPLEPECSFNFQEILSKFKIETS 2091
             K  LK++    K S        S   + D+G   + Q  E    ++F+EI SKF  +  
Sbjct: 1846 AKESLKSDYLNVKSSAGKDVAEISNVSELDSGLIRY-QEEESISPYHFEEICSKFVTKDC 1904

Query: 2092 LKELVKEVGLIGSNGVVSFVPSRSPYLDDPSLTLASSTNNVVNIGDVRSFSESQRLQSDX 2271
             ++LVKE+GLI SNG+ +F+PS S +L+D    L S+ +N  + GD          +++ 
Sbjct: 1905 NRDLVKEIGLISSNGIPTFLPSSSTHLNDS--VLISAKSNKPDGGDSGDQVIFAGPETNV 1962

Query: 2272 XXXXXXXXXDISAHVQSSRLDKVQEVAKAEFVKPMFPKERCQFTVKDSNSGLGVSKSSII 2451
                          V  S    + + +   F    F +++        +SG G+    ++
Sbjct: 1963 EGLNSESNMSFDRSVTDSHGGPLDKPSGLGF---GFSEQK-----NKKSSGSGLKSCCVV 2014

Query: 2452 RESSLVPKVGKACEILRCLKINLLDMDAALPEASLKASRSHSDRRCAWRTFVKSASTIYE 2631
             +++L    GKA    R LK NLLDMD ALPE +L+ S+SH +RR AWR FVKS+ +IYE
Sbjct: 2015 PQAALKRVTGKALPGFRFLKTNLLDMDVALPEEALRPSKSHPNRRRAWRVFVKSSQSIYE 2074

Query: 2632 MVQATIILEDTIKADYLRNDWWYWSSPSAAANISTLSALALRIYTLDSAILYEKPFGNDT 2811
            +VQATI++ED IK +YL+N+WWYWSS SAAA ISTLSAL++RI++LD+AI+Y+KP     
Sbjct: 2075 LVQATIVVEDMIKTEYLKNEWWYWSSLSAAAKISTLSALSVRIFSLDAAIIYDKP----- 2129

Query: 2812 TERFTPDGKFEKESRLGSVPTNNLKPSDQLMQKMPDLDSGENLKPRTRASKRRKDSEG 2985
                TP    ++   + S+P       DQ  Q  P  DS E      R+ K+RK+ EG
Sbjct: 2130 ---ITPSNPIDETKPIISLP-------DQKSQ--PVSDSQERSSRVRRSGKKRKEPEG 2175


>ref|XP_002517349.1| DNA binding protein, putative [Ricinus communis]
            gi|223543360|gb|EEF44891.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 2145

 Score =  489 bits (1259), Expect = e-135
 Identities = 358/1052 (34%), Positives = 516/1052 (49%), Gaps = 58/1052 (5%)
 Frame = +1

Query: 4    ETLAQKFEDLYEKEVLSFVQKT---MLPINAGIESEKERDDIFARVNESLIPKASWEEGI 174
            ETLAQ FE LYEKEV++ VQK            E++K+ D + A  NE  IPKA W+EG+
Sbjct: 1165 ETLAQNFESLYEKEVVTLVQKFEEFAKLDRLSAETKKDLDIVLASTNE--IPKAPWDEGV 1222

Query: 175  CKVCGMXXXXXXXXXXXXXXSEYHTYCLNPPLIRIPEGNWYCPSC----LXXXXXXXXXX 342
            CKVCG               +EYHTYCLNPPL RIPEGNWYCPSC    +          
Sbjct: 1223 CKVCGFDKDDDSVLLCDTCDAEYHTYCLNPPLARIPEGNWYCPSCVSVRMVQEASVSTQV 1282

Query: 343  XXXXXVKRYWKGRSQRKYLHKNLEMLAQLASAMELNEYWELTVEERVFLMKFLCDEVLNS 522
                  K+Y +G   R YL    E L  LASAME  +YW+  V+ER FL+KFLCDE+LNS
Sbjct: 1283 IGQNSCKKY-QGEMTRIYL----ETLVHLASAMEEKDYWDFGVDERTFLLKFLCDELLNS 1337

Query: 523  AMLRDHIGQCSIKFGDMLQKLRSLNSERKLLKFKEDNLVANMAKAKVPAHXXXXXXXXXX 702
            A++R H+ QC     ++ QKLR+L +E K LK KE+ +    AK    A           
Sbjct: 1338 ALVRQHLEQCMESTAEVQQKLRTLYAEWKNLKSKEEFMALKSAKMGTGASGEVKEGL--- 1394

Query: 703  XASVLPDDGKLKSKVPEGSNISPLPGCLGQMEEGEGQQVKGRSGCNSTSITKTPVSEASN 882
              S L D GK   + P   +  P   C                   S  ++    S   N
Sbjct: 1395 -VSALKDQGKSVGQPPVLGD-KPSDCC-----------------APSDDVSAVDGSPEGN 1435

Query: 883  SINTLFQQSGRDDIQSRCTEVAGCKNELLGAFVEHKDVQNV-DNVGSSVDLSQGGGSSAA 1059
             IN   +     + + + +      ++ + +   H  V+++ D +  S D S+       
Sbjct: 1436 GINGFDKHPSEINYEKKPSH----DSQNIDSTNNHGPVKDMHDAMEGSNDPSKENSKPLG 1491

Query: 1060 MLCMSRQISSDTAGQLSAVDLPSSQCHQCSTQANGSSSQECNDELSYLKSEITFLQESID 1239
                   +SSD    L  ++LPS   ++         SQ  + ++S +K +I  LQ  I 
Sbjct: 1492 PNHPGFSLSSDM-NALVVLNLPSVTMNE---------SQAYHTDVSAIKDDILRLQNLIS 1541

Query: 1240 KLDAELLRLAVRKEYLGRDSDGRIYWIYGRPGAGPWVLVN-------------------- 1359
             ++++L + ++R+E+LG DS G +YW    P   P ++V+                    
Sbjct: 1542 SMESQLSKQSLRREFLGSDSRGHLYWASATPNGHPQIVVDRSLTFQHRKISHHRLGNSSV 1601

Query: 1360 ----------GCSNVEE-------VFEPDKLFLNFTLWTSHCTRAEIEELVNWLGDGDIR 1488
                       C N+E        +F P+      + W S+ T AEIEEL+ WLG+ + +
Sbjct: 1602 LQHSSSSGIDACLNLEGSRACFPFLFNPNGTLSMSSAWVSYETDAEIEELIGWLGNNNQK 1661

Query: 1489 DRELKECILHW---QCNKSMDTNDTENDVLIRGKEISRINCSVEKAADSDLLTTKAVRAL 1659
            + ELKE I+ W   +  +S    D   +    G    R N   ++ A S+ LT KA   L
Sbjct: 1662 EIELKESIMQWLKLRFQESQRIRDPVQEECRAGLSTIRNN---DQTAFSNCLT-KATLLL 1717

Query: 1660 EKKFGPCPDVWAID-IRKNLHEGSLTRLGEMCRCECLEMLWPSRFHCVSCHKSFSTGVEL 1836
            EK +G   ++   D ++K   +   T   +  RC+CLE++WPSR HC SCH++ S  VE 
Sbjct: 1718 EKNYGAFVELDTSDMLKKRGKKARGTNEEKTYRCDCLELIWPSRNHCYSCHRTSSNDVEF 1777

Query: 1837 AQHAGGQCKTTSAVCESIQKTEDSMH-KTMLKNEKPGEKCSGSGSAIPGSVNEKHDNGSS 2013
              H+ G+C +     E  ++T DS+  +  +K E   ++       +  S+    +  + 
Sbjct: 1778 EGHSDGRCSSVPQSREKSEETNDSLKGRGNVKAEVTWKEKKSEIDKLHSSMGGLSELRAR 1837

Query: 2014 CFDQPLEP-ECSFNFQEILSKFKIETSLKELVKEVGLIGSNGVVSFVPSRSPYLDDPSLT 2190
                  E   C ++  +I SKF  E S KELV+++GLIGSNG+  FV S SPYL D    
Sbjct: 1838 LIKFQNEGINCPYDLLDICSKFVTEDSNKELVQDIGLIGSNGIPPFVTSISPYLSDSISV 1897

Query: 2191 LASSTNNVVNIGDVRSFSESQRLQSDXXXXXXXXXXDISAHVQSSRLDKVQEVAKAEFVK 2370
            L S  NN    GD  +  E Q                 S +  S+R   + E+   E +K
Sbjct: 1898 LISPENNTRIPGDECNVDERQVFPQGNWNENRAVLQSSSDN--STRKTSINEIG--EVLK 1953

Query: 2371 PMFPKERC-QFTVKDSNSG-----LGVSKSSIIRESSLVPKVGKACEILRCLKINLLDMD 2532
               P   C Q   K S+ G     +G     ++ ESSL+P VGK   ILR LKINLLDM+
Sbjct: 1954 TNKPPLGCLQRRGKKSSLGKCFPEMGPGCCCVVPESSLMPLVGKVSSILRQLKINLLDME 2013

Query: 2533 AALPEASLKASRSHSDRRCAWRTFVKSASTIYEMVQATIILEDTIKADYLRNDWWYWSSP 2712
            AALPE +L+ ++    RR AWR +VKSA +IY+MV+ATI+LE+ IK +YLRN+WWYWSS 
Sbjct: 2014 AALPEEALRPAKGQLGRRWAWRAYVKSAESIYQMVRATIMLEEMIKTEYLRNEWWYWSSL 2073

Query: 2713 SAAANISTLSALALRIYTLDSAILYEKPFGNDTTERFTPDGKFEKESRLGSVPTNNLKPS 2892
            SAAA  ST+++LALRIY+LD+ I+YEK   +D                    P+ NLK S
Sbjct: 2074 SAAAKTSTVASLALRIYSLDACIVYEKNSNSD--------------------PSVNLKLS 2113

Query: 2893 DQLMQK-MPDLDSGENLKPRTRASKRRKDSEG 2985
              + QK + D+D  E  +   +++K+RK+ EG
Sbjct: 2114 SLVNQKPVNDMDLVEKCRVTRKSNKKRKEPEG 2145


Top