BLASTX nr result
ID: Angelica22_contig00002214
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00002214 (4838 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002517349.1| DNA binding protein, putative [Ricinus commu... 764 0.0 ref|XP_004167238.1| PREDICTED: LOW QUALITY PROTEIN: methyl-CpG-b... 722 0.0 ref|XP_004141185.1| PREDICTED: methyl-CpG-binding domain-contain... 722 0.0 ref|NP_186795.1| methyl-CPG-binding domain 9 [Arabidopsis thalia... 663 0.0 ref|XP_002525350.1| DNA binding protein, putative [Ricinus commu... 662 0.0 >ref|XP_002517349.1| DNA binding protein, putative [Ricinus communis] gi|223543360|gb|EEF44891.1| DNA binding protein, putative [Ricinus communis] Length = 2145 Score = 764 bits (1972), Expect = 0.0 Identities = 454/1080 (42%), Positives = 620/1080 (57%), Gaps = 99/1080 (9%) Frame = -1 Query: 3542 QKAVITVLEDLQSAGLQQKPGKEQKKKRYISISDIIMQRCLNVLRHLAADDDSKNLNKLL 3363 +KAV++VL D+ GL QK K+K + +SDIIM++C +LRH AA DD+K LL Sbjct: 1033 KKAVLSVLADVLGEGLSQKSNNRNKRKITVPVSDIIMKQCRIILRHAAAADDAKVFCTLL 1092 Query: 3362 GRKFMNTSCGEDDGLRSSPAIVSCPLDFRTIDLKLAAGAYGGSHEAFLEDVREFWANAHI 3183 GR +N+ +D+GL PA+VS PLDFRTIDL+LA GAYGGSHE+FLEDVRE W N Sbjct: 1093 GRNLINSCDHDDEGLLGYPAMVSRPLDFRTIDLRLAVGAYGGSHESFLEDVRELWNNVRT 1152 Query: 3182 ASKDKPDQIRLVEMLSQTFDSMYQTEVLTQFQKLEKCRQSDCVSVEALNEIDEMLGIPRE 3003 A +D+PD I LVE L+Q F+S+Y+ EV+T QK E+ + D +S E ++D +L E Sbjct: 1153 AFRDQPDVIELVETLAQNFESLYEKEVVTLVQKFEEFAKLDRLSAETKKDLDIVLASTNE 1212 Query: 3002 ILKAPWEEGVCKVCEIDRDDDNVLLCDTCDAQYHIYCLDPPLAVIPEGDWHCPSCLAGTS 2823 I KAPW+EGVCKVC D+DDD+VLLCDTCDA+YH YCL+PPLA IPEG+W+CPSC++ Sbjct: 1213 IPKAPWDEGVCKVCGFDKDDDSVLLCDTCDAEYHTYCLNPPLARIPEGNWYCPSCVSVRM 1272 Query: 2822 ITRRAPDTW-SNLLHCRKYQGEFTRAHLDFLAHLASDMEEREYWELGVCQRSHLLTFLCD 2646 + + T C+KYQGE TR +L+ L HLAS MEE++YW+ GV +R+ LL FLCD Sbjct: 1273 VQEASVSTQVIGQNSCKKYQGEMTRIYLETLVHLASAMEEKDYWDFGVDERTFLLKFLCD 1332 Query: 2645 ELLSSALVRQHIDQCTDRSTKLMQDLRSTFTELKTLKYKEASLEAKVAESDRSFT----- 2481 ELL+SALVRQH++QC + + ++ Q LR+ + E K LK KE + K A+ + Sbjct: 1333 ELLNSALVRQHLEQCMESTAEVQQKLRTLYAEWKNLKSKEEFMALKSAKMGTGASGEVKE 1392 Query: 2480 -VVGTDMQQGR---------DKYI--------TRHYSSQSKSDFINNFSGDLFQ------ 2373 +V QG+ DK + + IN F + Sbjct: 1393 GLVSALKDQGKSVGQPPVLGDKPSDCCAPSDDVSAVDGSPEGNGINGFDKHPSEINYEKK 1452 Query: 2372 -----SNRDQTNAACYTPDL-----ASSEDLQDHSSSLATGNANVSSS----------YP 2253 N D TN D+ S++ +++S L + S S P Sbjct: 1453 PSHDSQNIDSTNNHGPVKDMHDAMEGSNDPSKENSKPLGPNHPGFSLSSDMNALVVLNLP 1512 Query: 2252 IVFLNNSKSESSELTSVRTQILQLNNLIASIYSQFLLVSMRREFLGSDSSGRLYWVLADS 2073 V +N S++ +++++++ IL+L NLI+S+ SQ S+RREFLGSDS G LYW A Sbjct: 1513 SVTMNESQAYHTDVSAIKDDILRLQNLISSMESQLSKQSLRREFLGSDSRGHLYWASATP 1572 Query: 2072 GTQPSLISSRNGEMQQ---------NGHIISTSFLGAVSPPFSCELGDNKGC-------- 1944 P ++ R+ Q N ++ S + + E ++ C Sbjct: 1573 NGHPQIVVDRSLTFQHRKISHHRLGNSSVLQHSSSSGIDACLNLE--GSRACFPFLFNPN 1630 Query: 1943 ------PKWVSYNTDAEIKKLLHHLQDNNPRERELKESILQWESLVTRGNKLSESQHLEG 1782 WVSY TDAEI++L+ L +NN +E ELKESI+QW L + ESQ + Sbjct: 1631 GTLSMSSAWVSYETDAEIEELIGWLGNNNQKEIELKESIMQWLKL-----RFQESQRIRD 1685 Query: 1781 P---QMTTSLGNCKN--VTSSTSLCTMATSLLENKYGPCVALESSELSNIRIKKAKGIVQ 1617 P + L +N T+ ++ T AT LLE YG V L++S++ R KKA+G + Sbjct: 1686 PVQEECRAGLSTIRNNDQTAFSNCLTKATLLLEKNYGAFVELDTSDMLKKRGKKARGTNE 1745 Query: 1616 QNMHRCDCLEPIWPSRLHCNLCHRTFLTDVELDGHNDGKCSKALPASDKLECLQIS-EGK 1440 + +RCDCLE IWPSR HC CHRT DVE +GH+DG+CS + +K E S +G+ Sbjct: 1746 EKTYRCDCLELIWPSRNHCYSCHRTSSNDVEFEGHSDGRCSSVPQSREKSEETNDSLKGR 1805 Query: 1439 -SMSSGLEQGGYRHETHXXXXXXXXXXXXXXXSIIHQNDKSVCPFDYEDICSKFVINDSN 1263 ++ + + + E I QN+ CP+D DICSKFV DSN Sbjct: 1806 GNVKAEVTWKEKKSEIDKLHSSMGGLSELRARLIKFQNEGINCPYDLLDICSKFVTEDSN 1865 Query: 1262 KELVKDIGLISSNGIPMFVPSISPHISDSVAML-PPKETVVVEADKLSNDVQHTSLQD-- 1092 KELV+DIGLI SNGIP FV SISP++SDS+++L P+ + D+ + D + Q Sbjct: 1866 KELVQDIGLIGSNGIPPFVTSISPYLSDSISVLISPENNTRIPGDECNVDERQVFPQGNW 1925 Query: 1091 ----------------KTTXXXXXXXXXXXXPLQCAVKHQDQKLSLHCSAPLCKVWNCCI 960 KT+ P ++ + +K SL P CC+ Sbjct: 1926 NENRAVLQSSSDNSTRKTSINEIGEVLKTNKPPLGCLQRRGKKSSLGKCFPEMGPGCCCV 1985 Query: 959 VPEPSLRPLRGRAGYILRQLKINLLDMDAALPEEAIRPSRSKLERRWAWRSFVKSAERIY 780 VPE SL PL G+ ILRQLKINLLDM+AALPEEA+RP++ +L RRWAWR++VKSAE IY Sbjct: 1986 VPESSLMPLVGKVSSILRQLKINLLDMEAALPEEALRPAKGQLGRRWAWRAYVKSAESIY 2045 Query: 779 EMVQATVALEDMIKSEYLRSTWWYWSSLTMAAKISTLSSLALRIYSLDAAIYYNKTSCSD 600 +MV+AT+ LE+MIK+EYLR+ WWYWSSL+ AAK ST++SLALRIYSLDA I Y K S SD Sbjct: 2046 QMVRATIMLEEMIKTEYLRNEWWYWSSLSAAAKTSTVASLALRIYSLDACIVYEKNSNSD 2105 >ref|XP_004167238.1| PREDICTED: LOW QUALITY PROTEIN: methyl-CpG-binding domain-containing protein 9-like [Cucumis sativus] Length = 1277 Score = 722 bits (1863), Expect = 0.0 Identities = 432/1071 (40%), Positives = 620/1071 (57%), Gaps = 93/1071 (8%) Frame = -1 Query: 3542 QKAVITVLEDLQSAGLQQKPGKEQKKKRYISISDIIMQRCLNVLRHLAADDDSKNLNKLL 3363 +KAV+++L D+ L K K +K+ ISISDI+M++C VLR AA DD+K LL Sbjct: 173 KKAVLSILADICGDSLPPKVEKRRKRITTISISDIVMKQCRTVLRRAAAADDAKVFCNLL 232 Query: 3362 GRKFMNTSCGEDDGLRSSPAIVSCPLDFRTIDLKLAAGAYGGSHEAFLEDVREFWANAHI 3183 GRK M +S +D+GL P +VS PLDFRTIDL+LA+G+Y GSHEAFLEDV+E W N Sbjct: 233 GRKLMASSDNDDEGLLGPPGMVSRPLDFRTIDLRLASGSYDGSHEAFLEDVQELWNNLRY 292 Query: 3182 ASKDKPDQIRLVEMLSQTFDSMYQTEVLTQFQKLEKCRQSDCVSVEALNEIDEMLGIPRE 3003 A D+PD + LVE LS+ F+ +Y+ EVL+ +KL++ + + +S E E+D L E Sbjct: 293 AYGDQPDLVELVETLSENFERLYENEVLSLIEKLKEFSKLESLSAETKVEVDGFLVSLNE 352 Query: 3002 ILKAPWEEGVCKVCEIDRDDDNVLLCDTCDAQYHIYCLDPPLAVIPEGDWHCPSCLAGTS 2823 I KAPW+EGVCKVC ID+DDD+VLLCDTCDA+YH YCL+PPLA IPEG+W+CPSC+ GT Sbjct: 353 IPKAPWDEGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLARIPEGNWYCPSCVMGTR 412 Query: 2822 ITRRAPDTWSNLLHC---RKYQGEFTRAHLDFLAHLASDMEEREYWELGVCQRSHLLTFL 2652 + + ++++ +K++GE TR L+ LA+LA+ +EE+EYWE V +R LL +L Sbjct: 413 MVEDPSEHTKHIINLHKGKKFRGEVTRDFLNKLANLAAALEEKEYWEFSVDERLFLLKYL 472 Query: 2651 CDELLSSALVRQHIDQCTDRSTKLMQDLRSTFTELKTLKYKEASLEAKVAESDRSFTVVG 2472 CDELLSSAL+RQH++QC + +L Q LRS F E K LK +E + A+ A+ D T + Sbjct: 473 CDELLSSALIRQHLEQCVEALAELQQKLRSCFIEWKNLKCREEVVAARAAKLD---TTML 529 Query: 2471 TDMQQGR-----------DKYIT---------RHYSSQSKSDFI------NNFSGDLFQS 2370 + +++G+ D+Y + H S Q + N+ G++ S Sbjct: 530 SAVREGQGSCDGARLGASDQYSSLTSLENKCHNHASFQEQMSSAHDVTDNNDAGGNVLSS 589 Query: 2369 NRDQTNAACYT------------PDLASSEDLQDHSSSLATG-------NAN-VSSSYPI 2250 + Q + D + +++ S L +G +AN V + + Sbjct: 590 SGSQNSGKPVKFNEPSLSGLPQEVDGSDQSNMETEISILPSGKQYFTPCDANGVPVAPQV 649 Query: 2249 VFLNNSKSESSELTSVRTQILQLNNLIASIYSQFLLVSMRREFLGSDSSGRLYWVLADSG 2070 N S++ SEL S++ ILQ+ + IAS + L +S+RREFLGSD++GRLYW S Sbjct: 650 PPPNESQAYHSELDSIKKDILQVQDSIASTELELLKISVRREFLGSDAAGRLYWASVMSN 709 Query: 2069 TQPSLISSRNGE---MQQNGHIISTSFLG---AVSPPFSCELGDNK------------GC 1944 P +ISS + + ++ F + S S L N G Sbjct: 710 GLPQIISSGSSVHIGSESRDRVVKGRFFKNYTSTSNANSSTLNSNMYSSLLHLPKDFIGN 769 Query: 1943 PKWVSYNTDAEIKKLLHHLQDNNPRERELKESILQWES----LVTRGNKLSESQHLEGPQ 1776 +SY T+A+I +L+ L+D++P+ERELKESILQW +R N S + L+ Sbjct: 770 SPCISYQTEADILELIDWLKDSDPKERELKESILQWLKPKLQTSSRSNNQSPEEQLKD-- 827 Query: 1775 MTTSLGNCKNVTSSTSLCTMATSLLENKYGPCVALESSELSNIRIKKAKGIVQQNMHRCD 1596 ++S + + + S L A++LLE+KYGP + + + N + KA+ + M RC Sbjct: 828 -SSSSSDVEKLECSGFLVNRASALLESKYGPFLEFVTPDDLNRWLDKARLAEDEKMFRCV 886 Query: 1595 CLEPIWPSRLHCNLCHRTFLTDVELDGHNDGKCSKALPAS-DKLECLQISEGKSMSSGLE 1419 C+EP+WPSR HC CH++F TDVEL+ H++G+CS +LPAS D ++ + S + E Sbjct: 887 CMEPVWPSRYHCLSCHKSFSTDVELEEHDNGQCS-SLPASCDGIKEVGDSSKSKCNIKFE 945 Query: 1418 QGGYRHETHXXXXXXXXXXXXXXXSIIHQNDKSVCPFDYEDICSKFVINDSNKELVKDIG 1239 + I +QND +CP+D+E ICSKF+ DSNK+L+K+IG Sbjct: 946 SKQEESSSMVIAETSRGYFNHSMGLIKYQNDGMMCPYDFELICSKFLTKDSNKDLIKEIG 1005 Query: 1238 LISSNGIPMFVPSISPHISDSV---------AMLPPKETVVVEADKLSNDVQHTSLQDKT 1086 LISSNG+P F+ S+SP+I +S + P T++ E L N + + Sbjct: 1006 LISSNGVPSFLSSVSPYIMESTLNVIDLKKDSSTPEDGTLLSEWPSLENIILENGCHQSS 1065 Query: 1085 TXXXXXXXXXXXXPLQCAVKHQDQKLSLHCSAPLCK------------VWNCCIVPEPSL 942 + K +L+ C P K + C ++P+ S Sbjct: 1066 SIDSSIQKPAGNEISAPKTK----RLAAGCLEPKSKKSXMDNRFSEFGIGRCFVIPQSSQ 1121 Query: 941 RPLRGRAGYILRQLKINLLDMDAALPEEAIRPSRSKLERRWAWRSFVKSAERIYEMVQAT 762 RPL G+ ++R LK+NLLDMDAALP+EA++PS+ +ERRWAWR+FVKSA IYEMVQAT Sbjct: 1122 RPLVGKILQVVRGLKMNLLDMDAALPDEALKPSKLHIERRWAWRAFVKSAGTIYEMVQAT 1181 Query: 761 VALEDMIKSEYLRSTWWYWSSLTMAAKISTLSSLALRIYSLDAAIYYNKTS 609 +ALEDMI++EYL++ WWYWSSL+ AAKIST+SSLALRI+SLDAAI Y K S Sbjct: 1182 IALEDMIRTEYLKNEWWYWSSLSAAAKISTVSSLALRIFSLDAAIIYEKIS 1232 >ref|XP_004141185.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like [Cucumis sativus] Length = 2131 Score = 722 bits (1863), Expect = 0.0 Identities = 435/1072 (40%), Positives = 618/1072 (57%), Gaps = 94/1072 (8%) Frame = -1 Query: 3542 QKAVITVLEDLQSAGLQQKPGKEQKKKRYISISDIIMQRCLNVLRHLAADDDSKNLNKLL 3363 +KAV+++L D+ L K K +K+ ISISDI+M++C VLR AA DD+K LL Sbjct: 1026 KKAVLSILADICGDSLPPKVEKRRKRITTISISDIVMKQCRTVLRRAAAADDAKVFCNLL 1085 Query: 3362 GRKFMNTSCGEDDGLRSSPAIVSCPLDFRTIDLKLAAGAYGGSHEAFLEDVREFWANAHI 3183 GRK M +S +D+GL P +VS PLDFRTIDL+LA+G+Y GSHEAFLEDV+E W N Sbjct: 1086 GRKLMASSDNDDEGLLGPPGMVSRPLDFRTIDLRLASGSYDGSHEAFLEDVQELWNNLRY 1145 Query: 3182 ASKDKPDQIRLVEMLSQTFDSMYQTEVLTQFQKLEKCRQSDCVSVEALNEIDEMLGIPRE 3003 A D+PD + LVE LS+ F+ +Y+ EVL+ +KL++ + + +S E E+D L E Sbjct: 1146 AYGDQPDLVELVETLSENFERLYENEVLSLIEKLKEFSKLESLSAETKVEVDGFLVSLNE 1205 Query: 3002 ILKAPWEEGVCKVCEIDRDDDNVLLCDTCDAQYHIYCLDPPLAVIPEGDWHCPSCLAGTS 2823 I KAPW+EGVCKVC ID+DDD+VLLCDTCDA+YH YCL+PPLA IPEG+W+CPSC+ GT Sbjct: 1206 IPKAPWDEGVCKVCGIDKDDDSVLLCDTCDAEYHTYCLNPPLARIPEGNWYCPSCVMGTR 1265 Query: 2822 ITRRAPDTWSN----LLHCRKYQGEFTRAHLDFLAHLASDMEEREYWELGVCQRSHLLTF 2655 + + N L +K++GE TR L+ LA+LA+ +EE+EYWE V +R LL + Sbjct: 1266 MVEDPSEHTKNHIINLHKGKKFRGEVTRDFLNKLANLAAALEEKEYWEFSVDERLFLLKY 1325 Query: 2654 LCDELLSSALVRQHIDQCTDRSTKLMQDLRSTFTELKTLKYKEASLEAKVAESDRSFTVV 2475 LCDELLSSAL+RQH++QC + +L Q LRS F E K LK +E + A+ A+ D T + Sbjct: 1326 LCDELLSSALIRQHLEQCVEALAELQQKLRSCFIEWKNLKCREEVVAARAAKLD---TTM 1382 Query: 2474 GTDMQQGR-----------DKYIT---------RHYSSQSKSDFI------NNFSGDLFQ 2373 + +++G+ D+Y + H S Q + N+ G++ Sbjct: 1383 LSAVREGQGSCDGARLGASDQYSSLTSLENKCHNHASFQEQMSSAHDVTDNNDAGGNVLS 1442 Query: 2372 SNRDQTNAACYT------------PDLASSEDLQDHSSSLATG-------NAN-VSSSYP 2253 S+ Q + D + +++ S L +G +AN V + Sbjct: 1443 SSGSQNSGKPVKFNEPSLSGLPQEVDGSDQSNMETEISILPSGKQYFTPCDANGVPVAPQ 1502 Query: 2252 IVFLNNSKSESSELTSVRTQILQLNNLIASIYSQFLLVSMRREFLGSDSSGRLYWVLADS 2073 + N S++ SEL S++ ILQ+ + IAS + L +S+RREFLGSD++GRLYW S Sbjct: 1503 VPPPNESQAYHSELDSIKKDILQVQDSIASTELELLKISVRREFLGSDAAGRLYWASVMS 1562 Query: 2072 GTQPSLISSRNGE---MQQNGHIISTSFLG---AVSPPFSCELGDNK------------G 1947 P +ISS + + ++ F + S S L N G Sbjct: 1563 NGLPQIISSGSSVHIGSESRDRVVKGRFFKNYTSTSNANSSTLNSNMYSSLLHLPKDFIG 1622 Query: 1946 CPKWVSYNTDAEIKKLLHHLQDNNPRERELKESILQWES----LVTRGNKLSESQHLEGP 1779 +SY T+A+I +L+ L+D++P+ERELKESILQW +R N S + L+ Sbjct: 1623 NSPCISYQTEADILELIDWLKDSDPKERELKESILQWLKPKLQTSSRSNNQSPEEQLKD- 1681 Query: 1778 QMTTSLGNCKNVTSSTSLCTMATSLLENKYGPCVALESSELSNIRIKKAKGIVQQNMHRC 1599 ++S + + + S L A++LLE+KYGP + + + N + KA+ + M RC Sbjct: 1682 --SSSSSDVEKLECSGFLVNRASALLESKYGPFLEFVTPDDLNRWLDKARLAEDEKMFRC 1739 Query: 1598 DCLEPIWPSRLHCNLCHRTFLTDVELDGHNDGKCSKALPAS-DKLECLQISEGKSMSSGL 1422 C+EP+WPSR HC CHR+F TDVEL+ H++G+CS +LPAS D ++ + S + Sbjct: 1740 VCMEPVWPSRYHCLSCHRSFSTDVELEEHDNGQCS-SLPASCDGIKEVGDSSKSKCNIKF 1798 Query: 1421 EQGGYRHETHXXXXXXXXXXXXXXXSIIHQNDKSVCPFDYEDICSKFVINDSNKELVKDI 1242 E + I +QND +CP+D+E ICSKF+ DSNK+L+K+I Sbjct: 1799 ESKQEESSSMVIAETSRGYFNHSMGLIKYQNDGMMCPYDFELICSKFLTKDSNKDLIKEI 1858 Query: 1241 GLISSNGIPMFVPSISPHISDSV---------AMLPPKETVVVEADKLSNDVQHTSLQDK 1089 GLISSNG+P F+ S+SP+I +S + P T++ E L N + Sbjct: 1859 GLISSNGVPSFLSSVSPYIMESTLNVIDLKKDSSTPEDGTLLSEWPSLENIILENGCHQS 1918 Query: 1088 TTXXXXXXXXXXXXPLQCAVKHQDQKLSLHCSAPLCK------------VWNCCIVPEPS 945 ++ K +L+ C P K + C ++P+ S Sbjct: 1919 SSIDSSIQKPAGNEISAPKTK----RLAAGCLEPKSKKICMDNRFSEFGIGRCFVIPQSS 1974 Query: 944 LRPLRGRAGYILRQLKINLLDMDAALPEEAIRPSRSKLERRWAWRSFVKSAERIYEMVQA 765 RPL G+ ++R LK+NLLDMDAALP+EA++PS+ +ERRWAWR+FVKSA IYEMVQA Sbjct: 1975 QRPLVGKILQVVRGLKMNLLDMDAALPDEALKPSKLHIERRWAWRAFVKSAGTIYEMVQA 2034 Query: 764 TVALEDMIKSEYLRSTWWYWSSLTMAAKISTLSSLALRIYSLDAAIYYNKTS 609 T+ALEDMI++EYL++ WWYWSSL+ AAKIST+SSLALRI+SLDAAI Y K S Sbjct: 2035 TIALEDMIRTEYLKNEWWYWSSLSAAAKISTVSSLALRIFSLDAAIIYEKIS 2086 >ref|NP_186795.1| methyl-CPG-binding domain 9 [Arabidopsis thaliana] gi|75337201|sp|Q9SGH2.1|MBD9_ARATH RecName: Full=Methyl-CpG-binding domain-containing protein 9; Short=AtMBD9; Short=MBD09; AltName: Full=Histone acetyl tranferase MBD9; AltName: Full=Methyl-CpG-binding protein MBD9 gi|6692266|gb|AAF24616.1|AC010870_9 unknown protein [Arabidopsis thaliana] gi|332640148|gb|AEE73669.1| methyl-CPG-binding domain 9 [Arabidopsis thaliana] Length = 2176 Score = 663 bits (1711), Expect = 0.0 Identities = 404/1054 (38%), Positives = 593/1054 (56%), Gaps = 78/1054 (7%) Frame = -1 Query: 3542 QKAVITVLEDLQSAGLQQKPGKEQKKKRYISISDIIMQRCLNVLRHLAADDDSKNLNKLL 3363 +KAV+++L D++ L Q+ K KK+ YIS+SD+IM++C VLR +AA D+ K L LL Sbjct: 1100 KKAVLSLLADIRGGDLVQRSIKGTKKRTYISVSDVIMKKCRAVLRGVAAADEDKVLCTLL 1159 Query: 3362 GRKFMNTSCGEDDGLRSSPAIVSCPLDFRTIDLKLAAGAYGGSHEAFLEDVREFWANAHI 3183 GRK +N+S +DDGL SPA+VS PLDFRTIDL+LAAGAY GS EAFLEDV E W++ + Sbjct: 1160 GRKLLNSSDNDDDGLLGSPAMVSRPLDFRTIDLRLAAGAYDGSTEAFLEDVLELWSSIRV 1219 Query: 3182 ASKDKPDQIRLVEMLSQTFDSMYQTEVLTQFQKLEKCRQSDCVSVEALNEIDEMLGIPRE 3003 D+PD + LV LS+ F S+Y+ EV+ QKL+ R+ +C+S E EI +++ + Sbjct: 1220 MYADQPDCVDLVATLSEKFKSLYEAEVVPLVQKLKDYRKLECLSAEMKKEIKDIVVSVNK 1279 Query: 3002 ILKAPWEEGVCKVCEIDRDDDNVLLCDTCDAQYHIYCLDPPLAVIPEGDWHCPSCLAGTS 2823 + KAPW+EGVCKVC +D+DDD+VLLCDTCDA+YH YCL+PPL IP+G+W+CPSC+ Sbjct: 1280 LPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYCLNPPLIRIPDGNWYCPSCVIAKR 1339 Query: 2822 ITRRAPDTWSNLLH--CRKYQGEFTRAHLDFLAHLASDMEEREYWELGVCQRSHLLTFLC 2649 + + A +++ + RKYQGE TRA ++ AHLA MEE++YWE +R LL LC Sbjct: 1340 MAQEALESYKLVRRRKGRKYQGELTRASMELTAHLADVMEEKDYWEFSAEERILLLKLLC 1399 Query: 2648 DELLSSALVRQHIDQCTDRSTKLMQDLRSTFTELKTLKYKEASLEAKVAESDRSF----- 2484 DELLSS+LV QH++QC + ++ Q LRS +E K K ++ L AK+A+ + S Sbjct: 1400 DELLSSSLVHQHLEQCAEAIIEMQQKLRSLSSEWKNAKMRQEFLTAKLAKVEPSILKEVG 1459 Query: 2483 ---------TVVGTD--MQQGRDKYITRHYSSQSKSDFINNFSG---------------- 2385 +G D Q+G +TR S + ++N G Sbjct: 1460 EPHNSSYFADQMGCDPQPQEGVGDGVTRD-DETSSTAYLNKNQGKSPLETDTQPGESHVN 1518 Query: 2384 ----------DLFQSNRDQTNAACYTPDLASSEDLQDHSSSLATG-------NANVSSSY 2256 + R + A +P + + +D S +L ++ S++ Sbjct: 1519 FGESKISSPETISSPGRHELPIADTSPLVTDNLPEKDTSETLLKSVGRNHETHSPNSNAV 1578 Query: 2255 PIVFLNNSKSESS--------ELTSVRTQILQLNNLIASIYSQFLLVSMRREFLGSDSSG 2100 + +++ S++S +L++ +I L I SI SQ L S+RR+FLG+D+SG Sbjct: 1579 ELPTAHDASSQASQELQACQQDLSATSNEIQNLQQSIRSIESQLLKQSIRRDFLGTDASG 1638 Query: 2099 RLYWVLADSGTQPSLISSRNGEMQQNGHIISTSFLGAVSP-PF--SCELGDNKGCPKWVS 1929 RLYW P ++ + +Q+ + +G+ P PF + + G + P W Sbjct: 1639 RLYWGCCFPDENPRILVDGSISLQKP---VQADLIGSKVPSPFLHTVDHGRLRLSP-WTY 1694 Query: 1928 YNTDAEIKKLLHHLQDNNPRERELKESILQWESL----VTRGNKLSESQHLEGPQMTTSL 1761 Y T+ EI +L+ L D++ +ER+L+ESIL W+ L V + K ++Q+L P T L Sbjct: 1695 YETETEISELVQWLHDDDLKERDLRESILWWKRLRYGDVQKEKK--QAQNLSAPVFATGL 1752 Query: 1760 GNCKNVTSSTSLCTMATSLLENKYGPCVALESSELSNIRIKKAKGIVQQNMHRCDCLEPI 1581 T A +E +YGPC+ LE L R KK K ++ + RC+CLE I Sbjct: 1753 E------------TKAAMSMEKRYGPCIKLEMETLKK-RGKKTKVAEREKLCRCECLESI 1799 Query: 1580 WPSRLHCNLCHRTFLTDVELDGHNDGKCSKALPASDKLECLQISEGKSMSSGLEQGGYRH 1401 PS +HC +CH+TF +D E + H + KC A+++ + + S S + + Sbjct: 1800 LPSMIHCLICHKTFASDDEFEDHTESKCIPYSLATEEGKDISDSSKAKESLKSDYLNVKS 1859 Query: 1400 ETHXXXXXXXXXXXXXXXSIIHQNDKSVCPFDYEDICSKFVINDSNKELVKDIGLISSNG 1221 I +Q ++S+ P+ +E+ICSKFV D N++LVK+IGLISSNG Sbjct: 1860 SAGKDVAEISNVSELDSGLIRYQEEESISPYHFEEICSKFVTKDCNRDLVKEIGLISSNG 1919 Query: 1220 IPMFVPSISPHISDSVAMLPPKE--------TVVVEADKLSN----DVQHTSLQDKTTXX 1077 IP F+PS S H++DSV + V+ A +N + + D++ Sbjct: 1920 IPTFLPSSSTHLNDSVLISAKSNKPDGGDSGDQVIFAGPETNVEGLNSESNMSFDRSVTD 1979 Query: 1076 XXXXXXXXXXPLQCAVKHQDQKLSLHCSAPLCKVWNCCIVPEPSLRPLRGRAGYILRQLK 897 L Q K S + +CC+VP+ +L+ + G+A R LK Sbjct: 1980 SHGGPLDKPSGLGFGFSEQKNKKSSGSG-----LKSCCVVPQAALKRVTGKALPGFRFLK 2034 Query: 896 INLLDMDAALPEEAIRPSRSKLERRWAWRSFVKSAERIYEMVQATVALEDMIKSEYLRST 717 NLLDMD ALPEEA+RPS+S RR AWR FVKS++ IYE+VQAT+ +EDMIK+EYL++ Sbjct: 2035 TNLLDMDVALPEEALRPSKSHPNRRRAWRVFVKSSQSIYELVQATIVVEDMIKTEYLKNE 2094 Query: 716 WWYWSSLTMAAKISTLSSLALRIYSLDAAIYYNK 615 WWYWSSL+ AAKISTLS+L++RI+SLDAAI Y+K Sbjct: 2095 WWYWSSLSAAAKISTLSALSVRIFSLDAAIIYDK 2128 >ref|XP_002525350.1| DNA binding protein, putative [Ricinus communis] gi|223535313|gb|EEF36988.1| DNA binding protein, putative [Ricinus communis] Length = 1794 Score = 662 bits (1709), Expect = 0.0 Identities = 409/1027 (39%), Positives = 574/1027 (55%), Gaps = 50/1027 (4%) Frame = -1 Query: 3542 QKAVITVLEDLQSAGLQQKPGKEQKKKRYISISDIIMQRCLNVLRHLAADDDSKNLNKLL 3363 +KAV++VL D+ QQKP +++K K ++ D+IM++C VLR AA D+ K LL Sbjct: 729 KKAVLSVLADVCGETPQQKPNRKRKGKHIDTLPDVIMKQCRKVLRRAAAADEEKIFCNLL 788 Query: 3362 GRKFMNTSCGEDDGLRSSPAIVSCPLDFRTIDLKLAAGAYGGSHEAFLEDVREFWANAHI 3183 GR +NTS +D+GL P +VS PLDFRTIDL+LA GAYGGSHEAFLEDVRE W + Sbjct: 789 GRTLLNTSDNDDEGLLGFPTMVSRPLDFRTIDLRLAFGAYGGSHEAFLEDVREVWHHIRT 848 Query: 3182 ASKDKPDQIRLVEMLSQTFDSMYQTEVLTQFQKLEKCRQSDCVSVEALNEIDEMLGIPRE 3003 A D+ D + L E LSQ F+++Y+ EVLT QKL +C + EA E++++L + Sbjct: 849 AYADQSDLVHLAEKLSQNFEALYKNEVLTLVQKLTDYAAVECSNSEAKKEMEDILEHASQ 908 Query: 3002 ILKAPWEEGVCKVCEIDRDDDNVLLCDTCDAQYHIYCLDPPLAVIPEGDWHCPSCLAGTS 2823 + KAPW+EGVCKVC +D+DDDNVLLCD CD+ YH YCL+PPLA IPEG+W+CPSC Sbjct: 909 MPKAPWDEGVCKVCGVDKDDDNVLLCDKCDSGYHTYCLNPPLARIPEGNWYCPSC----- 963 Query: 2822 ITRRAPDTWSNLLHCRK--YQGEFTRAHLDFLAHLASDMEEREYWELGVCQRSHLLTFLC 2649 IT+ A + HCRK QGEFT L+ LAHL + ME +YW+ V +R LL FL Sbjct: 964 ITQGASQVPQFVSHCRKKRRQGEFTHGVLEALAHLGTTMEITDYWDYSVEERIFLLKFLG 1023 Query: 2648 DELLSSALVRQHIDQCTDRSTKLMQDLRSTFTELKTLKYKEASLEAKVAESDRSFTVVGT 2469 DE+L+SA +R+H+DQC S L Q LRS E + LK+KE + V +S + T T Sbjct: 1024 DEVLNSANIREHLDQCASVSADLQQKLRSLSMEWRNLKFKEELMLNGVGKSGKEGT---T 1080 Query: 2468 DMQQGRDKYITRHYS-----SQSKSDFINNFSGDLFQSNRDQTNAAC--YTPDLASSEDL 2310 + DK + + +S S S D + G F D T C Y + + + Sbjct: 1081 TVLPNYDKLLGQTHSRSSLCSTSFIDLEHLKDGPRFPRTNDFTKRPCWVYPKGVQVQQPI 1140 Query: 2309 QDHSSSLATGNANVSSSYPIVFLNNSKSESSELTSVRTQILQLNNLIASIYSQFLLVSMR 2130 + S + + P V N ++ + E +R + L + + S+ Q S+R Sbjct: 1141 SNGSQVFTISDTECQVNQPDV--NQLQTSNLESIFIRDKASVLQDSVTSLELQLQKASLR 1198 Query: 2129 REFLGSDSSGRLYWVLADSGTQPSLISSRNGEMQQN-----------GHIISTSFLGA-- 1989 +EFLG DS+GR+YW + +G+ P ++ +QQ+ ++ S +GA Sbjct: 1199 KEFLGRDSAGRVYWAFSRTGSLPWVVIDGTTVVQQSSIAEENRVLRFNNLTFRSSIGAQD 1258 Query: 1988 --------VSPPFSCELGDNKGCP-KWVSYNTDAEIKKLLHHLQDNNPRERELKESILQW 1836 V P++ +L +W S+ + AEI++L+ L+DN+P +REL ES+LQ Sbjct: 1259 LLRFKGSNVFSPYASDLTSGISVYFQWFSHQSYAEIEELIKWLRDNDPMQRELIESLLQR 1318 Query: 1835 ESLVTRGNKLSESQHLEGPQMTTSLGNCKNVTSSTSLCTMATSLLENKYGPCVALESSEL 1656 + + + + LE Q + N + SL T A + LE KYGPC+ L+ + + Sbjct: 1319 LNFGYSNSNKAANYVLEMNQPASMPVNIEKTLKPKSLETRALTALEKKYGPCMELDVTNI 1378 Query: 1655 SNIRIKKAKGIVQQNMHRCDCLEPIWPSRLHCNLCHRTFLTDVELDGHNDGKC------- 1497 S + K M RC+CLE IWPSR HC CHR+F + EL+ HNDGKC Sbjct: 1379 SVKFSRNLKVTYDDRMCRCECLEAIWPSRHHCLSCHRSFSSRCELEEHNDGKCGAGAHTP 1438 Query: 1496 --SKALPASDKLECLQISEGKSMSSGLEQGGYRHETHXXXXXXXXXXXXXXXSIIHQNDK 1323 S+ K + L +E + GG HE +I + Sbjct: 1439 QNSRVTDDVSKEKVLMRAEHGEWQ--CKAGGAGHEIE--------------FGLIGFRKE 1482 Query: 1322 SVCPFDYEDICSKFVINDSNKELVKDIGLISSNGIPMFVPSISPHISDSVAMLPPKETVV 1143 + P++ E+I +KFV SNKELVK+IGL+ SNGIP VP SP++ D L Sbjct: 1483 FMSPYNLEEISAKFVTRSSNKELVKEIGLLGSNGIPSLVPCSSPYLIDPTLKLVLPCVNE 1542 Query: 1142 VEADKLSNDVQHTSLQDKTTXXXXXXXXXXXXPLQCAVKHQDQKLSLHCSAPLCKV---W 972 V S +V++ SLQ TT AV ++ + S + + + Sbjct: 1543 VCQSVQSTNVENGSLQGDTTTSKRHANKSNATKDCTAVDLYEELQEIGRSYLMNQSSLRF 1602 Query: 971 NCCIVPEP-------SLRPLRGRAGYILRQLKINLLDMDAALPEEAIRPSRSKLERRWAW 813 +C + P +LRPL G+ +ILRQLKINLLDMDAALPEEA++ S LE+R AW Sbjct: 1603 SCTKLGNPLSEIRGSALRPLVGKGAHILRQLKINLLDMDAALPEEAVKSSNIYLEKRCAW 1662 Query: 812 RSFVKSAERIYEMVQATVALEDMIKSEYLRSTWWYWSSLTMAAKISTLSSLALRIYSLDA 633 R+FVKSA+ ++EMVQAT+ LE+MIK+++LR+ WWYWSSL+ AAKI+T+SSLALRIY+LDA Sbjct: 1663 RAFVKSAKSVFEMVQATIVLENMIKTDFLRNEWWYWSSLSAAAKIATISSLALRIYTLDA 1722 Query: 632 AIYYNKT 612 AI Y KT Sbjct: 1723 AIVYEKT 1729