BLASTX nr result

ID: Angelica22_contig00004763 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00004763
         (5227 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002510762.1| set domain protein, putative [Ricinus commun...   805   0.0  
ref|XP_003534870.1| PREDICTED: uncharacterized protein LOC100805...   749   0.0  
ref|XP_002300607.1| SET domain protein [Populus trichocarpa] gi|...   738   0.0  
ref|XP_003594905.1| Histone-lysine N-methyltransferase SETD1B [M...   736   0.0  
ref|XP_004146799.1| PREDICTED: uncharacterized protein LOC101220...   725   0.0  

>ref|XP_002510762.1| set domain protein, putative [Ricinus communis]
            gi|223551463|gb|EEF52949.1| set domain protein, putative
            [Ricinus communis]
          Length = 1258

 Score =  805 bits (2080), Expect = 0.0
 Identities = 518/1226 (42%), Positives = 674/1226 (54%), Gaps = 49/1226 (3%)
 Frame = +2

Query: 611  DDKLNSGVSVDTSCQLNGGNESISHTSIGG---TSYPVKAEAAYASPAFVNSWMYVNAEG 781
            D K  S   ++ SCQLNG +  I  +S  G    SY  K    Y  PAF + WMY+N  G
Sbjct: 73   DKKTCSSSVLEMSCQLNGNSSGIPESSNAGGSVKSYQDKNFPGYMPPAFASGWMYLNVNG 132

Query: 782  QMCGPYIQEQLYEGLSSGFLPDDLPVYPILNGSLINPVPLNYFKQFPDHVATGFAYLPGS 961
            QMCGPYIQ+QLYEGLS+GFL +DLPVYP+LNG+L+NPVPL YF QFPDHVATGFAYL   
Sbjct: 133  QMCGPYIQQQLYEGLSTGFLHEDLPVYPVLNGTLVNPVPLKYFNQFPDHVATGFAYLGIG 192

Query: 962  FSGVKVPANSQTYPGSDFLSRTQELATTSGSYIS-------QTAYPXXXXXXXXXXXPQD 1120
             SG  +P +  T    D     QE      + +S         ++            P  
Sbjct: 193  ISGTSMPMSHFTSVSMDSAIHRQEGCVPHAAQVSLCSDAQEMVSHSHVPHNTCGSNQPVS 252

Query: 1121 LNADEANSSKPYMPVSGEEPCWLFDDDEGRKHGPHSLVELYSWHHYGYLRDSLMITHADN 1300
             N+  A+   P+  +SGE+ CW+F+DD GRKHGPHSL ELYSWH +GYLR+SL I H  N
Sbjct: 253  -NSMAASHDIPFSLLSGEDSCWMFEDDGGRKHGPHSLSELYSWHRHGYLRNSLTIYHIQN 311

Query: 1301 KFKPFILKSVVDTWRTGEKTLSVSNDKDHTTGPIQS----ISEDLCSQLHSGIMKTARRV 1468
            KF+PF L SV+D W T +    +++D +   G + S    ISE++  QLH+GIMK ARRV
Sbjct: 312  KFRPFPLLSVIDAWSTDKHESVLASDAEGEMGSLCSFVSEISEEVSCQLHAGIMKAARRV 371

Query: 1469 VLDEVISHVIAECIASKKAIKYQKFEAIN------QNVNTLSASTNMSESCSANVVVTSS 1630
             LDE+IS+V++E   +KK+ +  K   I       Q+  T     +    C      + +
Sbjct: 372  ALDEIISNVMSEFFDTKKSHRNLKRSPITTLCLFYQSEVTGERRNHAVPECKP-AAFSHN 430

Query: 1631 HETALSDCVSEYTLPASRSPVRTLSSMKSVGSAENFLDACAIIYRMLFDSSMQVVWNAVF 1810
             + A  D +SE  LP          + KSVG+ +NF  + A++ R+LFD  M+V+WNAVF
Sbjct: 431  SDQACVDGMSEL-LP---------KNTKSVGTIDNFWGSYAVVCRILFDYCMEVMWNAVF 480

Query: 1811 HDPIVEYSSVWRRTKLWLGHSAVLGQKNSIKQYEGEVEKVHFAALQSEPELSGGEADYPP 1990
            +D I +YS+ WRR KLW   S +     SIK Y GE+EK     L SE            
Sbjct: 481  YDAIADYSNSWRRRKLWSARSNIR-LPASIKDYGGEIEK-----LSSE------------ 522

Query: 1991 GFELPSMSSDVHILSPSVSSCFCNGEESNGRNLQSKDHIFKDVGSILVGVEHDLHSSAMM 2170
              EL  +  D H  S ++S      E ++  N  S    ++ +  IL  V+++LH S   
Sbjct: 523  -LELVCLKKDNHAQSHNLSPFLHVRERASKLNALSHK-AYRGIRRILEYVKNELHMSTKP 580

Query: 2171 SLTQYFESIVDEEVKKITGSPKDDQSNEVEVDPPIEQPHIISLSGSS---EALLDLEKIS 2341
              ++Y E ++D+EV KI    +DD+ NE  V+    +      S S    E   D  K+ 
Sbjct: 581  FFSEYVEFLIDKEVGKIVRVSEDDKLNEETVESFSRRCQTTDYSSSEFQDELTTDSVKL- 639

Query: 2342 DDNNQVSSESEKQEDYQKNVSVL--RNPVSNVLTNTLVKLCESLRGTDVVKDIDMLDAPG 2515
              N + S +++      K +  L   +  SN + +   K    +    V ++ID    PG
Sbjct: 640  --NVETSDDTQSLVQAGKPLGSLAPEDLFSNFVASAFAKSQVDVDFVMVDQNIDEPPPPG 697

Query: 2516 SEIKSETLAPLQISTIRSSRSHERFPRITLYVVLAMFRQKLHDDVLRECSSSLIDGAFNK 2695
                + TL P  I   R ++  E  P+I  YV +A+ RQKLHDDVL E  S  IDG  N+
Sbjct: 698  FGDNARTLVPSPIHKFRPTQPEESIPKIREYVAMAICRQKLHDDVLSEWKSFFIDGILNQ 757

Query: 2696 FWFSHRPXXXXXXXXTVVKRASRTNDERPSNFPAAVDKYSEK-VRSKHRTEPSDISLMTG 2872
            F  S           +  K    +N  +  N  A    Y  K  R  + ++ + +S +  
Sbjct: 758  FLRSIHTLRQHCQPGS--KMGGTSNANKDHNGTALTSLYKLKGTREFNSSDSAGVSSVCD 815

Query: 2873 RYTYSRKKGL-RNKSGSFSEWANFANSGSHKQSVGLSNVPNISVEATQNTEVNASVLSKI 3049
            +YTY RKK L R K GS S+     ++G     V      N+  +     E   + L K 
Sbjct: 816  KYTYYRKKKLVRKKLGSSSQSITPVDTGLQHHPVEKLQKQNVVKDI--EVEPVVATLKKK 873

Query: 3050 IDLDCSTEICADANE-QTMQNEHLPSVHT-----ANHKALKIAHGIQVIEKSSTRTKKLL 3211
                  TE+  D    +++    LPS  +      + K +K  H +     + T    + 
Sbjct: 874  KQKKGQTELSDDRRAIKSIVKSSLPSDQSMAKNGTHQKVIKYKHAVPRPSINVT-IDTIK 932

Query: 3212 PVRPN------DSTKVKK--DANKKRRNLE---TQEHVNQN---KVLNTKRKHTADNNTP 3349
            P R N      D  KVKK  D+N     +E   T ++  +N   K+   KRKH+AD  + 
Sbjct: 933  PNRKNSSDVSKDHAKVKKVSDSNNHDGGIEEVPTHDYSKKNLATKISKLKRKHSADGRSV 992

Query: 3350 T--AKLLKVEKGPTGQVPFKQXXXXXXXXXXYRTSSAYPKSYGCARTSINGWEWHKWSIT 3523
            +   K LKV    + Q   +Q           R S++ P+S GCAR+SI GWEWHKWS +
Sbjct: 993  SHPMKFLKVTTSGSKQAASRQVTAGKAKSRKSRASNSCPRSDGCARSSITGWEWHKWSHS 1052

Query: 3524 ATPSERTRVRGSTLRHAQSRGSEMNVSQISNSKGLSARTNRVKMRNXXXXXXXXXXXKAT 3703
            A+P++R RVRG    HA    SE   SQ+SN K LSARTNRVKMRN           KAT
Sbjct: 1053 ASPADRARVRGIHCLHANYSVSEAYTSQLSNGKVLSARTNRVKMRNLLAAAEGADLLKAT 1112

Query: 3704 QLKARKKRLRFQQSKIHDWGLVALEPIDAEDFVIEYVGELIRSSISDIRERQYEKMGIGS 3883
            QLKARKKRLRFQQSKIHDWGLVALEPI+AEDFVIEYVGELIR  ISDIRER YEKMGIGS
Sbjct: 1113 QLKARKKRLRFQQSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIRERLYEKMGIGS 1172

Query: 3884 SYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRQIVAGEELT 4063
            SYLFRLDDGYVVDATKRGG+ARFINHSCEPNCYTKVISVEGQKKIFIYAKR I AGEE+T
Sbjct: 1173 SYLFRLDDGYVVDATKRGGVARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEIT 1232

Query: 4064 YDYKFPVEEIKIPCNCGSNRCRRSLN 4141
            Y+YKFP+EE KIPCNCGS +CR SLN
Sbjct: 1233 YNYKFPLEEKKIPCNCGSRKCRGSLN 1258


>ref|XP_003534870.1| PREDICTED: uncharacterized protein LOC100805708 [Glycine max]
          Length = 1213

 Score =  749 bits (1934), Expect = 0.0
 Identities = 494/1205 (40%), Positives = 655/1205 (54%), Gaps = 28/1205 (2%)
 Frame = +2

Query: 611  DDKLNSGVSVDTSCQLN--GGNESISHTSIGGTSYPVKAEAAYAS-PAFVNSWMYVNAEG 781
            DDK++    V+ SC  N   G   +  T+ G  S+  ++   Y   PAFV+ WMYVN  G
Sbjct: 66   DDKVDPDSGVEMSCPSNVKSGYVPVCSTT-GHISHMDQSFCGYVQQPAFVSGWMYVNENG 124

Query: 782  QMCGPYIQEQLYEGLSSGFLPDDLPVYPILNGSLINPVPLNYFKQFPDHVATGFAYLPGS 961
            QMCGPYI+EQLYEGL++GFLP +LPVYP++NG+L++PVPLNYFKQFPDHV+TGFAYL   
Sbjct: 125  QMCGPYIKEQLYEGLTTGFLPSELPVYPVINGTLMSPVPLNYFKQFPDHVSTGFAYLSMG 184

Query: 962  FSGVKVPANSQTYPGSDFLSRTQELATTSGSYISQTAYPXXXXXXXXXXXPQDLNADEAN 1141
            FSG +VP  +  Y           LA    S                    Q ++    N
Sbjct: 185  FSGTRVPTMA-AYEQDRSFEHAAPLAVNPDS--------------------QPVSQSHVN 223

Query: 1142 SSKPYMPVSGEEPCWLFDDDEGRKHGPHSLVELYSWHHYGYLRDSLMITHADNKFKPFIL 1321
                     G E CWL++D++G KHGPHS+ EL SW+ +GYL+DS +I+H+DNK+  F+L
Sbjct: 224  YCIKESNHLGVECCWLYEDEKGMKHGPHSINELISWNRHGYLKDSTVISHSDNKYDTFVL 283

Query: 1322 KSVVDTWR-----TGEKTLSVSNDKDHTTGPIQSISEDLCSQLHSGIMKTARRVVLDEVI 1486
             S V+  +     T  ++ S SN+       I  ISED+ SQLH GIMK ARRVVLD +I
Sbjct: 284  LSAVNALKGDISGTICRSGSPSNEVGDMVNLIGEISEDISSQLHMGIMKAARRVVLDGII 343

Query: 1487 SHVIAECIASKKAIKYQKFEAINQNVNTLSA-STNMSESCSANVVVTSSHETALSDCVSE 1663
              +IAE +  KK  +++   A     N +S  S  +S   + +    SSH      C   
Sbjct: 344  GDIIAEFVTEKKRTRHKLESADCTPGNNMSKFSAEISRGSAISSDPASSHTLDDQTCHES 403

Query: 1664 YTLPASRSPVRTLSSMKSVGSAENFLDACAIIYRMLFDSSMQVVWNAVFHDPIVEYSSVW 1843
              LP +         +KSVGS ENF  + A++ ++L D SMQV+WNAVF D + EY S W
Sbjct: 404  SRLPPA--------IIKSVGSIENFWWSYAVVRKVLLDYSMQVMWNAVFFDTLAEYLSSW 455

Query: 1844 RRTKLWLGHSAVLGQKNSIKQYEGEVEKVHFAALQSEPELSGGEADYPPGFELPSMSSDV 2023
            R+ KLW         + S  + E   EK+   AL   P+ S    D    F + +   + 
Sbjct: 456  RKKKLWSHRKP----QPSANECEDHTEKIESEALVINPDTSESNVDGYNQFGVLATEKNC 511

Query: 2024 HILSPSVSSCFCNGEESNGRNLQSKDHIFKDVGSILVGVEHDLHSSAMMSLTQYFESIVD 2203
              L  S SS    G    G+ +       +D+  IL  VE++LH S+ +SL  Y +S ++
Sbjct: 512  PRLFSS-SSSLKGGNLLEGQKVSCLYDNSRDLTCILESVENELHFSSKVSLADYIQSFIE 570

Query: 2204 EEVKKITGSPKDDQSNEVEVDPPIEQPHIISLSGSSEALLDLEKISDDNNQVSSESEKQE 2383
            +EV K+   P++++ NEV V            +  SE L D   + +  N  S +  K  
Sbjct: 571  KEVNKLIPFPEENKFNEVAVGD----------TRFSEKLADKTSVKEILNDKSVDPAKAG 620

Query: 2384 DYQKNVSVLRNPVSNVLTNTLVKLCESLRGTDVVKDIDMLDAPGSEIKSETLAPLQISTI 2563
            +     S   N +S+V +    +LC  +   DVV++ ++ D P    KS+T+A    S  
Sbjct: 621  N-SFGESASGNHMSDVFSKAFKELCGYV--DDVVEE-EIDDLPPGLEKSQTVALHYNSKF 676

Query: 2564 RSSRSHERFPRITLYVVLAMFRQKLHDDVLRECSSSLIDGAFNKFWFSHRPXXXXXXXXT 2743
            R SRS E   +IT YV  A+ RQKLHD+VL +  S  +D    + + S            
Sbjct: 677  RPSRSAECNLKITEYVATALCRQKLHDEVLEKWRSLFLDSVPKQVFISSSTIKKHFKSDG 736

Query: 2744 VVKRAS-RTNDERPSNFPAAVDKYSEKVRSKHRTEPSDISLMTGRYTYSRKKGLRNKSGS 2920
              KR +   + E  ++  + + +  E  +S     P     + G+YTY RKK  R +  S
Sbjct: 737  HKKRKTVNASKEHLNSATSGLGRVKEGAKSSSEVPP-----VIGKYTYCRKKLSRKELIS 791

Query: 2921 FSEWANFANSGSHKQSVGLSNVPNISVEATQNTEVN-ASVL---SKII----DLDCSTEI 3076
                A   +S   KQ V      + S +  +  EV  ASV+   +K+I    D     + 
Sbjct: 792  SKSVAE-NDSRPGKQPVAKLR-KHFSGDVGEAAEVKIASVIHGKTKMIKGKKDTTSKGKS 849

Query: 3077 CADANEQTMQNEHLPSVHTANHKALKIAHGIQ--VIEKSSTRTKKLLPVRPNDSTK---V 3241
                N  +  N+ L   + A  K LK +  +Q  V +   +  KKL     N       V
Sbjct: 850  SVSVNSSS-HNDQLSLKNKAGQKVLKFSGEVQNDVKDFVKSNVKKLSASTDNSVVMKKIV 908

Query: 3242 KKDANKKRRNLETQEHVNQN---KVLNTKRKHTADNNTPT--AKLLKVEKGPTGQVPFKQ 3406
            K D   K +         QN   KV  +KRKH  D    +   K+LK+  G       KQ
Sbjct: 909  KSDGTVKEKVTSHCSREIQNATMKVSKSKRKHQMDGTASSHPTKVLKISNGGAYLGASKQ 968

Query: 3407 XXXXXXXXXXYRTSSAYPKSYGCARTSINGWEWHKWSITATPSERTRVRGSTLRHAQSRG 3586
                       +  +  P+S GCARTSI+GWEWHKWS +A+P+ + RVRG      +   
Sbjct: 969  VTVASRKSAKSKPLNLCPRSDGCARTSIDGWEWHKWSRSASPAYKARVRGLPCVQNKCID 1028

Query: 3587 SEMNVSQISNSKGLSARTNRVKMRNXXXXXXXXXXXKATQLKARKKRLRFQQSKIHDWGL 3766
            SE N+SQ+SN KGLSARTNRVK+RN           K  QLKARKK LRFQ+SKIHDWGL
Sbjct: 1029 SENNLSQLSNGKGLSARTNRVKLRNLLAAAEGADLLKVPQLKARKKHLRFQRSKIHDWGL 1088

Query: 3767 VALEPIDAEDFVIEYVGELIRSSISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIA 3946
            +ALEPI+AEDFVIEY+GELIR  ISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIA
Sbjct: 1089 LALEPIEAEDFVIEYIGELIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIA 1148

Query: 3947 RFINHSCEPNCYTKVISVEGQKKIFIYAKRQIVAGEELTYDYKFPVEEIKIPCNCGSNRC 4126
            RF+NHSCEPNCYTKVISVEGQKKIFIYAKR I AGEE+TY+YKFP+EE KIPCNCGS +C
Sbjct: 1149 RFVNHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCNCGSRKC 1208

Query: 4127 RRSLN 4141
            R SLN
Sbjct: 1209 RGSLN 1213


>ref|XP_002300607.1| SET domain protein [Populus trichocarpa] gi|222842333|gb|EEE79880.1|
            SET domain protein [Populus trichocarpa]
          Length = 1390

 Score =  738 bits (1905), Expect = 0.0
 Identities = 499/1260 (39%), Positives = 656/1260 (52%), Gaps = 99/1260 (7%)
 Frame = +2

Query: 641  DTSCQLNGGNESISHTSIGGTSYPVKAEAAYASPAFVNSWMYVNAEGQMCGPYIQEQLYE 820
            + SC+ NG +E + +T  GG SY  +  + ++ PAFV+ WMY+N  GQMCGPYIQ+QLYE
Sbjct: 67   EMSCKSNGNSEGMPNT--GGASYGGENCSGHSPPAFVSGWMYLNENGQMCGPYIQQQLYE 124

Query: 821  GLSSGFLPDDLPVYPILNGSLINPVPLNYFKQFPDHVATGFAYLPGSFSGVKVPANSQTY 1000
            GLS+GFLP+DLPVYPI NG LINPVPLNYFKQFPDHV+TGF YL    SG  +P N  T 
Sbjct: 125  GLSTGFLPEDLPVYPIANGILINPVPLNYFKQFPDHVSTGFTYLCLGTSGTTMPTNHPTD 184

Query: 1001 PGSDFLSRTQELATTSGSYISQTAYPXXXXXXXXXXXPQDLNADEANSSKPYMPVSGEEP 1180
              +      Q  A  S     ++                  N++ A+   P   VSGE+ 
Sbjct: 185  LAAHRQEGVQYAAPVSAHPDIESISDSRVRNHTYSFNQPISNSEAADYVTPVSLVSGEDS 244

Query: 1181 CWLFDDDEGRKHGPHSLVELYSWHHYGYLRDSLMITHADNKFKPFILKSVVDTWRTGEKT 1360
            CWLF DD+GRKHGPHSL+ELYSW+ YGYL+DSLMI HA NKF+P  L S+++ WR  +  
Sbjct: 245  CWLFKDDDGRKHGPHSLLELYSWYQYGYLKDSLMIYHAQNKFRPLPLLSIMNAWRLDKPE 304

Query: 1361 LSVSNDKDHTTGPIQS----ISEDLCSQLHSGIMKTARRVVLDEVISHVIAECIASKKAI 1528
                 D    TG  QS    ISE++ SQLHSGI+K ARR  LDE+I  VI+E + +K+A 
Sbjct: 305  SFSMTDATTETGSSQSFISVISEEVSSQLHSGILKAARRFALDEIICDVISEFVRTKRAE 364

Query: 1529 KYQKFEAINQNVNTLSASTNMSESCSANVVV-TSSHETALSDCVSEYTLPASRSPVRTLS 1705
            +Y   +  NQ   T S    MS+S S  ++  T   + A  + +S+ T  A    V+   
Sbjct: 365  RYLMLD--NQAAKTCSVDGKMSQSASERMIFSTPECDAAACNYISDQTW-ADELSVQLPR 421

Query: 1706 SMKSVGSAENFLDACAIIYRMLFDSSMQVVWNAVFHDPIVEYSSVWRRTKLWLGHSAVLG 1885
            S KSVG+A++F  + A+I R L D  M+V+WNAVF+D I EY+  WR++KLW  H  +  
Sbjct: 422  STKSVGNADDFWGSYAVICRCLSDYCMEVMWNAVFYDTIAEYTISWRKSKLWFHHPYLC- 480

Query: 1886 QKNSIKQYEGEVEKVHFAALQSEPELSGGEADYPPGFELPSMSSDVHILSPSVSSCFCNG 2065
                      ++E++      S  E      D PPGFEL    SD  + S   SSC   G
Sbjct: 481  ---------MKIEELPSETYFSGQESPASSVDCPPGFELLKTKSDHTVPSSITSSCAHVG 531

Query: 2066 EESNGRN-LQSKDHIFKDVGSILVGVEHDLHSSAMMSLTQYFESIVDEEVKKITGSPKDD 2242
            +E   +N L  KD    D+  IL  V ++LH S  +SL +Y E +V E+VKK+    +D 
Sbjct: 532  QEPCEQNSLSFKDCPDDDMKCILESVAYELHKSTKVSLLEYVEILVKEKVKKLVNFSEDK 591

Query: 2243 QSNEVEVD---PPIEQPHIISLSGSSEALLDLEKISDDNNQVSSESEKQEDYQKNVSVLR 2413
            + NE   D   P  +     S+    E ++D  +I  +    SS+ +     QK+    +
Sbjct: 592  RLNEEIFDFSIPSSQASEYGSIEMKDEKMIDSNQIPAE-IMFSSKPQSSLQVQKSFFPFQ 650

Query: 2414 --NPVSNVLTNTLVKLCESLRGTDVVKDIDMLDAPGSEIKSETLAPLQISTIRSSRSHER 2587
              N +SN L     +L  S+      ++ID    PG   K   L P  I+  R S+S + 
Sbjct: 651  SENEISNFLAIAFKRLRPSVVNAIDDENIDGPPPPG--FKDTALFPSAINKFRPSKSLKL 708

Query: 2588 FPRITLYVVLAMFRQKLHDDVLRECSSSLIDGAFNKFWFSHRPXXXXXXXXTVVKRASRT 2767
             P++  YV +AM  QKLHDDVL    S  +D         HR               S  
Sbjct: 709  TPKVGAYVTIAMCMQKLHDDVLNVWKSIFVDEIL------HRSPRLC---------CSSE 753

Query: 2768 NDERPSNFPAAVDKYSEKVRSKHRTEPSDISLMTGRYTYSRKKGLRNKS-GSFSEWANFA 2944
                P        K++E     H  + S +SL++G+YTY RK+ L  K  GS S      
Sbjct: 754  KHTEPGINEEGAFKFTEGSNKFHSPDSSVLSLVSGKYTYHRKRKLVGKKLGSSSHSTTTV 813

Query: 2945 NSGSHKQSVGLSNVPNISVEATQNTEVNASVLSKIIDLDCSTE---ICADANEQTMQNEH 3115
            +SG  KQ V  S   ++  + ++N  V      K      S +   + A   E ++    
Sbjct: 814  DSGLLKQPVEKSRKQDVLSDVSENVVVQPVKTPKKKGQASSVDAKPLKATIAESSVNARP 873

Query: 3116 L------PSVHTANHKAL--KIAHGIQVIEKSSTRTKKLLPVRPNDSTKVKKDANKKRRN 3271
            L       SV+    KA         Q + K+ +R K +   R  +  K  KD+ K  R+
Sbjct: 874  LKATIAESSVNVGPSKAAVKSTLKRDQSLPKNISRRKVMKIARAVNDDKDAKDSIKTSRD 933

Query: 3272 LE--------------------TQEHVNQNKVLNTKRKHTADNNTPT--AKLLKVEKGPT 3385
            +                     +++ +N  KV N+KRK T D  + +   K+LKVE    
Sbjct: 934  VVGLIDCNGRDAGIKKSGTTECSKKTLNSTKVSNSKRKSTVDGGSVSHPMKILKVENDVN 993

Query: 3386 GQVPFKQXXXXXXXXXXYRTSSAYPKSYGCARTSINGWE-------------------WH 3508
             Q    Q              +A   S    ++++NG                       
Sbjct: 994  KQAATGQVMARKTKSDHVFLCTATKVSKLKRKSTVNGGSVSHPMKILKVENGANKQTATG 1053

Query: 3509 KWSITATPSERTR-------------------------VRGSTLRHAQSRG--------- 3586
            +++   T S ++R                         V+ S    A+ RG         
Sbjct: 1054 QFTARKTKSSKSRMLIPCPRSDGCARSSINGWEWHAWSVKASPAERARVRGVRCIHAKYS 1113

Query: 3587 -SEMNVSQISNSKGLSARTNRVKMRNXXXXXXXXXXXKATQLKARKKRLRFQQSKIHDWG 3763
             SE   SQ+SN K LSARTNRVK+RN           KATQLKARKKRL FQ+SKIHDWG
Sbjct: 1114 GSEAYASQLSNGKVLSARTNRVKLRNLLAAAEGVDLLKATQLKARKKRLCFQRSKIHDWG 1173

Query: 3764 LVALEPIDAEDFVIEYVGELIRSSISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGI 3943
            LVALE I+AEDFVIEYVGELIR  ISDIRER YEKMGIGSSYLFRLDDGYVVDATKRGGI
Sbjct: 1174 LVALESIEAEDFVIEYVGELIRPQISDIRERLYEKMGIGSSYLFRLDDGYVVDATKRGGI 1233

Query: 3944 ARFINHSCEPNCYTKVISVEGQKKIFIYAKRQIVAGEELTYDYKFPVEEIKIPCNCGSNR 4123
            ARFINHSCEPNCYTKVISVEGQKKIFIYAKR I AGEE+TY+YKFP+E+ KIPCNCGS +
Sbjct: 1234 ARFINHSCEPNCYTKVISVEGQKKIFIYAKRYIAAGEEITYNYKFPLEDKKIPCNCGSRK 1293


>ref|XP_003594905.1| Histone-lysine N-methyltransferase SETD1B [Medicago truncatula]
            gi|355483953|gb|AES65156.1| Histone-lysine
            N-methyltransferase SETD1B [Medicago truncatula]
          Length = 1232

 Score =  736 bits (1901), Expect = 0.0
 Identities = 481/1216 (39%), Positives = 667/1216 (54%), Gaps = 45/1216 (3%)
 Frame = +2

Query: 611  DDKLNSGVSVDTSCQLNGGNE-SISHTSIGGTSYPVKAEAAYAS-PAFVNSWMYVNAEGQ 784
            DDK N    ++ SC  N  ++ S+  TS    S+  ++   +   P FV+ WMYVN  GQ
Sbjct: 37   DDKANPNCGMEMSCPSNVNSDVSVCSTSTVNISHSHQSFRGFVQQPDFVSGWMYVNEHGQ 96

Query: 785  MCGPYIQEQLYEGLSSGFLPDDLPVYPILNGSLINPVPLNYFKQFPDHVATGFAYLPGSF 964
            MCGPYI+EQL+EGL++GFLP +LPVYP++NG+++N VPLNYFKQ+PDHV+TGFAYL   F
Sbjct: 97   MCGPYIKEQLHEGLTTGFLPFELPVYPVINGTIMNSVPLNYFKQYPDHVSTGFAYLSMDF 156

Query: 965  SGVKVPAN--SQTYPGSDFLSRTQELATTSGSYISQTAYPXXXXXXXXXXXPQDLN---- 1126
            S  ++  N  S +    D   R+ ELA            P            ++ N    
Sbjct: 157  SNARMSKNCSSSSQDMVDGQDRSVELAAV------MAVNPDSKSVSHVNDCNKESNHVDL 210

Query: 1127 ADEANSSKPYMPVSGEEPCWLFDDDEGRKHGPHSLVELYSWHHYGYLRDSLMITHADNKF 1306
            + EA S      + G E CWL++D +G KHGPHS+ EL SWHH+GYL DS +I+H DNK+
Sbjct: 211  SSEAFSRIISSQMLGGECCWLYEDKKGMKHGPHSISELISWHHHGYLEDSTVISHFDNKY 270

Query: 1307 KPFILKSVVDTWRTGE-KTLSVSNDKDHTTGPIQS----ISEDLCSQLHSGIMKTARRVV 1471
              F+L S V+  +     T+  S+ K +  G + +    ISED+ SQLH+G+MK++R+VV
Sbjct: 271  GTFVLLSAVNAMKGDTCGTICGSDSKSNGVGDVMNLICEISEDISSQLHTGVMKSSRKVV 330

Query: 1472 LDEVISHVIAECIASKKAIKYQKFEAINQNVNTLSASTNMSESCSANVVVTSSHETALSD 1651
            LD +I  +IAE I  KK  K QK E+ +Q   T + +  M    ++     ++       
Sbjct: 331  LDGIIGDIIAEFITEKKC-KKQKLESADQTSETCTLNNKMMNKGASIPSEPAASRILNGQ 389

Query: 1652 CVSEYTLPASRSPVRTLSSMKSVGSAENFLDACAIIYRMLFDSSMQVVWNAVFHDPIVEY 1831
               E + P+S       +++KSVGS ENF  + A++ ++LFD S+QV+WNAVF D + E 
Sbjct: 390  ACHEISRPSS-------TNVKSVGSIENFWWSYAVVRKVLFDHSLQVMWNAVFFDTVTEV 442

Query: 1832 SSVWRRTKLWLG---HSAVLGQKNSIKQYEGEVEKVHFAALQSEPELSGGEADYPPGFEL 2002
               WR+ K W      S+V   K+S+++ + E       AL +   +   EAD   G   
Sbjct: 443  LFSWRKKKYWSHPKPQSSVNESKDSVEKLKSEA-----LALGTGSSVCNVEADIQSG--A 495

Query: 2003 PSMSSDVH---ILSPSVSSCFCNGEESNGRNLQSKDHIFKDVGSILVGVEHDLHSSAMMS 2173
             +   D H   +LSP+       G  + G+ +       +D+  IL  VE++LH SA  S
Sbjct: 496  MATERDCHPELLLSPNNIKI---GNIAEGQRVSCSYGNSEDLTRILESVENELHCSAKAS 552

Query: 2174 LTQYFESIVDEEVKKITGSPKDDQSNEVEVDPPIEQPHIISLSGSSEALLD--LEKISDD 2347
            L  Y  S+V++EV ++  SP+ D  +EV+V        +   +   E L D  ++ + + 
Sbjct: 553  LADYVRSVVEKEVNQLIPSPEKDIFSEVDVSDCRISKMVAGKTSVKETLSDKSIDPVKNG 612

Query: 2348 NNQVSSESEKQEDYQKNVSVLRNPVSNVLTNTLVKLCESLRGTDVVKDIDMLDAPGSEIK 2527
            ++     SE             N +S+V +    +LC  L   DVV D ++ D P    +
Sbjct: 613  DSICVPSSE-------------NHMSDVFSKAFQELCGHLN--DVVDDEEIGDLPPG-FE 656

Query: 2528 SETLAPLQISTIRSSRSHERFPRITLYVVLAMFRQKLHDDVLRECSSSLIDGAFNKFWFS 2707
              ++ P   S  R SR+ E  P+IT YV  A+ RQKLHD+VL++   S++D  F K   S
Sbjct: 657  KNSIFPHCNSKFRPSRTVECNPKITEYVTAALCRQKLHDEVLKDWKLSILDSTFKKVMSS 716

Query: 2708 HRPXXXXXXXXTVVKRASRTNDERPSNFPAAVDKYSEKVRS-----KHRTEPSDISLMTG 2872
                           ++   N E  ++    + K  E  +      K   + S + L   
Sbjct: 717  CTIKKNLQSRGHGKGKSFSANKEHLNDATLGLGKVKEGTKLGLGKVKEGAKSSGVPLAIE 776

Query: 2873 RYTYSRKKGLRNKSGSFSEWANFANSGSHKQSVGLSNVPNISVEATQNTEVNASVL---- 3040
            +YTY RK  L  K    S+     NSG  K+ +      ++S +  ++ EV  + +    
Sbjct: 777  KYTYHRKN-LSRKELCSSKPVVDDNSGPGKKPLAKLR-KDVSGDVKESAEVKVTAIKRGK 834

Query: 3041 SKIID--LDCSTEICADAN-EQTMQNEHLPSVHTANHKALKIAHGIQ--VIEKSSTRTKK 3205
            +K+I    D S++  +  N + +  +  L   +    K  K AH +Q  V +   +  K+
Sbjct: 835  AKMIKGKKDTSSKKSSPVNVDNSSPSVQLSLKNKTCQKVSKFAHTVQNGVTDVLKSNKKR 894

Query: 3206 LLPVRPND--STKVKKDANKKRRNLETQEHVNQNK------VLNTKRKHTADNNTPT--A 3355
            LL    N      VK++        +T  H+++ K      V  +KRKH  D  T +  A
Sbjct: 895  LLVSSDNSVGMKVVKRNNTDVTIQRKTTGHISKEKLNATNTVSKSKRKHQPDGVTSSHPA 954

Query: 3356 KLLKVEKGPTGQVPFKQXXXXXXXXXXYRTSSAYPKSYGCARTSINGWEWHKWSITATPS 3535
            K+LK+          KQ           ++    P+S GCARTSI+GWEWHKWS +A+P+
Sbjct: 955  KVLKISNSGASLEASKQVTEARRNSAKSKSLDLCPRSIGCARTSIDGWEWHKWSQSASPT 1014

Query: 3536 ERTRVRGSTLRHAQSRGSEMNVSQISNSKGLSARTNRVKMRNXXXXXXXXXXXKATQLKA 3715
             R RVRG      +   SE N SQ+SNSKGLSARTNRVK+RN           K  QLKA
Sbjct: 1015 SRARVRGLPRLQNKFINSEKNPSQLSNSKGLSARTNRVKLRNLLAAAEGADLLKVPQLKA 1074

Query: 3716 RKKRLRFQQSKIHDWGLVALEPIDAEDFVIEYVGELIRSSISDIRERQYEKMGIGSSYLF 3895
            RKKRLRFQ+SKIHDWGLVALEPI+AEDFVIEY+GELIR  ISDIRE QYEKMGIGSSYLF
Sbjct: 1075 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYIGELIRPRISDIREVQYEKMGIGSSYLF 1134

Query: 3896 RLDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRQIVAGEELTYDYK 4075
            RLDDGYVVDATKRGGIARFINHSCEPNCY KVIS EGQKKIFIYAKR I AGEE+TY+YK
Sbjct: 1135 RLDDGYVVDATKRGGIARFINHSCEPNCYPKVISFEGQKKIFIYAKRHINAGEEITYNYK 1194

Query: 4076 FPVEEIKIPCNCGSNR 4123
            FP+EE KIPCNCGS +
Sbjct: 1195 FPLEEKKIPCNCGSKK 1210


>ref|XP_004146799.1| PREDICTED: uncharacterized protein LOC101220062 [Cucumis sativus]
          Length = 1289

 Score =  725 bits (1872), Expect = 0.0
 Identities = 486/1250 (38%), Positives = 658/1250 (52%), Gaps = 73/1250 (5%)
 Frame = +2

Query: 611  DDKLNSGVSVDTSCQLNGGNESISHT-SIGGTSYPVKAEAAYASPAFVNSWMYVNAEGQM 787
            D+K  S  SVD SCQLNG +  +    S  G+S+  K  + Y+ P  V+ WMYVN +GQM
Sbjct: 73   DEKNGSYSSVDMSCQLNGTSPDLPECCSSEGSSFRDKGFSGYSFPTCVSGWMYVNEQGQM 132

Query: 788  CGPYIQEQLYEGLSSGFLPDDLPVYPILNGSLINPVPLNYFKQFPDHVATGFAYLPGSFS 967
            CGPYIQEQL+EGLS+GFLPD+L VYP+ NG+L NPVPL YFKQFPDH+ATGFAYL    S
Sbjct: 133  CGPYIQEQLHEGLSTGFLPDELLVYPVFNGALTNPVPLKYFKQFPDHIATGFAYLSVDIS 192

Query: 968  GVKVPANSQTYPGSDFLSRTQELATTSGS-----YISQTAYPXXXXXXXXXXXPQDLNAD 1132
             + +  N       D     QE     G+     + SQ++              Q  N++
Sbjct: 193  NMGLNGNHSDACKIDLAMHRQEGLVECGNPPTPCHDSQSS--PLSFGYENGGSKQASNSE 250

Query: 1133 EANSSKPYMPVSGEEPCWLFDDDEGRKHGPHSLVELYSWHHYGYLRDSLMITHADNKFKP 1312
                +   +P S E  CWL  D  GRKHGP+SL++LYSWH +GYL+DS+MI H ++KFKP
Sbjct: 251  LFCLTTSNLPSSVEGSCWLIMDHTGRKHGPYSLLQLYSWHQHGYLKDSVMIYHIESKFKP 310

Query: 1313 FILKSVVDTWRTGEKTLSVSND-KDHTTGP----IQSISEDLCSQLHSGIMKTARRVVLD 1477
            F L S V+ W+        S+D K + +G     I   SE + SQLH+GIMK AR+VVLD
Sbjct: 311  FTLFSAVNAWKAAIPLPLFSSDLKTNESGSLLKFISETSEGVSSQLHAGIMKAARKVVLD 370

Query: 1478 EVISHVIAECIASKKAIKYQKFEAINQNVNTLSASTNMSESCSANVVVTSSHETALSDCV 1657
            E++  +I E +  KK+ +  K E  NQ +   S  + MSE          S    + +  
Sbjct: 371  EIVGSIIGEFVTVKKSERQIKVEQTNQIMKVCSLDSRMSEVTRGGDFPADS----MPETQ 426

Query: 1658 SEYTLPASRS-PVRTLSSMKSVGSAENFLDACAIIYRMLFDSSMQVVWNAVFHDPIVEYS 1834
              +++P   S  V  + S+K VGS +NF +  A+I +MLFD S+QVVWNAV +D + EYS
Sbjct: 427  GFFSVPEKVSTDVVPVQSLKLVGSIDNFREVHAVICQMLFDYSLQVVWNAVSYDTVAEYS 486

Query: 1835 SVWRRTKLWLGHS----AVLGQKNSIKQYE---------GEVEKVHFAALQSEPELSGGE 1975
            S WRR + W        A  G ++ +K+ E          +   +H  +  S  +  G +
Sbjct: 487  SAWRRKRFWSYRPHYSLASSGYRDRVKKIEKTPAEASLPRKESSLHGVSSLSVSKFKGAQ 546

Query: 1976 ADYPPGFELPSMSSDV-HILSPSVSSCFCNGEESNGRNLQSKDHIFKDVGSILVGVEHDL 2152
             +      + S+S  V H  S   S   C   +             +D+  ++  +E +L
Sbjct: 547  TENCARSAVISLSVPVGHKSSRPTSHSCCERPK-------------EDLKWMVEYLEKEL 593

Query: 2153 HSSAMMSLTQYFESIVDEEVKKITGSPKDDQSNEVEVDPPIEQPHIISLSGSSEALLDLE 2332
            HSSA +S+ +Y + I++EEV     +  D + ++V +D  I+     S    S +  +L+
Sbjct: 594  HSSAKVSMAEYIQDILEEEVISSCNASTDVKLDKVALDVSIQ---CSSTDNYSNSFGELQ 650

Query: 2333 KISDDNNQVSSESEKQEDYQKNVSVLRNPVSNVLTNTLVKL---------CESLRGTDVV 2485
              S+D +   +  E +      V++  +   N + N+L ++         C         
Sbjct: 651  CDSNDTHGDRNSGELKLALLPEVNLSNDTALNSVANSLYEVFKEICTNEGCAFNEDCAFN 710

Query: 2486 KDIDMLDAPGSEIKSETLAPLQISTIRSSRSHERFPRITLYVVLAMFRQKLHDDVLRECS 2665
            +D + L APG E       P      R S S++ + +I  Y++LA+ RQKLHD VL+E +
Sbjct: 711  EDCNELLAPGLEEHPTFQIPSPACKFRPSSSNKCYSKIEGYIMLAICRQKLHDAVLKEWT 770

Query: 2666 SSLIDGAFNKFWFSHRPXXXXXXXXTVVKRASRTNDERPSNFPAAVDKYSEKVRSKHRTE 2845
            SS  D    +F  S            +V+ A   +    S  P  + + SE+        
Sbjct: 771  SSYKDDLLRQFVSSWIASKKHCNSNRIVEGA--CDGGEASKVPDKLREGSERF------- 821

Query: 2846 PSDISLMTGRYTYSRKKGLRNKSGSFSEWANFANSGSHKQSVGLSNVPNISVEATQNTEV 3025
              + SL+TG YTY RKK  + K GS S+ A   +     Q    S   NISV   + T+ 
Sbjct: 822  -LESSLVTGNYTYYRKKSSKRKLGS-SDCATEGSPVVRNQPSEKSRKENISVGVCETTDS 879

Query: 3026 N-ASVLSKII-------DLDCSTEICADANEQTMQNEHLPSVHTANHKALKIAHGIQ--- 3172
              AS+  K I       DL           E T+ + H         K LK +  ++   
Sbjct: 880  EIASLTLKSIAKNKRKKDLSIKATCKRTCAEVTLPSSHSSGKTICGTKKLKFSPPVKDDN 939

Query: 3173 VIEKSSTRTKKLLPVRPNDSTKVKKDANKKRRNLETQEHVNQNKVL-------------- 3310
              + S    K  +   P     V +  NK  R +  QE +    +L              
Sbjct: 940  AKKDSVKHGKGRMIGSPLMIKNVDQVMNKCDRGVGAQEKLCSTSLLLSSLIFSLSLSSFF 999

Query: 3311 ------------NTKRKHTADNNTPTA-KLLKVEKGPTGQVPFKQXXXXXXXXXXYRTSS 3451
                          KRK   D  +    K+L V    + Q   K+           R  +
Sbjct: 1000 PGLYLCAAVNVSKIKRKQKVDEASLLGNKVLTVADDFSKQAASKRVVAQKKKSDKSRKLN 1059

Query: 3452 AYPKSYGCARTSINGWEWHKWSITATPSERTRVRGSTLRHAQSRGSEMNVSQISNSKGLS 3631
                S GCAR+SINGWEW +W++ A+P+ER R RG    ++   G +++ S + N KGLS
Sbjct: 1060 ISIISDGCARSSINGWEWRRWTLKASPAERARNRGFQYFYSDPIGPDVSTSHLLNGKGLS 1119

Query: 3632 ARTNRVKMRNXXXXXXXXXXXKATQLKARKKRLRFQQSKIHDWGLVALEPIDAEDFVIEY 3811
            ARTNRVK+RN           KA+QLKARKKRLRFQ+SKIHDWGLVALEPI+AEDFVIEY
Sbjct: 1120 ARTNRVKLRNLLAAADGADLLKASQLKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 1179

Query: 3812 VGELIRSSISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKV 3991
            VGELIR  ISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGG+ARFINHSCEPNCYTKV
Sbjct: 1180 VGELIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGVARFINHSCEPNCYTKV 1239

Query: 3992 ISVEGQKKIFIYAKRQIVAGEELTYDYKFPVEEIKIPCNCGSNRCRRSLN 4141
            I+VEGQKKIFIYAKR I AGEE+TY+YKFP+EE KIPCNC S RCR SLN
Sbjct: 1240 ITVEGQKKIFIYAKRHISAGEEITYNYKFPLEEKKIPCNCRSRRCRGSLN 1289


Top