BLASTX nr result

ID: Cimicifuga21_contig00001245 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cimicifuga21_contig00001245
         (1552 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002275154.1| PREDICTED: histone-lysine N-methyltransferas...   358   3e-96
emb|CAN82112.1| hypothetical protein VITISV_031337 [Vitis vinifera]   358   3e-96
ref|XP_002299366.1| SET domain protein [Populus trichocarpa] gi|...   341   3e-91
ref|XP_003531125.1| PREDICTED: histone-lysine N-methyltransferas...   331   4e-88
ref|XP_002512709.1| protein with unknown function [Ricinus commu...   313   8e-83

>ref|XP_002275154.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Vitis
            vinifera]
          Length = 405

 Score =  358 bits (918), Expect = 3e-96
 Identities = 195/384 (50%), Positives = 238/384 (61%), Gaps = 20/384 (5%)
 Frame = -2

Query: 1539 EIEGRGRGVIATRSVKAGEILVTDSPILLYSANPLKID---FCSNCFRKLDXXXXXXXXX 1369
            EIEGRGR ++A++S++ G+I++TDSPILLYSA+PL      +CSNCFR L          
Sbjct: 18   EIEGRGRALVASQSLRGGQIILTDSPILLYSAHPLSSSSNAYCSNCFRHLQTCSTLVSCS 77

Query: 1368 XXXXSYHALFCSPNCRSTALASSHSPWVCQALNXXXXXXXXXXXXSLALDDVSDAHFLVS 1189
                    LFCSP+C + AL+SSHSPW C  L+            S   +    A FLV+
Sbjct: 78   SCP----CLFCSPDCLTHALSSSHSPWACLTLSLLRASPSLSLSHS---ERQVQARFLVA 130

Query: 1188 AYNLAVISPSHFKLLLSLEG-STPTPTDPRXXXXXXXXXXXSPPQNFAGISLDLTAVLLA 1012
            AYNLA++SPSHF +LLSL+G + P+                SPPQ  AG S++LT  LLA
Sbjct: 131  AYNLAIVSPSHFHILLSLQGMALPSSDSDAPTFLHSLLSSLSPPQGVAGFSVELTTALLA 190

Query: 1011 KDKRNAFALMEP--FQENGERNVRAYGIYPNASFFNHDCLPNAARFDYVDTSVVAGRNTD 838
            KDK NAF LMEP      GER+VRAYGIYP ASFFNHDCLPNA RFDYVDT+  +  NTD
Sbjct: 191  KDKLNAFGLMEPPALAPGGERSVRAYGIYPKASFFNHDCLPNACRFDYVDTA--SHHNTD 248

Query: 837  IIVRAIHDFPQGREICLSYFPVNWSYADRQKRLMEDYGFVCDCDRCKVEVQWXXXXXXXX 658
            I +R IHD P+G EICLSYFPVN +YADRQKRL+EDYGF C CDRC+VE  W        
Sbjct: 249  ITIRLIHDVPEGSEICLSYFPVNETYADRQKRLLEDYGFTCYCDRCRVEANWKDDDEQEE 308

Query: 657  XXXXXEQMV------------XXXXXXXXXXXXDFPHAYFFLRYVCNRDDCGGTLAPLPP 514
                  Q++                        DFPHAYFFLRY+C R++C GTLAPLPP
Sbjct: 309  EQDDEGQVMDEDQDEQMIGSENEIEIGDGGGENDFPHAYFFLRYMCTRENCWGTLAPLPP 368

Query: 513  --SDGAPSDVMECNVCGQLRTGEE 448
              SD +PS++MECNVCG  +  +E
Sbjct: 369  SDSDASPSNLMECNVCGNSKKSDE 392


>emb|CAN82112.1| hypothetical protein VITISV_031337 [Vitis vinifera]
          Length = 405

 Score =  358 bits (918), Expect = 3e-96
 Identities = 196/384 (51%), Positives = 239/384 (62%), Gaps = 20/384 (5%)
 Frame = -2

Query: 1539 EIEGRGRGVIATRSVKAGEILVTDSPILLYSANPLKID---FCSNCFRKLDXXXXXXXXX 1369
            EIEGRGR ++A++S++ G+I++TDSPILLYSA+PL      +CSNCFR L          
Sbjct: 18   EIEGRGRALVASQSLRGGQIILTDSPILLYSAHPLSSSSNAYCSNCFRHLQTCSTLVSCS 77

Query: 1368 XXXXSYHALFCSPNCRSTALASSHSPWVCQALNXXXXXXXXXXXXSLALDDVSDAHFLVS 1189
                    LFCSP+C + AL+SSHSPW C  L+            S   +    A FLV+
Sbjct: 78   SCP----CLFCSPDCLTXALSSSHSPWACLTLSLLRASPSLSLSHS---ERQVQARFLVA 130

Query: 1188 AYNLAVISPSHFKLLLSLEG-STPTPTDPRXXXXXXXXXXXSPPQNFAGISLDLTAVLLA 1012
            AYNLA++SPSHF +LLSL+G + P+                SPPQ  AG S++LT  LLA
Sbjct: 131  AYNLAIVSPSHFHILLSLQGMALPSSDSDAPTFLHSLLSSLSPPQGVAGFSVELTTALLA 190

Query: 1011 KDKRNAFALMEP--FQENGERNVRAYGIYPNASFFNHDCLPNAARFDYVDTSVVAGRNTD 838
            KDK NAF LMEP      GER+VRAYGIYP ASFFNHDCLPNA RFDYVDT+  +  NTD
Sbjct: 191  KDKLNAFGLMEPPALAPGGERSVRAYGIYPKASFFNHDCLPNACRFDYVDTA--SHHNTD 248

Query: 837  IIVRAIHDFPQGREICLSYFPVNWSYADRQKRLMEDYGFVCDCDRCKVEVQWXXXXXXXX 658
            I +R IHD P+G EICLSYFPVN +YADRQKRL+EDYGF C CDRC+VE  W        
Sbjct: 249  ITIRLIHDVPEGSEICLSYFPVNETYADRQKRLLEDYGFTCYCDRCRVEANWKDDDEQEE 308

Query: 657  XXXXXEQMV------------XXXXXXXXXXXXDFPHAYFFLRYVCNRDDCGGTLAPLPP 514
                  Q++                        DFPHAYFFLRY+C R++C GTLAPLPP
Sbjct: 309  EQDDEGQVMDEDQDEQMIGSENEIEIGDGGGXNDFPHAYFFLRYMCTRENCWGTLAPLPP 368

Query: 513  --SDGAPSDVMECNVCGQLRTGEE 448
              SD +PS++MECNVCG  +  +E
Sbjct: 369  SDSDASPSNLMECNVCGNSKKXDE 392


>ref|XP_002299366.1| SET domain protein [Populus trichocarpa] gi|222846624|gb|EEE84171.1|
            SET domain protein [Populus trichocarpa]
          Length = 391

 Score =  341 bits (875), Expect = 3e-91
 Identities = 184/388 (47%), Positives = 230/388 (59%), Gaps = 21/388 (5%)
 Frame = -2

Query: 1551 VAIKEIEGRGRGVIATRSVKAGEILVTDSPILLYSANPLKID------FCSNCFRKLDXX 1390
            V ++EI+GRGRG+++T+ ++ G+I++ DSPILLYSA PL         +C  CF+ +   
Sbjct: 11   VRVEEIQGRGRGLVSTQPLRGGQIVLIDSPILLYSALPLTKQQHSTFLYCDKCFKTIQSA 70

Query: 1389 XXXXXXXXXXXSYHALFCSPNCRSTALASSHSPWVCQALNXXXXXXXXXXXXSLALDDVS 1210
                         H  FCSP C S ALASSH+PWVCQ+L+              +++   
Sbjct: 71   SVSCPTCS-----HQRFCSPTCLSAALASSHTPWVCQSLSRLRDCQDFLQHH--SVERQI 123

Query: 1209 DAHFLVSAYNLAVISPSHFKLLLSLEGSTPTPTDPRXXXXXXXXXXXSPPQNFAGIS--L 1036
             A FLV+AYNLA +SPS F++LLSL+G                     PP    G S  L
Sbjct: 124  QAQFLVAAYNLAFVSPSDFQILLSLQGRAEDEDPAIVQSLHSVISSLCPPPPIEGFSFSL 183

Query: 1035 DLTAVLLAKDKRNAFALMEPFQEN----GERNVRAYGIYPNASFFNHDCLPNAARFDYVD 868
            +L A L+AKD+ NAF LMEP   N    G+R+VRAYGIYP AS FNHDCLPNA RFDYVD
Sbjct: 184  ELIAALVAKDRFNAFGLMEPLNLNEENGGQRSVRAYGIYPKASLFNHDCLPNACRFDYVD 243

Query: 867  TSVVAGRNTDIIVRAIHDFPQGREICLSYFPVNWSYADRQKRLMEDYGFVCDCDRCKVEV 688
            T+     NTDI+VR IHD PQGREICLSYFPVN +Y+ R+KRL+EDYGF CDCDRCKVE 
Sbjct: 244  TN--NSGNTDIVVRMIHDVPQGREICLSYFPVNSNYSTRRKRLLEDYGFTCDCDRCKVEA 301

Query: 687  QW---------XXXXXXXXXXXXXEQMVXXXXXXXXXXXXDFPHAYFFLRYVCNRDDCGG 535
             W                       +              DFPHAYFFLRY+CNR++C G
Sbjct: 302  TWSDDEGDGDDNDNEVMEEDVDEPMEAESDGEEIGNDNSTDFPHAYFFLRYMCNRNNCWG 361

Query: 534  TLAPLPPSDGAPSDVMECNVCGQLRTGE 451
            TLAP PPSD  PS+++ECN CG ++  E
Sbjct: 362  TLAPFPPSDAKPSNLLECNACGDIKNDE 389


>ref|XP_003531125.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Glycine
            max]
          Length = 419

 Score =  331 bits (848), Expect = 4e-88
 Identities = 189/403 (46%), Positives = 234/403 (58%), Gaps = 33/403 (8%)
 Frame = -2

Query: 1545 IKEIEGRGRGVIATRSVKAGEILVTDSPILLYSANPLKID-------------FCSNCFR 1405
            ++EI+GRGRG++A++ +KAG+I++ DSPILLYSA PL                FC +CFR
Sbjct: 12   VEEIQGRGRGMVASQPLKAGQIVLRDSPILLYSALPLVRQSLSSSSSSASTSCFCDHCFR 71

Query: 1404 KLDXXXXXXXXXXXXXS---YHALFCSPNCRSTALASSHSPWVCQALNXXXXXXXXXXXX 1234
             L                   H  FC+ NC S AL SSHS WVCQAL+            
Sbjct: 72   ILSPSLQGDSSSSTVLCPNCRHHCFCNSNCLSNALNSSHSSWVCQALSHLRANSLLLEQP 131

Query: 1233 SLALDDVSDAHFLVSAYNLAVISPSHFKLLLSLEGSTPTPTDPRXXXXXXXXXXXSP--- 1063
               L+     +FLV+AYNLA ISPS F+++LSL+GS    T                   
Sbjct: 132  ---LEHQVQVNFLVAAYNLANISPSDFQIMLSLQGSPDDSTIAAAQFLHPLISSLCSLAL 188

Query: 1062 --PQNFAGISLDLTAVLLAKDKRNAFALMEPFQENGE-RNVRAYGIYPNASFFNHDCLPN 892
              PQN  G SL+LT+ +LAKDK NAF +M+PF E+ + R+VRAYGIYP ASFFNHDCLPN
Sbjct: 189  IGPQN--GFSLELTSAILAKDKLNAFGIMQPFSEHDDQRSVRAYGIYPYASFFNHDCLPN 246

Query: 891  AARFDYVDTSVVA-GRNTDIIVRAIHDFPQGREICLSYFPVNWSYADRQKRLMEDYGFVC 715
            A RFDYVD +      NTD I+R IHD PQGREICLSYFPVN  Y+ RQKRL+EDYGF C
Sbjct: 247  ACRFDYVDANPSDDSHNTDFIIRMIHDVPQGREICLSYFPVNEKYSSRQKRLIEDYGFTC 306

Query: 714  DCDRCKVEVQWXXXXXXXXXXXXXEQMV----------XXXXXXXXXXXXDFPHAYFFLR 565
            +CDRC VE  W             E+++                      DFPHAYFFL+
Sbjct: 307  NCDRCNVESNWSDNDSVEDNAEEEEEVMDEDQCETMAASDTDDHPHEDNNDFPHAYFFLK 366

Query: 564  YVCNRDDCGGTLAPLPPSDGAPSDVMECNVCGQLRTGEEVDGD 436
            Y+C+R +C GTLAPLPP    P +VMECNVCG+L++    D D
Sbjct: 367  YMCDRTNCWGTLAPLPPQGDTPCNVMECNVCGKLKSDNTFDID 409


>ref|XP_002512709.1| protein with unknown function [Ricinus communis]
            gi|223548670|gb|EEF50161.1| protein with unknown function
            [Ricinus communis]
          Length = 379

 Score =  313 bits (802), Expect = 8e-83
 Identities = 176/380 (46%), Positives = 221/380 (58%), Gaps = 22/380 (5%)
 Frame = -2

Query: 1524 GRGVIATRSVKAGEILVTDSPILLYSANPLKID-----------FCSNCFRKLDXXXXXX 1378
            GRGV++ + ++ G+++V DSP+LLYSA PL              +C NC+RK+       
Sbjct: 12   GRGVVSYQPLRGGDVVVRDSPLLLYSAFPLSRSSITSTPTSTSVYCDNCYRKIHSAVICC 71

Query: 1377 XXXXXXXSYHALFCSPNCRSTALASSHSPWVCQALNXXXXXXXXXXXXSLALDDVSDAHF 1198
                     H  FCSPNC S   ASS++PWVCQAL+             L  +    A F
Sbjct: 72   PTCS-----HHKFCSPNCVS---ASSNAPWVCQALSRLHDCSSLVVHQPL--ERQVQARF 121

Query: 1197 LVSAYNLAVISPSHFKLLLSLEGSTPTPTDPRXXXXXXXXXXXSPPQNFAGISLDLTAVL 1018
            L++AYNL ++SPS+F++LLSL+G      D              PP      SL+LT+ L
Sbjct: 122  LIAAYNLFLVSPSNFQILLSLQGQGGGGDDEDAQFLHSLISSLCPPTGGVPFSLELTSAL 181

Query: 1017 LAKDKRNAFALMEPFQEN-GERNVRAYGIYPNASFFNHDCLPNAARFDYVDTSVVAGRNT 841
            LAKDK NAF LMEPF  N G+R+VRAYGIYP A+ FNHDCLPNA RFDYVDT     ++T
Sbjct: 182  LAKDKLNAFGLMEPFDINDGKRSVRAYGIYPKAALFNHDCLPNACRFDYVDT-----QDT 236

Query: 840  DIIVRAIHDFPQGREICLSYFPVNWSYADRQKRLMEDYGFVCDCDRCKVEVQWXXXXXXX 661
            D+I+R IHD PQGREICLSYFPVN+ Y+ RQKRL EDYGF+CDCDRCKVE  W       
Sbjct: 237  DLIIRMIHDVPQGREICLSYFPVNYDYSTRQKRLREDYGFICDCDRCKVEANWSDQEHDD 296

Query: 660  XXXXXXEQ----------MVXXXXXXXXXXXXDFPHAYFFLRYVCNRDDCGGTLAPLPPS 511
                  E           M             DFPHAYFFLRY+C+  +C GTLAPL  S
Sbjct: 297  ADDDENENEAMEEDSEEAMADEKDDAPSDNDSDFPHAYFFLRYMCDGTNCWGTLAPLAQS 356

Query: 510  DGAPSDVMECNVCGQLRTGE 451
            +     ++ECNVCG+++  E
Sbjct: 357  N-VNITLLECNVCGKIKRDE 375


Top