BLASTX nr result

ID: Dioscorea21_contig00021707 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00021707
         (1481 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_849991.1| histone-lysine N-methyltransferase ASHR2 [Arabi...   240   8e-61
ref|XP_002275154.1| PREDICTED: histone-lysine N-methyltransferas...   233   1e-58
emb|CAN82112.1| hypothetical protein VITISV_031337 [Vitis vinifera]   232   2e-58
ref|XP_002886029.1| hypothetical protein ARALYDRAFT_343234 [Arab...   226   9e-57
ref|XP_002299366.1| SET domain protein [Populus trichocarpa] gi|...   221   4e-55

>ref|NP_849991.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
            gi|94707155|sp|Q9ZUM9.3|ASHR2_ARATH RecName:
            Full=Histone-lysine N-methyltransferase ASHR2; AltName:
            Full=ASH1-related protein 2; AltName: Full=Protein SET
            DOMAIN GROUP 39 gi|28393236|gb|AAO42047.1| putative
            SET-domain transcriptional regulator [Arabidopsis
            thaliana] gi|330251813|gb|AEC06907.1| histone-lysine
            N-methyltransferase ASHR2 [Arabidopsis thaliana]
          Length = 398

 Score =  240 bits (612), Expect = 8e-61
 Identities = 151/391 (38%), Positives = 187/391 (47%), Gaps = 43/391 (10%)
 Frame = -1

Query: 1445 PEPLVKLAEIPGRGRALVATRAIKPGEVLLSESPLLLYPA-----TVASNYCSQCYRSLG 1281
            PE L+++AEI GRGR+LVA ++++ G+V+L ESPLLLY A     +  S YC  C+R L 
Sbjct: 9    PETLLRVAEIGGRGRSLVAAQSLRAGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLA 68

Query: 1280 PGGHR--------------CLAGPPPSPWLSETIGRLRSHVGEPDFFS--------QACF 1167
               H+              C A    +PWL E++ RL  H      FS        QA F
Sbjct: 69   SSAHQKCQSCSLVSFCSPNCFASH--TPWLCESLRRL--HQSSSSAFSDQPSDRQVQARF 124

Query: 1166 LXXXXXXXXXXXXXXXXXXXLQGSHT----------DXXXXXXXXXXXXXXXXXXXPGFS 1017
            L                   LQGS +          D                      S
Sbjct: 125  LLSAYNLAAASPSDFQILLSLQGSGSSNGDPSCSAGDSAAAGFLHSLLSSVCPSLPVSIS 184

Query: 1016 PXXXXXXXXXXXXXAFGLMEPFREEDSGERRVRAYGIYPKASFFNHDCLPNACRFDYVDG 837
            P             AFGLMEP    +  +R VRAYGIYPK SFFNHDCLPNACRFDYVD 
Sbjct: 185  PDLTAALLSKDKVNAFGLMEPCSVSNE-KRSVRAYGIYPKTSFFNHDCLPNACRFDYVDS 243

Query: 836  DGENNTAIVVRAIHEIPEGREVCLSYFPVNWNYKERQARLLQDYGFKCECDRCQVEKNW- 660
              + NT I++R IH++PEGREVCLSYFPVN NY  RQ RLL+DYGFKC+CDRC+VE +W 
Sbjct: 244  ASDGNTDIIIRMIHDVPEGREVCLSYFPVNMNYSSRQKRLLEDYGFKCDCDRCKVEFSWS 303

Query: 659  -----KXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDFPHAYFFIRYVCDRENC 495
                 +                                    +FPHAYFF+RY+C++ENC
Sbjct: 304  EGEEDENEIMEEMEDQDEQEEMEDSVGENEEEVCGNGVDDESNFPHAYFFVRYMCEKENC 363

Query: 494  XXXXXXXXXXXXXTISSVMECNVCGRCKTDD 402
                           S V+ECNVCG  K D+
Sbjct: 364  -FGTLAPLPPKTHDASRVLECNVCGSVKEDE 393


>ref|XP_002275154.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Vitis
            vinifera]
          Length = 405

 Score =  233 bits (593), Expect = 1e-58
 Identities = 153/389 (39%), Positives = 185/389 (47%), Gaps = 35/389 (8%)
 Frame = -1

Query: 1436 LVKLAEIPGRGRALVATRAIKPGEVLLSESPLLLY---PATVASN-YCSQCYRSLGP--- 1278
            LVK  EI GRGRALVA+++++ G+++L++SP+LLY   P + +SN YCS C+R L     
Sbjct: 13   LVKTVEIEGRGRALVASQSLRGGQIILTDSPILLYSAHPLSSSSNAYCSNCFRHLQTCST 72

Query: 1277 ----GGHRCLAGPPP----------SPWLSETIGRLRSH----VGEPDFFSQACFLXXXX 1152
                    CL   P           SPW   T+  LR+     +   +   QA FL    
Sbjct: 73   LVSCSSCPCLFCSPDCLTHALSSSHSPWACLTLSLLRASPSLSLSHSERQVQARFLVAAY 132

Query: 1151 XXXXXXXXXXXXXXXLQG----SHTDXXXXXXXXXXXXXXXXXXXPGFSPXXXXXXXXXX 984
                           LQG    S                       GFS           
Sbjct: 133  NLAIVSPSHFHILLSLQGMALPSSDSDAPTFLHSLLSSLSPPQGVAGFSVELTTALLAKD 192

Query: 983  XXXAFGLMEPFREEDSGERRVRAYGIYPKASFFNHDCLPNACRFDYVDGDGENNTAIVVR 804
               AFGLMEP      GER VRAYGIYPKASFFNHDCLPNACRFDYVD    +NT I +R
Sbjct: 193  KLNAFGLMEPPALAPGGERSVRAYGIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIR 252

Query: 803  AIHEIPEGREVCLSYFPVNWNYKERQARLLQDYGFKCECDRCQVEKNWK----XXXXXXX 636
             IH++PEG E+CLSYFPVN  Y +RQ RLL+DYGF C CDRC+VE NWK           
Sbjct: 253  LIHDVPEGSEICLSYFPVNETYADRQKRLLEDYGFTCYCDRCRVEANWKDDDEQEEEQDD 312

Query: 635  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXDFPHAYFFIRYVCDRENCXXXXXXXXXXXXX 456
                                         DFPHAYFF+RY+C RENC             
Sbjct: 313  EGQVMDEDQDEQMIGSENEIEIGDGGGENDFPHAYFFLRYMCTRENCWGTLAPLPPSDSD 372

Query: 455  TI-SSVMECNVCGRC-KTDDGFGGDASML 375
               S++MECNVCG   K+D+ F  D   L
Sbjct: 373  ASPSNLMECNVCGNSKKSDEDFNSDEDRL 401


>emb|CAN82112.1| hypothetical protein VITISV_031337 [Vitis vinifera]
          Length = 405

 Score =  232 bits (591), Expect = 2e-58
 Identities = 154/389 (39%), Positives = 185/389 (47%), Gaps = 35/389 (8%)
 Frame = -1

Query: 1436 LVKLAEIPGRGRALVATRAIKPGEVLLSESPLLLY---PATVASN-YCSQCYRSLGP--- 1278
            LVK  EI GRGRALVA+++++ G+++L++SP+LLY   P + +SN YCS C+R L     
Sbjct: 13   LVKTVEIEGRGRALVASQSLRGGQIILTDSPILLYSAHPLSSSSNAYCSNCFRHLQTCST 72

Query: 1277 ----GGHRCLAGPPP----------SPWLSETIGRLRSH----VGEPDFFSQACFLXXXX 1152
                    CL   P           SPW   T+  LR+     +   +   QA FL    
Sbjct: 73   LVSCSSCPCLFCSPDCLTXALSSSHSPWACLTLSLLRASPSLSLSHSERQVQARFLVAAY 132

Query: 1151 XXXXXXXXXXXXXXXLQG----SHTDXXXXXXXXXXXXXXXXXXXPGFSPXXXXXXXXXX 984
                           LQG    S                       GFS           
Sbjct: 133  NLAIVSPSHFHILLSLQGMALPSSDSDAPTFLHSLLSSLSPPQGVAGFSVELTTALLAKD 192

Query: 983  XXXAFGLMEPFREEDSGERRVRAYGIYPKASFFNHDCLPNACRFDYVDGDGENNTAIVVR 804
               AFGLMEP      GER VRAYGIYPKASFFNHDCLPNACRFDYVD    +NT I +R
Sbjct: 193  KLNAFGLMEPPALAPGGERSVRAYGIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIR 252

Query: 803  AIHEIPEGREVCLSYFPVNWNYKERQARLLQDYGFKCECDRCQVEKNWK----XXXXXXX 636
             IH++PEG E+CLSYFPVN  Y +RQ RLL+DYGF C CDRC+VE NWK           
Sbjct: 253  LIHDVPEGSEICLSYFPVNETYADRQKRLLEDYGFTCYCDRCRVEANWKDDDEQEEEQDD 312

Query: 635  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXDFPHAYFFIRYVCDRENCXXXXXXXXXXXXX 456
                                         DFPHAYFF+RY+C RENC             
Sbjct: 313  EGQVMDEDQDEQMIGSENEIEIGDGGGXNDFPHAYFFLRYMCTRENCWGTLAPLPPSDSD 372

Query: 455  TI-SSVMECNVCGRC-KTDDGFGGDASML 375
               S++MECNVCG   K D+ F  D   L
Sbjct: 373  ASPSNLMECNVCGNSKKXDEDFNSDEDRL 401


>ref|XP_002886029.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
            lyrata] gi|297331869|gb|EFH62288.1| hypothetical protein
            ARALYDRAFT_343234 [Arabidopsis lyrata subsp. lyrata]
          Length = 424

 Score =  226 bits (577), Expect = 9e-57
 Identities = 139/356 (39%), Positives = 174/356 (48%), Gaps = 39/356 (10%)
 Frame = -1

Query: 1445 PEPLVKLAEIPGRGRALVATRAIKPGEVLLSESPLLLYPA-----TVASNYCSQCYRSLG 1281
            PE L+++AEI GRGR+LVA ++++ G+V+L ESPLLLY A     +  S YC  C+R L 
Sbjct: 8    PETLLRVAEIGGRGRSLVAAQSLRAGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLA 67

Query: 1280 PGGHR--------------CLAGPPPSPWLSETIGRLRSHVGEPDFFSQ-------ACFL 1164
               H+              C A    +PWL E++  LR H     F  Q       A FL
Sbjct: 68   SSAHQKCQSCSLVSFCSPNCFASH--TPWLCESL--LRLHQSSSAFSDQPSDRQVQARFL 123

Query: 1163 XXXXXXXXXXXXXXXXXXXLQGS-------HTDXXXXXXXXXXXXXXXXXXXPGFSPXXX 1005
                               LQGS        +                       SP   
Sbjct: 124  LSAYNLAAASPSDFQILLSLQGSGCSNGDPSSSATDSGFLHSLLSAVCPPLPVCISPELT 183

Query: 1004 XXXXXXXXXXAFGLMEPFREEDSGERRVRAYGIYPKASFFNHDCLPNACRFDYVDGDGEN 825
                      AFGLMEPF   +  +R VRAYGIYPK SFFNHDCLPNACRFDYVD   + 
Sbjct: 184  AALLAKDKVNAFGLMEPFSVSND-KRSVRAYGIYPKTSFFNHDCLPNACRFDYVDSASDG 242

Query: 824  NTAIVVRAIHEIPEGREVCLSYFPVNWNYKERQARLLQDYGFKCECDRCQVEKNW----- 660
            NT I++R IH++PEGREVCLSYFPVN NY  RQ RLL+DYGFKC+CDRC+VE +W     
Sbjct: 243  NTDIIIRTIHDVPEGREVCLSYFPVNMNYSSRQKRLLEDYGFKCDCDRCKVESSWSEGEE 302

Query: 659  -KXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDFPHAYFFIRYVCDRENC 495
             +                                    +FPHAYFF+RY+C++E+C
Sbjct: 303  DENEVMEEMGNEDDEEEMEDSEGENVEEVCGNGVDDESNFPHAYFFVRYMCEKESC 358


>ref|XP_002299366.1| SET domain protein [Populus trichocarpa] gi|222846624|gb|EEE84171.1|
            SET domain protein [Populus trichocarpa]
          Length = 391

 Score =  221 bits (563), Expect = 4e-55
 Identities = 140/386 (36%), Positives = 180/386 (46%), Gaps = 39/386 (10%)
 Frame = -1

Query: 1442 EPLVKLAEIPGRGRALVATRAIKPGEVLLSESPLLLYPATVASN-------YCSQCYRSL 1284
            E LV++ EI GRGR LV+T+ ++ G+++L +SP+LLY A   +        YC +C++++
Sbjct: 8    ETLVRVEEIQGRGRGLVSTQPLRGGQIVLIDSPILLYSALPLTKQQHSTFLYCDKCFKTI 67

Query: 1283 GPGGHRC----------------LAGPPPSPWLSETIGRLRSHVGEPDFFS--------Q 1176
                  C                      +PW+ +++ RLR      DF          Q
Sbjct: 68   QSASVSCPTCSHQRFCSPTCLSAALASSHTPWVCQSLSRLRDC---QDFLQHHSVERQIQ 124

Query: 1175 ACFLXXXXXXXXXXXXXXXXXXXLQGSHTDXXXXXXXXXXXXXXXXXXXP-----GFSPX 1011
            A FL                   LQG   D                   P      FS  
Sbjct: 125  AQFLVAAYNLAFVSPSDFQILLSLQGRAEDEDPAIVQSLHSVISSLCPPPPIEGFSFSLE 184

Query: 1010 XXXXXXXXXXXXAFGLMEPFR--EEDSGERRVRAYGIYPKASFFNHDCLPNACRFDYVDG 837
                        AFGLMEP    EE+ G+R VRAYGIYPKAS FNHDCLPNACRFDYVD 
Sbjct: 185  LIAALVAKDRFNAFGLMEPLNLNEENGGQRSVRAYGIYPKASLFNHDCLPNACRFDYVDT 244

Query: 836  DGENNTAIVVRAIHEIPEGREVCLSYFPVNWNYKERQARLLQDYGFKCECDRCQVEKNWK 657
            +   NT IVVR IH++P+GRE+CLSYFPVN NY  R+ RLL+DYGF C+CDRC+VE  W 
Sbjct: 245  NNSGNTDIVVRMIHDVPQGREICLSYFPVNSNYSTRRKRLLEDYGFTCDCDRCKVEATWS 304

Query: 656  -XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDFPHAYFFIRYVCDRENCXXXXX 480
                                                 DFPHAYFF+RY+C+R NC     
Sbjct: 305  DDEGDGDDNDNEVMEEDVDEPMEAESDGEEIGNDNSTDFPHAYFFLRYMCNRNNCWGTLA 364

Query: 479  XXXXXXXXTISSVMECNVCGRCKTDD 402
                      S+++ECN CG  K D+
Sbjct: 365  PFPPSDAKP-SNLLECNACGDIKNDE 389


Top