BLASTX nr result

ID: Dioscorea21_contig00019177 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00019177
         (1466 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002306713.1| SET domain protein [Populus trichocarpa] gi|...   602   e-169
ref|XP_002528669.1| set domain protein, putative [Ricinus commun...   595   e-167
gb|AEL16989.1| ASH1-like protein [Phaseolus vulgaris]                 594   e-167
ref|XP_003536414.1| PREDICTED: histone-lysine N-methyltransferas...   585   e-165
ref|XP_003634540.1| PREDICTED: histone-lysine N-methyltransferas...   585   e-164

>ref|XP_002306713.1| SET domain protein [Populus trichocarpa] gi|222856162|gb|EEE93709.1|
            SET domain protein [Populus trichocarpa]
          Length = 495

 Score =  602 bits (1551), Expect = e-169
 Identities = 307/485 (63%), Positives = 365/485 (75%), Gaps = 18/485 (3%)
 Frame = +3

Query: 3    HRKHKKQKEEDIAVCVCRYDANDPESACGGSCLNVLTSTECTPGYCPCGNFCKNQKFQKC 182
            +RKHKKQKEEDIA+C C+++ +DP+SACG  CLN+LTSTECTPGYCPCG +CKNQ+FQK 
Sbjct: 21   YRKHKKQKEEDIAICECKFNGDDPDSACGERCLNLLTSTECTPGYCPCGVYCKNQRFQKF 80

Query: 183  EYAKLQLFKTEGRGWGLLAAEDIKAGQFVIEYCGEVISCQEAKWRAQRYEVEALKDAFII 362
            EYAK QLFKTEGRGWGLLA E+IKAGQF+IEYCGEVIS +EAK R+Q YE + LKDAFII
Sbjct: 81   EYAKTQLFKTEGRGWGLLADEEIKAGQFIIEYCGEVISWKEAKKRSQVYENQGLKDAFII 140

Query: 363  SLDAYESIDATSKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDINAGTELAYDYN 542
            SL++ ESIDAT KGSLARFINHSCQPNCETRKW VLGE+RVGIFAKQ+I+ GTELAYDYN
Sbjct: 141  SLNSTESIDATKKGSLARFINHSCQPNCETRKWTVLGEIRVGIFAKQNISIGTELAYDYN 200

Query: 543  FEWYGGAKVRCLCGAPSCSGFLGAKSRGFQEATYLWEDGDTRYSIENVPVYDSEEDEPAT 722
            FEWYGGAKVRCLCGA +CSGFLGAKSRGFQE TYLWED D RYSIE +P+YDS EDEP++
Sbjct: 201  FEWYGGAKVRCLCGAVNCSGFLGAKSRGFQEDTYLWEDDDDRYSIEKIPLYDSAEDEPSS 260

Query: 723  KILKAIVPVKDEMLADDGSSYLLG------------VDGMEPILFPHSQPISVEPLNSLP 866
            K LK         +A+  S Y +G            V+  +P+    S  +SV+PL+S P
Sbjct: 261  KFLK---------IANSDSEYDIGGKIEYSTVMNFDVESDKPL---ESTVLSVQPLDSFP 308

Query: 867  ME---MDGLKNEMPGEDNNYSEDAHQRLTQKSTMISRIRSNSACRNYHIDSSPT--KNSA 1031
            ME   M+ +K E   E   YS+   Q    K+ MISRIRSNSACRNYHI S P   K S 
Sbjct: 309  MEGVVMNAVKAEANEEMALYSQGTPQSFAPKNAMISRIRSNSACRNYHIGSGPVPKKRSK 368

Query: 1032 FYPNAKSKTNVRRQVNIKSVTDRLASEDARQEIFSCEESRNLAASQLDALYDEIRPAIEE 1211
             Y   K K  +++QV+ K VT  LA ++A++E+ + EE +N AAS+L  LY+EIRP IEE
Sbjct: 369  QYSTGKLKHLMQKQVDAKRVTKLLAVKEAQEEVLTYEEMKNDAASELSLLYNEIRPVIEE 428

Query: 1212 HERDTQDSVPTSVAEKWIEASCNKLKAEFDLYSSIIKNITCTPRRAENNMGP-QGVATEN 1388
            HERD+QDSVPT+VAEKWI+  C KLKAEFDLYSSIIKNI CTP+R      P +    +N
Sbjct: 429  HERDSQDSVPTTVAEKWIQVCCTKLKAEFDLYSSIIKNIACTPQRTLEQARPSEEPGNDN 488

Query: 1389 GIKLL 1403
             +K L
Sbjct: 489  EVKFL 493


>ref|XP_002528669.1| set domain protein, putative [Ricinus communis]
            gi|223531892|gb|EEF33708.1| set domain protein, putative
            [Ricinus communis]
          Length = 495

 Score =  595 bits (1533), Expect = e-167
 Identities = 298/473 (63%), Positives = 359/473 (75%), Gaps = 6/473 (1%)
 Frame = +3

Query: 3    HRKHKKQKEEDIAVCVCRYDANDPESACGGSCLNVLTSTECTPGYCPCGNFCKNQKFQKC 182
            +RKH+KQKEEDIA+C CR+DA+DPESACG  CLNVLTSTECTPGYC CG FCKNQ+FQKC
Sbjct: 21   YRKHRKQKEEDIAICECRFDASDPESACGERCLNVLTSTECTPGYCRCGIFCKNQRFQKC 80

Query: 183  EYAKLQLFKTEGRGWGLLAAEDIKAGQFVIEYCGEVISCQEAKWRAQRYEVEALKDAFII 362
            EY K +LFKTEGRGWGLLA EDIKAGQF+IEYCGEVIS +EAK R+Q YE + LKDAFII
Sbjct: 81   EYFKTRLFKTEGRGWGLLADEDIKAGQFIIEYCGEVISWKEAKRRSQAYERQGLKDAFII 140

Query: 363  SLDAYESIDATSKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDINAGTELAYDYN 542
            SL++ ESIDAT KGSLARFINHSCQPNCETRKWNVLGE+RVGIFAKQDI+ GTELAYDYN
Sbjct: 141  SLNSSESIDATRKGSLARFINHSCQPNCETRKWNVLGEIRVGIFAKQDISIGTELAYDYN 200

Query: 543  FEWYGGAKVRCLCGAPSCSGFLGAKSRGFQEATYLWEDGDTRYSIENVPVYDSEEDEPAT 722
            FEWYGGAKVRCLCG+ SCSGFLGAKSRGFQE TYLWED D RYS+E +P+YDS EDEP++
Sbjct: 201  FEWYGGAKVRCLCGSASCSGFLGAKSRGFQEDTYLWEDDDDRYSVEKIPLYDSAEDEPSS 260

Query: 723  KILKAIVPVKDEMLADDGSSYLLGVDGMEPILFPHSQPISVEPLNSLPME---MDGLKNE 893
            K+LK +    ++ +        +    + P        ++++P++S+P+E   M+ +K E
Sbjct: 261  KLLKTMNSNFEDEIGRSAEYPTMMNFSVGPEHHVECAALTIKPVDSIPIEGAAMNPVKTE 320

Query: 894  MPGEDNNYSEDAHQRLTQKSTMISRIRSNSACRNYHIDSSPT--KNSAFYPNAKSKTNVR 1067
               E + YS+D+ Q   QKST+IS I  +S C N H    P   K S    N K K   +
Sbjct: 321  ASEEISLYSQDSEQNFVQKSTVISLIEGSSGCGNCHTGRGPVSKKLSKHSSNGKLKHLPQ 380

Query: 1068 RQVNIKSVTDRLASEDARQEIFSCEESRNLAASQLDALYDEIRPAIEEHERDTQDSVPTS 1247
            +QV++K   + LA ++A+ E+ + EE +N AASQL +LY++IRPAIEEHERD QDSV TS
Sbjct: 381  KQVDVKHFANLLAVKEAQDEVLTYEERKNEAASQLSSLYNQIRPAIEEHERDNQDSVATS 440

Query: 1248 VAEKWIEASCNKLKAEFDLYSSIIKNITCTPRRA-ENNMGPQGVATENGIKLL 1403
            VAEKWIE  C KLKAEFDLYSSIIKN+ CTPRRA E    P+    +N +K L
Sbjct: 441  VAEKWIEVCCLKLKAEFDLYSSIIKNVACTPRRAPELPQPPEIGENDNEVKYL 493


>gb|AEL16989.1| ASH1-like protein [Phaseolus vulgaris]
          Length = 481

 Score =  594 bits (1531), Expect = e-167
 Identities = 301/472 (63%), Positives = 353/472 (74%), Gaps = 6/472 (1%)
 Frame = +3

Query: 6    RKHKKQKEEDIAVCVCRYDANDPESACGGSCLNVLTSTECTPGYCPCGNFCKNQKFQKCE 185
            R+ KKQKEEDIA+C C+YDAND +SACG SCLNVLTSTECTPGYCPC   CKNQKFQKCE
Sbjct: 22   RRQKKQKEEDIAICECKYDANDTDSACGDSCLNVLTSTECTPGYCPCDILCKNQKFQKCE 81

Query: 186  YAKLQLFKTEGRGWGLLAAEDIKAGQFVIEYCGEVISCQEAKWRAQRYEVEALKDAFIIS 365
            YAK +LFKTEGRGWGLLA ED+KAGQFVIEYCGEVIS +EAK R+Q YE + LKDAFII 
Sbjct: 82   YAKTKLFKTEGRGWGLLAGEDLKAGQFVIEYCGEVISWKEAKRRSQAYENQGLKDAFIIC 141

Query: 366  LDAYESIDATSKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDINAGTELAYDYNF 545
            L+A ESIDAT KGSLARFINHSC+PNCETRKWNVLGE+RVGIFAK D+  GTELAYDYNF
Sbjct: 142  LNASESIDATRKGSLARFINHSCRPNCETRKWNVLGEIRVGIFAKHDVPIGTELAYDYNF 201

Query: 546  EWYGGAKVRCLCGAPSCSGFLGAKSRGFQEATYLWEDGDTRYSIENVPVYDSEEDEPATK 725
            EW+GGAKVRCLCGA  CSGFLGAKSRGFQE TYLWED D RYS+E +PVYDS EDEP + 
Sbjct: 202  EWFGGAKVRCLCGALKCSGFLGAKSRGFQEDTYLWEDDDDRYSVEKIPVYDSAEDEPVSN 261

Query: 726  ILKAIVPVKDEMLADDGSSYLLGVDGMEPILFPHSQPISVEPLNSLPM---EMDGLKNEM 896
            +        D ML D+  S               S   +V+ L+S+ M   ++  +K E+
Sbjct: 262  VNGRTESPLDVMLKDEQLS--------------ESTGFNVQSLDSVQMKGLDVKKIKTEV 307

Query: 897  PGEDNN-YSEDAHQRLTQKSTMISRIRSNSACRNYHID--SSPTKNSAFYPNAKSKTNVR 1067
              ED + Y+ D  Q L+QK+ MISRIRSN+A RNYHI   S  TK S  Y   + K  V 
Sbjct: 308  TDEDMHLYNHDTEQTLSQKNAMISRIRSNAAGRNYHIGPRSMSTKRSRAYNGGRFKNLVE 367

Query: 1068 RQVNIKSVTDRLASEDARQEIFSCEESRNLAASQLDALYDEIRPAIEEHERDTQDSVPTS 1247
            ++++ K     LAS++A++EI +CE+ ++ A S LD+LYDEIRPAIEEHERD+QDSV T+
Sbjct: 368  KKIDAKFAAGLLASKEAQEEILNCEKRKDDATSTLDSLYDEIRPAIEEHERDSQDSVSTT 427

Query: 1248 VAEKWIEASCNKLKAEFDLYSSIIKNITCTPRRAENNMGPQGVATENGIKLL 1403
            VAEKWI+  C KLKAEFDLYSSI+KN+ CT +RA     P  V  EN IKLL
Sbjct: 428  VAEKWIQVCCLKLKAEFDLYSSIVKNVACTAQRAPGQAKPTEVDNENEIKLL 479


>ref|XP_003536414.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Glycine
            max] gi|34529091|dbj|BAC85636.1| unnamed protein product
            [Homo sapiens]
          Length = 480

 Score =  585 bits (1509), Expect = e-165
 Identities = 298/472 (63%), Positives = 354/472 (75%), Gaps = 6/472 (1%)
 Frame = +3

Query: 6    RKHKKQKEEDIAVCVCRYDANDPESACGGSCLNVLTSTECTPGYCPCGNFCKNQKFQKCE 185
            R+HKKQKEEDIA+C C+YDA+DP++ACG SCLNVLTSTECTPGYC C   CKNQKFQKCE
Sbjct: 22   RRHKKQKEEDIAICECKYDADDPDNACGDSCLNVLTSTECTPGYCHCDILCKNQKFQKCE 81

Query: 186  YAKLQLFKTEGRGWGLLAAEDIKAGQFVIEYCGEVISCQEAKWRAQRYEVEALKDAFIIS 365
            YAK +LFKTEGRGWGLLA EDIKAGQFVIEYCGEVIS +EAK R+Q YE + LKDAFII 
Sbjct: 82   YAKTKLFKTEGRGWGLLADEDIKAGQFVIEYCGEVISWKEAKRRSQAYENQGLKDAFIIF 141

Query: 366  LDAYESIDATSKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDINAGTELAYDYNF 545
            L+  ESIDAT KGSLARFINHSCQPNCETRKWNVLGE+RVGIFAK DI  GTELAYDYNF
Sbjct: 142  LNVSESIDATRKGSLARFINHSCQPNCETRKWNVLGEIRVGIFAKHDIPIGTELAYDYNF 201

Query: 546  EWYGGAKVRCLCGAPSCSGFLGAKSRGFQEATYLWEDGDTRYSIENVPVYDSEEDEPATK 725
            EW+GGAKVRCLCGA  CSGFLGAKSRGFQE TYLWED D RYS+E +PVYDS EDEP + 
Sbjct: 202  EWFGGAKVRCLCGALKCSGFLGAKSRGFQEDTYLWEDDDGRYSVEKIPVYDSAEDEPVSN 261

Query: 726  ILKAIVPVKDEMLADDGSSYLLGVDGMEPILFPHSQPISVEPLNSLPM---EMDGLKNEM 896
                  P  D ++  +  S               S    V+PL+S+ M   ++  +K ++
Sbjct: 262  FNGRTEPSLDVIVKAEQLS--------------ESTAFHVQPLDSVQMKDLDVKKIKTDV 307

Query: 897  PGEDNN-YSEDAHQRLTQKSTMISRIRSNSACRNYHID--SSPTKNSAFYPNAKSKTNVR 1067
              ED N YS+D+   L+QK+  IS IRSN+A RNY +   S  TK S  Y   + K  + 
Sbjct: 308  ADEDMNFYSQDSEHTLSQKNA-ISHIRSNTAGRNYCLGPRSMSTKRSRAYNGGRFKNLIE 366

Query: 1068 RQVNIKSVTDRLASEDARQEIFSCEESRNLAASQLDALYDEIRPAIEEHERDTQDSVPTS 1247
            +++++K     LAS++A++EIF+CE+ ++ A S LD+LYDEIRPAIEEHERD+QDSV T+
Sbjct: 367  KKIDVKFAAALLASKEAQEEIFNCEKMKDDATSALDSLYDEIRPAIEEHERDSQDSVSTT 426

Query: 1248 VAEKWIEASCNKLKAEFDLYSSIIKNITCTPRRAENNMGPQGVATENGIKLL 1403
            VAEKWI+A C KLKAEFDLYSSI+KN+ CT +RA   + P  V  EN IKLL
Sbjct: 427  VAEKWIQACCLKLKAEFDLYSSIVKNVACTAQRASGQVKPTEVDNENEIKLL 478


>ref|XP_003634540.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
            vinifera]
          Length = 515

 Score =  585 bits (1507), Expect = e-164
 Identities = 294/492 (59%), Positives = 366/492 (74%), Gaps = 11/492 (2%)
 Frame = +3

Query: 3    HRKHKKQKEEDIAVCVCRYDANDPESACGGSCLNVLTSTECTPGYCPCGNFCKNQKFQKC 182
            +RKH KQ+EEDIA+C C+YDANDP+SACG +CLNVLTSTECTPGYC CG FCKNQ+FQKC
Sbjct: 35   YRKHIKQQEEDIAICECKYDANDPDSACGEACLNVLTSTECTPGYCRCGLFCKNQRFQKC 94

Query: 183  EYAKLQLFKTEGRGWGLLAAEDIKAGQFVIEYCGEVISCQEAKWRAQRYEVEALKDAFII 362
            EYAK +LF+TEGRGWGLLA E+IKAG+FVIEYCGEVIS +EA+ R+Q Y    LKDAFII
Sbjct: 95   EYAKTKLFRTEGRGWGLLADENIKAGRFVIEYCGEVISWKEARGRSQVYASLGLKDAFII 154

Query: 363  SLDAYESIDATSKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDINAGTELAYDYN 542
            SL+  E IDAT KGSL RFINHSCQPNCETRKW VLGEVRVGIFAKQDI+ GTELAY+YN
Sbjct: 155  SLNGSECIDATKKGSLGRFINHSCQPNCETRKWTVLGEVRVGIFAKQDISIGTELAYNYN 214

Query: 543  FEWYGGAKVRCLCGAPSCSGFLGAKSRGFQEATYLWEDGDTRYSIENVPVYDSEEDEPAT 722
            FEWYGGAKVRCLCGA SCSGFLGAKSRGFQE TYLWEDGD RYS+E +P+YDS EDEP++
Sbjct: 215  FEWYGGAKVRCLCGAISCSGFLGAKSRGFQEDTYLWEDGDDRYSVEKIPLYDSAEDEPSS 274

Query: 723  KILKAIVPVKDEMLADDGSSYLLGVDGM----EPILFPH---SQPISVEPLNSLPMEM-- 875
            K+ + +   K E ++     Y   VD        + + H   S  + VE ++S+P+++  
Sbjct: 275  KLPRVMDYSKPEFISHGKVEYTTAVDASVEYDTSVRYEHQLESTELVVEAVDSVPVDLVI 334

Query: 876  DGLKNEMPGEDNNYSEDAHQRLTQKSTMISRIRSNSACRNYHIDSS--PTKNSAFYPNAK 1049
            + +K E+  E   +++   Q   QK+ MI  I+SNSA +N HI       K S  +PN +
Sbjct: 335  NEIKTEVSEETKLFTDGTQQAFPQKNAMIPHIQSNSASQNNHIGPGHVAKKRSKHFPNGR 394

Query: 1050 SKTNVRRQVNIKSVTDRLASEDARQEIFSCEESRNLAASQLDALYDEIRPAIEEHERDTQ 1229
            SK   ++QV+ K V   L SE+AR+E+F  EE +N A+S+LD++YDEIRPAIEEHERD+Q
Sbjct: 395  SKPVAQKQVDAKFVAQFLGSEEAREEVFKYEEEKNQASSRLDSIYDEIRPAIEEHERDSQ 454

Query: 1230 DSVPTSVAEKWIEASCNKLKAEFDLYSSIIKNITCTPRRAENNMGPQGVATENGIKLLEN 1409
            DSVPT VA KWI A+C+K+KA+F+LYSSII+NI C PR+      PQG A     K  E 
Sbjct: 455  DSVPTEVARKWIGANCSKMKADFNLYSSIIRNIVCNPRK------PQGEA-----KASEG 503

Query: 1410 GTVKSEMQGVVS 1445
            G  ++E + +++
Sbjct: 504  GDNENETKDLIT 515


Top