BLASTX nr result

ID: Dioscorea21_contig00007275 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00007275
         (1573 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_001141813.1| uncharacterized protein LOC100273952 precurs...   565   e-158
ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|2...   563   e-158
ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group] g...   557   e-156
dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          556   e-156
gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indi...   556   e-156

>ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
            gi|194706024|gb|ACF87096.1| unknown [Zea mays]
            gi|413945958|gb|AFW78607.1| hypothetical protein
            ZEAMMB73_489507 [Zea mays]
          Length = 460

 Score =  565 bits (1456), Expect = e-158
 Identities = 272/423 (64%), Positives = 318/423 (75%), Gaps = 16/423 (3%)
 Frame = -3

Query: 1454 FESWCREFNKSYASEEEKLARFKVFEDNLAFVNRHNS-------------AGNSTYELGL 1314
            F++WC E  K+YA+ EE+ AR  VF DN AFV  HN+             A   +Y L L
Sbjct: 36   FDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSYTLAL 95

Query: 1313 NAFSDLAHHEFRAARFG-LSFGLLQPSGDRIVFRGSAGG--VPDSVDWRKSGAVTAVKDQ 1143
            NAF+DL H EFRAAR G ++ G    S    V+ G  GG  VPD++DWRKSGAVT VKDQ
Sbjct: 96   NAFADLTHEEFRAARLGRIAPGAALRSRAAPVYWGLGGGAAVPDALDWRKSGAVTKVKDQ 155

Query: 1142 GSCGACWAFSATGAIEGINKIVTGSLVSLSEQELCDCDRTYNSGCGGGLMDYAFKWVIQN 963
            GSCGACW+FSATGA+EGINKI TGSLVSLSEQEL DCDR+YNSGCGGGLMDYA+K+VI+N
Sbjct: 156  GSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVIKN 215

Query: 962  HGIDSEDDYPFKGAERTCLKNKLNRRVVSIDGYTDVPANNEDLLLQAVAKQPVSVGICGS 783
             GID+E+DYP++ A+ TC KNKL +RVV+IDGYTDVP+N EDLLLQAVA+QPVSVGICGS
Sbjct: 216  GGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSVGICGS 275

Query: 782  ERAFQSYAKGIFNGPCSTNLDHAVLIVGYGSQNGEDYWIVKNSWGTSWGMDGYMHMQRNS 603
             RAFQ Y +GIF+GPC T+LDHAVLIVGYGS+ G+DYWIVKNSWG SWGM GYMHM RN+
Sbjct: 276  ARAFQLYYQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNT 335

Query: 602  GSSQGVCGINMLASFXXXXXXXXXXXXXXXXTKCSLLTYCPAGNTCCCTWRILGLCLSWS 423
            G S+GVCGINM+ASF                TKCSLLTYCP G+TCCC+WR+LG CLSWS
Sbjct: 336  GDSKGVCGINMMASFPTKTSPNPPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGFCLSWS 395

Query: 422  CCELDSAVCCKDHRYCCPSDYPVCDNKSKQCFKGSRNSTGVNGFKRKTSFMNFKGLKPFL 243
            CCELD+AVCCKD+RYCCP DYPVCD    QC K S N + + G +RK SF        +L
Sbjct: 396  CCELDNAVCCKDNRYCCPHDYPVCDTGRGQCLKASGNFSAIEGIRRKQSFSKAPSWTGWL 455

Query: 242  EAL 234
            E +
Sbjct: 456  ELM 458


>ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|222857137|gb|EEE94684.1|
            predicted protein [Populus trichocarpa]
          Length = 436

 Score =  563 bits (1452), Expect = e-158
 Identities = 259/393 (65%), Positives = 301/393 (76%)
 Frame = -3

Query: 1454 FESWCREFNKSYASEEEKLARFKVFEDNLAFVNRHNSAGNSTYELGLNAFSDLAHHEFRA 1275
            FE+WC+E  KSY S+EE+  R KVFEDN  FV +HNS GNS+Y L LNAF+DL HHEF+ 
Sbjct: 29   FETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFKT 88

Query: 1274 ARFGLSFGLLQPSGDRIVFRGSAGGVPDSVDWRKSGAVTAVKDQGSCGACWAFSATGAIE 1095
            +R GLS   L  +   +   G  G +P S+DWR  G VT VKDQGSCGACW+FSATGAIE
Sbjct: 89   SRLGLSAAPLNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFSATGAIE 148

Query: 1094 GINKIVTGSLVSLSEQELCDCDRTYNSGCGGGLMDYAFKWVIQNHGIDSEDDYPFKGAER 915
            GINKIVTGSLVSLSEQEL +CD++YN GCGGGLMDYAF++VI NHGID+E+DYP++  + 
Sbjct: 149  GINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDG 208

Query: 914  TCLKNKLNRRVVSIDGYTDVPANNEDLLLQAVAKQPVSVGICGSERAFQSYAKGIFNGPC 735
            TC K+++ RRVV+ID Y DVP NNE  LLQAVA QPVSVGICGSERAFQ Y+KGIF GPC
Sbjct: 209  TCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGPC 268

Query: 734  STNLDHAVLIVGYGSQNGEDYWIVKNSWGTSWGMDGYMHMQRNSGSSQGVCGINMLASFX 555
            ST+LDHAVLIVGYGS+NG DYWIVKNSWGT WGM GYMHMQRNSG+SQGVCGINMLAS+ 
Sbjct: 269  STSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLASYP 328

Query: 554  XXXXXXXXXXXXXXXTKCSLLTYCPAGNTCCCTWRILGLCLSWSCCELDSAVCCKDHRYC 375
                           TKC+LLTYC AG TCCC  +  G+C+SW CC LDSAVCCKD  +C
Sbjct: 329  VKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLDSAVCCKDRLHC 388

Query: 374  CPSDYPVCDNKSKQCFKGSRNSTGVNGFKRKTS 276
            CP DYPVCD     CFK + N+T +   + KTS
Sbjct: 389  CPHDYPVCDTDKNMCFKRAGNATRMEAIEGKTS 421


>ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group] gi|48475189|gb|AAT44258.1|
            hypothetical protein [Oryza sativa Japonica Group]
            gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa
            Japonica Group]
          Length = 450

 Score =  557 bits (1436), Expect = e-156
 Identities = 267/412 (64%), Positives = 310/412 (75%), Gaps = 5/412 (1%)
 Frame = -3

Query: 1454 FESWCREFNKSYASEEEKLARFKVFEDNLAFVNRHNSAGNSTYELGLNAFSDLAHHEFRA 1275
            FE+WC E  +SYA+  E+ AR   F DN AFV  HN A  S Y L LNAF+DL H EFRA
Sbjct: 38   FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPAS-YALALNAFADLTHDEFRA 96

Query: 1274 ARFGLSFGLLQPSGDR----IVFRGSAGGVPDSVDWRKSGAVTAVKDQGSCGACWAFSAT 1107
            AR G       P  D     +   G  G VPD+VDWR+SGAVT VKDQGSCGACW+FSAT
Sbjct: 97   ARLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSAT 156

Query: 1106 GAIEGINKIVTGSLVSLSEQELCDCDRTYNSGCGGGLMDYAFKWVIQNHGIDSEDDYPFK 927
            GA+EGINKI TGSL+SLSEQEL DCDR+YNSGCGGGLMDYA+K+V++N GID+E DYP++
Sbjct: 157  GAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYR 216

Query: 926  GAERTCLKNKLNRRVVSIDGYTDVPANNEDLLLQAVAKQPVSVGICGSERAFQSYAKGIF 747
              + TC KNKL RRVV+IDGY DVPANNED+LLQAVA+QPVSVGICGS RAFQ Y+KGIF
Sbjct: 217  ETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIF 276

Query: 746  NGPCSTNLDHAVLIVGYGSQNGEDYWIVKNSWGTSWGMDGYMHMQRNSGSSQGVCGINML 567
            +GPC T+LDHA+LIVGYGS+ G+DYWIVKNSWG SWGM GYM+M RN+G+S GVCGIN +
Sbjct: 277  DGPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQM 336

Query: 566  ASFXXXXXXXXXXXXXXXXTKCSLLTYCPAGNTCCCTWRILGLCLSWSCCELDSAVCCKD 387
             SF                TKCSLLTYCP G+TCCC+WR+LGLCLSWSCCELD+AVCCKD
Sbjct: 337  PSFPTKSSPNPPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGLCLSWSCCELDNAVCCKD 396

Query: 386  HRYCCPSDYPVCDNKSKQCFKGSR-NSTGVNGFKRKTSFMNFKGLKPFLEAL 234
            +RYCCP DYPVCD  S++CFK +  N + + G  RK  F     L   LE L
Sbjct: 397  NRYCCPHDYPVCDTASQRCFKANNGNFSVMEGGSRKQPFSKVPSLGGLLELL 448


>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  556 bits (1433), Expect = e-156
 Identities = 262/408 (64%), Positives = 305/408 (74%), Gaps = 3/408 (0%)
 Frame = -3

Query: 1454 FESWCREFNKSYASEEEKLARFKVFEDNLAFVNRHNSAGNSTYELGLNAFSDLAHHEFRA 1275
            FE+WC++  K+YAS+EEKL R KVF+DN  FV  HNS GNS+Y L LNAF+DL HHEF+A
Sbjct: 30   FETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEFKA 89

Query: 1274 ARFGLSFGL---LQPSGDRIVFRGSAGGVPDSVDWRKSGAVTAVKDQGSCGACWAFSATG 1104
            +R GLS      L               VP SVDWRK+GAVT VKDQG+CGACW+FSATG
Sbjct: 90   SRLGLSSAASASLNVDRSNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWSFSATG 149

Query: 1103 AIEGINKIVTGSLVSLSEQELCDCDRTYNSGCGGGLMDYAFKWVIQNHGIDSEDDYPFKG 924
            AIEGINKIVTGSLVSLSEQEL DCD++YN+GC GG+MDYAF++VI NHGID+E+DYP++G
Sbjct: 150  AIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEEDYPYQG 209

Query: 923  AERTCLKNKLNRRVVSIDGYTDVPANNEDLLLQAVAKQPVSVGICGSERAFQSYAKGIFN 744
             +R+C K KL R VV+IDGY DVP NNE  LL+AVA QPVSVGICGSERAFQ Y+KGIF 
Sbjct: 210  RDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYSKGIFT 269

Query: 743  GPCSTNLDHAVLIVGYGSQNGEDYWIVKNSWGTSWGMDGYMHMQRNSGSSQGVCGINMLA 564
            GPCST+LDHAVLIVGYGS+NG DYWIVKNSWG+ WGMDGYMHMQRNSGSS+G+CGINMLA
Sbjct: 270  GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGINMLA 329

Query: 563  SFXXXXXXXXXXXXXXXXTKCSLLTYCPAGNTCCCTWRILGLCLSWSCCELDSAVCCKDH 384
            S+                T+C L T+C  G TCCC   I G+CLSW CCELDSAVCCKD 
Sbjct: 330  SYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKCCELDSAVCCKDG 389

Query: 383  RYCCPSDYPVCDNKSKQCFKGSRNSTGVNGFKRKTSFMNFKGLKPFLE 240
            R+CCP DYPVCD     C K   N+T +  F + +S   F+     LE
Sbjct: 390  RHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSSGKFRSWSSLLE 437


>gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
          Length = 449

 Score =  556 bits (1432), Expect = e-156
 Identities = 267/412 (64%), Positives = 310/412 (75%), Gaps = 5/412 (1%)
 Frame = -3

Query: 1454 FESWCREFNKSYASEEEKLARFKVFEDNLAFVNRHNSAGNSTYELGLNAFSDLAHHEFRA 1275
            FE+WC E  +SYA+  E+ AR   F DN AFV  HN A  S Y L LNAF+DL H EFRA
Sbjct: 38   FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPAS-YALALNAFADLTHDEFRA 96

Query: 1274 ARFGLSFGLLQPSGDR----IVFRGSAGGVPDSVDWRKSGAVTAVKDQGSCGACWAFSAT 1107
            AR G       P  D     +   G  G VPD+VDWR+SGAVT VKDQGSCGACW+FSAT
Sbjct: 97   ARLG-RLAAAGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSAT 155

Query: 1106 GAIEGINKIVTGSLVSLSEQELCDCDRTYNSGCGGGLMDYAFKWVIQNHGIDSEDDYPFK 927
            GA+EGINKI TGSL+SLSEQEL DCDR+YNSGCGGGLMDYA+K+V++N GID+E DYP++
Sbjct: 156  GAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYR 215

Query: 926  GAERTCLKNKLNRRVVSIDGYTDVPANNEDLLLQAVAKQPVSVGICGSERAFQSYAKGIF 747
              + TC KNKL RRVV+IDGY DVPANNED+LLQAVA+QPVSVGICGS RAFQ Y+KGIF
Sbjct: 216  ETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIF 275

Query: 746  NGPCSTNLDHAVLIVGYGSQNGEDYWIVKNSWGTSWGMDGYMHMQRNSGSSQGVCGINML 567
            +GPC T+LDHA+LIVGYGS+ G+DYWIVKNSWG SWGM GYM+M RN+G+S GVCGIN +
Sbjct: 276  DGPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQM 335

Query: 566  ASFXXXXXXXXXXXXXXXXTKCSLLTYCPAGNTCCCTWRILGLCLSWSCCELDSAVCCKD 387
             SF                TKCSLLTYCP G+TCCC+WR+LGLCLSWSCCELD+AVCCKD
Sbjct: 336  PSFPTKSSPNPPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGLCLSWSCCELDNAVCCKD 395

Query: 386  HRYCCPSDYPVCDNKSKQCFKGSR-NSTGVNGFKRKTSFMNFKGLKPFLEAL 234
            +RYCCP DYPVCD  S++CFK +  N + + G  RK  F     L   LE L
Sbjct: 396  NRYCCPHDYPVCDTASQRCFKANNGNFSVMEGGSRKQPFSKVPSLGGLLELL 447