BLASTX nr result

ID: Dioscorea21_contig00031382 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00031382
         (524 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_001174806.1| Os06g0499301 [Oryza sativa Japonica Group] g...   182   3e-44
gb|EAZ01063.1| hypothetical protein OsI_23091 [Oryza sativa Indi...   182   3e-44
tpg|DAA52557.1| TPA: hypothetical protein ZEAMMB73_743775 [Zea m...   180   1e-43
ref|XP_003565323.1| PREDICTED: pentatricopeptide repeat-containi...   176   2e-42
ref|XP_002304600.1| predicted protein [Populus trichocarpa] gi|2...   173   2e-41

>ref|NP_001174806.1| Os06g0499301 [Oryza sativa Japonica Group]
           gi|52076487|dbj|BAD45366.1| putative fertility restorer
           [Oryza sativa Japonica Group]
           gi|125597333|gb|EAZ37113.1| hypothetical protein
           OsJ_21452 [Oryza sativa Japonica Group]
           gi|255677074|dbj|BAH93534.1| Os06g0499301 [Oryza sativa
           Japonica Group]
          Length = 642

 Score =  182 bits (461), Expect = 3e-44
 Identities = 90/174 (51%), Positives = 113/174 (64%)
 Frame = -1

Query: 524 GHFESGNIQRALATWKDMVRHGCEPNAVCYSTLIHGLCEGGRLDEGMMAWRNMLAKGCFP 345
           G+F+ G+  RAL+ W++M+  GC PNAV YS LI+GLC  GRL + MM W++ML +GC P
Sbjct: 427 GYFKIGDTSRALSVWEEMIGAGCVPNAVSYSILINGLCNVGRLKDAMMVWKHMLDRGCAP 486

Query: 344 DVVAYGSMIXXXXXXXXXXXXXXLFNDMLAREEPEPDAVIYNILFDGLLKGGKLTQAMDL 165
           D +AY SMI              LF DMLA    +PD + YN+L DGLL    L +AMDL
Sbjct: 487 DTIAYTSMIKGLCVSGMVDGGLRLFYDMLASGHADPDVISYNVLLDGLLLAKDLPRAMDL 546

Query: 164 LRSMLDRGCDPDEVTCNTFLREFGGEDKKVREFMEGLVVRLSKLERAEGAAGIV 3
           L  MLD+GCDPD VTCN FLREFG  ++K REF+EGLVVRL    R   A  ++
Sbjct: 547 LNRMLDQGCDPDTVTCNIFLREFGAGERKGREFLEGLVVRLCDRRRNMAAGEVL 600



 Score = 89.0 bits (219), Expect = 4e-16
 Identities = 48/141 (34%), Positives = 74/141 (52%)
 Frame = -1

Query: 524 GHFESGNIQRALATWKDMVRHGCEPNAVCYSTLIHGLCEGGRLDEGMMAWRNMLAKGCFP 345
           G  +SG I  AL  W+ MV     PN V YS +I GL   G++ E  + +R M+   C P
Sbjct: 357 GFCKSGEIDCALKVWEAMVASPVRPNVVLYSAMIGGLANFGKMTEAELLFREMIHSKCAP 416

Query: 344 DVVAYGSMIXXXXXXXXXXXXXXLFNDMLAREEPEPDAVIYNILFDGLLKGGKLTQAMDL 165
           +++ YGS+I              ++ +M+      P+AV Y+IL +GL   G+L  AM +
Sbjct: 417 NIITYGSIIQGYFKIGDTSRALSVWEEMIG-AGCVPNAVSYSILINGLCNVGRLKDAMMV 475

Query: 164 LRSMLDRGCDPDEVTCNTFLR 102
            + MLDRGC PD +   + ++
Sbjct: 476 WKHMLDRGCAPDTIAYTSMIK 496


>gb|EAZ01063.1| hypothetical protein OsI_23091 [Oryza sativa Indica Group]
          Length = 552

 Score =  182 bits (461), Expect = 3e-44
 Identities = 90/174 (51%), Positives = 113/174 (64%)
 Frame = -1

Query: 524 GHFESGNIQRALATWKDMVRHGCEPNAVCYSTLIHGLCEGGRLDEGMMAWRNMLAKGCFP 345
           G+F+ G+  RAL+ W++M+  GC PNAV YS LI+GLC  GRL + MM W++ML +GC P
Sbjct: 337 GYFKIGDTSRALSVWEEMIGAGCMPNAVSYSILINGLCNVGRLKDAMMVWKHMLDRGCAP 396

Query: 344 DVVAYGSMIXXXXXXXXXXXXXXLFNDMLAREEPEPDAVIYNILFDGLLKGGKLTQAMDL 165
           D +AY SMI              LF DMLA    +PD + YN+L DGLL    L +AMDL
Sbjct: 397 DTIAYTSMIKGLCVSGMVDGGLRLFYDMLASGHADPDVISYNVLLDGLLLAKDLPRAMDL 456

Query: 164 LRSMLDRGCDPDEVTCNTFLREFGGEDKKVREFMEGLVVRLSKLERAEGAAGIV 3
           L  MLD+GCDPD VTCN FLREFG  ++K REF+EGLVVRL    R   A  ++
Sbjct: 457 LNRMLDQGCDPDTVTCNIFLREFGAGERKGREFLEGLVVRLCDRRRNMAAGEVL 510



 Score = 90.5 bits (223), Expect = 1e-16
 Identities = 49/141 (34%), Positives = 74/141 (52%)
 Frame = -1

Query: 524 GHFESGNIQRALATWKDMVRHGCEPNAVCYSTLIHGLCEGGRLDEGMMAWRNMLAKGCFP 345
           G  +SG I  AL  W+ MV     PN V YS +I GL   G++ E  + +R M+   C P
Sbjct: 267 GFCKSGEIDCALKVWEAMVASPVRPNVVLYSAMIGGLANFGKMTEAELLFREMIDSKCAP 326

Query: 344 DVVAYGSMIXXXXXXXXXXXXXXLFNDMLAREEPEPDAVIYNILFDGLLKGGKLTQAMDL 165
           +++ YGSMI              ++ +M+      P+AV Y+IL +GL   G+L  AM +
Sbjct: 327 NIITYGSMIQGYFKIGDTSRALSVWEEMIG-AGCMPNAVSYSILINGLCNVGRLKDAMMV 385

Query: 164 LRSMLDRGCDPDEVTCNTFLR 102
            + MLDRGC PD +   + ++
Sbjct: 386 WKHMLDRGCAPDTIAYTSMIK 406


>tpg|DAA52557.1| TPA: hypothetical protein ZEAMMB73_743775 [Zea mays]
          Length = 630

 Score =  180 bits (456), Expect = 1e-43
 Identities = 85/174 (48%), Positives = 114/174 (65%)
 Frame = -1

Query: 524 GHFESGNIQRALATWKDMVRHGCEPNAVCYSTLIHGLCEGGRLDEGMMAWRNMLAKGCFP 345
           G+F   N  RAL+TW++M++ GC P A+ YS LI GLC+ GRL + MM W+NM+ +GC P
Sbjct: 415 GYFHIANSSRALSTWEEMIKVGCVPTAISYSILISGLCDVGRLKDAMMVWKNMIGRGCAP 474

Query: 344 DVVAYGSMIXXXXXXXXXXXXXXLFNDMLAREEPEPDAVIYNILFDGLLKGGKLTQAMDL 165
           D +AY SM+              LFNDMLA+ + +PD + YN+L D L++   L +AMDL
Sbjct: 475 DTIAYTSMMKGLCMSGMVDGGLRLFNDMLAKGDAKPDVISYNVLLDALIRTNDLPRAMDL 534

Query: 164 LRSMLDRGCDPDEVTCNTFLREFGGEDKKVREFMEGLVVRLSKLERAEGAAGIV 3
           L  MLD+ CDPD +TCN FLREFG  + K REF+EGLV+RL   +R   A  +V
Sbjct: 535 LNQMLDQMCDPDTITCNIFLREFGVLEGKGREFLEGLVMRLCYRDRYRAAGDVV 588



 Score = 85.1 bits (209), Expect = 6e-15
 Identities = 44/141 (31%), Positives = 70/141 (49%)
 Frame = -1

Query: 524 GHFESGNIQRALATWKDMVRHGCEPNAVCYSTLIHGLCEGGRLDEGMMAWRNMLAKGCFP 345
           G  +SG + RAL  W+ MV    +PN V YS +I G    GR+ E    +  M+   C P
Sbjct: 345 GFCKSGEVDRALMVWETMVAARVKPNVVLYSAMIDGFARSGRMTEAEKLFEEMVDAKCIP 404

Query: 344 DVVAYGSMIXXXXXXXXXXXXXXLFNDMLAREEPEPDAVIYNILFDGLLKGGKLTQAMDL 165
           ++V Y SM+               + +M+ +    P A+ Y+IL  GL   G+L  AM +
Sbjct: 405 NIVTYSSMVRGYFHIANSSRALSTWEEMI-KVGCVPTAISYSILISGLCDVGRLKDAMMV 463

Query: 164 LRSMLDRGCDPDEVTCNTFLR 102
            ++M+ RGC PD +   + ++
Sbjct: 464 WKNMIGRGCAPDTIAYTSMMK 484



 Score = 58.9 bits (141), Expect = 4e-07
 Identities = 32/138 (23%), Positives = 65/138 (47%)
 Frame = -1

Query: 509 GNIQRALATWKDMVRHGCEPNAVCYSTLIHGLCEGGRLDEGMMAWRNMLAKGCFPDVVAY 330
           G  + A+   + M   G  P  + Y  ++ GL + GR+++     + M  +G  P    +
Sbjct: 280 GEARAAMNIMRRMENEGIVPGLMTYGAVVDGLVKCGRVEDAWKVAQEMGGQGLAPSEFVF 339

Query: 329 GSMIXXXXXXXXXXXXXXLFNDMLAREEPEPDAVIYNILFDGLLKGGKLTQAMDLLRSML 150
            ++I              ++  M+A    +P+ V+Y+ + DG  + G++T+A  L   M+
Sbjct: 340 SAVITGFCKSGEVDRALMVWETMVAARV-KPNVVLYSAMIDGFARSGRMTEAEKLFEEMV 398

Query: 149 DRGCDPDEVTCNTFLREF 96
           D  C P+ VT ++ +R +
Sbjct: 399 DAKCIPNIVTYSSMVRGY 416


>ref|XP_003565323.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20090-like
            [Brachypodium distachyon]
          Length = 746

 Score =  176 bits (445), Expect = 2e-42
 Identities = 88/174 (50%), Positives = 114/174 (65%)
 Frame = -1

Query: 524  GHFESGNIQRALATWKDMVRHGCEPNAVCYSTLIHGLCEGGRLDEGMMAWRNMLAKGCFP 345
            G+F+ G+  +AL+ W+DM+R GC PNAV YS LI+GLC  GR  + MM W++ML +GC P
Sbjct: 532  GYFQIGDSSQALSFWEDMLRIGCTPNAVTYSVLINGLCNVGRSKDAMMVWKHMLGRGCVP 591

Query: 344  DVVAYGSMIXXXXXXXXXXXXXXLFNDMLAREEPEPDAVIYNILFDGLLKGGKLTQAMDL 165
            D +AY SMI              LF DMLAR +  PD + YN+L DGLL+   L +AMDL
Sbjct: 592  DTIAYTSMIKGFCVSGMVDAGLRLFYDMLARGDTHPDVICYNVLLDGLLRAKDLPRAMDL 651

Query: 164  LRSMLDRGCDPDEVTCNTFLREFGGEDKKVREFMEGLVVRLSKLERAEGAAGIV 3
            L  MLD+ CDPD VTCNTFLRE     +K +EF+EGLVVRL   +R + A  ++
Sbjct: 652  LNQMLDQACDPDTVTCNTFLREI-EVGQKGQEFLEGLVVRLCNRKRNKAAGEVL 704



 Score = 89.0 bits (219), Expect = 4e-16
 Identities = 52/160 (32%), Positives = 79/160 (49%), Gaps = 3/160 (1%)
 Frame = -1

Query: 524 GHFESGNIQRALATWKDMVRHGCEPNAVCYSTLIHGLCEGGRLDEGMMAWRNMLAKGCFP 345
           G  + G + RA   W  MV  G +PN V YS +I GL   G++ E  + +R M+   C P
Sbjct: 462 GFCKLGEVDRASRVWDTMVAAGIKPNVVLYSAMIDGLARCGKMTEAELLFREMIEAKCVP 521

Query: 344 DVVAYGSMIXXXXXXXXXXXXXXLFNDMLAREEPEPDAVIYNILFDGLLKGGKLTQAMDL 165
           +++ Y SM+               + DML R    P+AV Y++L +GL   G+   AM +
Sbjct: 522 NIMTYSSMVRGYFQIGDSSQALSFWEDML-RIGCTPNAVTYSVLINGLCNVGRSKDAMMV 580

Query: 164 LRSMLDRGCDPDEVTCNTFLREF---GGEDKKVREFMEGL 54
            + ML RGC PD +   + ++ F   G  D  +R F + L
Sbjct: 581 WKHMLGRGCVPDTIAYTSMIKGFCVSGMVDAGLRLFYDML 620


>ref|XP_002304600.1| predicted protein [Populus trichocarpa] gi|222842032|gb|EEE79579.1|
           predicted protein [Populus trichocarpa]
          Length = 641

 Score =  173 bits (438), Expect = 2e-41
 Identities = 92/178 (51%), Positives = 114/178 (64%), Gaps = 4/178 (2%)
 Frame = -1

Query: 524 GHFESGNIQRALATWKDMVRHGCEPNAVCYSTLIHGLCEGGRLDEGMMAWRNMLAKGCFP 345
           G FE+GN  +A+  WKDM +H    N VCYS LIHGLC+ G++ E MM W  ML KGC P
Sbjct: 423 GFFEAGNGHKAIEMWKDMAKHNFTQNEVCYSVLIHGLCKDGKVKEAMMVWAQMLGKGCKP 482

Query: 344 DVVAYGSMIXXXXXXXXXXXXXXLFNDMLARE-EPEPDAVIYNILFDGLLKGGKLTQAMD 168
           DVVAYGSMI              L+N+ML +E + +PD V YNIL + L K   +++A+D
Sbjct: 483 DVVAYGSMINGLSNAGLVEDALQLYNEMLCQEPDSQPDVVTYNILLNALCKQSSISRAID 542

Query: 167 LLRSMLDRGCDPDEVTCNTF---LREFGGEDKKVREFMEGLVVRLSKLERAEGAAGIV 3
           LL SMLDRGCDPD VTC  F   LRE     +  REF++GLVVRL K +R  GA+ IV
Sbjct: 543 LLNSMLDRGCDPDLVTCIIFLRTLREKLDPPQDGREFLDGLVVRLLKRQRVLGASKIV 600



 Score = 79.3 bits (194), Expect = 3e-13
 Identities = 44/140 (31%), Positives = 67/140 (47%)
 Frame = -1

Query: 524 GHFESGNIQRALATWKDMVRHGCEPNAVCYSTLIHGLCEGGRLDEGMMAWRNMLAKGCFP 345
           G F+ G  Q A+  +K+M    CE N + YS +I GLC  G+ DE +     M    C P
Sbjct: 353 GLFKEGKSQEAMQLFKEMTVKECELNTIVYSAVIDGLCRDGKPDEALEVLSEMTNNRCKP 412

Query: 344 DVVAYGSMIXXXXXXXXXXXXXXLFNDMLAREEPEPDAVIYNILFDGLLKGGKLTQAMDL 165
           +   Y S++              ++ DM A+     + V Y++L  GL K GK+ +AM +
Sbjct: 413 NAYTYSSLMKGFFEAGNGHKAIEMWKDM-AKHNFTQNEVCYSVLIHGLCKDGKVKEAMMV 471

Query: 164 LRSMLDRGCDPDEVTCNTFL 105
              ML +GC PD V   + +
Sbjct: 472 WAQMLGKGCKPDVVAYGSMI 491



 Score = 71.2 bits (173), Expect = 8e-11
 Identities = 43/135 (31%), Positives = 64/135 (47%)
 Frame = -1

Query: 509 GNIQRALATWKDMVRHGCEPNAVCYSTLIHGLCEGGRLDEGMMAWRNMLAKGCFPDVVAY 330
           G +  A+  ++DM    C+P+   Y TL+ GLC+  R+DE +     M   GCFP  V +
Sbjct: 183 GLVDDAVQMFRDMPVSKCQPDVYTYCTLMDGLCKADRIDEAVSLLDEMQIDGCFPSPVTF 242

Query: 329 GSMIXXXXXXXXXXXXXXLFNDMLAREEPEPDAVIYNILFDGLLKGGKLTQAMDLLRSML 150
             +I              L ++M  +    P+ V YN L  GL   GKL +A+ LL  M+
Sbjct: 243 NVLINGLCKKGDLARVAKLVDNMFLK-GCAPNEVTYNTLIHGLCLKGKLEKAISLLDRMV 301

Query: 149 DRGCDPDEVTCNTFL 105
              C P+ VT  T +
Sbjct: 302 SSKCVPNVVTYGTII 316



 Score = 70.1 bits (170), Expect = 2e-10
 Identities = 39/130 (30%), Positives = 63/130 (48%)
 Frame = -1

Query: 524 GHFESGNIQRALATWKDMVRHGCEPNAVCYSTLIHGLCEGGRLDEGMMAWRNMLAKGCFP 345
           G  + G++ R      +M   GC PN V Y+TLIHGLC  G+L++ +     M++  C P
Sbjct: 248 GLCKKGDLARVAKLVDNMFLKGCAPNEVTYNTLIHGLCLKGKLEKAISLLDRMVSSKCVP 307

Query: 344 DVVAYGSMIXXXXXXXXXXXXXXLFNDMLAREEPEPDAVIYNILFDGLLKGGKLTQAMDL 165
           +VV YG++I              +   ++       +  +Y+ L  GL K GK  +AM L
Sbjct: 308 NVVTYGTIINGLVKQGRALDGARVL-ALMEERGYHVNEYVYSALISGLFKEGKSQEAMQL 366

Query: 164 LRSMLDRGCD 135
            + M  + C+
Sbjct: 367 FKEMTVKECE 376



 Score = 61.6 bits (148), Expect = 7e-08
 Identities = 34/116 (29%), Positives = 60/116 (51%)
 Frame = -1

Query: 452 PNAVCYSTLIHGLCEGGRLDEGMMAWRNMLAKGCFPDVVAYGSMIXXXXXXXXXXXXXXL 273
           PN + ++ +I  +C+ G +D+ +  +R+M    C PDV  Y +++              L
Sbjct: 167 PNVLTFNLVIKTMCKVGLVDDAVQMFRDMPVSKCQPDVYTYCTLMDGLCKADRIDEAVSL 226

Query: 272 FNDMLAREEPEPDAVIYNILFDGLLKGGKLTQAMDLLRSMLDRGCDPDEVTCNTFL 105
            ++M   +   P  V +N+L +GL K G L +   L+ +M  +GC P+EVT NT +
Sbjct: 227 LDEMQI-DGCFPSPVTFNVLINGLCKKGDLARVAKLVDNMFLKGCAPNEVTYNTLI 281



 Score = 58.5 bits (140), Expect = 6e-07
 Identities = 40/148 (27%), Positives = 65/148 (43%)
 Frame = -1

Query: 524 GHFESGNIQRALATWKDMVRHGCEPNAVCYSTLIHGLCEGGRLDEGMMAWRNMLAKGCFP 345
           G  ++  I  A++   +M   GC P+ V ++ LI+GLC+ G L        NM  KGC P
Sbjct: 213 GLCKADRIDEAVSLLDEMQIDGCFPSPVTFNVLINGLCKKGDLARVAKLVDNMFLKGCAP 272

Query: 344 DVVAYGSMIXXXXXXXXXXXXXXLFNDMLAREEPEPDAVIYNILFDGLLKGGKLTQAMDL 165
           + V Y ++I              L  D +   +  P+ V Y  + +GL+K G+      +
Sbjct: 273 NEVTYNTLIHGLCLKGKLEKAISLL-DRMVSSKCVPNVVTYGTIINGLVKQGRALDGARV 331

Query: 164 LRSMLDRGCDPDEVTCNTFLREFGGEDK 81
           L  M +RG   +E   +  +     E K
Sbjct: 332 LALMEERGYHVNEYVYSALISGLFKEGK 359


Top