BLASTX nr result

ID: Dioscorea21_contig00009670 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00009670
         (826 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

tpg|DAA61768.1| TPA: heat shock factor protein 4 [Zea mays]           232   9e-59
ref|NP_001150318.1| heat shock factor protein 4 [Zea mays] gi|19...   231   2e-58
tpg|DAA40235.1| TPA: hypothetical protein ZEAMMB73_110006 [Zea m...   221   2e-55
ref|XP_002460330.1| hypothetical protein SORBIDRAFT_02g026590 [S...   220   3e-55
tpg|DAA61767.1| TPA: hypothetical protein ZEAMMB73_394338 [Zea m...   219   8e-55

>tpg|DAA61768.1| TPA: heat shock factor protein 4 [Zea mays]
          Length = 298

 Score =  232 bits (591), Expect = 9e-59
 Identities = 134/276 (48%), Positives = 168/276 (60%), Gaps = 16/276 (5%)
 Frame = +3

Query: 30  FLMKTYEMVEDAETDEVLSWGEMGKSFVVWKPVEFARDILPAHFKHNNFSSFVRQLNTYG 209
           FL KT++MVE+  TDEV+SW E G+SFVVWKPVE ARD+LP HFKH NFSSFVRQLNTYG
Sbjct: 20  FLTKTHQMVEERGTDEVISWAEQGRSFVVWKPVELARDLLPLHFKHCNFSSFVRQLNTYG 79

Query: 210 FRKIVPDRWEFANDYFRRGEQNLLSEIRRRKSIQTQSATKG--------XXXXXXXXXXX 365
           FRK+VPDRWEFAND FRRGEQ LLS IRRRKS   Q +  G                   
Sbjct: 80  FRKVVPDRWEFANDNFRRGEQGLLSGIRRRKSTALQMSKSGSGGSGGVNATFPPPLPPPP 139

Query: 366 XXXXXXXXXXXXXRLMSQNPQHFLDLSNENKKLKEDNQLLSSELAQAKLKCEELLASLST 545
                           + +P    DL++EN++LK+DN  LS+ELAQA+  CEELL  LS 
Sbjct: 140 PASATTSGVHERSSSSASSPPRAPDLASENEQLKKDNHTLSAELAQARRHCEELLGFLSR 199

Query: 546 YADARELDTRHLVQEAARQGV-------RITSSVSEVTQWXXXXXXXXXXXXCLKLFGVL 704
           + D R+LD R L+QE  R G        R  +  S++ +              +KLFGVL
Sbjct: 200 FLDVRQLDLRLLMQEDVRAGASDDGAQRRAHAVASQLER------GGGEEGKSVKLFGVL 253

Query: 705 FKVFEGRKKRGRCEEGSSSPGQPMKM-RLGAPWMGI 809
            K  +  +KRGRCEE ++S  +P+KM R+G PW+G+
Sbjct: 254 LK--DAARKRGRCEEAAASE-RPIKMIRVGEPWVGV 286


>ref|NP_001150318.1| heat shock factor protein 4 [Zea mays] gi|195638334|gb|ACG38635.1|
           heat shock factor protein 4 [Zea mays]
          Length = 299

 Score =  231 bits (588), Expect = 2e-58
 Identities = 134/276 (48%), Positives = 167/276 (60%), Gaps = 16/276 (5%)
 Frame = +3

Query: 30  FLMKTYEMVEDAETDEVLSWGEMGKSFVVWKPVEFARDILPAHFKHNNFSSFVRQLNTYG 209
           FL KT++MVE+  TDEV+SW E G+SFVVWKPVE ARD+LP HFKH NFSSFVRQLNTYG
Sbjct: 21  FLTKTHQMVEERGTDEVISWAEQGRSFVVWKPVELARDLLPLHFKHCNFSSFVRQLNTYG 80

Query: 210 FRKIVPDRWEFANDYFRRGEQNLLSEIRRRKSIQTQSATKG--------XXXXXXXXXXX 365
           FRK+VPDRWEFAND FRRGEQ LLS IRRRKS   Q +  G                   
Sbjct: 81  FRKVVPDRWEFANDNFRRGEQGLLSGIRRRKSTALQMSKSGSGGSGGVNATFPPPLPPPP 140

Query: 366 XXXXXXXXXXXXXRLMSQNPQHFLDLSNENKKLKEDNQLLSSELAQAKLKCEELLASLST 545
                           + +P    DL++EN++LK+DN  LS ELAQA+  CEELL  LS 
Sbjct: 141 PASATTSGVHERSSSSASSPPRAPDLASENEQLKKDNHTLSVELAQARRHCEELLGFLSR 200

Query: 546 YADARELDTRHLVQEAARQGV-------RITSSVSEVTQWXXXXXXXXXXXXCLKLFGVL 704
           + D R+LD R L+QE  R G        R  +  S++ +              +KLFGVL
Sbjct: 201 FLDVRQLDLRLLMQEDVRAGASDDGAQRRAHAVASQLER------GGGEEGKSVKLFGVL 254

Query: 705 FKVFEGRKKRGRCEEGSSSPGQPMKM-RLGAPWMGI 809
            K  +  +KRGRCEE ++S  +P+KM R+G PW+G+
Sbjct: 255 LK--DAARKRGRCEEAAASE-RPIKMIRVGEPWVGV 287


>tpg|DAA40235.1| TPA: hypothetical protein ZEAMMB73_110006 [Zea mays]
          Length = 298

 Score =  221 bits (563), Expect = 2e-55
 Identities = 128/274 (46%), Positives = 162/274 (59%), Gaps = 14/274 (5%)
 Frame = +3

Query: 30  FLMKTYEMVEDAETDEVLSWGEMGKSFVVWKPVEFARDILPAHFKHNNFSSFVRQLNTYG 209
           FL KT++MVE+  TDEV+SW E G+SFVVWKPVE ARD+LP HFKH NFSSFVRQLNTYG
Sbjct: 18  FLSKTHQMVEERGTDEVISWAEQGRSFVVWKPVELARDLLPLHFKHCNFSSFVRQLNTYG 77

Query: 210 FRKIVPDRWEFANDYFRRGEQNLLSEIRRRKSIQTQSATKG-------------XXXXXX 350
           FRK+VPDRWEFAN+ FRRGEQ LLS IRRRKS   Q +  G                   
Sbjct: 78  FRKVVPDRWEFANENFRRGEQGLLSGIRRRKSTTPQPSKYGGGSVVNTAFPPPLPLPPPA 137

Query: 351 XXXXXXXXXXXXXXXXXXRLMSQNPQHFLDLSNENKKLKEDNQLLSSELAQAKLKCEELL 530
                                + +P    DL++EN++LK+DN+ LS+ELAQA+  CEELL
Sbjct: 138 SVTTSGGGGAGGAGNERSSSSASSPPRTDDLTSENEQLKKDNRTLSTELAQARRHCEELL 197

Query: 531 ASLSTYADARELDTRHLVQEAARQGVRITSSVSEVTQWXXXXXXXXXXXXCLKLFGVLFK 710
             LS + D R+LD   L+QE  R G       +                  +KLFGVL  
Sbjct: 198 GFLSRFLDVRQLDLGLLMQEDVRAGA--GDDAAPRRAMVSQLERGGEEGKSVKLFGVL-- 253

Query: 711 VFEGRKKRGRCEEGSSSPGQPMKM-RLGAPWMGI 809
           + +  +KR RCEE ++S  +P+KM R+G PW+G+
Sbjct: 254 LTDAARKRARCEEAAASE-RPIKMIRIGEPWIGV 286


>ref|XP_002460330.1| hypothetical protein SORBIDRAFT_02g026590 [Sorghum bicolor]
           gi|241923707|gb|EER96851.1| hypothetical protein
           SORBIDRAFT_02g026590 [Sorghum bicolor]
          Length = 315

 Score =  220 bits (560), Expect = 3e-55
 Identities = 131/281 (46%), Positives = 161/281 (57%), Gaps = 21/281 (7%)
 Frame = +3

Query: 30  FLMKTYEMVEDAETDEVLSWGEMGKSFVVWKPVEFARDILPAHFKHNNFSSFVRQLNTYG 209
           FL KT++MVE+  TDEV+SW E G+SFVVWKPVE ARD+LP HFKH NFSSFVRQLNTYG
Sbjct: 27  FLTKTHQMVEERATDEVISWAEQGRSFVVWKPVELARDLLPLHFKHCNFSSFVRQLNTYG 86

Query: 210 FRKIVPDRWEFANDYFRRGEQNLLSEIRRRKSIQTQSATKGXXXXXXXXXXXXXXXXXXX 389
           FRK+VPDRWEFAND FRRGEQ LLS IRRRK    QS+ K                    
Sbjct: 87  FRKVVPDRWEFANDNFRRGEQGLLSGIRRRKPTTPQSSNKSGGSGGVNVAFPPPLPPPPA 146

Query: 390 XXXXXRL-----------MSQNPQHFLDLSNENKKLKEDNQLLSSELAQAKLKCEELLAS 536
                              + +P     L++EN++LK+DN  LS+ELAQA+  CEELL  
Sbjct: 147 PPASGTTSGGGGNERSSSSASSPPRADQLTSENEQLKKDNHTLSTELAQARRHCEELLGF 206

Query: 537 LSTYADARELDTRHLVQEAARQGVRITSSVSE---------VTQWXXXXXXXXXXXXCLK 689
           LS + D R+LD   L+Q    + VR   +  +         V                +K
Sbjct: 207 LSRFLDVRQLDLGLLMQ--GEEDVRAAGAAGDGALQAQRRAVVNHQLERGRGGEEGKSVK 264

Query: 690 LFGVLFKVFEGRKKRGRCEEGSSSPGQPMKM-RLGAPWMGI 809
           LFGVL K    R KRGRCEE  +S  +P+KM R+G PW+G+
Sbjct: 265 LFGVLLKDAAAR-KRGRCEEAVASE-RPIKMIRVGEPWVGV 303


>tpg|DAA61767.1| TPA: hypothetical protein ZEAMMB73_394338 [Zea mays]
          Length = 321

 Score =  219 bits (557), Expect = 8e-55
 Identities = 134/299 (44%), Positives = 168/299 (56%), Gaps = 39/299 (13%)
 Frame = +3

Query: 30  FLMKTYEMVEDAETDEVLSWGEMGKSFVVWKPVEFARDILPAHFKHNNFSSFVRQLNTY- 206
           FL KT++MVE+  TDEV+SW E G+SFVVWKPVE ARD+LP HFKH NFSSFVRQLNTY 
Sbjct: 20  FLTKTHQMVEERGTDEVISWAEQGRSFVVWKPVELARDLLPLHFKHCNFSSFVRQLNTYL 79

Query: 207 ----------------------GFRKIVPDRWEFANDYFRRGEQNLLSEIRRRKSIQTQS 320
                                 GFRK+VPDRWEFAND FRRGEQ LLS IRRRKS   Q 
Sbjct: 80  CYVVDERAFQAATVPSSKEYMRGFRKVVPDRWEFANDNFRRGEQGLLSGIRRRKSTALQM 139

Query: 321 ATKG--------XXXXXXXXXXXXXXXXXXXXXXXXRLMSQNPQHFLDLSNENKKLKEDN 476
           +  G                                   + +P    DL++EN++LK+DN
Sbjct: 140 SKSGSGGSGGVNATFPPPLPPPPPASATTSGVHERSSSSASSPPRAPDLASENEQLKKDN 199

Query: 477 QLLSSELAQAKLKCEELLASLSTYADARELDTRHLVQEAARQGV-------RITSSVSEV 635
             LS+ELAQA+  CEELL  LS + D R+LD R L+QE  R G        R  +  S++
Sbjct: 200 HTLSAELAQARRHCEELLGFLSRFLDVRQLDLRLLMQEDVRAGASDDGAQRRAHAVASQL 259

Query: 636 TQWXXXXXXXXXXXXCLKLFGVLFKVFEGRKKRGRCEEGSSSPGQPMKM-RLGAPWMGI 809
            +              +KLFGVL K  +  +KRGRCEE ++S  +P+KM R+G PW+G+
Sbjct: 260 ER------GGGEEGKSVKLFGVLLK--DAARKRGRCEEAAASE-RPIKMIRVGEPWVGV 309