BLASTX nr result

ID: Dioscorea21_contig00007529 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00007529
         (1873 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mira...   154   1e-34
ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1...   152   4e-34
ref|XP_001774247.1| predicted protein [Physcomitrella patens sub...   149   2e-33
ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group] g...   149   3e-33
ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1...   149   3e-33

>gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score =  154 bits (388), Expect = 1e-34
 Identities = 109/376 (28%), Positives = 173/376 (46%), Gaps = 8/376 (2%)
 Frame = -3

Query: 1199 SSTGGYVVFALIGTQHYPMKLSLDTTSDLTWVQCKPCEQCYPKYNAVYNPANSSTFMNTM 1020
            + +G Y++   IGT    +   +DT SDL W QC+PC QC+ +   ++NP +SS+F    
Sbjct: 91   AGSGEYLMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLP 150

Query: 1019 CQDNYCKIEADNYYKIEADKVMRFCTDDRRCLFGHAYRDDKNIMGSLVSDYFEFDQEMAG 840
            C+  YC+           D     C +D  C + + Y D  +  G + ++ F F+     
Sbjct: 151  CESQYCQ-----------DLPSESCYND--CQYTYGYGDGSSTQGYMATETFTFETSSV- 196

Query: 839  TEKVFHSHLTFGCAHSTIGSFNREVDGVLGLGEGPFSIISQLEVSGFSHCLTPPNSGKTS 660
                   ++ FGC     G       G++G+G GP S+ SQL V  FS+C+T   S   S
Sbjct: 197  ------PNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSSGSSSPS 250

Query: 659  YILFGDAAQ--TRGAPGAYMIVNPRYPSRYYLKLHSIALLDHQKMITIDGIPSNTFAIDA 486
             +  G AA     G+P   +I +   P+ YY+ L  I +          GIPS+TF +  
Sbjct: 251  TLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNL-----GIPSSTFQLQD 305

Query: 485  DRFGGFYLDIGTPFINLPQVAYHELRKVLQSTLLAYNIRPI-VSSNPSDFCFEASFD--- 318
            D  GG  +D GT    LPQ AY+ + +     +   N+ P+  SS+    CF+   D   
Sbjct: 306  DGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI---NLSPVDESSSGLSTCFQLPSDGST 362

Query: 317  -DVKHISLVFTVSLHDLIITGNQLFYENFDSKGVICLGVVESKTKE-TILGRNAQINRNI 144
              V  IS+ F   + +L   G +    +  ++GVICL +  S  +  +I G   Q    +
Sbjct: 363  VQVPEISMQFDGGVLNL---GEENVLIS-PAEGVICLAMGSSSQQGISIFGNIQQQETQV 418

Query: 143  GYDFEDRLVTFRDMEC 96
             YD ++  V+F   +C
Sbjct: 419  LYDLQNLAVSFVPTQC 434


>ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  152 bits (383), Expect = 4e-34
 Identities = 113/374 (30%), Positives = 168/374 (44%), Gaps = 6/374 (1%)
 Frame = -3

Query: 1199 SSTGGYVVFALIGTQHYPMKLSLDTTSDLTWVQCKPCEQCYPKYNAVYNPANSSTFMNTM 1020
            +  G Y++   IGT        LDT SDL W QCKPC +CY +   +++P  SS+F    
Sbjct: 103  AGNGEYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVS 162

Query: 1019 CQDNYCKIEADNYYKIEADKVMRFCTDDRRCLFGHAYRDDKNIMGSLVSDYFEFDQEMAG 840
            C  + C     +            C+D   C + ++Y D     G L ++ F F +    
Sbjct: 163  CGSSLCSALPSS-----------TCSDG--CEYVYSYGDYSMTQGVLATETFTFGK---S 206

Query: 839  TEKVFHSHLTFGCAHSTIGSFNREVDGVLGLGEGPFSIISQLEVSGFSHCLTPPNSGKTS 660
              KV   ++ FGC     G    +  G++GLG GP S++SQL+   FS+CLTP +  K S
Sbjct: 207  KNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEQRFSYCLTPIDDTKES 266

Query: 659  YILFGDAAQTRGAPGAY---MIVNPRYPSRYYLKLHSIALLDHQKMITIDGIPSNTFAID 489
             +L G   + + A       ++ NP  PS YYL L +I++ D     T   I  +TF + 
Sbjct: 267  VLLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGD-----TRLSIEKSTFEVG 321

Query: 488  ADRFGGFYLDIGTPFINLPQVAYHELRK-VLQSTLLAYNIRPIVSSNPSDFCFE--ASFD 318
             D  GG  +D GT    + Q AY  L+K  +  T LA +     SS   D CF   +   
Sbjct: 322  DDGNGGVIIDSGTTITYVQQKAYEALKKEFISQTKLALD---KTSSTGLDLCFSLPSGST 378

Query: 317  DVKHISLVFTVSLHDLIITGNQLFYENFDSKGVICLGVVESKTKETILGRNAQINRNIGY 138
             V+   LVF     DL +        +  + GV CL +  S +  +I G   Q N  + +
Sbjct: 379  QVEIPKLVFHFKGGDLELPAENYMIGD-SNLGVACLAMGAS-SGMSIFGNVQQQNILVNH 436

Query: 137  DFEDRLVTFRDMEC 96
            D E   ++F    C
Sbjct: 437  DLEKETISFVPTSC 450


>ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
            gi|162674374|gb|EDQ60883.1| predicted protein
            [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  149 bits (376), Expect = 2e-33
 Identities = 113/376 (30%), Positives = 175/376 (46%), Gaps = 8/376 (2%)
 Frame = -3

Query: 1199 SSTGGYVVFALIGTQHYPMKLSLDTTSDLTWVQCKPCEQCYPKYNAVYNPANSSTFMNTM 1020
            S  G Y++    G+      + +DT SDL W QC PCE C    + +++P  SST+    
Sbjct: 75   SGNGEYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYDTVS 134

Query: 1019 CQDNYCKIEADNYYKIEADKVMRFCTDDRRCLFGHAYRDDKNIMGSLVSDYFEFDQEMAG 840
            C  N+C           +    + CT    C + + Y D  +  G+L +     +    G
Sbjct: 135  CASNFC-----------SSLPFQSCTTS--CKYDYMYGDGSSTSGALST-----ETVTVG 176

Query: 839  TEKVFHSHLTFGCAHSTIGSFNREVDGVLGLGEGPFSIISQ---LEVSGFSHCLTPPNSG 669
            T  +   ++ FGC H+ +GSF     G++GLG+GP S+ISQ   +    FS+CL P  S 
Sbjct: 177  TGTI--PNVAFGCGHTNLGSF-AGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGST 233

Query: 668  KTSYILFGDAAQTRGAPGAYMIVNPRYPSRYYLKLHSIALLDHQKMITIDGIPSNTFAID 489
            KTS +L GD+A   G     ++ N   P+ YY  L  I++    K +T    P  TF+ID
Sbjct: 234  KTSPMLIGDSAAAGGVAYTALLTNTANPTFYYADLTGISV--SGKAVT---YPVGTFSID 288

Query: 488  ADRFGGFYLDIGTPFINLPQVAYHELRKVLQSTLLAYNIRPIVSSNPS----DFCFE-AS 324
            A   GGF LD GT    L   A++ L   L++ +      P   ++ S    D+CF  A 
Sbjct: 289  ASGQGGFILDSGTTLTYLETGAFNALVAALKAEV------PFPEADGSLYGLDYCFSTAG 342

Query: 323  FDDVKHISLVFTVSLHDLIITGNQLFYENFDSKGVICLGVVESKTKETILGRNAQINRNI 144
              +  + ++ F     D  +    +F    D+ G ICL +  S T  +I+G   Q N  I
Sbjct: 343  VANPTYPTMTFHFKGADYELPPENVFVA-LDTGGSICLAMAAS-TGFSIMGNIQQQNHLI 400

Query: 143  GYDFEDRLVTFRDMEC 96
             +D  ++ V F++  C
Sbjct: 401  VHDLVNQRVGFKEANC 416


>ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
            gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding
            protein [Oryza sativa Japonica Group]
            gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa
            Japonica Group] gi|125603713|gb|EAZ43038.1| hypothetical
            protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  149 bits (375), Expect = 3e-33
 Identities = 119/384 (30%), Positives = 176/384 (45%), Gaps = 16/384 (4%)
 Frame = -3

Query: 1199 SSTGGYVVFALIGTQHYPMKLSLDTTSDLTWVQCKPCEQCYPKYNAVYNPANSSTFMNTM 1020
            +S G Y++   IGT        +DT SDL W QC PC  C  +    + PA S+T+    
Sbjct: 87   ASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVP 146

Query: 1019 CQDNYCKIEADNYYKIEADKVMRFCTDDRRCLFGHAYRDDKNIMGSLVSDYFEFDQEMAG 840
            C+   C           A      C     C++ + Y D+ +  G L S+ F F    A 
Sbjct: 147  CRSPLC-----------AALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFG--AAN 193

Query: 839  TEKVFHSHLTFGCAHSTIGSFNREVDGVLGLGEGPFSIISQLEVSGFSHCLT---PPNSG 669
            + KV  S + FGC +   G       G++GLG GP S++SQL  S FS+CLT    P   
Sbjct: 194  SSKVMVSDVAFGCGNINSGQLANS-SGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPEPS 252

Query: 668  KTSYILF----GDAAQTRGAP--GAYMIVNPRYPSRYYLKLHSIALLDHQKMITIDGIPS 507
            + ++ +F    G  A + G+P     ++VN   PS Y++ L  I+L   QK + ID +  
Sbjct: 253  RLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISL--GQKRLPIDPL-- 308

Query: 506  NTFAIDADRFGGFYLDIGTPFINLPQVAYHELRKVLQSTLLAYNIRPIVSSNPSDFCFEA 327
              FAI+ D  GG ++D GT    L Q AY  +R+ L S L     RP+  +N ++   E 
Sbjct: 309  -VFAINDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVL-----RPLPPTNDTEIGLET 362

Query: 326  SFDDVKHISLVFTVSLHDLIITGNQLFY---ENF----DSKGVICLGVVESKTKETILGR 168
             F      S+  TV   +L   G        EN+     + G +CL ++ S    TI+G 
Sbjct: 363  CFPWPPPPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRS-GDATIIGN 421

Query: 167  NAQINRNIGYDFEDRLVTFRDMEC 96
              Q N +I YD  + L++F    C
Sbjct: 422  YQQQNMHILYDIANSLLSFVPAPC 445


>ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  149 bits (375), Expect = 3e-33
 Identities = 111/374 (29%), Positives = 166/374 (44%), Gaps = 6/374 (1%)
 Frame = -3

Query: 1199 SSTGGYVVFALIGTQHYPMKLSLDTTSDLTWVQCKPCEQCYPKYNAVYNPANSSTFMNTM 1020
            +  G Y++   IGT        LDT SDL W QCKPC QCY +   +++P  SS+F    
Sbjct: 103  AGNGEYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVS 162

Query: 1019 CQDNYCKIEADNYYKIEADKVMRFCTDDRRCLFGHAYRDDKNIMGSLVSDYFEFDQEMAG 840
            C  + C     +            C+D   C + ++Y D     G L ++ F F +    
Sbjct: 163  CGSSLCSAVPSS-----------TCSDG--CEYVYSYGDYSMTQGVLATETFTFGK---S 206

Query: 839  TEKVFHSHLTFGCAHSTIGSFNREVDGVLGLGEGPFSIISQLEVSGFSHCLTPPNSGKTS 660
              KV   ++ FGC     G    +  G++GLG GP S++SQL+   FS+CLTP +  K S
Sbjct: 207  KNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEPRFSYCLTPMDDTKES 266

Query: 659  YILFGDAAQTRGAPGAY---MIVNPRYPSRYYLKLHSIALLDHQKMITIDGIPSNTFAID 489
             +L G   + + A       ++ NP  PS YYL L  I++ D     T   I  +TF + 
Sbjct: 267  ILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGD-----TRLSIEKSTFEVG 321

Query: 488  ADRFGGFYLDIGTPFINLPQVAYHELRK-VLQSTLLAYNIRPIVSSNPSDFCFE--ASFD 318
             D  GG  +D GT    + Q A+  L+K  +  T L  +     SS   D CF   +   
Sbjct: 322  DDGNGGVIIDSGTTITYIEQKAFEALKKEFISQTKLPLD---KTSSTGLDLCFSLPSGST 378

Query: 317  DVKHISLVFTVSLHDLIITGNQLFYENFDSKGVICLGVVESKTKETILGRNAQINRNIGY 138
             V+   +VF     DL +        +  + GV CL +  S +  +I G   Q N  + +
Sbjct: 379  QVEIPKIVFHFKGGDLELPAENYMIGD-SNLGVACLAMGAS-SGMSIFGNVQQQNILVNH 436

Query: 137  DFEDRLVTFRDMEC 96
            D E   ++F    C
Sbjct: 437  DLEKETISFVPTSC 450


Top