BLASTX nr result

ID: Scutellaria22_contig00000269 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria22_contig00000269
         (1699 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor,...   478   e-132
ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1...   473   e-131
ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis tha...   468   e-129
gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putativ...   468   e-129
ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1...   465   e-128

>ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223537425|gb|EEF39053.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 469

 Score =  478 bits (1229), Expect = e-132
 Identities = 246/372 (66%), Positives = 282/372 (75%), Gaps = 3/372 (0%)
 Frame = +3

Query: 81   YRTFVPSSLPRPQPFSWSSDQEYDELRALPFDESSFSVDLHHVDNLSPALNSSPEYLFKL 260
            Y+T V + L      SW+  +   +        ++FSV LHHVD LS   NS+PE LF  
Sbjct: 28   YQTLVANPLRSQPTLSWTDSESPTDTAE---SSATFSVQLHHVDALS--FNSTPETLFTT 82

Query: 261  RLGRDXXXXXXXXXXXGRNVSGKP--RDFSSSVVSGLAQGSGEYFTRLGVGTPTKYVYMV 434
            RL RD               +GK     FSSSV+SGLAQGSGEYFTR+GVGTP +YVYMV
Sbjct: 83   RLQRDAARVEAISYLAETAGTGKRVGTGFSSSVISGLAQGSGEYFTRIGVGTPPRYVYMV 142

Query: 435  LDTGSDVVWIQCSPCRKCYTQSDPVFDPRKSTTFSGVSCASPLCRRLDSPGCNSRKK-CL 611
            LDTGSD+VWIQC+PC++CY QSDPVFDPRKS +F+ ++C SPLC RLDSPGCN++K+ C+
Sbjct: 143  LDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACRSPLCHRLDSPGCNTQKQTCM 202

Query: 612  YQVSYGDGSFTVGDFSTETLTFRRTRVKNVALGCGHDNEXXXXXXXXXXXXXXXXXSFPI 791
            YQVSYGDGSFT GDFSTETLTFRRTRV  VALGCGHDNE                 SFP 
Sbjct: 203  YQVSYGDGSFTFGDFSTETLTFRRTRVARVALGCGHDNEGLFVGAAGLLGLGRGRLSFPS 262

Query: 792  QAGRRFGRKFSYCLVDRTASSKPSAILFGESAVSRKAVFTPLLTNPKLDTFYYVGLNGIS 971
            Q GRRF  KFSYCLVDR+ASSKPS+++FG+SAVSR A FTPL++NPKLDTFYYV L GIS
Sbjct: 263  QTGRRFNHKFSYCLVDRSASSKPSSMVFGDSAVSRTARFTPLVSNPKLDTFYYVELLGIS 322

Query: 972  VGGRRVPGITASLFKLDAASGNGGVIVDSGTSVTRLTRPAYIALRNAFRAGASNLKRSPE 1151
            VGG RVPGITASLFKLD  +GNGGVI+DSGTSVTRLTRPAYIA R+AFRAGASNLKR+P+
Sbjct: 323  VGGTRVPGITASLFKLD-QTGNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQ 381

Query: 1152 FSLFDTCFDLSG 1187
            FSLFDTCFDLSG
Sbjct: 382  FSLFDTCFDLSG 393



 Score =  104 bits (260), Expect = 6e-20
 Identities = 50/58 (86%), Positives = 52/58 (89%)
 Frame = +2

Query: 1310 SLPASNYLIPVDTDGKFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGNRVGFAQGGCA 1483
            SLPASNYLIPVDT G FC AFAGTM GLSIIGNIQQQGFRVV+DLAG+RVGFA  GCA
Sbjct: 412  SLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPHGCA 469


>ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  473 bits (1217), Expect = e-131
 Identities = 243/367 (66%), Positives = 280/367 (76%), Gaps = 5/367 (1%)
 Frame = +3

Query: 102  SLPRPQPFSW-SSDQEYDELRALPFDESSFSVDLHHVDNLSPALNSSPEYLFKLRLGRDX 278
            SLP P   SW  S+ E D       +E + S+ LHH+D LS   N +PE LF+LRL RD 
Sbjct: 39   SLPHPPAISWPESESEPDP------EEEALSLHLHHIDALSS--NKTPEQLFQLRLQRDA 90

Query: 279  XXXXXXXXXXGRNVSGKPRD---FSSSVVSGLAQGSGEYFTRLGVGTPTKYVYMVLDTGS 449
                        N S   R    FSSS++SGLAQGSGEYFTR+GVGTP +YVYMVLDTGS
Sbjct: 91   KRVEGVVALAALNQSHARRSGSSFSSSIISGLAQGSGEYFTRIGVGTPARYVYMVLDTGS 150

Query: 450  DVVWIQCSPCRKCYTQSDPVFDPRKSTTFSGVSCASPLCRRLDSPGCNSRKK-CLYQVSY 626
            DVVW+QC+PCRKCYTQ+DPVFDP KS T++G+ C +PLCRRLDSPGCN++ K C YQVSY
Sbjct: 151  DVVWLQCAPCRKCYTQADPVFDPTKSRTYAGIPCGAPLCRRLDSPGCNNKNKVCQYQVSY 210

Query: 627  GDGSFTVGDFSTETLTFRRTRVKNVALGCGHDNEXXXXXXXXXXXXXXXXXSFPIQAGRR 806
            GDGSFT GDFSTETLTFRRTRV  VALGCGHDNE                 SFP+Q GRR
Sbjct: 211  GDGSFTFGDFSTETLTFRRTRVTRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRR 270

Query: 807  FGRKFSYCLVDRTASSKPSAILFGESAVSRKAVFTPLLTNPKLDTFYYVGLNGISVGGRR 986
            F +KFSYCLVDR+AS+KPS+++FG+SAVSR A FTPL+ NPKLDTFYY+ L GISVGG  
Sbjct: 271  FNQKFSYCLVDRSASAKPSSVVFGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSP 330

Query: 987  VPGITASLFKLDAASGNGGVIVDSGTSVTRLTRPAYIALRNAFRAGASNLKRSPEFSLFD 1166
            V G++ASLF+LDAA GNGGVI+DSGTSVTRLTRPAYIALR+AFR GAS+LKR+ EFSLFD
Sbjct: 331  VRGLSASLFRLDAA-GNGGVIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFD 389

Query: 1167 TCFDLSG 1187
            TCFDLSG
Sbjct: 390  TCFDLSG 396



 Score =  102 bits (255), Expect = 2e-19
 Identities = 49/57 (85%), Positives = 51/57 (89%)
 Frame = +2

Query: 1310 SLPASNYLIPVDTDGKFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGNRVGFAQGGC 1480
            SLPA+NYLIPVD  G FCFAFAGTMSGLSIIGNIQQQGFRV FDLAG+RVGFA  GC
Sbjct: 415  SLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRGC 471


>ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein
            [Arabidopsis thaliana] gi|22135930|gb|AAM91547.1|
            chloroplast nucleoid DNA binding protein, putative
            [Arabidopsis thaliana] gi|30387595|gb|AAP31963.1|
            At1g01300 [Arabidopsis thaliana]
            gi|332189147|gb|AEE27268.1| aspartyl protease-like
            protein [Arabidopsis thaliana]
          Length = 485

 Score =  468 bits (1205), Expect = e-129
 Identities = 245/384 (63%), Positives = 286/384 (74%), Gaps = 16/384 (4%)
 Frame = +3

Query: 81   YRTFVPSS--LPRPQPFSWSSDQEYDELRALPFDE-------SSFSVDLHHVDNLSPALN 233
            ++T  P+S  LP   P S+  D + + L    F+        SS +++L H+D LS   N
Sbjct: 28   FQTLFPNSHSLPCASPVSFQPDSDSESLLESEFESGSDSESSSSITLNLDHIDALSS--N 85

Query: 234  SSPEYLFKLRLGRDXXXXXXXXXXX----GRNVSGKPRD--FSSSVVSGLAQGSGEYFTR 395
             +P+ LF  RL RD               GRNV+  PR   FSSSVVSGL+QGSGEYFTR
Sbjct: 86   KTPDELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTR 145

Query: 396  LGVGTPTKYVYMVLDTGSDVVWIQCSPCRKCYTQSDPVFDPRKSTTFSGVSCASPLCRRL 575
            LGVGTP +YVYMVLDTGSD+VW+QC+PCR+CY+QSDP+FDPRKS T++ + C+SP CRRL
Sbjct: 146  LGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRL 205

Query: 576  DSPGCNSRKK-CLYQVSYGDGSFTVGDFSTETLTFRRTRVKNVALGCGHDNEXXXXXXXX 752
            DS GCN+R+K CLYQVSYGDGSFTVGDFSTETLTFRR RVK VALGCGHDNE        
Sbjct: 206  DSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAG 265

Query: 753  XXXXXXXXXSFPIQAGRRFGRKFSYCLVDRTASSKPSAILFGESAVSRKAVFTPLLTNPK 932
                     SFP Q G RF +KFSYCLVDR+ASSKPS+++FG +AVSR A FTPLL+NPK
Sbjct: 266  LLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPK 325

Query: 933  LDTFYYVGLNGISVGGRRVPGITASLFKLDAASGNGGVIVDSGTSVTRLTRPAYIALRNA 1112
            LDTFYYVGL GISVGG RVPG+TASLFKLD   GNGGVI+DSGTSVTRL RPAYIA+R+A
Sbjct: 326  LDTFYYVGLLGISVGGTRVPGVTASLFKLDQI-GNGGVIIDSGTSVTRLIRPAYIAMRDA 384

Query: 1113 FRAGASNLKRSPEFSLFDTCFDLS 1184
            FR GA  LKR+P+FSLFDTCFDLS
Sbjct: 385  FRVGAKTLKRAPDFSLFDTCFDLS 408



 Score =  108 bits (271), Expect = 3e-21
 Identities = 51/58 (87%), Positives = 55/58 (94%)
 Frame = +2

Query: 1310 SLPASNYLIPVDTDGKFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGNRVGFAQGGCA 1483
            SLPA+NYLIPVDT+GKFCFAFAGTM GLSIIGNIQQQGFRVV+DLA +RVGFA GGCA
Sbjct: 428  SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485


>gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
            thaliana]
          Length = 485

 Score =  468 bits (1203), Expect = e-129
 Identities = 245/384 (63%), Positives = 285/384 (74%), Gaps = 16/384 (4%)
 Frame = +3

Query: 81   YRTFVPSS--LPRPQPFSWSSDQEYDELRALPFDE-------SSFSVDLHHVDNLSPALN 233
            ++T  P+S  LP   P S+  D + + L    F+        SS +++L H+D LS   N
Sbjct: 28   FQTLFPNSHSLPCASPVSFQPDSDSESLLESEFESGSDSESSSSITLNLDHIDALSS--N 85

Query: 234  SSPEYLFKLRLGRDXXXXXXXXXXX----GRNVSGKPRD--FSSSVVSGLAQGSGEYFTR 395
             +P+ LF  RL RD               GRNV+  PR   FSSSVVSGL+QGSGEYFTR
Sbjct: 86   KTPQELFSSRLQRDSRRVRSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTR 145

Query: 396  LGVGTPTKYVYMVLDTGSDVVWIQCSPCRKCYTQSDPVFDPRKSTTFSGVSCASPLCRRL 575
            LGVGTP +YVYMVLDTGSD+VW+QC+PCR+CY+QSDP+FDPRKS T++ + C+SP CRRL
Sbjct: 146  LGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRL 205

Query: 576  DSPGCNSRKK-CLYQVSYGDGSFTVGDFSTETLTFRRTRVKNVALGCGHDNEXXXXXXXX 752
            DS GCN+R+K CLYQVSYGDGSFTVGDFSTETLTFRR RVK VALGCGHDNE        
Sbjct: 206  DSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAG 265

Query: 753  XXXXXXXXXSFPIQAGRRFGRKFSYCLVDRTASSKPSAILFGESAVSRKAVFTPLLTNPK 932
                     SFP Q G RF +KFSYCLVDR+ASSKPS+++FG +AVSR A FTPLL+NPK
Sbjct: 266  LLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPK 325

Query: 933  LDTFYYVGLNGISVGGRRVPGITASLFKLDAASGNGGVIVDSGTSVTRLTRPAYIALRNA 1112
            LDTFYYVGL GISVGG RVPG+TASLFKLD   GNGGVI+DSGTSVTRL RPAYIA+R+A
Sbjct: 326  LDTFYYVGLLGISVGGTRVPGVTASLFKLDQI-GNGGVIIDSGTSVTRLIRPAYIAMRDA 384

Query: 1113 FRAGASNLKRSPEFSLFDTCFDLS 1184
            FR GA  LKR+P FSLFDTCFDLS
Sbjct: 385  FRVGAKTLKRAPNFSLFDTCFDLS 408



 Score =  108 bits (271), Expect = 3e-21
 Identities = 51/58 (87%), Positives = 55/58 (94%)
 Frame = +2

Query: 1310 SLPASNYLIPVDTDGKFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGNRVGFAQGGCA 1483
            SLPA+NYLIPVDT+GKFCFAFAGTM GLSIIGNIQQQGFRVV+DLA +RVGFA GGCA
Sbjct: 428  SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485


>ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
            gi|147788999|emb|CAN64659.1| hypothetical protein
            VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  465 bits (1197), Expect = e-128
 Identities = 242/365 (66%), Positives = 278/365 (76%), Gaps = 11/365 (3%)
 Frame = +3

Query: 126  SWSSDQEYDELRALPFDESSFSVDLH--HVDNLSPALNSSPEYLFKLRLGRDXXXXXXXX 299
            SW+  +   ++  LP  E+  ++ +H  H D L  A N++PE LF LRL RD        
Sbjct: 54   SWTETET--QISTLPVSETDPTMTMHLEHRDVL--AFNATPEALFNLRLQRDAFRVEALS 109

Query: 300  XXX--------GRN-VSGKPRDFSSSVVSGLAQGSGEYFTRLGVGTPTKYVYMVLDTGSD 452
                       GRN    +   FSSSV SGLAQGSGEYFTRLGVGTP KYVYMVLDTGSD
Sbjct: 110  KMAAAAGGRRAGRNGTHAQGGGFSSSVTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTGSD 169

Query: 453  VVWIQCSPCRKCYTQSDPVFDPRKSTTFSGVSCASPLCRRLDSPGCNSRKKCLYQVSYGD 632
            VVWIQC+PCRKCY+Q+DPVFDP+KS +FS +SC SPLC RLDSPGCNSR+ CLYQV+YGD
Sbjct: 170  VVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGD 229

Query: 633  GSFTVGDFSTETLTFRRTRVKNVALGCGHDNEXXXXXXXXXXXXXXXXXSFPIQAGRRFG 812
            GSFT G+FSTETLTFR TRV  VALGCGHDNE                 SFP Q G RFG
Sbjct: 230  GSFTFGEFSTETLTFRGTRVPKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFG 289

Query: 813  RKFSYCLVDRTASSKPSAILFGESAVSRKAVFTPLLTNPKLDTFYYVGLNGISVGGRRVP 992
            RKFSYCLVDR+ASSKPS+++FG+SAVSR AVFTPL+TNPKLDTFYY+ L GISVGG RV 
Sbjct: 290  RKFSYCLVDRSASSKPSSVVFGQSAVSRTAVFTPLITNPKLDTFYYLELTGISVGGARVA 349

Query: 993  GITASLFKLDAASGNGGVIVDSGTSVTRLTRPAYIALRNAFRAGASNLKRSPEFSLFDTC 1172
            GITASLFKLD A GNGGVI+DSGTSVTRLTR AY++LR+AFRAGA++LKR+P++SLFDTC
Sbjct: 350  GITASLFKLDTA-GNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSLFDTC 408

Query: 1173 FDLSG 1187
            FDLSG
Sbjct: 409  FDLSG 413



 Score =  104 bits (259), Expect = 8e-20
 Identities = 49/58 (84%), Positives = 54/58 (93%)
 Frame = +2

Query: 1310 SLPASNYLIPVDTDGKFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGNRVGFAQGGCA 1483
            SLPA+NYLIPVDT+G FCFAFAGTMSGLSIIGNIQQQGFRVVFD+A +R+GFA  GCA
Sbjct: 432  SLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQGFRVVFDVAASRIGFAARGCA 489


Top