BLASTX nr result

ID: Scutellaria22_contig00005471 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria22_contig00005471
         (1729 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1...   626   e-177
ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor,...   622   e-175
ref|XP_002302634.1| predicted protein [Populus trichocarpa] gi|2...   605   e-171
ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   601   e-169
ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1...   600   e-169

>ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
            gi|147788999|emb|CAN64659.1| hypothetical protein
            VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  626 bits (1614), Expect = e-177
 Identities = 314/425 (73%), Positives = 348/425 (81%), Gaps = 11/425 (2%)
 Frame = +1

Query: 202  LPDSETXXXXXXXXXXXXXPPLNSTPESLFTLRLRRDAARVEALSTVAAAAN-------- 357
            LP SET                N+TPE+LF LRL+RDA RVEALS +AAAA         
Sbjct: 65   LPVSETDPTMTMHLEHRDVLAFNATPEALFNLRLQRDAFRVEALSKMAAAAGGRRAGRNG 124

Query: 358  ---VSGDFSSSIISGLAQGSGEYFTRIGIGTPAKYVYMVLDTGSDVVWVQCSPCRKCYTQ 528
                 G FSSS+ SGLAQGSGEYFTR+G+GTP KYVYMVLDTGSDVVW+QC+PCRKCY+Q
Sbjct: 125  THAQGGGFSSSVTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQ 184

Query: 529  TDPVFDPKASTSFLGVSCVSPLCRRLDSPGCNSRQKCLYQVSYGDGSFTVGEFSTETLTF 708
            TDPVFDPK S SF  +SC SPLC RLDSPGCNSRQ CLYQV+YGDGSFT GEFSTETLTF
Sbjct: 185  TDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTF 244

Query: 709  RRTKVNNVALGCGHDNEGLFVXXXXXXXXXXXXXSFPTQAGPRFGRKFSYCLVDRSASSK 888
            R T+V  VALGCGHDNEGLFV             SFPTQ G RFGRKFSYCLVDRSASSK
Sbjct: 245  RGTRVPKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLVDRSASSK 304

Query: 889  PSSLVFGESAVSRNAVFTPLLTNPKLDTFYYVGLNGISVGGTRVPSITASLFKLDRAGNG 1068
            PSS+VFG+SAVSR AVFTPL+TNPKLDTFYY+ L GISVGG RV  ITASLFKLD AGNG
Sbjct: 305  PSSVVFGQSAVSRTAVFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNG 364

Query: 1069 GVIVDSGTSVTRLTRPAYIALRDAFRAGASNLKRSTEFSLFDTCFDLSGKTEVKVPTVVL 1248
            GVI+DSGTSVTRLTR AY++LRDAFRAGA++LKR+ ++SLFDTCFDLSGKTEVKVPTVV+
Sbjct: 365  GVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSLFDTCFDLSGKTEVKVPTVVM 424

Query: 1249 HFAGADVSLPASNYLIPVDSDGKFCFAFAGTTGGLSIIGNIQQQGYRVVFDLASNRVGFA 1428
            HF GADVSLPA+NYLIPVD++G FCFAFAGT  GLSIIGNIQQQG+RVVFD+A++R+GFA
Sbjct: 425  HFRGADVSLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQGFRVVFDVAASRIGFA 484

Query: 1429 PRGCA 1443
             RGCA
Sbjct: 485  ARGCA 489


>ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223537425|gb|EEF39053.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 469

 Score =  622 bits (1603), Expect = e-175
 Identities = 321/470 (68%), Positives = 359/470 (76%), Gaps = 7/470 (1%)
 Frame = +1

Query: 55   MEGK--RASFLLSIFATIVVSFSAPLRYXXXXXXXXXXXXXXXXXXXXDEALPDSETXXX 228
            MEGK  R +FLL    TI  S S  L Y                    +     +E+   
Sbjct: 1    MEGKAGRNAFLLFFSFTIFFSHSTSLNYQTLVANPLRSQPTLSWTDS-ESPTDTAESSAT 59

Query: 229  XXXXXXXXXXPPLNSTPESLFTLRLRRDAARVEALSTVAAAAN----VSGDFSSSIISGL 396
                         NSTPE+LFT RL+RDAARVEA+S +A  A     V   FSSS+ISGL
Sbjct: 60   FSVQLHHVDALSFNSTPETLFTTRLQRDAARVEAISYLAETAGTGKRVGTGFSSSVISGL 119

Query: 397  AQGSGEYFTRIGIGTPAKYVYMVLDTGSDVVWVQCSPCRKCYTQTDPVFDPKASTSFLGV 576
            AQGSGEYFTRIG+GTP +YVYMVLDTGSD+VW+QC+PC++CY Q+DPVFDP+ S SF  +
Sbjct: 120  AQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASI 179

Query: 577  SCVSPLCRRLDSPGCNS-RQKCLYQVSYGDGSFTVGEFSTETLTFRRTKVNNVALGCGHD 753
            +C SPLC RLDSPGCN+ +Q C+YQVSYGDGSFT G+FSTETLTFRRT+V  VALGCGHD
Sbjct: 180  ACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTRVARVALGCGHD 239

Query: 754  NEGLFVXXXXXXXXXXXXXSFPTQAGPRFGRKFSYCLVDRSASSKPSSLVFGESAVSRNA 933
            NEGLFV             SFP+Q G RF  KFSYCLVDRSASSKPSS+VFG+SAVSR A
Sbjct: 240  NEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVFGDSAVSRTA 299

Query: 934  VFTPLLTNPKLDTFYYVGLNGISVGGTRVPSITASLFKLDRAGNGGVIVDSGTSVTRLTR 1113
             FTPL++NPKLDTFYYV L GISVGGTRVP ITASLFKLD+ GNGGVI+DSGTSVTRLTR
Sbjct: 300  RFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTSVTRLTR 359

Query: 1114 PAYIALRDAFRAGASNLKRSTEFSLFDTCFDLSGKTEVKVPTVVLHFAGADVSLPASNYL 1293
            PAYIA RDAFRAGASNLKR+ +FSLFDTCFDLSGKTEVKVPTVVLHF GADVSLPASNYL
Sbjct: 360  PAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPASNYL 419

Query: 1294 IPVDSDGKFCFAFAGTTGGLSIIGNIQQQGYRVVFDLASNRVGFAPRGCA 1443
            IPVD+ G FC AFAGT GGLSIIGNIQQQG+RVV+DLA +RVGFAP GCA
Sbjct: 420  IPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPHGCA 469


>ref|XP_002302634.1| predicted protein [Populus trichocarpa] gi|222844360|gb|EEE81907.1|
            predicted protein [Populus trichocarpa]
          Length = 490

 Score =  605 bits (1561), Expect = e-171
 Identities = 296/400 (74%), Positives = 338/400 (84%), Gaps = 8/400 (2%)
 Frame = +1

Query: 268  NSTPESLFTLRLRRDAARVEALSTVAAAANVSG-------DFSSSIISGLAQGSGEYFTR 426
            + TP+ LF  RL RDA+RV++L+++AAA   +         FSSS+ SGLAQGSGEYFTR
Sbjct: 91   DETPQDLFNSRLARDASRVKSLTSLAAAVGSTNRTRARGPGFSSSVTSGLAQGSGEYFTR 150

Query: 427  IGIGTPAKYVYMVLDTGSDVVWVQCSPCRKCYTQTDPVFDPKASTSFLGVSCVSPLCRRL 606
            +G+GTPA+YV+MVLDTGSDVVW+QC+PC+KCY+QTDPVF+P  S SF  + C SPLCRRL
Sbjct: 151  LGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCGSPLCRRL 210

Query: 607  DSPGCNSRQK-CLYQVSYGDGSFTVGEFSTETLTFRRTKVNNVALGCGHDNEGLFVXXXX 783
            DSPGC++++  CLYQVSYGDGSFT GEFSTETLTFR T+V  VALGCGHDNEGLF+    
Sbjct: 211  DSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTRVGRVALGCGHDNEGLFIGAAG 270

Query: 784  XXXXXXXXXSFPTQAGPRFGRKFSYCLVDRSASSKPSSLVFGESAVSRNAVFTPLLTNPK 963
                     SFP+Q G RF RKFSYCLVDRSASSKPS +VFG+SA+SR A FTPL++NPK
Sbjct: 271  LLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASSKPSYMVFGDSAISRTARFTPLVSNPK 330

Query: 964  LDTFYYVGLNGISVGGTRVPSITASLFKLDRAGNGGVIVDSGTSVTRLTRPAYIALRDAF 1143
            LDTFYYV L G+SVGGTRVP ITASLFKLD  GNGGVI+DSGTSVTRLTRPAY+ALRDAF
Sbjct: 331  LDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRPAYVALRDAF 390

Query: 1144 RAGASNLKRSTEFSLFDTCFDLSGKTEVKVPTVVLHFAGADVSLPASNYLIPVDSDGKFC 1323
            R GASNLKR+ EFSLFDTCFDLSGKTEVKVPTVVLHF GADVSLPASNYLIPVD+ G FC
Sbjct: 391  RVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPASNYLIPVDNSGSFC 450

Query: 1324 FAFAGTTGGLSIIGNIQQQGYRVVFDLASNRVGFAPRGCA 1443
            FAFAGT  GLSI+GNIQQQG+RVV+DLA++RVGFAPRGCA
Sbjct: 451  FAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGFAPRGCA 490


>ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
            sativus]
          Length = 471

 Score =  601 bits (1550), Expect = e-169
 Identities = 302/398 (75%), Positives = 333/398 (83%), Gaps = 6/398 (1%)
 Frame = +1

Query: 268  NSTPESLFTLRLRRDAARVEALSTVAAAA-NVSGD-----FSSSIISGLAQGSGEYFTRI 429
            N TPE LF LRL+RDA RV+ LS++ A + N+S       FSSS+ISGLAQGSGEYFTRI
Sbjct: 74   NRTPEELFHLRLQRDAIRVKKLSSLGATSRNLSKPGGTTGFSSSVISGLAQGSGEYFTRI 133

Query: 430  GIGTPAKYVYMVLDTGSDVVWVQCSPCRKCYTQTDPVFDPKASTSFLGVSCVSPLCRRLD 609
            G+GTP KYVYMVLDTGSD+VW+QC+PC+ CY+QTDPVF+P  S SF  V C +PLCRRL+
Sbjct: 134  GVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLE 193

Query: 610  SPGCNSRQKCLYQVSYGDGSFTVGEFSTETLTFRRTKVNNVALGCGHDNEGLFVXXXXXX 789
            SPGCN RQ CLYQVSYGDGS+T GEF TETLTFRRTKV  VALGCGHDNEGLFV      
Sbjct: 194  SPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLL 253

Query: 790  XXXXXXXSFPTQAGPRFGRKFSYCLVDRSASSKPSSLVFGESAVSRNAVFTPLLTNPKLD 969
                   SFP+QAG  F +KFSYCLVDRSASSKPSS+VFG SAVSR A FTPLLTNP+LD
Sbjct: 254  GLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLD 313

Query: 970  TFYYVGLNGISVGGTRVPSITASLFKLDRAGNGGVIVDSGTSVTRLTRPAYIALRDAFRA 1149
            TFYYV L GISVGGT V  ITAS FKLDR GNGGVI+D GTSVTRL +PAYIALRDAFRA
Sbjct: 314  TFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRA 373

Query: 1150 GASNLKRSTEFSLFDTCFDLSGKTEVKVPTVVLHFAGADVSLPASNYLIPVDSDGKFCFA 1329
            GAS+LK + EFSLFDTC+DLSGKT VKVPTVVLHF GADVSLPASNYLIPVD  G+FCFA
Sbjct: 374  GASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDGSGRFCFA 433

Query: 1330 FAGTTGGLSIIGNIQQQGYRVVFDLASNRVGFAPRGCA 1443
            FAGTT GLSIIGNIQQQG+RVV+DLAS+RVGF+PRGCA
Sbjct: 434  FAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471


>ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  600 bits (1546), Expect = e-169
 Identities = 311/472 (65%), Positives = 351/472 (74%), Gaps = 10/472 (2%)
 Frame = +1

Query: 55   MEGKRAS----FLLSIFATIVVSFSAPLRYXXXXXXXXXXXXXXXXXXXXDEALPDSETX 222
            MEGK+ +      L +  T+ +S SA L+                      E+ PD E  
Sbjct: 1    MEGKKKTTNCLLFLFLSLTLSLSLSAALQLQTQTLPLHSLPHPPAISWPESESEPDPEEE 60

Query: 223  XXXXXXXXXXXXPPLNSTPESLFTLRLRRDAARVEALSTVAA-----AANVSGDFSSSII 387
                           N TPE LF LRL+RDA RVE +  +AA     A      FSSSII
Sbjct: 61   ALSLHLHHIDALSS-NKTPEQLFQLRLQRDAKRVEGVVALAALNQSHARRSGSSFSSSII 119

Query: 388  SGLAQGSGEYFTRIGIGTPAKYVYMVLDTGSDVVWVQCSPCRKCYTQTDPVFDPKASTSF 567
            SGLAQGSGEYFTRIG+GTPA+YVYMVLDTGSDVVW+QC+PCRKCYTQ DPVFDP  S ++
Sbjct: 120  SGLAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTY 179

Query: 568  LGVSCVSPLCRRLDSPGCNSRQK-CLYQVSYGDGSFTVGEFSTETLTFRRTKVNNVALGC 744
             G+ C +PLCRRLDSPGCN++ K C YQVSYGDGSFT G+FSTETLTFRRT+V  VALGC
Sbjct: 180  AGIPCGAPLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTRVTRVALGC 239

Query: 745  GHDNEGLFVXXXXXXXXXXXXXSFPTQAGPRFGRKFSYCLVDRSASSKPSSLVFGESAVS 924
            GHDNEGLF+             SFP Q G RF +KFSYCLVDRSAS+KPSS+VFG+SAVS
Sbjct: 240  GHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPSSVVFGDSAVS 299

Query: 925  RNAVFTPLLTNPKLDTFYYVGLNGISVGGTRVPSITASLFKLDRAGNGGVIVDSGTSVTR 1104
            R A FTPL+ NPKLDTFYY+ L GISVGG+ V  ++ASLF+LD AGNGGVI+DSGTSVTR
Sbjct: 300  RTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIIDSGTSVTR 359

Query: 1105 LTRPAYIALRDAFRAGASNLKRSTEFSLFDTCFDLSGKTEVKVPTVVLHFAGADVSLPAS 1284
            LTRPAYIALRDAFR GAS+LKR+ EFSLFDTCFDLSG TEVKVPTVVLHF GADVSLPA+
Sbjct: 360  LTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHFRGADVSLPAT 419

Query: 1285 NYLIPVDSDGKFCFAFAGTTGGLSIIGNIQQQGYRVVFDLASNRVGFAPRGC 1440
            NYLIPVD+ G FCFAFAGT  GLSIIGNIQQQG+RV FDLA +RVGFAPRGC
Sbjct: 420  NYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRGC 471


Top