BLASTX nr result

ID: Lithospermum22_contig00016030 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum22_contig00016030
         (1724 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor,...   379   e-102
ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis tha...   379   e-102
ref|XP_002884082.1| aspartyl protease family protein [Arabidopsi...   377   e-102
dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]     366   9e-99
ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein ...   363   1e-97

>ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223525945|gb|EEF28342.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 533

 Score =  379 bits (972), Expect = e-102
 Identities = 216/526 (41%), Positives = 292/526 (55%), Gaps = 7/526 (1%)
 Frame = -1

Query: 1724 DIHHRYSDTVKKFLDNDLLPKKDSVEYFNALVHRDHL-RGRLLAGEPEPTTPLTFAGGNL 1548
            D+HHRYSD VK  L  D LP+K S+ Y+ ++ HRD L  GR L  +   +TPLTF  GN 
Sbjct: 44   DLHHRYSDPVKGMLSVDDLPEKGSLHYYASMAHRDILIHGRKLVSD-NTSTPLTFFSGNE 102

Query: 1547 TIRIDNLGFLHYGAVSVGTPGTEFIVGLDTGSDLFWIPCECTD--CPRHLTALSGQRIDL 1374
            T R  +LGFLHY  VS+GTP   ++V LDTGSDLFW+PC+CT+  C + L   SG++ID 
Sbjct: 103  TYRFSSLGFLHYANVSIGTPSLSYLVALDTGSDLFWLPCDCTNSGCVQGLQFPSGEQIDF 162

Query: 1373 NIYALNGSSTEKPVTCDNAICGSSYQCLTEQNVCSYQTSYLTVNTSTSGIFLEDVLQLAT 1194
            NIY  N SST + + C+N +C    +C + Q+ C YQ  YL+  TS++G+ +ED+L L T
Sbjct: 163  NIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCPYQVQYLSNGTSSTGVLVEDLLHLTT 222

Query: 1193 NENQPSSVNTTVIFGCGVVQTGSFXXXXXXXXXXXXXLEDISVPSALATKGLTANSFSMC 1014
            ++ Q  +++  +IFGCG VQTGSF             + +ISVPS LA +G T+NSFSMC
Sbjct: 223  DDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNISVPSTLAREGYTSNSFSMC 282

Query: 1013 FGRDTFGRIVFGDKGSLDQGETPFIISATNPTYXXXXXXXXXXXXXXXXXXINLTFSALF 834
            FGRD  GRI FGD GS  QGETPF +   +PTY                   +L FSA+F
Sbjct: 283  FGRDGIGRISFGDTGSSGQGETPFNLRQLHPTY-----NVSITKINVGGRDADLEFSAIF 337

Query: 833  DSGSSFTMFPDAIYTSITDSFDSQVLDPRRNG-SDLAFEYCYSLRSSSSGFESPKAPNLT 657
            DSG+SFT   D  YT I++SF+    + R +  SD+ FEYCY + S+ +  E    P + 
Sbjct: 338  DSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYEMSSNQTNLE---IPTVN 394

Query: 656  FTMRGGDEFNVTAPLIIFHLEDXXXXXXXXXXXXXXXSIIGQNFMTGYRLVYDREKLVLG 477
              M+GG +FNVT P++I  L+                +IIGQNFMTGYR+V++RE+ VLG
Sbjct: 395  LVMQGGSQFNVTDPIVIVILQGGASIYCLAIVKSGDVNIIGQNFMTGYRIVFNRERNVLG 454

Query: 476  WKKSDCYDTNQAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTQTSPPPS 297
            WK SDCYD                                            T  +P  +
Sbjct: 455  WKASDCYDDMDT-------------------------TTFPVDPISPGIPPATAVNPQAT 489

Query: 296  SAAGNTTNDSPAGPGFRLPGTGNA---AQLNSFTWKLVMVFLSLFS 168
            + +GNTT  S   P    P   NA    +LNS T+ ++MV +  F+
Sbjct: 490  AGSGNTTEVSGTPP----PVGNNAPKLPKLNSLTFAIIMVLIPFFT 531


>ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid
            DNA-binding protein [Arabidopsis thaliana]
            gi|22655368|gb|AAM98276.1| At2g17760/At2g17760
            [Arabidopsis thaliana] gi|330251585|gb|AEC06679.1|
            aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 513

 Score =  379 bits (972), Expect = e-102
 Identities = 205/426 (48%), Positives = 263/426 (61%), Gaps = 3/426 (0%)
 Frame = -1

Query: 1724 DIHHRYSDTVKKFLDNDLLPKKDSVEYFNALVHRDHL-RGRLLAGEPEPTTPLTFAGGNL 1548
            + HHR+SD V   L  D LP +DS +Y+  + HRD L RGR LA E +    +TF+ GN 
Sbjct: 36   EFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRLIRGRRLANEDQSL--VTFSDGNE 93

Query: 1547 TIRIDNLGFLHYGAVSVGTPGTEFIVGLDTGSDLFWIPCECTDCPRHLTALSGQRIDLNI 1368
            T+R+D LGFLHY  V+VGTP   F+V LDTGSDLFW+PC+CT+C R L A  G  +DLNI
Sbjct: 94   TVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNI 153

Query: 1367 YALNGSSTEKPVTCDNAICGSSYQCLTEQNVCSYQTSYLTVNTSTSGIFLEDVLQLATNE 1188
            Y+ N SST   V C++ +C    +C + ++ C YQ  YL+  TS++G+ +EDVL L +N+
Sbjct: 154  YSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSND 213

Query: 1187 NQPSSVNTTVIFGCGVVQTGSFXXXXXXXXXXXXXLEDISVPSALATKGLTANSFSMCFG 1008
                ++   V FGCG VQTG F             LEDISVPS LA +G+ ANSFSMCFG
Sbjct: 214  KSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFG 273

Query: 1007 RDTFGRIVFGDKGSLDQGETPFIISATNPTYXXXXXXXXXXXXXXXXXXINLTFSALFDS 828
             D  GRI FGDKGS+DQ ETP  I   +PTY                   +L F A+FDS
Sbjct: 274  NDGAGRISFGDKGSVDQRETPLNIRQPHPTY-----NITVTKISVGGNTGDLEFDAVFDS 328

Query: 827  GSSFTMFPDAIYTSITDSFDSQVLDPR--RNGSDLAFEYCYSLRSSSSGFESPKAPNLTF 654
            G+SFT   DA YT I++SF+S  LD R     S+L FEYCY+L  +   F+ P A NL  
Sbjct: 329  GTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYP-AVNL-- 385

Query: 653  TMRGGDEFNVTAPLIIFHLEDXXXXXXXXXXXXXXXSIIGQNFMTGYRLVYDREKLVLGW 474
            TM+GG  + V  PL++  ++D               SIIGQNFMTGYR+V+DREKL+LGW
Sbjct: 386  TMKGGSSYPVYHPLVVIPMKD-TDVYCLAIMKIEDISIIGQNFMTGYRVVFDREKLILGW 444

Query: 473  KKSDCY 456
            K+SDCY
Sbjct: 445  KESDCY 450


>ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297329922|gb|EFH60341.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  377 bits (968), Expect = e-102
 Identities = 205/426 (48%), Positives = 262/426 (61%), Gaps = 3/426 (0%)
 Frame = -1

Query: 1724 DIHHRYSDTVKKFLDNDLLPKKDSVEYFNALVHRDHL-RGRLLAGEPEPTTPLTFAGGNL 1548
            + HHR+SD V   L  D LP +DS +Y+  + HRD L RGR LA E +    +TF+ GN 
Sbjct: 36   EFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRLIRGRRLANEDQSL--VTFSDGNE 93

Query: 1547 TIRIDNLGFLHYGAVSVGTPGTEFIVGLDTGSDLFWIPCECTDCPRHLTALSGQRIDLNI 1368
            TIR+D LGFLHY  V+VGTP   F+V LDTGSDLFW+PC+CT+C R L A  G  +DLNI
Sbjct: 94   TIRVDALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNI 153

Query: 1367 YALNGSSTEKPVTCDNAICGSSYQCLTEQNVCSYQTSYLTVNTSTSGIFLEDVLQLATNE 1188
            Y+ N SST   V C++ +C    +C + ++ C YQ  YL+  TS++G+ +EDVL L +N+
Sbjct: 154  YSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSND 213

Query: 1187 NQPSSVNTTVIFGCGVVQTGSFXXXXXXXXXXXXXLEDISVPSALATKGLTANSFSMCFG 1008
                ++   V  GCG VQTG F             LEDISVPS LA +G+ ANSFSMCFG
Sbjct: 214  KSSKAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFG 273

Query: 1007 RDTFGRIVFGDKGSLDQGETPFIISATNPTYXXXXXXXXXXXXXXXXXXINLTFSALFDS 828
             D  GRI FGDKGS+DQ ETP  I   +PTY                   +L F A+FDS
Sbjct: 274  NDGAGRISFGDKGSVDQRETPLNIRQPHPTY-----NITVTKISVEGNTGDLEFDAVFDS 328

Query: 827  GSSFTMFPDAIYTSITDSFDSQVLDPR--RNGSDLAFEYCYSLRSSSSGFESPKAPNLTF 654
            G+SFT   DA YT I++SF+S  LD R     S+L FEYCY+L  +   F+ P A NL  
Sbjct: 329  GTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYP-AVNL-- 385

Query: 653  TMRGGDEFNVTAPLIIFHLEDXXXXXXXXXXXXXXXSIIGQNFMTGYRLVYDREKLVLGW 474
            TM+GG  + V  PL++  ++D               SIIGQNFMTGYR+V+DREKL+LGW
Sbjct: 386  TMKGGSSYPVYHPLVVIPMKD-TDVYCLAILKIEDISIIGQNFMTGYRVVFDREKLILGW 444

Query: 473  KKSDCY 456
            K+SDCY
Sbjct: 445  KESDCY 450


>dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
          Length = 515

 Score =  366 bits (940), Expect = 9e-99
 Identities = 201/425 (47%), Positives = 259/425 (60%), Gaps = 3/425 (0%)
 Frame = -1

Query: 1724 DIHHRYSDTVKKFLDNDLLPKKDSVEYFNALVHRDHL-RGRLLAGEPEPTTPLTFAGGNL 1548
            + HHR+SD V   L  D LP +DS +Y+  + HRD L RGR LA E +    +TFA GN 
Sbjct: 36   EFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRLIRGRRLASEDQSL--VTFADGNE 93

Query: 1547 TIRIDNLGFLHYGAVSVGTPGTEFIVGLDTGSDLFWIPCEC-TDCPRHLTALSGQRIDLN 1371
            TIR++ LGFLHY  V+VGTP   F+V LDTGSDLFW+PC+C T+C R L A  G  +DLN
Sbjct: 94   TIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCDCSTNCVRELKAPGGSSLDLN 153

Query: 1370 IYALNGSSTEKPVTCDNAICGSSYQCLTEQNVCSYQTSYLTVNTSTSGIFLEDVLQLATN 1191
            IY+ N SST   V C++ +C    +C +  + C YQ  YL+  TS++G+ +EDVL L + 
Sbjct: 154  IYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCPYQIRYLSNGTSSTGVLVEDVLHLVSM 213

Query: 1190 ENQPSSVNTTVIFGCGVVQTGSFXXXXXXXXXXXXXLEDISVPSALATKGLTANSFSMCF 1011
            E     +   +  GCG+VQTG F             LEDISVPS LA +G+ ANSFSMCF
Sbjct: 214  EKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCF 273

Query: 1010 GRDTFGRIVFGDKGSLDQGETPFIISATNPTYXXXXXXXXXXXXXXXXXXINLTFSALFD 831
            G D  GRI FGDKGS+DQ ETP  I   +PTY                   +L F A+FD
Sbjct: 274  GDDGAGRISFGDKGSVDQRETPLNIRQPHPTY-----NVTVTQISVGGNTGDLEFDAVFD 328

Query: 830  SGSSFTMFPDAIYTSITDSFDSQVLDPR-RNGSDLAFEYCYSLRSSSSGFESPKAPNLTF 654
            +G+SFT   DA YT I++SF+S  LD R +  S+L FEYCY++  +   FE    P++  
Sbjct: 329  TGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCYAVSPNKKSFE---YPDVNL 385

Query: 653  TMRGGDEFNVTAPLIIFHLEDXXXXXXXXXXXXXXXSIIGQNFMTGYRLVYDREKLVLGW 474
            TM+GG  + V  PLI+  +ED               SIIGQNFMTGYR+V+DREKL+LGW
Sbjct: 386  TMKGGSSYPVYHPLIVVPIED-TVVYCLAIMKSEDISIIGQNFMTGYRVVFDREKLILGW 444

Query: 473  KKSDC 459
            K+SDC
Sbjct: 445  KESDC 449


>ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 515

 Score =  363 bits (931), Expect = 1e-97
 Identities = 202/430 (46%), Positives = 259/430 (60%), Gaps = 7/430 (1%)
 Frame = -1

Query: 1721 IHHRYSDTVKKFLDNDLL-----PKKDSVEYFNALVHRDHL-RGRLLAGEPEPTTPLTFA 1560
            +HHR+S+ V+K+  +        P+K +VEY+  L  RD L RGR L+   +    L F+
Sbjct: 25   MHHRHSEPVRKWSHSTASGIPAPPEKGTVEYYAELADRDRLLRGRKLS---QIDDGLAFS 81

Query: 1559 GGNLTIRIDNLGFLHYGAVSVGTPGTEFIVGLDTGSDLFWIPCECTDCPRHLTALSGQRI 1380
             GN T RI +LGFLHY  V +GTPG +F+V LDTGSDLFW+PC+CT C    ++      
Sbjct: 82   DGNSTFRISSLGFLHYTTVQIGTPGVKFMVALDTGSDLFWVPCDCTRCAATDSSAFASDF 141

Query: 1379 DLNIYALNGSSTEKPVTCDNAICGSSYQCLTEQNVCSYQTSYLTVNTSTSGIFLEDVLQL 1200
            DLN+Y  NGSST K VTC+N++C    QCL   + C Y  SY++  TSTSGI +EDVL L
Sbjct: 142  DLNVYNPNGSSTSKKVTCNNSLCMHRSQCLGTLSNCPYMVSYVSAETSTSGILVEDVLHL 201

Query: 1199 ATNENQPSSVNTTVIFGCGVVQTGSFXXXXXXXXXXXXXLEDISVPSALATKGLTANSFS 1020
               +N    V   VIFGCG +Q+GSF             +E ISVPS L+ +G TA+SFS
Sbjct: 202  TQEDNHHDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFS 261

Query: 1019 MCFGRDTFGRIVFGDKGSLDQGETPFIISATNPTYXXXXXXXXXXXXXXXXXXINLTFSA 840
            MCFGRD  GRI FGDKGS DQ ETPF ++ ++PTY                  I++ F+A
Sbjct: 262  MCFGRDGIGRISFGDKGSFDQDETPFNLNPSHPTY-----NITVTQVRVGTTLIDVEFTA 316

Query: 839  LFDSGSSFTMFPDAIYTSITDSFDSQVLDPR-RNGSDLAFEYCYSLRSSSSGFESPKAPN 663
            LFDSG+SFT   D  YT +T+SF SQV D R R+ S + FEYCY +   S    +   P+
Sbjct: 317  LFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDM---SPDANTSLIPS 373

Query: 662  LTFTMRGGDEFNVTAPLIIFHLEDXXXXXXXXXXXXXXXSIIGQNFMTGYRLVYDREKLV 483
            ++ TM GG  F V  P+II   +                +IIGQNFMTGYR+V+DREKLV
Sbjct: 374  VSLTMGGGSHFAVYDPIIIISTQS-ELVYCLAVVKTAELNIIGQNFMTGYRVVFDREKLV 432

Query: 482  LGWKKSDCYD 453
            LGWKK DCYD
Sbjct: 433  LGWKKFDCYD 442


Top