BLASTX nr result

ID: Lithospermum22_contig00004840 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum22_contig00004840
         (1534 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]                75   6e-11
gb|AAA87849.1| preprocathepsin cathepsin L [Schistosoma japonicum]     71   7e-10
ref|XP_002102312.1| GD19566 [Drosophila simulans] gi|194198239|g...    71   7e-10
ref|XP_001979023.1| GG10644 [Drosophila erecta] gi|190650726|gb|...    71   7e-10
ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thali...    71   7e-10

>dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
          Length = 365

 Score = 74.7 bits (182), Expect = 6e-11
 Identities = 56/201 (27%), Positives = 100/201 (49%), Gaps = 4/201 (1%)
 Frame = -3

Query: 911 DKDILPQVAHQGLSCTCTVFATVAACSAFVAKQYGVKVNFSAQYIVDNMSPKPSSSVPTY 732
           ++  +  V +QG   +C  F+TVA+         G  ++ S Q +VD  +   S      
Sbjct: 136 ERGAVSPVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSG----- 190

Query: 731 CKGLRMGIVLDFI-KREGLVLESDYSPYDGY--IHSPLKDKKAVFKIVDFQSVSTLDGRT 561
           C G  M     FI    G+  ESDY PY G   +  P+++K  +  I  ++ V  ++ + 
Sbjct: 191 CNGGSMDYAFQFIVSNGGIDSESDY-PYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKA 249

Query: 560 IERGIKNEMV-VGVMRAWKSFSKFTGNGIYKSMGDDLKNAPVLHGVFITGFGGEGNGDDY 384
           + + + ++ V VG+  + ++F  +T   +  S G +L      HGV + G+G E NG DY
Sbjct: 250 LMKAVAHQPVSVGIEASGRAFQLYTSGVLTGSCGTNLD-----HGVVVVGYGSE-NGKDY 303

Query: 383 YEVMNSHGVEYCDNGFLKVAR 321
           + V NS G E+ ++G++++ R
Sbjct: 304 WIVRNSWGPEWGEDGYIRMER 324


>gb|AAA87849.1| preprocathepsin cathepsin L [Schistosoma japonicum]
          Length = 331

 Score = 71.2 bits (173), Expect = 7e-10
 Identities = 52/197 (26%), Positives = 88/197 (44%)
 Frame = -3

Query: 911 DKDILPQVAHQGLSCTCTVFATVAACSAFVAKQYGVKVNFSAQYIVDNMSPKPSSSVPTY 732
           D   +  V HQGL  +C  F+   A    + +++   V  S Q +VD      +      
Sbjct: 124 DHGAVTAVKHQGLCGSCWAFSATGAIEGQLRRKHKKLVKLSEQQLVDCRYNYGNDG---- 179

Query: 731 CKGLRMGIVLDFIKREGLVLESDYSPYDGYIHSPLKDKKAVFKIVDFQSVSTLDGRTIER 552
           C+G  M +  +++++  +  E+DY       +   +  K V K+  F  +   D +T+E+
Sbjct: 180 CEGGTMDLAFNYLEKHYIESENDYKYLGHDANCHYRKSKGVVKVKKFGDLPARDEKTLEK 239

Query: 551 GIKNEMVVGVMRAWKSFSKFTGNGIYKSMGDDLKNAPVLHGVFITGFGGEGNGDDYYEVM 372
            +     + V            +GIY+S   D K A + HGV   G+G E NG DY+ + 
Sbjct: 240 AVYQYGPISVGIVALDSLILYKSGIYES--KDCKYADINHGVLAVGYGRE-NGKDYWLIK 296

Query: 371 NSHGVEYCDNGFLKVAR 321
           NS G  +  NG+ K+ R
Sbjct: 297 NSWGDLWGMNGYFKLRR 313


>ref|XP_002102312.1| GD19566 [Drosophila simulans] gi|194198239|gb|EDX11815.1| GD19566
           [Drosophila simulans]
          Length = 336

 Score = 71.2 bits (173), Expect = 7e-10
 Identities = 51/198 (25%), Positives = 94/198 (47%), Gaps = 4/198 (2%)
 Frame = -3

Query: 902 ILPQVAHQGLSC-TCTVFATVAACSAFVAKQYGVKVNFSAQYIVDNMSPKPSSSVPTYCK 726
           ++  V  QG  C +C  F+T     A +AK+YG     S +++VD + P P++     C 
Sbjct: 129 LISPVGDQGTECLSCWAFSTSGVLEAHLAKKYGKLEPLSPKHLVDCV-PYPNNG----CS 183

Query: 725 GLRMGIVLDFIKREGLVLESDYSPYDGYIHSPL-KDKKAVFKIVDFQSVSTLDGRTIERG 549
           G  + +  ++ +  G+  +  Y PY+      L K  ++   +  + ++S  D R +   
Sbjct: 184 GGWVSVAFNYTRDHGIATKESY-PYEPVSGECLWKSDRSTGNLSGYVTLSNYDERELAEV 242

Query: 548 IKN--EMVVGVMRAWKSFSKFTGNGIYKSMGDDLKNAPVLHGVFITGFGGEGNGDDYYEV 375
           + N   + V +    + F ++ G GI        K   + H V + GFG      DY+ +
Sbjct: 243 VYNIGPVAVSIDHLHEEFDQYFG-GILSIPACRSKRQDLTHSVLLVGFGTHPKWGDYWII 301

Query: 374 MNSHGVEYCDNGFLKVAR 321
            NS+G E+ ++G+LK+AR
Sbjct: 302 KNSYGTEWGESGYLKLAR 319


>ref|XP_001979023.1| GG10644 [Drosophila erecta] gi|190650726|gb|EDV47981.1| GG10644
           [Drosophila erecta]
          Length = 344

 Score = 71.2 bits (173), Expect = 7e-10
 Identities = 49/197 (24%), Positives = 94/197 (47%), Gaps = 4/197 (2%)
 Frame = -3

Query: 899 LPQVAHQGLSC-TCTVFATVAACSAFVAKQYGVKVNFSAQYIVDNMSPKPSSSVPTYCKG 723
           + +V +QG  C +C  F+T     A +AK+    V  S Q++VD + P P++     C G
Sbjct: 137 ISEVGNQGTQCLSCWAFSTSGVLEAHLAKKNKKLVPLSPQHLVDCV-PYPNNG----CSG 191

Query: 722 LRMGIVLDFIKREGLVLESDYSPYDGYIHSPL-KDKKAVFKIVDFQSVSTLDGRTIERGI 546
             + +   ++ ++G+  +  Y PY+      L     +   + D+ ++S+ D + +   +
Sbjct: 192 GWVSVAFKYMMKKGIATKESY-PYEPKARDCLWNSTNSAGTLTDYVTLSSYDEKELAEVV 250

Query: 545 KN--EMVVGVMRAWKSFSKFTGNGIYKSMGDDLKNAPVLHGVFITGFGGEGNGDDYYEVM 372
            N   + V +    + F ++ G GI            + H V + GFG      DY+ + 
Sbjct: 251 YNVGPVAVSIDHLHEEFDQYFG-GILSIPACRSSRTDLTHSVLVVGFGTHPKWGDYWLIK 309

Query: 371 NSHGVEYCDNGFLKVAR 321
           NS+G+E+ +NG+ K+AR
Sbjct: 310 NSYGIEWGENGYFKLAR 326


>ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
           gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName:
           Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
           Precursor gi|4678354|emb|CAB41164.1| cysteine
           endopeptidase-like protein [Arabidopsis thaliana]
           gi|332644882|gb|AEE78403.1| putative cysteine proteinase
           [Arabidopsis thaliana]
          Length = 361

 Score = 71.2 bits (173), Expect = 7e-10
 Identities = 56/199 (28%), Positives = 89/199 (44%), Gaps = 3/199 (1%)
 Frame = -3

Query: 908 KDILPQVAHQGLSCTCTVFATVAACSAFVAKQYGVKVNFSAQYIVDNMSPKPSSSVPTYC 729
           K  + ++ +QG   +C  F+TVAA       +    V+ S Q +VD  + +        C
Sbjct: 137 KGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTKQNEG-----C 191

Query: 728 KGLRMGIVLDFIKREGLVLESDYSPYDGYIH--SPLKDKKAVFKIVDFQSVSTLDGRTIE 555
            G  M I  +FIK+ G +   D  PY+G        KD   +  I   + V   D   + 
Sbjct: 192 NGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALL 251

Query: 554 RGIKNEMVVGVMRAWKSFSKFTGNGIYK-SMGDDLKNAPVLHGVFITGFGGEGNGDDYYE 378
           + + N+ V   + A  S  +F   G++  S G +L      HGV   G+G E  G  Y+ 
Sbjct: 252 KAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELN-----HGVAAVGYGSE-RGKKYWI 305

Query: 377 VMNSHGVEYCDNGFLKVAR 321
           V NS G E+ + G++K+ R
Sbjct: 306 VRNSWGAEWGEGGYIKIER 324


Top