BLASTX nr result
ID: Lithospermum22_contig00004840
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Lithospermum22_contig00004840 (1534 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus] 75 6e-11 gb|AAA87849.1| preprocathepsin cathepsin L [Schistosoma japonicum] 71 7e-10 ref|XP_002102312.1| GD19566 [Drosophila simulans] gi|194198239|g... 71 7e-10 ref|XP_001979023.1| GG10644 [Drosophila erecta] gi|190650726|gb|... 71 7e-10 ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thali... 71 7e-10 >dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus] Length = 365 Score = 74.7 bits (182), Expect = 6e-11 Identities = 56/201 (27%), Positives = 100/201 (49%), Gaps = 4/201 (1%) Frame = -3 Query: 911 DKDILPQVAHQGLSCTCTVFATVAACSAFVAKQYGVKVNFSAQYIVDNMSPKPSSSVPTY 732 ++ + V +QG +C F+TVA+ G ++ S Q +VD + S Sbjct: 136 ERGAVSPVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSG----- 190 Query: 731 CKGLRMGIVLDFI-KREGLVLESDYSPYDGY--IHSPLKDKKAVFKIVDFQSVSTLDGRT 561 C G M FI G+ ESDY PY G + P+++K + I ++ V ++ + Sbjct: 191 CNGGSMDYAFQFIVSNGGIDSESDY-PYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKA 249 Query: 560 IERGIKNEMV-VGVMRAWKSFSKFTGNGIYKSMGDDLKNAPVLHGVFITGFGGEGNGDDY 384 + + + ++ V VG+ + ++F +T + S G +L HGV + G+G E NG DY Sbjct: 250 LMKAVAHQPVSVGIEASGRAFQLYTSGVLTGSCGTNLD-----HGVVVVGYGSE-NGKDY 303 Query: 383 YEVMNSHGVEYCDNGFLKVAR 321 + V NS G E+ ++G++++ R Sbjct: 304 WIVRNSWGPEWGEDGYIRMER 324 >gb|AAA87849.1| preprocathepsin cathepsin L [Schistosoma japonicum] Length = 331 Score = 71.2 bits (173), Expect = 7e-10 Identities = 52/197 (26%), Positives = 88/197 (44%) Frame = -3 Query: 911 DKDILPQVAHQGLSCTCTVFATVAACSAFVAKQYGVKVNFSAQYIVDNMSPKPSSSVPTY 732 D + V HQGL +C F+ A + +++ V S Q +VD + Sbjct: 124 DHGAVTAVKHQGLCGSCWAFSATGAIEGQLRRKHKKLVKLSEQQLVDCRYNYGNDG---- 179 Query: 731 CKGLRMGIVLDFIKREGLVLESDYSPYDGYIHSPLKDKKAVFKIVDFQSVSTLDGRTIER 552 C+G M + +++++ + E+DY + + K V K+ F + D +T+E+ Sbjct: 180 CEGGTMDLAFNYLEKHYIESENDYKYLGHDANCHYRKSKGVVKVKKFGDLPARDEKTLEK 239 Query: 551 GIKNEMVVGVMRAWKSFSKFTGNGIYKSMGDDLKNAPVLHGVFITGFGGEGNGDDYYEVM 372 + + V +GIY+S D K A + HGV G+G E NG DY+ + Sbjct: 240 AVYQYGPISVGIVALDSLILYKSGIYES--KDCKYADINHGVLAVGYGRE-NGKDYWLIK 296 Query: 371 NSHGVEYCDNGFLKVAR 321 NS G + NG+ K+ R Sbjct: 297 NSWGDLWGMNGYFKLRR 313 >ref|XP_002102312.1| GD19566 [Drosophila simulans] gi|194198239|gb|EDX11815.1| GD19566 [Drosophila simulans] Length = 336 Score = 71.2 bits (173), Expect = 7e-10 Identities = 51/198 (25%), Positives = 94/198 (47%), Gaps = 4/198 (2%) Frame = -3 Query: 902 ILPQVAHQGLSC-TCTVFATVAACSAFVAKQYGVKVNFSAQYIVDNMSPKPSSSVPTYCK 726 ++ V QG C +C F+T A +AK+YG S +++VD + P P++ C Sbjct: 129 LISPVGDQGTECLSCWAFSTSGVLEAHLAKKYGKLEPLSPKHLVDCV-PYPNNG----CS 183 Query: 725 GLRMGIVLDFIKREGLVLESDYSPYDGYIHSPL-KDKKAVFKIVDFQSVSTLDGRTIERG 549 G + + ++ + G+ + Y PY+ L K ++ + + ++S D R + Sbjct: 184 GGWVSVAFNYTRDHGIATKESY-PYEPVSGECLWKSDRSTGNLSGYVTLSNYDERELAEV 242 Query: 548 IKN--EMVVGVMRAWKSFSKFTGNGIYKSMGDDLKNAPVLHGVFITGFGGEGNGDDYYEV 375 + N + V + + F ++ G GI K + H V + GFG DY+ + Sbjct: 243 VYNIGPVAVSIDHLHEEFDQYFG-GILSIPACRSKRQDLTHSVLLVGFGTHPKWGDYWII 301 Query: 374 MNSHGVEYCDNGFLKVAR 321 NS+G E+ ++G+LK+AR Sbjct: 302 KNSYGTEWGESGYLKLAR 319 >ref|XP_001979023.1| GG10644 [Drosophila erecta] gi|190650726|gb|EDV47981.1| GG10644 [Drosophila erecta] Length = 344 Score = 71.2 bits (173), Expect = 7e-10 Identities = 49/197 (24%), Positives = 94/197 (47%), Gaps = 4/197 (2%) Frame = -3 Query: 899 LPQVAHQGLSC-TCTVFATVAACSAFVAKQYGVKVNFSAQYIVDNMSPKPSSSVPTYCKG 723 + +V +QG C +C F+T A +AK+ V S Q++VD + P P++ C G Sbjct: 137 ISEVGNQGTQCLSCWAFSTSGVLEAHLAKKNKKLVPLSPQHLVDCV-PYPNNG----CSG 191 Query: 722 LRMGIVLDFIKREGLVLESDYSPYDGYIHSPL-KDKKAVFKIVDFQSVSTLDGRTIERGI 546 + + ++ ++G+ + Y PY+ L + + D+ ++S+ D + + + Sbjct: 192 GWVSVAFKYMMKKGIATKESY-PYEPKARDCLWNSTNSAGTLTDYVTLSSYDEKELAEVV 250 Query: 545 KN--EMVVGVMRAWKSFSKFTGNGIYKSMGDDLKNAPVLHGVFITGFGGEGNGDDYYEVM 372 N + V + + F ++ G GI + H V + GFG DY+ + Sbjct: 251 YNVGPVAVSIDHLHEEFDQYFG-GILSIPACRSSRTDLTHSVLVVGFGTHPKWGDYWLIK 309 Query: 371 NSHGVEYCDNGFLKVAR 321 NS+G+E+ +NG+ K+AR Sbjct: 310 NSYGIEWGENGYFKLAR 326 >ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana] gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags: Precursor gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana] gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana] Length = 361 Score = 71.2 bits (173), Expect = 7e-10 Identities = 56/199 (28%), Positives = 89/199 (44%), Gaps = 3/199 (1%) Frame = -3 Query: 908 KDILPQVAHQGLSCTCTVFATVAACSAFVAKQYGVKVNFSAQYIVDNMSPKPSSSVPTYC 729 K + ++ +QG +C F+TVAA + V+ S Q +VD + + C Sbjct: 137 KGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTKQNEG-----C 191 Query: 728 KGLRMGIVLDFIKREGLVLESDYSPYDGYIH--SPLKDKKAVFKIVDFQSVSTLDGRTIE 555 G M I +FIK+ G + D PY+G KD + I + V D + Sbjct: 192 NGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALL 251 Query: 554 RGIKNEMVVGVMRAWKSFSKFTGNGIYK-SMGDDLKNAPVLHGVFITGFGGEGNGDDYYE 378 + + N+ V + A S +F G++ S G +L HGV G+G E G Y+ Sbjct: 252 KAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELN-----HGVAAVGYGSE-RGKKYWI 305 Query: 377 VMNSHGVEYCDNGFLKVAR 321 V NS G E+ + G++K+ R Sbjct: 306 VRNSWGAEWGEGGYIKIER 324