BLASTX nr result

ID: Cnidium21_contig00000778 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cnidium21_contig00000778
         (1487 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_563648.1| putative cathepsin B-like cysteine protease [Ar...   494   e-137
dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]     492   e-137
ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arab...   491   e-136
ref|XP_002301457.1| predicted protein [Populus trichocarpa] gi|2...   491   e-136
ref|NP_567215.1| cathepsin B [Arabidopsis thaliana] gi|13877861|...   489   e-135

>ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
            gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10
            [Arabidopsis thaliana] gi|14532526|gb|AAK63991.1|
            At1g02300/T6A9_10 [Arabidopsis thaliana]
            gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis
            thaliana] gi|332189292|gb|AEE27413.1| putative cathepsin
            B-like cysteine protease [Arabidopsis thaliana]
          Length = 362

 Score =  494 bits (1273), Expect = e-137
 Identities = 226/311 (72%), Positives = 254/311 (81%)
 Frame = -1

Query: 1220 VKSVNNNPKAGWKASMNDRFSNYTVSQFKHLLGVKPTPPGELQGIPVKIHSERLKLPSQF 1041
            VK VN NP AGWKAS NDRF+N TV++FK LLGVKPTP  E  G+P+  H   LKLP +F
Sbjct: 51   VKEVNENPNAGWKASFNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISLKLPKEF 110

Query: 1040 DARTAWPKCSTIGSILDQGHCGSCWAFAAVESLSDRFCIQFDMNISLSVNXXXXXXXXXX 861
            DARTAW +C++IG ILDQGHCGSCWAF AVESLSDRFCI+++MN+SLSVN          
Sbjct: 111  DARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLC 170

Query: 860  XXXXXXGYPIAAWRYFKRSGVVTEECDPYFDQTGCSHPGCEPAYPTPKCKKQCVGGNLLW 681
                  GYPIAAWRYFK  GVVTEECDPYFD TGCSHPGCEPAYPTPKC ++CV GN LW
Sbjct: 171  GQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLW 230

Query: 680  KNSKHFSVSAYKVQSDPSNIMAEVYKNGPVEVSFTVYEDFAHYQSGVYKHITGEEMGGHA 501
            + SKH+ VSAYKV+S P +IMAEVYKNGPVEV+FTVYEDFAHY+SGVYKHITG  +GGHA
Sbjct: 231  RESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHA 290

Query: 500  VKLIGWGTSDEGEDYWLMANQWNRSWGDDGYFKIRRGTNECGIEEDVVAGLPSSKNVVLE 321
            VKLIGWGTSD+GEDYWL+ANQWNRSWGDDGYFKIRRGTNECGIE  VVAGLPS +NVV  
Sbjct: 291  VKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVVKG 350

Query: 320  ITNVNDAFLDA 288
            IT  +D  + +
Sbjct: 351  ITTSDDLLVSS 361


>dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
          Length = 362

 Score =  492 bits (1267), Expect = e-137
 Identities = 225/306 (73%), Positives = 252/306 (82%)
 Frame = -1

Query: 1220 VKSVNNNPKAGWKASMNDRFSNYTVSQFKHLLGVKPTPPGELQGIPVKIHSERLKLPSQF 1041
            VK VN NP AGWKA++NDRFSN TV++FK LLGVKPTP     G+P+  H   LKLP +F
Sbjct: 51   VKKVNQNPDAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGVPIVSHDRSLKLPKEF 110

Query: 1040 DARTAWPKCSTIGSILDQGHCGSCWAFAAVESLSDRFCIQFDMNISLSVNXXXXXXXXXX 861
            DARTAWP+C++IG+ILDQGHCGSCWAF AVESLSDRFCI+F MNISLSVN          
Sbjct: 111  DARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIEFGMNISLSVNDLLACCGFRC 170

Query: 860  XXXXXXGYPIAAWRYFKRSGVVTEECDPYFDQTGCSHPGCEPAYPTPKCKKQCVGGNLLW 681
                  GYPIAAW+YF  SGVVTEECDPYFD TGCSHPGCEPAYPTPKC ++CV GN LW
Sbjct: 171  GDGCDGGYPIAAWQYFSYSGVVTEECDPYFDDTGCSHPGCEPAYPTPKCMRKCVSGNQLW 230

Query: 680  KNSKHFSVSAYKVQSDPSNIMAEVYKNGPVEVSFTVYEDFAHYQSGVYKHITGEEMGGHA 501
              SKH+SVS Y V+S+P +IMAEVYKNGPVEVSFTVYEDFAHY+SGVYKHITG  +GGHA
Sbjct: 231  SQSKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHA 290

Query: 500  VKLIGWGTSDEGEDYWLMANQWNRSWGDDGYFKIRRGTNECGIEEDVVAGLPSSKNVVLE 321
            VKLIGWGT+DEGEDYWL+ANQWNRSWGDDGYF IRRGTNECGIE++ VAGLPSS+NV   
Sbjct: 291  VKLIGWGTTDEGEDYWLLANQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLPSSRNVFKV 350

Query: 320  ITNVND 303
            IT  +D
Sbjct: 351  ITGSDD 356


>ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
            lyrata] gi|297335237|gb|EFH65654.1| hypothetical protein
            ARALYDRAFT_887368 [Arabidopsis lyrata subsp. lyrata]
          Length = 360

 Score =  491 bits (1265), Expect = e-136
 Identities = 224/311 (72%), Positives = 253/311 (81%)
 Frame = -1

Query: 1220 VKSVNNNPKAGWKASMNDRFSNYTVSQFKHLLGVKPTPPGELQGIPVKIHSERLKLPSQF 1041
            VK VN NP AGWKA+ NDRF+N TV++FK LLGVKPTP  E  G+P+  H   LKLP +F
Sbjct: 49   VKEVNENPNAGWKAAFNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISLKLPKEF 108

Query: 1040 DARTAWPKCSTIGSILDQGHCGSCWAFAAVESLSDRFCIQFDMNISLSVNXXXXXXXXXX 861
            DARTAW +C+++G ILDQGHCGSCWAF AVESLSDRFCI+++MNISLSVN          
Sbjct: 109  DARTAWSQCTSVGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNISLSVNDLLACCGFLC 168

Query: 860  XXXXXXGYPIAAWRYFKRSGVVTEECDPYFDQTGCSHPGCEPAYPTPKCKKQCVGGNLLW 681
                  GYPIAAWRYFK  GVVTEECDPYFD TGCSHPGCEPAYPTPKC ++CV GN LW
Sbjct: 169  GQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLW 228

Query: 680  KNSKHFSVSAYKVQSDPSNIMAEVYKNGPVEVSFTVYEDFAHYQSGVYKHITGEEMGGHA 501
            + SKH+ VSAYKV+S P +IMAEVYKNGPVEV+FTVYEDFAHY+SGVYKHITG  +GGHA
Sbjct: 229  RESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHA 288

Query: 500  VKLIGWGTSDEGEDYWLMANQWNRSWGDDGYFKIRRGTNECGIEEDVVAGLPSSKNVVLE 321
            VKLIGWGTSD+GEDYWL+ANQWNRSWGDDGYFKIRRGTNECGIE  VVAGLPS +NV   
Sbjct: 289  VKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVFKG 348

Query: 320  ITNVNDAFLDA 288
            IT  +D  + +
Sbjct: 349  ITTSDDLLVSS 359


>ref|XP_002301457.1| predicted protein [Populus trichocarpa] gi|222843183|gb|EEE80730.1|
            predicted protein [Populus trichocarpa]
          Length = 357

 Score =  491 bits (1265), Expect = e-136
 Identities = 227/312 (72%), Positives = 255/312 (81%)
 Frame = -1

Query: 1220 VKSVNNNPKAGWKASMNDRFSNYTVSQFKHLLGVKPTPPGELQGIPVKIHSERLKLPSQF 1041
            +K VN NPKAGWKA+MN  FSNYTV+QFK+LLGVKPTP  EL+GIPV  H + L+LP +F
Sbjct: 46   LKKVNGNPKAGWKATMNHHFSNYTVAQFKYLLGVKPTPKEELRGIPVISHPKSLRLPEEF 105

Query: 1040 DARTAWPKCSTIGSILDQGHCGSCWAFAAVESLSDRFCIQFDMNISLSVNXXXXXXXXXX 861
            DARTAWP+CSTIG ILDQGHCGSCWAF AVESLSDRFCI + MNISLSVN          
Sbjct: 106  DARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHYGMNISLSVNDLLACCGFLC 165

Query: 860  XXXXXXGYPIAAWRYFKRSGVVTEECDPYFDQTGCSHPGCEPAYPTPKCKKQCVGGNLLW 681
                  GYPI+AWRYF   GVVTEECDPYFD  GCSHPGCEP YPTPKC ++CV  N LW
Sbjct: 166  GSGCNGGYPISAWRYFVHHGVVTEECDPYFDDIGCSHPGCEPGYPTPKCARKCVNKNQLW 225

Query: 680  KNSKHFSVSAYKVQSDPSNIMAEVYKNGPVEVSFTVYEDFAHYQSGVYKHITGEEMGGHA 501
            K SKH+ V  Y++ SDP +IMAE+YKNGPVEV+FTVYEDFAHY+SGVYKHITG  MGGHA
Sbjct: 226  KKSKHYGVKPYRIDSDPESIMAEIYKNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHA 285

Query: 500  VKLIGWGTSDEGEDYWLMANQWNRSWGDDGYFKIRRGTNECGIEEDVVAGLPSSKNVVLE 321
            VKLIGWGTS++GE YWL+ANQWNR WGDDGYFKIRRGTNECGIE DVVAGLPS++N+V E
Sbjct: 286  VKLIGWGTSEDGEAYWLLANQWNRGWGDDGYFKIRRGTNECGIEGDVVAGLPSTRNLVRE 345

Query: 320  ITNVNDAFLDAS 285
            + +V DA  DAS
Sbjct: 346  VVSV-DAREDAS 356


>ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
            gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B
            cysteine protease [Arabidopsis thaliana]
            gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis
            thaliana] gi|21281113|gb|AAM45063.1| putative cathepsin B
            cysteine protease [Arabidopsis thaliana]
            gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine
            protease, putative [Arabidopsis thaliana]
            gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
            gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis
            thaliana] gi|51968702|dbj|BAD43043.1| cathepsin B-like
            cysteine protease [Arabidopsis thaliana]
            gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis
            thaliana]
          Length = 359

 Score =  489 bits (1258), Expect = e-135
 Identities = 226/306 (73%), Positives = 247/306 (80%)
 Frame = -1

Query: 1220 VKSVNNNPKAGWKASMNDRFSNYTVSQFKHLLGVKPTPPGELQGIPVKIHSERLKLPSQF 1041
            VK VN NP AGWKA++NDRFSN TV++FK LLGVKPTP     G+P+  H   LKLP  F
Sbjct: 48   VKKVNENPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGVPIVSHDPSLKLPKAF 107

Query: 1040 DARTAWPKCSTIGSILDQGHCGSCWAFAAVESLSDRFCIQFDMNISLSVNXXXXXXXXXX 861
            DARTAWP+C++IG+ILDQGHCGSCWAF AVESLSDRFCIQF MNISLSVN          
Sbjct: 108  DARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRC 167

Query: 860  XXXXXXGYPIAAWRYFKRSGVVTEECDPYFDQTGCSHPGCEPAYPTPKCKKQCVGGNLLW 681
                  GYPIAAW+YF  SGVVTEECDPYFD TGCSHPGCEPAYPTPKC ++CV  N LW
Sbjct: 168  GDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAYPTPKCSRKCVSDNKLW 227

Query: 680  KNSKHFSVSAYKVQSDPSNIMAEVYKNGPVEVSFTVYEDFAHYQSGVYKHITGEEMGGHA 501
              SKH+SVS Y V+S+P +IMAEVYKNGPVEVSFTVYEDFAHY+SGVYKHITG  +GGHA
Sbjct: 228  SESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHA 287

Query: 500  VKLIGWGTSDEGEDYWLMANQWNRSWGDDGYFKIRRGTNECGIEEDVVAGLPSSKNVVLE 321
            VKLIGWGTS EGEDYWLMANQWNR WGDDGYF IRRGTNECGIE++ VAGLPSSKNV   
Sbjct: 288  VKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSSKNVFRV 347

Query: 320  ITNVND 303
             T  ND
Sbjct: 348  DTGSND 353


Top