BLASTX nr result

ID: Angelica22_contig00001944 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00001944
         (1459 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002301457.1| predicted protein [Populus trichocarpa] gi|2...   512   e-143
dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]     509   e-142
ref|NP_567215.1| cathepsin B [Arabidopsis thaliana] gi|13877861|...   506   e-141
ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arab...   506   e-141
gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]                    504   e-140

>ref|XP_002301457.1| predicted protein [Populus trichocarpa] gi|222843183|gb|EEE80730.1|
            predicted protein [Populus trichocarpa]
          Length = 357

 Score =  512 bits (1319), Expect = e-143
 Identities = 242/345 (70%), Positives = 275/345 (79%), Gaps = 1/345 (0%)
 Frame = -3

Query: 1349 LLLVGVLSCFHLQVVALKSASLEQLESDILQESIVKSVNNNPKAGWKASMNGRFSNYTVS 1170
            LLL+G +  F  QV+A++  S  +L S ILQ+SI+K VN NPKAGWKA+MN  FSNYTV+
Sbjct: 12   LLLIGAIFTFQSQVIAVEPVSDLKLNSRILQDSILKKVNGNPKAGWKATMNHHFSNYTVA 71

Query: 1169 QFKHLLGVKPTPPGELQGIPVKIHSERLNLPSQFDARTAWPKCSTIGSILDQGHCGSCWA 990
            QFK+LLGVKPTP  EL+GIPV  H + L LP +FDARTAWP+CSTIG ILDQGHCGSCWA
Sbjct: 72   QFKYLLGVKPTPKEELRGIPVISHPKSLRLPEEFDARTAWPQCSTIGKILDQGHCGSCWA 131

Query: 989  FAAVESLSDRFCIQFDMNISLSVNXXXXXXXXXXXXXXXXGYPIAAWRYFKRSGVVTEEC 810
            F AVESLSDRFCI + MNISLSVN                GYPI+AWRYF   GVVTEEC
Sbjct: 132  FGAVESLSDRFCIHYGMNISLSVNDLLACCGFLCGSGCNGGYPISAWRYFVHHGVVTEEC 191

Query: 809  DPYFDQTGCSHPGCEPAYPTPKCKKQCVGGNLLWKNSKHFSVSAYKVQSDPSNIMAEVYK 630
            DPYFD  GCSHPGCEP YPTPKC ++CV  N LWK SKH+ V  Y++ SDP +IMAE+YK
Sbjct: 192  DPYFDDIGCSHPGCEPGYPTPKCARKCVNKNQLWKKSKHYGVKPYRIDSDPESIMAEIYK 251

Query: 629  NGPVEVAFTVYEDFAHYKSGVYKHITGEEMGGHAVKLIGWGTSDEGEDYWLMANQWNRSW 450
            NGPVEVAFTVYEDFAHYKSGVYKHITG  MGGHAVKLIGWGTS++GE YWL+ANQWNR W
Sbjct: 252  NGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDGEAYWLLANQWNRGW 311

Query: 449  GDDGYFKIRRGTNECGIEAEVVAGLPSSKNVVITNVN-DAFLDAA 318
            GDDGYFKIRRGTNECGIE +VVAGLPS++N+V   V+ DA  DA+
Sbjct: 312  GDDGYFKIRRGTNECGIEGDVVAGLPSTRNLVREVVSVDAREDAS 356


>dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
          Length = 362

 Score =  509 bits (1310), Expect = e-142
 Identities = 245/350 (70%), Positives = 278/350 (79%), Gaps = 4/350 (1%)
 Frame = -3

Query: 1373 TTGFFLASL-LLVGVLSC-FHLQVVALKSASLEQLESDILQESIVKSVNNNPKAGWKASM 1200
            TT   L S+ LL+G++S  F LQ V  ++ S ++L S ILQE IVK VN NP AGWKA++
Sbjct: 7    TTKLCLVSVFLLLGLVSSSFDLQGVKAENLSKQKLNSKILQEEIVKKVNQNPDAGWKAAI 66

Query: 1199 NGRFSNYTVSQFKHLLGVKPTPPGELQGIPVKIHSERLNLPSQFDARTAWPKCSTIGSIL 1020
            N RFSN TV++FK LLGVKPTP     G+P+  H   L LP +FDARTAWP+C++IG+IL
Sbjct: 67   NDRFSNATVAEFKRLLGVKPTPKKHFLGVPIVSHDRSLKLPKEFDARTAWPQCTSIGNIL 126

Query: 1019 DQGHCGSCWAFAAVESLSDRFCIQFDMNISLSVNXXXXXXXXXXXXXXXXGYPIAAWRYF 840
            DQGHCGSCWAF AVESLSDRFCI+F MNISLSVN                GYPIAAW+YF
Sbjct: 127  DQGHCGSCWAFGAVESLSDRFCIEFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYF 186

Query: 839  KRSGVVTEECDPYFDQTGCSHPGCEPAYPTPKCKKQCVGGNLLWKNSKHFSVSAYKVQSD 660
              SGVVTEECDPYFD TGCSHPGCEPAYPTPKC ++CV GN LW  SKH+SVS Y V+S+
Sbjct: 187  SYSGVVTEECDPYFDDTGCSHPGCEPAYPTPKCMRKCVSGNQLWSQSKHYSVSTYTVKSN 246

Query: 659  PSNIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGEEMGGHAVKLIGWGTSDEGEDYW 480
            P +IMAEVYKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGT+DEGEDYW
Sbjct: 247  PQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDEGEDYW 306

Query: 479  LMANQWNRSWGDDGYFKIRRGTNECGIEAEVVAGLPSSKNV--VITNVND 336
            L+ANQWNRSWGDDGYF IRRGTNECGIE E VAGLPSS+NV  VIT  +D
Sbjct: 307  LLANQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLPSSRNVFKVITGSDD 356


>ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
            gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B
            cysteine protease [Arabidopsis thaliana]
            gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis
            thaliana] gi|21281113|gb|AAM45063.1| putative cathepsin B
            cysteine protease [Arabidopsis thaliana]
            gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine
            protease, putative [Arabidopsis thaliana]
            gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
            gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis
            thaliana] gi|51968702|dbj|BAD43043.1| cathepsin B-like
            cysteine protease [Arabidopsis thaliana]
            gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine
            protease [Arabidopsis thaliana]
            gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis
            thaliana]
          Length = 359

 Score =  506 bits (1304), Expect = e-141
 Identities = 243/348 (69%), Positives = 273/348 (78%), Gaps = 3/348 (0%)
 Frame = -3

Query: 1370 TGFFLASL-LLVGVLSCFHLQVVALKSASLEQLESDILQESIVKSVNNNPKAGWKASMNG 1194
            T   LAS+ LL+G+L  F L+ +  +S + ++L+S ILQ+ IVK VN NP AGWKA++N 
Sbjct: 6    TKLCLASVFLLLGLLLAFDLKGIEAESLTKQKLDSKILQDEIVKKVNENPNAGWKAAIND 65

Query: 1193 RFSNYTVSQFKHLLGVKPTPPGELQGIPVKIHSERLNLPSQFDARTAWPKCSTIGSILDQ 1014
            RFSN TV++FK LLGVKPTP     G+P+  H   L LP  FDARTAWP+C++IG+ILDQ
Sbjct: 66   RFSNATVAEFKRLLGVKPTPKKHFLGVPIVSHDPSLKLPKAFDARTAWPQCTSIGNILDQ 125

Query: 1013 GHCGSCWAFAAVESLSDRFCIQFDMNISLSVNXXXXXXXXXXXXXXXXGYPIAAWRYFKR 834
            GHCGSCWAF AVESLSDRFCIQF MNISLSVN                GYPIAAW+YF  
Sbjct: 126  GHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSY 185

Query: 833  SGVVTEECDPYFDQTGCSHPGCEPAYPTPKCKKQCVGGNLLWKNSKHFSVSAYKVQSDPS 654
            SGVVTEECDPYFD TGCSHPGCEPAYPTPKC ++CV  N LW  SKH+SVS Y V+S+P 
Sbjct: 186  SGVVTEECDPYFDNTGCSHPGCEPAYPTPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQ 245

Query: 653  NIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGEEMGGHAVKLIGWGTSDEGEDYWLM 474
            +IMAEVYKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGTS EGEDYWLM
Sbjct: 246  DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLM 305

Query: 473  ANQWNRSWGDDGYFKIRRGTNECGIEAEVVAGLPSSKNV--VITNVND 336
            ANQWNR WGDDGYF IRRGTNECGIE E VAGLPSSKNV  V T  ND
Sbjct: 306  ANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSSKNVFRVDTGSND 353


>ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
            lyrata] gi|297335237|gb|EFH65654.1| hypothetical protein
            ARALYDRAFT_887368 [Arabidopsis lyrata subsp. lyrata]
          Length = 360

 Score =  506 bits (1302), Expect = e-141
 Identities = 240/350 (68%), Positives = 274/350 (78%), Gaps = 2/350 (0%)
 Frame = -3

Query: 1364 FFLASLLLVGVLSCFHLQVVALKSASLEQLESDILQESIVKSVNNNPKAGWKASMNGRFS 1185
            FF   LL+    S F+LQ +A ++ S ++L S ILQ  IVK VN NP AGWKA+ N RF+
Sbjct: 14   FFFLGLLI----SSFNLQGIAAENLSKQKLTSWILQNEIVKEVNENPNAGWKAAFNDRFA 69

Query: 1184 NYTVSQFKHLLGVKPTPPGELQGIPVKIHSERLNLPSQFDARTAWPKCSTIGSILDQGHC 1005
            N TV++FK LLGVKPTP  E  G+P+  H   L LP +FDARTAW +C+++G ILDQGHC
Sbjct: 70   NATVAEFKRLLGVKPTPKTEFLGVPIVSHDISLKLPKEFDARTAWSQCTSVGRILDQGHC 129

Query: 1004 GSCWAFAAVESLSDRFCIQFDMNISLSVNXXXXXXXXXXXXXXXXGYPIAAWRYFKRSGV 825
            GSCWAF AVESLSDRFCI+++MNISLSVN                GYPIAAWRYFK  GV
Sbjct: 130  GSCWAFGAVESLSDRFCIKYNMNISLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGV 189

Query: 824  VTEECDPYFDQTGCSHPGCEPAYPTPKCKKQCVGGNLLWKNSKHFSVSAYKVQSDPSNIM 645
            VTEECDPYFD TGCSHPGCEPAYPTPKC ++CV GN LW+ SKH+ VSAYKV+S P +IM
Sbjct: 190  VTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIM 249

Query: 644  AEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGEEMGGHAVKLIGWGTSDEGEDYWLMANQ 465
            AEVYKNGPVEVAFTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGTSD+GEDYWL+ANQ
Sbjct: 250  AEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQ 309

Query: 464  WNRSWGDDGYFKIRRGTNECGIEAEVVAGLPSSKNVV--ITNVNDAFLDA 321
            WNRSWGDDGYFKIRRGTNECGIE  VVAGLPS +NV   IT  +D  + +
Sbjct: 310  WNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVFKGITTSDDLLVSS 359


>gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
          Length = 356

 Score =  504 bits (1299), Expect = e-140
 Identities = 239/346 (69%), Positives = 276/346 (79%), Gaps = 1/346 (0%)
 Frame = -3

Query: 1349 LLLVGVLSCFHLQVVALKSASLEQLESDILQESIVKSVNNNPKAGWKASMNGRFSNYTVS 1170
            LLL+G  S   LQVVA +  S  + ES ILQ+SIVK VN N KAGWKA++N RFSN+TVS
Sbjct: 12   LLLIGA-SVLVLQVVAEQPISQAKAESAILQDSIVKQVNENEKAGWKAALNPRFSNFTVS 70

Query: 1169 QFKHLLGVKPTPPGELQGIPVKIHSERLNLPSQFDARTAWPKCSTIGSILDQGHCGSCWA 990
            QFK LLGVKPT  G+L+GIP+  H + L LP +FDAR AWP CSTIG ILDQGHCGSCWA
Sbjct: 71   QFKRLLGVKPTRKGDLKGIPILTHPKLLELPQEFDARVAWPNCSTIGRILDQGHCGSCWA 130

Query: 989  FAAVESLSDRFCIQFDMNISLSVNXXXXXXXXXXXXXXXXGYPIAAWRYFKRSGVVTEEC 810
            F AVESLSDRFCI + +NISLS N                GYP+ AW+YF R GVVT+EC
Sbjct: 131  FGAVESLSDRFCIHYGLNISLSANDLLACCGFLCGDGCDGGYPLQAWKYFVRKGVVTDEC 190

Query: 809  DPYFDQTGCSHPGCEPAYPTPKCKKQCVGGNLLWKNSKHFSVSAYKVQSDPSNIMAEVYK 630
            DPYFD  GCSHPGCEPAYPTPKC ++CV  NLLW  SKHF V+AY + SDP +IM E+YK
Sbjct: 191  DPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYMISSDPHSIMTELYK 250

Query: 629  NGPVEVAFTVYEDFAHYKSGVYKHITGEEMGGHAVKLIGWGTSDEGEDYWLMANQWNRSW 450
            NGPVEV+FTVYEDFAHYKSGVYKH+TG+ MGGHAVKLIGWGTS++GEDYWL+ANQWNR W
Sbjct: 251  NGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGW 310

Query: 449  GDDGYFKIRRGTNECGIEAEVVAGLPSSKNV-VITNVNDAFLDAAV 315
            GDDGYFKIRRGT+EC IE EVVAGLPS++N+ +  +V+DAFLDAA+
Sbjct: 311  GDDGYFKIRRGTDECEIEDEVVAGLPSARNLNMELDVSDAFLDAAM 356


Top