BLASTX nr result
ID: Angelica22_contig00001944
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00001944 (1459 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002301457.1| predicted protein [Populus trichocarpa] gi|2... 512 e-143 dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila] 509 e-142 ref|NP_567215.1| cathepsin B [Arabidopsis thaliana] gi|13877861|... 506 e-141 ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arab... 506 e-141 gb|ABF47216.1| cathepsin B [Nicotiana benthamiana] 504 e-140 >ref|XP_002301457.1| predicted protein [Populus trichocarpa] gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa] Length = 357 Score = 512 bits (1319), Expect = e-143 Identities = 242/345 (70%), Positives = 275/345 (79%), Gaps = 1/345 (0%) Frame = -3 Query: 1349 LLLVGVLSCFHLQVVALKSASLEQLESDILQESIVKSVNNNPKAGWKASMNGRFSNYTVS 1170 LLL+G + F QV+A++ S +L S ILQ+SI+K VN NPKAGWKA+MN FSNYTV+ Sbjct: 12 LLLIGAIFTFQSQVIAVEPVSDLKLNSRILQDSILKKVNGNPKAGWKATMNHHFSNYTVA 71 Query: 1169 QFKHLLGVKPTPPGELQGIPVKIHSERLNLPSQFDARTAWPKCSTIGSILDQGHCGSCWA 990 QFK+LLGVKPTP EL+GIPV H + L LP +FDARTAWP+CSTIG ILDQGHCGSCWA Sbjct: 72 QFKYLLGVKPTPKEELRGIPVISHPKSLRLPEEFDARTAWPQCSTIGKILDQGHCGSCWA 131 Query: 989 FAAVESLSDRFCIQFDMNISLSVNXXXXXXXXXXXXXXXXGYPIAAWRYFKRSGVVTEEC 810 F AVESLSDRFCI + MNISLSVN GYPI+AWRYF GVVTEEC Sbjct: 132 FGAVESLSDRFCIHYGMNISLSVNDLLACCGFLCGSGCNGGYPISAWRYFVHHGVVTEEC 191 Query: 809 DPYFDQTGCSHPGCEPAYPTPKCKKQCVGGNLLWKNSKHFSVSAYKVQSDPSNIMAEVYK 630 DPYFD GCSHPGCEP YPTPKC ++CV N LWK SKH+ V Y++ SDP +IMAE+YK Sbjct: 192 DPYFDDIGCSHPGCEPGYPTPKCARKCVNKNQLWKKSKHYGVKPYRIDSDPESIMAEIYK 251 Query: 629 NGPVEVAFTVYEDFAHYKSGVYKHITGEEMGGHAVKLIGWGTSDEGEDYWLMANQWNRSW 450 NGPVEVAFTVYEDFAHYKSGVYKHITG MGGHAVKLIGWGTS++GE YWL+ANQWNR W Sbjct: 252 NGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDGEAYWLLANQWNRGW 311 Query: 449 GDDGYFKIRRGTNECGIEAEVVAGLPSSKNVVITNVN-DAFLDAA 318 GDDGYFKIRRGTNECGIE +VVAGLPS++N+V V+ DA DA+ Sbjct: 312 GDDGYFKIRRGTNECGIEGDVVAGLPSTRNLVREVVSVDAREDAS 356 >dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila] Length = 362 Score = 509 bits (1310), Expect = e-142 Identities = 245/350 (70%), Positives = 278/350 (79%), Gaps = 4/350 (1%) Frame = -3 Query: 1373 TTGFFLASL-LLVGVLSC-FHLQVVALKSASLEQLESDILQESIVKSVNNNPKAGWKASM 1200 TT L S+ LL+G++S F LQ V ++ S ++L S ILQE IVK VN NP AGWKA++ Sbjct: 7 TTKLCLVSVFLLLGLVSSSFDLQGVKAENLSKQKLNSKILQEEIVKKVNQNPDAGWKAAI 66 Query: 1199 NGRFSNYTVSQFKHLLGVKPTPPGELQGIPVKIHSERLNLPSQFDARTAWPKCSTIGSIL 1020 N RFSN TV++FK LLGVKPTP G+P+ H L LP +FDARTAWP+C++IG+IL Sbjct: 67 NDRFSNATVAEFKRLLGVKPTPKKHFLGVPIVSHDRSLKLPKEFDARTAWPQCTSIGNIL 126 Query: 1019 DQGHCGSCWAFAAVESLSDRFCIQFDMNISLSVNXXXXXXXXXXXXXXXXGYPIAAWRYF 840 DQGHCGSCWAF AVESLSDRFCI+F MNISLSVN GYPIAAW+YF Sbjct: 127 DQGHCGSCWAFGAVESLSDRFCIEFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYF 186 Query: 839 KRSGVVTEECDPYFDQTGCSHPGCEPAYPTPKCKKQCVGGNLLWKNSKHFSVSAYKVQSD 660 SGVVTEECDPYFD TGCSHPGCEPAYPTPKC ++CV GN LW SKH+SVS Y V+S+ Sbjct: 187 SYSGVVTEECDPYFDDTGCSHPGCEPAYPTPKCMRKCVSGNQLWSQSKHYSVSTYTVKSN 246 Query: 659 PSNIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGEEMGGHAVKLIGWGTSDEGEDYW 480 P +IMAEVYKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGT+DEGEDYW Sbjct: 247 PQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDEGEDYW 306 Query: 479 LMANQWNRSWGDDGYFKIRRGTNECGIEAEVVAGLPSSKNV--VITNVND 336 L+ANQWNRSWGDDGYF IRRGTNECGIE E VAGLPSS+NV VIT +D Sbjct: 307 LLANQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLPSSRNVFKVITGSDD 356 >ref|NP_567215.1| cathepsin B [Arabidopsis thaliana] gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana] gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana] gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana] gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana] gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana] gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana] gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana] gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana] gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana] gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana] gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana] gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana] gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana] gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana] gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana] gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana] gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana] gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana] gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana] Length = 359 Score = 506 bits (1304), Expect = e-141 Identities = 243/348 (69%), Positives = 273/348 (78%), Gaps = 3/348 (0%) Frame = -3 Query: 1370 TGFFLASL-LLVGVLSCFHLQVVALKSASLEQLESDILQESIVKSVNNNPKAGWKASMNG 1194 T LAS+ LL+G+L F L+ + +S + ++L+S ILQ+ IVK VN NP AGWKA++N Sbjct: 6 TKLCLASVFLLLGLLLAFDLKGIEAESLTKQKLDSKILQDEIVKKVNENPNAGWKAAIND 65 Query: 1193 RFSNYTVSQFKHLLGVKPTPPGELQGIPVKIHSERLNLPSQFDARTAWPKCSTIGSILDQ 1014 RFSN TV++FK LLGVKPTP G+P+ H L LP FDARTAWP+C++IG+ILDQ Sbjct: 66 RFSNATVAEFKRLLGVKPTPKKHFLGVPIVSHDPSLKLPKAFDARTAWPQCTSIGNILDQ 125 Query: 1013 GHCGSCWAFAAVESLSDRFCIQFDMNISLSVNXXXXXXXXXXXXXXXXGYPIAAWRYFKR 834 GHCGSCWAF AVESLSDRFCIQF MNISLSVN GYPIAAW+YF Sbjct: 126 GHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSY 185 Query: 833 SGVVTEECDPYFDQTGCSHPGCEPAYPTPKCKKQCVGGNLLWKNSKHFSVSAYKVQSDPS 654 SGVVTEECDPYFD TGCSHPGCEPAYPTPKC ++CV N LW SKH+SVS Y V+S+P Sbjct: 186 SGVVTEECDPYFDNTGCSHPGCEPAYPTPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQ 245 Query: 653 NIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGEEMGGHAVKLIGWGTSDEGEDYWLM 474 +IMAEVYKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGTS EGEDYWLM Sbjct: 246 DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLM 305 Query: 473 ANQWNRSWGDDGYFKIRRGTNECGIEAEVVAGLPSSKNV--VITNVND 336 ANQWNR WGDDGYF IRRGTNECGIE E VAGLPSSKNV V T ND Sbjct: 306 ANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSSKNVFRVDTGSND 353 >ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp. lyrata] gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp. lyrata] Length = 360 Score = 506 bits (1302), Expect = e-141 Identities = 240/350 (68%), Positives = 274/350 (78%), Gaps = 2/350 (0%) Frame = -3 Query: 1364 FFLASLLLVGVLSCFHLQVVALKSASLEQLESDILQESIVKSVNNNPKAGWKASMNGRFS 1185 FF LL+ S F+LQ +A ++ S ++L S ILQ IVK VN NP AGWKA+ N RF+ Sbjct: 14 FFFLGLLI----SSFNLQGIAAENLSKQKLTSWILQNEIVKEVNENPNAGWKAAFNDRFA 69 Query: 1184 NYTVSQFKHLLGVKPTPPGELQGIPVKIHSERLNLPSQFDARTAWPKCSTIGSILDQGHC 1005 N TV++FK LLGVKPTP E G+P+ H L LP +FDARTAW +C+++G ILDQGHC Sbjct: 70 NATVAEFKRLLGVKPTPKTEFLGVPIVSHDISLKLPKEFDARTAWSQCTSVGRILDQGHC 129 Query: 1004 GSCWAFAAVESLSDRFCIQFDMNISLSVNXXXXXXXXXXXXXXXXGYPIAAWRYFKRSGV 825 GSCWAF AVESLSDRFCI+++MNISLSVN GYPIAAWRYFK GV Sbjct: 130 GSCWAFGAVESLSDRFCIKYNMNISLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGV 189 Query: 824 VTEECDPYFDQTGCSHPGCEPAYPTPKCKKQCVGGNLLWKNSKHFSVSAYKVQSDPSNIM 645 VTEECDPYFD TGCSHPGCEPAYPTPKC ++CV GN LW+ SKH+ VSAYKV+S P +IM Sbjct: 190 VTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIM 249 Query: 644 AEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGEEMGGHAVKLIGWGTSDEGEDYWLMANQ 465 AEVYKNGPVEVAFTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGTSD+GEDYWL+ANQ Sbjct: 250 AEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQ 309 Query: 464 WNRSWGDDGYFKIRRGTNECGIEAEVVAGLPSSKNVV--ITNVNDAFLDA 321 WNRSWGDDGYFKIRRGTNECGIE VVAGLPS +NV IT +D + + Sbjct: 310 WNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVFKGITTSDDLLVSS 359 >gb|ABF47216.1| cathepsin B [Nicotiana benthamiana] Length = 356 Score = 504 bits (1299), Expect = e-140 Identities = 239/346 (69%), Positives = 276/346 (79%), Gaps = 1/346 (0%) Frame = -3 Query: 1349 LLLVGVLSCFHLQVVALKSASLEQLESDILQESIVKSVNNNPKAGWKASMNGRFSNYTVS 1170 LLL+G S LQVVA + S + ES ILQ+SIVK VN N KAGWKA++N RFSN+TVS Sbjct: 12 LLLIGA-SVLVLQVVAEQPISQAKAESAILQDSIVKQVNENEKAGWKAALNPRFSNFTVS 70 Query: 1169 QFKHLLGVKPTPPGELQGIPVKIHSERLNLPSQFDARTAWPKCSTIGSILDQGHCGSCWA 990 QFK LLGVKPT G+L+GIP+ H + L LP +FDAR AWP CSTIG ILDQGHCGSCWA Sbjct: 71 QFKRLLGVKPTRKGDLKGIPILTHPKLLELPQEFDARVAWPNCSTIGRILDQGHCGSCWA 130 Query: 989 FAAVESLSDRFCIQFDMNISLSVNXXXXXXXXXXXXXXXXGYPIAAWRYFKRSGVVTEEC 810 F AVESLSDRFCI + +NISLS N GYP+ AW+YF R GVVT+EC Sbjct: 131 FGAVESLSDRFCIHYGLNISLSANDLLACCGFLCGDGCDGGYPLQAWKYFVRKGVVTDEC 190 Query: 809 DPYFDQTGCSHPGCEPAYPTPKCKKQCVGGNLLWKNSKHFSVSAYKVQSDPSNIMAEVYK 630 DPYFD GCSHPGCEPAYPTPKC ++CV NLLW SKHF V+AY + SDP +IM E+YK Sbjct: 191 DPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYMISSDPHSIMTELYK 250 Query: 629 NGPVEVAFTVYEDFAHYKSGVYKHITGEEMGGHAVKLIGWGTSDEGEDYWLMANQWNRSW 450 NGPVEV+FTVYEDFAHYKSGVYKH+TG+ MGGHAVKLIGWGTS++GEDYWL+ANQWNR W Sbjct: 251 NGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGW 310 Query: 449 GDDGYFKIRRGTNECGIEAEVVAGLPSSKNV-VITNVNDAFLDAAV 315 GDDGYFKIRRGT+EC IE EVVAGLPS++N+ + +V+DAFLDAA+ Sbjct: 311 GDDGYFKIRRGTDECEIEDEVVAGLPSARNLNMELDVSDAFLDAAM 356