BLASTX nr result
ID: Papaver22_contig00002200
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver22_contig00002200 (2799 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818] 96 4e-17 dbj|BAA22544.1| FBSB precursor [Ananas comosus] 93 4e-16 gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818] 91 1e-15 sp|P60994.1|ERVB_TABDI RecName: Full=Ervatamin-B; Short=ERV-B gi... 88 1e-14 gb|ADV41672.1| cysteine protease [Nicotiana tabacum] 87 2e-14 >gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818] Length = 448 Score = 96.3 bits (238), Expect = 4e-17 Identities = 64/200 (32%), Positives = 96/200 (48%), Gaps = 7/200 (3%) Frame = +2 Query: 1964 GPYEKYVDWRLRVDKDKLPLIYNQFTLPCCWACAAATSLEWKHLIESGKLVPLSYQQLID 2143 GP VDWR K + I NQ CW+ + S+E H I +G LV LS QQL+D Sbjct: 114 GPNAGSVDWR---QKGAVTPIKNQGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVD 170 Query: 2144 SVPG------KDEIIWNVFSALDLVRNEGITSGKLYPFINKKGMKQQLPPVLDIKGIDGW 2305 ++ N F ++ N G+ + + YP+ + G+ + I G+ Sbjct: 171 CSGSFGNQGCNGGLMDNAFKY--IISNGGLDTEQDYPYTARDGVCDKSKESKHAVSISGY 228 Query: 2306 VEVSEDNYFKLIALLQTGPAIVNMRTD-DAFMGERKGLFRHPCYGNPDHSMVCVGYDFRA 2482 +V ++N +L A ++ GP V + D +F G+F PC N DH ++ VGY Sbjct: 229 KDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGPCGTNLDHGVLVVGY---T 285 Query: 2483 EPYWILQNSWGQDDGNAGYI 2542 YWI++NSWG G+ GYI Sbjct: 286 SDYWIVKNSWGASWGDQGYI 305 >dbj|BAA22544.1| FBSB precursor [Ananas comosus] Length = 356 Score = 92.8 bits (229), Expect = 4e-16 Identities = 59/190 (31%), Positives = 95/190 (50%), Gaps = 3/190 (1%) Frame = +2 Query: 1982 VDWRLRVDKDKLPLIYNQFTLPCCWACAAATSLEWKHLIESGKLVPLSYQQLIDSVPGKD 2161 +DWR D + + NQ CWA AA ++E + I+ G L PLS QQ++D G Sbjct: 128 IDWR---DYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCAKGYG 184 Query: 2162 -EIIWNVFSALDLVRNEGITSGKLYPFINKKGMKQQLPPVLDIKGIDGWVEVSEDNYFKL 2338 + W + ++ N+G+ SG +YP+ KG + V + I G+ V +N + Sbjct: 185 CKGGWEFRAFEFIISNKGVASGAIYPYKAAKG-TCKTNGVPNSAYITGYARVPRNNESSM 243 Query: 2339 IALLQTGPAIVNMRTDDAFMGERKGLFRHPCYGNPDHSMVCVGY--DFRAEPYWILQNSW 2512 + + P V + + F + G+F PC + +H++ +GY D + YWI++NSW Sbjct: 244 MYAVSKQPITVAVDANANFQYYKSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYWIVKNSW 303 Query: 2513 GQDDGNAGYI 2542 G G AGYI Sbjct: 304 GARWGEAGYI 313 >gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818] Length = 381 Score = 91.3 bits (225), Expect = 1e-15 Identities = 63/203 (31%), Positives = 94/203 (46%), Gaps = 7/203 (3%) Frame = +2 Query: 1964 GPYEKYVDWRLRVDKDKLPLIYNQFTLPCCWACAAATSLEWKHLIESGKLVPLSYQQLID 2143 GP VDWR K + I NQ CW+ + S+E H I +G LV LS QQL+D Sbjct: 104 GPNAGSVDWR---QKGAVTPIKNQGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVD 160 Query: 2144 SVPG------KDEIIWNVFSALDLVRNEGITSGKLYPFINKKGMKQQLPPVLDIKGIDGW 2305 ++ N F ++ N G+ + + YP+ + G+ + I G+ Sbjct: 161 CSGSFGNQGCNGGLMDNAFKY--IISNGGLDTEQDYPYTARDGVCDKSKESKHAVSISGY 218 Query: 2306 VEVSEDNYFKLIALLQTGPAIVNMRTD-DAFMGERKGLFRHPCYGNPDHSMVCVGYDFRA 2482 +V ++N +L A ++ GP V + D +F G+F PC N DH ++ VGY Sbjct: 219 KDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGPCGTNLDHGVLVVGY---T 275 Query: 2483 EPYWILQNSWGQDDGNAGYIHYG 2551 YWI++NSWG G H G Sbjct: 276 SDYWIVKNSWGASWVTRGGCHSG 298 >sp|P60994.1|ERVB_TABDI RecName: Full=Ervatamin-B; Short=ERV-B gi|30749291|pdb|1IWD|A Chain A, Proposed Amino Acid Sequence And The 1.63 Angstrom X-ray Crystal Structure Of A Plant Cysteine Protease Ervatamin B: Insight Into The Structural Basis Of Its Stability And Substrate Specificity Length = 215 Score = 87.8 bits (216), Expect = 1e-14 Identities = 61/192 (31%), Positives = 98/192 (51%), Gaps = 4/192 (2%) Frame = +2 Query: 1979 YVDWRLRVDKDKLPLIYNQFTLPCCWACAAATSLEWKHLIESGKLVPLSYQQLIDSVPGK 2158 +VDWR K + I NQ CWA +A ++E + I +G+L+ LS Q+L+D Sbjct: 4 FVDWR---SKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTAS 60 Query: 2159 DEII--WNVFSALDLVRNEGITSGKLYPFINKKGMKQQLPPVLDIKGIDGWVEVSEDNYF 2332 W + ++ N GI + + YP+ +G + P L + I+G+ V+ +N Sbjct: 61 HGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSCK--PYRLRVVSINGFQRVTRNNES 118 Query: 2333 KLIALLQTGPAIVNMRTDDA-FMGERKGLFRHPCYGNPDHSMVCVGYDFRA-EPYWILQN 2506 L + + + P V + A F G+F PC +H +V VGY ++ + YWI++N Sbjct: 119 ALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGTQSGKNYWIVRN 178 Query: 2507 SWGQDDGNAGYI 2542 SWGQ+ GN GYI Sbjct: 179 SWGQNWGNQGYI 190 >gb|ADV41672.1| cysteine protease [Nicotiana tabacum] Length = 349 Score = 87.4 bits (215), Expect = 2e-14 Identities = 59/194 (30%), Positives = 101/194 (52%), Gaps = 7/194 (3%) Frame = +2 Query: 1982 VDWRLRVDKDKLPLIYNQFTLPCCWACAAATSLEWKHLIESGKLVPLSYQQLID-SVPGK 2158 +DWR K + I +Q CCWA +A + E H +++GKL+PLS Q+L+D V G+ Sbjct: 135 MDWR---KKGAVTPIKDQKECGCCWAFSAVAATEGLHQLKTGKLIPLSEQELVDCDVEGE 191 Query: 2159 DEIIWN--VFSALD-LVRNEGITSGKLYPFINKKGMKQQLPPVLDIKGIDGWVEVSEDNY 2329 DE + +A D +++N+G+T+ YP+ + G+ + L I G+ +V ++ Sbjct: 192 DEGCSGGLLDTAFDFILKNKGLTTEANYPYKGEDGVCNKKKSALSAAKIAGYEDVPANSE 251 Query: 2330 FKLIALLQTGPAIVNM-RTDDAFMGERKGLFRHPCYGNPDHSMVCVGYDFRAE--PYWIL 2500 L+ + P V + + F G+F C +H++ VGY + YWI+ Sbjct: 252 KALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTKYWII 311 Query: 2501 QNSWGQDDGNAGYI 2542 +NSWG G++GY+ Sbjct: 312 KNSWGSKWGDSGYM 325