BLASTX nr result
ID: Coptis24_contig00000223
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis24_contig00000223 (981 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAS20590.1| digestive cysteine proteinase intestain [Leptinot... 89 1e-15 gb|AAN77406.1| digestive cysteine protease intestain [Leptinotar... 88 3e-15 gb|AAS20589.1| digestive cysteine proteinase intestain [Leptinot... 87 6e-15 ref|XP_002721635.1| PREDICTED: cathepsin H [Oryctolagus cuniculus] 86 2e-14 gb|AAG17127.1|AF190653_1 cathepsin L-like cysteine proteinase CA... 86 2e-14 >gb|AAS20590.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata] Length = 322 Score = 89.4 bits (220), Expect = 1e-15 Identities = 67/230 (29%), Positives = 100/230 (43%), Gaps = 10/230 (4%) Frame = -1 Query: 855 DTSLPTQGEFQAR-ETCTSPPQGECNWWVDDCSWAYHATSMVESALAIKNKRQPDRLAVK 679 D LP Q ++ + + QG C WA+ T +E AI NK + L+ + Sbjct: 107 DLELPEQIDWTEKGAVLPAKNQGNCR-----SCWAFSTTGSLEGQNAIHNKVKTP-LSEQ 160 Query: 678 ELLDCCPKQSQSSCGDQDPIDCCAKHSCGGSVLEALAYIKQNGLSLEREY-------ESR 520 +LLDC C D GG + EA YI NG+ E Y E + Sbjct: 161 QLLDCSASYGNGDCDD------------GGLMTEAFDYIIDNGIEAESSYPYVEQMTECQ 208 Query: 519 QVSRKAM--IKDFGERITNAETLKXXXXXXXXXXXVNDCRPLFSYRGGVFDVEHTFKVNK 346 ++K + IK + + + + + LK L Y GGV D + F ++ Sbjct: 209 YDAKKTIVQIKGYKKLLADEDELKKAVGTVGPISVGMSSENLHMYGGGVLDDQCYFGMD- 267 Query: 345 TLHPTEIRHAMLLVGYAEKYGKPVWIVQNSWGSDWGDEGYIYVKRGSNIL 196 HA+L+VGY E GK W V+NSWG+ WG++GY ++R +N L Sbjct: 268 --------HAVLVVGYGEANGKKFWKVKNSWGTTWGEDGYFRIERDANNL 309 >gb|AAN77406.1| digestive cysteine protease intestain [Leptinotarsa decemlineata] Length = 196 Score = 88.2 bits (217), Expect = 3e-15 Identities = 64/216 (29%), Positives = 94/216 (43%), Gaps = 10/216 (4%) Frame = -1 Query: 795 QGECNWWVDDCSWAYHATSMVESALAIKNKRQPDRLAVKELLDCCPKQSQSSCGDQDPID 616 QGEC WA+ T VE AI NK + L+ ++LLDC C D Sbjct: 2 QGECG-----SCWAFSTTGSVEGQNAIHNKVKTP-LSEQQLLDCSASYGNGDCHD----- 50 Query: 615 CCAKHSCGGSVLEALAYIKQNGLSLEREY-------ESRQVSRKAM--IKDFGERITNAE 463 GG + +A YI NG+ E Y E + ++K + IK + + + + + Sbjct: 51 -------GGLMTKAFNYIIDNGIEAESSYPYVEQMTECQYDAKKTIVQIKGYKKLLADED 103 Query: 462 TLKXXXXXXXXXXXVNDCRPLFSYRGGVFDVEHTFKVNKTLHPTEIRHAMLLVGYAEKYG 283 LK L Y GG+ D + F ++ HA+L+VGY E G Sbjct: 104 ELKKAVGAVGPISVGMSSENLHMYGGGILDDQCYFDMD---------HAVLVVGYGEANG 154 Query: 282 KPVWIVQNSWGSDWGDEGYIYVKR-GSNILGIEEGC 178 K W V+NSWG+ WG++GY ++R N+ I C Sbjct: 155 KKFWRVKNSWGTTWGEDGYFRIERDADNLCDIASMC 190 >gb|AAS20589.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata] Length = 322 Score = 87.0 bits (214), Expect = 6e-15 Identities = 64/216 (29%), Positives = 93/216 (43%), Gaps = 10/216 (4%) Frame = -1 Query: 795 QGECNWWVDDCSWAYHATSMVESALAIKNKRQPDRLAVKELLDCCPKQSQSSCGDQDPID 616 QG C WA+ T +E AI NK + L+ ++LLDC C D Sbjct: 128 QGNCR-----SCWAFSTTGSLEGQNAIHNKVKTP-LSEQQLLDCSASYGNGDCDD----- 176 Query: 615 CCAKHSCGGSVLEALAYIKQNGLSLEREY-------ESRQVSRKAM--IKDFGERITNAE 463 GG + EA YI NG+ E Y E + ++K + IK + + + + + Sbjct: 177 -------GGLMTEAFDYIIDNGIEAESSYPYVEQMTECQYDAKKTIVQIKGYKKLLADED 229 Query: 462 TLKXXXXXXXXXXXVNDCRPLFSYRGGVFDVEHTFKVNKTLHPTEIRHAMLLVGYAEKYG 283 LK L Y GGV D + F ++ HA+L+VGY E G Sbjct: 230 ELKKAVGTVGPISVGMSSENLHMYGGGVLDDQCYFGMD---------HAVLVVGYGEANG 280 Query: 282 KPVWIVQNSWGSDWGDEGYIYVKR-GSNILGIEEGC 178 K W V+NSWG+ WG++GY ++R N+ I C Sbjct: 281 KKFWKVKNSWGTTWGEDGYFRIERDADNLCDIASMC 316 >ref|XP_002721635.1| PREDICTED: cathepsin H [Oryctolagus cuniculus] Length = 333 Score = 85.5 bits (210), Expect = 2e-14 Identities = 65/232 (28%), Positives = 96/232 (41%), Gaps = 14/232 (6%) Frame = -1 Query: 843 PTQGEFQARETCTSPP--QGECNWWVDDCSWAYHATSMVESALAIKNKRQPDRLAVKELL 670 P+ +++ + SP QG C W + T +ESA+AI + LA ++L+ Sbjct: 115 PSSVDWRKKGNFVSPVKNQGACG-----SCWTFSTTGALESAVAIAGGKMLS-LAEQQLV 168 Query: 669 DCCPKQSQSSCGDQDPIDCCAKHSCGGSVLEALAYIKQN-GLSLEREYESRQVSRK---- 505 DC + C GG +A YI N G+ E Y R + + Sbjct: 169 DCAQNFNNHGCE-------------GGLPSQAFEYILYNKGIMGEDSYPYRAMEGRCKFQ 215 Query: 504 -----AMIKDFGERITNAET--LKXXXXXXXXXXXVNDCRPLFSYRGGVFDVEHTFKVNK 346 A +KD N E ++ YR G++ K Sbjct: 216 PQKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVTEDFMQYRKGIYSSTSCHKT-- 273 Query: 345 TLHPTEIRHAMLLVGYAEKYGKPVWIVQNSWGSDWGDEGYIYVKRGSNILGI 190 P ++ HA+L VGY E+ G P WIV+NSWGS WG GY Y++RG N+ G+ Sbjct: 274 ---PDKVNHAVLAVGYGEENGVPYWIVKNSWGSHWGMNGYFYIERGKNMCGL 322 >gb|AAG17127.1|AF190653_1 cathepsin L-like cysteine proteinase CAL1 [Diabrotica virgifera virgifera] Length = 322 Score = 85.5 bits (210), Expect = 2e-14 Identities = 58/213 (27%), Positives = 96/213 (45%), Gaps = 9/213 (4%) Frame = -1 Query: 795 QGECNWWVDDCSWAYHATSMVESALAIKNKRQPDRLAVKELLDCCPKQSQSSCGDQDPID 616 QG+C WA+ AT +E I N + + L+ +ELLDC + C + Sbjct: 126 QGQCG-----SCWAFSATGSLEGQNYIVNGKS-EPLSEQELLDCSVEYGNGDCDE----- 174 Query: 615 CCAKHSCGGSVLEALAYIKQNGLSLEREY-------ESRQVSRKAM--IKDFGERITNAE 463 GG + A ++++NG+ E Y + R + KA+ I+ + E + E Sbjct: 175 -------GGLMTLAFEFVEENGIVSEASYPYEAIQGDCRTTNDKAVLHIQGYNEVYPSEE 227 Query: 462 TLKXXXXXXXXXXXVNDCRPLFSYRGGVFDVEHTFKVNKTLHPTEIRHAMLLVGYAEKYG 283 L+ P+ + G++D + + L H +L+VGY E+ G Sbjct: 228 ALRQAVGTVGPISAAIWAEPIQFFSSGIYDDPNCLNYVEYLD-----HGILVVGYGEENG 282 Query: 282 KPVWIVQNSWGSDWGDEGYIYVKRGSNILGIEE 184 P WIV+NSWG+ WG+EGY +KR + G+ + Sbjct: 283 TPYWIVKNSWGATWGEEGYFRLKRNIALCGLAQ 315