BLASTX nr result

ID: Coptis23_contig00005003 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00005003
         (1071 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAS20590.1| digestive cysteine proteinase intestain [Leptinot...    92   2e-16
gb|AAS20589.1| digestive cysteine proteinase intestain [Leptinot...    91   4e-16
gb|AAN77406.1| digestive cysteine protease intestain [Leptinotar...    91   6e-16
gb|AAS20588.1| digestive cysteine proteinase intestain [Leptinot...    89   2e-15
gb|JAA46608.1| Putative pro-cathepsin h [Desmodus rotundus]            88   4e-15

>gb|AAS20590.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
          Length = 322

 Score = 92.0 bits (227), Expect = 2e-16
 Identities = 78/281 (27%), Positives = 119/281 (42%), Gaps = 10/281 (3%)
 Frame = -3

Query: 1057 FSVVVVWY*RMTKTRNKKKDNXXXXXXXXXXADISVAPVVPDTSLPTQGEFQARDTCTSP 878
            + + V  +  MT+   +KK            A + V P   D  LP Q ++  +     P
Sbjct: 68   YYMAVTQFADMTRDEFRKKLGLQNNRRPNLNATLRVFP--EDLELPEQIDWTEKGAVL-P 124

Query: 877  PQGESNWWVDDCSWAYHATSMVESALAIKNKRRLDRLAVKELLDCCPKQSQSSCGDQDPI 698
             + + N     C WA+  T  +E   AI NK +   L+ ++LLDC        C D    
Sbjct: 125  AKNQGN--CRSC-WAFSTTGSLEGQNAIHNKVKTP-LSEQQLLDCSASYGNGDCDD---- 176

Query: 697  DCCAKHSCGGSVLEALAYIKQNGLSLEREY-------ESRQVSRKAM--IKDFGERITNA 545
                    GG + EA  YI  NG+  E  Y       E +  ++K +  IK + + + + 
Sbjct: 177  --------GGLMTEAFDYIIDNGIEAESSYPYVEQMTECQYDAKKTIVQIKGYKKLLADE 228

Query: 544  ETLKXXXXXXXXXXXVNDCRPLFSYRGGVFDVEHTFKVDKTLHPTEIRHAMLLVGYAEKY 365
            + LK                 L  Y GGV D +  F +D         HA+L+VGY E  
Sbjct: 229  DELKKAVGTVGPISVGMSSENLHMYGGGVLDDQCYFGMD---------HAVLVVGYGEAN 279

Query: 364  GKPVWIVQNSWGSNWGDEGYIYVKR-GSNILGIEEGCVYVV 245
            GK  W V+NSWG+ WG++GY  ++R  +N+  I   C Y +
Sbjct: 280  GKKFWKVKNSWGTTWGEDGYFRIERDANNLCDIASMCSYPI 320


>gb|AAS20589.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
          Length = 322

 Score = 91.3 bits (225), Expect = 4e-16
 Identities = 63/208 (30%), Positives = 92/208 (44%), Gaps = 10/208 (4%)
 Frame = -3

Query: 838 WAYHATSMVESALAIKNKRRLDRLAVKELLDCCPKQSQSSCGDQDPIDCCAKHSCGGSVL 659
           WA+  T  +E   AI NK +   L+ ++LLDC        C D            GG + 
Sbjct: 135 WAFSTTGSLEGQNAIHNKVKTP-LSEQQLLDCSASYGNGDCDD------------GGLMT 181

Query: 658 EALAYIKQNGLSLEREY-------ESRQVSRKAM--IKDFGERITNAETLKXXXXXXXXX 506
           EA  YI  NG+  E  Y       E +  ++K +  IK + + + + + LK         
Sbjct: 182 EAFDYIIDNGIEAESSYPYVEQMTECQYDAKKTIVQIKGYKKLLADEDELKKAVGTVGPI 241

Query: 505 XXVNDCRPLFSYRGGVFDVEHTFKVDKTLHPTEIRHAMLLVGYAEKYGKPVWIVQNSWGS 326
                   L  Y GGV D +  F +D         HA+L+VGY E  GK  W V+NSWG+
Sbjct: 242 SVGMSSENLHMYGGGVLDDQCYFGMD---------HAVLVVGYGEANGKKFWKVKNSWGT 292

Query: 325 NWGDEGYIYVKR-GSNILGIEEGCVYVV 245
            WG++GY  ++R   N+  I   C Y +
Sbjct: 293 TWGEDGYFRIERDADNLCDIASMCSYPI 320


>gb|AAN77406.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
          Length = 196

 Score = 90.5 bits (223), Expect = 6e-16
 Identities = 62/208 (29%), Positives = 92/208 (44%), Gaps = 10/208 (4%)
 Frame = -3

Query: 838 WAYHATSMVESALAIKNKRRLDRLAVKELLDCCPKQSQSSCGDQDPIDCCAKHSCGGSVL 659
           WA+  T  VE   AI NK +   L+ ++LLDC        C D            GG + 
Sbjct: 9   WAFSTTGSVEGQNAIHNKVKTP-LSEQQLLDCSASYGNGDCHD------------GGLMT 55

Query: 658 EALAYIKQNGLSLEREY-------ESRQVSRKAM--IKDFGERITNAETLKXXXXXXXXX 506
           +A  YI  NG+  E  Y       E +  ++K +  IK + + + + + LK         
Sbjct: 56  KAFNYIIDNGIEAESSYPYVEQMTECQYDAKKTIVQIKGYKKLLADEDELKKAVGAVGPI 115

Query: 505 XXVNDCRPLFSYRGGVFDVEHTFKVDKTLHPTEIRHAMLLVGYAEKYGKPVWIVQNSWGS 326
                   L  Y GG+ D +  F +D         HA+L+VGY E  GK  W V+NSWG+
Sbjct: 116 SVGMSSENLHMYGGGILDDQCYFDMD---------HAVLVVGYGEANGKKFWRVKNSWGT 166

Query: 325 NWGDEGYIYVKR-GSNILGIEEGCVYVV 245
            WG++GY  ++R   N+  I   C Y +
Sbjct: 167 TWGEDGYFRIERDADNLCDIASMCSYPI 194


>gb|AAS20588.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
          Length = 322

 Score = 89.0 bits (219), Expect = 2e-15
 Identities = 77/281 (27%), Positives = 117/281 (41%), Gaps = 10/281 (3%)
 Frame = -3

Query: 1057 FSVVVVWY*RMTKTRNKKKDNXXXXXXXXXXADISVAPVVPDTSLPTQGEFQARDTCTSP 878
            + + V  +  MT+   +KK            A + V P   D  LP Q ++  +     P
Sbjct: 68   YYMAVTQFADMTRDEFRKKLGLQNNRRPNLNATLQVFP--EDLELPEQIDWTEKGAVL-P 124

Query: 877  PQGESNWWVDDCSWAYHATSMVESALAIKNKRRLDRLAVKELLDCCPKQSQSSCGDQDPI 698
             + + N     C WA+  T  +E   AI NK +   L+ ++LLDC        C D    
Sbjct: 125  VKNQGN--CRSC-WAFSTTGSLEGQNAIHNKVKTP-LSEQQLLDCSASYGNGDCDD---- 176

Query: 697  DCCAKHSCGGSVLEALAYIKQNGLSLEREY-------ESRQVSRKAM--IKDFGERITNA 545
                    GG + EA  YI  NG+  E  Y       E +  ++K +  IK + + + + 
Sbjct: 177  --------GGLMTEAFDYIIDNGIEAESSYPYVEQMTECQYDAKKTIVQIKGYKKLLADE 228

Query: 544  ETLKXXXXXXXXXXXVNDCRPLFSYRGGVFDVEHTFKVDKTLHPTEIRHAMLLVGYAEKY 365
            + LK                 L  Y GGV   +  F +D         HA+L+VGY E  
Sbjct: 229  DELKKAVGTVGPISVGMSSENLHMYGGGVLGDQCYFGMD---------HAVLVVGYGEAN 279

Query: 364  GKPVWIVQNSWGSNWGDEGYIYVKR-GSNILGIEEGCVYVV 245
            GK  W V+NSWG+ WG++GY  ++R   N+  I   C Y +
Sbjct: 280  GKKFWKVKNSWGATWGEDGYFRIERDADNLCDIASMCSYPI 320


>gb|JAA46608.1| Putative pro-cathepsin h [Desmodus rotundus]
          Length = 336

 Score = 87.8 bits (216), Expect = 4e-15
 Identities = 64/238 (26%), Positives = 102/238 (42%), Gaps = 12/238 (5%)
 Frame = -3

Query: 922 PTQGEFQARDTCTSPPQGESNWWVDDCSWAYHATSMVESALAIKNKRRLDRLAVKELLDC 743
           PT  +++ +    SP + +       C W +  T  +ESA+AIK  + L  L+ ++L+DC
Sbjct: 118 PTSVDWRKKGRFVSPVKNQGG--CGSC-WTFSTTGALESAIAIKTGKMLS-LSEQQLVDC 173

Query: 742 CPKQSQSSCGDQDPIDCCAKHSCGGSVLEALAYIKQNGLSLERE---YESRQVSRK---- 584
               +   C              GG   +A  YI+ N   +E +   YE +  + +    
Sbjct: 174 AQNFNNHGCQ-------------GGLPSQAFEYIRYNKGIMEEDSYPYEGKDSNCRFQPE 220

Query: 583 ---AMIKDFGERITNAET--LKXXXXXXXXXXXVNDCRPLFSYRGGVFDVEHTFKVDKTL 419
              A +KD      N E   ++                    YR G++      K     
Sbjct: 221 KAIAFVKDVANITLNDEAAMVEAVALYNPVSFAFEVTSDFMLYRKGIYSSTSCHKT---- 276

Query: 418 HPTEIRHAMLLVGYAEKYGKPVWIVQNSWGSNWGDEGYIYVKRGSNILGIEEGCVYVV 245
            P ++ HA+L VGY E+ GKP WIV+NSWG  WG  GY  ++RG+N+ G+     Y +
Sbjct: 277 -PDKVNHAVLAVGYGEQNGKPYWIVKNSWGPYWGMNGYFLIERGTNMCGLAACASYPI 333


Top