BLASTX nr result

ID: Coptis25_contig00002632 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00002632
         (1574 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa ...   515   e-143
ref|XP_002301457.1| predicted protein [Populus trichocarpa] gi|2...   506   e-141
ref|NP_001150152.1| LOC100283781 precursor [Zea mays] gi|1956371...   503   e-140
tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea...   501   e-139
ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]     501   e-139

>gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
            gi|125551767|gb|EAY97476.1| hypothetical protein
            OsI_19406 [Oryza sativa Indica Group]
            gi|215694023|dbj|BAG89222.1| unnamed protein product
            [Oryza sativa Japonica Group]
            gi|215712372|dbj|BAG94499.1| unnamed protein product
            [Oryza sativa Japonica Group]
            gi|215765382|dbj|BAG87079.1| unnamed protein product
            [Oryza sativa Japonica Group] gi|222631058|gb|EEE63190.1|
            hypothetical protein OsJ_17999 [Oryza sativa Japonica
            Group]
          Length = 358

 Score =  515 bits (1327), Expect = e-143
 Identities = 232/309 (75%), Positives = 264/309 (85%)
 Frame = +2

Query: 317  SIILQDSIIAQINSNPHAGWKAARNPRFSNYTVAQFKHLLGVKPTPQNVLDNVPVISHPR 496
            S I+QD II  IN +P+AGW AARNP F+NYT AQFKH+LGVKPTP +VL++VPV ++PR
Sbjct: 39   SRIIQDDIIKAINKHPNAGWTAARNPYFANYTTAQFKHILGVKPTPHSVLNDVPVKTYPR 98

Query: 497  FLTLPKQFDARTTWSKCGTIGVILDQGHCGSCWAFGAVEALSDRFCIHFGMNISLSVNDL 676
             L LPK+FDAR+ WS+C TIG ILDQGHCGSCWAFGAVE L DRFCIHF MNISLSVNDL
Sbjct: 99   SLMLPKEFDARSAWSQCNTIGTILDQGHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDL 158

Query: 677  LAXXXXXXXXXXXXXYPINAWQYFVQNGVVTEECDPYFDNVGCSHPGCEPLYPTPTCEKK 856
            +A             YPI AW+YFV+NGVVT+ECDPYFD VGC HPGCEP YPTP CEKK
Sbjct: 159  VACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKK 218

Query: 857  CQQKNQLWAESKHFSVSAYRISSAVYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHIT 1036
            C+ +NQ+W E KHFSV+AYR++S  +DIMAEVY+NGPVEVAFTVYEDFAHYKSGVYKHIT
Sbjct: 219  CKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHIT 278

Query: 1037 GGVMGGHAVKLIGWGTTDDGEDYWLLANQWNKSWGDDGFFKIRRGTNECGIEEDVVAGLP 1216
            GG+MGGHAVKLIGWGTTD GEDYWLLANQWN+ WGDDG+FKI RGTNECGIEEDVVAG+P
Sbjct: 279  GGMMGGHAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMP 338

Query: 1217 STKNLLRRF 1243
            STKN++R +
Sbjct: 339  STKNMVRNY 347


>ref|XP_002301457.1| predicted protein [Populus trichocarpa] gi|222843183|gb|EEE80730.1|
            predicted protein [Populus trichocarpa]
          Length = 357

 Score =  506 bits (1303), Expect = e-141
 Identities = 235/323 (72%), Positives = 265/323 (82%)
 Frame = +2

Query: 269  QVHAVRLTASLIRSSESIILQDSIIAQINSNPHAGWKAARNPRFSNYTVAQFKHLLGVKP 448
            QV AV   + L  +S   ILQDSI+ ++N NP AGWKA  N  FSNYTVAQFK+LLGVKP
Sbjct: 24   QVIAVEPVSDLKLNSR--ILQDSILKKVNGNPKAGWKATMNHHFSNYTVAQFKYLLGVKP 81

Query: 449  TPQNVLDNVPVISHPRFLTLPKQFDARTTWSKCGTIGVILDQGHCGSCWAFGAVEALSDR 628
            TP+  L  +PVISHP+ L LP++FDART W +C TIG ILDQGHCGSCWAFGAVE+LSDR
Sbjct: 82   TPKEELRGIPVISHPKSLRLPEEFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDR 141

Query: 629  FCIHFGMNISLSVNDLLAXXXXXXXXXXXXXYPINAWQYFVQNGVVTEECDPYFDNVGCS 808
            FCIH+GMNISLSVNDLLA             YPI+AW+YFV +GVVTEECDPYFD++GCS
Sbjct: 142  FCIHYGMNISLSVNDLLACCGFLCGSGCNGGYPISAWRYFVHHGVVTEECDPYFDDIGCS 201

Query: 809  HPGCEPLYPTPTCEKKCQQKNQLWAESKHFSVSAYRISSAVYDIMAEVYKNGPVEVAFTV 988
            HPGCEP YPTP C +KC  KNQLW +SKH+ V  YRI S    IMAE+YKNGPVEVAFTV
Sbjct: 202  HPGCEPGYPTPKCARKCVNKNQLWKKSKHYGVKPYRIDSDPESIMAEIYKNGPVEVAFTV 261

Query: 989  YEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTTDDGEDYWLLANQWNKSWGDDGFFKIRR 1168
            YEDFAHYKSGVYKHITGG+MGGHAVKLIGWGT++DGE YWLLANQWN+ WGDDG+FKIRR
Sbjct: 262  YEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDGEAYWLLANQWNRGWGDDGYFKIRR 321

Query: 1169 GTNECGIEEDVVAGLPSTKNLLR 1237
            GTNECGIE DVVAGLPST+NL+R
Sbjct: 322  GTNECGIEGDVVAGLPSTRNLVR 344


>ref|NP_001150152.1| LOC100283781 precursor [Zea mays] gi|195637168|gb|ACG38052.1|
            cathepsin B-like cysteine proteinase 3 precursor [Zea
            mays]
          Length = 347

 Score =  503 bits (1295), Expect = e-140
 Identities = 226/307 (73%), Positives = 256/307 (83%)
 Frame = +2

Query: 323  ILQDSIIAQINSNPHAGWKAARNPRFSNYTVAQFKHLLGVKPTPQNVLDNVPVISHPRFL 502
            I+Q+ II  +N++P AGW A+RNP FSNYT+AQFKH+LGVKP PQN L NVPV ++ R L
Sbjct: 32   IIQEDIIETVNNHPSAGWTASRNPYFSNYTIAQFKHILGVKPAPQNALSNVPVKTYSRSL 91

Query: 503  TLPKQFDARTTWSKCGTIGVILDQGHCGSCWAFGAVEALSDRFCIHFGMNISLSVNDLLA 682
             LPK+FDAR+ WS+C TIG ILDQGHCGSCWAFGAVE L DRFCIH  M+I LSVNDLLA
Sbjct: 92   ELPKEFDARSAWSRCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHLNMSILLSVNDLLA 151

Query: 683  XXXXXXXXXXXXXYPINAWQYFVQNGVVTEECDPYFDNVGCSHPGCEPLYPTPTCEKKCQ 862
                         YPI AW+YFVQNGVVT+ECDPYFD VGC HPGCEP YPTP CEKKC+
Sbjct: 152  CCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKCEKKCK 211

Query: 863  QKNQLWAESKHFSVSAYRISSAVYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGG 1042
            ++NQ+W E KHFS+ AYRI+S  +DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGG
Sbjct: 212  EQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGG 271

Query: 1043 VMGGHAVKLIGWGTTDDGEDYWLLANQWNKSWGDDGFFKIRRGTNECGIEEDVVAGLPST 1222
            +MGGHAVKLIGWGT+D GEDYWLLANQWN+ WGDDG+FKI RG NECGIEE VVAG+PST
Sbjct: 272  IMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVVAGMPST 331

Query: 1223 KNLLRRF 1243
            KN++  F
Sbjct: 332  KNMVPNF 338


>tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
          Length = 347

 Score =  501 bits (1291), Expect = e-139
 Identities = 225/307 (73%), Positives = 256/307 (83%)
 Frame = +2

Query: 323  ILQDSIIAQINSNPHAGWKAARNPRFSNYTVAQFKHLLGVKPTPQNVLDNVPVISHPRFL 502
            I+Q+ II  +N++P AGW A+RNP FSNYT+AQFKH+LGVKP PQN L NVPV ++ R L
Sbjct: 32   IIQEDIIETVNNHPSAGWTASRNPYFSNYTIAQFKHILGVKPAPQNALSNVPVKTYSRSL 91

Query: 503  TLPKQFDARTTWSKCGTIGVILDQGHCGSCWAFGAVEALSDRFCIHFGMNISLSVNDLLA 682
             LPK+FDAR+ WS+C TIG IL+QGHCGSCWAFGAVE L DRFCIH  M+I LSVNDLLA
Sbjct: 92   ELPKEFDARSAWSRCSTIGNILEQGHCGSCWAFGAVECLQDRFCIHLNMSILLSVNDLLA 151

Query: 683  XXXXXXXXXXXXXYPINAWQYFVQNGVVTEECDPYFDNVGCSHPGCEPLYPTPTCEKKCQ 862
                         YPI AW+YFVQNGVVT+ECDPYFD VGC HPGCEP YPTP CEKKC+
Sbjct: 152  CCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKCEKKCK 211

Query: 863  QKNQLWAESKHFSVSAYRISSAVYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGG 1042
            ++NQ+W E KHFS+ AYRI+S  +DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGG
Sbjct: 212  EQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGG 271

Query: 1043 VMGGHAVKLIGWGTTDDGEDYWLLANQWNKSWGDDGFFKIRRGTNECGIEEDVVAGLPST 1222
            +MGGHAVKLIGWGT+D GEDYWLLANQWN+ WGDDG+FKI RG NECGIEE VVAG+PST
Sbjct: 272  IMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVVAGMPST 331

Query: 1223 KNLLRRF 1243
            KN++  F
Sbjct: 332  KNMVPNF 338


>ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 348

 Score =  501 bits (1289), Expect = e-139
 Identities = 234/351 (66%), Positives = 269/351 (76%)
 Frame = +2

Query: 194  MVSSTFMYLVTXXXXXGVFPQHHLLQVHAVRLTASLIRSSESIILQDSIIAQINSNPHAG 373
            M SS F   ++      V   HH +      L   L    ++ ILQ+SI+  +N +P AG
Sbjct: 1    MASSHFYLSLSLLFLAAVCTFHHQVYAEEQVLKFKL----DADILQESIVRHVNEHPQAG 56

Query: 374  WKAARNPRFSNYTVAQFKHLLGVKPTPQNVLDNVPVISHPRFLTLPKQFDARTTWSKCGT 553
            WKA  NPRFSNY+V+QFK+LLGVK TP+  L + PV+SHP+ L LPK FDAR  W +C +
Sbjct: 57   WKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWPQCIS 116

Query: 554  IGVILDQGHCGSCWAFGAVEALSDRFCIHFGMNISLSVNDLLAXXXXXXXXXXXXXYPIN 733
            IG ILDQGHCGSCWAFGAVE+LSDRFCIHF MNI+LSVNDLLA             YPI+
Sbjct: 117  IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPIS 176

Query: 734  AWQYFVQNGVVTEECDPYFDNVGCSHPGCEPLYPTPTCEKKCQQKNQLWAESKHFSVSAY 913
            AW+YFV++GVVTE+CDPYFD  GCSHPGCEP YPTP C + C  KNQ+W ++KH+ VSAY
Sbjct: 177  AWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVSAY 236

Query: 914  RISSAVYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTTDD 1093
            R+     DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKHITG VMGGHAVKLIGWGTTDD
Sbjct: 237  RVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDD 296

Query: 1094 GEDYWLLANQWNKSWGDDGFFKIRRGTNECGIEEDVVAGLPSTKNLLRRFA 1246
            GEDYWLLANQWN+ WGDDG+FKIRRGTNECGIEEDVVAGLPSTKN+ R  A
Sbjct: 297  GEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLPSTKNIAREAA 347


Top