BLASTX nr result

ID: Bupleurum21_contig00007868 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00007868
         (1653 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABQ10201.1| cysteine protease Cp3 [Actinidia deliciosa]            570   e-160
gb|ACM80348.1| cysteine proteinase [Solanum lycopersicum]             553   e-155
ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis ...   553   e-155
ref|XP_002512963.1| cysteine protease, putative [Ricinus communi...   552   e-155
gb|AAF61440.1|AF138264_1 papain-like cysteine proteinase isoform...   552   e-154

>gb|ABQ10201.1| cysteine protease Cp3 [Actinidia deliciosa]
          Length = 365

 Score =  570 bits (1469), Expect = e-160
 Identities = 267/359 (74%), Positives = 306/359 (85%), Gaps = 3/359 (0%)
 Frame = -3

Query: 1549 LFLILSLLAATSFAKIA---IDDDDSLIFQVVGEDRPLKARQHFEMFKHKFGKLYESDEE 1379
            L  + SLL  TS A  +    D +D +I Q+V  D PL A  HF +FK +FGK Y + E+
Sbjct: 7    LLFVFSLLLVTSLAAASGKSSDGEDLVIQQIVDGDHPLSADHHFRLFKRRFGKSYATQED 66

Query: 1378 HAFRYAVFKHNLRQAERNQKIDPTAVHGVTQFSDMTEEEFREKHLGLKPIKFPKDANKAP 1199
            H +R++VFK NLR+A  +Q++DP+AVHGVTQFSD+T  EFR  HLGLK ++FP DANKAP
Sbjct: 67   HDYRFSVFKTNLRRARHHQRLDPSAVHGVTQFSDLTPAEFRRNHLGLKRLRFPADANKAP 126

Query: 1198 ILPTHDLPTDFDWRDHGAVAAVKNQGSCGSCWSFSTTGALEGANYLATGKLESLSEQQLI 1019
            ILPT DLP DFDWRDHGAVA+VKNQGSCGSCWSFSTTGALEGAN+LATGKL SLSEQQL+
Sbjct: 127  ILPTEDLPADFDWRDHGAVASVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLV 186

Query: 1018 DCDHECDPEEAQSCDAGCNGGLMNTAFEYTLKTGGLMREKDYPYTGTDKGACKLDKSKIV 839
            DCDHECDPEE  SCD+GCNGGLMN+A EYTLK GGLMRE+DYPY+GTD+G CK D++KI 
Sbjct: 187  DCDHECDPEEPGSCDSGCNGGLMNSALEYTLKAGGLMREEDYPYSGTDRGTCKFDETKIA 246

Query: 838  ASVHNFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCPYICSKRLDHGVLLVG 659
            ASV NFSVVSLDE+QIAANLVKNGPLAVAINAV+MQTYVGGVSCPYICSKRLDHGVLLVG
Sbjct: 247  ASVANFSVVSLDENQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICSKRLDHGVLLVG 306

Query: 658  YGESGFSPIRMKAKPYWIIKNSWGETWGEQGYYKICKGPNVCGVDSMVSTVVAAHTTSH 482
            YG +G++PIRMK KPYWIIKNSWGE+WGE G+YKIC+G NVCGVDSMVSTV A HTTS+
Sbjct: 307  YGSAGYAPIRMKEKPYWIIKNSWGESWGENGFYKICQGRNVCGVDSMVSTVAAVHTTSN 365


>gb|ACM80348.1| cysteine proteinase [Solanum lycopersicum]
          Length = 368

 Score =  553 bits (1426), Expect = e-155
 Identities = 271/369 (73%), Positives = 309/369 (83%), Gaps = 10/369 (2%)
 Frame = -3

Query: 1561 MSHK--LFLILSLLAATSF-----AKIAIDDDDSLIFQVVGED--RPLKARQHFEMFKHK 1409
            M+H+  L  +LS+L  TSF      +I   DDD LI QVVG++    L A  HF +FK +
Sbjct: 1    MAHRFSLVFVLSILLTTSFLLAVNGEIKGGDDDILIRQVVGDEDHHMLNAEHHFTLFKKR 60

Query: 1408 FGKLYESDEEHAFRYAVFKHNLRQAERNQKIDPTAVHGVTQFSDMTEEEFREKHLGL-KP 1232
            FGK Y SDEEH +R++VFK NLR+A R+QK+DP+AVHGVTQFSDMT +EF +K LG+ + 
Sbjct: 61   FGKTYASDEEHHYRFSVFKANLRRAMRHQKLDPSAVHGVTQFSDMTPDEFSQKFLGVNRR 120

Query: 1231 IKFPKDANKAPILPTHDLPTDFDWRDHGAVAAVKNQGSCGSCWSFSTTGALEGANYLATG 1052
            ++FP DANKAPILPT DLP+DFDWR+HGAV  VKNQGSCGSCWSFSTTGALEGAN+LATG
Sbjct: 121  LRFPSDANKAPILPTEDLPSDFDWREHGAVTPVKNQGSCGSCWSFSTTGALEGANFLATG 180

Query: 1051 KLESLSEQQLIDCDHECDPEEAQSCDAGCNGGLMNTAFEYTLKTGGLMREKDYPYTGTDK 872
            KL SLSEQQL+DCDHECDPEE  SCD+GC+GGLMN+AFEYTLK GGLMRE+DYPYTGTDK
Sbjct: 181  KLVSLSEQQLVDCDHECDPEEKDSCDSGCSGGLMNSAFEYTLKAGGLMREEDYPYTGTDK 240

Query: 871  GACKLDKSKIVASVHNFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCPYICS 692
              CK D +K+ A V NFSVVSLDE+QIAANLVKNGPLAVAINAV+MQTYVGGVSCPYICS
Sbjct: 241  ATCKFDNTKVAAKVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICS 300

Query: 691  KRLDHGVLLVGYGESGFSPIRMKAKPYWIIKNSWGETWGEQGYYKICKGPNVCGVDSMVS 512
            K+LDHGVLLVGYG +GFSPIRMK KPYWIIKNSWGE WGE GYYKI +G NVCGVDSMVS
Sbjct: 301  KQLDHGVLLVGYG-TGFSPIRMKEKPYWIIKNSWGEKWGESGYYKIRRGRNVCGVDSMVS 359

Query: 511  TVVAAHTTS 485
            TV A  T+S
Sbjct: 360  TVAAVSTSS 368


>ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
          Length = 377

 Score =  553 bits (1426), Expect = e-155
 Identities = 263/366 (71%), Positives = 301/366 (82%), Gaps = 10/366 (2%)
 Frame = -3

Query: 1549 LFLILSLLAATSFAKIAIDDDDSLIFQVVGEDRPLKARQ----------HFEMFKHKFGK 1400
            LF +  +   +S       DDD +I QVV E   ++  +          HF +FK +FGK
Sbjct: 12   LFSLFFVALTSSELHSGGSDDDIIIRQVVPELGDVEGSEEENLLTADHHHFSIFKRRFGK 71

Query: 1399 LYESDEEHAFRYAVFKHNLRQAERNQKIDPTAVHGVTQFSDMTEEEFREKHLGLKPIKFP 1220
             Y S EEH +R+ VFK NLR+A R+Q++DP+A HGVTQFSD+T  EFR  +LGL+P+K P
Sbjct: 72   SYASQEEHDYRFKVFKANLRRARRHQQLDPSATHGVTQFSDLTPAEFRGTYLGLRPLKLP 131

Query: 1219 KDANKAPILPTHDLPTDFDWRDHGAVAAVKNQGSCGSCWSFSTTGALEGANYLATGKLES 1040
             DA KAPILPT+DLP DFDWRDHGAV AVKNQGSCGSCWSFSTTGALEGAN+LATG L S
Sbjct: 132  HDAQKAPILPTNDLPEDFDWRDHGAVTAVKNQGSCGSCWSFSTTGALEGANFLATGNLVS 191

Query: 1039 LSEQQLIDCDHECDPEEAQSCDAGCNGGLMNTAFEYTLKTGGLMREKDYPYTGTDKGACK 860
            LSEQQL++CDHECDPEE  SCD+GCNGGLMNTAFEYTLK GGLM+E+DYPYTGTD+G+CK
Sbjct: 192  LSEQQLVECDHECDPEEMGSCDSGCNGGLMNTAFEYTLKAGGLMKEEDYPYTGTDRGSCK 251

Query: 859  LDKSKIVASVHNFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCPYICSKRLD 680
             DK+KI ASV NFSV+SLDEDQIAANLVKNGPLAVAINAV+MQTYVGGVSCPYICSKRLD
Sbjct: 252  FDKTKIAASVSNFSVISLDEDQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICSKRLD 311

Query: 679  HGVLLVGYGESGFSPIRMKAKPYWIIKNSWGETWGEQGYYKICKGPNVCGVDSMVSTVVA 500
            HGVLLVGYG +G++PIRMK KPYWIIKNSWGE WGE G+YKIC+G NVCGVDSMVSTV A
Sbjct: 312  HGVLLVGYGSAGYAPIRMKDKPYWIIKNSWGENWGENGFYKICRGRNVCGVDSMVSTVAA 371

Query: 499  AHTTSH 482
             HTTS+
Sbjct: 372  VHTTSN 377


>ref|XP_002512963.1| cysteine protease, putative [Ricinus communis]
            gi|223547974|gb|EEF49466.1| cysteine protease, putative
            [Ricinus communis]
          Length = 373

 Score =  552 bits (1423), Expect = e-155
 Identities = 264/369 (71%), Positives = 303/369 (82%), Gaps = 9/369 (2%)
 Frame = -3

Query: 1564 AMSHKLFLILSLL--AATSFAKIAIDDDDSLIFQVV-GEDRP------LKARQHFEMFKH 1412
            A+    F+I S+L  +A +   +  D +D LI QV  G+D        L A  HF +FK 
Sbjct: 4    AVRFSFFVISSILFVSAVTAETLTTDGEDPLIRQVTDGQDESSANPNLLGAEHHFSLFKK 63

Query: 1411 KFGKLYESDEEHAFRYAVFKHNLRQAERNQKIDPTAVHGVTQFSDMTEEEFREKHLGLKP 1232
            KF K Y S EEH +R+ +FK NLR+AER+QK+DPTA HGVTQFSD+T  EFR + LGL+ 
Sbjct: 64   KFKKTYASQEEHDYRFKIFKSNLRRAERHQKLDPTATHGVTQFSDLTHSEFRRQFLGLRR 123

Query: 1231 IKFPKDANKAPILPTHDLPTDFDWRDHGAVAAVKNQGSCGSCWSFSTTGALEGANYLATG 1052
            ++ PKDAN+AP+LPT+DLP DFDWR+ GAV AVKNQGSCGSCWSFSTTGALEGANYLATG
Sbjct: 124  LRLPKDANEAPMLPTNDLPADFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGANYLATG 183

Query: 1051 KLESLSEQQLIDCDHECDPEEAQSCDAGCNGGLMNTAFEYTLKTGGLMREKDYPYTGTDK 872
            KL SLSEQQL+DCDHECDP E  +CD+GCNGGLMN+AFEYTLK GGLMRE+DYPYTGTD+
Sbjct: 184  KLVSLSEQQLVDCDHECDPAEEGACDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR 243

Query: 871  GACKLDKSKIVASVHNFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCPYICS 692
            GAC+ DK+KI A V NFSVVSLDEDQIAANLVKNGPLAVAINAV+MQTY+GGVSCPYICS
Sbjct: 244  GACQFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICS 303

Query: 691  KRLDHGVLLVGYGESGFSPIRMKAKPYWIIKNSWGETWGEQGYYKICKGPNVCGVDSMVS 512
            KRLDHGVLLVGYG +G++PIRMK KPYWIIKNSWGE WGE GYYKIC+G N+CGVDSMVS
Sbjct: 304  KRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGENWGESGYYKICRGRNICGVDSMVS 363

Query: 511  TVVAAHTTS 485
            TV A  T S
Sbjct: 364  TVAAVQTAS 372


>gb|AAF61440.1|AF138264_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
          Length = 368

 Score =  552 bits (1422), Expect = e-154
 Identities = 268/365 (73%), Positives = 302/365 (82%), Gaps = 5/365 (1%)
 Frame = -3

Query: 1564 AMSHKLFLILSLLAATSFAKIAIDDD--DSLIFQVVGEDRP--LKARQHFEMFKHKFGKL 1397
            A    L  + +LLA TS    A DDD  D LI QVVG+     L A  HF +FK +FGK 
Sbjct: 2    AFRFSLLFLCTLLATTSLVFAAEDDDGDDVLIRQVVGDGDGDLLNADHHFTVFKRRFGKA 61

Query: 1396 YESDEEHAFRYAVFKHNLRQAERNQKIDPTAVHGVTQFSDMTEEEFREKHLGL-KPIKFP 1220
            Y SDEEH +R +VFK N+R+A+R+Q++DP AVHGVTQFSD+T  EFR K LGL + +KFP
Sbjct: 62   YASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLKFP 121

Query: 1219 KDANKAPILPTHDLPTDFDWRDHGAVAAVKNQGSCGSCWSFSTTGALEGANYLATGKLES 1040
             DA  APILPT +LP+DFDWRDHGAV  VKNQG+CGSCWSFSTTGALEGAN+LATGKL S
Sbjct: 122  ADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLVS 181

Query: 1039 LSEQQLIDCDHECDPEEAQSCDAGCNGGLMNTAFEYTLKTGGLMREKDYPYTGTDKGACK 860
            LSEQQL+DCDHECDPEEA SCD+GCNGGLMN+AFEYTLK GGLMRE+DYPYTG D   C+
Sbjct: 182  LSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQVCR 241

Query: 859  LDKSKIVASVHNFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCPYICSKRLD 680
             DK+KI A V NFSVVSLDEDQIAANLVKNGPLAVAINAV+MQTY+GGVSCPYICSKRLD
Sbjct: 242  FDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSKRLD 301

Query: 679  HGVLLVGYGESGFSPIRMKAKPYWIIKNSWGETWGEQGYYKICKGPNVCGVDSMVSTVVA 500
            HGVLLVGYG +G++PIRMK KPYWIIKNSWGE+WGE GYYKIC+G NVCGVDSMVSTV A
Sbjct: 302  HGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 361

Query: 499  AHTTS 485
              TT+
Sbjct: 362  VSTTT 366