BLASTX nr result

ID: Coptis24_contig00008769 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00008769
         (1714 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          472   e-130
ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|2...   471   e-130
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   462   e-127
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   460   e-127
ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine...   447   e-123

>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  472 bits (1215), Expect = e-130
 Identities = 228/398 (57%), Positives = 285/398 (71%), Gaps = 3/398 (0%)
 Frame = -3

Query: 1532 LFNTWCDQHGKRYSSEKEKLFRQSVFEDNLAYVIQHNSLKFFTYTLALNDFADLTHQEFK 1353
            LF TWC QHGK Y+S++EKLFR  VF+DN  +V +HNS    +YTL+LN FADLTH EFK
Sbjct: 29   LFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEFK 88

Query: 1352 DSWLGLKIPALDGKSLHKPAFSLEGSRKIGKIPRSIDWRKNGAVSSVRNQASCGASWAFS 1173
             S LGL   A    SL+    + +    +  +P S+DWRKNGAV+ V++Q +CGA W+FS
Sbjct: 89   ASRLGLSSAA--SASLNVDRSNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWSFS 146

Query: 1172 AVGAIEGIHKIVTGSLVSLSEQELLDCEKSFNSGCSGGLTDTAFQWTIESQGIGTKEDYP 993
            A GAIEGI+KIVTGSLVSLSEQEL+DC+KS+N+GC GG+ D AFQ+ I++ GI T+EDYP
Sbjct: 147  ATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEEDYP 206

Query: 992  YQAGERACNK-KSKRNVVTIDGYRSVPRNDEEQLLEAVASQPVSAGICASERSHQFYSKG 816
            YQ  +R+CNK K KR+VVTIDGY  VP+N+E++LL+AVA+QPVS GIC SER+ Q YSKG
Sbjct: 207  YQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYSKG 266

Query: 815  IFSGPCSTSLNHAVLIVGYGSVNGVDYWIVKNSWGTNWGINGYMQIQRDSGYPEGVCGIN 636
            IF+GPCSTSL+HAVLIVGYGS NGVDYWIVKNSWG+ WG++GYM +QR+SG   G+CGIN
Sbjct: 267  IFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGIN 326

Query: 635  ILASYPIKTGSEETPSPDRLVQRCGLLDHCNADQICCCAKRLI-VCLVWSCCNQKNGVCC 459
            +LASYP KT     P       RC L  HC   + CCC   +  +CL W CC   + VCC
Sbjct: 327  MLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKCCELDSAVCC 386

Query: 458  NDNKHCCPEGNH-CLPNNRICHQDLQNNTTMTKLTQSS 348
             D +HCCP     C     IC +   N T + K  ++S
Sbjct: 387  KDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNS 424


>ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|222857137|gb|EEE94684.1|
            predicted protein [Populus trichocarpa]
          Length = 436

 Score =  471 bits (1211), Expect = e-130
 Identities = 230/404 (56%), Positives = 286/404 (70%), Gaps = 3/404 (0%)
 Frame = -3

Query: 1535 QLFNTWCDQHGKRYSSEKEKLFRQSVFEDNLAYVIQHNSLKFFTYTLALNDFADLTHQEF 1356
            QLF TWC +HGK Y+S++E+  R  VFEDN  +V +HNS    +Y+LALN FADLTH EF
Sbjct: 27   QLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEF 86

Query: 1355 KDSWLGLKIPALDGKSLHKPAFSLEGSRKIGKIPRSIDWRKNGAVSSVRNQASCGASWAF 1176
            K S LGL    L+    H+   +LE +  +G IP SIDWR  G V++V++Q SCGA W+F
Sbjct: 87   KTSRLGLSAAPLN--LAHR---NLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGACWSF 141

Query: 1175 SAVGAIEGIHKIVTGSLVSLSEQELLDCEKSFNSGCSGGLTDTAFQWTIESQGIGTKEDY 996
            SA GAIEGI+KIVTGSLVSLSEQEL++C+KS+N GC GGL D AFQ+ I + GI T+EDY
Sbjct: 142  SATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEEDY 201

Query: 995  PYQAGERACNK-KSKRNVVTIDGYRSVPRNDEEQLLEAVASQPVSAGICASERSHQFYSK 819
            PY+A +  CNK + KR VVTID Y  VP N+E+QLL+AVA+QPVS GIC SER+ Q YSK
Sbjct: 202  PYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSK 261

Query: 818  GIFSGPCSTSLNHAVLIVGYGSVNGVDYWIVKNSWGTNWGINGYMQIQRDSGYPEGVCGI 639
            GIF+GPCSTSL+HAVLIVGYGS NGVDYWIVKNSWGT WG+ GYM +QR+SG  +GVCGI
Sbjct: 262  GIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGI 321

Query: 638  NILASYPIKTGSEETPSPDRLVQRCGLLDHCNADQICCCAKRLI-VCLVWSCCNQKNGVC 462
            N+LASYP+KT     P P     +C LL +C A + CCCA++   +C+ W CC   + VC
Sbjct: 322  NMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLDSAVC 381

Query: 461  CNDNKHCCPEGNH-CLPNNRICHQDLQNNTTMTKLTQSSFWKFG 333
            C D  HCCP     C  +  +C +   N T M  +   +  KFG
Sbjct: 382  CKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGKTSGKFG 425


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  462 bits (1188), Expect = e-127
 Identities = 224/390 (57%), Positives = 280/390 (71%), Gaps = 3/390 (0%)
 Frame = -3

Query: 1535 QLFNTWCDQHGKRYSSEKEKLFRQSVFEDNLAYVIQHNSLKFFTYTLALNDFADLTHQEF 1356
            +LF  WC +HGK YSS +EKL+R  VF DN  +V  HN+L   +YTL+LN +ADLTH EF
Sbjct: 27   ELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTHHEF 86

Query: 1355 KDSWLGLKIPALDGKSLHKPAFSLEGSRKIGKIPRSIDWRKNGAVSSVRNQASCGASWAF 1176
            K S LG   PAL      +P    E S     +P S+DWRK GAV++V++Q SCGA W+F
Sbjct: 87   KVSRLGFS-PALRN---FRPVLPQEPSLP-RDVPDSLDWRKKGAVTAVKDQGSCGACWSF 141

Query: 1175 SAVGAIEGIHKIVTGSLVSLSEQELLDCEKSFNSGCSGGLTDTAFQWTIESQGIGTKEDY 996
            SA GA+EGI++I+TGSL+SLSEQEL+DC++S+NSGC GGL D A+Q+ I + GI T+ DY
Sbjct: 142  SATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTENDY 201

Query: 995  PYQAGERACNK-KSKRNVVTIDGYRSVPRNDEEQLLEAVASQPVSAGICASERSHQFYSK 819
            PYQA + +C K K +RNVVTIDGY  +P NDE +LL+AVA+QPVS GIC SER+ Q YSK
Sbjct: 202  PYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLYSK 261

Query: 818  GIFSGPCSTSLNHAVLIVGYGSVNGVDYWIVKNSWGTNWGINGYMQIQRDSGYPEGVCGI 639
            GIFSGPCSTSL+HAVLIVGYGS NGVDYWIVKNSWG +WG++GYM +QR+SG  EGVCGI
Sbjct: 262  GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVCGI 321

Query: 638  NILASYPIKTGSEETPSPDRLVQRCGLLDHCNADQICCCAKRLI-VCLVWSCCNQKNGVC 462
            N LASYP KT     PSP     +C +L  C A + CCCAK+ + +CL W CC   + VC
Sbjct: 322  NKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCGLSSAVC 381

Query: 461  CNDNKHCCP-EGNHCLPNNRICHQDLQNNT 375
            C D +HCCP +   C  +  +C +   N T
Sbjct: 382  CKDGRHCCPFDYPICDTDRNLCLKQTMNGT 411


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  460 bits (1184), Expect = e-127
 Identities = 219/369 (59%), Positives = 276/369 (74%), Gaps = 2/369 (0%)
 Frame = -3

Query: 1535 QLFNTWCDQHGKRYSSEKEKLFRQSVFEDNLAYVIQHNSLKFFTYTLALNDFADLTHQEF 1356
            +LF +W  +HGK Y+S+++KL+R  +FE+N  +V +HNS    +YTL+LN FADLTH EF
Sbjct: 30   KLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEF 89

Query: 1355 KDSWLGLKIPALDGKSLHKPAFSLEGSRKIGKIPRSIDWRKNGAVSSVRNQASCGASWAF 1176
            K S LGL   +  GK L +  F L     +G +P SIDWRK GAVS V++Q +CGA W+F
Sbjct: 90   KASRLGLSAFSTSGK-LSRRNFPLHDF--VGDVPISIDWRKKGAVSQVKDQGNCGACWSF 146

Query: 1175 SAVGAIEGIHKIVTGSLVSLSEQELLDCEKSFNSGCSGGLTDTAFQWTIESQGIGTKEDY 996
            SA GAIEGI+KIVTGSLVSLSEQEL+DC++S+N+GC GGL D A+Q+ IE+ GI T+EDY
Sbjct: 147  SATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDY 206

Query: 995  PYQAGERACNK-KSKRNVVTIDGYRSVPRNDEEQLLEAVASQPVSAGICASERSHQFYSK 819
            PYQA E+ CNK K KR+VVTIDGY  VP+N+E++LL+AVA+QPVS GIC SER+ Q YSK
Sbjct: 207  PYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSK 266

Query: 818  GIFSGPCSTSLNHAVLIVGYGSVNGVDYWIVKNSWGTNWGINGYMQIQRDSGYPEGVCGI 639
            GIF+GPCSTSL+HAVLIVGYGS NGVDYWIVKNSWGT+WGINGYM + R+SG  +G+CGI
Sbjct: 267  GIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGI 326

Query: 638  NILASYPIKTGSEETPSPDRLVQRCGLLDHCNADQICCCAKRLI-VCLVWSCCNQKNGVC 462
            N+LAS+P+KT     P       +C L   C   + CCC +R+  +C  W CC   + VC
Sbjct: 327  NMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCELDSAVC 386

Query: 461  CNDNKHCCP 435
            C D  HCCP
Sbjct: 387  CKDGLHCCP 395


>ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  447 bits (1149), Expect = e-123
 Identities = 224/398 (56%), Positives = 277/398 (69%), Gaps = 8/398 (2%)
 Frame = -3

Query: 1541 TDQLFNTWCDQHGKRYSSEKEKLFRQSVFEDNLAYVIQHN-----SLKFFTYTLALNDFA 1377
            T +LF  WC +H K YSSE+EKL+R  VFEDN A+V QHN     +    +YTL+LN FA
Sbjct: 29   TSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFA 88

Query: 1376 DLTHQEFKDSWLGLKIPALDGKSLHKPAFSLEGSRKIGKIPRSIDWRKNGAVSSVRNQAS 1197
            DLTH EFK + LGL +  L  K         + SR +  IP  IDWR++GAV+ V++QAS
Sbjct: 89   DLTHHEFKTTRLGLPLTLLRFKRPQN-----QQSRDLLHIPSQIDWRQSGAVTPVKDQAS 143

Query: 1196 CGASWAFSAVGAIEGIHKIVTGSLVSLSEQELLDCEKSFNSGCSGGLTDTAFQWTIESQG 1017
            CGA WAFSA GAIEGI+KIVTGSLVSLSEQEL+DC+ S+NSGC GGL D A+Q+ I+++G
Sbjct: 144  CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKG 203

Query: 1016 IGTKEDYPYQAGERACNK-KSKRNVVTIDGYRSVPRNDEEQLLEAVASQPVSAGICASER 840
            I T++DYPYQA +R+C+K K KR  VTI+ Y  VP ++EE +L+AVASQPVS GIC SER
Sbjct: 204  IDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGSER 262

Query: 839  SHQFYSKGIFSGPCSTSLNHAVLIVGYGSVNGVDYWIVKNSWGTNWGINGYMQIQRDSGY 660
              Q YSKGIF+GPCST L+HAVLIVGYGS NGVDYWIVKNSWG  WG+NGY+ + R+SG 
Sbjct: 263  EFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNSGN 322

Query: 659  PEGVCGINILASYPIKTGSEETPSPDRLVQRCGLLDHCNADQICCCAKRLI-VCLVWSCC 483
             +G+CGIN LASYP+KT       P     RC L  HC+  + CCCAK  + +C  W CC
Sbjct: 323  SKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGICFSWKCC 382

Query: 482  NQKNGVCCNDNKHCCPEGNH-CLPNNRICHQDLQNNTT 372
               + VCC D +HCCP+    C      C +   N TT
Sbjct: 383  GLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTT 420


Top