BLASTX nr result

ID: Coptis25_contig00000221 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00000221
         (1574 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ53178.1| JHL18I08.12 [Jatropha curcas]                         541   e-151
gb|ABF46642.1| papain-like cysteine proteinase [Pachysandra term...   541   e-151
ref|XP_002302004.1| predicted protein [Populus trichocarpa] gi|2...   538   e-150
gb|ABK94801.1| unknown [Populus trichocarpa]                          536   e-150
ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis ...   536   e-150

>dbj|BAJ53178.1| JHL18I08.12 [Jatropha curcas]
          Length = 368

 Score =  541 bits (1395), Expect = e-151
 Identities = 254/346 (73%), Positives = 300/346 (86%), Gaps = 1/346 (0%)
 Frame = -3

Query: 1419 IAAFASLDESNDNLIRQVVSNDDENDLVLNAHHHFSNFITKFGKKYADEIEHAYRFSVFK 1240
            IA+  S DE +D LIRQVV + D++ L LNA HHF+ F  KFGK YA + EH YRF +FK
Sbjct: 19   IASTTSPDELDDPLIRQVVPDGDQDHL-LNAEHHFTTFKAKFGKTYATQEEHDYRFKLFK 77

Query: 1239 SNLQRAKLHQIIDPTAEHGVTKFSDLTPSEFGEKYLGLRSLKLPADAHKAPILPTNDLPT 1060
            +NL+RA+ HQ++DPTA HGVT FSDLTP EF  +YLGLR L+LPADAH+APILPTNDLPT
Sbjct: 78   ANLRRARKHQMMDPTAVHGVTMFSDLTPREFRRQYLGLRRLRLPADAHEAPILPTNDLPT 137

Query: 1059 EFDWRDHGAVTPIKDQGQCGSCWSFSTTGALEGAHFLATGDLVSLSEQQLVDCDHECDTE 880
            +FDWRDHGAVT +K+QG CGSCWSFS  GALEGAHFLATG+LVSLSEQQLVDCDHECD E
Sbjct: 138  DFDWRDHGAVTNVKNQGSCGSCWSFSAAGALEGAHFLATGELVSLSEQQLVDCDHECDPE 197

Query: 879  EASSCDNGCNGGLMNSALEYTLKAGGLQREEDYPYTGKD-SKCKFDKTKIAASVSNFSVI 703
            E  +CD+GCNGGLM +A EYTLKAGGL+REEDYPYTG D   CKFD+ KI ASVSNFSV+
Sbjct: 198  EYGACDSGCNGGLMTTAFEYTLKAGGLEREEDYPYTGNDRGPCKFDRNKIVASVSNFSVV 257

Query: 702  SIDEEQIAANLVKNGPLAVGINAAYMQTYMAGVSCPYICSKRRLDHAILLVGFGSDGFAP 523
            SIDE+QIAANLVK+GPLAVGINA +MQTYM GVSCPYICSKR+ DH +LLVG+GS G+AP
Sbjct: 258  SIDEDQIAANLVKHGPLAVGINAVFMQTYMGGVSCPYICSKRQ-DHGVLLVGYGSAGYAP 316

Query: 522  IRLKEKPYWILKNSWGQNWGEDGFYKICRGPNVCGVDSMVSTVASV 385
            IRLK+KP+WI+KNSWG++WGE+G+Y+ICRG N+CGVD+MVS+VA++
Sbjct: 317  IRLKDKPFWIIKNSWGESWGENGYYRICRGRNICGVDAMVSSVAAI 362


>gb|ABF46642.1| papain-like cysteine proteinase [Pachysandra terminalis]
          Length = 374

 Score =  541 bits (1393), Expect = e-151
 Identities = 256/347 (73%), Positives = 296/347 (85%), Gaps = 4/347 (1%)
 Frame = -3

Query: 1416 AAFASLDESNDNLIRQVVSNDDEND---LVLNAHHHFSNFITKFGKKYADEIEHAYRFSV 1246
            A+  S DES+D LIRQVV+  D++D   L+LNA HHFS+F  +FGK Y    EH  RF V
Sbjct: 22   ASTVSSDESDDLLIRQVVAGADDHDNDDLLLNAEHHFSSFKKRFGKAYTSCDEHDRRFGV 81

Query: 1245 FKSNLQRAKLHQIIDPTAEHGVTKFSDLTPSEFGEKYLGLRSLKLPADAHKAPILPTNDL 1066
            FK+NL+RAK +QI+DP+A HGVT+F DLTP+EF   YLGL+ L+LPAD H+APILPTNDL
Sbjct: 82   FKANLRRAKRNQILDPSAVHGVTQFFDLTPAEFRRTYLGLKRLRLPADTHEAPILPTNDL 141

Query: 1065 PTEFDWRDHGAVTPIKDQGQCGSCWSFSTTGALEGAHFLATGDLVSLSEQQLVDCDHECD 886
            P +FDWRDHGAVTP+K+QG CGSCWSFS TGALEGA+FLATG LVSLSEQQLVDCDH CD
Sbjct: 142  PADFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHVCD 201

Query: 885  TEEASSCDNGCNGGLMNSALEYTLKAGGLQREEDYPYTGKD-SKCKFDKTKIAASVSNFS 709
            +E+ SSCD+GCNGGLM SA EYTLKAGGL+REEDYPYTG D SKCKFDKTKIA S SNFS
Sbjct: 202  SEDPSSCDSGCNGGLMTSAFEYTLKAGGLEREEDYPYTGTDHSKCKFDKTKIAVSASNFS 261

Query: 708  VISIDEEQIAANLVKNGPLAVGINAAYMQTYMAGVSCPYICSKRRLDHAILLVGFGSDGF 529
            V+S+DE QIAANLV NGPLA+GINA +MQTY+ GVSCPYICSKR LDH +LLVG+GS GF
Sbjct: 262  VVSLDENQIAANLVTNGPLAIGINAMFMQTYIGGVSCPYICSKRLLDHGVLLVGYGSAGF 321

Query: 528  APIRLKEKPYWILKNSWGQNWGEDGFYKICRGPNVCGVDSMVSTVAS 388
            APIR KEKPYWI+KNSWG++WGE G+YKICRG N+CG+DSMVS VA+
Sbjct: 322  APIRFKEKPYWIIKNSWGESWGEKGYYKICRGRNICGMDSMVSAVAA 368


>ref|XP_002302004.1| predicted protein [Populus trichocarpa] gi|222843730|gb|EEE81277.1|
            predicted protein [Populus trichocarpa]
          Length = 367

 Score =  538 bits (1387), Expect = e-150
 Identities = 256/350 (73%), Positives = 303/350 (86%), Gaps = 2/350 (0%)
 Frame = -3

Query: 1419 IAAFASLDESNDNLIRQVVSNDDENDLVLNAHHHFSNFITKFGKKYADEIEHAYRFSVFK 1240
            +A+  S ++ +D LIRQVVS D E+DL LNA HHF++F +KFGK YA + EH YRF VFK
Sbjct: 19   VASTVSSNDLDDPLIRQVVS-DGEDDL-LNAEHHFTSFKSKFGKTYATQEEHDYRFGVFK 76

Query: 1239 SNLQRAKLHQIIDPTAEHGVTKFSDLTPSEFGEKYLGL-RSLKLPADAHKAPILPTNDLP 1063
            +NL+RAK HQ+IDPTA HG+TKFSDLTP EF  ++LGL R L+LP DA+KAPILPT DLP
Sbjct: 77   ANLRRAKKHQMIDPTAAHGITKFSDLTPKEFRRQFLGLKRWLRLPTDANKAPILPTTDLP 136

Query: 1062 TEFDWRDHGAVTPIKDQGQCGSCWSFSTTGALEGAHFLATGDLVSLSEQQLVDCDHECDT 883
            T++DWRDHGAVT +KDQG CGSCWSFS TGALEGAH+LATG+L SLSEQQLVDCDHECD 
Sbjct: 137  TDYDWRDHGAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDP 196

Query: 882  EEASSCDNGCNGGLMNSALEYTLKAGGLQREEDYPYTGKD-SKCKFDKTKIAASVSNFSV 706
            EE  +CD+GC+GGLMN+A EY LKAGGL+REEDYPYTG D   CKFDK+K+ ASVSNFSV
Sbjct: 197  EEYGACDSGCDGGLMNNAFEYALKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSV 256

Query: 705  ISIDEEQIAANLVKNGPLAVGINAAYMQTYMAGVSCPYICSKRRLDHAILLVGFGSDGFA 526
            +SIDE+QIAANLVK+GPL+V INAA+MQTY+ GVSCPYICSKR+ DH +LLVG+GS G+A
Sbjct: 257  VSIDEDQIAANLVKHGPLSVAINAAFMQTYVGGVSCPYICSKRQ-DHGVLLVGYGSAGYA 315

Query: 525  PIRLKEKPYWILKNSWGQNWGEDGFYKICRGPNVCGVDSMVSTVASVQLT 376
            PIR KEKP+WI+KNSWGQNWGE+G+YKICRG N+CGVDSMVSTVA++  T
Sbjct: 316  PIRFKEKPFWIIKNSWGQNWGENGYYKICRGRNICGVDSMVSTVAAIHTT 365


>gb|ABK94801.1| unknown [Populus trichocarpa]
          Length = 367

 Score =  536 bits (1382), Expect = e-150
 Identities = 256/350 (73%), Positives = 302/350 (86%), Gaps = 2/350 (0%)
 Frame = -3

Query: 1419 IAAFASLDESNDNLIRQVVSNDDENDLVLNAHHHFSNFITKFGKKYADEIEHAYRFSVFK 1240
            +A+  S ++ +D LI QVVS D E+DL LNA HHF++F +KFGK YA + EH YRF VFK
Sbjct: 19   VASTVSSNDLDDPLIIQVVS-DGEDDL-LNAEHHFTSFKSKFGKTYATQEEHDYRFGVFK 76

Query: 1239 SNLQRAKLHQIIDPTAEHGVTKFSDLTPSEFGEKYLGL-RSLKLPADAHKAPILPTNDLP 1063
            +NL+RAK HQ+IDPTA HGVTKFSDLTP EF  ++LGL R L+LP DA+KAPILPT DLP
Sbjct: 77   ANLRRAKKHQMIDPTAAHGVTKFSDLTPKEFRRQFLGLKRRLRLPTDANKAPILPTTDLP 136

Query: 1062 TEFDWRDHGAVTPIKDQGQCGSCWSFSTTGALEGAHFLATGDLVSLSEQQLVDCDHECDT 883
            T++DWRDHGAVT +KDQG CGSCWSFS TGALEGAH+LATG+L SLSEQQLVDCDHECD 
Sbjct: 137  TDYDWRDHGAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDP 196

Query: 882  EEASSCDNGCNGGLMNSALEYTLKAGGLQREEDYPYTGKD-SKCKFDKTKIAASVSNFSV 706
            EE  +CD+GC+GGLMN+A EY LKAGGL+REEDYPYTG D   CKFDK+K+ ASVSNFSV
Sbjct: 197  EEYGACDSGCDGGLMNNAFEYALKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSV 256

Query: 705  ISIDEEQIAANLVKNGPLAVGINAAYMQTYMAGVSCPYICSKRRLDHAILLVGFGSDGFA 526
            +SIDE+QIAANLVK+GPL+V INAA+MQTY+ GVSCPYICSKR+ DH +LLVG+GS G+A
Sbjct: 257  VSIDEDQIAANLVKHGPLSVAINAAFMQTYVGGVSCPYICSKRQ-DHGVLLVGYGSAGYA 315

Query: 525  PIRLKEKPYWILKNSWGQNWGEDGFYKICRGPNVCGVDSMVSTVASVQLT 376
            PIR KEKP+WI+KNSWGQNWGE+G+YKICRG N+CGVDSMVSTVA++  T
Sbjct: 316  PIRFKEKPFWIIKNSWGQNWGENGYYKICRGRNICGVDSMVSTVAAIHTT 365


>ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
          Length = 377

 Score =  536 bits (1382), Expect = e-150
 Identities = 257/345 (74%), Positives = 293/345 (84%), Gaps = 7/345 (2%)
 Frame = -3

Query: 1389 NDNLIRQVV------SNDDENDLVLNAHHHFSNFITKFGKKYADEIEHAYRFSVFKSNLQ 1228
            +D +IRQVV         +E +L+   HHHFS F  +FGK YA + EH YRF VFK+NL+
Sbjct: 32   DDIIIRQVVPELGDVEGSEEENLLTADHHHFSIFKRRFGKSYASQEEHDYRFKVFKANLR 91

Query: 1227 RAKLHQIIDPTAEHGVTKFSDLTPSEFGEKYLGLRSLKLPADAHKAPILPTNDLPTEFDW 1048
            RA+ HQ +DP+A HGVT+FSDLTP+EF   YLGLR LKLP DA KAPILPTNDLP +FDW
Sbjct: 92   RARRHQQLDPSATHGVTQFSDLTPAEFRGTYLGLRPLKLPHDAQKAPILPTNDLPEDFDW 151

Query: 1047 RDHGAVTPIKDQGQCGSCWSFSTTGALEGAHFLATGDLVSLSEQQLVDCDHECDTEEASS 868
            RDHGAVT +K+QG CGSCWSFSTTGALEGA+FLATG+LVSLSEQQLV+CDHECD EE  S
Sbjct: 152  RDHGAVTAVKNQGSCGSCWSFSTTGALEGANFLATGNLVSLSEQQLVECDHECDPEEMGS 211

Query: 867  CDNGCNGGLMNSALEYTLKAGGLQREEDYPYTGKD-SKCKFDKTKIAASVSNFSVISIDE 691
            CD+GCNGGLMN+A EYTLKAGGL +EEDYPYTG D   CKFDKTKIAASVSNFSVIS+DE
Sbjct: 212  CDSGCNGGLMNTAFEYTLKAGGLMKEEDYPYTGTDRGSCKFDKTKIAASVSNFSVISLDE 271

Query: 690  EQIAANLVKNGPLAVGINAAYMQTYMAGVSCPYICSKRRLDHAILLVGFGSDGFAPIRLK 511
            +QIAANLVKNGPLAV INA +MQTY+ GVSCPYICSK RLDH +LLVG+GS G+APIR+K
Sbjct: 272  DQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICSK-RLDHGVLLVGYGSAGYAPIRMK 330

Query: 510  EKPYWILKNSWGQNWGEDGFYKICRGPNVCGVDSMVSTVASVQLT 376
            +KPYWI+KNSWG+NWGE+GFYKICRG NVCGVDSMVSTVA+V  T
Sbjct: 331  DKPYWIIKNSWGENWGENGFYKICRGRNVCGVDSMVSTVAAVHTT 375


Top