BLASTX nr result

ID: Angelica23_contig00004208 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00004208
         (1799 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAD29957.1| cysteine protease [Daucus carota]                     686   0.0  
dbj|BAD29960.1| cysteine protease [Daucus carota]                     628   e-177
dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]         615   e-173
ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum ...   611   e-172
dbj|BAD29958.1| cysteine protease [Daucus carota]                     605   e-171

>dbj|BAD29957.1| cysteine protease [Daucus carota]
          Length = 437

 Score =  686 bits (1771), Expect = 0.0
 Identities = 329/411 (80%), Positives = 353/411 (85%), Gaps = 3/411 (0%)
 Frame = -1

Query: 1646 DMSIITYDETHKPSSSNSLIRTDDEVMTIYNSWLVKHGKSYNALGEKETRFQIFKDNLRY 1467
            DMSII YD+TH    +NSLIRTDDEVMT+YNSWLVKHGKSYNALGEKETRFQIFKDNLRY
Sbjct: 24   DMSIINYDQTH----TNSLIRTDDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRY 79

Query: 1466 IDNHNADPEKSFELGLNKFADLTNEEYRSKYMGTKSRDSRPKLSKGRSDRYAPVAGESLP 1287
            IDNHNADP++S+ELGLN+FADLTNEEYR+KY+GTKSR+SRPKLSKG SDRYAPV GE LP
Sbjct: 80   IDNHNADPDRSYELGLNRFADLTNEEYRAKYLGTKSRESRPKLSKGPSDRYAPVEGEELP 139

Query: 1286 DSIDWREKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSYNE 1107
            DSIDWREKGAVAAVKDQG CGSCWAFSAI +VEGINQI+TGELITLSEQELVDCDRSYNE
Sbjct: 140  DSIDWREKGAVAAVKDQGSCGSCWAFSAIGAVEGINQITTGELITLSEQELVDCDRSYNE 199

Query: 1106 GCEGGLMDYAFEFIIKNGGIDSDKDYPYSGRDGYCDKNKKNAKVVTIDSYEDVPVYDEKA 927
            GCEGGLMDYAF FIIKNGGIDSD DYPY+GRDG C++NK+NAKVVTIDSYEDVPVYDEKA
Sbjct: 200  GCEGGLMDYAFNFIIKNGGIDSDLDYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKA 259

Query: 926  LQKAVANQPISVAIEAGGIDFQLYVSGIFTGKCGTNVDHXXXXXXXXXXXGKDYWIVRNS 747
            LQKA ANQPISVAIEAGG+DFQLYVSGIFTGKCGT VDH           G DYWIVRNS
Sbjct: 260  LQKAAANQPISVAIEAGGMDFQLYVSGIFTGKCGTAVDHGVVVVGYGSEEGMDYWIVRNS 319

Query: 746  WGAAWGEAGYLRMERNVANKPSGLCGLTIEPSY---XXXXXXXXXXXXXXXXXXXXXXNV 576
            WGAAWGEAGYL+M+RNV  K SGLCG+TIEPSY                         NV
Sbjct: 320  WGAAWGEAGYLKMQRNV-GKSSGLCGITIEPSYPVKNGDNPPNPGPTPPSPPSPSLPDNV 378

Query: 575  CDKYSSCAAHTTCCCLYTYGKECYIWGCCPLEAASCCDDGYSCCPHDYPVC 423
            CD Y+SC AHTTCCCLYT+GK+C+ WGCCPLEAASCCDDGYSCCPHDYPVC
Sbjct: 379  CDAYTSCPAHTTCCCLYTFGKQCFYWGCCPLEAASCCDDGYSCCPHDYPVC 429


>dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score =  628 bits (1619), Expect = e-177
 Identities = 301/435 (69%), Positives = 342/435 (78%)
 Frame = -1

Query: 1646 DMSIITYDETHKPSSSNSLIRTDDEVMTIYNSWLVKHGKSYNALGEKETRFQIFKDNLRY 1467
            DMSIITYD+TH   S      TDD +M  Y SWLVKHGKSYNALGEKE RFQIFKDN  Y
Sbjct: 21   DMSIITYDQTHAVGS------TDDVIMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLY 74

Query: 1466 IDNHNADPEKSFELGLNKFADLTNEEYRSKYMGTKSRDSRPKLSKGRSDRYAPVAGESLP 1287
            ID  NA  ++SF+LGLN+FADLTNEEYRSKY G +++DSR K+S G+S RYA +AGESLP
Sbjct: 75   IDEQNAAKDRSFKLGLNRFADLTNEEYRSKYTGIRTKDSRKKVS-GKSQRYASLAGESLP 133

Query: 1286 DSIDWREKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSYNE 1107
            +S+DWRE GAVA+VKDQG CGSCWAFS I++VEGINQI+TG+LITLSEQELVDCDRSYNE
Sbjct: 134  ESVDWREHGAVASVKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNE 193

Query: 1106 GCEGGLMDYAFEFIIKNGGIDSDKDYPYSGRDGYCDKNKKNAKVVTIDSYEDVPVYDEKA 927
            GC GGLMD AF+FII NGGIDSD DYPY+GRDG CD+ +KNAKVVTIDSYEDVP YDEKA
Sbjct: 194  GCNGGLMDDAFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKA 253

Query: 926  LQKAVANQPISVAIEAGGIDFQLYVSGIFTGKCGTNVDHXXXXXXXXXXXGKDYWIVRNS 747
            LQKA ANQPISVAIEA G DFQ Y SGIFTGKCGT++DH           GKDYWIVRNS
Sbjct: 254  LQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYGTENGKDYWIVRNS 313

Query: 746  WGAAWGEAGYLRMERNVANKPSGLCGLTIEPSYXXXXXXXXXXXXXXXXXXXXXXNVCDK 567
            WGA WGE GYLRMER +++K +G+CG+T EPSY                      +VCD+
Sbjct: 314  WGADWGEKGYLRMERGISSK-AGICGITSEPSYPVKSGVNPPNPGPSPPSPKSPESVCDE 372

Query: 566  YSSCAAHTTCCCLYTYGKECYIWGCCPLEAASCCDDGYSCCPHDYPVCHVYSGTCSMSTN 387
            Y +C   TTCCC+Y Y   C+ WGCCPLE ASCCDDGYSCCPHDYPVC+V +GTCSMS N
Sbjct: 373  YYTCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVCNVRAGTCSMSNN 432

Query: 386  SPLGVKAMKRTRATP 342
            +PLGVKA++R  ATP
Sbjct: 433  NPLGVKAIQRILATP 447


>dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 462

 Score =  615 bits (1585), Expect = e-173
 Identities = 291/435 (66%), Positives = 336/435 (77%)
 Frame = -1

Query: 1646 DMSIITYDETHKPSSSNSLIRTDDEVMTIYNSWLVKHGKSYNALGEKETRFQIFKDNLRY 1467
            DMSII YD TH   +S S  RTDDEVM +Y SWLVKHGKSYNALGEKE RFQIFKDNLR+
Sbjct: 24   DMSIINYDATH---ASKSSWRTDDEVMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRF 80

Query: 1466 IDNHNADPEKSFELGLNKFADLTNEEYRSKYMGTKSRDSRPKLSKGRSDRYAPVAGESLP 1287
            ID HNA+   S+++GLN+FADLTNEEYRS Y+G KS+   PKLSK +SDRYAP  G+SLP
Sbjct: 81   IDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSK---PKLSKVKSDRYAPRVGDSLP 137

Query: 1286 DSIDWREKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSYNE 1107
            +S+DWR KGAVA +KDQG CGSCWAFS + +VEGINQI TGELITLSEQELVDCD+SYNE
Sbjct: 138  ESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNE 197

Query: 1106 GCEGGLMDYAFEFIIKNGGIDSDKDYPYSGRDGYCDKNKKNAKVVTIDSYEDVPVYDEKA 927
            GC+GGLMDY FEFII NGGID+DKDYPY GRD  CD+ +KNAKVVTIDSYEDVPV +E+A
Sbjct: 198  GCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEEA 257

Query: 926  LQKAVANQPISVAIEAGGIDFQLYVSGIFTGKCGTNVDHXXXXXXXXXXXGKDYWIVRNS 747
            L+KAVA+QP+SV IE GG  FQ Y SGIFTGKCGT +DH           GKDYWIVRNS
Sbjct: 258  LKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGYGTEKGKDYWIVRNS 317

Query: 746  WGAAWGEAGYLRMERNVANKPSGLCGLTIEPSYXXXXXXXXXXXXXXXXXXXXXXNVCDK 567
            WG++WGEAGY+RMERN+A    G CG+ +EPSY                       VCD 
Sbjct: 318  WGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLKNGQNPPNPGPSPPTPVRPPTVCDD 377

Query: 566  YSSCAAHTTCCCLYTYGKECYIWGCCPLEAASCCDDGYSCCPHDYPVCHVYSGTCSMSTN 387
            Y +C   +TCCC+Y Y   C+ WGCCPL+ A+CCDD YSCCPHDYPVC+V +GTCSMS N
Sbjct: 378  YYTCPESSTCCCVYEYYGYCFSWGCCPLDGATCCDDHYSCCPHDYPVCNVQAGTCSMSKN 437

Query: 386  SPLGVKAMKRTRATP 342
            +PLGVKA++R  ATP
Sbjct: 438  NPLGVKAIQRILATP 452


>ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
            gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease
            TDI-65 [Solanum lycopersicum] gi|2828252|emb|CAA05894.1|
            CYP1 [Solanum lycopersicum]
          Length = 466

 Score =  611 bits (1575), Expect = e-172
 Identities = 290/437 (66%), Positives = 334/437 (76%)
 Frame = -1

Query: 1646 DMSIITYDETHKPSSSNSLIRTDDEVMTIYNSWLVKHGKSYNALGEKETRFQIFKDNLRY 1467
            DMSII+YDETH         RTDDEV  +Y SWL++HGKSYNALGEK+ RFQIFKDNLRY
Sbjct: 26   DMSIISYDETHIHR------RTDDEVSALYESWLIEHGKSYNALGEKDKRFQIFKDNLRY 79

Query: 1466 IDNHNADPEKSFELGLNKFADLTNEEYRSKYMGTKSRDSRPKLSKGRSDRYAPVAGESLP 1287
            ID  N+ P +S++LGL KFADLTNEEYRS Y+GTKS   R KLSK +SDRY P  G+SLP
Sbjct: 80   IDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRKKLSKNKSDRYLPKVGDSLP 139

Query: 1286 DSIDWREKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSYNE 1107
            +SIDWREKG +  VKDQG CGSCWAFSA+A++E IN I TG LI+LSEQELVDCDRSYNE
Sbjct: 140  ESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSYNE 199

Query: 1106 GCEGGLMDYAFEFIIKNGGIDSDKDYPYSGRDGYCDKNKKNAKVVTIDSYEDVPVYDEKA 927
            GC+GGLMDYAFEF+IKNGGID+++DYPY  R+G CD+ +KNAKVV IDSYEDVPV +EKA
Sbjct: 200  GCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNNEKA 259

Query: 926  LQKAVANQPISVAIEAGGIDFQLYVSGIFTGKCGTNVDHXXXXXXXXXXXGKDYWIVRNS 747
            LQKAVA+QP+S+A+EAGG DFQ Y SGIFTGKCGT VDH           G DYWIVRNS
Sbjct: 260  LQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGTENGMDYWIVRNS 319

Query: 746  WGAAWGEAGYLRMERNVANKPSGLCGLTIEPSYXXXXXXXXXXXXXXXXXXXXXXNVCDK 567
            WGA WGE GYLR++RNVA+  SGLCGL IEPSY                        CD+
Sbjct: 320  WGANWGENGYLRVQRNVASS-SGLCGLAIEPSYPVKTGPNPPKPAPSPPSPVKPPTECDE 378

Query: 566  YSSCAAHTTCCCLYTYGKECYIWGCCPLEAASCCDDGYSCCPHDYPVCHVYSGTCSMSTN 387
            YS CA  TTCCC+  + + C+ WGCCPLE A+CC+D YSCCPHDYP+C+V  GTCSMS  
Sbjct: 379  YSQCAVGTTCCCILQFRRSCFSWGCCPLEGATCCEDHYSCCPHDYPICNVRQGTCSMSKG 438

Query: 386  SPLGVKAMKRTRATPIG 336
            +PLGVKAMKR  A PIG
Sbjct: 439  NPLGVKAMKRILAQPIG 455


>dbj|BAD29958.1| cysteine protease [Daucus carota]
          Length = 496

 Score =  605 bits (1561), Expect = e-171
 Identities = 289/426 (67%), Positives = 329/426 (77%)
 Frame = -1

Query: 1646 DMSIITYDETHKPSSSNSLIRTDDEVMTIYNSWLVKHGKSYNALGEKETRFQIFKDNLRY 1467
            DMSIITYDETH         +TDDE  T++ SWLV HGKSYNALGE+E RFQIFK+NLRY
Sbjct: 21   DMSIITYDETHAVG-----FKTDDEATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLRY 75

Query: 1466 IDNHNADPEKSFELGLNKFADLTNEEYRSKYMGTKSRDSRPKLSKGRSDRYAPVAGESLP 1287
            ID  N   ++ F+LGLNKFADLTNEEYRSKY G KS+D R K+S  +S RYA ++GESLP
Sbjct: 76   IDEQNLVEDRGFKLGLNKFADLTNEEYRSKYTGIKSKDLRKKVS-AKSGRYATLSGESLP 134

Query: 1286 DSIDWREKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSYNE 1107
            +S+DWRE GAVA VKDQG CGSCWAFS I++VEGINQI+TG+LITLSEQELVDCDRSYNE
Sbjct: 135  ESVDWRESGAVATVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNE 194

Query: 1106 GCEGGLMDYAFEFIIKNGGIDSDKDYPYSGRDGYCDKNKKNAKVVTIDSYEDVPVYDEKA 927
            GC GGLMDYAFEFII NGGID+D DYPY+GRDG CD+ +KNAKVVTIDSYEDVP YDE A
Sbjct: 195  GCNGGLMDYAFEFIINNGGIDTDVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDELA 254

Query: 926  LQKAVANQPISVAIEAGGIDFQLYVSGIFTGKCGTNVDHXXXXXXXXXXXGKDYWIVRNS 747
            L+KA ANQPISVAIEA G DFQ Y SGIFTGKCG  +DH           GKDYWIVRNS
Sbjct: 255  LKKAAANQPISVAIEASGRDFQFYDSGIFTGKCGIALDHGVVVVGYGTENGKDYWIVRNS 314

Query: 746  WGAAWGEAGYLRMERNVANKPSGLCGLTIEPSYXXXXXXXXXXXXXXXXXXXXXXNVCDK 567
            WGA WGE GYLRMER +++K +G+CG+ IEPSY                      +VCD+
Sbjct: 315  WGADWGENGYLRMERGISSK-TGICGIAIEPSYPVKTGVNPPNPGPSPPTPKTPESVCDE 373

Query: 566  YSSCAAHTTCCCLYTYGKECYIWGCCPLEAASCCDDGYSCCPHDYPVCHVYSGTCSMSTN 387
            Y +C   TTCCC+Y Y   C+ WGCCPLE ASCCDDGYSCCPHDYPVC+V +GTCSM  N
Sbjct: 374  YYTCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVCNVRAGTCSMKYN 433

Query: 386  SPLGVK 369
            +PLGV+
Sbjct: 434  NPLGVR 439


Top