BLASTX nr result
ID: Angelica23_contig00004208
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00004208 (1799 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAD29957.1| cysteine protease [Daucus carota] 686 0.0 dbj|BAD29960.1| cysteine protease [Daucus carota] 628 e-177 dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus] 615 e-173 ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum ... 611 e-172 dbj|BAD29958.1| cysteine protease [Daucus carota] 605 e-171 >dbj|BAD29957.1| cysteine protease [Daucus carota] Length = 437 Score = 686 bits (1771), Expect = 0.0 Identities = 329/411 (80%), Positives = 353/411 (85%), Gaps = 3/411 (0%) Frame = -1 Query: 1646 DMSIITYDETHKPSSSNSLIRTDDEVMTIYNSWLVKHGKSYNALGEKETRFQIFKDNLRY 1467 DMSII YD+TH +NSLIRTDDEVMT+YNSWLVKHGKSYNALGEKETRFQIFKDNLRY Sbjct: 24 DMSIINYDQTH----TNSLIRTDDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRY 79 Query: 1466 IDNHNADPEKSFELGLNKFADLTNEEYRSKYMGTKSRDSRPKLSKGRSDRYAPVAGESLP 1287 IDNHNADP++S+ELGLN+FADLTNEEYR+KY+GTKSR+SRPKLSKG SDRYAPV GE LP Sbjct: 80 IDNHNADPDRSYELGLNRFADLTNEEYRAKYLGTKSRESRPKLSKGPSDRYAPVEGEELP 139 Query: 1286 DSIDWREKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSYNE 1107 DSIDWREKGAVAAVKDQG CGSCWAFSAI +VEGINQI+TGELITLSEQELVDCDRSYNE Sbjct: 140 DSIDWREKGAVAAVKDQGSCGSCWAFSAIGAVEGINQITTGELITLSEQELVDCDRSYNE 199 Query: 1106 GCEGGLMDYAFEFIIKNGGIDSDKDYPYSGRDGYCDKNKKNAKVVTIDSYEDVPVYDEKA 927 GCEGGLMDYAF FIIKNGGIDSD DYPY+GRDG C++NK+NAKVVTIDSYEDVPVYDEKA Sbjct: 200 GCEGGLMDYAFNFIIKNGGIDSDLDYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKA 259 Query: 926 LQKAVANQPISVAIEAGGIDFQLYVSGIFTGKCGTNVDHXXXXXXXXXXXGKDYWIVRNS 747 LQKA ANQPISVAIEAGG+DFQLYVSGIFTGKCGT VDH G DYWIVRNS Sbjct: 260 LQKAAANQPISVAIEAGGMDFQLYVSGIFTGKCGTAVDHGVVVVGYGSEEGMDYWIVRNS 319 Query: 746 WGAAWGEAGYLRMERNVANKPSGLCGLTIEPSY---XXXXXXXXXXXXXXXXXXXXXXNV 576 WGAAWGEAGYL+M+RNV K SGLCG+TIEPSY NV Sbjct: 320 WGAAWGEAGYLKMQRNV-GKSSGLCGITIEPSYPVKNGDNPPNPGPTPPSPPSPSLPDNV 378 Query: 575 CDKYSSCAAHTTCCCLYTYGKECYIWGCCPLEAASCCDDGYSCCPHDYPVC 423 CD Y+SC AHTTCCCLYT+GK+C+ WGCCPLEAASCCDDGYSCCPHDYPVC Sbjct: 379 CDAYTSCPAHTTCCCLYTFGKQCFYWGCCPLEAASCCDDGYSCCPHDYPVC 429 >dbj|BAD29960.1| cysteine protease [Daucus carota] Length = 460 Score = 628 bits (1619), Expect = e-177 Identities = 301/435 (69%), Positives = 342/435 (78%) Frame = -1 Query: 1646 DMSIITYDETHKPSSSNSLIRTDDEVMTIYNSWLVKHGKSYNALGEKETRFQIFKDNLRY 1467 DMSIITYD+TH S TDD +M Y SWLVKHGKSYNALGEKE RFQIFKDN Y Sbjct: 21 DMSIITYDQTHAVGS------TDDVIMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLY 74 Query: 1466 IDNHNADPEKSFELGLNKFADLTNEEYRSKYMGTKSRDSRPKLSKGRSDRYAPVAGESLP 1287 ID NA ++SF+LGLN+FADLTNEEYRSKY G +++DSR K+S G+S RYA +AGESLP Sbjct: 75 IDEQNAAKDRSFKLGLNRFADLTNEEYRSKYTGIRTKDSRKKVS-GKSQRYASLAGESLP 133 Query: 1286 DSIDWREKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSYNE 1107 +S+DWRE GAVA+VKDQG CGSCWAFS I++VEGINQI+TG+LITLSEQELVDCDRSYNE Sbjct: 134 ESVDWREHGAVASVKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNE 193 Query: 1106 GCEGGLMDYAFEFIIKNGGIDSDKDYPYSGRDGYCDKNKKNAKVVTIDSYEDVPVYDEKA 927 GC GGLMD AF+FII NGGIDSD DYPY+GRDG CD+ +KNAKVVTIDSYEDVP YDEKA Sbjct: 194 GCNGGLMDDAFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKA 253 Query: 926 LQKAVANQPISVAIEAGGIDFQLYVSGIFTGKCGTNVDHXXXXXXXXXXXGKDYWIVRNS 747 LQKA ANQPISVAIEA G DFQ Y SGIFTGKCGT++DH GKDYWIVRNS Sbjct: 254 LQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYGTENGKDYWIVRNS 313 Query: 746 WGAAWGEAGYLRMERNVANKPSGLCGLTIEPSYXXXXXXXXXXXXXXXXXXXXXXNVCDK 567 WGA WGE GYLRMER +++K +G+CG+T EPSY +VCD+ Sbjct: 314 WGADWGEKGYLRMERGISSK-AGICGITSEPSYPVKSGVNPPNPGPSPPSPKSPESVCDE 372 Query: 566 YSSCAAHTTCCCLYTYGKECYIWGCCPLEAASCCDDGYSCCPHDYPVCHVYSGTCSMSTN 387 Y +C TTCCC+Y Y C+ WGCCPLE ASCCDDGYSCCPHDYPVC+V +GTCSMS N Sbjct: 373 YYTCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVCNVRAGTCSMSNN 432 Query: 386 SPLGVKAMKRTRATP 342 +PLGVKA++R ATP Sbjct: 433 NPLGVKAIQRILATP 447 >dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus] Length = 462 Score = 615 bits (1585), Expect = e-173 Identities = 291/435 (66%), Positives = 336/435 (77%) Frame = -1 Query: 1646 DMSIITYDETHKPSSSNSLIRTDDEVMTIYNSWLVKHGKSYNALGEKETRFQIFKDNLRY 1467 DMSII YD TH +S S RTDDEVM +Y SWLVKHGKSYNALGEKE RFQIFKDNLR+ Sbjct: 24 DMSIINYDATH---ASKSSWRTDDEVMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRF 80 Query: 1466 IDNHNADPEKSFELGLNKFADLTNEEYRSKYMGTKSRDSRPKLSKGRSDRYAPVAGESLP 1287 ID HNA+ S+++GLN+FADLTNEEYRS Y+G KS+ PKLSK +SDRYAP G+SLP Sbjct: 81 IDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSK---PKLSKVKSDRYAPRVGDSLP 137 Query: 1286 DSIDWREKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSYNE 1107 +S+DWR KGAVA +KDQG CGSCWAFS + +VEGINQI TGELITLSEQELVDCD+SYNE Sbjct: 138 ESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNE 197 Query: 1106 GCEGGLMDYAFEFIIKNGGIDSDKDYPYSGRDGYCDKNKKNAKVVTIDSYEDVPVYDEKA 927 GC+GGLMDY FEFII NGGID+DKDYPY GRD CD+ +KNAKVVTIDSYEDVPV +E+A Sbjct: 198 GCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEEA 257 Query: 926 LQKAVANQPISVAIEAGGIDFQLYVSGIFTGKCGTNVDHXXXXXXXXXXXGKDYWIVRNS 747 L+KAVA+QP+SV IE GG FQ Y SGIFTGKCGT +DH GKDYWIVRNS Sbjct: 258 LKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGYGTEKGKDYWIVRNS 317 Query: 746 WGAAWGEAGYLRMERNVANKPSGLCGLTIEPSYXXXXXXXXXXXXXXXXXXXXXXNVCDK 567 WG++WGEAGY+RMERN+A G CG+ +EPSY VCD Sbjct: 318 WGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLKNGQNPPNPGPSPPTPVRPPTVCDD 377 Query: 566 YSSCAAHTTCCCLYTYGKECYIWGCCPLEAASCCDDGYSCCPHDYPVCHVYSGTCSMSTN 387 Y +C +TCCC+Y Y C+ WGCCPL+ A+CCDD YSCCPHDYPVC+V +GTCSMS N Sbjct: 378 YYTCPESSTCCCVYEYYGYCFSWGCCPLDGATCCDDHYSCCPHDYPVCNVQAGTCSMSKN 437 Query: 386 SPLGVKAMKRTRATP 342 +PLGVKA++R ATP Sbjct: 438 NPLGVKAIQRILATP 452 >ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum] gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum] gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum] Length = 466 Score = 611 bits (1575), Expect = e-172 Identities = 290/437 (66%), Positives = 334/437 (76%) Frame = -1 Query: 1646 DMSIITYDETHKPSSSNSLIRTDDEVMTIYNSWLVKHGKSYNALGEKETRFQIFKDNLRY 1467 DMSII+YDETH RTDDEV +Y SWL++HGKSYNALGEK+ RFQIFKDNLRY Sbjct: 26 DMSIISYDETHIHR------RTDDEVSALYESWLIEHGKSYNALGEKDKRFQIFKDNLRY 79 Query: 1466 IDNHNADPEKSFELGLNKFADLTNEEYRSKYMGTKSRDSRPKLSKGRSDRYAPVAGESLP 1287 ID N+ P +S++LGL KFADLTNEEYRS Y+GTKS R KLSK +SDRY P G+SLP Sbjct: 80 IDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRKKLSKNKSDRYLPKVGDSLP 139 Query: 1286 DSIDWREKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSYNE 1107 +SIDWREKG + VKDQG CGSCWAFSA+A++E IN I TG LI+LSEQELVDCDRSYNE Sbjct: 140 ESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSYNE 199 Query: 1106 GCEGGLMDYAFEFIIKNGGIDSDKDYPYSGRDGYCDKNKKNAKVVTIDSYEDVPVYDEKA 927 GC+GGLMDYAFEF+IKNGGID+++DYPY R+G CD+ +KNAKVV IDSYEDVPV +EKA Sbjct: 200 GCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNNEKA 259 Query: 926 LQKAVANQPISVAIEAGGIDFQLYVSGIFTGKCGTNVDHXXXXXXXXXXXGKDYWIVRNS 747 LQKAVA+QP+S+A+EAGG DFQ Y SGIFTGKCGT VDH G DYWIVRNS Sbjct: 260 LQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGTENGMDYWIVRNS 319 Query: 746 WGAAWGEAGYLRMERNVANKPSGLCGLTIEPSYXXXXXXXXXXXXXXXXXXXXXXNVCDK 567 WGA WGE GYLR++RNVA+ SGLCGL IEPSY CD+ Sbjct: 320 WGANWGENGYLRVQRNVASS-SGLCGLAIEPSYPVKTGPNPPKPAPSPPSPVKPPTECDE 378 Query: 566 YSSCAAHTTCCCLYTYGKECYIWGCCPLEAASCCDDGYSCCPHDYPVCHVYSGTCSMSTN 387 YS CA TTCCC+ + + C+ WGCCPLE A+CC+D YSCCPHDYP+C+V GTCSMS Sbjct: 379 YSQCAVGTTCCCILQFRRSCFSWGCCPLEGATCCEDHYSCCPHDYPICNVRQGTCSMSKG 438 Query: 386 SPLGVKAMKRTRATPIG 336 +PLGVKAMKR A PIG Sbjct: 439 NPLGVKAMKRILAQPIG 455 >dbj|BAD29958.1| cysteine protease [Daucus carota] Length = 496 Score = 605 bits (1561), Expect = e-171 Identities = 289/426 (67%), Positives = 329/426 (77%) Frame = -1 Query: 1646 DMSIITYDETHKPSSSNSLIRTDDEVMTIYNSWLVKHGKSYNALGEKETRFQIFKDNLRY 1467 DMSIITYDETH +TDDE T++ SWLV HGKSYNALGE+E RFQIFK+NLRY Sbjct: 21 DMSIITYDETHAVG-----FKTDDEATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLRY 75 Query: 1466 IDNHNADPEKSFELGLNKFADLTNEEYRSKYMGTKSRDSRPKLSKGRSDRYAPVAGESLP 1287 ID N ++ F+LGLNKFADLTNEEYRSKY G KS+D R K+S +S RYA ++GESLP Sbjct: 76 IDEQNLVEDRGFKLGLNKFADLTNEEYRSKYTGIKSKDLRKKVS-AKSGRYATLSGESLP 134 Query: 1286 DSIDWREKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSYNE 1107 +S+DWRE GAVA VKDQG CGSCWAFS I++VEGINQI+TG+LITLSEQELVDCDRSYNE Sbjct: 135 ESVDWRESGAVATVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNE 194 Query: 1106 GCEGGLMDYAFEFIIKNGGIDSDKDYPYSGRDGYCDKNKKNAKVVTIDSYEDVPVYDEKA 927 GC GGLMDYAFEFII NGGID+D DYPY+GRDG CD+ +KNAKVVTIDSYEDVP YDE A Sbjct: 195 GCNGGLMDYAFEFIINNGGIDTDVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDELA 254 Query: 926 LQKAVANQPISVAIEAGGIDFQLYVSGIFTGKCGTNVDHXXXXXXXXXXXGKDYWIVRNS 747 L+KA ANQPISVAIEA G DFQ Y SGIFTGKCG +DH GKDYWIVRNS Sbjct: 255 LKKAAANQPISVAIEASGRDFQFYDSGIFTGKCGIALDHGVVVVGYGTENGKDYWIVRNS 314 Query: 746 WGAAWGEAGYLRMERNVANKPSGLCGLTIEPSYXXXXXXXXXXXXXXXXXXXXXXNVCDK 567 WGA WGE GYLRMER +++K +G+CG+ IEPSY +VCD+ Sbjct: 315 WGADWGENGYLRMERGISSK-TGICGIAIEPSYPVKTGVNPPNPGPSPPTPKTPESVCDE 373 Query: 566 YSSCAAHTTCCCLYTYGKECYIWGCCPLEAASCCDDGYSCCPHDYPVCHVYSGTCSMSTN 387 Y +C TTCCC+Y Y C+ WGCCPLE ASCCDDGYSCCPHDYPVC+V +GTCSM N Sbjct: 374 YYTCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVCNVRAGTCSMKYN 433 Query: 386 SPLGVK 369 +PLGV+ Sbjct: 434 NPLGVR 439