BLASTX nr result
ID: Angelica23_contig00001031
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00001031 (1706 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAD29960.1| cysteine protease [Daucus carota] 599 0.0 dbj|BAD29958.1| cysteine protease [Daucus carota] 552 e-163 dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus] 470 e-138 ref|XP_002510170.1| cysteine protease, putative [Ricinus communi... 456 e-133 dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus] 456 e-132 >dbj|BAD29960.1| cysteine protease [Daucus carota] Length = 460 Score = 599 bits (1544), Expect(2) = 0.0 Identities = 290/369 (78%), Positives = 298/369 (80%), Gaps = 7/369 (1%) Frame = -2 Query: 1411 RFADLTNEEYRSKYTGIKTKDSRKKDSGKSERYATLAGESLPERVDWREQGAVASVKDQG 1232 RFADLTNEEYRSKYTGI+TKDSRKK SGKS+RYA+LAGESLPE VDWRE GAVASVKDQG Sbjct: 92 RFADLTNEEYRSKYTGIRTKDSRKKVSGKSQRYASLAGESLPESVDWREHGAVASVKDQG 151 Query: 1231 SCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQFIINNG 1052 CGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQFIINNG Sbjct: 152 QCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQFIINNG 211 Query: 1051 GID-------TXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXISVAIEASG 893 GID T ISVAIEASG Sbjct: 212 GIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPISVAIEASG 271 Query: 892 RDFQFYDSGIFTGKCGTELDHGVVVVGYGTENGKDYWIVRNSWGADWGEKGYLRMERGIS 713 RDFQFYDSGIFTGKCGT+LDHGVVVVGYGTENGKDYWIVRNSWGADWGEKGYLRMERGIS Sbjct: 272 RDFQFYDSGIFTGKCGTDLDHGVVVVGYGTENGKDYWIVRNSWGADWGEKGYLRMERGIS 331 Query: 712 SRAGICGILSEPSYXXXXXXXXXXXXXXXXXXKTPESVCDEYYTCPMSTTCCCMYEYYGY 533 S+AGICGI SEPSY K+PESVCDEYYTCPMSTTCCCMYEYYGY Sbjct: 332 SKAGICGITSEPSYPVKSGVNPPNPGPSPPSPKSPESVCDEYYTCPMSTTCCCMYEYYGY 391 Query: 532 CFAWGCCPLDGASCCDDGYSCCPHDYPVCNVRAGTCSMSNNNPLGVKAIQRLLATPKWQH 353 CFAWGCCPL+GASCCDDGYSCCPHDYPVCNVRAGTCSMSNNNPLGVKAIQR+LATP WQH Sbjct: 392 CFAWGCCPLEGASCCDDGYSCCPHDYPVCNVRAGTCSMSNNNPLGVKAIQRILATPNWQH 451 Query: 352 GSKGKKVTA 326 GSKGKKVTA Sbjct: 452 GSKGKKVTA 460 Score = 74.7 bits (182), Expect(2) = 0.0 Identities = 38/54 (70%), Positives = 40/54 (74%) Frame = -3 Query: 1683 MKMIXXXXXXXXXXAVSTAADMSIITYDQAHAVGTTDDVIMAAYETWLAKHGKS 1522 MKMI A TAADMSIITYDQ HAVG+TDDVIMAAYE+WL KHGKS Sbjct: 1 MKMILSLLSLSLLAAAVTAADMSIITYDQTHAVGSTDDVIMAAYESWLVKHGKS 54 >dbj|BAD29958.1| cysteine protease [Daucus carota] Length = 496 Score = 552 bits (1423), Expect(2) = e-163 Identities = 275/404 (68%), Positives = 293/404 (72%), Gaps = 7/404 (1%) Frame = -2 Query: 1411 RFADLTNEEYRSKYTGIKTKDSRKKDSGKSERYATLAGESLPERVDWREQGAVASVKDQG 1232 +FADLTNEEYRSKYTGIK+KD RKK S KS RYATL+GESLPE VDWRE GAVA+VKDQG Sbjct: 93 KFADLTNEEYRSKYTGIKSKDLRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKDQG 152 Query: 1231 SCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQFIINNG 1052 SCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMD AF+FIINNG Sbjct: 153 SCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDYAFEFIINNG 212 Query: 1051 GIDTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-------ISVAIEASG 893 GIDT ISVAIEASG Sbjct: 213 GIDTDVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAIEASG 272 Query: 892 RDFQFYDSGIFTGKCGTELDHGVVVVGYGTENGKDYWIVRNSWGADWGEKGYLRMERGIS 713 RDFQFYDSGIFTGKCG LDHGVVVVGYGTENGKDYWIVRNSWGADWGE GYLRMERGIS Sbjct: 273 RDFQFYDSGIFTGKCGIALDHGVVVVGYGTENGKDYWIVRNSWGADWGENGYLRMERGIS 332 Query: 712 SRAGICGILSEPSYXXXXXXXXXXXXXXXXXXKTPESVCDEYYTCPMSTTCCCMYEYYGY 533 S+ GICGI EPSY KTPESVCDEYYTCPMSTTCCCMYEYYGY Sbjct: 333 SKTGICGIAIEPSYPVKTGVNPPNPGPSPPTPKTPESVCDEYYTCPMSTTCCCMYEYYGY 392 Query: 532 CFAWGCCPLDGASCCDDGYSCCPHDYPVCNVRAGTCSMSNNNPLGVKAIQRLLATPKWQH 353 CFAWGCCPL+GASCCDDGYSCCPHDYPVCNVRAGTCSM NNPLGV+ L Sbjct: 393 CFAWGCCPLEGASCCDDGYSCCPHDYPVCNVRAGTCSMKYNNPLGVRQSSAFLQLQTGNT 452 Query: 352 GSKGKKVTA*EDSRKDP*VTFYGKTGAHRSDKFTTARFKLLKFQ 221 +K +++ + + K P V F GKTGA+ DK TTA +++ Sbjct: 453 EAKERRLLL-KKNPKGPRVMFSGKTGAYSRDKITTAELVRFRYE 495 Score = 50.4 bits (119), Expect(2) = e-163 Identities = 24/40 (60%), Positives = 28/40 (70%), Gaps = 1/40 (2%) Frame = -3 Query: 1638 VSTAADMSIITYDQAHAVG-TTDDVIMAAYETWLAKHGKS 1522 V+ A DMSIITYD+ HAVG TDD +E+WL HGKS Sbjct: 16 VAAATDMSIITYDETHAVGFKTDDEATTLFESWLVTHGKS 55 >dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus] Length = 462 Score = 470 bits (1209), Expect(2) = e-138 Identities = 227/362 (62%), Positives = 253/362 (69%), Gaps = 8/362 (2%) Frame = -2 Query: 1411 RFADLTNEEYRSKYTGIKTKDSRKKDSGKSERYATLAGESLPERVDWREQGAVASVKDQG 1232 RFADLTNEEYRS Y G K+K K KS+RYA G+SLPE VDWR +GAVA +KDQG Sbjct: 98 RFADLTNEEYRSTYLGAKSKPKLSKV--KSDRYAPRVGDSLPESVDWRAKGAVAPIKDQG 155 Query: 1231 SCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQFIINNG 1052 SCGSCWAFST++AVEGINQI TG+LITLSEQELVDCD+SYNEGC+GGLMD F+FIINNG Sbjct: 156 SCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNEGCDGGLMDYGFEFIINNG 215 Query: 1051 GIDTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-------ISVAIEASG 893 GIDT +SV IE G Sbjct: 216 GIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEEALKKAVASQPVSVGIEGGG 275 Query: 892 RDFQFYDSGIFTGKCGTELDHGVVVVGYGTENGKDYWIVRNSWGADWGEKGYLRMERGIS 713 R FQFYDSGIFTGKCGT LDHGV VVGYGTE GKDYWIVRNSWG+ WGE GY+RMER ++ Sbjct: 276 RAFQFYDSGIFTGKCGTALDHGVNVVGYGTEKGKDYWIVRNSWGSSWGEAGYIRMERNLA 335 Query: 712 SRA-GICGILSEPSYXXXXXXXXXXXXXXXXXXKTPESVCDEYYTCPMSTTCCCMYEYYG 536 + G CGI EPSY P +VCD+YYTCP S+TCCC+YEYYG Sbjct: 336 GTSVGKCGIAMEPSYPLKNGQNPPNPGPSPPTPVRPPTVCDDYYTCPESSTCCCVYEYYG 395 Query: 535 YCFAWGCCPLDGASCCDDGYSCCPHDYPVCNVRAGTCSMSNNNPLGVKAIQRLLATPKWQ 356 YCF+WGCCPLDGA+CCDD YSCCPHDYPVCNV+AGTCSMS NNPLGVKAIQR+LATP + Sbjct: 396 YCFSWGCCPLDGATCCDDHYSCCPHDYPVCNVQAGTCSMSKNNPLGVKAIQRILATPNRE 455 Query: 355 HG 350 G Sbjct: 456 TG 457 Score = 50.8 bits (120), Expect(2) = e-138 Identities = 25/41 (60%), Positives = 29/41 (70%), Gaps = 3/41 (7%) Frame = -3 Query: 1635 STAADMSIITYDQAHAVGT---TDDVIMAAYETWLAKHGKS 1522 S+A DMSII YD HA + TDD +MA YE+WL KHGKS Sbjct: 20 SSALDMSIINYDATHASKSSWRTDDEVMAMYESWLVKHGKS 60 >ref|XP_002510170.1| cysteine protease, putative [Ricinus communis] gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis] Length = 469 Score = 456 bits (1173), Expect(2) = e-133 Identities = 216/369 (58%), Positives = 248/369 (67%), Gaps = 7/369 (1%) Frame = -2 Query: 1411 RFADLTNEEYRSKYTGIKTKDSRKKDSGKSERYATLAGESLPERVDWREQGAVASVKDQG 1232 RFADLTNEEYRS Y G ++ R + S S RY G+SLP+ VDWR++GAVA VKDQG Sbjct: 101 RFADLTNEEYRSMYLGARSGAKRNRLSRSSNRYLPRVGDSLPDSVDWRKEGAVAEVKDQG 160 Query: 1231 SCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQFIINNG 1052 SCGSCWAFSTI+AVEGIN+I TG LI+LSEQELVDCDRSYNEGCNGGLMD AFQFIINNG Sbjct: 161 SCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDRSYNEGCNGGLMDYAFQFIINNG 220 Query: 1051 GIDTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-------ISVAIEASG 893 GID+ +SVAIEA G Sbjct: 221 GIDSEEDYPYLARDGTCDTYRKNAKVVTIDNYEDVPVNDEKALQKAVANQPVSVAIEAGG 280 Query: 892 RDFQFYDSGIFTGKCGTELDHGVVVVGYGTENGKDYWIVRNSWGADWGEKGYLRMERGIS 713 R+FQFY SGIFTG+CGT LDHGV VGYGTENGKDYWIVRNSWG WGE GY+RMER I+ Sbjct: 281 REFQFYQSGIFTGRCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYIRMERNIA 340 Query: 712 SRAGICGILSEPSYXXXXXXXXXXXXXXXXXXKTPESVCDEYYTCPMSTTCCCMYEYYGY 533 + G CGI EPSY P SVCD Y++CP STTCCC++EY Y Sbjct: 341 TATGKCGIAIEPSYPIKKGQNPPNPGPSPPSPIKPPSVCDSYFSCPESTTCCCIFEYAKY 400 Query: 532 CFAWGCCPLDGASCCDDGYSCCPHDYPVCNVRAGTCSMSNNNPLGVKAIQRLLATPKWQH 353 CF WGCCPL+GA+CCDD YSCCPHDYPVCN+ GTC + +NP GVKA++R A P W + Sbjct: 401 CFEWGCCPLEGATCCDDHYSCCPHDYPVCNINEGTCLIGKDNPFGVKAMRRTPAKPHWAY 460 Query: 352 GSKGKKVTA 326 G +G+K +A Sbjct: 461 GLEGRKNSA 469 Score = 47.8 bits (112), Expect(2) = e-133 Identities = 22/42 (52%), Positives = 30/42 (71%), Gaps = 3/42 (7%) Frame = -3 Query: 1638 VSTAADMSIITYDQAHAVGT---TDDVIMAAYETWLAKHGKS 1522 +S+A DMSI++YDQ H + TDD +MA YE WL K+GK+ Sbjct: 20 LSSALDMSIVSYDQTHLTKSSWRTDDEVMAIYEEWLVKNGKA 61 >dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus] Length = 461 Score = 456 bits (1173), Expect(2) = e-132 Identities = 217/357 (60%), Positives = 248/357 (69%), Gaps = 8/357 (2%) Frame = -2 Query: 1411 RFADLTNEEYRSKYTGIKTKDSRKKDSG-KSERYATLAGESLPERVDWREQGAVASVKDQ 1235 +FADLTNEEYR YTGIKT D +KK S KS+RYA +G+SLPE VDWREQGAV VKDQ Sbjct: 99 KFADLTNEEYRMTYTGIKTIDDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQ 158 Query: 1234 GSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQFIINN 1055 GSCGSCWAFST +VEG+N+I TG LI++SEQELV+CD SYN+GCNGGLMD AF+FII N Sbjct: 159 GSCGSCWAFSTTGSVEGVNKIVTGDLISVSEQELVNCDTSYNQGCNGGLMDYAFEFIIKN 218 Query: 1054 GGIDTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-------ISVAIEAS 896 GGIDT ++VAIEA Sbjct: 219 GGIDTEEDYPYTGKDGKCDKNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAG 278 Query: 895 GRDFQFYDSGIFTGKCGTELDHGVVVVGYGTENGKDYWIVRNSWGADWGEKGYLRMERGI 716 GRDFQFY SGIFTG CGT LDHGV+ GYGTE+GKDYW+V+NSWGA+WGE GYL+MER I Sbjct: 279 GRDFQFYTSGIFTGSCGTALDHGVLAAGYGTEDGKDYWLVKNSWGAEWGEGGYLKMERNI 338 Query: 715 SSRAGICGILSEPSYXXXXXXXXXXXXXXXXXXKTPESVCDEYYTCPMSTTCCCMYEYYG 536 + ++G CGI E SY PE VCDEY TCP STTCCC+YEYYG Sbjct: 339 ADKSGKCGIAMEASYPIKNGDNPPNPGPTPPSPAAPEVVCDEYSTCPESTTCCCIYEYYG 398 Query: 535 YCFAWGCCPLDGASCCDDGYSCCPHDYPVCNVRAGTCSMSNNNPLGVKAIQRLLATP 365 YCFAWGCCPL+GASCCDD YSCCPHDYP+CNVR GTCS S N+PL + A +R+LATP Sbjct: 399 YCFAWGCCPLEGASCCDDHYSCCPHDYPICNVRRGTCSKSRNSPLEISATKRILATP 455 Score = 43.1 bits (100), Expect(2) = e-132 Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 7/46 (15%) Frame = -3 Query: 1638 VSTAADMSIITYDQAHAVGT-------TDDVIMAAYETWLAKHGKS 1522 + +A DMSII YD H + TDD + A YE+WL KHGK+ Sbjct: 17 IISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVKHGKT 62