BLASTX nr result
ID: Angelica22_contig00003920
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00003920 (1727 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAD29960.1| cysteine protease [Daucus carota] 854 0.0 dbj|BAD29958.1| cysteine protease [Daucus carota] 766 0.0 dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus] 665 0.0 gb|ABR19827.1| cysteine proteinase [Elaeis guineensis] 645 0.0 ref|XP_002510170.1| cysteine protease, putative [Ricinus communi... 639 0.0 >dbj|BAD29960.1| cysteine protease [Daucus carota] Length = 460 Score = 854 bits (2206), Expect = 0.0 Identities = 405/460 (88%), Positives = 417/460 (90%) Frame = +3 Query: 24 MKMIXXXXXXXXXXXVSTAADMSIITYDQAHAVGTTDDVIMAAYETWLAKHGKSYNALGE 203 MKMI TAADMSIITYDQ HAVG+TDDVIMAAYE+WL KHGKSYNALGE Sbjct: 1 MKMILSLLSLSLLAAAVTAADMSIITYDQTHAVGSTDDVIMAAYESWLVKHGKSYNALGE 60 Query: 204 TEQRFQIFKDNFLYIDEQNAKKDRSFKLGLNRFADLTNEEYRSKYTGIKTKDSRKKDSGK 383 EQRFQIFKDNFLYIDEQNA KDRSFKLGLNRFADLTNEEYRSKYTGI+TKDSRKK SGK Sbjct: 61 KEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNRFADLTNEEYRSKYTGIRTKDSRKKVSGK 120 Query: 384 SERYATLAGESLPERVDWREQGAVASVKDQGSCGSCWAFSTISAVEGINQIATGKLITLS 563 S+RYA+LAGESLPE VDWRE GAVASVKDQG CGSCWAFSTISAVEGINQIATGKLITLS Sbjct: 121 SQRYASLAGESLPESVDWREHGAVASVKDQGQCGSCWAFSTISAVEGINQIATGKLITLS 180 Query: 564 EQELVDCDRSYNEGCNGGLMDDAFQFIINNGGIDTDADYPYTGRDGQCDQYRKNAKVVTI 743 EQELVDCDRSYNEGCNGGLMDDAFQFIINNGGID+DADYPYTGRDGQCDQYRKNAKVVTI Sbjct: 181 EQELVDCDRSYNEGCNGGLMDDAFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTI 240 Query: 744 DSYEDVPAYDDKALQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTELDHGVVVVGYG 923 DSYEDVP YD+KALQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGT+LDHGVVVVGYG Sbjct: 241 DSYEDVPEYDEKALQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYG 300 Query: 924 TENGKDYWIVRNSWGADWGEKGYLRMERGISSRAGICGILSEPSYXXXXXXXXXXXXXXX 1103 TENGKDYWIVRNSWGADWGEKGYLRMERGISS+AGICGI SEPSY Sbjct: 301 TENGKDYWIVRNSWGADWGEKGYLRMERGISSKAGICGITSEPSYPVKSGVNPPNPGPSP 360 Query: 1104 XXXXTPESVCDEYYTCPMSTTCCCMYEYYGYCFAWGCCPLDGASCCDDGYSCCPHDYPVC 1283 +PESVCDEYYTCPMSTTCCCMYEYYGYCFAWGCCPL+GASCCDDGYSCCPHDYPVC Sbjct: 361 PSPKSPESVCDEYYTCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVC 420 Query: 1284 NVRAGTCSMSNNNPLGVKAIQRLLATPKWQHGSKGKKVTA 1403 NVRAGTCSMSNNNPLGVKAIQR+LATP WQHGSKGKKVTA Sbjct: 421 NVRAGTCSMSNNNPLGVKAIQRILATPNWQHGSKGKKVTA 460 >dbj|BAD29958.1| cysteine protease [Daucus carota] Length = 496 Score = 766 bits (1977), Expect = 0.0 Identities = 368/481 (76%), Positives = 396/481 (82%), Gaps = 1/481 (0%) Frame = +3 Query: 69 VSTAADMSIITYDQAHAVG-TTDDVIMAAYETWLAKHGKSYNALGETEQRFQIFKDNFLY 245 V+ A DMSIITYD+ HAVG TDD +E+WL HGKSYNALGE E+RFQIFK+N Y Sbjct: 16 VAAATDMSIITYDETHAVGFKTDDEATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLRY 75 Query: 246 IDEQNAKKDRSFKLGLNRFADLTNEEYRSKYTGIKTKDSRKKDSGKSERYATLAGESLPE 425 IDEQN +DR FKLGLN+FADLTNEEYRSKYTGIK+KD RKK S KS RYATL+GESLPE Sbjct: 76 IDEQNLVEDRGFKLGLNKFADLTNEEYRSKYTGIKSKDLRKKVSAKSGRYATLSGESLPE 135 Query: 426 RVDWREQGAVASVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEG 605 VDWRE GAVA+VKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEG Sbjct: 136 SVDWRESGAVATVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEG 195 Query: 606 CNGGLMDDAFQFIINNGGIDTDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPAYDDKAL 785 CNGGLMD AF+FIINNGGIDTD DYPYTGRDG+CDQYRKNAKVVTIDSYEDVPAYD+ AL Sbjct: 196 CNGGLMDYAFEFIINNGGIDTDVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDELAL 255 Query: 786 QKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTELDHGVVVVGYGTENGKDYWIVRNSW 965 +KAAANQPISVAIEASGRDFQFYDSGIFTGKCG LDHGVVVVGYGTENGKDYWIVRNSW Sbjct: 256 KKAAANQPISVAIEASGRDFQFYDSGIFTGKCGIALDHGVVVVGYGTENGKDYWIVRNSW 315 Query: 966 GADWGEKGYLRMERGISSRAGICGILSEPSYXXXXXXXXXXXXXXXXXXXTPESVCDEYY 1145 GADWGE GYLRMERGISS+ GICGI EPSY TPESVCDEYY Sbjct: 316 GADWGENGYLRMERGISSKTGICGIAIEPSYPVKTGVNPPNPGPSPPTPKTPESVCDEYY 375 Query: 1146 TCPMSTTCCCMYEYYGYCFAWGCCPLDGASCCDDGYSCCPHDYPVCNVRAGTCSMSNNNP 1325 TCPMSTTCCCMYEYYGYCFAWGCCPL+GASCCDDGYSCCPHDYPVCNVRAGTCSM NNP Sbjct: 376 TCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVCNVRAGTCSMKYNNP 435 Query: 1326 LGVKAIQRLLATPKWQHGSKGKKVTA*EDSRKDP*VTFYGKTGAHRSDKFTTARFKLLKF 1505 LGV+ L +K +++ + + K P V F GKTGA+ DK TTA ++ Sbjct: 436 LGVRQSSAFLQLQTGNTEAKERRLLL-KKNPKGPRVMFSGKTGAYSRDKITTAELVRFRY 494 Query: 1506 Q 1508 + Sbjct: 495 E 495 >dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus] Length = 462 Score = 665 bits (1717), Expect = 0.0 Identities = 312/440 (70%), Positives = 355/440 (80%), Gaps = 4/440 (0%) Frame = +3 Query: 72 STAADMSIITYDQAHAVGT---TDDVIMAAYETWLAKHGKSYNALGETEQRFQIFKDNFL 242 S+A DMSII YD HA + TDD +MA YE+WL KHGKSYNALGE E+RFQIFKDN Sbjct: 20 SSALDMSIINYDATHASKSSWRTDDEVMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLR 79 Query: 243 YIDEQNAKKDRSFKLGLNRFADLTNEEYRSKYTGIKTKDSRKKDSGKSERYATLAGESLP 422 +IDE NA+++ S+K+GLNRFADLTNEEYRS Y G K+K K KS+RYA G+SLP Sbjct: 80 FIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSKPKLSKV--KSDRYAPRVGDSLP 137 Query: 423 ERVDWREQGAVASVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNE 602 E VDWR +GAVA +KDQGSCGSCWAFST++AVEGINQI TG+LITLSEQELVDCD+SYNE Sbjct: 138 ESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNE 197 Query: 603 GCNGGLMDDAFQFIINNGGIDTDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPAYDDKA 782 GC+GGLMD F+FIINNGGIDTD DYPY GRD +CDQYRKNAKVVTIDSYEDVP +++A Sbjct: 198 GCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEEA 257 Query: 783 LQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTELDHGVVVVGYGTENGKDYWIVRNS 962 L+KA A+QP+SV IE GR FQFYDSGIFTGKCGT LDHGV VVGYGTE GKDYWIVRNS Sbjct: 258 LKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGYGTEKGKDYWIVRNS 317 Query: 963 WGADWGEKGYLRMERGIS-SRAGICGILSEPSYXXXXXXXXXXXXXXXXXXXTPESVCDE 1139 WG+ WGE GY+RMER ++ + G CGI EPSY P +VCD+ Sbjct: 318 WGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLKNGQNPPNPGPSPPTPVRPPTVCDD 377 Query: 1140 YYTCPMSTTCCCMYEYYGYCFAWGCCPLDGASCCDDGYSCCPHDYPVCNVRAGTCSMSNN 1319 YYTCP S+TCCC+YEYYGYCF+WGCCPLDGA+CCDD YSCCPHDYPVCNV+AGTCSMS N Sbjct: 378 YYTCPESSTCCCVYEYYGYCFSWGCCPLDGATCCDDHYSCCPHDYPVCNVQAGTCSMSKN 437 Query: 1320 NPLGVKAIQRLLATPKWQHG 1379 NPLGVKAIQR+LATP + G Sbjct: 438 NPLGVKAIQRILATPNRETG 457 >gb|ABR19827.1| cysteine proteinase [Elaeis guineensis] Length = 470 Score = 645 bits (1665), Expect = 0.0 Identities = 300/451 (66%), Positives = 351/451 (77%), Gaps = 7/451 (1%) Frame = +3 Query: 72 STAADMSIITYDQAHAVG---TTDDVIMAAYETWLAKHGKSYNALGETEQRFQIFKDNFL 242 S A DMSII+YD+AH V +++ + YE WLAKHG++YNALGE E+RF+IFKDN L Sbjct: 20 SAAPDMSIISYDEAHGVRGLERSEEEMRILYEGWLAKHGRAYNALGEKERRFEIFKDNVL 79 Query: 243 YIDEQNAKKD---RSFKLGLNRFADLTNEEYRSKYTGIKTKDSRKKDSGKSERYATLAGE 413 +ID NA D RSF+LGLNRFAD+TNEEYR+ Y G + R++ S+RY AGE Sbjct: 80 FIDAHNAAADAGHRSFRLGLNRFADMTNEEYRAVYLGTRPAGHRRRARVGSDRYRYNAGE 139 Query: 414 SLPERVDWREQGAVASVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRS 593 LPE VDWR +GAVA+VKDQGSCGSCWAFST++AVEGIN+I TG LI+LSEQELVDCD Sbjct: 140 DLPESVDWRAKGAVAAVKDQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNG 199 Query: 594 YNEGCNGGLMDDAFQFIINNGGIDTDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPAYD 773 YN+GCNGGLMD F+FIINNGGIDT+ DYPYT RDG+CDQYRKNAKVV+ID YEDVP D Sbjct: 200 YNQGCNGGLMDYGFEFIINNGGIDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVND 259 Query: 774 DKALQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTELDHGVVVVGYGTENGKDYWIV 953 +KALQKA ANQP+SVAIEA GR+FQ Y SGIFTG+CGT+LDHGVV VGYGTENGKDYWIV Sbjct: 260 EKALQKAVANQPVSVAIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIV 319 Query: 954 RNSWGADWGEKGYLRMERGISSRAGICGILSEPSYXXXXXXXXXXXXXXXXXXXTPESVC 1133 RNSWG DWGE GY+RMER +++ G CGI EPSY +P +VC Sbjct: 320 RNSWGGDWGESGYIRMERNVNTSTGKCGIAIEPSYPTKKGQNPPKPAPSPPSPVSPPTVC 379 Query: 1134 DEYYTCPMSTTCCCMYEYYGYCFAWGCCPLDGASCCDDGYSCCPHDYPVCNVRAGTCSMS 1313 D YY+CP STTCCC+YEY YCFAWGCCPL+GA+CC+D YSCCPHDYPVCNV+AGTC +S Sbjct: 380 DNYYSCPSSTTCCCVYEYGRYCFAWGCCPLEGATCCEDHYSCCPHDYPVCNVKAGTCQLS 439 Query: 1314 NNNPLGVKAIQRLLATPKWQH-GSKGKKVTA 1403 +NPLGVKA+ R A P W G+ GKK+ A Sbjct: 440 KDNPLGVKALARTPAKPHWAFLGAGGKKINA 470 >ref|XP_002510170.1| cysteine protease, putative [Ricinus communis] gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis] Length = 469 Score = 639 bits (1647), Expect = 0.0 Identities = 297/451 (65%), Positives = 349/451 (77%), Gaps = 6/451 (1%) Frame = +3 Query: 69 VSTAADMSIITYDQAHAVGT---TDDVIMAAYETWLAKHGKSY---NALGETEQRFQIFK 230 +S+A DMSI++YDQ H + TDD +MA YE WL K+GK++ NALGE E+RFQ+FK Sbjct: 20 LSSALDMSIVSYDQTHLTKSSWRTDDEVMAIYEEWLVKNGKAHSNNNALGEKERRFQVFK 79 Query: 231 DNFLYIDEQNAKKDRSFKLGLNRFADLTNEEYRSKYTGIKTKDSRKKDSGKSERYATLAG 410 DN +IDE N++ +RS+K+GLNRFADLTNEEYRS Y G ++ R + S S RY G Sbjct: 80 DNLRFIDEHNSE-NRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRSSNRYLPRVG 138 Query: 411 ESLPERVDWREQGAVASVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDR 590 +SLP+ VDWR++GAVA VKDQGSCGSCWAFSTI+AVEGIN+I TG LI+LSEQELVDCDR Sbjct: 139 DSLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDR 198 Query: 591 SYNEGCNGGLMDDAFQFIINNGGIDTDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPAY 770 SYNEGCNGGLMD AFQFIINNGGID++ DYPY RDG CD YRKNAKVVTID+YEDVP Sbjct: 199 SYNEGCNGGLMDYAFQFIINNGGIDSEEDYPYLARDGTCDTYRKNAKVVTIDNYEDVPVN 258 Query: 771 DDKALQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTELDHGVVVVGYGTENGKDYWI 950 D+KALQKA ANQP+SVAIEA GR+FQFY SGIFTG+CGT LDHGV VGYGTENGKDYWI Sbjct: 259 DEKALQKAVANQPVSVAIEAGGREFQFYQSGIFTGRCGTALDHGVAAVGYGTENGKDYWI 318 Query: 951 VRNSWGADWGEKGYLRMERGISSRAGICGILSEPSYXXXXXXXXXXXXXXXXXXXTPESV 1130 VRNSWG WGE GY+RMER I++ G CGI EPSY P SV Sbjct: 319 VRNSWGKSWGESGYIRMERNIATATGKCGIAIEPSYPIKKGQNPPNPGPSPPSPIKPPSV 378 Query: 1131 CDEYYTCPMSTTCCCMYEYYGYCFAWGCCPLDGASCCDDGYSCCPHDYPVCNVRAGTCSM 1310 CD Y++CP STTCCC++EY YCF WGCCPL+GA+CCDD YSCCPHDYPVCN+ GTC + Sbjct: 379 CDSYFSCPESTTCCCIFEYAKYCFEWGCCPLEGATCCDDHYSCCPHDYPVCNINEGTCLI 438 Query: 1311 SNNNPLGVKAIQRLLATPKWQHGSKGKKVTA 1403 +NP GVKA++R A P W +G +G+K +A Sbjct: 439 GKDNPFGVKAMRRTPAKPHWAYGLEGRKNSA 469