BLASTX nr result

ID: Angelica22_contig00003920 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00003920
         (1727 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAD29960.1| cysteine protease [Daucus carota]                     854   0.0  
dbj|BAD29958.1| cysteine protease [Daucus carota]                     766   0.0  
dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]         665   0.0  
gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]                645   0.0  
ref|XP_002510170.1| cysteine protease, putative [Ricinus communi...   639   0.0  

>dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score =  854 bits (2206), Expect = 0.0
 Identities = 405/460 (88%), Positives = 417/460 (90%)
 Frame = +3

Query: 24   MKMIXXXXXXXXXXXVSTAADMSIITYDQAHAVGTTDDVIMAAYETWLAKHGKSYNALGE 203
            MKMI             TAADMSIITYDQ HAVG+TDDVIMAAYE+WL KHGKSYNALGE
Sbjct: 1    MKMILSLLSLSLLAAAVTAADMSIITYDQTHAVGSTDDVIMAAYESWLVKHGKSYNALGE 60

Query: 204  TEQRFQIFKDNFLYIDEQNAKKDRSFKLGLNRFADLTNEEYRSKYTGIKTKDSRKKDSGK 383
             EQRFQIFKDNFLYIDEQNA KDRSFKLGLNRFADLTNEEYRSKYTGI+TKDSRKK SGK
Sbjct: 61   KEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNRFADLTNEEYRSKYTGIRTKDSRKKVSGK 120

Query: 384  SERYATLAGESLPERVDWREQGAVASVKDQGSCGSCWAFSTISAVEGINQIATGKLITLS 563
            S+RYA+LAGESLPE VDWRE GAVASVKDQG CGSCWAFSTISAVEGINQIATGKLITLS
Sbjct: 121  SQRYASLAGESLPESVDWREHGAVASVKDQGQCGSCWAFSTISAVEGINQIATGKLITLS 180

Query: 564  EQELVDCDRSYNEGCNGGLMDDAFQFIINNGGIDTDADYPYTGRDGQCDQYRKNAKVVTI 743
            EQELVDCDRSYNEGCNGGLMDDAFQFIINNGGID+DADYPYTGRDGQCDQYRKNAKVVTI
Sbjct: 181  EQELVDCDRSYNEGCNGGLMDDAFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTI 240

Query: 744  DSYEDVPAYDDKALQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTELDHGVVVVGYG 923
            DSYEDVP YD+KALQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGT+LDHGVVVVGYG
Sbjct: 241  DSYEDVPEYDEKALQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYG 300

Query: 924  TENGKDYWIVRNSWGADWGEKGYLRMERGISSRAGICGILSEPSYXXXXXXXXXXXXXXX 1103
            TENGKDYWIVRNSWGADWGEKGYLRMERGISS+AGICGI SEPSY               
Sbjct: 301  TENGKDYWIVRNSWGADWGEKGYLRMERGISSKAGICGITSEPSYPVKSGVNPPNPGPSP 360

Query: 1104 XXXXTPESVCDEYYTCPMSTTCCCMYEYYGYCFAWGCCPLDGASCCDDGYSCCPHDYPVC 1283
                +PESVCDEYYTCPMSTTCCCMYEYYGYCFAWGCCPL+GASCCDDGYSCCPHDYPVC
Sbjct: 361  PSPKSPESVCDEYYTCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVC 420

Query: 1284 NVRAGTCSMSNNNPLGVKAIQRLLATPKWQHGSKGKKVTA 1403
            NVRAGTCSMSNNNPLGVKAIQR+LATP WQHGSKGKKVTA
Sbjct: 421  NVRAGTCSMSNNNPLGVKAIQRILATPNWQHGSKGKKVTA 460


>dbj|BAD29958.1| cysteine protease [Daucus carota]
          Length = 496

 Score =  766 bits (1977), Expect = 0.0
 Identities = 368/481 (76%), Positives = 396/481 (82%), Gaps = 1/481 (0%)
 Frame = +3

Query: 69   VSTAADMSIITYDQAHAVG-TTDDVIMAAYETWLAKHGKSYNALGETEQRFQIFKDNFLY 245
            V+ A DMSIITYD+ HAVG  TDD     +E+WL  HGKSYNALGE E+RFQIFK+N  Y
Sbjct: 16   VAAATDMSIITYDETHAVGFKTDDEATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLRY 75

Query: 246  IDEQNAKKDRSFKLGLNRFADLTNEEYRSKYTGIKTKDSRKKDSGKSERYATLAGESLPE 425
            IDEQN  +DR FKLGLN+FADLTNEEYRSKYTGIK+KD RKK S KS RYATL+GESLPE
Sbjct: 76   IDEQNLVEDRGFKLGLNKFADLTNEEYRSKYTGIKSKDLRKKVSAKSGRYATLSGESLPE 135

Query: 426  RVDWREQGAVASVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEG 605
             VDWRE GAVA+VKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEG
Sbjct: 136  SVDWRESGAVATVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEG 195

Query: 606  CNGGLMDDAFQFIINNGGIDTDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPAYDDKAL 785
            CNGGLMD AF+FIINNGGIDTD DYPYTGRDG+CDQYRKNAKVVTIDSYEDVPAYD+ AL
Sbjct: 196  CNGGLMDYAFEFIINNGGIDTDVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDELAL 255

Query: 786  QKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTELDHGVVVVGYGTENGKDYWIVRNSW 965
            +KAAANQPISVAIEASGRDFQFYDSGIFTGKCG  LDHGVVVVGYGTENGKDYWIVRNSW
Sbjct: 256  KKAAANQPISVAIEASGRDFQFYDSGIFTGKCGIALDHGVVVVGYGTENGKDYWIVRNSW 315

Query: 966  GADWGEKGYLRMERGISSRAGICGILSEPSYXXXXXXXXXXXXXXXXXXXTPESVCDEYY 1145
            GADWGE GYLRMERGISS+ GICGI  EPSY                   TPESVCDEYY
Sbjct: 316  GADWGENGYLRMERGISSKTGICGIAIEPSYPVKTGVNPPNPGPSPPTPKTPESVCDEYY 375

Query: 1146 TCPMSTTCCCMYEYYGYCFAWGCCPLDGASCCDDGYSCCPHDYPVCNVRAGTCSMSNNNP 1325
            TCPMSTTCCCMYEYYGYCFAWGCCPL+GASCCDDGYSCCPHDYPVCNVRAGTCSM  NNP
Sbjct: 376  TCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVCNVRAGTCSMKYNNP 435

Query: 1326 LGVKAIQRLLATPKWQHGSKGKKVTA*EDSRKDP*VTFYGKTGAHRSDKFTTARFKLLKF 1505
            LGV+     L        +K +++   + + K P V F GKTGA+  DK TTA     ++
Sbjct: 436  LGVRQSSAFLQLQTGNTEAKERRLLL-KKNPKGPRVMFSGKTGAYSRDKITTAELVRFRY 494

Query: 1506 Q 1508
            +
Sbjct: 495  E 495


>dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 462

 Score =  665 bits (1717), Expect = 0.0
 Identities = 312/440 (70%), Positives = 355/440 (80%), Gaps = 4/440 (0%)
 Frame = +3

Query: 72   STAADMSIITYDQAHAVGT---TDDVIMAAYETWLAKHGKSYNALGETEQRFQIFKDNFL 242
            S+A DMSII YD  HA  +   TDD +MA YE+WL KHGKSYNALGE E+RFQIFKDN  
Sbjct: 20   SSALDMSIINYDATHASKSSWRTDDEVMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLR 79

Query: 243  YIDEQNAKKDRSFKLGLNRFADLTNEEYRSKYTGIKTKDSRKKDSGKSERYATLAGESLP 422
            +IDE NA+++ S+K+GLNRFADLTNEEYRS Y G K+K    K   KS+RYA   G+SLP
Sbjct: 80   FIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSKPKLSKV--KSDRYAPRVGDSLP 137

Query: 423  ERVDWREQGAVASVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNE 602
            E VDWR +GAVA +KDQGSCGSCWAFST++AVEGINQI TG+LITLSEQELVDCD+SYNE
Sbjct: 138  ESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNE 197

Query: 603  GCNGGLMDDAFQFIINNGGIDTDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPAYDDKA 782
            GC+GGLMD  F+FIINNGGIDTD DYPY GRD +CDQYRKNAKVVTIDSYEDVP  +++A
Sbjct: 198  GCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEEA 257

Query: 783  LQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTELDHGVVVVGYGTENGKDYWIVRNS 962
            L+KA A+QP+SV IE  GR FQFYDSGIFTGKCGT LDHGV VVGYGTE GKDYWIVRNS
Sbjct: 258  LKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGYGTEKGKDYWIVRNS 317

Query: 963  WGADWGEKGYLRMERGIS-SRAGICGILSEPSYXXXXXXXXXXXXXXXXXXXTPESVCDE 1139
            WG+ WGE GY+RMER ++ +  G CGI  EPSY                    P +VCD+
Sbjct: 318  WGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLKNGQNPPNPGPSPPTPVRPPTVCDD 377

Query: 1140 YYTCPMSTTCCCMYEYYGYCFAWGCCPLDGASCCDDGYSCCPHDYPVCNVRAGTCSMSNN 1319
            YYTCP S+TCCC+YEYYGYCF+WGCCPLDGA+CCDD YSCCPHDYPVCNV+AGTCSMS N
Sbjct: 378  YYTCPESSTCCCVYEYYGYCFSWGCCPLDGATCCDDHYSCCPHDYPVCNVQAGTCSMSKN 437

Query: 1320 NPLGVKAIQRLLATPKWQHG 1379
            NPLGVKAIQR+LATP  + G
Sbjct: 438  NPLGVKAIQRILATPNRETG 457


>gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
          Length = 470

 Score =  645 bits (1665), Expect = 0.0
 Identities = 300/451 (66%), Positives = 351/451 (77%), Gaps = 7/451 (1%)
 Frame = +3

Query: 72   STAADMSIITYDQAHAVG---TTDDVIMAAYETWLAKHGKSYNALGETEQRFQIFKDNFL 242
            S A DMSII+YD+AH V     +++ +   YE WLAKHG++YNALGE E+RF+IFKDN L
Sbjct: 20   SAAPDMSIISYDEAHGVRGLERSEEEMRILYEGWLAKHGRAYNALGEKERRFEIFKDNVL 79

Query: 243  YIDEQNAKKD---RSFKLGLNRFADLTNEEYRSKYTGIKTKDSRKKDSGKSERYATLAGE 413
            +ID  NA  D   RSF+LGLNRFAD+TNEEYR+ Y G +    R++    S+RY   AGE
Sbjct: 80   FIDAHNAAADAGHRSFRLGLNRFADMTNEEYRAVYLGTRPAGHRRRARVGSDRYRYNAGE 139

Query: 414  SLPERVDWREQGAVASVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRS 593
             LPE VDWR +GAVA+VKDQGSCGSCWAFST++AVEGIN+I TG LI+LSEQELVDCD  
Sbjct: 140  DLPESVDWRAKGAVAAVKDQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNG 199

Query: 594  YNEGCNGGLMDDAFQFIINNGGIDTDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPAYD 773
            YN+GCNGGLMD  F+FIINNGGIDT+ DYPYT RDG+CDQYRKNAKVV+ID YEDVP  D
Sbjct: 200  YNQGCNGGLMDYGFEFIINNGGIDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVND 259

Query: 774  DKALQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTELDHGVVVVGYGTENGKDYWIV 953
            +KALQKA ANQP+SVAIEA GR+FQ Y SGIFTG+CGT+LDHGVV VGYGTENGKDYWIV
Sbjct: 260  EKALQKAVANQPVSVAIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIV 319

Query: 954  RNSWGADWGEKGYLRMERGISSRAGICGILSEPSYXXXXXXXXXXXXXXXXXXXTPESVC 1133
            RNSWG DWGE GY+RMER +++  G CGI  EPSY                   +P +VC
Sbjct: 320  RNSWGGDWGESGYIRMERNVNTSTGKCGIAIEPSYPTKKGQNPPKPAPSPPSPVSPPTVC 379

Query: 1134 DEYYTCPMSTTCCCMYEYYGYCFAWGCCPLDGASCCDDGYSCCPHDYPVCNVRAGTCSMS 1313
            D YY+CP STTCCC+YEY  YCFAWGCCPL+GA+CC+D YSCCPHDYPVCNV+AGTC +S
Sbjct: 380  DNYYSCPSSTTCCCVYEYGRYCFAWGCCPLEGATCCEDHYSCCPHDYPVCNVKAGTCQLS 439

Query: 1314 NNNPLGVKAIQRLLATPKWQH-GSKGKKVTA 1403
             +NPLGVKA+ R  A P W   G+ GKK+ A
Sbjct: 440  KDNPLGVKALARTPAKPHWAFLGAGGKKINA 470


>ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
            gi|223550871|gb|EEF52357.1| cysteine protease, putative
            [Ricinus communis]
          Length = 469

 Score =  639 bits (1647), Expect = 0.0
 Identities = 297/451 (65%), Positives = 349/451 (77%), Gaps = 6/451 (1%)
 Frame = +3

Query: 69   VSTAADMSIITYDQAHAVGT---TDDVIMAAYETWLAKHGKSY---NALGETEQRFQIFK 230
            +S+A DMSI++YDQ H   +   TDD +MA YE WL K+GK++   NALGE E+RFQ+FK
Sbjct: 20   LSSALDMSIVSYDQTHLTKSSWRTDDEVMAIYEEWLVKNGKAHSNNNALGEKERRFQVFK 79

Query: 231  DNFLYIDEQNAKKDRSFKLGLNRFADLTNEEYRSKYTGIKTKDSRKKDSGKSERYATLAG 410
            DN  +IDE N++ +RS+K+GLNRFADLTNEEYRS Y G ++   R + S  S RY    G
Sbjct: 80   DNLRFIDEHNSE-NRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRSSNRYLPRVG 138

Query: 411  ESLPERVDWREQGAVASVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDR 590
            +SLP+ VDWR++GAVA VKDQGSCGSCWAFSTI+AVEGIN+I TG LI+LSEQELVDCDR
Sbjct: 139  DSLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDR 198

Query: 591  SYNEGCNGGLMDDAFQFIINNGGIDTDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPAY 770
            SYNEGCNGGLMD AFQFIINNGGID++ DYPY  RDG CD YRKNAKVVTID+YEDVP  
Sbjct: 199  SYNEGCNGGLMDYAFQFIINNGGIDSEEDYPYLARDGTCDTYRKNAKVVTIDNYEDVPVN 258

Query: 771  DDKALQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTELDHGVVVVGYGTENGKDYWI 950
            D+KALQKA ANQP+SVAIEA GR+FQFY SGIFTG+CGT LDHGV  VGYGTENGKDYWI
Sbjct: 259  DEKALQKAVANQPVSVAIEAGGREFQFYQSGIFTGRCGTALDHGVAAVGYGTENGKDYWI 318

Query: 951  VRNSWGADWGEKGYLRMERGISSRAGICGILSEPSYXXXXXXXXXXXXXXXXXXXTPESV 1130
            VRNSWG  WGE GY+RMER I++  G CGI  EPSY                    P SV
Sbjct: 319  VRNSWGKSWGESGYIRMERNIATATGKCGIAIEPSYPIKKGQNPPNPGPSPPSPIKPPSV 378

Query: 1131 CDEYYTCPMSTTCCCMYEYYGYCFAWGCCPLDGASCCDDGYSCCPHDYPVCNVRAGTCSM 1310
            CD Y++CP STTCCC++EY  YCF WGCCPL+GA+CCDD YSCCPHDYPVCN+  GTC +
Sbjct: 379  CDSYFSCPESTTCCCIFEYAKYCFEWGCCPLEGATCCDDHYSCCPHDYPVCNINEGTCLI 438

Query: 1311 SNNNPLGVKAIQRLLATPKWQHGSKGKKVTA 1403
              +NP GVKA++R  A P W +G +G+K +A
Sbjct: 439  GKDNPFGVKAMRRTPAKPHWAYGLEGRKNSA 469


Top