BLASTX nr result

ID: Angelica23_contig00001270 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00001270
         (2100 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor,...   577   e-162
ref|XP_002884082.1| aspartyl protease family protein [Arabidopsi...   543   e-152
ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis tha...   541   e-151
dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]     529   e-147
ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic pro...   507   e-141

>ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223525945|gb|EEF28342.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 533

 Score =  577 bits (1486), Expect = e-162
 Identities = 281/458 (61%), Positives = 347/458 (75%), Gaps = 5/458 (1%)
 Frame = +1

Query: 238  LMIIICVILSCKSVVAFGTYGYDIHHRYSDSLKGILDFDGLPEKGTYHYYAAMARRDSLS 417
            L++++ ++ S      FGT+G+D+HHRYSD +KG+L  D LPEKG+ HYYA+MA RD L 
Sbjct: 22   LLLLVLMLSSSSFSYGFGTFGFDLHHRYSDPVKGMLSVDDLPEKGSLHYYASMAHRDILI 81

Query: 418  RRRHL-ADNTDL-LSFVDGNQTYRLSSLGFLHYANVSVGTPSVWFLVALDTGSDLFWLPC 591
              R L +DNT   L+F  GN+TYR SSLGFLHYANVS+GTPS+ +LVALDTGSDLFWLPC
Sbjct: 82   HGRKLVSDNTSTPLTFFSGNETYRFSSLGFLHYANVSIGTPSLSYLVALDTGSDLFWLPC 141

Query: 592  DCVS--CVRDLNTSSGRQIEFNIYSPNISSTSTSVRCDNDLCEKKNRCSAASDTCPYRVR 765
            DC +  CV+ L   SG QI+FNIY PN SSTS ++ C+N LC +++RC +A  TCPY+V+
Sbjct: 142  DCTNSGCVQGLQFPSGEQIDFNIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCPYQVQ 201

Query: 766  YLSSNTSSTGFLVGDVLHLNMDNNQQKAVEANIKFGCGTVQTGSFLDGAAPNGLFGLGME 945
            YLS+ TSSTG LV D+LHL  D+ Q +A++A I FGCG VQTGSFLDGAAPNGLFGLGM 
Sbjct: 202  YLSNGTSSTGVLVEDLLHLTTDDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMT 261

Query: 946  KVSVPSILASEGLAADSFSMCFGPDGIGRIKFGDKGSSDQSETPFNIDQLNPTYNISITH 1125
             +SVPS LA EG  ++SFSMCFG DGIGRI FGD GSS Q ETPFN+ QL+PTYN+SIT 
Sbjct: 262  NISVPSTLAREGYTSNSFSMCFGRDGIGRISFGDTGSSGQGETPFNLRQLHPTYNVSITK 321

Query: 1126 VSVEENVTDINLSAIFDSGTSFTYLTDPAFTIISENFNSIAQEKRVSPNSDLPFEYCYAL 1305
            ++V     D+  SAIFDSGTSFTYL DPA+T+ISE+FN  A+EKR S  SD+PFEYCY +
Sbjct: 322  INVGGRDADLEFSAIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYEM 381

Query: 1306 SPKQYSFRVANVNLTTKGGKQIFVNDPVVYISTQEGAVIYCLGLVKSEDIDIIGQNFMTG 1485
            S  Q +  +  VNL  +GG Q  V DP+V +  Q GA IYCL +VKS D++IIGQNFMTG
Sbjct: 382  SSNQTNLEIPTVNLVMQGGSQFNVTDPIVIVILQGGASIYCLAIVKSGDVNIIGQNFMTG 441

Query: 1486 YNIVFDREKKVLGWKASDCYDGNNSAALPINP-SRGAP 1596
            Y IVF+RE+ VLGWKASDCYD  ++   P++P S G P
Sbjct: 442  YRIVFNRERNVLGWKASDCYDDMDTTTFPVDPISPGIP 479


>ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297329922|gb|EFH60341.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  543 bits (1398), Expect = e-152
 Identities = 268/460 (58%), Positives = 337/460 (73%), Gaps = 7/460 (1%)
 Frame = +1

Query: 226  SNHVLMIIICVILSCKSVV----AFGTYGYDIHHRYSDSLKGILDFDGLPEKGTYHYYAA 393
            S  +L + + ++L+   V+     FG +G++ HHR+SD + G+L  DGLP + +  YY  
Sbjct: 6    SCRILFLGLIILLASSWVLERCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRV 65

Query: 394  MARRDSLSRRRHLA-DNTDLLSFVDGNQTYRLSSLGFLHYANVSVGTPSVWFLVALDTGS 570
            MA RD L R R LA ++  L++F DGN+T R+ +LGFLHYANV+VGTPS WFLVALDTGS
Sbjct: 66   MAHRDRLIRGRRLANEDQSLVTFSDGNETIRVDALGFLHYANVTVGTPSDWFLVALDTGS 125

Query: 571  DLFWLPCDCVSCVRDLNTSSGRQIEFNIYSPNISSTSTSVRCDNDLCEKKNRCSAASDTC 750
            DLFWLPCDC +CVR+L    G  ++ NIYSPN SSTST V C++ LC + +RC++    C
Sbjct: 126  DLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESNC 185

Query: 751  PYRVRYLSSNTSSTGFLVGDVLHLNMDNNQQKAVEANIKFGCGTVQTGSFLDGAAPNGLF 930
            PY++RYLS+ TSSTG LV DVLHL  ++   KA+ A +  GCG VQTG F DGAAPNGLF
Sbjct: 186  PYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTLGCGQVQTGVFHDGAAPNGLF 245

Query: 931  GLGMEKVSVPSILASEGLAADSFSMCFGPDGIGRIKFGDKGSSDQSETPFNIDQLNPTYN 1110
            GLG+E +SVPS+LA EG+AA+SFSMCFG DG GRI FGDKGS DQ ETP NI Q +PTYN
Sbjct: 246  GLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYN 305

Query: 1111 ISITHVSVEENVTDINLSAIFDSGTSFTYLTDPAFTIISENFNSIAQEKRV-SPNSDLPF 1287
            I++T +SVE N  D+   A+FDSGTSFTYLTD A+T+ISE+FNS+A +KR  + +S+LPF
Sbjct: 306  ITVTKISVEGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPF 365

Query: 1288 EYCYALSPKQYSFRVANVNLTTKGGKQIFVNDPVVYISTQEGAVIYCLGLVKSEDIDIIG 1467
            EYCYALSP + SF+   VNLT KGG    V  P+V I  ++   +YCL ++K EDI IIG
Sbjct: 366  EYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKD-TDVYCLAILKIEDISIIG 424

Query: 1468 QNFMTGYNIVFDREKKVLGWKASDCYDGNNSA-ALPINPS 1584
            QNFMTGY +VFDREK +LGWK SDCY G  SA  LP N S
Sbjct: 425  QNFMTGYRVVFDREKLILGWKESDCYTGETSARTLPSNRS 464


>ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid
            DNA-binding protein [Arabidopsis thaliana]
            gi|22655368|gb|AAM98276.1| At2g17760/At2g17760
            [Arabidopsis thaliana] gi|330251585|gb|AEC06679.1|
            aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 513

 Score =  541 bits (1394), Expect = e-151
 Identities = 267/460 (58%), Positives = 337/460 (73%), Gaps = 7/460 (1%)
 Frame = +1

Query: 226  SNHVLMIIICVILSCKSVV----AFGTYGYDIHHRYSDSLKGILDFDGLPEKGTYHYYAA 393
            S  +L + + ++L+   V+     FG +G++ HHR+SD + G+L  DGLP + +  YY  
Sbjct: 6    SCRILFLGLLILLASSWVLDRCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRV 65

Query: 394  MARRDSLSRRRHLA-DNTDLLSFVDGNQTYRLSSLGFLHYANVSVGTPSVWFLVALDTGS 570
            MA RD L R R LA ++  L++F DGN+T R+ +LGFLHYANV+VGTPS WF+VALDTGS
Sbjct: 66   MAHRDRLIRGRRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVALDTGS 125

Query: 571  DLFWLPCDCVSCVRDLNTSSGRQIEFNIYSPNISSTSTSVRCDNDLCEKKNRCSAASDTC 750
            DLFWLPCDC +CVR+L    G  ++ NIYSPN SSTST V C++ LC + +RC++    C
Sbjct: 126  DLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDC 185

Query: 751  PYRVRYLSSNTSSTGFLVGDVLHLNMDNNQQKAVEANIKFGCGTVQTGSFLDGAAPNGLF 930
            PY++RYLS+ TSSTG LV DVLHL  ++   KA+ A + FGCG VQTG F DGAAPNGLF
Sbjct: 186  PYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLF 245

Query: 931  GLGMEKVSVPSILASEGLAADSFSMCFGPDGIGRIKFGDKGSSDQSETPFNIDQLNPTYN 1110
            GLG+E +SVPS+LA EG+AA+SFSMCFG DG GRI FGDKGS DQ ETP NI Q +PTYN
Sbjct: 246  GLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYN 305

Query: 1111 ISITHVSVEENVTDINLSAIFDSGTSFTYLTDPAFTIISENFNSIAQEKRV-SPNSDLPF 1287
            I++T +SV  N  D+   A+FDSGTSFTYLTD A+T+ISE+FNS+A +KR  + +S+LPF
Sbjct: 306  ITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPF 365

Query: 1288 EYCYALSPKQYSFRVANVNLTTKGGKQIFVNDPVVYISTQEGAVIYCLGLVKSEDIDIIG 1467
            EYCYALSP + SF+   VNLT KGG    V  P+V I  ++   +YCL ++K EDI IIG
Sbjct: 366  EYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKD-TDVYCLAIMKIEDISIIG 424

Query: 1468 QNFMTGYNIVFDREKKVLGWKASDCYDGNNSA-ALPINPS 1584
            QNFMTGY +VFDREK +LGWK SDCY G  SA  LP N S
Sbjct: 425  QNFMTGYRVVFDREKLILGWKESDCYTGETSARTLPSNRS 464


>dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
          Length = 515

 Score =  529 bits (1363), Expect = e-147
 Identities = 257/446 (57%), Positives = 326/446 (73%), Gaps = 3/446 (0%)
 Frame = +1

Query: 235  VLMIIICVILS-CKSVVAFGTYGYDIHHRYSDSLKGILDFDGLPEKGTYHYYAAMARRDS 411
            +LM++   +L  C+ +   G +G++ HHR+SD + G+L  DGLP + +  YY  MA RD 
Sbjct: 15   ILMLVSSWVLDRCEGL---GEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDR 71

Query: 412  LSRRRHLA-DNTDLLSFVDGNQTYRLSSLGFLHYANVSVGTPSVWFLVALDTGSDLFWLP 588
            L R R LA ++  L++F DGN+T R+++LGFLHYANV+VGTPS WFLVALDTGSDLFWLP
Sbjct: 72   LIRGRRLASEDQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFWLP 131

Query: 589  CDC-VSCVRDLNTSSGRQIEFNIYSPNISSTSTSVRCDNDLCEKKNRCSAASDTCPYRVR 765
            CDC  +CVR+L    G  ++ NIYSPN SSTS+ V C++ LC + +RC++    CPY++R
Sbjct: 132  CDCSTNCVRELKAPGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCPYQIR 191

Query: 766  YLSSNTSSTGFLVGDVLHLNMDNNQQKAVEANIKFGCGTVQTGSFLDGAAPNGLFGLGME 945
            YLS+ TSSTG LV DVLHL       K + A I  GCG VQTG F DGAAPNGLFGLG+E
Sbjct: 192  YLSNGTSSTGVLVEDVLHLVSMEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFGLGLE 251

Query: 946  KVSVPSILASEGLAADSFSMCFGPDGIGRIKFGDKGSSDQSETPFNIDQLNPTYNISITH 1125
             +SVPS+LA EG+AA+SFSMCFG DG GRI FGDKGS DQ ETP NI Q +PTYN+++T 
Sbjct: 252  DISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETPLNIRQPHPTYNVTVTQ 311

Query: 1126 VSVEENVTDINLSAIFDSGTSFTYLTDPAFTIISENFNSIAQEKRVSPNSDLPFEYCYAL 1305
            +SV  N  D+   A+FD+GTSFTYLTD  +T+ISE+FNS+A +KR   +S+LPFEYCYA+
Sbjct: 312  ISVGGNTGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCYAV 371

Query: 1306 SPKQYSFRVANVNLTTKGGKQIFVNDPVVYISTQEGAVIYCLGLVKSEDIDIIGQNFMTG 1485
            SP + SF   +VNLT KGG    V  P++ +   E  V+YCL ++KSEDI IIGQNFMTG
Sbjct: 372  SPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPI-EDTVVYCLAIMKSEDISIIGQNFMTG 430

Query: 1486 YNIVFDREKKVLGWKASDCYDGNNSA 1563
            Y +VFDREK +LGWK SDC  G  SA
Sbjct: 431  YRVVFDREKLILGWKESDCSTGETSA 456


>ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
            1-like [Cucumis sativus]
          Length = 524

 Score =  507 bits (1305), Expect = e-141
 Identities = 245/437 (56%), Positives = 323/437 (73%), Gaps = 4/437 (0%)
 Frame = +1

Query: 286  FGTYGYDIHHRYSDSLKGILDFDGLPEKGTYHYYAAMARRDSLSRRRHLADNTDL--LSF 459
            FG++ ++IHH YS +++ IL F   P++GT  YYAAM R D     R L    D   L+F
Sbjct: 32   FGSFTFNIHHLYSPAVRQILPFHSFPDEGTLDYYAAMVRTDXFVHSRRLGQVQDHRPLTF 91

Query: 460  VDGNQTYRLSSLGFLHYANVSVGTPSVWFLVALDTGSDLFWLPCDCVSCVRDLNTSSGRQ 639
            + GN+T R+S LGFL+YA V+VGTP V +LVALDTGSDLFWLPCDCV+C+  LNT+ G  
Sbjct: 92   LSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQG-P 150

Query: 640  IEFNIYSPNISSTSTSVRCDNDLCEKKNRCSAASDTCPYRVRYLSSNTSSTGFLVGDVLH 819
            + FNIYSPN SSTS  V+C + LC   ++CS+ SDTCPY+V YLS NTSSTG+LV D+LH
Sbjct: 151  VNFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILH 210

Query: 820  LNMDNNQQKAVEANIKFGCGTVQTGSFLDGAAPNGLFGLGMEKVSVPSILASEGLAADSF 999
            L  ++ Q K V A I  GCG  Q+G+FL  AAPNGLFGLG+E VSVPSILA+ GL ++SF
Sbjct: 211  LTTNDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSF 270

Query: 1000 SMCFGPDGIGRIKFGDKGSSDQSETPFNIDQLNPTYNISITHVSVEENVTDINLSAIFDS 1179
            S+CFGP  +GRI+FGDKGS  Q+ETPFN+ + +PTYN+SIT + V  +++D++++ IFDS
Sbjct: 271  SLCFGPARMGRIEFGDKGSPGQNETPFNLGRRHPTYNVSITQIGVGGHISDLDVAVIFDS 330

Query: 1180 GTSFTYLTDPAFTIISENFNSIAQEKRVSPNSDLPFEYCYALSPKQYSFRVANVNLTTKG 1359
            GTSFTYL DPA+++ ++ F S+ +EK+ + NSD+PFE CY LSP Q +F    +NLT KG
Sbjct: 331  GTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLTMKG 390

Query: 1360 GKQIFVNDPVVYISTQEGAVIYCLGLVKSEDIDIIGQNFMTGYNIVFDREKKVLGWKASD 1539
            G    +N P+V IST E   ++CL + +S+ I+IIGQNFMTGY+IVFDREK VLGWK S+
Sbjct: 391  GGHFVINHPIVLIST-ESKRLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESN 449

Query: 1540 C--YDGNNSAALPINPS 1584
            C  Y+  N+  LP+ P+
Sbjct: 450  CTGYEDENTNNLPVGPT 466


Top