BLASTX nr result

ID: Angelica22_contig00002689 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00002689
         (1918 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor,...   583   e-164
ref|XP_002884082.1| aspartyl protease family protein [Arabidopsi...   549   e-153
ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis tha...   547   e-153
dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]     537   e-150
ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic pro...   513   e-143

>ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223525945|gb|EEF28342.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 533

 Score =  583 bits (1504), Expect = e-164
 Identities = 298/520 (57%), Positives = 373/520 (71%), Gaps = 10/520 (1%)
 Frame = -2

Query: 1746 LMIIICVILSCKSVVAFGTYGYDIHHRYSDSLKGILDFDGLPEKGTYHYYAAMARRDSLS 1567
            L++++ ++ S      FGT+G+D+HHRYSD +KG+L  D LPEKG+ HYYA+MA RD L 
Sbjct: 22   LLLLVLMLSSSSFSYGFGTFGFDLHHRYSDPVKGMLSVDDLPEKGSLHYYASMAHRDILI 81

Query: 1566 RRRHL-ADNTDL-LSFVDGNQTYRLSSLGFLHYANVSVGTPSVWFLVALDTGSDLFWLPC 1393
              R L +DNT   L+F  GN+TYR SSLGFLHYANVS+GTPS+ +LVALDTGSDLFWLPC
Sbjct: 82   HGRKLVSDNTSTPLTFFSGNETYRFSSLGFLHYANVSIGTPSLSYLVALDTGSDLFWLPC 141

Query: 1392 DCVS--CVRDLNTSSGRQIEFNIYSPNISSTSTSVRCDNDLCEKKNRCSAASDTCPYRVR 1219
            DC +  CV+ L   SG QI+FNIY PN SSTS ++ C+N LC +++RC +A  TCPY+V+
Sbjct: 142  DCTNSGCVQGLQFPSGEQIDFNIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCPYQVQ 201

Query: 1218 YLSSNTSSTGFLVGDVLHLNMDNNQQKAVEANIKFGCGTVQTGSFLDGAAPNGLFGLGME 1039
            YLS+ TSSTG LV D+LHL  D+ Q +A++A I FGCG VQTGSFLDGAAPNGLFGLGM 
Sbjct: 202  YLSNGTSSTGVLVEDLLHLTTDDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMT 261

Query: 1038 KVSVPSILASEGLAADSFSMCFGPDGIGRIKFGDKGSSDQSETPFNIDQLNPTYNISITH 859
             +SVPS LA EG  ++SFSMCFG DGIGRI FGD GSS Q ETPFN+ QL+PTYN+SIT 
Sbjct: 262  NISVPSTLAREGYTSNSFSMCFGRDGIGRISFGDTGSSGQGETPFNLRQLHPTYNVSITK 321

Query: 858  VSVEENVTDINLSAIFDSGTSFTYLTDPAFTIISENFNSIAQEKRVSPNSDLPFEYCYAL 679
            ++V     D+  SAIFDSGTSFTYL DPA+T+ISE+FN  A+EKR S  SD+PFEYCY +
Sbjct: 322  INVGGRDADLEFSAIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYEM 381

Query: 678  SPKQYSFRVANVNLTTKGGKQIFVNDPVVYISTQEGAVIYCLGLVKSEDIDIIGQNFMTG 499
            S  Q +  +  VNL  +GG Q  V DP+V +  Q GA IYCL +VKS D++IIGQNFMTG
Sbjct: 382  SSNQTNLEIPTVNLVMQGGSQFNVTDPIVIVILQGGASIYCLAIVKSGDVNIIGQNFMTG 441

Query: 498  YNIVFDREKKVLGWKASDCYDGNNSAALPINPSRGAPQPAKSTPATNPPTSVEPEAR--S 325
            Y IVF+RE+ VLGWKASDCYD  ++   P++P          +P   P T+V P+A   S
Sbjct: 442  YRIVFNRERNVLGWKASDCYDDMDTTTFPVDP---------ISPGIPPATAVNPQATAGS 492

Query: 324  GNRT---GSPHASPI-QPQPTLPSNHSGRSSSFRHALLMV 217
            GN T   G+P   P+    P LP     + +S   A++MV
Sbjct: 493  GNTTEVSGTP--PPVGNNAPKLP-----KLNSLTFAIIMV 525


>ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297329922|gb|EFH60341.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  549 bits (1414), Expect = e-153
 Identities = 273/482 (56%), Positives = 345/482 (71%), Gaps = 6/482 (1%)
 Frame = -2

Query: 1758 SNHVLMIIICVILSCKSVV----AFGTYGYDIHHRYSDSLKGILDFDGLPEKGTYHYYAA 1591
            S  +L + + ++L+   V+     FG +G++ HHR+SD + G+L  DGLP + +  YY  
Sbjct: 6    SCRILFLGLIILLASSWVLERCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRV 65

Query: 1590 MARRDSLSRRRHLA-DNTDLLSFVDGNQTYRLSSLGFLHYANVSVGTPSVWFLVALDTGS 1414
            MA RD L R R LA ++  L++F DGN+T R+ +LGFLHYANV+VGTPS WFLVALDTGS
Sbjct: 66   MAHRDRLIRGRRLANEDQSLVTFSDGNETIRVDALGFLHYANVTVGTPSDWFLVALDTGS 125

Query: 1413 DLFWLPCDCVSCVRDLNTSSGRQIEFNIYSPNISSTSTSVRCDNDLCEKKNRCSAASDTC 1234
            DLFWLPCDC +CVR+L    G  ++ NIYSPN SSTST V C++ LC + +RC++    C
Sbjct: 126  DLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESNC 185

Query: 1233 PYRVRYLSSNTSSTGFLVGDVLHLNMDNNQQKAVEANIKFGCGTVQTGSFLDGAAPNGLF 1054
            PY++RYLS+ TSSTG LV DVLHL  ++   KA+ A +  GCG VQTG F DGAAPNGLF
Sbjct: 186  PYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTLGCGQVQTGVFHDGAAPNGLF 245

Query: 1053 GLGMEKVSVPSILASEGLAADSFSMCFGPDGIGRIKFGDKGSSDQSETPFNIDQLNPTYN 874
            GLG+E +SVPS+LA EG+AA+SFSMCFG DG GRI FGDKGS DQ ETP NI Q +PTYN
Sbjct: 246  GLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYN 305

Query: 873  ISITHVSVEENVTDINLSAIFDSGTSFTYLTDPAFTIISENFNSIAQEKRV-SPNSDLPF 697
            I++T +SVE N  D+   A+FDSGTSFTYLTD A+T+ISE+FNS+A +KR  + +S+LPF
Sbjct: 306  ITVTKISVEGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPF 365

Query: 696  EYCYALSPKQYSFRVANVNLTTKGGKQIFVNDPVVYISTQEGAVIYCLGLVKSEDIDIIG 517
            EYCYALSP + SF+   VNLT KGG    V  P+V I  ++   +YCL ++K EDI IIG
Sbjct: 366  EYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKD-TDVYCLAILKIEDISIIG 424

Query: 516  QNFMTGYNIVFDREKKVLGWKASDCYDGNNSAALPINPSRGAPQPAKSTPATNPPTSVEP 337
            QNFMTGY +VFDREK +LGWK SDCY G  SA       R  P    S+ A  P +S +P
Sbjct: 425  QNFMTGYRVVFDREKLILGWKESDCYTGETSA-------RTLPSNRSSSSARPPASSFDP 477

Query: 336  EA 331
            EA
Sbjct: 478  EA 479


>ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid
            DNA-binding protein [Arabidopsis thaliana]
            gi|22655368|gb|AAM98276.1| At2g17760/At2g17760
            [Arabidopsis thaliana] gi|330251585|gb|AEC06679.1|
            aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 513

 Score =  547 bits (1410), Expect = e-153
 Identities = 272/482 (56%), Positives = 345/482 (71%), Gaps = 6/482 (1%)
 Frame = -2

Query: 1758 SNHVLMIIICVILSCKSVV----AFGTYGYDIHHRYSDSLKGILDFDGLPEKGTYHYYAA 1591
            S  +L + + ++L+   V+     FG +G++ HHR+SD + G+L  DGLP + +  YY  
Sbjct: 6    SCRILFLGLLILLASSWVLDRCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRV 65

Query: 1590 MARRDSLSRRRHLA-DNTDLLSFVDGNQTYRLSSLGFLHYANVSVGTPSVWFLVALDTGS 1414
            MA RD L R R LA ++  L++F DGN+T R+ +LGFLHYANV+VGTPS WF+VALDTGS
Sbjct: 66   MAHRDRLIRGRRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVALDTGS 125

Query: 1413 DLFWLPCDCVSCVRDLNTSSGRQIEFNIYSPNISSTSTSVRCDNDLCEKKNRCSAASDTC 1234
            DLFWLPCDC +CVR+L    G  ++ NIYSPN SSTST V C++ LC + +RC++    C
Sbjct: 126  DLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDC 185

Query: 1233 PYRVRYLSSNTSSTGFLVGDVLHLNMDNNQQKAVEANIKFGCGTVQTGSFLDGAAPNGLF 1054
            PY++RYLS+ TSSTG LV DVLHL  ++   KA+ A + FGCG VQTG F DGAAPNGLF
Sbjct: 186  PYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLF 245

Query: 1053 GLGMEKVSVPSILASEGLAADSFSMCFGPDGIGRIKFGDKGSSDQSETPFNIDQLNPTYN 874
            GLG+E +SVPS+LA EG+AA+SFSMCFG DG GRI FGDKGS DQ ETP NI Q +PTYN
Sbjct: 246  GLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYN 305

Query: 873  ISITHVSVEENVTDINLSAIFDSGTSFTYLTDPAFTIISENFNSIAQEKRV-SPNSDLPF 697
            I++T +SV  N  D+   A+FDSGTSFTYLTD A+T+ISE+FNS+A +KR  + +S+LPF
Sbjct: 306  ITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPF 365

Query: 696  EYCYALSPKQYSFRVANVNLTTKGGKQIFVNDPVVYISTQEGAVIYCLGLVKSEDIDIIG 517
            EYCYALSP + SF+   VNLT KGG    V  P+V I  ++   +YCL ++K EDI IIG
Sbjct: 366  EYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKD-TDVYCLAIMKIEDISIIG 424

Query: 516  QNFMTGYNIVFDREKKVLGWKASDCYDGNNSAALPINPSRGAPQPAKSTPATNPPTSVEP 337
            QNFMTGY +VFDREK +LGWK SDCY G  SA       R  P    S+ A  P +S +P
Sbjct: 425  QNFMTGYRVVFDREKLILGWKESDCYTGETSA-------RTLPSNRSSSSARPPASSFDP 477

Query: 336  EA 331
            EA
Sbjct: 478  EA 479


>dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
          Length = 515

 Score =  537 bits (1384), Expect = e-150
 Identities = 266/476 (55%), Positives = 338/476 (71%), Gaps = 3/476 (0%)
 Frame = -2

Query: 1749 VLMIIICVILS-CKSVVAFGTYGYDIHHRYSDSLKGILDFDGLPEKGTYHYYAAMARRDS 1573
            +LM++   +L  C+ +   G +G++ HHR+SD + G+L  DGLP + +  YY  MA RD 
Sbjct: 15   ILMLVSSWVLDRCEGL---GEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDR 71

Query: 1572 LSRRRHLA-DNTDLLSFVDGNQTYRLSSLGFLHYANVSVGTPSVWFLVALDTGSDLFWLP 1396
            L R R LA ++  L++F DGN+T R+++LGFLHYANV+VGTPS WFLVALDTGSDLFWLP
Sbjct: 72   LIRGRRLASEDQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFWLP 131

Query: 1395 CDC-VSCVRDLNTSSGRQIEFNIYSPNISSTSTSVRCDNDLCEKKNRCSAASDTCPYRVR 1219
            CDC  +CVR+L    G  ++ NIYSPN SSTS+ V C++ LC + +RC++    CPY++R
Sbjct: 132  CDCSTNCVRELKAPGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCPYQIR 191

Query: 1218 YLSSNTSSTGFLVGDVLHLNMDNNQQKAVEANIKFGCGTVQTGSFLDGAAPNGLFGLGME 1039
            YLS+ TSSTG LV DVLHL       K + A I  GCG VQTG F DGAAPNGLFGLG+E
Sbjct: 192  YLSNGTSSTGVLVEDVLHLVSMEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFGLGLE 251

Query: 1038 KVSVPSILASEGLAADSFSMCFGPDGIGRIKFGDKGSSDQSETPFNIDQLNPTYNISITH 859
             +SVPS+LA EG+AA+SFSMCFG DG GRI FGDKGS DQ ETP NI Q +PTYN+++T 
Sbjct: 252  DISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETPLNIRQPHPTYNVTVTQ 311

Query: 858  VSVEENVTDINLSAIFDSGTSFTYLTDPAFTIISENFNSIAQEKRVSPNSDLPFEYCYAL 679
            +SV  N  D+   A+FD+GTSFTYLTD  +T+ISE+FNS+A +KR   +S+LPFEYCYA+
Sbjct: 312  ISVGGNTGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCYAV 371

Query: 678  SPKQYSFRVANVNLTTKGGKQIFVNDPVVYISTQEGAVIYCLGLVKSEDIDIIGQNFMTG 499
            SP + SF   +VNLT KGG    V  P++ +   E  V+YCL ++KSEDI IIGQNFMTG
Sbjct: 372  SPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPI-EDTVVYCLAIMKSEDISIIGQNFMTG 430

Query: 498  YNIVFDREKKVLGWKASDCYDGNNSAALPINPSRGAPQPAKSTPATNPPTSVEPEA 331
            Y +VFDREK +LGWK SDC  G  SA       R  P    S+ A  P +S +PEA
Sbjct: 431  YRVVFDREKLILGWKESDCSTGETSA-------RTQPSNRSSSSARPPASSFDPEA 479


>ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
            1-like [Cucumis sativus]
          Length = 524

 Score =  513 bits (1322), Expect = e-143
 Identities = 254/462 (54%), Positives = 337/462 (72%), Gaps = 4/462 (0%)
 Frame = -2

Query: 1698 FGTYGYDIHHRYSDSLKGILDFDGLPEKGTYHYYAAMARRDSLSRRRHLADNTDL--LSF 1525
            FG++ ++IHH YS +++ IL F   P++GT  YYAAM R D     R L    D   L+F
Sbjct: 32   FGSFTFNIHHLYSPAVRQILPFHSFPDEGTLDYYAAMVRTDXFVHSRRLGQVQDHRPLTF 91

Query: 1524 VDGNQTYRLSSLGFLHYANVSVGTPSVWFLVALDTGSDLFWLPCDCVSCVRDLNTSSGRQ 1345
            + GN+T R+S LGFL+YA V+VGTP V +LVALDTGSDLFWLPCDCV+C+  LNT+ G  
Sbjct: 92   LSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQG-P 150

Query: 1344 IEFNIYSPNISSTSTSVRCDNDLCEKKNRCSAASDTCPYRVRYLSSNTSSTGFLVGDVLH 1165
            + FNIYSPN SSTS  V+C + LC   ++CS+ SDTCPY+V YLS NTSSTG+LV D+LH
Sbjct: 151  VNFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILH 210

Query: 1164 LNMDNNQQKAVEANIKFGCGTVQTGSFLDGAAPNGLFGLGMEKVSVPSILASEGLAADSF 985
            L  ++ Q K V A I  GCG  Q+G+FL  AAPNGLFGLG+E VSVPSILA+ GL ++SF
Sbjct: 211  LTTNDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSF 270

Query: 984  SMCFGPDGIGRIKFGDKGSSDQSETPFNIDQLNPTYNISITHVSVEENVTDINLSAIFDS 805
            S+CFGP  +GRI+FGDKGS  Q+ETPFN+ + +PTYN+SIT + V  +++D++++ IFDS
Sbjct: 271  SLCFGPARMGRIEFGDKGSPGQNETPFNLGRRHPTYNVSITQIGVGGHISDLDVAVIFDS 330

Query: 804  GTSFTYLTDPAFTIISENFNSIAQEKRVSPNSDLPFEYCYALSPKQYSFRVANVNLTTKG 625
            GTSFTYL DPA+++ ++ F S+ +EK+ + NSD+PFE CY LSP Q +F    +NLT KG
Sbjct: 331  GTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLTMKG 390

Query: 624  GKQIFVNDPVVYISTQEGAVIYCLGLVKSEDIDIIGQNFMTGYNIVFDREKKVLGWKASD 445
            G    +N P+V IST E   ++CL + +S+ I+IIGQNFMTGY+IVFDREK VLGWK S+
Sbjct: 391  GGHFVINHPIVLIST-ESKRLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESN 449

Query: 444  C--YDGNNSAALPINPSRGAPQPAKSTPATNPPTSVEPEARS 325
            C  Y+  N+  LP+ P+   P PA + P T   T+++P+A S
Sbjct: 450  CTGYEDENTNNLPVGPT---PTPA-AAPGT---TAIKPQANS 484


Top