BLASTX nr result
ID: Angelica22_contig00002689
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00002689 (1918 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor,... 583 e-164 ref|XP_002884082.1| aspartyl protease family protein [Arabidopsi... 549 e-153 ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis tha... 547 e-153 dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila] 537 e-150 ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic pro... 513 e-143 >ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] Length = 533 Score = 583 bits (1504), Expect = e-164 Identities = 298/520 (57%), Positives = 373/520 (71%), Gaps = 10/520 (1%) Frame = -2 Query: 1746 LMIIICVILSCKSVVAFGTYGYDIHHRYSDSLKGILDFDGLPEKGTYHYYAAMARRDSLS 1567 L++++ ++ S FGT+G+D+HHRYSD +KG+L D LPEKG+ HYYA+MA RD L Sbjct: 22 LLLLVLMLSSSSFSYGFGTFGFDLHHRYSDPVKGMLSVDDLPEKGSLHYYASMAHRDILI 81 Query: 1566 RRRHL-ADNTDL-LSFVDGNQTYRLSSLGFLHYANVSVGTPSVWFLVALDTGSDLFWLPC 1393 R L +DNT L+F GN+TYR SSLGFLHYANVS+GTPS+ +LVALDTGSDLFWLPC Sbjct: 82 HGRKLVSDNTSTPLTFFSGNETYRFSSLGFLHYANVSIGTPSLSYLVALDTGSDLFWLPC 141 Query: 1392 DCVS--CVRDLNTSSGRQIEFNIYSPNISSTSTSVRCDNDLCEKKNRCSAASDTCPYRVR 1219 DC + CV+ L SG QI+FNIY PN SSTS ++ C+N LC +++RC +A TCPY+V+ Sbjct: 142 DCTNSGCVQGLQFPSGEQIDFNIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCPYQVQ 201 Query: 1218 YLSSNTSSTGFLVGDVLHLNMDNNQQKAVEANIKFGCGTVQTGSFLDGAAPNGLFGLGME 1039 YLS+ TSSTG LV D+LHL D+ Q +A++A I FGCG VQTGSFLDGAAPNGLFGLGM Sbjct: 202 YLSNGTSSTGVLVEDLLHLTTDDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMT 261 Query: 1038 KVSVPSILASEGLAADSFSMCFGPDGIGRIKFGDKGSSDQSETPFNIDQLNPTYNISITH 859 +SVPS LA EG ++SFSMCFG DGIGRI FGD GSS Q ETPFN+ QL+PTYN+SIT Sbjct: 262 NISVPSTLAREGYTSNSFSMCFGRDGIGRISFGDTGSSGQGETPFNLRQLHPTYNVSITK 321 Query: 858 VSVEENVTDINLSAIFDSGTSFTYLTDPAFTIISENFNSIAQEKRVSPNSDLPFEYCYAL 679 ++V D+ SAIFDSGTSFTYL DPA+T+ISE+FN A+EKR S SD+PFEYCY + Sbjct: 322 INVGGRDADLEFSAIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYEM 381 Query: 678 SPKQYSFRVANVNLTTKGGKQIFVNDPVVYISTQEGAVIYCLGLVKSEDIDIIGQNFMTG 499 S Q + + VNL +GG Q V DP+V + Q GA IYCL +VKS D++IIGQNFMTG Sbjct: 382 SSNQTNLEIPTVNLVMQGGSQFNVTDPIVIVILQGGASIYCLAIVKSGDVNIIGQNFMTG 441 Query: 498 YNIVFDREKKVLGWKASDCYDGNNSAALPINPSRGAPQPAKSTPATNPPTSVEPEAR--S 325 Y IVF+RE+ VLGWKASDCYD ++ P++P +P P T+V P+A S Sbjct: 442 YRIVFNRERNVLGWKASDCYDDMDTTTFPVDP---------ISPGIPPATAVNPQATAGS 492 Query: 324 GNRT---GSPHASPI-QPQPTLPSNHSGRSSSFRHALLMV 217 GN T G+P P+ P LP + +S A++MV Sbjct: 493 GNTTEVSGTP--PPVGNNAPKLP-----KLNSLTFAIIMV 525 >ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] Length = 513 Score = 549 bits (1414), Expect = e-153 Identities = 273/482 (56%), Positives = 345/482 (71%), Gaps = 6/482 (1%) Frame = -2 Query: 1758 SNHVLMIIICVILSCKSVV----AFGTYGYDIHHRYSDSLKGILDFDGLPEKGTYHYYAA 1591 S +L + + ++L+ V+ FG +G++ HHR+SD + G+L DGLP + + YY Sbjct: 6 SCRILFLGLIILLASSWVLERCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRV 65 Query: 1590 MARRDSLSRRRHLA-DNTDLLSFVDGNQTYRLSSLGFLHYANVSVGTPSVWFLVALDTGS 1414 MA RD L R R LA ++ L++F DGN+T R+ +LGFLHYANV+VGTPS WFLVALDTGS Sbjct: 66 MAHRDRLIRGRRLANEDQSLVTFSDGNETIRVDALGFLHYANVTVGTPSDWFLVALDTGS 125 Query: 1413 DLFWLPCDCVSCVRDLNTSSGRQIEFNIYSPNISSTSTSVRCDNDLCEKKNRCSAASDTC 1234 DLFWLPCDC +CVR+L G ++ NIYSPN SSTST V C++ LC + +RC++ C Sbjct: 126 DLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESNC 185 Query: 1233 PYRVRYLSSNTSSTGFLVGDVLHLNMDNNQQKAVEANIKFGCGTVQTGSFLDGAAPNGLF 1054 PY++RYLS+ TSSTG LV DVLHL ++ KA+ A + GCG VQTG F DGAAPNGLF Sbjct: 186 PYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTLGCGQVQTGVFHDGAAPNGLF 245 Query: 1053 GLGMEKVSVPSILASEGLAADSFSMCFGPDGIGRIKFGDKGSSDQSETPFNIDQLNPTYN 874 GLG+E +SVPS+LA EG+AA+SFSMCFG DG GRI FGDKGS DQ ETP NI Q +PTYN Sbjct: 246 GLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYN 305 Query: 873 ISITHVSVEENVTDINLSAIFDSGTSFTYLTDPAFTIISENFNSIAQEKRV-SPNSDLPF 697 I++T +SVE N D+ A+FDSGTSFTYLTD A+T+ISE+FNS+A +KR + +S+LPF Sbjct: 306 ITVTKISVEGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPF 365 Query: 696 EYCYALSPKQYSFRVANVNLTTKGGKQIFVNDPVVYISTQEGAVIYCLGLVKSEDIDIIG 517 EYCYALSP + SF+ VNLT KGG V P+V I ++ +YCL ++K EDI IIG Sbjct: 366 EYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKD-TDVYCLAILKIEDISIIG 424 Query: 516 QNFMTGYNIVFDREKKVLGWKASDCYDGNNSAALPINPSRGAPQPAKSTPATNPPTSVEP 337 QNFMTGY +VFDREK +LGWK SDCY G SA R P S+ A P +S +P Sbjct: 425 QNFMTGYRVVFDREKLILGWKESDCYTGETSA-------RTLPSNRSSSSARPPASSFDP 477 Query: 336 EA 331 EA Sbjct: 478 EA 479 >ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana] gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis thaliana] gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana] gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana] Length = 513 Score = 547 bits (1410), Expect = e-153 Identities = 272/482 (56%), Positives = 345/482 (71%), Gaps = 6/482 (1%) Frame = -2 Query: 1758 SNHVLMIIICVILSCKSVV----AFGTYGYDIHHRYSDSLKGILDFDGLPEKGTYHYYAA 1591 S +L + + ++L+ V+ FG +G++ HHR+SD + G+L DGLP + + YY Sbjct: 6 SCRILFLGLLILLASSWVLDRCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRV 65 Query: 1590 MARRDSLSRRRHLA-DNTDLLSFVDGNQTYRLSSLGFLHYANVSVGTPSVWFLVALDTGS 1414 MA RD L R R LA ++ L++F DGN+T R+ +LGFLHYANV+VGTPS WF+VALDTGS Sbjct: 66 MAHRDRLIRGRRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVALDTGS 125 Query: 1413 DLFWLPCDCVSCVRDLNTSSGRQIEFNIYSPNISSTSTSVRCDNDLCEKKNRCSAASDTC 1234 DLFWLPCDC +CVR+L G ++ NIYSPN SSTST V C++ LC + +RC++ C Sbjct: 126 DLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDC 185 Query: 1233 PYRVRYLSSNTSSTGFLVGDVLHLNMDNNQQKAVEANIKFGCGTVQTGSFLDGAAPNGLF 1054 PY++RYLS+ TSSTG LV DVLHL ++ KA+ A + FGCG VQTG F DGAAPNGLF Sbjct: 186 PYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLF 245 Query: 1053 GLGMEKVSVPSILASEGLAADSFSMCFGPDGIGRIKFGDKGSSDQSETPFNIDQLNPTYN 874 GLG+E +SVPS+LA EG+AA+SFSMCFG DG GRI FGDKGS DQ ETP NI Q +PTYN Sbjct: 246 GLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYN 305 Query: 873 ISITHVSVEENVTDINLSAIFDSGTSFTYLTDPAFTIISENFNSIAQEKRV-SPNSDLPF 697 I++T +SV N D+ A+FDSGTSFTYLTD A+T+ISE+FNS+A +KR + +S+LPF Sbjct: 306 ITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPF 365 Query: 696 EYCYALSPKQYSFRVANVNLTTKGGKQIFVNDPVVYISTQEGAVIYCLGLVKSEDIDIIG 517 EYCYALSP + SF+ VNLT KGG V P+V I ++ +YCL ++K EDI IIG Sbjct: 366 EYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKD-TDVYCLAIMKIEDISIIG 424 Query: 516 QNFMTGYNIVFDREKKVLGWKASDCYDGNNSAALPINPSRGAPQPAKSTPATNPPTSVEP 337 QNFMTGY +VFDREK +LGWK SDCY G SA R P S+ A P +S +P Sbjct: 425 QNFMTGYRVVFDREKLILGWKESDCYTGETSA-------RTLPSNRSSSSARPPASSFDP 477 Query: 336 EA 331 EA Sbjct: 478 EA 479 >dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila] Length = 515 Score = 537 bits (1384), Expect = e-150 Identities = 266/476 (55%), Positives = 338/476 (71%), Gaps = 3/476 (0%) Frame = -2 Query: 1749 VLMIIICVILS-CKSVVAFGTYGYDIHHRYSDSLKGILDFDGLPEKGTYHYYAAMARRDS 1573 +LM++ +L C+ + G +G++ HHR+SD + G+L DGLP + + YY MA RD Sbjct: 15 ILMLVSSWVLDRCEGL---GEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDR 71 Query: 1572 LSRRRHLA-DNTDLLSFVDGNQTYRLSSLGFLHYANVSVGTPSVWFLVALDTGSDLFWLP 1396 L R R LA ++ L++F DGN+T R+++LGFLHYANV+VGTPS WFLVALDTGSDLFWLP Sbjct: 72 LIRGRRLASEDQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFWLP 131 Query: 1395 CDC-VSCVRDLNTSSGRQIEFNIYSPNISSTSTSVRCDNDLCEKKNRCSAASDTCPYRVR 1219 CDC +CVR+L G ++ NIYSPN SSTS+ V C++ LC + +RC++ CPY++R Sbjct: 132 CDCSTNCVRELKAPGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCPYQIR 191 Query: 1218 YLSSNTSSTGFLVGDVLHLNMDNNQQKAVEANIKFGCGTVQTGSFLDGAAPNGLFGLGME 1039 YLS+ TSSTG LV DVLHL K + A I GCG VQTG F DGAAPNGLFGLG+E Sbjct: 192 YLSNGTSSTGVLVEDVLHLVSMEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFGLGLE 251 Query: 1038 KVSVPSILASEGLAADSFSMCFGPDGIGRIKFGDKGSSDQSETPFNIDQLNPTYNISITH 859 +SVPS+LA EG+AA+SFSMCFG DG GRI FGDKGS DQ ETP NI Q +PTYN+++T Sbjct: 252 DISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETPLNIRQPHPTYNVTVTQ 311 Query: 858 VSVEENVTDINLSAIFDSGTSFTYLTDPAFTIISENFNSIAQEKRVSPNSDLPFEYCYAL 679 +SV N D+ A+FD+GTSFTYLTD +T+ISE+FNS+A +KR +S+LPFEYCYA+ Sbjct: 312 ISVGGNTGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCYAV 371 Query: 678 SPKQYSFRVANVNLTTKGGKQIFVNDPVVYISTQEGAVIYCLGLVKSEDIDIIGQNFMTG 499 SP + SF +VNLT KGG V P++ + E V+YCL ++KSEDI IIGQNFMTG Sbjct: 372 SPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPI-EDTVVYCLAIMKSEDISIIGQNFMTG 430 Query: 498 YNIVFDREKKVLGWKASDCYDGNNSAALPINPSRGAPQPAKSTPATNPPTSVEPEA 331 Y +VFDREK +LGWK SDC G SA R P S+ A P +S +PEA Sbjct: 431 YRVVFDREKLILGWKESDCSTGETSA-------RTQPSNRSSSSARPPASSFDPEA 479 >ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein 1-like [Cucumis sativus] Length = 524 Score = 513 bits (1322), Expect = e-143 Identities = 254/462 (54%), Positives = 337/462 (72%), Gaps = 4/462 (0%) Frame = -2 Query: 1698 FGTYGYDIHHRYSDSLKGILDFDGLPEKGTYHYYAAMARRDSLSRRRHLADNTDL--LSF 1525 FG++ ++IHH YS +++ IL F P++GT YYAAM R D R L D L+F Sbjct: 32 FGSFTFNIHHLYSPAVRQILPFHSFPDEGTLDYYAAMVRTDXFVHSRRLGQVQDHRPLTF 91 Query: 1524 VDGNQTYRLSSLGFLHYANVSVGTPSVWFLVALDTGSDLFWLPCDCVSCVRDLNTSSGRQ 1345 + GN+T R+S LGFL+YA V+VGTP V +LVALDTGSDLFWLPCDCV+C+ LNT+ G Sbjct: 92 LSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQG-P 150 Query: 1344 IEFNIYSPNISSTSTSVRCDNDLCEKKNRCSAASDTCPYRVRYLSSNTSSTGFLVGDVLH 1165 + FNIYSPN SSTS V+C + LC ++CS+ SDTCPY+V YLS NTSSTG+LV D+LH Sbjct: 151 VNFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILH 210 Query: 1164 LNMDNNQQKAVEANIKFGCGTVQTGSFLDGAAPNGLFGLGMEKVSVPSILASEGLAADSF 985 L ++ Q K V A I GCG Q+G+FL AAPNGLFGLG+E VSVPSILA+ GL ++SF Sbjct: 211 LTTNDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSF 270 Query: 984 SMCFGPDGIGRIKFGDKGSSDQSETPFNIDQLNPTYNISITHVSVEENVTDINLSAIFDS 805 S+CFGP +GRI+FGDKGS Q+ETPFN+ + +PTYN+SIT + V +++D++++ IFDS Sbjct: 271 SLCFGPARMGRIEFGDKGSPGQNETPFNLGRRHPTYNVSITQIGVGGHISDLDVAVIFDS 330 Query: 804 GTSFTYLTDPAFTIISENFNSIAQEKRVSPNSDLPFEYCYALSPKQYSFRVANVNLTTKG 625 GTSFTYL DPA+++ ++ F S+ +EK+ + NSD+PFE CY LSP Q +F +NLT KG Sbjct: 331 GTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLTMKG 390 Query: 624 GKQIFVNDPVVYISTQEGAVIYCLGLVKSEDIDIIGQNFMTGYNIVFDREKKVLGWKASD 445 G +N P+V IST E ++CL + +S+ I+IIGQNFMTGY+IVFDREK VLGWK S+ Sbjct: 391 GGHFVINHPIVLIST-ESKRLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESN 449 Query: 444 C--YDGNNSAALPINPSRGAPQPAKSTPATNPPTSVEPEARS 325 C Y+ N+ LP+ P+ P PA + P T T+++P+A S Sbjct: 450 CTGYEDENTNNLPVGPT---PTPA-AAPGT---TAIKPQANS 484