BLASTX nr result
ID: Bupleurum21_contig00010829
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Bupleurum21_contig00010829 (1548 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002263389.2| PREDICTED: lysosomal Pro-X carboxypeptidase ... 761 0.0 ref|XP_002310325.1| predicted protein [Populus trichocarpa] gi|2... 748 0.0 ref|NP_201377.2| Serine carboxypeptidase S28 family protein [Ara... 733 0.0 dbj|BAB10683.1| lysosomal Pro-X carboxypeptidase [Arabidopsis th... 721 0.0 ref|XP_002864979.1| serine carboxypeptidase S28 family protein [... 714 0.0 >ref|XP_002263389.2| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera] gi|296085719|emb|CBI29519.3| unnamed protein product [Vitis vinifera] Length = 510 Score = 761 bits (1964), Expect = 0.0 Identities = 346/470 (73%), Positives = 412/470 (87%) Frame = +1 Query: 139 SKHLPRFLGRFSKPNKPLIKNLQKYRYDTRYFDQSLDHFSFADLPKFRQRYLISSEHWSG 318 SK +PRFLG+F+ PN+ + ++Y+TRYF+Q LDHFS ADLPKFRQRYLIS+ HW+G Sbjct: 37 SKSIPRFLGKFAYPNRG-----KPFQYETRYFEQRLDHFSIADLPKFRQRYLISTRHWTG 91 Query: 319 PDKAGPIFFYCGNEGYIDWFADNTGFVWELAPQFGAMVVFPEHRYYGESVPYGSSGEAYK 498 PD+ GPIF YCGNEG I+WFA NTGFVW++AP+FGAMV+FPEHRYYGES+PYGS +AY Sbjct: 92 PDRMGPIFLYCGNEGDIEWFAANTGFVWDMAPRFGAMVLFPEHRYYGESMPYGSRDKAYA 151 Query: 499 NASTLSYLTAEQALADYAILLTDLKKNLSAEACPVVLFGGSYGGMLAAWMRLKYPHLSVG 678 NA++LSYLTAEQALAD+A+L+T+LK+NLSAE CPVVLFGGSYGGMLAAWMRLKYPH+++G Sbjct: 152 NAASLSYLTAEQALADFAVLVTNLKRNLSAEGCPVVLFGGSYGGMLAAWMRLKYPHIAIG 211 Query: 679 ALASSAPVLEFEDIVPEESFYDIVSNVFKHESVSCFNTIKASWDTLFSEGNKEDGLLQLT 858 ALASSAP+L+FEDIVP E+FYDIVSN FK ES+SCF+TIK SWD L SEG K DGL QLT Sbjct: 212 ALASSAPILQFEDIVPPETFYDIVSNNFKRESISCFDTIKKSWDVLISEGQKNDGLKQLT 271 Query: 859 KTFHLCRKLNSTEDLFNWWDSAYTSLAMVNYPYPSEFLMPLPGNPIKEVCRKIDSCLGGT 1038 K F LCR L TEDL++W DSAY+ LAMVNYPYPS+FLMPLPG+PIKEVCRK+DSC GT Sbjct: 272 KAFRLCRDLKRTEDLYDWLDSAYSFLAMVNYPYPSDFLMPLPGHPIKEVCRKMDSCPEGT 331 Query: 1039 SILDCIFEGLNVYYNYTGSVDCFNLDDDPHGEDGWNWQACTEMVMPMSSNRYSSMFPEFH 1218 S+L+ IFEG++VYYNYTG V+CF LDDDPHG DGWNWQACTEMVMPM+S+R SSMFP + Sbjct: 332 SVLERIFEGVSVYYNYTGKVECFQLDDDPHGMDGWNWQACTEMVMPMASSRESSMFPTYD 391 Query: 1219 YNDTQYKEECWEDFKVTPRPTWITTEFGGHDFKANLEHFGSNIIFSNGLLDPWSGGSVLE 1398 YN + ++EECW+DF V PRPTWITTEFGGH+FK L+ FGSNIIFSNGLLDPWSGGSVL+ Sbjct: 392 YNYSSFQEECWKDFSVKPRPTWITTEFGGHEFKTTLKVFGSNIIFSNGLLDPWSGGSVLQ 451 Query: 1399 DISESIVALVTDKGAHHLDLRAATNEDPTWLLEQREKEIKLIAGWIEAYN 1548 +ISE++VALVT++GAHH+DLR++T EDP WL+EQR E+KLI GWIE Y+ Sbjct: 452 NISETVVALVTEEGAHHIDLRSSTAEDPDWLVEQRAFEVKLIKGWIEDYH 501 >ref|XP_002310325.1| predicted protein [Populus trichocarpa] gi|222853228|gb|EEE90775.1| predicted protein [Populus trichocarpa] Length = 515 Score = 748 bits (1930), Expect = 0.0 Identities = 342/470 (72%), Positives = 405/470 (86%) Frame = +1 Query: 136 SSKHLPRFLGRFSKPNKPLIKNLQKYRYDTRYFDQSLDHFSFADLPKFRQRYLISSEHWS 315 SSK PRFL + S P K ++ Q+YRY+++YF Q LDHFSF +LPKF QRYLI+++HW+ Sbjct: 36 SSKRAPRFLSKHSYPIKTQLQEQQQYRYESKYFYQQLDHFSFLNLPKFPQRYLINTDHWA 95 Query: 316 GPDKAGPIFFYCGNEGYIDWFADNTGFVWELAPQFGAMVVFPEHRYYGESVPYGSSGEAY 495 GP++ GPIF YCGNEG I+WFA NTGFVWE+AP FGAMV+FPEHRYYGES+PYG+ EAY Sbjct: 96 GPERRGPIFLYCGNEGDIEWFAVNTGFVWEIAPLFGAMVLFPEHRYYGESMPYGNREEAY 155 Query: 496 KNASTLSYLTAEQALADYAILLTDLKKNLSAEACPVVLFGGSYGGMLAAWMRLKYPHLSV 675 KNASTLSYLTAEQALAD+A+L+TDLK+NLSA+ACPVVLFGGSYGGMLAAWMRLKYPH+++ Sbjct: 156 KNASTLSYLTAEQALADFAVLITDLKRNLSAQACPVVLFGGSYGGMLAAWMRLKYPHVAI 215 Query: 676 GALASSAPVLEFEDIVPEESFYDIVSNVFKHESVSCFNTIKASWDTLFSEGNKEDGLLQL 855 GALASSAP+L+FEDIVP E+FY+IVSN FK ES SCFNTIK SWD L SEG K++GL+QL Sbjct: 216 GALASSAPILQFEDIVPPETFYNIVSNDFKRESTSCFNTIKESWDALLSEGLKKNGLVQL 275 Query: 856 TKTFHLCRKLNSTEDLFNWWDSAYTSLAMVNYPYPSEFLMPLPGNPIKEVCRKIDSCLGG 1035 TKTFHLCR+L STEDL NW DSAY+ LAMV+YPYPS F+MPLPG PI EVC++ID C G Sbjct: 276 TKTFHLCRELKSTEDLANWLDSAYSYLAMVDYPYPSSFMMPLPGYPIGEVCKRIDGCPDG 335 Query: 1036 TSILDCIFEGLNVYYNYTGSVDCFNLDDDPHGEDGWNWQACTEMVMPMSSNRYSSMFPEF 1215 TSIL+ IFEG+++YYNYTG + CF LDDDPHG DGWNWQACTEMVMPMSS+ +SMFP + Sbjct: 336 TSILERIFEGISIYYNYTGELHCFELDDDPHGLDGWNWQACTEMVMPMSSSHNASMFPTY 395 Query: 1216 HYNDTQYKEECWEDFKVTPRPTWITTEFGGHDFKANLEHFGSNIIFSNGLLDPWSGGSVL 1395 +N + Y+E CWE+F V PRP WITTEFGG D K LE FGSNIIFSNGLLDPWSGGSVL Sbjct: 396 DFNYSSYQEGCWEEFGVIPRPRWITTEFGGQDIKTALETFGSNIIFSNGLLDPWSGGSVL 455 Query: 1396 EDISESIVALVTDKGAHHLDLRAATNEDPTWLLEQREKEIKLIAGWIEAY 1545 ++ISE++VALVT++GAHH+DLR +T EDP WL+EQRE E+KLI GWI+ Y Sbjct: 456 QNISETVVALVTEEGAHHIDLRPSTPEDPDWLVEQRETEVKLIKGWIDGY 505 >ref|NP_201377.2| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana] gi|95147306|gb|ABF57288.1| At5g65760 [Arabidopsis thaliana] gi|110736177|dbj|BAF00060.1| lysosomal Pro-X carboxypeptidase [Arabidopsis thaliana] gi|332010719|gb|AED98102.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana] Length = 515 Score = 733 bits (1891), Expect = 0.0 Identities = 336/480 (70%), Positives = 402/480 (83%), Gaps = 4/480 (0%) Frame = +1 Query: 118 NGLSIRSSKHLPRFLGRFSKPNKPLIKNLQ----KYRYDTRYFDQSLDHFSFADLPKFRQ 285 NG S+ SSK LPRF + + I+ + +YRY+T++F Q LDHFSFADLPKF Q Sbjct: 21 NGSSLSSSKLLPRFPRYTFQNREARIQQFRGDRNEYRYETKFFSQQLDHFSFADLPKFSQ 80 Query: 286 RYLISSEHWSGPDKAGPIFFYCGNEGYIDWFADNTGFVWELAPQFGAMVVFPEHRYYGES 465 RYLI+S+HW G GPIF YCGNEG I+WFA N+GF+W++AP+FGA++VFPEHRYYGES Sbjct: 81 RYLINSDHWLGASALGPIFLYCGNEGDIEWFATNSGFIWDIAPKFGALLVFPEHRYYGES 140 Query: 466 VPYGSSGEAYKNASTLSYLTAEQALADYAILLTDLKKNLSAEACPVVLFGGSYGGMLAAW 645 +PYGS EAYKNA+TLSYLT EQALAD+A+ +TDLK+NLSAEACPVVLFGGSYGGMLAAW Sbjct: 141 MPYGSREEAYKNATTLSYLTTEQALADFAVFVTDLKRNLSAEACPVVLFGGSYGGMLAAW 200 Query: 646 MRLKYPHLSVGALASSAPVLEFEDIVPEESFYDIVSNVFKHESVSCFNTIKASWDTLFSE 825 MRLKYPH+++GALASSAP+L+FED+VP E+FYDI SN FK ES SCFNTIK SWD + +E Sbjct: 201 MRLKYPHIAIGALASSAPILQFEDVVPPETFYDIASNDFKRESSSCFNTIKDSWDAIIAE 260 Query: 826 GNKEDGLLQLTKTFHLCRKLNSTEDLFNWWDSAYTSLAMVNYPYPSEFLMPLPGNPIKEV 1005 G KE+GLLQLTKTFH CR LNST+DL +W DSAY+ LAMV+YPYP++F+MPLPG+PI+EV Sbjct: 261 GQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYSYLAMVDYPYPADFMMPLPGHPIREV 320 Query: 1006 CRKIDSCLGGTSILDCIFEGLNVYYNYTGSVDCFNLDDDPHGEDGWNWQACTEMVMPMSS 1185 CRKID SILD I+ G++VYYNYTG+VDCF LDDDPHG DGWNWQACTEMVMPMSS Sbjct: 321 CRKIDGAGSNASILDRIYAGISVYYNYTGNVDCFKLDDDPHGLDGWNWQACTEMVMPMSS 380 Query: 1186 NRYSSMFPEFHYNDTQYKEECWEDFKVTPRPTWITTEFGGHDFKANLEHFGSNIIFSNGL 1365 N+ +SMFP + +N + YKEECW F+V PRP W+TTEFGGHD L+ FGSNIIFSNGL Sbjct: 381 NQENSMFPGYGFNYSSYKEECWNTFRVNPRPKWVTTEFGGHDIATTLKSFGSNIIFSNGL 440 Query: 1366 LDPWSGGSVLEDISESIVALVTDKGAHHLDLRAATNEDPTWLLEQREKEIKLIAGWIEAY 1545 LDPWSGGSVL+++S++IVALVT +GAHHLDLR +T EDP WL++QRE EI+LI GWIE Y Sbjct: 441 LDPWSGGSVLKNLSDTIVALVTKEGAHHLDLRPSTPEDPKWLVDQREAEIRLIQGWIETY 500 >dbj|BAB10683.1| lysosomal Pro-X carboxypeptidase [Arabidopsis thaliana] Length = 529 Score = 721 bits (1862), Expect = 0.0 Identities = 335/494 (67%), Positives = 402/494 (81%), Gaps = 18/494 (3%) Frame = +1 Query: 118 NGLSIRSSKHLPRFLGRFSKPNKPLIKNLQ----KYRYDTRYFDQSLDHFSFADLPKFRQ 285 NG S+ SSK LPRF + + I+ + +YRY+T++F Q LDHFSFADLPKF Q Sbjct: 21 NGSSLSSSKLLPRFPRYTFQNREARIQQFRGDRNEYRYETKFFSQQLDHFSFADLPKFSQ 80 Query: 286 RYLISSEHWSGPDKAGPIFFYCGNEGYIDWFADNTGFVWELAPQFGAMVVFPEHRYYGES 465 RYLI+S+HW G GPIF YCGNEG I+WFA N+GF+W++AP+FGA++VFPEHRYYGES Sbjct: 81 RYLINSDHWLGASALGPIFLYCGNEGDIEWFATNSGFIWDIAPKFGALLVFPEHRYYGES 140 Query: 466 VPYGSSGEAYKNASTLSYLTAEQALADYAILLTDLKKNLSAEACPVVLFGGSYGG----- 630 +PYGS EAYKNA+TLSYLT EQALAD+A+ +TDLK+NLSAEACPVVLFGGSYGG Sbjct: 141 MPYGSREEAYKNATTLSYLTTEQALADFAVFVTDLKRNLSAEACPVVLFGGSYGGSNNCV 200 Query: 631 ---------MLAAWMRLKYPHLSVGALASSAPVLEFEDIVPEESFYDIVSNVFKHESVSC 783 +LAAWMRLKYPH+++GALASSAP+L+FED+VP E+FYDI SN FK ES SC Sbjct: 201 FVFVVIDATVLAAWMRLKYPHIAIGALASSAPILQFEDVVPPETFYDIASNDFKRESSSC 260 Query: 784 FNTIKASWDTLFSEGNKEDGLLQLTKTFHLCRKLNSTEDLFNWWDSAYTSLAMVNYPYPS 963 FNTIK SWD + +EG KE+GLLQLTKTFH CR LNST+DL +W DSAY+ LAMV+YPYP+ Sbjct: 261 FNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYSYLAMVDYPYPA 320 Query: 964 EFLMPLPGNPIKEVCRKIDSCLGGTSILDCIFEGLNVYYNYTGSVDCFNLDDDPHGEDGW 1143 +F+MPLPG+PI+EVCRKID SILD I+ G++VYYNYTG+VDCF LDDDPHG DGW Sbjct: 321 DFMMPLPGHPIREVCRKIDGAGSNASILDRIYAGISVYYNYTGNVDCFKLDDDPHGLDGW 380 Query: 1144 NWQACTEMVMPMSSNRYSSMFPEFHYNDTQYKEECWEDFKVTPRPTWITTEFGGHDFKAN 1323 NWQACTEMVMPMSSN+ +SMFP + +N + YKEECW F+V PRP W+TTEFGGHD Sbjct: 381 NWQACTEMVMPMSSNQENSMFPGYGFNYSSYKEECWNTFRVNPRPKWVTTEFGGHDIATT 440 Query: 1324 LEHFGSNIIFSNGLLDPWSGGSVLEDISESIVALVTDKGAHHLDLRAATNEDPTWLLEQR 1503 L+ FGSNIIFSNGLLDPWSGGSVL+++S++IVALVT +GAHHLDLR +T EDP WL++QR Sbjct: 441 LKSFGSNIIFSNGLLDPWSGGSVLKNLSDTIVALVTKEGAHHLDLRPSTPEDPKWLVDQR 500 Query: 1504 EKEIKLIAGWIEAY 1545 E EI+LI GWIE Y Sbjct: 501 EAEIRLIQGWIETY 514 >ref|XP_002864979.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata subsp. lyrata] gi|297310814|gb|EFH41238.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata subsp. lyrata] Length = 514 Score = 714 bits (1843), Expect = 0.0 Identities = 332/480 (69%), Positives = 398/480 (82%), Gaps = 4/480 (0%) Frame = +1 Query: 118 NGLSIRSSKHLPRFLGRFSKPNKPLIKNLQ----KYRYDTRYFDQSLDHFSFADLPKFRQ 285 NG S+ SSK LPRF R++ N+ I+ + +YRY+T++F Q LDHFSFADLPKF Q Sbjct: 21 NGSSLSSSKLLPRF-PRYTSRNRGRIQQFRGDRNEYRYETKFFSQQLDHFSFADLPKFPQ 79 Query: 286 RYLISSEHWSGPDKAGPIFFYCGNEGYIDWFADNTGFVWELAPQFGAMVVFPEHRYYGES 465 RYLI+S++W G GPIF YCGNEG I+WFA N+GF+W++AP+FGA++VFPE R Sbjct: 80 RYLINSDYWLGASALGPIFLYCGNEGDIEWFATNSGFIWDIAPKFGALLVFPEVRSCLFC 139 Query: 466 VPYGSSGEAYKNASTLSYLTAEQALADYAILLTDLKKNLSAEACPVVLFGGSYGGMLAAW 645 +PYGS EAYKNA+TLSYLT EQALAD+A+ +TDLK+NLSAEACPVVLFGGSYGGMLAAW Sbjct: 140 MPYGSMEEAYKNATTLSYLTTEQALADFAVFVTDLKRNLSAEACPVVLFGGSYGGMLAAW 199 Query: 646 MRLKYPHLSVGALASSAPVLEFEDIVPEESFYDIVSNVFKHESVSCFNTIKASWDTLFSE 825 MRLKYPH+++GALASSAP+L+FEDIVP E+FYDI SN FK ES SCFNTIK SWD + +E Sbjct: 200 MRLKYPHIAIGALASSAPILQFEDIVPPETFYDIASNDFKRESSSCFNTIKDSWDAIIAE 259 Query: 826 GNKEDGLLQLTKTFHLCRKLNSTEDLFNWWDSAYTSLAMVNYPYPSEFLMPLPGNPIKEV 1005 G KE+GLLQLTKTFH CR LNST+DL +W DSAY+ LAMV+YPYP++F+MPLPG+PI+EV Sbjct: 260 GQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYSYLAMVDYPYPADFMMPLPGHPIREV 319 Query: 1006 CRKIDSCLGGTSILDCIFEGLNVYYNYTGSVDCFNLDDDPHGEDGWNWQACTEMVMPMSS 1185 CRKID SILD IF G++VYYNYTG+VDCF LDDDPHG DGWNWQACTEMVMPMSS Sbjct: 320 CRKIDGAHSDASILDRIFAGISVYYNYTGNVDCFKLDDDPHGLDGWNWQACTEMVMPMSS 379 Query: 1186 NRYSSMFPEFHYNDTQYKEECWEDFKVTPRPTWITTEFGGHDFKANLEHFGSNIIFSNGL 1365 N+ SMFP + +N + YKEECW F+V PRP W+TTEFGGHD + L+ FGSNIIFSNG+ Sbjct: 380 NQEKSMFPAYDFNYSSYKEECWNTFRVNPRPKWVTTEFGGHDIETTLKLFGSNIIFSNGM 439 Query: 1366 LDPWSGGSVLEDISESIVALVTDKGAHHLDLRAATNEDPTWLLEQREKEIKLIAGWIEAY 1545 LDPWSGGSVL+++S +IVALVT +GAHHLDLR +T EDP WL++QRE EI+LI GWIE Y Sbjct: 440 LDPWSGGSVLKNLSNTIVALVTKEGAHHLDLRPSTPEDPKWLVDQREAEIQLIQGWIETY 499