BLASTX nr result

ID: Bupleurum21_contig00010829 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00010829
         (1548 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002263389.2| PREDICTED: lysosomal Pro-X carboxypeptidase ...   761   0.0  
ref|XP_002310325.1| predicted protein [Populus trichocarpa] gi|2...   748   0.0  
ref|NP_201377.2| Serine carboxypeptidase S28 family protein [Ara...   733   0.0  
dbj|BAB10683.1| lysosomal Pro-X carboxypeptidase [Arabidopsis th...   721   0.0  
ref|XP_002864979.1| serine carboxypeptidase S28 family protein [...   714   0.0  

>ref|XP_002263389.2| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
            gi|296085719|emb|CBI29519.3| unnamed protein product
            [Vitis vinifera]
          Length = 510

 Score =  761 bits (1964), Expect = 0.0
 Identities = 346/470 (73%), Positives = 412/470 (87%)
 Frame = +1

Query: 139  SKHLPRFLGRFSKPNKPLIKNLQKYRYDTRYFDQSLDHFSFADLPKFRQRYLISSEHWSG 318
            SK +PRFLG+F+ PN+      + ++Y+TRYF+Q LDHFS ADLPKFRQRYLIS+ HW+G
Sbjct: 37   SKSIPRFLGKFAYPNRG-----KPFQYETRYFEQRLDHFSIADLPKFRQRYLISTRHWTG 91

Query: 319  PDKAGPIFFYCGNEGYIDWFADNTGFVWELAPQFGAMVVFPEHRYYGESVPYGSSGEAYK 498
            PD+ GPIF YCGNEG I+WFA NTGFVW++AP+FGAMV+FPEHRYYGES+PYGS  +AY 
Sbjct: 92   PDRMGPIFLYCGNEGDIEWFAANTGFVWDMAPRFGAMVLFPEHRYYGESMPYGSRDKAYA 151

Query: 499  NASTLSYLTAEQALADYAILLTDLKKNLSAEACPVVLFGGSYGGMLAAWMRLKYPHLSVG 678
            NA++LSYLTAEQALAD+A+L+T+LK+NLSAE CPVVLFGGSYGGMLAAWMRLKYPH+++G
Sbjct: 152  NAASLSYLTAEQALADFAVLVTNLKRNLSAEGCPVVLFGGSYGGMLAAWMRLKYPHIAIG 211

Query: 679  ALASSAPVLEFEDIVPEESFYDIVSNVFKHESVSCFNTIKASWDTLFSEGNKEDGLLQLT 858
            ALASSAP+L+FEDIVP E+FYDIVSN FK ES+SCF+TIK SWD L SEG K DGL QLT
Sbjct: 212  ALASSAPILQFEDIVPPETFYDIVSNNFKRESISCFDTIKKSWDVLISEGQKNDGLKQLT 271

Query: 859  KTFHLCRKLNSTEDLFNWWDSAYTSLAMVNYPYPSEFLMPLPGNPIKEVCRKIDSCLGGT 1038
            K F LCR L  TEDL++W DSAY+ LAMVNYPYPS+FLMPLPG+PIKEVCRK+DSC  GT
Sbjct: 272  KAFRLCRDLKRTEDLYDWLDSAYSFLAMVNYPYPSDFLMPLPGHPIKEVCRKMDSCPEGT 331

Query: 1039 SILDCIFEGLNVYYNYTGSVDCFNLDDDPHGEDGWNWQACTEMVMPMSSNRYSSMFPEFH 1218
            S+L+ IFEG++VYYNYTG V+CF LDDDPHG DGWNWQACTEMVMPM+S+R SSMFP + 
Sbjct: 332  SVLERIFEGVSVYYNYTGKVECFQLDDDPHGMDGWNWQACTEMVMPMASSRESSMFPTYD 391

Query: 1219 YNDTQYKEECWEDFKVTPRPTWITTEFGGHDFKANLEHFGSNIIFSNGLLDPWSGGSVLE 1398
            YN + ++EECW+DF V PRPTWITTEFGGH+FK  L+ FGSNIIFSNGLLDPWSGGSVL+
Sbjct: 392  YNYSSFQEECWKDFSVKPRPTWITTEFGGHEFKTTLKVFGSNIIFSNGLLDPWSGGSVLQ 451

Query: 1399 DISESIVALVTDKGAHHLDLRAATNEDPTWLLEQREKEIKLIAGWIEAYN 1548
            +ISE++VALVT++GAHH+DLR++T EDP WL+EQR  E+KLI GWIE Y+
Sbjct: 452  NISETVVALVTEEGAHHIDLRSSTAEDPDWLVEQRAFEVKLIKGWIEDYH 501


>ref|XP_002310325.1| predicted protein [Populus trichocarpa] gi|222853228|gb|EEE90775.1|
            predicted protein [Populus trichocarpa]
          Length = 515

 Score =  748 bits (1930), Expect = 0.0
 Identities = 342/470 (72%), Positives = 405/470 (86%)
 Frame = +1

Query: 136  SSKHLPRFLGRFSKPNKPLIKNLQKYRYDTRYFDQSLDHFSFADLPKFRQRYLISSEHWS 315
            SSK  PRFL + S P K  ++  Q+YRY+++YF Q LDHFSF +LPKF QRYLI+++HW+
Sbjct: 36   SSKRAPRFLSKHSYPIKTQLQEQQQYRYESKYFYQQLDHFSFLNLPKFPQRYLINTDHWA 95

Query: 316  GPDKAGPIFFYCGNEGYIDWFADNTGFVWELAPQFGAMVVFPEHRYYGESVPYGSSGEAY 495
            GP++ GPIF YCGNEG I+WFA NTGFVWE+AP FGAMV+FPEHRYYGES+PYG+  EAY
Sbjct: 96   GPERRGPIFLYCGNEGDIEWFAVNTGFVWEIAPLFGAMVLFPEHRYYGESMPYGNREEAY 155

Query: 496  KNASTLSYLTAEQALADYAILLTDLKKNLSAEACPVVLFGGSYGGMLAAWMRLKYPHLSV 675
            KNASTLSYLTAEQALAD+A+L+TDLK+NLSA+ACPVVLFGGSYGGMLAAWMRLKYPH+++
Sbjct: 156  KNASTLSYLTAEQALADFAVLITDLKRNLSAQACPVVLFGGSYGGMLAAWMRLKYPHVAI 215

Query: 676  GALASSAPVLEFEDIVPEESFYDIVSNVFKHESVSCFNTIKASWDTLFSEGNKEDGLLQL 855
            GALASSAP+L+FEDIVP E+FY+IVSN FK ES SCFNTIK SWD L SEG K++GL+QL
Sbjct: 216  GALASSAPILQFEDIVPPETFYNIVSNDFKRESTSCFNTIKESWDALLSEGLKKNGLVQL 275

Query: 856  TKTFHLCRKLNSTEDLFNWWDSAYTSLAMVNYPYPSEFLMPLPGNPIKEVCRKIDSCLGG 1035
            TKTFHLCR+L STEDL NW DSAY+ LAMV+YPYPS F+MPLPG PI EVC++ID C  G
Sbjct: 276  TKTFHLCRELKSTEDLANWLDSAYSYLAMVDYPYPSSFMMPLPGYPIGEVCKRIDGCPDG 335

Query: 1036 TSILDCIFEGLNVYYNYTGSVDCFNLDDDPHGEDGWNWQACTEMVMPMSSNRYSSMFPEF 1215
            TSIL+ IFEG+++YYNYTG + CF LDDDPHG DGWNWQACTEMVMPMSS+  +SMFP +
Sbjct: 336  TSILERIFEGISIYYNYTGELHCFELDDDPHGLDGWNWQACTEMVMPMSSSHNASMFPTY 395

Query: 1216 HYNDTQYKEECWEDFKVTPRPTWITTEFGGHDFKANLEHFGSNIIFSNGLLDPWSGGSVL 1395
             +N + Y+E CWE+F V PRP WITTEFGG D K  LE FGSNIIFSNGLLDPWSGGSVL
Sbjct: 396  DFNYSSYQEGCWEEFGVIPRPRWITTEFGGQDIKTALETFGSNIIFSNGLLDPWSGGSVL 455

Query: 1396 EDISESIVALVTDKGAHHLDLRAATNEDPTWLLEQREKEIKLIAGWIEAY 1545
            ++ISE++VALVT++GAHH+DLR +T EDP WL+EQRE E+KLI GWI+ Y
Sbjct: 456  QNISETVVALVTEEGAHHIDLRPSTPEDPDWLVEQRETEVKLIKGWIDGY 505


>ref|NP_201377.2| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
            gi|95147306|gb|ABF57288.1| At5g65760 [Arabidopsis
            thaliana] gi|110736177|dbj|BAF00060.1| lysosomal Pro-X
            carboxypeptidase [Arabidopsis thaliana]
            gi|332010719|gb|AED98102.1| Serine carboxypeptidase S28
            family protein [Arabidopsis thaliana]
          Length = 515

 Score =  733 bits (1891), Expect = 0.0
 Identities = 336/480 (70%), Positives = 402/480 (83%), Gaps = 4/480 (0%)
 Frame = +1

Query: 118  NGLSIRSSKHLPRFLGRFSKPNKPLIKNLQ----KYRYDTRYFDQSLDHFSFADLPKFRQ 285
            NG S+ SSK LPRF     +  +  I+  +    +YRY+T++F Q LDHFSFADLPKF Q
Sbjct: 21   NGSSLSSSKLLPRFPRYTFQNREARIQQFRGDRNEYRYETKFFSQQLDHFSFADLPKFSQ 80

Query: 286  RYLISSEHWSGPDKAGPIFFYCGNEGYIDWFADNTGFVWELAPQFGAMVVFPEHRYYGES 465
            RYLI+S+HW G    GPIF YCGNEG I+WFA N+GF+W++AP+FGA++VFPEHRYYGES
Sbjct: 81   RYLINSDHWLGASALGPIFLYCGNEGDIEWFATNSGFIWDIAPKFGALLVFPEHRYYGES 140

Query: 466  VPYGSSGEAYKNASTLSYLTAEQALADYAILLTDLKKNLSAEACPVVLFGGSYGGMLAAW 645
            +PYGS  EAYKNA+TLSYLT EQALAD+A+ +TDLK+NLSAEACPVVLFGGSYGGMLAAW
Sbjct: 141  MPYGSREEAYKNATTLSYLTTEQALADFAVFVTDLKRNLSAEACPVVLFGGSYGGMLAAW 200

Query: 646  MRLKYPHLSVGALASSAPVLEFEDIVPEESFYDIVSNVFKHESVSCFNTIKASWDTLFSE 825
            MRLKYPH+++GALASSAP+L+FED+VP E+FYDI SN FK ES SCFNTIK SWD + +E
Sbjct: 201  MRLKYPHIAIGALASSAPILQFEDVVPPETFYDIASNDFKRESSSCFNTIKDSWDAIIAE 260

Query: 826  GNKEDGLLQLTKTFHLCRKLNSTEDLFNWWDSAYTSLAMVNYPYPSEFLMPLPGNPIKEV 1005
            G KE+GLLQLTKTFH CR LNST+DL +W DSAY+ LAMV+YPYP++F+MPLPG+PI+EV
Sbjct: 261  GQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYSYLAMVDYPYPADFMMPLPGHPIREV 320

Query: 1006 CRKIDSCLGGTSILDCIFEGLNVYYNYTGSVDCFNLDDDPHGEDGWNWQACTEMVMPMSS 1185
            CRKID      SILD I+ G++VYYNYTG+VDCF LDDDPHG DGWNWQACTEMVMPMSS
Sbjct: 321  CRKIDGAGSNASILDRIYAGISVYYNYTGNVDCFKLDDDPHGLDGWNWQACTEMVMPMSS 380

Query: 1186 NRYSSMFPEFHYNDTQYKEECWEDFKVTPRPTWITTEFGGHDFKANLEHFGSNIIFSNGL 1365
            N+ +SMFP + +N + YKEECW  F+V PRP W+TTEFGGHD    L+ FGSNIIFSNGL
Sbjct: 381  NQENSMFPGYGFNYSSYKEECWNTFRVNPRPKWVTTEFGGHDIATTLKSFGSNIIFSNGL 440

Query: 1366 LDPWSGGSVLEDISESIVALVTDKGAHHLDLRAATNEDPTWLLEQREKEIKLIAGWIEAY 1545
            LDPWSGGSVL+++S++IVALVT +GAHHLDLR +T EDP WL++QRE EI+LI GWIE Y
Sbjct: 441  LDPWSGGSVLKNLSDTIVALVTKEGAHHLDLRPSTPEDPKWLVDQREAEIRLIQGWIETY 500


>dbj|BAB10683.1| lysosomal Pro-X carboxypeptidase [Arabidopsis thaliana]
          Length = 529

 Score =  721 bits (1862), Expect = 0.0
 Identities = 335/494 (67%), Positives = 402/494 (81%), Gaps = 18/494 (3%)
 Frame = +1

Query: 118  NGLSIRSSKHLPRFLGRFSKPNKPLIKNLQ----KYRYDTRYFDQSLDHFSFADLPKFRQ 285
            NG S+ SSK LPRF     +  +  I+  +    +YRY+T++F Q LDHFSFADLPKF Q
Sbjct: 21   NGSSLSSSKLLPRFPRYTFQNREARIQQFRGDRNEYRYETKFFSQQLDHFSFADLPKFSQ 80

Query: 286  RYLISSEHWSGPDKAGPIFFYCGNEGYIDWFADNTGFVWELAPQFGAMVVFPEHRYYGES 465
            RYLI+S+HW G    GPIF YCGNEG I+WFA N+GF+W++AP+FGA++VFPEHRYYGES
Sbjct: 81   RYLINSDHWLGASALGPIFLYCGNEGDIEWFATNSGFIWDIAPKFGALLVFPEHRYYGES 140

Query: 466  VPYGSSGEAYKNASTLSYLTAEQALADYAILLTDLKKNLSAEACPVVLFGGSYGG----- 630
            +PYGS  EAYKNA+TLSYLT EQALAD+A+ +TDLK+NLSAEACPVVLFGGSYGG     
Sbjct: 141  MPYGSREEAYKNATTLSYLTTEQALADFAVFVTDLKRNLSAEACPVVLFGGSYGGSNNCV 200

Query: 631  ---------MLAAWMRLKYPHLSVGALASSAPVLEFEDIVPEESFYDIVSNVFKHESVSC 783
                     +LAAWMRLKYPH+++GALASSAP+L+FED+VP E+FYDI SN FK ES SC
Sbjct: 201  FVFVVIDATVLAAWMRLKYPHIAIGALASSAPILQFEDVVPPETFYDIASNDFKRESSSC 260

Query: 784  FNTIKASWDTLFSEGNKEDGLLQLTKTFHLCRKLNSTEDLFNWWDSAYTSLAMVNYPYPS 963
            FNTIK SWD + +EG KE+GLLQLTKTFH CR LNST+DL +W DSAY+ LAMV+YPYP+
Sbjct: 261  FNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYSYLAMVDYPYPA 320

Query: 964  EFLMPLPGNPIKEVCRKIDSCLGGTSILDCIFEGLNVYYNYTGSVDCFNLDDDPHGEDGW 1143
            +F+MPLPG+PI+EVCRKID      SILD I+ G++VYYNYTG+VDCF LDDDPHG DGW
Sbjct: 321  DFMMPLPGHPIREVCRKIDGAGSNASILDRIYAGISVYYNYTGNVDCFKLDDDPHGLDGW 380

Query: 1144 NWQACTEMVMPMSSNRYSSMFPEFHYNDTQYKEECWEDFKVTPRPTWITTEFGGHDFKAN 1323
            NWQACTEMVMPMSSN+ +SMFP + +N + YKEECW  F+V PRP W+TTEFGGHD    
Sbjct: 381  NWQACTEMVMPMSSNQENSMFPGYGFNYSSYKEECWNTFRVNPRPKWVTTEFGGHDIATT 440

Query: 1324 LEHFGSNIIFSNGLLDPWSGGSVLEDISESIVALVTDKGAHHLDLRAATNEDPTWLLEQR 1503
            L+ FGSNIIFSNGLLDPWSGGSVL+++S++IVALVT +GAHHLDLR +T EDP WL++QR
Sbjct: 441  LKSFGSNIIFSNGLLDPWSGGSVLKNLSDTIVALVTKEGAHHLDLRPSTPEDPKWLVDQR 500

Query: 1504 EKEIKLIAGWIEAY 1545
            E EI+LI GWIE Y
Sbjct: 501  EAEIRLIQGWIETY 514


>ref|XP_002864979.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata subsp.
            lyrata] gi|297310814|gb|EFH41238.1| serine
            carboxypeptidase S28 family protein [Arabidopsis lyrata
            subsp. lyrata]
          Length = 514

 Score =  714 bits (1843), Expect = 0.0
 Identities = 332/480 (69%), Positives = 398/480 (82%), Gaps = 4/480 (0%)
 Frame = +1

Query: 118  NGLSIRSSKHLPRFLGRFSKPNKPLIKNLQ----KYRYDTRYFDQSLDHFSFADLPKFRQ 285
            NG S+ SSK LPRF  R++  N+  I+  +    +YRY+T++F Q LDHFSFADLPKF Q
Sbjct: 21   NGSSLSSSKLLPRF-PRYTSRNRGRIQQFRGDRNEYRYETKFFSQQLDHFSFADLPKFPQ 79

Query: 286  RYLISSEHWSGPDKAGPIFFYCGNEGYIDWFADNTGFVWELAPQFGAMVVFPEHRYYGES 465
            RYLI+S++W G    GPIF YCGNEG I+WFA N+GF+W++AP+FGA++VFPE R     
Sbjct: 80   RYLINSDYWLGASALGPIFLYCGNEGDIEWFATNSGFIWDIAPKFGALLVFPEVRSCLFC 139

Query: 466  VPYGSSGEAYKNASTLSYLTAEQALADYAILLTDLKKNLSAEACPVVLFGGSYGGMLAAW 645
            +PYGS  EAYKNA+TLSYLT EQALAD+A+ +TDLK+NLSAEACPVVLFGGSYGGMLAAW
Sbjct: 140  MPYGSMEEAYKNATTLSYLTTEQALADFAVFVTDLKRNLSAEACPVVLFGGSYGGMLAAW 199

Query: 646  MRLKYPHLSVGALASSAPVLEFEDIVPEESFYDIVSNVFKHESVSCFNTIKASWDTLFSE 825
            MRLKYPH+++GALASSAP+L+FEDIVP E+FYDI SN FK ES SCFNTIK SWD + +E
Sbjct: 200  MRLKYPHIAIGALASSAPILQFEDIVPPETFYDIASNDFKRESSSCFNTIKDSWDAIIAE 259

Query: 826  GNKEDGLLQLTKTFHLCRKLNSTEDLFNWWDSAYTSLAMVNYPYPSEFLMPLPGNPIKEV 1005
            G KE+GLLQLTKTFH CR LNST+DL +W DSAY+ LAMV+YPYP++F+MPLPG+PI+EV
Sbjct: 260  GQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYSYLAMVDYPYPADFMMPLPGHPIREV 319

Query: 1006 CRKIDSCLGGTSILDCIFEGLNVYYNYTGSVDCFNLDDDPHGEDGWNWQACTEMVMPMSS 1185
            CRKID      SILD IF G++VYYNYTG+VDCF LDDDPHG DGWNWQACTEMVMPMSS
Sbjct: 320  CRKIDGAHSDASILDRIFAGISVYYNYTGNVDCFKLDDDPHGLDGWNWQACTEMVMPMSS 379

Query: 1186 NRYSSMFPEFHYNDTQYKEECWEDFKVTPRPTWITTEFGGHDFKANLEHFGSNIIFSNGL 1365
            N+  SMFP + +N + YKEECW  F+V PRP W+TTEFGGHD +  L+ FGSNIIFSNG+
Sbjct: 380  NQEKSMFPAYDFNYSSYKEECWNTFRVNPRPKWVTTEFGGHDIETTLKLFGSNIIFSNGM 439

Query: 1366 LDPWSGGSVLEDISESIVALVTDKGAHHLDLRAATNEDPTWLLEQREKEIKLIAGWIEAY 1545
            LDPWSGGSVL+++S +IVALVT +GAHHLDLR +T EDP WL++QRE EI+LI GWIE Y
Sbjct: 440  LDPWSGGSVLKNLSNTIVALVTKEGAHHLDLRPSTPEDPKWLVDQREAEIQLIQGWIETY 499


Top