BLASTX nr result

ID: Glycyrrhiza24_contig00012961 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza24_contig00012961
         (1764 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003549991.1| PREDICTED: lysosomal Pro-X carboxypeptidase-...   879   0.0  
ref|XP_003525880.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pr...   823   0.0  
ref|XP_002310325.1| predicted protein [Populus trichocarpa] gi|2...   774   0.0  
ref|XP_002263389.2| PREDICTED: lysosomal Pro-X carboxypeptidase ...   772   0.0  
ref|NP_201377.2| Serine carboxypeptidase S28 family protein [Ara...   736   0.0  

>ref|XP_003549991.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
          Length = 513

 Score =  879 bits (2270), Expect = 0.0
 Identities = 419/506 (82%), Positives = 453/506 (89%), Gaps = 1/506 (0%)
 Frame = -2

Query: 1724 RFSTLVF-FTVAVIVVSCSQQPLAFKHAPRFLRKFAXXXXXXXXXXXXQLGFNYETRHFQ 1548
            R STLVF  ++ +IV+S   QPLA  H+P+FL KFA               F+YE R+FQ
Sbjct: 9    RMSTLVFTLSIIIIVLSYPAQPLALNHSPKFLGKFAATARTHSNSEPPPQ-FHYEKRYFQ 67

Query: 1547 QHLDHFSFSELPTFPQRYLINTEHWVGPQRSGPIFFYCGNEGDIVWFAQNTGFVWEIAPQ 1368
            Q LDHFSFSELPTFPQRYLI+TEHWVGP R GPIFFYCGNEGDI WFAQNTGFVWEIAP+
Sbjct: 68   QRLDHFSFSELPTFPQRYLISTEHWVGPHRLGPIFFYCGNEGDIEWFAQNTGFVWEIAPR 127

Query: 1367 FGAMVVFPEHRYYGESVPYGSREEAYKNATTLSYLTAEQALADFSVLLTDLKQNFSANHC 1188
            FGAMVVFPEHRYYGESVPYGS EEAYKNATTLSYLTAEQALADFSVL+T LK N+SA  C
Sbjct: 128  FGAMVVFPEHRYYGESVPYGSAEEAYKNATTLSYLTAEQALADFSVLITYLKHNYSAKDC 187

Query: 1187 PVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPPETFYDIVSNDFRRESS 1008
            PVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPPETFYD+VSN F+RES 
Sbjct: 188  PVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPPETFYDLVSNAFKRESF 247

Query: 1007 TCFNYIKQSWDDIATKGETSNGLVQLTKTFNLCGKLKRTEDLWGWLESAYSYLAMVNYPY 828
            TCFNYIKQSW++IA+ G+T+NGL  LTKTFNLC KLKRT+DL+ W E+AYSYLAMVNYPY
Sbjct: 248  TCFNYIKQSWNEIASTGQTNNGLELLTKTFNLCQKLKRTKDLYDWAEAAYSYLAMVNYPY 307

Query: 827  PSEFMMPLPGHPIREVCQRIDGGPAGTSILERIYEGVNVYYNYTGEAKCFELDDDPHGLS 648
            P+EFMM LP HPIREVC+RIDGGPAGTSILERIYEGVNVYYNYTGEAKCFELDDDPHG+S
Sbjct: 308  PAEFMMTLPEHPIREVCRRIDGGPAGTSILERIYEGVNVYYNYTGEAKCFELDDDPHGMS 367

Query: 647  GWNWQACTEMVMPMSSSQESSMFPPYEYNYSSFQEDCLKNFGVKPRPKWITTEFGGHNIH 468
            GW WQACTEMVMPMSSSQESSMFPPYEYNY+S Q +CLK FGVKPRP+WITTEFGGH+IH
Sbjct: 368  GWEWQACTEMVMPMSSSQESSMFPPYEYNYTSIQAECLKKFGVKPRPRWITTEFGGHDIH 427

Query: 467  ATLKKFGSNIIFSNGLLDPWSGGSVLQNISESIVALVTEEGAHHIDLRASTGNDPDWLVE 288
            ATLKKFGSNIIFSNGLLDPWSGG VLQNISES+V+LVTEEGAHHIDLR+ST NDPDWLVE
Sbjct: 428  ATLKKFGSNIIFSNGLLDPWSGGGVLQNISESVVSLVTEEGAHHIDLRSSTKNDPDWLVE 487

Query: 287  QRATEIKLIQGWISDYHQKNKAAFDM 210
            QR TEIKLI+GWISDYHQKN+A FDM
Sbjct: 488  QRETEIKLIEGWISDYHQKNEAMFDM 513


>ref|XP_003525880.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X carboxypeptidase-like
            [Glycine max]
          Length = 597

 Score =  823 bits (2126), Expect = 0.0
 Identities = 394/502 (78%), Positives = 433/502 (86%)
 Frame = -2

Query: 1715 TLVFFTVAVIVVSCSQQPLAFKHAPRFLRKFAXXXXXXXXXXXXQLGFNYETRHFQQHLD 1536
            TLVF    +IV+S S QPLA KH P+FL KFA               F+YETR  QQ LD
Sbjct: 83   TLVFTLSVIIVLSYSAQPLALKHWPKFLGKFAATARTHSEPPPQ---FHYETRCIQQSLD 139

Query: 1535 HFSFSELPTFPQRYLINTEHWVGPQRSGPIFFYCGNEGDIVWFAQNTGFVWEIAPQFGAM 1356
            HFSFSELPTFPQRYLI+TEHWVGP+R GP+FFY GNE DI WFAQNTG VWEIAP+FGAM
Sbjct: 140  HFSFSELPTFPQRYLISTEHWVGPRRLGPVFFYSGNEDDIEWFAQNTGVVWEIAPRFGAM 199

Query: 1355 VVFPEHRYYGESVPYGSREEAYKNATTLSYLTAEQALADFSVLLTDLKQNFSANHCPVVL 1176
            VVFPEH+YYGESVPYGS EEAYKN TTLSYLT+EQAL DFSV++ DLK NFS   CPV L
Sbjct: 200  VVFPEHQYYGESVPYGSAEEAYKNVTTLSYLTSEQALVDFSVVIADLKHNFSTKDCPVFL 259

Query: 1175 FGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPPETFYDIVSNDFRRESSTCFN 996
            FGGSYGGMLAAWMRLKYPH+AVGALASSAPILQFEDIVPPETFYD+VSN F+RES  CFN
Sbjct: 260  FGGSYGGMLAAWMRLKYPHVAVGALASSAPILQFEDIVPPETFYDLVSNAFKRESFICFN 319

Query: 995  YIKQSWDDIATKGETSNGLVQLTKTFNLCGKLKRTEDLWGWLESAYSYLAMVNYPYPSEF 816
            YIKQSW+++A+ G+T+NGL  LTKTFNLC KL RT+DL+ W+E+AYSYLAMVNYPYP+EF
Sbjct: 320  YIKQSWNEMASAGQTNNGLELLTKTFNLCQKLNRTKDLYDWVEAAYSYLAMVNYPYPAEF 379

Query: 815  MMPLPGHPIREVCQRIDGGPAGTSILERIYEGVNVYYNYTGEAKCFELDDDPHGLSGWNW 636
            MM LP HPIREV        + + ILERIYEGVNVYYNYTGEAKCFELDDDPHG+SGW+W
Sbjct: 380  MMTLPEHPIREVSM-----VSNSYILERIYEGVNVYYNYTGEAKCFELDDDPHGMSGWDW 434

Query: 635  QACTEMVMPMSSSQESSMFPPYEYNYSSFQEDCLKNFGVKPRPKWITTEFGGHNIHATLK 456
            QACTEM+MPMSSSQESSMF PYEY Y+S QE+CLK FGVKPRPKWITTEFGGH+IHATLK
Sbjct: 435  QACTEMIMPMSSSQESSMFLPYEYXYTSIQEECLKKFGVKPRPKWITTEFGGHDIHATLK 494

Query: 455  KFGSNIIFSNGLLDPWSGGSVLQNISESIVALVTEEGAHHIDLRASTGNDPDWLVEQRAT 276
            KFGSNIIFSNGLLDPWSGGS+LQNISES+V+LVTEEGAHHIDLR+ST NDPDWLVEQR T
Sbjct: 495  KFGSNIIFSNGLLDPWSGGSILQNISESVVSLVTEEGAHHIDLRSSTKNDPDWLVEQRET 554

Query: 275  EIKLIQGWISDYHQKNKAAFDM 210
            EIKLI+GWISDYHQKNKA FDM
Sbjct: 555  EIKLIEGWISDYHQKNKAMFDM 576


>ref|XP_002310325.1| predicted protein [Populus trichocarpa] gi|222853228|gb|EEE90775.1|
            predicted protein [Populus trichocarpa]
          Length = 515

 Score =  774 bits (1999), Expect = 0.0
 Identities = 367/503 (72%), Positives = 418/503 (83%)
 Frame = -2

Query: 1718 STLVFFTVAVIVVSCSQQPLAFKHAPRFLRKFAXXXXXXXXXXXXQLGFNYETRHFQQHL 1539
            ST + FT   +  S     L+ K APRFL K +               + YE+++F Q L
Sbjct: 17   STTIIFTPPALA-SQPLNHLSSKRAPRFLSKHSYPIKTQLQEQQQ---YRYESKYFYQQL 72

Query: 1538 DHFSFSELPTFPQRYLINTEHWVGPQRSGPIFFYCGNEGDIVWFAQNTGFVWEIAPQFGA 1359
            DHFSF  LP FPQRYLINT+HW GP+R GPIF YCGNEGDI WFA NTGFVWEIAP FGA
Sbjct: 73   DHFSFLNLPKFPQRYLINTDHWAGPERRGPIFLYCGNEGDIEWFAVNTGFVWEIAPLFGA 132

Query: 1358 MVVFPEHRYYGESVPYGSREEAYKNATTLSYLTAEQALADFSVLLTDLKQNFSANHCPVV 1179
            MV+FPEHRYYGES+PYG+REEAYKNA+TLSYLTAEQALADF+VL+TDLK+N SA  CPVV
Sbjct: 133  MVLFPEHRYYGESMPYGNREEAYKNASTLSYLTAEQALADFAVLITDLKRNLSAQACPVV 192

Query: 1178 LFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPPETFYDIVSNDFRRESSTCF 999
            LFGGSYGGMLAAWMRLKYPH+A+GALASSAPILQFEDIVPPETFY+IVSNDF+RES++CF
Sbjct: 193  LFGGSYGGMLAAWMRLKYPHVAIGALASSAPILQFEDIVPPETFYNIVSNDFKRESTSCF 252

Query: 998  NYIKQSWDDIATKGETSNGLVQLTKTFNLCGKLKRTEDLWGWLESAYSYLAMVNYPYPSE 819
            N IK+SWD + ++G   NGLVQLTKTF+LC +LK TEDL  WL+SAYSYLAMV+YPYPS 
Sbjct: 253  NTIKESWDALLSEGLKKNGLVQLTKTFHLCRELKSTEDLANWLDSAYSYLAMVDYPYPSS 312

Query: 818  FMMPLPGHPIREVCQRIDGGPAGTSILERIYEGVNVYYNYTGEAKCFELDDDPHGLSGWN 639
            FMMPLPG+PI EVC+RIDG P GTSILERI+EG+++YYNYTGE  CFELDDDPHGL GWN
Sbjct: 313  FMMPLPGYPIGEVCKRIDGCPDGTSILERIFEGISIYYNYTGELHCFELDDDPHGLDGWN 372

Query: 638  WQACTEMVMPMSSSQESSMFPPYEYNYSSFQEDCLKNFGVKPRPKWITTEFGGHNIHATL 459
            WQACTEMVMPMSSS  +SMFP Y++NYSS+QE C + FGV PRP+WITTEFGG +I   L
Sbjct: 373  WQACTEMVMPMSSSHNASMFPTYDFNYSSYQEGCWEEFGVIPRPRWITTEFGGQDIKTAL 432

Query: 458  KKFGSNIIFSNGLLDPWSGGSVLQNISESIVALVTEEGAHHIDLRASTGNDPDWLVEQRA 279
            + FGSNIIFSNGLLDPWSGGSVLQNISE++VALVTEEGAHHIDLR ST  DPDWLVEQR 
Sbjct: 433  ETFGSNIIFSNGLLDPWSGGSVLQNISETVVALVTEEGAHHIDLRPSTPEDPDWLVEQRE 492

Query: 278  TEIKLIQGWISDYHQKNKAAFDM 210
            TE+KLI+GWI  Y ++ K AF M
Sbjct: 493  TEVKLIKGWIDGYLKEKKTAFSM 515


>ref|XP_002263389.2| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
            gi|296085719|emb|CBI29519.3| unnamed protein product
            [Vitis vinifera]
          Length = 510

 Score =  772 bits (1994), Expect = 0.0
 Identities = 356/481 (74%), Positives = 413/481 (85%)
 Frame = -2

Query: 1652 KHAPRFLRKFAXXXXXXXXXXXXQLGFNYETRHFQQHLDHFSFSELPTFPQRYLINTEHW 1473
            K  PRFL KFA               F YETR+F+Q LDHFS ++LP F QRYLI+T HW
Sbjct: 38   KSIPRFLGKFAYPNRGKP--------FQYETRYFEQRLDHFSIADLPKFRQRYLISTRHW 89

Query: 1472 VGPQRSGPIFFYCGNEGDIVWFAQNTGFVWEIAPQFGAMVVFPEHRYYGESVPYGSREEA 1293
             GP R GPIF YCGNEGDI WFA NTGFVW++AP+FGAMV+FPEHRYYGES+PYGSR++A
Sbjct: 90   TGPDRMGPIFLYCGNEGDIEWFAANTGFVWDMAPRFGAMVLFPEHRYYGESMPYGSRDKA 149

Query: 1292 YKNATTLSYLTAEQALADFSVLLTDLKQNFSANHCPVVLFGGSYGGMLAAWMRLKYPHIA 1113
            Y NA +LSYLTAEQALADF+VL+T+LK+N SA  CPVVLFGGSYGGMLAAWMRLKYPHIA
Sbjct: 150  YANAASLSYLTAEQALADFAVLVTNLKRNLSAEGCPVVLFGGSYGGMLAAWMRLKYPHIA 209

Query: 1112 VGALASSAPILQFEDIVPPETFYDIVSNDFRRESSTCFNYIKQSWDDIATKGETSNGLVQ 933
            +GALASSAPILQFEDIVPPETFYDIVSN+F+RES +CF+ IK+SWD + ++G+ ++GL Q
Sbjct: 210  IGALASSAPILQFEDIVPPETFYDIVSNNFKRESISCFDTIKKSWDVLISEGQKNDGLKQ 269

Query: 932  LTKTFNLCGKLKRTEDLWGWLESAYSYLAMVNYPYPSEFMMPLPGHPIREVCQRIDGGPA 753
            LTK F LC  LKRTEDL+ WL+SAYS+LAMVNYPYPS+F+MPLPGHPI+EVC+++D  P 
Sbjct: 270  LTKAFRLCRDLKRTEDLYDWLDSAYSFLAMVNYPYPSDFLMPLPGHPIKEVCRKMDSCPE 329

Query: 752  GTSILERIYEGVNVYYNYTGEAKCFELDDDPHGLSGWNWQACTEMVMPMSSSQESSMFPP 573
            GTS+LERI+EGV+VYYNYTG+ +CF+LDDDPHG+ GWNWQACTEMVMPM+SS+ESSMFP 
Sbjct: 330  GTSVLERIFEGVSVYYNYTGKVECFQLDDDPHGMDGWNWQACTEMVMPMASSRESSMFPT 389

Query: 572  YEYNYSSFQEDCLKNFGVKPRPKWITTEFGGHNIHATLKKFGSNIIFSNGLLDPWSGGSV 393
            Y+YNYSSFQE+C K+F VKPRP WITTEFGGH    TLK FGSNIIFSNGLLDPWSGGSV
Sbjct: 390  YDYNYSSFQEECWKDFSVKPRPTWITTEFGGHEFKTTLKVFGSNIIFSNGLLDPWSGGSV 449

Query: 392  LQNISESIVALVTEEGAHHIDLRASTGNDPDWLVEQRATEIKLIQGWISDYHQKNKAAFD 213
            LQNISE++VALVTEEGAHHIDLR+ST  DPDWLVEQRA E+KLI+GWI DYHQK  + F 
Sbjct: 450  LQNISETVVALVTEEGAHHIDLRSSTAEDPDWLVEQRAFEVKLIKGWIEDYHQKRNSVFS 509

Query: 212  M 210
            +
Sbjct: 510  I 510


>ref|NP_201377.2| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
            gi|95147306|gb|ABF57288.1| At5g65760 [Arabidopsis
            thaliana] gi|110736177|dbj|BAF00060.1| lysosomal Pro-X
            carboxypeptidase [Arabidopsis thaliana]
            gi|332010719|gb|AED98102.1| Serine carboxypeptidase S28
            family protein [Arabidopsis thaliana]
          Length = 515

 Score =  736 bits (1901), Expect = 0.0
 Identities = 345/504 (68%), Positives = 406/504 (80%), Gaps = 2/504 (0%)
 Frame = -2

Query: 1727 NRFSTLVFFTVAVIVVSCSQQPLAF-KHAPRFLR-KFAXXXXXXXXXXXXQLGFNYETRH 1554
            + F  L+ FT   +V   +   L+  K  PRF R  F             +  + YET+ 
Sbjct: 3    SHFCLLLIFTFFTLVFPSNGSSLSSSKLLPRFPRYTFQNREARIQQFRGDRNEYRYETKF 62

Query: 1553 FQQHLDHFSFSELPTFPQRYLINTEHWVGPQRSGPIFFYCGNEGDIVWFAQNTGFVWEIA 1374
            F Q LDHFSF++LP F QRYLIN++HW+G    GPIF YCGNEGDI WFA N+GF+W+IA
Sbjct: 63   FSQQLDHFSFADLPKFSQRYLINSDHWLGASALGPIFLYCGNEGDIEWFATNSGFIWDIA 122

Query: 1373 PQFGAMVVFPEHRYYGESVPYGSREEAYKNATTLSYLTAEQALADFSVLLTDLKQNFSAN 1194
            P+FGA++VFPEHRYYGES+PYGSREEAYKNATTLSYLT EQALADF+V +TDLK+N SA 
Sbjct: 123  PKFGALLVFPEHRYYGESMPYGSREEAYKNATTLSYLTTEQALADFAVFVTDLKRNLSAE 182

Query: 1193 HCPVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPPETFYDIVSNDFRRE 1014
             CPVVLFGGSYGGMLAAWMRLKYPHIA+GALASSAPILQFED+VPPETFYDI SNDF+RE
Sbjct: 183  ACPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDVVPPETFYDIASNDFKRE 242

Query: 1013 SSTCFNYIKQSWDDIATKGETSNGLVQLTKTFNLCGKLKRTEDLWGWLESAYSYLAMVNY 834
            SS+CFN IK SWD I  +G+  NGL+QLTKTF+ C  L  T+DL  WL+SAYSYLAMV+Y
Sbjct: 243  SSSCFNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYSYLAMVDY 302

Query: 833  PYPSEFMMPLPGHPIREVCQRIDGGPAGTSILERIYEGVNVYYNYTGEAKCFELDDDPHG 654
            PYP++FMMPLPGHPIREVC++IDG  +  SIL+RIY G++VYYNYTG   CF+LDDDPHG
Sbjct: 303  PYPADFMMPLPGHPIREVCRKIDGAGSNASILDRIYAGISVYYNYTGNVDCFKLDDDPHG 362

Query: 653  LSGWNWQACTEMVMPMSSSQESSMFPPYEYNYSSFQEDCLKNFGVKPRPKWITTEFGGHN 474
            L GWNWQACTEMVMPMSS+QE+SMFP Y +NYSS++E+C   F V PRPKW+TTEFGGH+
Sbjct: 363  LDGWNWQACTEMVMPMSSNQENSMFPGYGFNYSSYKEECWNTFRVNPRPKWVTTEFGGHD 422

Query: 473  IHATLKKFGSNIIFSNGLLDPWSGGSVLQNISESIVALVTEEGAHHIDLRASTGNDPDWL 294
            I  TLK FGSNIIFSNGLLDPWSGGSVL+N+S++IVALVT+EGAHH+DLR ST  DP WL
Sbjct: 423  IATTLKSFGSNIIFSNGLLDPWSGGSVLKNLSDTIVALVTKEGAHHLDLRPSTPEDPKWL 482

Query: 293  VEQRATEIKLIQGWISDYHQKNKA 222
            V+QR  EI+LIQGWI  Y  + +A
Sbjct: 483  VDQREAEIRLIQGWIETYRVEKEA 506


Top