BLASTX nr result

ID: Glycyrrhiza23_contig00001105 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00001105
         (1683 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003549991.1| PREDICTED: lysosomal Pro-X carboxypeptidase-...   875   0.0  
ref|XP_003525880.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pr...   820   0.0  
ref|XP_002310325.1| predicted protein [Populus trichocarpa] gi|2...   773   0.0  
ref|XP_002263389.2| PREDICTED: lysosomal Pro-X carboxypeptidase ...   771   0.0  
ref|NP_201377.2| Serine carboxypeptidase S28 family protein [Ara...   736   0.0  

>ref|XP_003549991.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
          Length = 513

 Score =  875 bits (2261), Expect = 0.0
 Identities = 417/506 (82%), Positives = 452/506 (89%), Gaps = 1/506 (0%)
 Frame = -2

Query: 1643 RFSTLVF-FTVAVIVVSCSQQPLAFKHAPRFLRKFAXXXXXXXXXXXXQLGFNYETRHFQ 1467
            R STLVF  ++ +IV+S   QPLA  H+P+FL KFA               F+YE R+FQ
Sbjct: 9    RMSTLVFTLSIIIIVLSYPAQPLALNHSPKFLGKFAATARTHSNSEPPPQ-FHYEKRYFQ 67

Query: 1466 QHLDHFSFSELPMFPQRYLINTEHWVGPQRSGPIFFYCGNEGDIVWFAQNTGFVWEIAPQ 1287
            Q LDHFSFSELP FPQRYLI+TEHWVGP R GPIFFYCGNEGDI WFAQNTGFVWEIAP+
Sbjct: 68   QRLDHFSFSELPTFPQRYLISTEHWVGPHRLGPIFFYCGNEGDIEWFAQNTGFVWEIAPR 127

Query: 1286 FGAMVVFPEHRYYGESVPYGSREEAYKNATTLSYLTAEQALADFSVLLTDLKQNFSANHC 1107
            FGAMVVFPEHRYYGESVPYGS EEAYKNATTLSYLTAEQALADFSVL+T LK N+SA  C
Sbjct: 128  FGAMVVFPEHRYYGESVPYGSAEEAYKNATTLSYLTAEQALADFSVLITYLKHNYSAKDC 187

Query: 1106 PVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPPETFYDIVSNDFRRESS 927
            PVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPPETFYD+VSN F+RES 
Sbjct: 188  PVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPPETFYDLVSNAFKRESF 247

Query: 926  TCFNYIKQSWDDIATKGETSNGLVQLTKTFNLCGKLKRTEDLWGWLESAYSYLAMVNYPY 747
            TCFNYIKQSW++IA+ G+T+NGL  LTKTFNLC KLKRT+DL+ W E+AYSYLAMVNYPY
Sbjct: 248  TCFNYIKQSWNEIASTGQTNNGLELLTKTFNLCQKLKRTKDLYDWAEAAYSYLAMVNYPY 307

Query: 746  PSEFMMPLPGHPIREVCQRIDGGPAGTSILERIYEGVNVYYNYTGEAKCFELDDDPHGLS 567
            P+EFMM LP HPIREVC+RIDGGPAGTSILERIYEGVNVYYNYTGEAKCFELDDDPHG+S
Sbjct: 308  PAEFMMTLPEHPIREVCRRIDGGPAGTSILERIYEGVNVYYNYTGEAKCFELDDDPHGMS 367

Query: 566  GWNWQACTEMVMPMSSSQESSMFPPYEYNYSSFQEDCLKNFGVKPRPKWITTEFGGHNIH 387
            GW WQACTEMVMPMSSSQESSMFPPYEYNY+S Q +CLK FGVKPRP+WITTEFGGH+IH
Sbjct: 368  GWEWQACTEMVMPMSSSQESSMFPPYEYNYTSIQAECLKKFGVKPRPRWITTEFGGHDIH 427

Query: 386  ATLKKFGSNIIFSNGLLDPWSGGSVLQNISESIVALVTEEGAHHIDLRASTGNDPDWLVE 207
            ATLKKFGSNIIFSNGLLDPWSGG VLQNISES+V+LVTEEGAHHIDLR+ST NDPDWLVE
Sbjct: 428  ATLKKFGSNIIFSNGLLDPWSGGGVLQNISESVVSLVTEEGAHHIDLRSSTKNDPDWLVE 487

Query: 206  QRATEIKLIQGWISDYHQKNKAAYDM 129
            QR TEIKLI+GWISDYHQKN+A +DM
Sbjct: 488  QRETEIKLIEGWISDYHQKNEAMFDM 513


>ref|XP_003525880.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X carboxypeptidase-like
            [Glycine max]
          Length = 597

 Score =  820 bits (2117), Expect = 0.0
 Identities = 392/502 (78%), Positives = 432/502 (86%)
 Frame = -2

Query: 1634 TLVFFTVAVIVVSCSQQPLAFKHAPRFLRKFAXXXXXXXXXXXXQLGFNYETRHFQQHLD 1455
            TLVF    +IV+S S QPLA KH P+FL KFA               F+YETR  QQ LD
Sbjct: 83   TLVFTLSVIIVLSYSAQPLALKHWPKFLGKFAATARTHSEPPPQ---FHYETRCIQQSLD 139

Query: 1454 HFSFSELPMFPQRYLINTEHWVGPQRSGPIFFYCGNEGDIVWFAQNTGFVWEIAPQFGAM 1275
            HFSFSELP FPQRYLI+TEHWVGP+R GP+FFY GNE DI WFAQNTG VWEIAP+FGAM
Sbjct: 140  HFSFSELPTFPQRYLISTEHWVGPRRLGPVFFYSGNEDDIEWFAQNTGVVWEIAPRFGAM 199

Query: 1274 VVFPEHRYYGESVPYGSREEAYKNATTLSYLTAEQALADFSVLLTDLKQNFSANHCPVVL 1095
            VVFPEH+YYGESVPYGS EEAYKN TTLSYLT+EQAL DFSV++ DLK NFS   CPV L
Sbjct: 200  VVFPEHQYYGESVPYGSAEEAYKNVTTLSYLTSEQALVDFSVVIADLKHNFSTKDCPVFL 259

Query: 1094 FGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPPETFYDIVSNDFRRESSTCFN 915
            FGGSYGGMLAAWMRLKYPH+AVGALASSAPILQFEDIVPPETFYD+VSN F+RES  CFN
Sbjct: 260  FGGSYGGMLAAWMRLKYPHVAVGALASSAPILQFEDIVPPETFYDLVSNAFKRESFICFN 319

Query: 914  YIKQSWDDIATKGETSNGLVQLTKTFNLCGKLKRTEDLWGWLESAYSYLAMVNYPYPSEF 735
            YIKQSW+++A+ G+T+NGL  LTKTFNLC KL RT+DL+ W+E+AYSYLAMVNYPYP+EF
Sbjct: 320  YIKQSWNEMASAGQTNNGLELLTKTFNLCQKLNRTKDLYDWVEAAYSYLAMVNYPYPAEF 379

Query: 734  MMPLPGHPIREVCQRIDGGPAGTSILERIYEGVNVYYNYTGEAKCFELDDDPHGLSGWNW 555
            MM LP HPIREV        + + ILERIYEGVNVYYNYTGEAKCFELDDDPHG+SGW+W
Sbjct: 380  MMTLPEHPIREVSM-----VSNSYILERIYEGVNVYYNYTGEAKCFELDDDPHGMSGWDW 434

Query: 554  QACTEMVMPMSSSQESSMFPPYEYNYSSFQEDCLKNFGVKPRPKWITTEFGGHNIHATLK 375
            QACTEM+MPMSSSQESSMF PYEY Y+S QE+CLK FGVKPRPKWITTEFGGH+IHATLK
Sbjct: 435  QACTEMIMPMSSSQESSMFLPYEYXYTSIQEECLKKFGVKPRPKWITTEFGGHDIHATLK 494

Query: 374  KFGSNIIFSNGLLDPWSGGSVLQNISESIVALVTEEGAHHIDLRASTGNDPDWLVEQRAT 195
            KFGSNIIFSNGLLDPWSGGS+LQNISES+V+LVTEEGAHHIDLR+ST NDPDWLVEQR T
Sbjct: 495  KFGSNIIFSNGLLDPWSGGSILQNISESVVSLVTEEGAHHIDLRSSTKNDPDWLVEQRET 554

Query: 194  EIKLIQGWISDYHQKNKAAYDM 129
            EIKLI+GWISDYHQKNKA +DM
Sbjct: 555  EIKLIEGWISDYHQKNKAMFDM 576


>ref|XP_002310325.1| predicted protein [Populus trichocarpa] gi|222853228|gb|EEE90775.1|
            predicted protein [Populus trichocarpa]
          Length = 515

 Score =  773 bits (1996), Expect = 0.0
 Identities = 366/503 (72%), Positives = 418/503 (83%)
 Frame = -2

Query: 1637 STLVFFTVAVIVVSCSQQPLAFKHAPRFLRKFAXXXXXXXXXXXXQLGFNYETRHFQQHL 1458
            ST + FT   +  S     L+ K APRFL K +               + YE+++F Q L
Sbjct: 17   STTIIFTPPALA-SQPLNHLSSKRAPRFLSKHSYPIKTQLQEQQQ---YRYESKYFYQQL 72

Query: 1457 DHFSFSELPMFPQRYLINTEHWVGPQRSGPIFFYCGNEGDIVWFAQNTGFVWEIAPQFGA 1278
            DHFSF  LP FPQRYLINT+HW GP+R GPIF YCGNEGDI WFA NTGFVWEIAP FGA
Sbjct: 73   DHFSFLNLPKFPQRYLINTDHWAGPERRGPIFLYCGNEGDIEWFAVNTGFVWEIAPLFGA 132

Query: 1277 MVVFPEHRYYGESVPYGSREEAYKNATTLSYLTAEQALADFSVLLTDLKQNFSANHCPVV 1098
            MV+FPEHRYYGES+PYG+REEAYKNA+TLSYLTAEQALADF+VL+TDLK+N SA  CPVV
Sbjct: 133  MVLFPEHRYYGESMPYGNREEAYKNASTLSYLTAEQALADFAVLITDLKRNLSAQACPVV 192

Query: 1097 LFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPPETFYDIVSNDFRRESSTCF 918
            LFGGSYGGMLAAWMRLKYPH+A+GALASSAPILQFEDIVPPETFY+IVSNDF+RES++CF
Sbjct: 193  LFGGSYGGMLAAWMRLKYPHVAIGALASSAPILQFEDIVPPETFYNIVSNDFKRESTSCF 252

Query: 917  NYIKQSWDDIATKGETSNGLVQLTKTFNLCGKLKRTEDLWGWLESAYSYLAMVNYPYPSE 738
            N IK+SWD + ++G   NGLVQLTKTF+LC +LK TEDL  WL+SAYSYLAMV+YPYPS 
Sbjct: 253  NTIKESWDALLSEGLKKNGLVQLTKTFHLCRELKSTEDLANWLDSAYSYLAMVDYPYPSS 312

Query: 737  FMMPLPGHPIREVCQRIDGGPAGTSILERIYEGVNVYYNYTGEAKCFELDDDPHGLSGWN 558
            FMMPLPG+PI EVC+RIDG P GTSILERI+EG+++YYNYTGE  CFELDDDPHGL GWN
Sbjct: 313  FMMPLPGYPIGEVCKRIDGCPDGTSILERIFEGISIYYNYTGELHCFELDDDPHGLDGWN 372

Query: 557  WQACTEMVMPMSSSQESSMFPPYEYNYSSFQEDCLKNFGVKPRPKWITTEFGGHNIHATL 378
            WQACTEMVMPMSSS  +SMFP Y++NYSS+QE C + FGV PRP+WITTEFGG +I   L
Sbjct: 373  WQACTEMVMPMSSSHNASMFPTYDFNYSSYQEGCWEEFGVIPRPRWITTEFGGQDIKTAL 432

Query: 377  KKFGSNIIFSNGLLDPWSGGSVLQNISESIVALVTEEGAHHIDLRASTGNDPDWLVEQRA 198
            + FGSNIIFSNGLLDPWSGGSVLQNISE++VALVTEEGAHHIDLR ST  DPDWLVEQR 
Sbjct: 433  ETFGSNIIFSNGLLDPWSGGSVLQNISETVVALVTEEGAHHIDLRPSTPEDPDWLVEQRE 492

Query: 197  TEIKLIQGWISDYHQKNKAAYDM 129
            TE+KLI+GWI  Y ++ K A+ M
Sbjct: 493  TEVKLIKGWIDGYLKEKKTAFSM 515


>ref|XP_002263389.2| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
            gi|296085719|emb|CBI29519.3| unnamed protein product
            [Vitis vinifera]
          Length = 510

 Score =  771 bits (1991), Expect = 0.0
 Identities = 355/481 (73%), Positives = 413/481 (85%)
 Frame = -2

Query: 1571 KHAPRFLRKFAXXXXXXXXXXXXQLGFNYETRHFQQHLDHFSFSELPMFPQRYLINTEHW 1392
            K  PRFL KFA               F YETR+F+Q LDHFS ++LP F QRYLI+T HW
Sbjct: 38   KSIPRFLGKFAYPNRGKP--------FQYETRYFEQRLDHFSIADLPKFRQRYLISTRHW 89

Query: 1391 VGPQRSGPIFFYCGNEGDIVWFAQNTGFVWEIAPQFGAMVVFPEHRYYGESVPYGSREEA 1212
             GP R GPIF YCGNEGDI WFA NTGFVW++AP+FGAMV+FPEHRYYGES+PYGSR++A
Sbjct: 90   TGPDRMGPIFLYCGNEGDIEWFAANTGFVWDMAPRFGAMVLFPEHRYYGESMPYGSRDKA 149

Query: 1211 YKNATTLSYLTAEQALADFSVLLTDLKQNFSANHCPVVLFGGSYGGMLAAWMRLKYPHIA 1032
            Y NA +LSYLTAEQALADF+VL+T+LK+N SA  CPVVLFGGSYGGMLAAWMRLKYPHIA
Sbjct: 150  YANAASLSYLTAEQALADFAVLVTNLKRNLSAEGCPVVLFGGSYGGMLAAWMRLKYPHIA 209

Query: 1031 VGALASSAPILQFEDIVPPETFYDIVSNDFRRESSTCFNYIKQSWDDIATKGETSNGLVQ 852
            +GALASSAPILQFEDIVPPETFYDIVSN+F+RES +CF+ IK+SWD + ++G+ ++GL Q
Sbjct: 210  IGALASSAPILQFEDIVPPETFYDIVSNNFKRESISCFDTIKKSWDVLISEGQKNDGLKQ 269

Query: 851  LTKTFNLCGKLKRTEDLWGWLESAYSYLAMVNYPYPSEFMMPLPGHPIREVCQRIDGGPA 672
            LTK F LC  LKRTEDL+ WL+SAYS+LAMVNYPYPS+F+MPLPGHPI+EVC+++D  P 
Sbjct: 270  LTKAFRLCRDLKRTEDLYDWLDSAYSFLAMVNYPYPSDFLMPLPGHPIKEVCRKMDSCPE 329

Query: 671  GTSILERIYEGVNVYYNYTGEAKCFELDDDPHGLSGWNWQACTEMVMPMSSSQESSMFPP 492
            GTS+LERI+EGV+VYYNYTG+ +CF+LDDDPHG+ GWNWQACTEMVMPM+SS+ESSMFP 
Sbjct: 330  GTSVLERIFEGVSVYYNYTGKVECFQLDDDPHGMDGWNWQACTEMVMPMASSRESSMFPT 389

Query: 491  YEYNYSSFQEDCLKNFGVKPRPKWITTEFGGHNIHATLKKFGSNIIFSNGLLDPWSGGSV 312
            Y+YNYSSFQE+C K+F VKPRP WITTEFGGH    TLK FGSNIIFSNGLLDPWSGGSV
Sbjct: 390  YDYNYSSFQEECWKDFSVKPRPTWITTEFGGHEFKTTLKVFGSNIIFSNGLLDPWSGGSV 449

Query: 311  LQNISESIVALVTEEGAHHIDLRASTGNDPDWLVEQRATEIKLIQGWISDYHQKNKAAYD 132
            LQNISE++VALVTEEGAHHIDLR+ST  DPDWLVEQRA E+KLI+GWI DYHQK  + + 
Sbjct: 450  LQNISETVVALVTEEGAHHIDLRSSTAEDPDWLVEQRAFEVKLIKGWIEDYHQKRNSVFS 509

Query: 131  M 129
            +
Sbjct: 510  I 510


>ref|NP_201377.2| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
            gi|95147306|gb|ABF57288.1| At5g65760 [Arabidopsis
            thaliana] gi|110736177|dbj|BAF00060.1| lysosomal Pro-X
            carboxypeptidase [Arabidopsis thaliana]
            gi|332010719|gb|AED98102.1| Serine carboxypeptidase S28
            family protein [Arabidopsis thaliana]
          Length = 515

 Score =  736 bits (1901), Expect = 0.0
 Identities = 345/504 (68%), Positives = 406/504 (80%), Gaps = 2/504 (0%)
 Frame = -2

Query: 1646 NRFSTLVFFTVAVIVVSCSQQPLAF-KHAPRFLR-KFAXXXXXXXXXXXXQLGFNYETRH 1473
            + F  L+ FT   +V   +   L+  K  PRF R  F             +  + YET+ 
Sbjct: 3    SHFCLLLIFTFFTLVFPSNGSSLSSSKLLPRFPRYTFQNREARIQQFRGDRNEYRYETKF 62

Query: 1472 FQQHLDHFSFSELPMFPQRYLINTEHWVGPQRSGPIFFYCGNEGDIVWFAQNTGFVWEIA 1293
            F Q LDHFSF++LP F QRYLIN++HW+G    GPIF YCGNEGDI WFA N+GF+W+IA
Sbjct: 63   FSQQLDHFSFADLPKFSQRYLINSDHWLGASALGPIFLYCGNEGDIEWFATNSGFIWDIA 122

Query: 1292 PQFGAMVVFPEHRYYGESVPYGSREEAYKNATTLSYLTAEQALADFSVLLTDLKQNFSAN 1113
            P+FGA++VFPEHRYYGES+PYGSREEAYKNATTLSYLT EQALADF+V +TDLK+N SA 
Sbjct: 123  PKFGALLVFPEHRYYGESMPYGSREEAYKNATTLSYLTTEQALADFAVFVTDLKRNLSAE 182

Query: 1112 HCPVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPPETFYDIVSNDFRRE 933
             CPVVLFGGSYGGMLAAWMRLKYPHIA+GALASSAPILQFED+VPPETFYDI SNDF+RE
Sbjct: 183  ACPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDVVPPETFYDIASNDFKRE 242

Query: 932  SSTCFNYIKQSWDDIATKGETSNGLVQLTKTFNLCGKLKRTEDLWGWLESAYSYLAMVNY 753
            SS+CFN IK SWD I  +G+  NGL+QLTKTF+ C  L  T+DL  WL+SAYSYLAMV+Y
Sbjct: 243  SSSCFNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYSYLAMVDY 302

Query: 752  PYPSEFMMPLPGHPIREVCQRIDGGPAGTSILERIYEGVNVYYNYTGEAKCFELDDDPHG 573
            PYP++FMMPLPGHPIREVC++IDG  +  SIL+RIY G++VYYNYTG   CF+LDDDPHG
Sbjct: 303  PYPADFMMPLPGHPIREVCRKIDGAGSNASILDRIYAGISVYYNYTGNVDCFKLDDDPHG 362

Query: 572  LSGWNWQACTEMVMPMSSSQESSMFPPYEYNYSSFQEDCLKNFGVKPRPKWITTEFGGHN 393
            L GWNWQACTEMVMPMSS+QE+SMFP Y +NYSS++E+C   F V PRPKW+TTEFGGH+
Sbjct: 363  LDGWNWQACTEMVMPMSSNQENSMFPGYGFNYSSYKEECWNTFRVNPRPKWVTTEFGGHD 422

Query: 392  IHATLKKFGSNIIFSNGLLDPWSGGSVLQNISESIVALVTEEGAHHIDLRASTGNDPDWL 213
            I  TLK FGSNIIFSNGLLDPWSGGSVL+N+S++IVALVT+EGAHH+DLR ST  DP WL
Sbjct: 423  IATTLKSFGSNIIFSNGLLDPWSGGSVLKNLSDTIVALVTKEGAHHLDLRPSTPEDPKWL 482

Query: 212  VEQRATEIKLIQGWISDYHQKNKA 141
            V+QR  EI+LIQGWI  Y  + +A
Sbjct: 483  VDQREAEIRLIQGWIETYRVEKEA 506


Top