BLASTX nr result
ID: Glycyrrhiza24_contig00006245
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza24_contig00006245 (1575 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine... 615 e-174 dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 580 e-163 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 565 e-159 ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|2... 563 e-158 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 555 e-155 >ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max] Length = 439 Score = 615 bits (1587), Expect = e-174 Identities = 300/419 (71%), Positives = 334/419 (79%), Gaps = 7/419 (1%) Frame = -3 Query: 1417 STFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRN----SSYTLSL 1250 S DTSELFE WCKEH K YSSEEEK YR KVFEDNY FVA+HNQ N SSYTLSL Sbjct: 25 SASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSL 84 Query: 1249 NAFADLTHHEFKASRLGLPPRSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQ 1070 NAFADLTHHEFK +RLGLP + LRF R Q+QQ D L +PS+IDWR +GAVTPVKDQ Sbjct: 85 NAFADLTHHEFKTTRLGLP--LTLLRFKRPQNQQSRDL-LHIPSQIDWRQSGAVTPVKDQ 141 Query: 1069 GSCGACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDN 890 SCGACW+F+ATGAIEGINKIVTGSL+SLSEQEL+DCDT+YNSGC GGLMD+AYQFVIDN Sbjct: 142 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDN 201 Query: 889 NGIDTEEDYPYQGRQRLCNKDKLRRRVVIIDGYIDVPRNDEKRLLKAVADQPVSVGICGS 710 GIDTE+DYPYQ RQR C+KDKL+RR V I+ Y+DVP ++E+ +LKAVA QPVSVGICGS Sbjct: 202 KGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGS 260 Query: 709 ERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSXXXXXXXXXXXXMLRNT 530 ER FQLYSKGIF GPCST LDHAVLIVGYGSENGVDYWIVKNS M+RN+ Sbjct: 261 EREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNS 320 Query: 529 GDSEGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWK 350 G+S+G+CGIN LASY +CNLFT+CS GETCCCA+S LGICF WK Sbjct: 321 GNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGICFSWK 380 Query: 349 CCGLTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKED---STRGWGSQ 182 CCGLTSAVCCKDKRHCCPQDYP+CDTRRGQCLKR +NGT T T E +D +RGW SQ Sbjct: 381 CCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFSHKSRGWKSQ 439 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 580 bits (1496), Expect = e-163 Identities = 282/416 (67%), Positives = 319/416 (76%), Gaps = 4/416 (0%) Frame = -3 Query: 1420 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 1241 +S+ + + LFE WC++HGK Y+S+EEK +R KVF+DNYDFV HN +G NSSYTLSLNAF Sbjct: 21 SSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQG-NSSYTLSLNAF 79 Query: 1240 ADLTHHEFKASRLGLPPRSSF-LRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGS 1064 ADLTHHEFKASRLGL +S L +R Q P D DVP+ +DWR GAVT VKDQG+ Sbjct: 80 ADLTHHEFKASRLGLSSAASASLNVDRSNRQIP-DFVADVPASVDWRKNGAVTQVKDQGN 138 Query: 1063 CGACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNG 884 CGACWSF+ATGAIEGINKIVTGSL+SLSEQELVDCD +YN+GCEGG+MDYA+QFVIDN+G Sbjct: 139 CGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHG 198 Query: 883 IDTEEDYPYQGRQRLCNKDKLRRRVVIIDGYIDVPRNDEKRLLKAVADQPVSVGICGSER 704 IDTEEDYPYQGR R CNK+KL+R VV IDGY+DVP+N+EK LLKAVA+QPVSVGICGSER Sbjct: 199 IDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSER 258 Query: 703 AFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSXXXXXXXXXXXXMLRNTGD 524 AFQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNS M RN+G Sbjct: 259 AFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGS 318 Query: 523 SEGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCC 344 S GLCGINMLASY T+C+LFT+C GETCCC + GIC WKCC Sbjct: 319 SRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKCC 378 Query: 343 GLTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKEDST---RGWGS 185 L SAVCCKD RHCCP+DYPVCDT R CLK N T+ F K S+ R W S Sbjct: 379 ELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSSGKFRSWSS 434 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 565 bits (1457), Expect = e-159 Identities = 273/390 (70%), Positives = 312/390 (80%) Frame = -3 Query: 1420 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 1241 +S+ D S+LFE+W KEHGK Y+S+E+K YR K+FE+NY+FV +HN +G NSSYTLSLNAF Sbjct: 23 SSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQG-NSSYTLSLNAF 81 Query: 1240 ADLTHHEFKASRLGLPPRSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSC 1061 ADLTHHEFKASRLGL S+ + +R ++ +D DVP IDWR GAV+ VKDQG+C Sbjct: 82 ADLTHHEFKASRLGLSAFSTSGKLSR-RNFPLHDFVGDVPISIDWRKKGAVSQVKDQGNC 140 Query: 1060 GACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGI 881 GACWSF+ATGAIEGINKIVTGSL+SLSEQELVDCD +YN+GCEGGLMDYAYQFVI+NNGI Sbjct: 141 GACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGI 200 Query: 880 DTEEDYPYQGRQRLCNKDKLRRRVVIIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERA 701 DTEEDYPYQ R++ CNK+KL+R VV IDGY DVP+N+EK LLKAVA QPVSVGICGSERA Sbjct: 201 DTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERA 260 Query: 700 FQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSXXXXXXXXXXXXMLRNTGDS 521 FQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNS MLRN+G+S Sbjct: 261 FQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNS 320 Query: 520 EGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCCG 341 +GLCGINMLAS+ TKC+LFT C GETCCC R + G+CF WKCC Sbjct: 321 QGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCE 380 Query: 340 LTSAVCCKDKRHCCPQDYPVCDTRRGQCLK 251 L SAVCCKD HCCP DYPVCDT+R CLK Sbjct: 381 LDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410 >ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa] Length = 436 Score = 563 bits (1451), Expect = e-158 Identities = 268/405 (66%), Positives = 308/405 (76%) Frame = -3 Query: 1417 STFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAFA 1238 S+ D S+LFE WCKEHGK+Y+S+EE+ +R KVFEDNYDFV +HN +G NSSY+L+LNAFA Sbjct: 21 SSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKG-NSSYSLALNAFA 79 Query: 1237 DLTHHEFKASRLGLPPRSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSCG 1058 DLTHHEFK SRLGL L + D+P+ IDWRN G VT VKDQGSCG Sbjct: 80 DLTHHEFKTSRLGLSAAPLNLAHRNLEITGVVG---DIPASIDWRNKGVVTNVKDQGSCG 136 Query: 1057 ACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGID 878 ACWSF+ATGAIEGINKIVTGSL+SLSEQEL++CD +YN GC GGLMDYA+QFVI+N+GID Sbjct: 137 ACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGID 196 Query: 877 TEEDYPYQGRQRLCNKDKLRRRVVIIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERAF 698 TEEDYPY+ R CNKD+++RRVV ID Y+DVP N+EK+LL+AVA QPVSVGICGSERAF Sbjct: 197 TEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAF 256 Query: 697 QLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSXXXXXXXXXXXXMLRNTGDSE 518 Q+YSKGIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNS M RN+G+S+ Sbjct: 257 QMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQ 316 Query: 517 GLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCCGL 338 G+CGINMLASY TKCNL TYC+ GETCCCAR GIC WKCCGL Sbjct: 317 GVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGL 376 Query: 337 TSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKEDS 203 SAVCCKD+ HCCP DYPVCDT + C KR N T+ E + S Sbjct: 377 DSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGKTS 421 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 555 bits (1429), Expect = e-155 Identities = 270/416 (64%), Positives = 309/416 (74%) Frame = -3 Query: 1420 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 1241 ++T + SELFE WC EHGK+YSS EEK YR VF DNY+FV HN NSSYTLSLN++ Sbjct: 20 SATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLD-NSSYTLSLNSY 78 Query: 1240 ADLTHHEFKASRLGLPPRSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSC 1061 ADLTHHEFK SRLG P + F Q+P+ DVP +DWR GAVT VKDQGSC Sbjct: 79 ADLTHHEFKVSRLGFSP--ALRNFRPVLPQEPSLPR-DVPDSLDWRKKGAVTAVKDQGSC 135 Query: 1060 GACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGI 881 GACWSF+ATGA+EGIN+I+TGSL+SLSEQEL+DCD +YNSGC GGLMDYAYQFVI N+GI Sbjct: 136 GACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGI 195 Query: 880 DTEEDYPYQGRQRLCNKDKLRRRVVIIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERA 701 DTE DYPYQ R C KDKL+R VV IDGY D+P NDE +LL+AVA QPVSVGICGSERA Sbjct: 196 DTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERA 255 Query: 700 FQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSXXXXXXXXXXXXMLRNTGDS 521 FQLYSKGIF+GPCSTSLDHAVLIVGYGSENGVDYWIVKNS M RN+G+S Sbjct: 256 FQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNS 315 Query: 520 EGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCCG 341 EG+CGIN LASY TKC++ T C+ GETCCCA+ LG+C WKCCG Sbjct: 316 EGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCG 375 Query: 340 LTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKEDSTRGWGSQ*AF 173 L+SAVCCKD RHCCP DYP+CDT R CLK+ NGT+T E S+ G+ +F Sbjct: 376 LSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSSSGSSGTWSSF 431