BLASTX nr result
ID: Glycyrrhiza24_contig00000382
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza24_contig00000382 (1969 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas] 97 1e-17 ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor,... 91 1e-15 dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum] 90 3e-15 ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35... 88 1e-14 ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis tha... 86 4e-14 >dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas] Length = 445 Score = 97.4 bits (241), Expect = 1e-17 Identities = 123/440 (27%), Positives = 187/440 (42%), Gaps = 18/440 (4%) Frame = -1 Query: 1762 SPISTDSHINQVIFD---SDHHNSSLPLLIQNNKNEDGGKTYEFNITAGKFFYLMTLQLV 1592 SPIS + FD S H S + KT E++I G Y M + + Sbjct: 42 SPISPLYNPKNTYFDRLQSSFHRSISRANRFTPNSVSAAKTLEYDIIPGGGEYFMRISI- 100 Query: 1591 LKDHKDDVEAYGTPDTGSNLIWLNLNCEXXXXXXXXXTDECIKIKEP------ETTFECH 1430 +E DTGS+LIW+ C+ EC K K P +T+ Sbjct: 101 ---GTPPIEVLVIADTGSDLIWVQ--CQPC--------QECYKQKSPIFNPKQSSTYRRV 147 Query: 1429 SGAEDPCKKMWSDLGMEEQDPDKCIKSTDHKDKCGYKIIYKDGSYSKGYFGEGGF--RDS 1256 C + SD+ + + CGY Y D S++ GY F + Sbjct: 148 LCETRYCNALNSDM--------RACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGST 199 Query: 1255 HDQEFKVKYGVSTDTGPK-EKNSIGVVGLGRGDLSLFQQR-KNVDFKFSYCLPQYEEKDQ 1082 ++ ++ +G G ++ G+VGLG G LSL Q +D KFSYCL EK Sbjct: 200 NNSIQELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSN 259 Query: 1081 SNENALATSKLVFGSQVNTNPETSIKFLDKYEATD--KKECATHLYCISLTSIYVKGHDK 908 + K+VFG + I D Y +T KE T Y ++L +I V G+++ Sbjct: 260 -----FSLGKIVFGDN------SFISGSDTYVSTPLVSKEPETFYY-LTLEAISV-GNER 306 Query: 907 --EEPEKKITVKEKGTTEVMIIDSGTTFTYLRGDVFDRFLDHVKQQIGDWENLENPYG-Y 737 E + EKG +IIDSGTT T+L ++++ L+ V ++ + E + +P G + Sbjct: 307 LAYENSRNDGNVEKGN---IIIDSGTTLTFLDSKLYNK-LELVLEKAVEGERVSDPNGIF 362 Query: 736 EHCFLKGSAEKLEKVSLGFKRTTVEPKDELKVELKRENIFDLMNKNGKDYRCLTVKKTDD 557 CF +K+ + TV D VELK N F K +D C T+ ++ Sbjct: 363 SICFR-------DKIGIELPIITVHFTDA-DVELKPINTFA---KAEEDLLCFTMIPSNG 411 Query: 556 VHILGSRAQVDFEVKFDLSK 497 + I G+ AQ++F V +DL K Sbjct: 412 IAIFGNLAQMNFLVGYDLDK 431 >ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus communis] gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus communis] Length = 449 Score = 90.5 bits (223), Expect = 1e-15 Identities = 112/399 (28%), Positives = 166/399 (41%), Gaps = 17/399 (4%) Frame = -1 Query: 1639 NITAGKFFYLMTLQLVLKDHKDDVEAYGTPDTGSNLIWLNLN-CEXXXXXXXXXTDECIK 1463 +I G YLM + + VE DTGS+LIW+ CE D Sbjct: 85 DIVPGGGEYLMRISI----GNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDP--- 137 Query: 1462 IKEPETTFECHSGAEDPCKKMWSDLGMEEQDPDKCIKSTDHKDKCGYKIIYKDGSYSKGY 1283 +++ + C K+ D D +K+ CGY Y D S+S G+ Sbjct: 138 --RRSSSYRNVLCGNEFCNKL--DGEARSCDARGFVKT------CGYTYSYGDQSFSDGH 187 Query: 1282 -----FGEGGFRDSHDQEF----KVKYGVSTDTGPK-EKNSIGVVGLGRGDLSLFQQR-K 1136 FG G + +V +G T G ++ G++GLG G +SL Q Sbjct: 188 LAIERFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGP 247 Query: 1135 NVDFKFSYCLPQYEEKDQSNENALATSKLVFGSQVNTNPET----SIKFLDKYEATDKKE 968 + KFSYCL E QSN TSK+ FG+ +N + S L K T Sbjct: 248 KLSGKFSYCLVPTSE--QSNY----TSKINFGNDINISGSNYNVVSTPLLPKKPET---- 297 Query: 967 CATHLYCISLTSIYVKGHDKEEPEKKITVKEKGTTEVMIIDSGTTFTYLRGDVFDRFLDH 788 Y ++L +I V+ EKG +IIDSGTT T+L + F+ LD Sbjct: 298 ----YYYLTLEAISVENKRLPYTNLWNGEVEKGN---IIIDSGTTLTFLDSEFFNN-LDS 349 Query: 787 VKQQIGDWENLENPYG-YEHCFLKGSAEKLEKVSLGFKRTTVEPKDELKVELKRENIFDL 611 ++ E + +P+G + CF A +L ++ F VE L+ N F Sbjct: 350 AVEEAVKGERVSDPHGLFNICFKDEKAIELPIITAHFTGADVE--------LQPVNTFA- 400 Query: 610 MNKNGKDYRCLTVKKTDDVHILGSRAQVDFEVKFDLSKK 494 K +D C T+ ++D+ I G+ AQ++F V +DL KK Sbjct: 401 --KVEEDLLCFTMIPSNDIAIFGNLAQMNFLVGYDLEKK 437 >dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum] Length = 449 Score = 89.7 bits (221), Expect = 3e-15 Identities = 104/376 (27%), Positives = 158/376 (42%), Gaps = 24/376 (6%) Frame = -1 Query: 1549 DTGSNLIWLNLNCEXXXXXXXXXTDECIKIKEP------ETTFECHSGAEDPCKKMWSDL 1388 DTGS+L WL D+C K P TTF PC + Sbjct: 98 DTGSDLTWLQSK----------PCDQCYPQKGPIFDPSNSTTFHKLPCTTAPCNAL---- 143 Query: 1387 GMEEQDPDKCIKSTDHKDKCGYKIIYKDGSYSKGYFGEGGFR--DSHDQEFKVKYGVSTD 1214 D+ +S CGY Y D SY+ GY ++ Q V +G T Sbjct: 144 -------DESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNASVQIRNVAFGCGTR 196 Query: 1213 TGPK-EKNSIGVVGLGRGDLSLFQQRKN-VDFKFSYCL-PQYEEKDQSNENALATSKLVF 1043 G ++ G+VGLG G+LS Q + + KFSYCL P E ++ ATS++VF Sbjct: 197 NGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPSDSPATSRIVF 256 Query: 1042 GSQVNTNPETSIKFLDKYEATDKKECATHLYCISLTSIYVKGHDK---EEPEKKITVKEK 872 G + ++ + KE +T+ Y +++ +I V G K K + Sbjct: 257 GDNPVFSSSSTNGVVFATTPLVNKEPSTYYY-LTIEAITV-GRKKLLYSSSSSKTASYDS 314 Query: 871 GTTEVM-----IIDSGTTFTYLRGDVFDRF----LDHVK-QQIGDWENLENPYGYEHCFL 722 G+ + IIDSGTT T+L + + ++ +K +++ D +N + CF Sbjct: 315 GSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSM----FSLCFK 370 Query: 721 KGSAEKLEKVSLGFKRTTVEPKDELKVELKRENIFDLMNKNGKDYRCLTVKKTDDVHILG 542 G E+V L + V + VELK N F + C T+ T+DV I G Sbjct: 371 SGK----EEVELPLMK--VHFRGGADVELKPVNTFVRAEEG---LVCFTMLPTNDVGIYG 421 Query: 541 SRAQVDFEVKFDLSKK 494 + AQ++F V +DL K+ Sbjct: 422 NLAQMNFVVGYDLGKR 437 >ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max] Length = 440 Score = 87.8 bits (216), Expect = 1e-14 Identities = 95/370 (25%), Positives = 151/370 (40%), Gaps = 14/370 (3%) Frame = -1 Query: 1570 VEAYGTPDTGSNLIWLNLN-CEXXXXXXXXXTDECIKIKEPETTFECHSGAEDPCKKMWS 1394 VE + DTGS+LIW+ CE D +TF+ PC + Sbjct: 103 VERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDP-----RKSSTFKTVPCDSQPCTLL-- 155 Query: 1393 DLGMEEQDPDKCIKSTDHKDKCGYKIIYKDGSYSKGYFG----EGGFRDSHDQEFKVKYG 1226 P +C Y+ IY D + G G G +++ + K+ +G Sbjct: 156 --------PPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTFG 207 Query: 1225 VS---TDTGPKEKNSIGVVGLGRGDLSLFQQRK-NVDFKFSYCLPQYEEKDQSNENALAT 1058 + DT + K ++G+VGLG G LSL Q + KFSYC P +T Sbjct: 208 CTFSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQIGRKFSYCFPPLSSN--------ST 259 Query: 1057 SKLVFGSQVNTNPETSIKFLDKYEATDK--KECATHLYCISLTSIYVKGHDKEEPEKKIT 884 SK+ FG+ + +K + +T K Y ++L + + KK+ Sbjct: 260 SKMRFGN------DAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGN-------KKVK 306 Query: 883 VKEKGTTEVMIIDSGTTFTYLRGDVFDRFLDHVKQQIGDWENLENPYGYEHCF-LKGSAE 707 E T ++IDSGT+FT L+ +++F+ VK+ G P Y CF KG + Sbjct: 307 TSESQTDGNILIDSGTSFTILKQSFYNKFVALVKEVYGVEAVKIPPLVYNFCFENKGKRK 366 Query: 706 KLEKVSLGFKRTTVEPKDELKVELKRENIFDLMNKNGKDYRCLTVKKT--DDVHILGSRA 533 + V F KV + N+F+ + N C+ T +D I G+ A Sbjct: 367 RFPDVVFLFTGA--------KVRVDASNLFEAEDNN---LLCMVALPTSDEDDSIFGNHA 415 Query: 532 QVDFEVKFDL 503 Q+ ++V++DL Sbjct: 416 QIGYQVEYDL 425 >ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana] gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags: Precursor gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana] Length = 447 Score = 85.9 bits (211), Expect = 4e-14 Identities = 105/399 (26%), Positives = 164/399 (41%), Gaps = 15/399 (3%) Frame = -1 Query: 1627 GKFFYLMTLQLVLKDHKDDVEAYGTPDTGSNLIWLNLN-CEXXXXXXXXXTDECIKIKEP 1451 G+FF +T+ ++ + DTGS+L W+ C+ D K+ Sbjct: 83 GEFFMSITIGT------PPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFD-----KKK 131 Query: 1450 ETTFECHSGAEDPCKKMWS-DLGMEEQDPDKCIKSTDHKDKCGYKIIYKDGSYSKGYFGE 1274 +T++ C+ + S + G +E + + C Y+ Y D S+SKG Sbjct: 132 SSTYKSEPCDSRNCQALSSTERGCDESN-----------NICKYRYSYGDQSFSKGDVAT 180 Query: 1273 GGFRDSHDQEFKVKYGVST------DTGPKEKNSIGVVGLGRGDLSLFQQR-KNVDFKFS 1115 V + + + G ++ G++GLG G LSL Q ++ KFS Sbjct: 181 ETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFS 240 Query: 1114 YCLPQYEEKDQSNENALATSKLVFGSQVNTNPETSIKFLDKYEAT-DKKECATHLYCISL 938 YCL S+++A V N+ P + K KE T+ Y ++L Sbjct: 241 YCL--------SHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYY-LTL 291 Query: 937 TSIYVKGHDKEEPEKKITVKEKG----TTEVMIIDSGTTFTYLRGDVFDRFLDHVKQQIG 770 +I V + G T+ +IIDSGTT T L FD+F V++ + Sbjct: 292 EAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVT 351 Query: 769 DWENLENPYG-YEHCFLKGSAEKLEKVSLGFKRTTVEPKDELKVELKRENIFDLMNKNGK 593 + + +P G HCF GSAE +G TV V L N F K + Sbjct: 352 GAKRVSDPQGLLSHCFKSGSAE------IGLPEITVH-FTGADVRLSPINAF---VKLSE 401 Query: 592 DYRCLTVKKTDDVHILGSRAQVDFEVKFDLSKKEKEVSF 476 D CL++ T +V I G+ AQ+DF V +DL + + VSF Sbjct: 402 DMVCLSMVPTTEVAIYGNFAQMDFLVGYDL--ETRTVSF 438