BLASTX nr result

ID: Glycyrrhiza24_contig00000382 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza24_contig00000382
         (1969 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]                           97   1e-17
ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor,...    91   1e-15
dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]               90   3e-15
ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35...    88   1e-14
ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis tha...    86   4e-14

>dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score = 97.4 bits (241), Expect = 1e-17
 Identities = 123/440 (27%), Positives = 187/440 (42%), Gaps = 18/440 (4%)
 Frame = -1

Query: 1762 SPISTDSHINQVIFD---SDHHNSSLPLLIQNNKNEDGGKTYEFNITAGKFFYLMTLQLV 1592
            SPIS   +     FD   S  H S          +    KT E++I  G   Y M + + 
Sbjct: 42   SPISPLYNPKNTYFDRLQSSFHRSISRANRFTPNSVSAAKTLEYDIIPGGGEYFMRISI- 100

Query: 1591 LKDHKDDVEAYGTPDTGSNLIWLNLNCEXXXXXXXXXTDECIKIKEP------ETTFECH 1430
                   +E     DTGS+LIW+   C+           EC K K P       +T+   
Sbjct: 101  ---GTPPIEVLVIADTGSDLIWVQ--CQPC--------QECYKQKSPIFNPKQSSTYRRV 147

Query: 1429 SGAEDPCKKMWSDLGMEEQDPDKCIKSTDHKDKCGYKIIYKDGSYSKGYFGEGGF--RDS 1256
                  C  + SD+        +   +      CGY   Y D S++ GY     F    +
Sbjct: 148  LCETRYCNALNSDM--------RACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGST 199

Query: 1255 HDQEFKVKYGVSTDTGPK-EKNSIGVVGLGRGDLSLFQQR-KNVDFKFSYCLPQYEEKDQ 1082
            ++   ++ +G     G   ++   G+VGLG G LSL  Q    +D KFSYCL    EK  
Sbjct: 200  NNSIQELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSN 259

Query: 1081 SNENALATSKLVFGSQVNTNPETSIKFLDKYEATD--KKECATHLYCISLTSIYVKGHDK 908
                  +  K+VFG        + I   D Y +T    KE  T  Y ++L +I V G+++
Sbjct: 260  -----FSLGKIVFGDN------SFISGSDTYVSTPLVSKEPETFYY-LTLEAISV-GNER 306

Query: 907  --EEPEKKITVKEKGTTEVMIIDSGTTFTYLRGDVFDRFLDHVKQQIGDWENLENPYG-Y 737
               E  +     EKG    +IIDSGTT T+L   ++++ L+ V ++  + E + +P G +
Sbjct: 307  LAYENSRNDGNVEKGN---IIIDSGTTLTFLDSKLYNK-LELVLEKAVEGERVSDPNGIF 362

Query: 736  EHCFLKGSAEKLEKVSLGFKRTTVEPKDELKVELKRENIFDLMNKNGKDYRCLTVKKTDD 557
              CF        +K+ +     TV   D   VELK  N F    K  +D  C T+  ++ 
Sbjct: 363  SICFR-------DKIGIELPIITVHFTDA-DVELKPINTFA---KAEEDLLCFTMIPSNG 411

Query: 556  VHILGSRAQVDFEVKFDLSK 497
            + I G+ AQ++F V +DL K
Sbjct: 412  IAIFGNLAQMNFLVGYDLDK 431


>ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
            communis] gi|223543249|gb|EEF44781.1| Aspartic proteinase
            nepenthesin-2 precursor, putative [Ricinus communis]
          Length = 449

 Score = 90.5 bits (223), Expect = 1e-15
 Identities = 112/399 (28%), Positives = 166/399 (41%), Gaps = 17/399 (4%)
 Frame = -1

Query: 1639 NITAGKFFYLMTLQLVLKDHKDDVEAYGTPDTGSNLIWLNLN-CEXXXXXXXXXTDECIK 1463
            +I  G   YLM + +        VE     DTGS+LIW+    CE          D    
Sbjct: 85   DIVPGGGEYLMRISI----GNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDP--- 137

Query: 1462 IKEPETTFECHSGAEDPCKKMWSDLGMEEQDPDKCIKSTDHKDKCGYKIIYKDGSYSKGY 1283
                 +++       + C K+  D      D    +K+      CGY   Y D S+S G+
Sbjct: 138  --RRSSSYRNVLCGNEFCNKL--DGEARSCDARGFVKT------CGYTYSYGDQSFSDGH 187

Query: 1282 -----FGEGGFRDSHDQEF----KVKYGVSTDTGPK-EKNSIGVVGLGRGDLSLFQQR-K 1136
                 FG G    +         +V +G  T  G   ++   G++GLG G +SL  Q   
Sbjct: 188  LAIERFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGP 247

Query: 1135 NVDFKFSYCLPQYEEKDQSNENALATSKLVFGSQVNTNPET----SIKFLDKYEATDKKE 968
             +  KFSYCL    E  QSN     TSK+ FG+ +N +       S   L K   T    
Sbjct: 248  KLSGKFSYCLVPTSE--QSNY----TSKINFGNDINISGSNYNVVSTPLLPKKPET---- 297

Query: 967  CATHLYCISLTSIYVKGHDKEEPEKKITVKEKGTTEVMIIDSGTTFTYLRGDVFDRFLDH 788
                 Y ++L +I V+              EKG    +IIDSGTT T+L  + F+  LD 
Sbjct: 298  ----YYYLTLEAISVENKRLPYTNLWNGEVEKGN---IIIDSGTTLTFLDSEFFNN-LDS 349

Query: 787  VKQQIGDWENLENPYG-YEHCFLKGSAEKLEKVSLGFKRTTVEPKDELKVELKRENIFDL 611
              ++    E + +P+G +  CF    A +L  ++  F    VE        L+  N F  
Sbjct: 350  AVEEAVKGERVSDPHGLFNICFKDEKAIELPIITAHFTGADVE--------LQPVNTFA- 400

Query: 610  MNKNGKDYRCLTVKKTDDVHILGSRAQVDFEVKFDLSKK 494
              K  +D  C T+  ++D+ I G+ AQ++F V +DL KK
Sbjct: 401  --KVEEDLLCFTMIPSNDIAIFGNLAQMNFLVGYDLEKK 437


>dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score = 89.7 bits (221), Expect = 3e-15
 Identities = 104/376 (27%), Positives = 158/376 (42%), Gaps = 24/376 (6%)
 Frame = -1

Query: 1549 DTGSNLIWLNLNCEXXXXXXXXXTDECIKIKEP------ETTFECHSGAEDPCKKMWSDL 1388
            DTGS+L WL               D+C   K P       TTF        PC  +    
Sbjct: 98   DTGSDLTWLQSK----------PCDQCYPQKGPIFDPSNSTTFHKLPCTTAPCNAL---- 143

Query: 1387 GMEEQDPDKCIKSTDHKDKCGYKIIYKDGSYSKGYFGEGGFR--DSHDQEFKVKYGVSTD 1214
                   D+  +S      CGY   Y D SY+ GY         ++  Q   V +G  T 
Sbjct: 144  -------DESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNASVQIRNVAFGCGTR 196

Query: 1213 TGPK-EKNSIGVVGLGRGDLSLFQQRKN-VDFKFSYCL-PQYEEKDQSNENALATSKLVF 1043
             G   ++   G+VGLG G+LS   Q  + +  KFSYCL P   E      ++ ATS++VF
Sbjct: 197  NGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPSDSPATSRIVF 256

Query: 1042 GSQVNTNPETSIKFLDKYEATDKKECATHLYCISLTSIYVKGHDK---EEPEKKITVKEK 872
            G     +  ++   +        KE +T+ Y +++ +I V G  K        K    + 
Sbjct: 257  GDNPVFSSSSTNGVVFATTPLVNKEPSTYYY-LTIEAITV-GRKKLLYSSSSSKTASYDS 314

Query: 871  GTTEVM-----IIDSGTTFTYLRGDVFDRF----LDHVK-QQIGDWENLENPYGYEHCFL 722
            G+   +     IIDSGTT T+L  + +       ++ +K +++ D +N      +  CF 
Sbjct: 315  GSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSM----FSLCFK 370

Query: 721  KGSAEKLEKVSLGFKRTTVEPKDELKVELKRENIFDLMNKNGKDYRCLTVKKTDDVHILG 542
             G     E+V L   +  V  +    VELK  N F    +      C T+  T+DV I G
Sbjct: 371  SGK----EEVELPLMK--VHFRGGADVELKPVNTFVRAEEG---LVCFTMLPTNDVGIYG 421

Query: 541  SRAQVDFEVKFDLSKK 494
            + AQ++F V +DL K+
Sbjct: 422  NLAQMNFVVGYDLGKR 437


>ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score = 87.8 bits (216), Expect = 1e-14
 Identities = 95/370 (25%), Positives = 151/370 (40%), Gaps = 14/370 (3%)
 Frame = -1

Query: 1570 VEAYGTPDTGSNLIWLNLN-CEXXXXXXXXXTDECIKIKEPETTFECHSGAEDPCKKMWS 1394
            VE +   DTGS+LIW+    CE          D         +TF+       PC  +  
Sbjct: 103  VERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDP-----RKSSTFKTVPCDSQPCTLL-- 155

Query: 1393 DLGMEEQDPDKCIKSTDHKDKCGYKIIYKDGSYSKGYFG----EGGFRDSHDQEFKVKYG 1226
                    P           +C Y+ IY D +   G  G      G +++  +  K+ +G
Sbjct: 156  --------PPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTFG 207

Query: 1225 VS---TDTGPKEKNSIGVVGLGRGDLSLFQQRK-NVDFKFSYCLPQYEEKDQSNENALAT 1058
             +    DT  + K ++G+VGLG G LSL  Q    +  KFSYC P             +T
Sbjct: 208  CTFSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQIGRKFSYCFPPLSSN--------ST 259

Query: 1057 SKLVFGSQVNTNPETSIKFLDKYEATDK--KECATHLYCISLTSIYVKGHDKEEPEKKIT 884
            SK+ FG+      +  +K +    +T    K      Y ++L  + +         KK+ 
Sbjct: 260  SKMRFGN------DAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGN-------KKVK 306

Query: 883  VKEKGTTEVMIIDSGTTFTYLRGDVFDRFLDHVKQQIGDWENLENPYGYEHCF-LKGSAE 707
              E  T   ++IDSGT+FT L+   +++F+  VK+  G       P  Y  CF  KG  +
Sbjct: 307  TSESQTDGNILIDSGTSFTILKQSFYNKFVALVKEVYGVEAVKIPPLVYNFCFENKGKRK 366

Query: 706  KLEKVSLGFKRTTVEPKDELKVELKRENIFDLMNKNGKDYRCLTVKKT--DDVHILGSRA 533
            +   V   F           KV +   N+F+  + N     C+    T  +D  I G+ A
Sbjct: 367  RFPDVVFLFTGA--------KVRVDASNLFEAEDNN---LLCMVALPTSDEDDSIFGNHA 415

Query: 532  QVDFEVKFDL 503
            Q+ ++V++DL
Sbjct: 416  QIGYQVEYDL 425


>ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName:
            Full=Probable aspartic protease At2g35615; Flags:
            Precursor gi|330254036|gb|AEC09130.1| aspartyl
            protease-like protein [Arabidopsis thaliana]
          Length = 447

 Score = 85.9 bits (211), Expect = 4e-14
 Identities = 105/399 (26%), Positives = 164/399 (41%), Gaps = 15/399 (3%)
 Frame = -1

Query: 1627 GKFFYLMTLQLVLKDHKDDVEAYGTPDTGSNLIWLNLN-CEXXXXXXXXXTDECIKIKEP 1451
            G+FF  +T+          ++ +   DTGS+L W+    C+          D     K+ 
Sbjct: 83   GEFFMSITIGT------PPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFD-----KKK 131

Query: 1450 ETTFECHSGAEDPCKKMWS-DLGMEEQDPDKCIKSTDHKDKCGYKIIYKDGSYSKGYFGE 1274
             +T++        C+ + S + G +E +           + C Y+  Y D S+SKG    
Sbjct: 132  SSTYKSEPCDSRNCQALSSTERGCDESN-----------NICKYRYSYGDQSFSKGDVAT 180

Query: 1273 GGFRDSHDQEFKVKYGVST------DTGPKEKNSIGVVGLGRGDLSLFQQR-KNVDFKFS 1115
                        V +  +       + G  ++   G++GLG G LSL  Q   ++  KFS
Sbjct: 181  ETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFS 240

Query: 1114 YCLPQYEEKDQSNENALATSKLVFGSQVNTNPETSIKFLDKYEAT-DKKECATHLYCISL 938
            YCL        S+++A      V     N+ P +  K           KE  T+ Y ++L
Sbjct: 241  YCL--------SHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYY-LTL 291

Query: 937  TSIYVKGHDKEEPEKKITVKEKG----TTEVMIIDSGTTFTYLRGDVFDRFLDHVKQQIG 770
             +I V               + G    T+  +IIDSGTT T L    FD+F   V++ + 
Sbjct: 292  EAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVT 351

Query: 769  DWENLENPYG-YEHCFLKGSAEKLEKVSLGFKRTTVEPKDELKVELKRENIFDLMNKNGK 593
              + + +P G   HCF  GSAE      +G    TV       V L   N F    K  +
Sbjct: 352  GAKRVSDPQGLLSHCFKSGSAE------IGLPEITVH-FTGADVRLSPINAF---VKLSE 401

Query: 592  DYRCLTVKKTDDVHILGSRAQVDFEVKFDLSKKEKEVSF 476
            D  CL++  T +V I G+ AQ+DF V +DL  + + VSF
Sbjct: 402  DMVCLSMVPTTEVAIYGNFAQMDFLVGYDL--ETRTVSF 438


Top