BLASTX nr result

ID: Glycyrrhiza23_contig00007762 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00007762
         (1703 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1...   640   0.0  
ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1...   620   e-175
ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor,...   531   e-148
ref|XP_002329464.1| predicted protein [Populus trichocarpa] gi|2...   530   e-148
ref|XP_002300215.1| predicted protein [Populus trichocarpa] gi|2...   527   e-147

>ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  640 bits (1651), Expect = 0.0
 Identities = 340/467 (72%), Positives = 368/467 (78%), Gaps = 2/467 (0%)
 Frame = +2

Query: 191  LIMATLQQHCSSSSCVPLFVTLLVAGLLVDPTSSTSRGSSLLXXXXXXXXXXAGFQVALR 370
            ++MA L+     SS V L   LL   L V PTSSTSR + L            GF+V LR
Sbjct: 1    MVMAKLKH---PSSFVTLVALLLAVSLFVAPTSSTSRKTIL----KHHPYPTKGFRVMLR 53

Query: 371  HMDSDKNLTKLERVQHGIKRGKTRLQRLNAMVLAATTTTEDSSTQQLEAPIHAGNGEYLM 550
            H+DS KNLTKLERVQHGIKRGK+RLQRLNAMVLAA+T     S  QLEAPIHAGNGEYLM
Sbjct: 54   HVDSGKNLTKLERVQHGIKRGKSRLQRLNAMVLAASTL---DSEDQLEAPIHAGNGEYLM 110

Query: 551  ELSIGTPPESYPAVLDTGSDLIWTQCKPCSQCYKQPTPIFDPXXXXXXXXXXXXXNLCSA 730
            EL+IGTPP SYPAVLDTGSDLIWTQCKPC+QCYKQPTPIFDP             +LCSA
Sbjct: 111  ELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSLCSA 170

Query: 731  LPSSTCSSDGCNYVYSYGDYSMTQGVLATETFTFGD--DKVSIKNIGFGCGEDNEGDGFE 904
            +PSSTCS DGC YVYSYGDYSMTQGVLATETFTFG   +KVS+ NIGFGCGEDNEGDGFE
Sbjct: 171  VPSSTCS-DGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFE 229

Query: 905  QASGLVGLGRGPLSLVSQLKEPKFSYCLTPMGDSKTSLLLLGSLANTKEDSNSLTVTTPL 1084
            QASGLVGLGRGPLSLVSQLKEP+FSYCLTPM D+K S+LLLGSL   K+      VTTPL
Sbjct: 230  QASGLVGLGRGPLSLVSQLKEPRFSYCLTPMDDTKESILLLGSLGKVKDAKE--VVTTPL 287

Query: 1085 LTNPSQPSFYYLSLEGISVGDTRLSIEKSTFEXXXXXXXXXXXXXXXXXXYIEENAFDAL 1264
            L NP QPSFYYLSLEGISVGDTRLSIEKSTFE                  YIE+ AF+AL
Sbjct: 288  LKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEAL 347

Query: 1265 KKELISQTKLPLDKSGTTGLDVCFTLPSGDTTQVEIPKLVFHFKGGDLELPAENYMIADS 1444
            KKE ISQTKLPLDK+ +TGLD+CF+LPSG +TQVEIPK+VFHFKGGDLELPAENYMI DS
Sbjct: 348  KKEFISQTKLPLDKTSSTGLDLCFSLPSG-STQVEIPKIVFHFKGGDLELPAENYMIGDS 406

Query: 1445 GLGVACLAMGASNGMSIIGNVQQQNILVNHDLQKGTISFVPTQCDQL 1585
             LGVACLAMGAS+GMSI GNVQQQNILVNHDL+K TISFVPT CDQL
Sbjct: 407  NLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKETISFVPTSCDQL 453


>ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  620 bits (1600), Expect = e-175
 Identities = 324/446 (72%), Positives = 359/446 (80%), Gaps = 2/446 (0%)
 Frame = +2

Query: 254  LLVAGLLVDPTSSTSRGSSLLXXXXXXXXXXAGFQVALRHMDSDKNLTKLERVQHGIKRG 433
            L++A L + PTSSTSR +S             GF+V LRH+DS KNLTKLERVQHGIKRG
Sbjct: 16   LVLACLFIAPTSSTSRKTSFKQQHPCPTTN--GFRVMLRHVDSGKNLTKLERVQHGIKRG 73

Query: 434  KTRLQRLNAMVLAATTTTEDSSTQQLEAPIHAGNGEYLMELSIGTPPESYPAVLDTGSDL 613
            K+RLQ+LNAMVLAA++T +  S  QLEAPIHAGNGEYL+EL+IGTPP SYPAVLDTGSDL
Sbjct: 74   KSRLQKLNAMVLAASSTPD--SEDQLEAPIHAGNGEYLIELAIGTPPVSYPAVLDTGSDL 131

Query: 614  IWTQCKPCSQCYKQPTPIFDPXXXXXXXXXXXXXNLCSALPSSTCSSDGCNYVYSYGDYS 793
            IWTQCKPC++CYKQPTPIFDP             +LCSALPSSTCS DGC YVYSYGDYS
Sbjct: 132  IWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLCSALPSSTCS-DGCEYVYSYGDYS 190

Query: 794  MTQGVLATETFTFGD--DKVSIKNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKE 967
            MTQGVLATETFTFG   +KVS+ NIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKE
Sbjct: 191  MTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKE 250

Query: 968  PKFSYCLTPMGDSKTSLLLLGSLANTKEDSNSLTVTTPLLTNPSQPSFYYLSLEGISVGD 1147
             +FSYCLTP+ D+K S+LLLGSL   K+      VTTPLL NP QPSFYYLSLE ISVGD
Sbjct: 251  QRFSYCLTPIDDTKESVLLLGSLGKVKDAKE--VVTTPLLKNPLQPSFYYLSLEAISVGD 308

Query: 1148 TRLSIEKSTFEXXXXXXXXXXXXXXXXXXYIEENAFDALKKELISQTKLPLDKSGTTGLD 1327
            TRLSIEKSTFE                  Y+++ A++ALKKE ISQTKL LDK+ +TGLD
Sbjct: 309  TRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTGLD 368

Query: 1328 VCFTLPSGDTTQVEIPKLVFHFKGGDLELPAENYMIADSGLGVACLAMGASNGMSIIGNV 1507
            +CF+LPSG +TQVEIPKLVFHFKGGDLELPAENYMI DS LGVACLAMGAS+GMSI GNV
Sbjct: 369  LCFSLPSG-STQVEIPKLVFHFKGGDLELPAENYMIGDSNLGVACLAMGASSGMSIFGNV 427

Query: 1508 QQQNILVNHDLQKGTISFVPTQCDQL 1585
            QQQNILVNHDL+K TISFVPT CDQL
Sbjct: 428  QQQNILVNHDLEKETISFVPTSCDQL 453


>ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223537841|gb|EEF39457.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 442

 Score =  531 bits (1367), Expect = e-148
 Identities = 270/448 (60%), Positives = 328/448 (73%)
 Frame = +2

Query: 242  LFVTLLVAGLLVDPTSSTSRGSSLLXXXXXXXXXXAGFQVALRHMDSDKNLTKLERVQHG 421
            + ++LL+  L V P  STSR +              GF++ L+H+DSDKNLTK +R+QHG
Sbjct: 11   VLLSLLILSLSVYPAFSTSRRA-----LSYPAQLKNGFRITLKHVDSDKNLTKFQRIQHG 65

Query: 422  IKRGKTRLQRLNAMVLAATTTTEDSSTQQLEAPIHAGNGEYLMELSIGTPPESYPAVLDT 601
            IKR   RL+RLNAMVLAA      SS  ++ +P+ +GNGE+LM L+IGTPPE+Y A++DT
Sbjct: 66   IKRANHRLERLNAMVLAA------SSNAEINSPVLSGNGEFLMNLAIGTPPETYSAIMDT 119

Query: 602  GSDLIWTQCKPCSQCYKQPTPIFDPXXXXXXXXXXXXXNLCSALPSSTCSSDGCNYVYSY 781
            GSDLIWTQCKPC+QC+ QP+PIFDP              LC ALP S+CS D C Y+Y+Y
Sbjct: 120  GSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSCS-DSCEYLYTY 178

Query: 782  GDYSMTQGVLATETFTFGDDKVSIKNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQL 961
            GDYS TQG +ATETFTFG  KVSI N+GFGCGEDNEGDGF Q SGLVGLGRGPLSLVSQL
Sbjct: 179  GDYSSTQGTMATETFTFG--KVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQL 236

Query: 962  KEPKFSYCLTPMGDSKTSLLLLGSLANTKEDSNSLTVTTPLLTNPSQPSFYYLSLEGISV 1141
            KE KFSYCLT + D+KTS LL+GSLA+    S ++  TTPL+ NP QPSFYYLSLEGISV
Sbjct: 237  KEAKFSYCLTSIDDTKTSTLLMGSLASVNGTSAAIR-TTPLIQNPLQPSFYYLSLEGISV 295

Query: 1142 GDTRLSIEKSTFEXXXXXXXXXXXXXXXXXXYIEENAFDALKKELISQTKLPLDKSGTTG 1321
            G TRL I++STF+                  Y+EE+AFD +KKE  SQ  LP+D SG TG
Sbjct: 296  GGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATG 355

Query: 1322 LDVCFTLPSGDTTQVEIPKLVFHFKGGDLELPAENYMIADSGLGVACLAMGASNGMSIIG 1501
            L++C+ LPS DT+++E+PKLV HF G DLELP ENYMIADS +GV CLAMG+S GMSI G
Sbjct: 356  LELCYNLPS-DTSELEVPKLVLHFTGADLELPGENYMIADSSMGVICLAMGSSGGMSIFG 414

Query: 1502 NVQQQNILVNHDLQKGTISFVPTQCDQL 1585
            NVQQQN+ V+HDL+K T+SF+PT C QL
Sbjct: 415  NVQQQNMFVSHDLEKETLSFLPTNCGQL 442


>ref|XP_002329464.1| predicted protein [Populus trichocarpa] gi|222870144|gb|EEF07275.1|
            predicted protein [Populus trichocarpa]
          Length = 439

 Score =  530 bits (1366), Expect = e-148
 Identities = 271/455 (59%), Positives = 327/455 (71%)
 Frame = +2

Query: 221  SSSSCVPLFVTLLVAGLLVDPTSSTSRGSSLLXXXXXXXXXXAGFQVALRHMDSDKNLTK 400
            ++ S + L V L +   +     STSR                GF+  L+H+DS KNLTK
Sbjct: 2    ANMSSLSLVVALAIFAFVFSHAFSTSR------RVLEHPKVQNGFRAKLKHVDSGKNLTK 55

Query: 401  LERVQHGIKRGKTRLQRLNAMVLAATTTTEDSSTQQLEAPIHAGNGEYLMELSIGTPPES 580
             ER+QHG+KRG+ RLQR  AM L A      SS  +++AP+  GNGE+LM+L+IGTPPE+
Sbjct: 56   FERIQHGVKRGRHRLQRFKAMALVA------SSNSEIDAPVLPGNGEFLMKLAIGTPPET 109

Query: 581  YPAVLDTGSDLIWTQCKPCSQCYKQPTPIFDPXXXXXXXXXXXXXNLCSALPSSTCSSDG 760
            Y A++DTGSDLIWTQCKPC+QC+ QPTPIFDP              LC ALP STCS DG
Sbjct: 110  YSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCEALPQSTCS-DG 168

Query: 761  CNYVYSYGDYSMTQGVLATETFTFGDDKVSIKNIGFGCGEDNEGDGFEQASGLVGLGRGP 940
            C Y+Y YGDYS TQG+LA+ET TFG  KVS+  + FGCGEDNEG GF Q SGLVGLGRGP
Sbjct: 169  CEYLYGYGDYSSTQGMLASETLTFG--KVSVPEVAFGCGEDNEGSGFSQGSGLVGLGRGP 226

Query: 941  LSLVSQLKEPKFSYCLTPMGDSKTSLLLLGSLANTKEDSNSLTVTTPLLTNPSQPSFYYL 1120
            LSLVSQLKEPKFSYCLT + D+K S LL+GSLA+ K  S+S   TTPL+ N +QPSFYYL
Sbjct: 227  LSLVSQLKEPKFSYCLTSVDDTKASTLLMGSLASVKA-SDSEIKTTPLIQNSAQPSFYYL 285

Query: 1121 SLEGISVGDTRLSIEKSTFEXXXXXXXXXXXXXXXXXXYIEENAFDALKKELISQTKLPL 1300
            SLEGISVGDT L I+KSTF                   Y+E++AFD + KE  SQ  LP+
Sbjct: 286  SLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPV 345

Query: 1301 DKSGTTGLDVCFTLPSGDTTQVEIPKLVFHFKGGDLELPAENYMIADSGLGVACLAMGAS 1480
            D SG+TGL+VCFTLPSG +T +E+PKLVFHF G DLELPAENYMIAD+ +GVACLAMG+S
Sbjct: 346  DNSGSTGLEVCFTLPSG-STDIEVPKLVFHFDGADLELPAENYMIADASMGVACLAMGSS 404

Query: 1481 NGMSIIGNVQQQNILVNHDLQKGTISFVPTQCDQL 1585
            +GMSI GN+QQQN+LV HDL+K T+SF+PTQCD+L
Sbjct: 405  SGMSIFGNIQQQNMLVLHDLEKETLSFLPTQCDEL 439


>ref|XP_002300215.1| predicted protein [Populus trichocarpa] gi|222847473|gb|EEE85020.1|
            predicted protein [Populus trichocarpa]
          Length = 439

 Score =  527 bits (1358), Expect = e-147
 Identities = 261/412 (63%), Positives = 317/412 (76%)
 Frame = +2

Query: 350  GFQVALRHMDSDKNLTKLERVQHGIKRGKTRLQRLNAMVLAATTTTEDSSTQQLEAPIHA 529
            GF+V L+H+DS KNLTKLER++HG+KRG+ RLQRL AM L A      SS+ ++EAP+  
Sbjct: 39   GFRVRLKHVDSGKNLTKLERIRHGVKRGRNRLQRLQAMALVA------SSSSEIEAPVLP 92

Query: 530  GNGEYLMELSIGTPPESYPAVLDTGSDLIWTQCKPCSQCYKQPTPIFDPXXXXXXXXXXX 709
            GNGE+LM+L+IGTPPE+Y A+LDTGSDLIWTQCKPC+QC+ Q TPIFDP           
Sbjct: 93   GNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSC 152

Query: 710  XXNLCSALPSSTCSSDGCNYVYSYGDYSMTQGVLATETFTFGDDKVSIKNIGFGCGEDNE 889
               LC ALP S+C++ GC Y+YSYGDYS TQG+LA+ET TFG  K S+ N+ FGCG DNE
Sbjct: 153  SSQLCEALPQSSCNN-GCEYLYSYGDYSSTQGILASETLTFG--KASVPNVAFGCGADNE 209

Query: 890  GDGFEQASGLVGLGRGPLSLVSQLKEPKFSYCLTPMGDSKTSLLLLGSLANTKEDSNSLT 1069
            G GF Q +GLVGLGRGPLSLVSQLKEPKFSYCLT + D+KTS LL+GSLA+    S+++ 
Sbjct: 210  GSGFSQGAGLVGLGRGPLSLVSQLKEPKFSYCLTTVDDTKTSTLLMGSLASVNASSSAIK 269

Query: 1070 VTTPLLTNPSQPSFYYLSLEGISVGDTRLSIEKSTFEXXXXXXXXXXXXXXXXXXYIEEN 1249
             TTPL+ +P+ PSFYYLSLEGISVGDTRL I+KSTF                   Y+EE+
Sbjct: 270  -TTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEES 328

Query: 1250 AFDALKKELISQTKLPLDKSGTTGLDVCFTLPSGDTTQVEIPKLVFHFKGGDLELPAENY 1429
            AF+ + KE  ++  LP+D SG+TGLDVCFTLPSG +T +E+PKLVFHF G DLELPAENY
Sbjct: 329  AFNLVAKEFTAKINLPVDSSGSTGLDVCFTLPSG-STNIEVPKLVFHFDGADLELPAENY 387

Query: 1430 MIADSGLGVACLAMGASNGMSIIGNVQQQNILVNHDLQKGTISFVPTQCDQL 1585
            MI DS +GVACLAMG+S+GMSI GNVQQQN+LV HDL+K T+SF+PTQCD L
Sbjct: 388  MIGDSSMGVACLAMGSSSGMSIFGNVQQQNMLVLHDLEKETLSFLPTQCDLL 439


Top