BLASTX nr result
ID: Glycyrrhiza24_contig00005633
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza24_contig00005633 (1579 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1... 640 0.0 ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1... 620 e-175 ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor,... 531 e-148 ref|XP_002329464.1| predicted protein [Populus trichocarpa] gi|2... 530 e-148 ref|XP_002300215.1| predicted protein [Populus trichocarpa] gi|2... 527 e-147 >ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max] Length = 453 Score = 640 bits (1651), Expect = 0.0 Identities = 340/467 (72%), Positives = 368/467 (78%), Gaps = 2/467 (0%) Frame = +3 Query: 54 LIMATLQQHCSSSSCVPLFVTLLVAGLLVDPTSSTSRGSSLLXXXXXXXXXXAGFQVALR 233 ++MA L+ SS V L LL L V PTSSTSR + L GF+V LR Sbjct: 1 MVMAKLKH---PSSFVTLVALLLAVSLFVAPTSSTSRKTIL----KHHPYPTKGFRVMLR 53 Query: 234 HMDSDKNLTKLERVQHGIKRGKTRLQRLNAMVLAATTTTEDSSTQQLEAPIHAGNGEYLM 413 H+DS KNLTKLERVQHGIKRGK+RLQRLNAMVLAA+T S QLEAPIHAGNGEYLM Sbjct: 54 HVDSGKNLTKLERVQHGIKRGKSRLQRLNAMVLAASTL---DSEDQLEAPIHAGNGEYLM 110 Query: 414 ELSIGTPPESYPAVLDTGSDLIWTQCKPCSQCYKQPTPIFDPXXXXXXXXXXXXXNLCSA 593 EL+IGTPP SYPAVLDTGSDLIWTQCKPC+QCYKQPTPIFDP +LCSA Sbjct: 111 ELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSLCSA 170 Query: 594 LPSSTCSSDGCNYVYSYGDYSMTQGVLATETFTFGD--DKVSIKNIGFGCGEDNEGDGFE 767 +PSSTCS DGC YVYSYGDYSMTQGVLATETFTFG +KVS+ NIGFGCGEDNEGDGFE Sbjct: 171 VPSSTCS-DGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFE 229 Query: 768 QASGLVGLGRGPLSLVSQLKEPKFSYCLTPMGDSKTSLLLLGSLANTKEDSNSLTVTTPL 947 QASGLVGLGRGPLSLVSQLKEP+FSYCLTPM D+K S+LLLGSL K+ VTTPL Sbjct: 230 QASGLVGLGRGPLSLVSQLKEPRFSYCLTPMDDTKESILLLGSLGKVKDAKE--VVTTPL 287 Query: 948 LTNPSQPSFYYLSLEGISVGDTRLSIEKSTFEXXXXXXXXXXXXXXXXXXYIEENAFDAL 1127 L NP QPSFYYLSLEGISVGDTRLSIEKSTFE YIE+ AF+AL Sbjct: 288 LKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEAL 347 Query: 1128 KKELISQTKLPLDKSGTTGLDVCFTLPSGDTTQVEIPKLVFHFKGGDLELPAENYMIADS 1307 KKE ISQTKLPLDK+ +TGLD+CF+LPSG +TQVEIPK+VFHFKGGDLELPAENYMI DS Sbjct: 348 KKEFISQTKLPLDKTSSTGLDLCFSLPSG-STQVEIPKIVFHFKGGDLELPAENYMIGDS 406 Query: 1308 GLGVACLAMGASNGMSIIGNVQQQNILVNHDLQKGTISFVPTQCDQL 1448 LGVACLAMGAS+GMSI GNVQQQNILVNHDL+K TISFVPT CDQL Sbjct: 407 NLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKETISFVPTSCDQL 453 >ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max] Length = 453 Score = 620 bits (1600), Expect = e-175 Identities = 324/446 (72%), Positives = 359/446 (80%), Gaps = 2/446 (0%) Frame = +3 Query: 117 LLVAGLLVDPTSSTSRGSSLLXXXXXXXXXXAGFQVALRHMDSDKNLTKLERVQHGIKRG 296 L++A L + PTSSTSR +S GF+V LRH+DS KNLTKLERVQHGIKRG Sbjct: 16 LVLACLFIAPTSSTSRKTSFKQQHPCPTTN--GFRVMLRHVDSGKNLTKLERVQHGIKRG 73 Query: 297 KTRLQRLNAMVLAATTTTEDSSTQQLEAPIHAGNGEYLMELSIGTPPESYPAVLDTGSDL 476 K+RLQ+LNAMVLAA++T + S QLEAPIHAGNGEYL+EL+IGTPP SYPAVLDTGSDL Sbjct: 74 KSRLQKLNAMVLAASSTPD--SEDQLEAPIHAGNGEYLIELAIGTPPVSYPAVLDTGSDL 131 Query: 477 IWTQCKPCSQCYKQPTPIFDPXXXXXXXXXXXXXNLCSALPSSTCSSDGCNYVYSYGDYS 656 IWTQCKPC++CYKQPTPIFDP +LCSALPSSTCS DGC YVYSYGDYS Sbjct: 132 IWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLCSALPSSTCS-DGCEYVYSYGDYS 190 Query: 657 MTQGVLATETFTFGD--DKVSIKNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKE 830 MTQGVLATETFTFG +KVS+ NIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKE Sbjct: 191 MTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKE 250 Query: 831 PKFSYCLTPMGDSKTSLLLLGSLANTKEDSNSLTVTTPLLTNPSQPSFYYLSLEGISVGD 1010 +FSYCLTP+ D+K S+LLLGSL K+ VTTPLL NP QPSFYYLSLE ISVGD Sbjct: 251 QRFSYCLTPIDDTKESVLLLGSLGKVKDAKE--VVTTPLLKNPLQPSFYYLSLEAISVGD 308 Query: 1011 TRLSIEKSTFEXXXXXXXXXXXXXXXXXXYIEENAFDALKKELISQTKLPLDKSGTTGLD 1190 TRLSIEKSTFE Y+++ A++ALKKE ISQTKL LDK+ +TGLD Sbjct: 309 TRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTGLD 368 Query: 1191 VCFTLPSGDTTQVEIPKLVFHFKGGDLELPAENYMIADSGLGVACLAMGASNGMSIIGNV 1370 +CF+LPSG +TQVEIPKLVFHFKGGDLELPAENYMI DS LGVACLAMGAS+GMSI GNV Sbjct: 369 LCFSLPSG-STQVEIPKLVFHFKGGDLELPAENYMIGDSNLGVACLAMGASSGMSIFGNV 427 Query: 1371 QQQNILVNHDLQKGTISFVPTQCDQL 1448 QQQNILVNHDL+K TISFVPT CDQL Sbjct: 428 QQQNILVNHDLEKETISFVPTSCDQL 453 >ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] Length = 442 Score = 531 bits (1367), Expect = e-148 Identities = 270/448 (60%), Positives = 328/448 (73%) Frame = +3 Query: 105 LFVTLLVAGLLVDPTSSTSRGSSLLXXXXXXXXXXAGFQVALRHMDSDKNLTKLERVQHG 284 + ++LL+ L V P STSR + GF++ L+H+DSDKNLTK +R+QHG Sbjct: 11 VLLSLLILSLSVYPAFSTSRRA-----LSYPAQLKNGFRITLKHVDSDKNLTKFQRIQHG 65 Query: 285 IKRGKTRLQRLNAMVLAATTTTEDSSTQQLEAPIHAGNGEYLMELSIGTPPESYPAVLDT 464 IKR RL+RLNAMVLAA SS ++ +P+ +GNGE+LM L+IGTPPE+Y A++DT Sbjct: 66 IKRANHRLERLNAMVLAA------SSNAEINSPVLSGNGEFLMNLAIGTPPETYSAIMDT 119 Query: 465 GSDLIWTQCKPCSQCYKQPTPIFDPXXXXXXXXXXXXXNLCSALPSSTCSSDGCNYVYSY 644 GSDLIWTQCKPC+QC+ QP+PIFDP LC ALP S+CS D C Y+Y+Y Sbjct: 120 GSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSCS-DSCEYLYTY 178 Query: 645 GDYSMTQGVLATETFTFGDDKVSIKNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQL 824 GDYS TQG +ATETFTFG KVSI N+GFGCGEDNEGDGF Q SGLVGLGRGPLSLVSQL Sbjct: 179 GDYSSTQGTMATETFTFG--KVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQL 236 Query: 825 KEPKFSYCLTPMGDSKTSLLLLGSLANTKEDSNSLTVTTPLLTNPSQPSFYYLSLEGISV 1004 KE KFSYCLT + D+KTS LL+GSLA+ S ++ TTPL+ NP QPSFYYLSLEGISV Sbjct: 237 KEAKFSYCLTSIDDTKTSTLLMGSLASVNGTSAAIR-TTPLIQNPLQPSFYYLSLEGISV 295 Query: 1005 GDTRLSIEKSTFEXXXXXXXXXXXXXXXXXXYIEENAFDALKKELISQTKLPLDKSGTTG 1184 G TRL I++STF+ Y+EE+AFD +KKE SQ LP+D SG TG Sbjct: 296 GGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATG 355 Query: 1185 LDVCFTLPSGDTTQVEIPKLVFHFKGGDLELPAENYMIADSGLGVACLAMGASNGMSIIG 1364 L++C+ LPS DT+++E+PKLV HF G DLELP ENYMIADS +GV CLAMG+S GMSI G Sbjct: 356 LELCYNLPS-DTSELEVPKLVLHFTGADLELPGENYMIADSSMGVICLAMGSSGGMSIFG 414 Query: 1365 NVQQQNILVNHDLQKGTISFVPTQCDQL 1448 NVQQQN+ V+HDL+K T+SF+PT C QL Sbjct: 415 NVQQQNMFVSHDLEKETLSFLPTNCGQL 442 >ref|XP_002329464.1| predicted protein [Populus trichocarpa] gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa] Length = 439 Score = 530 bits (1366), Expect = e-148 Identities = 271/455 (59%), Positives = 327/455 (71%) Frame = +3 Query: 84 SSSSCVPLFVTLLVAGLLVDPTSSTSRGSSLLXXXXXXXXXXAGFQVALRHMDSDKNLTK 263 ++ S + L V L + + STSR GF+ L+H+DS KNLTK Sbjct: 2 ANMSSLSLVVALAIFAFVFSHAFSTSR------RVLEHPKVQNGFRAKLKHVDSGKNLTK 55 Query: 264 LERVQHGIKRGKTRLQRLNAMVLAATTTTEDSSTQQLEAPIHAGNGEYLMELSIGTPPES 443 ER+QHG+KRG+ RLQR AM L A SS +++AP+ GNGE+LM+L+IGTPPE+ Sbjct: 56 FERIQHGVKRGRHRLQRFKAMALVA------SSNSEIDAPVLPGNGEFLMKLAIGTPPET 109 Query: 444 YPAVLDTGSDLIWTQCKPCSQCYKQPTPIFDPXXXXXXXXXXXXXNLCSALPSSTCSSDG 623 Y A++DTGSDLIWTQCKPC+QC+ QPTPIFDP LC ALP STCS DG Sbjct: 110 YSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCEALPQSTCS-DG 168 Query: 624 CNYVYSYGDYSMTQGVLATETFTFGDDKVSIKNIGFGCGEDNEGDGFEQASGLVGLGRGP 803 C Y+Y YGDYS TQG+LA+ET TFG KVS+ + FGCGEDNEG GF Q SGLVGLGRGP Sbjct: 169 CEYLYGYGDYSSTQGMLASETLTFG--KVSVPEVAFGCGEDNEGSGFSQGSGLVGLGRGP 226 Query: 804 LSLVSQLKEPKFSYCLTPMGDSKTSLLLLGSLANTKEDSNSLTVTTPLLTNPSQPSFYYL 983 LSLVSQLKEPKFSYCLT + D+K S LL+GSLA+ K S+S TTPL+ N +QPSFYYL Sbjct: 227 LSLVSQLKEPKFSYCLTSVDDTKASTLLMGSLASVKA-SDSEIKTTPLIQNSAQPSFYYL 285 Query: 984 SLEGISVGDTRLSIEKSTFEXXXXXXXXXXXXXXXXXXYIEENAFDALKKELISQTKLPL 1163 SLEGISVGDT L I+KSTF Y+E++AFD + KE SQ LP+ Sbjct: 286 SLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPV 345 Query: 1164 DKSGTTGLDVCFTLPSGDTTQVEIPKLVFHFKGGDLELPAENYMIADSGLGVACLAMGAS 1343 D SG+TGL+VCFTLPSG +T +E+PKLVFHF G DLELPAENYMIAD+ +GVACLAMG+S Sbjct: 346 DNSGSTGLEVCFTLPSG-STDIEVPKLVFHFDGADLELPAENYMIADASMGVACLAMGSS 404 Query: 1344 NGMSIIGNVQQQNILVNHDLQKGTISFVPTQCDQL 1448 +GMSI GN+QQQN+LV HDL+K T+SF+PTQCD+L Sbjct: 405 SGMSIFGNIQQQNMLVLHDLEKETLSFLPTQCDEL 439 >ref|XP_002300215.1| predicted protein [Populus trichocarpa] gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa] Length = 439 Score = 527 bits (1358), Expect = e-147 Identities = 261/412 (63%), Positives = 317/412 (76%) Frame = +3 Query: 213 GFQVALRHMDSDKNLTKLERVQHGIKRGKTRLQRLNAMVLAATTTTEDSSTQQLEAPIHA 392 GF+V L+H+DS KNLTKLER++HG+KRG+ RLQRL AM L A SS+ ++EAP+ Sbjct: 39 GFRVRLKHVDSGKNLTKLERIRHGVKRGRNRLQRLQAMALVA------SSSSEIEAPVLP 92 Query: 393 GNGEYLMELSIGTPPESYPAVLDTGSDLIWTQCKPCSQCYKQPTPIFDPXXXXXXXXXXX 572 GNGE+LM+L+IGTPPE+Y A+LDTGSDLIWTQCKPC+QC+ Q TPIFDP Sbjct: 93 GNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSC 152 Query: 573 XXNLCSALPSSTCSSDGCNYVYSYGDYSMTQGVLATETFTFGDDKVSIKNIGFGCGEDNE 752 LC ALP S+C++ GC Y+YSYGDYS TQG+LA+ET TFG K S+ N+ FGCG DNE Sbjct: 153 SSQLCEALPQSSCNN-GCEYLYSYGDYSSTQGILASETLTFG--KASVPNVAFGCGADNE 209 Query: 753 GDGFEQASGLVGLGRGPLSLVSQLKEPKFSYCLTPMGDSKTSLLLLGSLANTKEDSNSLT 932 G GF Q +GLVGLGRGPLSLVSQLKEPKFSYCLT + D+KTS LL+GSLA+ S+++ Sbjct: 210 GSGFSQGAGLVGLGRGPLSLVSQLKEPKFSYCLTTVDDTKTSTLLMGSLASVNASSSAIK 269 Query: 933 VTTPLLTNPSQPSFYYLSLEGISVGDTRLSIEKSTFEXXXXXXXXXXXXXXXXXXYIEEN 1112 TTPL+ +P+ PSFYYLSLEGISVGDTRL I+KSTF Y+EE+ Sbjct: 270 -TTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEES 328 Query: 1113 AFDALKKELISQTKLPLDKSGTTGLDVCFTLPSGDTTQVEIPKLVFHFKGGDLELPAENY 1292 AF+ + KE ++ LP+D SG+TGLDVCFTLPSG +T +E+PKLVFHF G DLELPAENY Sbjct: 329 AFNLVAKEFTAKINLPVDSSGSTGLDVCFTLPSG-STNIEVPKLVFHFDGADLELPAENY 387 Query: 1293 MIADSGLGVACLAMGASNGMSIIGNVQQQNILVNHDLQKGTISFVPTQCDQL 1448 MI DS +GVACLAMG+S+GMSI GNVQQQN+LV HDL+K T+SF+PTQCD L Sbjct: 388 MIGDSSMGVACLAMGSSSGMSIFGNVQQQNMLVLHDLEKETLSFLPTQCDLL 439