BLASTX nr result
ID: Glycyrrhiza23_contig00007762
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00007762 (1703 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1... 640 0.0 ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1... 620 e-175 ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor,... 531 e-148 ref|XP_002329464.1| predicted protein [Populus trichocarpa] gi|2... 530 e-148 ref|XP_002300215.1| predicted protein [Populus trichocarpa] gi|2... 527 e-147 >ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max] Length = 453 Score = 640 bits (1651), Expect = 0.0 Identities = 340/467 (72%), Positives = 368/467 (78%), Gaps = 2/467 (0%) Frame = +2 Query: 191 LIMATLQQHCSSSSCVPLFVTLLVAGLLVDPTSSTSRGSSLLXXXXXXXXXXAGFQVALR 370 ++MA L+ SS V L LL L V PTSSTSR + L GF+V LR Sbjct: 1 MVMAKLKH---PSSFVTLVALLLAVSLFVAPTSSTSRKTIL----KHHPYPTKGFRVMLR 53 Query: 371 HMDSDKNLTKLERVQHGIKRGKTRLQRLNAMVLAATTTTEDSSTQQLEAPIHAGNGEYLM 550 H+DS KNLTKLERVQHGIKRGK+RLQRLNAMVLAA+T S QLEAPIHAGNGEYLM Sbjct: 54 HVDSGKNLTKLERVQHGIKRGKSRLQRLNAMVLAASTL---DSEDQLEAPIHAGNGEYLM 110 Query: 551 ELSIGTPPESYPAVLDTGSDLIWTQCKPCSQCYKQPTPIFDPXXXXXXXXXXXXXNLCSA 730 EL+IGTPP SYPAVLDTGSDLIWTQCKPC+QCYKQPTPIFDP +LCSA Sbjct: 111 ELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSLCSA 170 Query: 731 LPSSTCSSDGCNYVYSYGDYSMTQGVLATETFTFGD--DKVSIKNIGFGCGEDNEGDGFE 904 +PSSTCS DGC YVYSYGDYSMTQGVLATETFTFG +KVS+ NIGFGCGEDNEGDGFE Sbjct: 171 VPSSTCS-DGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFE 229 Query: 905 QASGLVGLGRGPLSLVSQLKEPKFSYCLTPMGDSKTSLLLLGSLANTKEDSNSLTVTTPL 1084 QASGLVGLGRGPLSLVSQLKEP+FSYCLTPM D+K S+LLLGSL K+ VTTPL Sbjct: 230 QASGLVGLGRGPLSLVSQLKEPRFSYCLTPMDDTKESILLLGSLGKVKDAKE--VVTTPL 287 Query: 1085 LTNPSQPSFYYLSLEGISVGDTRLSIEKSTFEXXXXXXXXXXXXXXXXXXYIEENAFDAL 1264 L NP QPSFYYLSLEGISVGDTRLSIEKSTFE YIE+ AF+AL Sbjct: 288 LKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEAL 347 Query: 1265 KKELISQTKLPLDKSGTTGLDVCFTLPSGDTTQVEIPKLVFHFKGGDLELPAENYMIADS 1444 KKE ISQTKLPLDK+ +TGLD+CF+LPSG +TQVEIPK+VFHFKGGDLELPAENYMI DS Sbjct: 348 KKEFISQTKLPLDKTSSTGLDLCFSLPSG-STQVEIPKIVFHFKGGDLELPAENYMIGDS 406 Query: 1445 GLGVACLAMGASNGMSIIGNVQQQNILVNHDLQKGTISFVPTQCDQL 1585 LGVACLAMGAS+GMSI GNVQQQNILVNHDL+K TISFVPT CDQL Sbjct: 407 NLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKETISFVPTSCDQL 453 >ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max] Length = 453 Score = 620 bits (1600), Expect = e-175 Identities = 324/446 (72%), Positives = 359/446 (80%), Gaps = 2/446 (0%) Frame = +2 Query: 254 LLVAGLLVDPTSSTSRGSSLLXXXXXXXXXXAGFQVALRHMDSDKNLTKLERVQHGIKRG 433 L++A L + PTSSTSR +S GF+V LRH+DS KNLTKLERVQHGIKRG Sbjct: 16 LVLACLFIAPTSSTSRKTSFKQQHPCPTTN--GFRVMLRHVDSGKNLTKLERVQHGIKRG 73 Query: 434 KTRLQRLNAMVLAATTTTEDSSTQQLEAPIHAGNGEYLMELSIGTPPESYPAVLDTGSDL 613 K+RLQ+LNAMVLAA++T + S QLEAPIHAGNGEYL+EL+IGTPP SYPAVLDTGSDL Sbjct: 74 KSRLQKLNAMVLAASSTPD--SEDQLEAPIHAGNGEYLIELAIGTPPVSYPAVLDTGSDL 131 Query: 614 IWTQCKPCSQCYKQPTPIFDPXXXXXXXXXXXXXNLCSALPSSTCSSDGCNYVYSYGDYS 793 IWTQCKPC++CYKQPTPIFDP +LCSALPSSTCS DGC YVYSYGDYS Sbjct: 132 IWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLCSALPSSTCS-DGCEYVYSYGDYS 190 Query: 794 MTQGVLATETFTFGD--DKVSIKNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKE 967 MTQGVLATETFTFG +KVS+ NIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKE Sbjct: 191 MTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKE 250 Query: 968 PKFSYCLTPMGDSKTSLLLLGSLANTKEDSNSLTVTTPLLTNPSQPSFYYLSLEGISVGD 1147 +FSYCLTP+ D+K S+LLLGSL K+ VTTPLL NP QPSFYYLSLE ISVGD Sbjct: 251 QRFSYCLTPIDDTKESVLLLGSLGKVKDAKE--VVTTPLLKNPLQPSFYYLSLEAISVGD 308 Query: 1148 TRLSIEKSTFEXXXXXXXXXXXXXXXXXXYIEENAFDALKKELISQTKLPLDKSGTTGLD 1327 TRLSIEKSTFE Y+++ A++ALKKE ISQTKL LDK+ +TGLD Sbjct: 309 TRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTGLD 368 Query: 1328 VCFTLPSGDTTQVEIPKLVFHFKGGDLELPAENYMIADSGLGVACLAMGASNGMSIIGNV 1507 +CF+LPSG +TQVEIPKLVFHFKGGDLELPAENYMI DS LGVACLAMGAS+GMSI GNV Sbjct: 369 LCFSLPSG-STQVEIPKLVFHFKGGDLELPAENYMIGDSNLGVACLAMGASSGMSIFGNV 427 Query: 1508 QQQNILVNHDLQKGTISFVPTQCDQL 1585 QQQNILVNHDL+K TISFVPT CDQL Sbjct: 428 QQQNILVNHDLEKETISFVPTSCDQL 453 >ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] Length = 442 Score = 531 bits (1367), Expect = e-148 Identities = 270/448 (60%), Positives = 328/448 (73%) Frame = +2 Query: 242 LFVTLLVAGLLVDPTSSTSRGSSLLXXXXXXXXXXAGFQVALRHMDSDKNLTKLERVQHG 421 + ++LL+ L V P STSR + GF++ L+H+DSDKNLTK +R+QHG Sbjct: 11 VLLSLLILSLSVYPAFSTSRRA-----LSYPAQLKNGFRITLKHVDSDKNLTKFQRIQHG 65 Query: 422 IKRGKTRLQRLNAMVLAATTTTEDSSTQQLEAPIHAGNGEYLMELSIGTPPESYPAVLDT 601 IKR RL+RLNAMVLAA SS ++ +P+ +GNGE+LM L+IGTPPE+Y A++DT Sbjct: 66 IKRANHRLERLNAMVLAA------SSNAEINSPVLSGNGEFLMNLAIGTPPETYSAIMDT 119 Query: 602 GSDLIWTQCKPCSQCYKQPTPIFDPXXXXXXXXXXXXXNLCSALPSSTCSSDGCNYVYSY 781 GSDLIWTQCKPC+QC+ QP+PIFDP LC ALP S+CS D C Y+Y+Y Sbjct: 120 GSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSCS-DSCEYLYTY 178 Query: 782 GDYSMTQGVLATETFTFGDDKVSIKNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQL 961 GDYS TQG +ATETFTFG KVSI N+GFGCGEDNEGDGF Q SGLVGLGRGPLSLVSQL Sbjct: 179 GDYSSTQGTMATETFTFG--KVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQL 236 Query: 962 KEPKFSYCLTPMGDSKTSLLLLGSLANTKEDSNSLTVTTPLLTNPSQPSFYYLSLEGISV 1141 KE KFSYCLT + D+KTS LL+GSLA+ S ++ TTPL+ NP QPSFYYLSLEGISV Sbjct: 237 KEAKFSYCLTSIDDTKTSTLLMGSLASVNGTSAAIR-TTPLIQNPLQPSFYYLSLEGISV 295 Query: 1142 GDTRLSIEKSTFEXXXXXXXXXXXXXXXXXXYIEENAFDALKKELISQTKLPLDKSGTTG 1321 G TRL I++STF+ Y+EE+AFD +KKE SQ LP+D SG TG Sbjct: 296 GGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATG 355 Query: 1322 LDVCFTLPSGDTTQVEIPKLVFHFKGGDLELPAENYMIADSGLGVACLAMGASNGMSIIG 1501 L++C+ LPS DT+++E+PKLV HF G DLELP ENYMIADS +GV CLAMG+S GMSI G Sbjct: 356 LELCYNLPS-DTSELEVPKLVLHFTGADLELPGENYMIADSSMGVICLAMGSSGGMSIFG 414 Query: 1502 NVQQQNILVNHDLQKGTISFVPTQCDQL 1585 NVQQQN+ V+HDL+K T+SF+PT C QL Sbjct: 415 NVQQQNMFVSHDLEKETLSFLPTNCGQL 442 >ref|XP_002329464.1| predicted protein [Populus trichocarpa] gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa] Length = 439 Score = 530 bits (1366), Expect = e-148 Identities = 271/455 (59%), Positives = 327/455 (71%) Frame = +2 Query: 221 SSSSCVPLFVTLLVAGLLVDPTSSTSRGSSLLXXXXXXXXXXAGFQVALRHMDSDKNLTK 400 ++ S + L V L + + STSR GF+ L+H+DS KNLTK Sbjct: 2 ANMSSLSLVVALAIFAFVFSHAFSTSR------RVLEHPKVQNGFRAKLKHVDSGKNLTK 55 Query: 401 LERVQHGIKRGKTRLQRLNAMVLAATTTTEDSSTQQLEAPIHAGNGEYLMELSIGTPPES 580 ER+QHG+KRG+ RLQR AM L A SS +++AP+ GNGE+LM+L+IGTPPE+ Sbjct: 56 FERIQHGVKRGRHRLQRFKAMALVA------SSNSEIDAPVLPGNGEFLMKLAIGTPPET 109 Query: 581 YPAVLDTGSDLIWTQCKPCSQCYKQPTPIFDPXXXXXXXXXXXXXNLCSALPSSTCSSDG 760 Y A++DTGSDLIWTQCKPC+QC+ QPTPIFDP LC ALP STCS DG Sbjct: 110 YSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCEALPQSTCS-DG 168 Query: 761 CNYVYSYGDYSMTQGVLATETFTFGDDKVSIKNIGFGCGEDNEGDGFEQASGLVGLGRGP 940 C Y+Y YGDYS TQG+LA+ET TFG KVS+ + FGCGEDNEG GF Q SGLVGLGRGP Sbjct: 169 CEYLYGYGDYSSTQGMLASETLTFG--KVSVPEVAFGCGEDNEGSGFSQGSGLVGLGRGP 226 Query: 941 LSLVSQLKEPKFSYCLTPMGDSKTSLLLLGSLANTKEDSNSLTVTTPLLTNPSQPSFYYL 1120 LSLVSQLKEPKFSYCLT + D+K S LL+GSLA+ K S+S TTPL+ N +QPSFYYL Sbjct: 227 LSLVSQLKEPKFSYCLTSVDDTKASTLLMGSLASVKA-SDSEIKTTPLIQNSAQPSFYYL 285 Query: 1121 SLEGISVGDTRLSIEKSTFEXXXXXXXXXXXXXXXXXXYIEENAFDALKKELISQTKLPL 1300 SLEGISVGDT L I+KSTF Y+E++AFD + KE SQ LP+ Sbjct: 286 SLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPV 345 Query: 1301 DKSGTTGLDVCFTLPSGDTTQVEIPKLVFHFKGGDLELPAENYMIADSGLGVACLAMGAS 1480 D SG+TGL+VCFTLPSG +T +E+PKLVFHF G DLELPAENYMIAD+ +GVACLAMG+S Sbjct: 346 DNSGSTGLEVCFTLPSG-STDIEVPKLVFHFDGADLELPAENYMIADASMGVACLAMGSS 404 Query: 1481 NGMSIIGNVQQQNILVNHDLQKGTISFVPTQCDQL 1585 +GMSI GN+QQQN+LV HDL+K T+SF+PTQCD+L Sbjct: 405 SGMSIFGNIQQQNMLVLHDLEKETLSFLPTQCDEL 439 >ref|XP_002300215.1| predicted protein [Populus trichocarpa] gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa] Length = 439 Score = 527 bits (1358), Expect = e-147 Identities = 261/412 (63%), Positives = 317/412 (76%) Frame = +2 Query: 350 GFQVALRHMDSDKNLTKLERVQHGIKRGKTRLQRLNAMVLAATTTTEDSSTQQLEAPIHA 529 GF+V L+H+DS KNLTKLER++HG+KRG+ RLQRL AM L A SS+ ++EAP+ Sbjct: 39 GFRVRLKHVDSGKNLTKLERIRHGVKRGRNRLQRLQAMALVA------SSSSEIEAPVLP 92 Query: 530 GNGEYLMELSIGTPPESYPAVLDTGSDLIWTQCKPCSQCYKQPTPIFDPXXXXXXXXXXX 709 GNGE+LM+L+IGTPPE+Y A+LDTGSDLIWTQCKPC+QC+ Q TPIFDP Sbjct: 93 GNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSC 152 Query: 710 XXNLCSALPSSTCSSDGCNYVYSYGDYSMTQGVLATETFTFGDDKVSIKNIGFGCGEDNE 889 LC ALP S+C++ GC Y+YSYGDYS TQG+LA+ET TFG K S+ N+ FGCG DNE Sbjct: 153 SSQLCEALPQSSCNN-GCEYLYSYGDYSSTQGILASETLTFG--KASVPNVAFGCGADNE 209 Query: 890 GDGFEQASGLVGLGRGPLSLVSQLKEPKFSYCLTPMGDSKTSLLLLGSLANTKEDSNSLT 1069 G GF Q +GLVGLGRGPLSLVSQLKEPKFSYCLT + D+KTS LL+GSLA+ S+++ Sbjct: 210 GSGFSQGAGLVGLGRGPLSLVSQLKEPKFSYCLTTVDDTKTSTLLMGSLASVNASSSAIK 269 Query: 1070 VTTPLLTNPSQPSFYYLSLEGISVGDTRLSIEKSTFEXXXXXXXXXXXXXXXXXXYIEEN 1249 TTPL+ +P+ PSFYYLSLEGISVGDTRL I+KSTF Y+EE+ Sbjct: 270 -TTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEES 328 Query: 1250 AFDALKKELISQTKLPLDKSGTTGLDVCFTLPSGDTTQVEIPKLVFHFKGGDLELPAENY 1429 AF+ + KE ++ LP+D SG+TGLDVCFTLPSG +T +E+PKLVFHF G DLELPAENY Sbjct: 329 AFNLVAKEFTAKINLPVDSSGSTGLDVCFTLPSG-STNIEVPKLVFHFDGADLELPAENY 387 Query: 1430 MIADSGLGVACLAMGASNGMSIIGNVQQQNILVNHDLQKGTISFVPTQCDQL 1585 MI DS +GVACLAMG+S+GMSI GNVQQQN+LV HDL+K T+SF+PTQCD L Sbjct: 388 MIGDSSMGVACLAMGSSSGMSIFGNVQQQNMLVLHDLEKETLSFLPTQCDLL 439