BLASTX nr result
ID: Glycyrrhiza23_contig00000417
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00000417 (1817 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor,... 112 3e-22 gb|ABK28718.1| unknown [Arabidopsis thaliana] 112 4e-22 ref|NP_198319.1| aspartyl protease family protein [Arabidopsis t... 112 4e-22 ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp.... 107 8e-21 ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis tha... 107 1e-20 >ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus communis] gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus communis] Length = 449 Score = 112 bits (280), Expect = 3e-22 Identities = 115/415 (27%), Positives = 186/415 (44%), Gaps = 26/415 (6%) Frame = +3 Query: 294 YLMSLQVRTEDNKFVKAYATPDTGSDLIWLE--PTCKTTTANACIKEPETPFK-----CG 452 YLM + + N V+ A DTGSDLIW++ P N+ I +P CG Sbjct: 93 YLMRISI---GNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDPRRSSSYRNVLCG 149 Query: 453 DGDEDEYCKKMWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYL-----GKGTFSD 617 + E+C K L EA+ ++ + CGY Y D++ +G+L G G+ + Sbjct: 150 N----EFCNK----LDGEARSCDARGFV-KTCGYTYSYGDQSFSDGHLAIERFGIGSTNS 200 Query: 618 SHDQKL---ENMEYGVST---GTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYC 779 + + + + +G T GT ++ G++GLG G +SL QL ++ KFSYC Sbjct: 201 NTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLG----PKLSGKFSYC 256 Query: 780 LPQYEKKVDSNKNAQYATGKLVFGSQVNTNPET----STPLLDENPEEKAKDPGKKAEDY 947 L V +++ + Y T K+ FG+ +N + STPLL + PE Sbjct: 257 L------VPTSEQSNY-TSKINFGNDINISGSNYNVVSTPLLPKKPET------------ 297 Query: 948 CKTRYYCVNLTSIKVDGRQGILVKDTATTEV----MIIDSGSTFTSLRGELFKEFLKRVE 1115 YY + L +I V+ ++ + + EV +IIDSG+T T L E F VE Sbjct: 298 ----YYYLTLEAISVENKR-LPYTNLWNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAVE 352 Query: 1116 QQIGDKEEKPISDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKK 1295 + + + + CF A +L ++ F G VEL+ N F + ++ Sbjct: 353 EAVKGERVSDPHGLFNICFKDEKAIELPIITAHFTGADVELQPVNTFAKV--------EE 404 Query: 1296 DYLCLTVKKLDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSFVKVETCNQ 1460 D LC T+ ++ + I G+ AQM+F V +D+ K+ VSF+ + Q Sbjct: 405 DLLCFTMIPSND----------IAIFGNLAQMNFLVGYDLEKKAVSFLPTDCTKQ 449 >gb|ABK28718.1| unknown [Arabidopsis thaliana] Length = 438 Score = 112 bits (279), Expect = 4e-22 Identities = 109/403 (27%), Positives = 175/403 (43%), Gaps = 12/403 (2%) Frame = +3 Query: 264 EFNVTALRHTYLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPF 443 + ++T+ YLM++ + T + A DTGSDL+W + + C + + F Sbjct: 80 QIDLTSNSGEYLMNVSIGTPPFPIM---AIADTGSDLLWTQ----CAPCDDCYTQVDPLF 132 Query: 444 --KCGDGDEDEYCKK-MWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTF- 611 K +D C + L +A C ST+ D C Y + Y D + +G + T Sbjct: 133 DPKTSSTYKDVSCSSSQCTALENQASC--STN--DNTCSYSLSYGDNSYTKGNIAVDTLT 188 Query: 612 ---SDSHDQKLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFS 773 SD+ +L+N+ G + GT K G+VGLG G +SL +QL +S ++ KFS Sbjct: 189 LGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDS----IDGKFS 244 Query: 774 YCLPQYEKKVDSNKNAQYATGKLVFGSQVNTNPETSTPLLDENPEEKAKDPGKKAEDYCK 953 YCL K D + T +V GS V STPL+ + +E Sbjct: 245 YCLVPLTSKKDQTSKINFGTNAIVSGSGV-----VSTPLIAKASQET------------- 286 Query: 954 TRYYCVNLTSIKVDGRQ-GILVKDTATTEV-MIIDSGSTFTSLRGELFKEFLKRVEQQIG 1127 +Y + L SI V +Q D+ ++E +IIDSG+T T L E + E V I Sbjct: 287 --FYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSID 344 Query: 1128 DKEEKPISDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLC 1307 ++++ C+ K+ +++ F+G V+L N F + +D +C Sbjct: 345 AEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADVKLDSSNAFVQV--------SEDLVC 396 Query: 1308 LTVKKLDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSF 1436 + P+ I G+ AQM+F V +D + VSF Sbjct: 397 FAFRG----------SPSFSIYGNVAQMNFLVGYDTVSKTVSF 429 >ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana] gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana] gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana] gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana] gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana] Length = 437 Score = 112 bits (279), Expect = 4e-22 Identities = 109/403 (27%), Positives = 175/403 (43%), Gaps = 12/403 (2%) Frame = +3 Query: 264 EFNVTALRHTYLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPF 443 + ++T+ YLM++ + T + A DTGSDL+W + + C + + F Sbjct: 80 QIDLTSNSGEYLMNVSIGTPPFPIM---AIADTGSDLLWTQ----CAPCDDCYTQVDPLF 132 Query: 444 --KCGDGDEDEYCKK-MWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTF- 611 K +D C + L +A C ST+ D C Y + Y D + +G + T Sbjct: 133 DPKTSSTYKDVSCSSSQCTALENQASC--STN--DNTCSYSLSYGDNSYTKGNIAVDTLT 188 Query: 612 ---SDSHDQKLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFS 773 SD+ +L+N+ G + GT K G+VGLG G +SL +QL +S ++ KFS Sbjct: 189 LGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDS----IDGKFS 244 Query: 774 YCLPQYEKKVDSNKNAQYATGKLVFGSQVNTNPETSTPLLDENPEEKAKDPGKKAEDYCK 953 YCL K D + T +V GS V STPL+ + +E Sbjct: 245 YCLVPLTSKKDQTSKINFGTNAIVSGSGV-----VSTPLIAKASQET------------- 286 Query: 954 TRYYCVNLTSIKVDGRQ-GILVKDTATTEV-MIIDSGSTFTSLRGELFKEFLKRVEQQIG 1127 +Y + L SI V +Q D+ ++E +IIDSG+T T L E + E V I Sbjct: 287 --FYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSID 344 Query: 1128 DKEEKPISDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLC 1307 ++++ C+ K+ +++ F+G V+L N F + +D +C Sbjct: 345 AEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADVKLDSSNAFVQV--------SEDLVC 396 Query: 1308 LTVKKLDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSF 1436 + P+ I G+ AQM+F V +D + VSF Sbjct: 397 FAFRG----------SPSFSIYGNVAQMNFLVGYDTVSKTVSF 429 >ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 440 Score = 107 bits (268), Expect = 8e-21 Identities = 109/401 (27%), Positives = 172/401 (42%), Gaps = 10/401 (2%) Frame = +3 Query: 264 EFNVTALRHTYLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPF 443 + ++T+ YLM++ + T + A DTGSDL+W + CK +P Sbjct: 84 QIDLTSNSGEYLMNISLGTPPFPIM---AIADTGSDLLWTQ--CKPCDDCYTQVDPLFDP 138 Query: 444 KCGDGDEDEYCKK-MWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTFS-- 614 K +D C + L +A C ST+ D C Y Y DR+ +G + T + Sbjct: 139 KASSTYKDVSCSSSQCTALENQASC--STE--DNTCSYSTSYGDRSYTKGNIAVDTLTLG 194 Query: 615 --DSHDQKLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYC 779 D+ +L+N+ G + GT K G+VGLG G +SL QL +S ++ KFSYC Sbjct: 195 STDTRPVQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDS----IDGKFSYC 250 Query: 780 LPQYEKKVDSNKNAQYATGKLVFGSQVNTNPETSTPLLDENPEEKAKDPGKKAEDYCKTR 959 L + D + T +V G+ V STPL+ ++ E Sbjct: 251 LVPLTSENDRTSKINFGTNAVVSGTGV-----VSTPLIAKSQET---------------- 289 Query: 960 YYCVNLTSIKVDGRQGILV-KDTATTEV-MIIDSGSTFTSLRGELFKEFLKRVEQQIGDK 1133 +Y + L SI V ++ D+ + E +IIDSG+T T L E + E V I + Sbjct: 290 FYYLTLKSISVGSKEVQYPGSDSGSGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAE 349 Query: 1134 EEKPISDDYMHCFLKGSADKLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLCLT 1313 +++ C+ K+ +++ F+G V LK N F I +D +C Sbjct: 350 KKQDPQTGLSLCYSATGDLKVPAITMHFDGADVNLKPSNCFVQI--------SEDLVCFA 401 Query: 1314 VKKLDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSF 1436 + P+ I G+ AQM+F V +D + VSF Sbjct: 402 FRG----------SPSFSIYGNVAQMNFLVGYDTVSKTVSF 432 >ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana] gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags: Precursor gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana] Length = 447 Score = 107 bits (267), Expect = 1e-20 Identities = 118/400 (29%), Positives = 167/400 (41%), Gaps = 19/400 (4%) Frame = +3 Query: 294 YLMSLQVRTEDNKFVKAYATPDTGSDLIWLEPTCKTTTANACIKEPETPF--KCGDGDED 467 + MS+ + T +K +A DTGSDL W++ CK C KE F K + Sbjct: 85 FFMSITIGTPP---IKVFAIADTGSDLTWVQ--CKP--CQQCYKENGPIFDKKKSSTYKS 137 Query: 468 EYC--KKMWSYLGMEAKCIESTDHKDEYCGYKIVYKDRAGYEGYLGKGTFS----DSHDQ 629 E C + + E C ES + C Y+ Y D++ +G + T S Sbjct: 138 EPCDSRNCQALSSTERGCDESNN----ICKYRYSYGDQSFSKGDVATETVSIDSASGSPV 193 Query: 630 KLENMEYGV---STGTKEKNSKGVVGLGRGELSLFQQLNNSARARVEFKFSYCLPQYEKK 800 +G + GT ++ G++GLG G LSL QL +S + KFSYCL Sbjct: 194 SFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSS----ISKKFSYCLSHKSAT 249 Query: 801 VDSNKNAQYATGKLVFGSQVNTNPETSTPLLDENP------EEKAKDPGKKAEDYCKTRY 962 + T + S + STPL+D+ P +A GKK Y + Y Sbjct: 250 TNGTSVINLGTNSIP-SSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSY 308 Query: 963 YCVNLTSIKVDGRQGILVKDTATTEVMIIDSGSTFTSLRGELFKEFLKRVEQQI-GDKEE 1139 GIL + T+ +IIDSG+T T L F +F VE+ + G K Sbjct: 309 N---------PNDDGIL---SETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRV 356 Query: 1140 KPISDDYMHCFLKGSAD-KLEKVSLGFEGTTVELKRENIFDHIVKKGEGEEKKDYLCLTV 1316 HCF GSA+ L ++++ F G V L N F VK E D +CL++ Sbjct: 357 SDPQGLLSHCFKSGSAEIGLPEITVHFTGADVRLSPINAF---VKLSE-----DMVCLSM 408 Query: 1317 KKLDEGKMNGYVVPNVHILGSRAQMDFKVAFDVPKREVSF 1436 E V I G+ AQMDF V +D+ R VSF Sbjct: 409 VPTTE----------VAIYGNFAQMDFLVGYDLETRTVSF 438