BLASTX nr result
ID: Coptis23_contig00009119
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis23_contig00009119 (1921 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor,... 566 e-159 ref|XP_002884082.1| aspartyl protease family protein [Arabidopsi... 514 e-143 ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis tha... 508 e-141 dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila] 506 e-141 ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein ... 499 e-138 >ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] Length = 533 Score = 566 bits (1459), Expect = e-159 Identities = 285/495 (57%), Positives = 357/495 (72%), Gaps = 3/495 (0%) Frame = +1 Query: 130 ILLLLLFISRNSFTCYGFGTVGFNFHHRFSDEVKGVLSVDDLPRKGTVDYYKAMAHRDRI 309 +LLL+L +S +SF+ YGFGT GF+ HHR+SD VKG+LSVDDLP KG++ YY +MAHRD I Sbjct: 22 LLLLVLMLSSSSFS-YGFGTFGFDLHHRYSDPVKGMLSVDDLPEKGSLHYYASMAHRD-I 79 Query: 310 VIHGRGLATSTTGDQQELTFENGNDTLRIPQLGFLHYANVSVGTPSLSFLVALDTGSDLF 489 +IHGR L + T LTF +GN+T R LGFLHYANVS+GTPSLS+LVALDTGSDLF Sbjct: 80 LIHGRKLVSDNTSTP--LTFFSGNETYRFSSLGFLHYANVSIGTPSLSYLVALDTGSDLF 137 Query: 490 WVPCDCKS--CVKALVSNSGARLDLNIYXXXXXXXXXXVPCNSSACDVQGACSGVSRTCP 663 W+PCDC + CV+ L SG ++D NIY +PCN++ C Q C TCP Sbjct: 138 WLPCDCTNSGCVQGLQFPSGEQIDFNIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCP 197 Query: 664 YQVEYLSNGTSSAGILIEDVLHLTTDGNHPEVVDAQITLGCGLVETGSFLNGAAPNGLFG 843 YQV+YLSNGTSS G+L+ED+LHLTTD +DA+I GCG V+TGSFL+GAAPNGLFG Sbjct: 198 YQVQYLSNGTSSTGVLVEDLLHLTTDDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFG 257 Query: 844 LGMDKSSVPSILSSAGIVANSFSMCFGPDGIGRINFGDKGSVNQGETPFNMPQLQPMYNV 1023 LGM SVPS L+ G +NSFSMCFG DGIGRI+FGD GS QGETPFN+ QL P YNV Sbjct: 258 LGMTNISVPSTLAREGYTSNSFSMCFGRDGIGRISFGDTGSSGQGETPFNLRQLHPTYNV 317 Query: 1024 SLTHVTVETNQSNIDFSAIFDSGTSFTYLNDPAYTGLSETFNVNVQDRRRKSDPNIPFEY 1203 S+T + V ++++FSAIFDSGTSFTYLNDPAYT +SE+FN+ +++R S +IPFEY Sbjct: 318 SITKINVGGRDADLEFSAIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEY 377 Query: 1204 CYDVGPGETTILVPNVTLTMKGGSQFFVFDPIIDISDASGTK-YCLAVVKSPDVNIIGQN 1380 CY++ +T + +P V L M+GGSQF V DPI+ + G YCLA+VKS DVNIIGQN Sbjct: 378 CYEMSSNQTNLEIPTVNLVMQGGSQFNVTDPIVIVILQGGASIYCLAIVKSGDVNIIGQN 437 Query: 1381 FMTGYNIIFDREKLVLGWEKSSCYNIQDSSSPPIKFPNSTVVPPATAAEPRDIAPGLTKG 1560 FMTGY I+F+RE+ VLGW+ S CY+ D+++ P+ P S +PPATA P+ T G Sbjct: 438 FMTGYRIVFNRERNVLGWKASDCYDDMDTTTFPVD-PISPGIPPATAVNPQ-----ATAG 491 Query: 1561 TGNGSLNSGALCSVG 1605 +GN + SG VG Sbjct: 492 SGNTTEVSGTPPPVG 506 >ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] Length = 513 Score = 514 bits (1323), Expect = e-143 Identities = 262/506 (51%), Positives = 344/506 (67%), Gaps = 3/506 (0%) Frame = +1 Query: 133 LLLLLFISRNSFTCYGFGTVGFNFHHRFSDEVKGVLSVDDLPRKGTVDYYKAMAHRDRIV 312 L++LL S C GFG GF FHHRFSD+V GVL D LP + + YY+ MAHRDR+ Sbjct: 14 LIILLASSWVLERCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL- 72 Query: 313 IHGRGLATSTTGDQQELTFENGNDTLRIPQLGFLHYANVSVGTPSLSFLVALDTGSDLFW 492 I GR LA DQ +TF +GN+T+R+ LGFLHYANV+VGTPS FLVALDTGSDLFW Sbjct: 73 IRGRRLANE---DQSLVTFSDGNETIRVDALGFLHYANVTVGTPSDWFLVALDTGSDLFW 129 Query: 493 VPCDCKSCVKALVSNSGARLDLNIYXXXXXXXXXXVPCNSSACDVQGACSGVSRTCPYQV 672 +PCDC +CV+ L + G+ LDLNIY VPCNS+ C C+ CPYQ+ Sbjct: 130 LPCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQI 189 Query: 673 EYLSNGTSSAGILIEDVLHLTTDGNHPEVVDAQITLGCGLVETGSFLNGAAPNGLFGLGM 852 YLSNGTSS G+L+EDVLHL ++ + + A++TLGCG V+TG F +GAAPNGLFGLG+ Sbjct: 190 RYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGL 249 Query: 853 DKSSVPSILSSAGIVANSFSMCFGPDGIGRINFGDKGSVNQGETPFNMPQLQPMYNVSLT 1032 + SVPS+L+ GI ANSFSMCFG DG GRI+FGDKGSV+Q ETP N+ Q P YN+++T Sbjct: 250 EDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVT 309 Query: 1033 HVTVETNQSNIDFSAIFDSGTSFTYLNDPAYTGLSETFNVNVQDRR-RKSDPNIPFEYCY 1209 ++VE N +++F A+FDSGTSFTYL D AYT +SE+FN D+R + +D +PFEYCY Sbjct: 310 KISVEGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCY 369 Query: 1210 DVGPGETTILVPNVTLTMKGGSQFFVFDPIIDISDASGTKYCLAVVKSPDVNIIGQNFMT 1389 + P + + P V LTMKGGS + V+ P++ I YCLA++K D++IIGQNFMT Sbjct: 370 ALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAILKIEDISIIGQNFMT 429 Query: 1390 GYNIIFDREKLVLGWEKSSCYNIQDSS-SPPIKFPNSTVVPPATAAEPRDIAPGLTKGTG 1566 GY ++FDREKL+LGW++S CY + S+ + P +S+ PPA++ +P A + Sbjct: 430 GYRVVFDREKLILGWKESDCYTGETSARTLPSNRSSSSARPPASSFDPE--ATNIPSQRP 487 Query: 1567 NGSLNSGAL-CSVGLWSHFSAIGCVL 1641 N S +S A S+ L F +I +L Sbjct: 488 NTSTSSAAYSLSISLSLFFFSILAIL 513 >ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana] gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis thaliana] gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana] gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana] Length = 513 Score = 508 bits (1309), Expect = e-141 Identities = 260/506 (51%), Positives = 341/506 (67%), Gaps = 3/506 (0%) Frame = +1 Query: 133 LLLLLFISRNSFTCYGFGTVGFNFHHRFSDEVKGVLSVDDLPRKGTVDYYKAMAHRDRIV 312 LL+LL S C GFG GF FHHRFSD+V GVL D LP + + YY+ MAHRDR+ Sbjct: 14 LLILLASSWVLDRCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL- 72 Query: 313 IHGRGLATSTTGDQQELTFENGNDTLRIPQLGFLHYANVSVGTPSLSFLVALDTGSDLFW 492 I GR LA DQ +TF +GN+T+R+ LGFLHYANV+VGTPS F+VALDTGSDLFW Sbjct: 73 IRGRRLANE---DQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLFW 129 Query: 493 VPCDCKSCVKALVSNSGARLDLNIYXXXXXXXXXXVPCNSSACDVQGACSGVSRTCPYQV 672 +PCDC +CV+ L + G+ LDLNIY VPCNS+ C C+ CPYQ+ Sbjct: 130 LPCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQI 189 Query: 673 EYLSNGTSSAGILIEDVLHLTTDGNHPEVVDAQITLGCGLVETGSFLNGAAPNGLFGLGM 852 YLSNGTSS G+L+EDVLHL ++ + + A++T GCG V+TG F +GAAPNGLFGLG+ Sbjct: 190 RYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGL 249 Query: 853 DKSSVPSILSSAGIVANSFSMCFGPDGIGRINFGDKGSVNQGETPFNMPQLQPMYNVSLT 1032 + SVPS+L+ GI ANSFSMCFG DG GRI+FGDKGSV+Q ETP N+ Q P YN+++T Sbjct: 250 EDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVT 309 Query: 1033 HVTVETNQSNIDFSAIFDSGTSFTYLNDPAYTGLSETFNVNVQDRR-RKSDPNIPFEYCY 1209 ++V N +++F A+FDSGTSFTYL D AYT +SE+FN D+R + +D +PFEYCY Sbjct: 310 KISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCY 369 Query: 1210 DVGPGETTILVPNVTLTMKGGSQFFVFDPIIDISDASGTKYCLAVVKSPDVNIIGQNFMT 1389 + P + + P V LTMKGGS + V+ P++ I YCLA++K D++IIGQNFMT Sbjct: 370 ALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAIMKIEDISIIGQNFMT 429 Query: 1390 GYNIIFDREKLVLGWEKSSCYNIQDSS-SPPIKFPNSTVVPPATAAEPRDIAPGLTKGTG 1566 GY ++FDREKL+LGW++S CY + S+ + P +S+ PPA++ +P A + Sbjct: 430 GYRVVFDREKLILGWKESDCYTGETSARTLPSNRSSSSARPPASSFDPE--ATNIPSQRP 487 Query: 1567 NGSLNSGAL-CSVGLWSHFSAIGCVL 1641 N S S A S+ L F +I +L Sbjct: 488 NTSTTSAAYSLSISLSLFFFSILAIL 513 >dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila] Length = 515 Score = 506 bits (1304), Expect = e-141 Identities = 253/473 (53%), Positives = 331/473 (69%), Gaps = 2/473 (0%) Frame = +1 Query: 118 IRYSILLLLLFISRNSFTCYGFGTVGFNFHHRFSDEVKGVLSVDDLPRKGTVDYYKAMAH 297 I + L+L+L S C G G GF FHHRFSD+V GVL D LP + + YY+ MAH Sbjct: 9 IMFMGLILMLVSSWVLDRCEGLGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAH 68 Query: 298 RDRIVIHGRGLATSTTGDQQELTFENGNDTLRIPQLGFLHYANVSVGTPSLSFLVALDTG 477 RDR+ I GR LA+ DQ +TF +GN+T+R+ LGFLHYANV+VGTPS FLVALDTG Sbjct: 69 RDRL-IRGRRLASE---DQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTG 124 Query: 478 SDLFWVPCDCKS-CVKALVSNSGARLDLNIYXXXXXXXXXXVPCNSSACDVQGACSGVSR 654 SDLFW+PCDC + CV+ L + G+ LDLNIY VPCNS+ C C+ Sbjct: 125 SDLFWLPCDCSTNCVRELKAPGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLS 184 Query: 655 TCPYQVEYLSNGTSSAGILIEDVLHLTTDGNHPEVVDAQITLGCGLVETGSFLNGAAPNG 834 CPYQ+ YLSNGTSS G+L+EDVLHL + + + + A+ITLGCGLV+TG F +GAAPNG Sbjct: 185 DCPYQIRYLSNGTSSTGVLVEDVLHLVSMEKNSKPIRARITLGCGLVQTGVFHDGAAPNG 244 Query: 835 LFGLGMDKSSVPSILSSAGIVANSFSMCFGPDGIGRINFGDKGSVNQGETPFNMPQLQPM 1014 LFGLG++ SVPS+L+ GI ANSFSMCFG DG GRI+FGDKGSV+Q ETP N+ Q P Sbjct: 245 LFGLGLEDISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETPLNIRQPHPT 304 Query: 1015 YNVSLTHVTVETNQSNIDFSAIFDSGTSFTYLNDPAYTGLSETFNVNVQDRRRKSDPNIP 1194 YNV++T ++V N +++F A+FD+GTSFTYL D YT +SE+FN D+R ++D +P Sbjct: 305 YNVTVTQISVGGNTGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELP 364 Query: 1195 FEYCYDVGPGETTILVPNVTLTMKGGSQFFVFDPIIDISDASGTKYCLAVVKSPDVNIIG 1374 FEYCY V P + + P+V LTMKGGS + V+ P+I + YCLA++KS D++IIG Sbjct: 365 FEYCYAVSPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPIEDTVVYCLAIMKSEDISIIG 424 Query: 1375 QNFMTGYNIIFDREKLVLGWEKSSCYNIQDSS-SPPIKFPNSTVVPPATAAEP 1530 QNFMTGY ++FDREKL+LGW++S C + S+ + P +S+ PPA++ +P Sbjct: 425 QNFMTGYRVVFDREKLILGWKESDCSTGETSARTQPSNRSSSSARPPASSFDP 477 >ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max] Length = 508 Score = 499 bits (1284), Expect = e-138 Identities = 249/468 (53%), Positives = 316/468 (67%), Gaps = 2/468 (0%) Frame = +1 Query: 133 LLLLLFISRNSFTCYGFGTVGFNFHHRFSDEVKGVLSVDDLPRKGTVDYYKAMAHRDRIV 312 LLLLL +S S +C+ + GF+ HHRFSD VK +L V DLP KGT YY AMAHRDRI Sbjct: 11 LLLLLLLSLASQSCHALHSFGFDIHHRFSDPVKEILGVHDLPDKGTRQYYVAMAHRDRI- 69 Query: 313 IHGRGLATSTTGDQQELTFENGNDTLRIPQLGFLHYANVSVGTPSLSFLVALDTGSDLFW 492 GR LA G LTF N+T +I GFLH+ANVSVGTP LSFLVALDTGSDLFW Sbjct: 70 FRGRRLAA---GYHSPLTFIPSNETYQIEAFGFLHFANVSVGTPPLSFLVALDTGSDLFW 126 Query: 493 VPCDCKSCVKALVSNSGARLDLNIYXXXXXXXXXXVPCNSSACDVQGACSGVSRTCPYQV 672 +PC+C CV + ++G ++ NIY V CNSS C++Q C CPY+V Sbjct: 127 LPCNCTKCVHGIGLSNGEKIAFNIYDLKGSSTSQPVLCNSSLCELQRQCPSSDTICPYEV 186 Query: 673 EYLSNGTSSAGILIEDVLHLTTDGNHPEVVDAQITLGCGLVETGSFLNGAAPNGLFGLGM 852 YLSNGTS+ G L+EDVLHL TD + + D +IT GCG V+TG+FL+GAAPNGLFGLGM Sbjct: 187 NYLSNGTSTTGFLVEDVLHLITDDDKTKDADTRITFGCGQVQTGAFLDGAAPNGLFGLGM 246 Query: 853 DKSSVPSILSSAGIVANSFSMCFGPDGIGRINFGDKGSVNQGETPFNMPQLQPMYNVSLT 1032 SVPSIL+ G+ +NSFSMCFG DG+GRI FGD S+ QG+TPFN+ L P YN+++T Sbjct: 247 SNESVPSILAKEGLTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPFNLRALHPTYNITVT 306 Query: 1033 HVTVETNQSNIDFSAIFDSGTSFTYLNDPAYTGLSETFN--VNVQDRRRKSDPNIPFEYC 1206 + V +++F AIFDSGTSFTYLNDPAY ++ +FN + +Q S +PFEYC Sbjct: 307 QIIVGEKVDDLEFHAIFDSGTSFTYLNDPAYKQITNSFNSEIKLQRHSTSSSNELPFEYC 366 Query: 1207 YDVGPGETTILVPNVTLTMKGGSQFFVFDPIIDISDASGTKYCLAVVKSPDVNIIGQNFM 1386 Y++ P +T L ++ LTMKGG + V DPI+ +S CL V+KS +VNIIGQNFM Sbjct: 367 YELSPNQTVEL--SINLTMKGGDNYLVTDPIVTVSGEGINLLCLGVLKSNNVNIIGQNFM 424 Query: 1387 TGYNIIFDREKLVLGWEKSSCYNIQDSSSPPIKFPNSTVVPPATAAEP 1530 TGY I+FDRE ++LGW +S+CY+ + S+ PI N+ + PA A P Sbjct: 425 TGYRIVFDRENMILGWRESNCYD-DELSTLPINRSNTPAISPAIAVNP 471