BLASTX nr result
ID: Coptis24_contig00004227
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis24_contig00004227 (1792 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2... 606 e-171 ref|XP_002528927.1| pepsin A, putative [Ricinus communis] gi|223... 573 e-161 ref|XP_002304273.1| predicted protein [Populus trichocarpa] gi|2... 568 e-159 ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1... 556 e-156 ref|XP_002334311.1| predicted protein [Populus trichocarpa] gi|2... 556 e-156 >ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera] Length = 485 Score = 606 bits (1563), Expect = e-171 Identities = 305/476 (64%), Positives = 364/476 (76%), Gaps = 3/476 (0%) Frame = +2 Query: 47 CSCQIVLPLTHSLSKSHFTNNTHHLLKTSSLRSTKRFNXXXXXXXXXXXXISLPLSPGSD 226 CS ++LPLTHSLSKS F N+T HLLK +S RS RF+ ISLPLSPGSD Sbjct: 21 CSAIVLLPLTHSLSKSQF-NSTPHLLKFTSARSATRFHHRHRQ-------ISLPLSPGSD 72 Query: 227 YILTFSFGSKQKHQQSISLYMDTGSDLVWFPCSPFECILCENKYDSNIPS--TPFNVSYA 400 Y L+F+ GS Q ISLYMDTGSDLVWFPC+PFECILCE KYD+ +P N++ + Sbjct: 73 YTLSFNLGSHPP--QPISLYMDTGSDLVWFPCAPFECILCEGKYDTAATGGLSPPNITSS 130 Query: 401 SHVXXXXXXXXXXXXXXXXXDLCAISHCPLETIETSDCSSFHCPNFYYAYGDGSLIAKLY 580 + V DLCA++ CPLE IETSDCSSF CP FYYAYGDGSL+A+LY Sbjct: 131 ASVSCKSPACSAAHTSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSLVARLY 190 Query: 581 QDKLSLPFASSSLNLPNFTFGCAHTTLGEPIGVAGFGQGVLSLPAQLAKHSPQLANSFSY 760 +D LS+P ASS L L NFTFGCAHT LGEP+GVAGFG+GVLSLPAQLA SP L N FSY Sbjct: 191 RDSLSMP-ASSPLVLHNFTFGCAHTALGEPVGVAGFGRGVLSLPAQLASFSPHLGNQFSY 249 Query: 761 CLISHSFDVKKVHKPSPLILGRFSIEDEEKKALVDGGSEFLYTPMLQNPKHPYFYCVGLE 940 CL+SHSFD +V +PSPLILGR+S++DE+KK + EF+YT ML NPKHPYFYCVGLE Sbjct: 250 CLVSHSFDADRVRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDNPKHPYFYCVGLE 309 Query: 941 AISVGNKRIPATESLKRVNKEGDGGMVVDSGTTFTMLPTKMYEAVVTEFENRVGQVLSRA 1120 I+VGN++IP E LKRV++ G+GGMVVDSGTTFTMLP +YE++VTEF +R+G+V RA Sbjct: 310 GITVGNRKIPVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRA 369 Query: 1121 SEVEGQTGLSPCYYYDNDLVSLSKVPRLVLHFIGNSSVILPRKNYFFGFTNG-DDVKMKK 1297 +++E +TGL PCYY D+ S +KVP + LHF+GNS+VILPR NY++ F +G D K K+ Sbjct: 370 TQIEERTGLGPCYYSDD---SAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDGQKKKR 426 Query: 1298 KVGCLMLMNGGDEEESGGPVATLGNYQQQGFEVVYDLEKRRVGFAKRQCASLWNSL 1465 KVGCLMLMNGGDE ESGGP ATLGNYQQQGFEVVYDLEK RVGFA+R+CA LW+SL Sbjct: 427 KVGCLMLMNGGDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKCALLWDSL 482 >ref|XP_002528927.1| pepsin A, putative [Ricinus communis] gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis] Length = 493 Score = 573 bits (1478), Expect = e-161 Identities = 292/476 (61%), Positives = 352/476 (73%), Gaps = 6/476 (1%) Frame = +2 Query: 59 IVLPLTHSLSKSHFTNNTHHLLKTSSLRSTKRF-NXXXXXXXXXXXXISLPLSPGSDYIL 235 + LPLTHSLS + FT+ THHLLK++S RS RF + +SLPLSPGSDY L Sbjct: 26 LYLPLTHSLSNTQFTS-THHLLKSTSSRSASRFQHQHQKRHLRNRHQVSLPLSPGSDYTL 84 Query: 236 TFSFGSKQKHQQSISLYMDTGSDLVWFPCSPFECILCENKYDSNIPSTPFN--VSYASHV 409 +F+ S Q +SLY+DTGSDLVWFPC PFECILCE K ++ STP S A V Sbjct: 85 SFTLNSNPP--QHVSLYLDTGSDLVWFPCKPFECILCEGKAENTTASTPPPRLSSTARSV 142 Query: 410 XXXXXXXXXXXXXXXXXDLCAISHCPLETIETSDCSSFHCPNFYYAYGDGSLIAKLYQDK 589 DLCAI+ CPLE+IETSDC SF CP+FYYAYGDGSL+A+LY D Sbjct: 143 HCKSSACSAAHSNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSLVARLYHDS 202 Query: 590 LSLPFASSSLNLPNFTFGCAHTTLGEPIGVAGFGQGVLSLPAQLAKHSPQLANSFSYCLI 769 + LP A+ SL+L NFTFGCAHT L EP+GVAGFG+GVLSLPAQLA +PQL N FSYCL+ Sbjct: 203 IKLPLATPSLSLHNFTFGCAHTALAEPVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLV 262 Query: 770 SHSFDVKKVHKPSPLILGRFSIEDEEKKALVDGGSEFLYTPMLQNPKHPYFYCVGLEAIS 949 SHSF+ ++ PSPLILG D+++K + +F+YT ML NPKHPYFYCVGLE IS Sbjct: 263 SHSFNSDRLRLPSPLILGH---SDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGIS 319 Query: 950 VGNKRIPATESLKRVNKEGDGGMVVDSGTTFTMLPTKMYEAVVTEFENRVGQVLSRASEV 1129 +G K+IPA E LKRV++EG GG+VVDSGTTFTMLP +Y +VV EF+NRVG+V RA EV Sbjct: 320 IGKKKIPAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEV 379 Query: 1130 EGQTGLSPCYYYDNDLVSLSKVPRLVLHFIGN-SSVILPRKNYFFGFTN-GDDVKMKKKV 1303 E +TGL PCYYYD ++ +P LVLHF+GN SSV+LP+KNYF+ F + GD V+ K++V Sbjct: 380 EDKTGLGPCYYYD----TVVNIPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRV 435 Query: 1304 GCLMLMNGGDEEE-SGGPVATLGNYQQQGFEVVYDLEKRRVGFAKRQCASLWNSLN 1468 GCLMLMNGG+E E +GGP ATLGNYQQ GFEVVYDLE+RRVGFA+R+CASLW SLN Sbjct: 436 GCLMLMNGGEEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKCASLWESLN 491 >ref|XP_002304273.1| predicted protein [Populus trichocarpa] gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa] Length = 496 Score = 568 bits (1464), Expect = e-159 Identities = 296/478 (61%), Positives = 357/478 (74%), Gaps = 8/478 (1%) Frame = +2 Query: 59 IVLPLTHSLSKSHFTNNTHHLLKTSSLRSTKRFNXXXXXXXXXXXX-ISLPLSPGSDYIL 235 + LPL HSLSK+ FT+ THHLLK++S RST RF+ +SLPLSPGSDY L Sbjct: 26 LFLPLIHSLSKTQFTS-THHLLKSTSTRSTTRFHHHHHNKNSHNHRQVSLPLSPGSDYTL 84 Query: 236 TFSFGSKQKHQQSISLYMDTGSDLVWFPCSPFECILCENKYDS-NIPSTPFNV--SYASH 406 +F+ S Q ISLY+DTGSDLVWFPC PFECILCE K ++ ++ STP A+ Sbjct: 85 SFTINS-----QPISLYLDTGSDLVWFPCQPFECILCEGKAENASLASTPPPKLSKTATP 139 Query: 407 VXXXXXXXXXXXXXXXXXDLCAISHCPLETIETSDCSSFHCPNFYYAYGDGSLIAKLYQD 586 V DLCAIS+CPLE+IE SDC CP FYYAYGDGSLIA+LY+D Sbjct: 140 VSCKSSACSAVHSNLPSSDLCAISNCPLESIEISDCRKHSCPQFYYAYGDGSLIARLYRD 199 Query: 587 KLSLPFAS-SSLNLPNFTFGCAHTTLGEPIGVAGFGQGVLSLPAQLAKHSPQLANSFSYC 763 + LP ++ ++L NFTFGCAHTTL EPIGVAGFG+GVLSLPAQLA SPQL N FSYC Sbjct: 200 SIRLPLSNQTNLIFNNFTFGCAHTTLAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYC 259 Query: 764 LISHSFDVKKVHKPSPLILGRFSIEDEEKKALVDGGSEFLYTPMLQNPKHPYFYCVGLEA 943 L+SHSFD +V +PSPLILGR+ +++E++ F+YT ML NP+HPYFYCVGLE Sbjct: 260 LVSHSFDSDRVRRPSPLILGRYDHDEKERRVNGVKKPSFVYTSMLDNPRHPYFYCVGLEG 319 Query: 944 ISVGNKRIPATESLKRVNKEGDGGMVVDSGTTFTMLPTKMYEAVVTEFENRVGQVLSRAS 1123 IS+G K+IPA + L++V+++G GG+VVDSGTTFTMLP +Y+ VV EFENRVG+V RAS Sbjct: 320 ISIGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRVGRVNERAS 379 Query: 1124 EVEGQTGLSPCYYYDNDLVSLSKVPRLVLHFIGN-SSVILPRKNYFFGFTNGDDVKMKK- 1297 +E TGLSPCYY+DN++V+ VPR+VLHF+GN SSV+LPR+NYF+ F +G K KK Sbjct: 380 VIEENTGLSPCYYFDNNVVN---VPRVVLHFVGNGSSVVLPRRNYFYEFLDGGHGKGKKR 436 Query: 1298 KVGCLMLMNGGDEEE-SGGPVATLGNYQQQGFEVVYDLEKRRVGFAKRQCASLWNSLN 1468 KVGCLMLMNGGDE E SGGP ATLGNYQQQGFEVVYDLE RRVGFA+RQCASLW +LN Sbjct: 437 KVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVGFARRQCASLWEALN 494 >ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max] Length = 480 Score = 556 bits (1433), Expect = e-156 Identities = 291/474 (61%), Positives = 350/474 (73%), Gaps = 4/474 (0%) Frame = +2 Query: 59 IVLPLTHSLSKSHFTNNTHHLLKTSSLRSTKRFNXXXXXXXXXXXXISLPLSPGSDYILT 238 +++PLTH+LSK+ F N+THHLLK++S RS KRF +SLPLSPGSDY L+ Sbjct: 25 VLVPLTHTLSKAQF-NSTHHLLKSTSTRSAKRFRRQ----------LSLPLSPGSDYTLS 73 Query: 239 FSFGSKQKHQQSISLYMDTGSDLVWFPCSPFECILCENKYDSNIPSTPFNVSYASHVXXX 418 F+ G Q Q I+LYMDTGSDLVWFPC+PF+CILCE K + S P N++ + V Sbjct: 74 FNLGP-QAQAQPITLYMDTGSDLVWFPCAPFKCILCEGKPNEPNASPPTNITQSVAVSCK 132 Query: 419 XXXXXXXXXXXXXXDLCAISHCPLETIETSDCSSFHCPNFYYAYGDGSLIAKLYQDKLSL 598 DLCA + CPLE+IETSDC++F CP FYYAYGDGSLIA+LY+D LSL Sbjct: 133 SPACSAAHNLAPPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLIARLYRDTLSL 192 Query: 599 PFASSSLNLPNFTFGCAHTTLGEPIGVAGFGQGVLSLPAQLAKHSPQLANSFSYCLISHS 778 SSL L NFTFGCAHTTL EP GVAGFG+G+LSLPAQLA SPQL N FSYCL+SHS Sbjct: 193 ----SSLFLRNFTFGCAHTTLAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHS 248 Query: 779 FDVKKVHKPSPLILGRFSIEDEEKKALVDGGSEFLYTPMLQNPKHPYFYCVGLEAISVGN 958 FD ++V KPSPLILGR+ E++EK+ + G +EF+YT ML+NPKHPYFY V L I+VG Sbjct: 249 FDSERVRKPSPLILGRY--EEKEKEKIGGGVAEFVYTSMLENPKHPYFYTVSLIGIAVGK 306 Query: 959 KRIPATESLKRVNKEGDGGMVVDSGTTFTMLPTKMYEAVVTEFENRVGQVLSRASEVEGQ 1138 + IPA E L+RVN GDGG+VVDSGTTFTMLP Y +VV EF+ RVG+ RA ++E + Sbjct: 307 RTIPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRVGRDNKRARKIEEK 366 Query: 1139 TGLSPCYYYDNDLVSLSKVPRLVLHFIG--NSSVILPRKNYFFGFTNGDD-VKMKKKVGC 1309 TGL+PCYY L S++ VP L L F G NSSV+LPRKNYF+ F++G D K K+KVGC Sbjct: 367 TGLAPCYY----LNSVADVPALTLRFAGGKNSSVVLPRKNYFYEFSDGSDGAKGKRKVGC 422 Query: 1310 LMLMNGGDEEE-SGGPVATLGNYQQQGFEVVYDLEKRRVGFAKRQCASLWNSLN 1468 LMLMNGGDE + SGGP ATLGNYQQQGFEV YDLE++RVGFA+RQCA LW LN Sbjct: 423 LMLMNGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCALLWERLN 476 >ref|XP_002334311.1| predicted protein [Populus trichocarpa] gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa] Length = 496 Score = 556 bits (1433), Expect = e-156 Identities = 291/478 (60%), Positives = 352/478 (73%), Gaps = 8/478 (1%) Frame = +2 Query: 59 IVLPLTHSLSKSHFTNNTHHLLKTSSLRSTKRFNXXXXXXXXXXXX-ISLPLSPGSDYIL 235 + LPLTHSLSK+ FT+ THHL+K++S S RF +SLPLSPGSDY L Sbjct: 26 LFLPLTHSLSKTQFTS-THHLIKSTSTSSITRFRRHHHQKNTHNHRQVSLPLSPGSDYTL 84 Query: 236 TFSFGSKQKHQQSISLYMDTGSDLVWFPCSPFECILCENKYDS-NIPSTPFNV--SYASH 406 +F+ S Q I LY+DTGSDLVWFPC PFECILCE K ++ ++ STP A+ Sbjct: 85 SFTLDS-----QPIFLYLDTGSDLVWFPCQPFECILCEGKAENTSLASTPPPKLSKTATP 139 Query: 407 VXXXXXXXXXXXXXXXXXDLCAISHCPLETIETSDCSSFHCPNFYYAYGDGSLIAKLYQD 586 V DLCAIS+CPLE+IETSDC CP FYYAYGDGSLIA+LY+D Sbjct: 140 VSCKSSACSAAHSNLPSSDLCAISNCPLESIETSDCQKHSCPQFYYAYGDGSLIARLYRD 199 Query: 587 KLSLPFAS-SSLNLPNFTFGCAHTTLGEPIGVAGFGQGVLSLPAQLAKHSPQLANSFSYC 763 +SLP ++ ++L + NFTFGCAHT L EPIGVAGFG+GVLSLPAQLA SPQL N FSYC Sbjct: 200 SISLPLSNPTNLIVNNFTFGCAHTALAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYC 259 Query: 764 LISHSFDVKKVHKPSPLILGRFSIEDEEKKALVDGGSEFLYTPMLQNPKHPYFYCVGLEA 943 L+SHSFD ++ +PSPLILGR+ +++E++ F+YT ML N +HPYFYCVGLE Sbjct: 260 LVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRFVYTSMLDNLEHPYFYCVGLEG 319 Query: 944 ISVGNKRIPATESLKRVNKEGDGGMVVDSGTTFTMLPTKMYEAVVTEFENRVGQVLSRAS 1123 IS+G K+IPA L++V+ EG GG+VVDSGTTFTMLP +Y +VV EFENRVG+V RA Sbjct: 320 ISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNERAR 379 Query: 1124 EVEGQTGLSPCYYYDNDLVSLSKVPRLVLHFIGN-SSVILPRKNYFFGFTNGDDVKMKK- 1297 +E TGLSPCYY+DN++V+ VP +VLHF+GN SSV+LPR+NYF+ F +G D K KK Sbjct: 380 VIEEDTGLSPCYYFDNNVVN---VPSVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKR 436 Query: 1298 KVGCLMLMNGGDEEE-SGGPVATLGNYQQQGFEVVYDLEKRRVGFAKRQCASLWNSLN 1468 KVGCLMLMNGGDE E SGGP ATLGNYQQQGFEVVYDLE +RVGFA+RQCASLW +LN Sbjct: 437 KVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQCASLWETLN 494