BLASTX nr result
ID: Coptis25_contig00010198
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis25_contig00010198 (2072 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002273997.2| PREDICTED: LOW QUALITY PROTEIN: glyoxysomal ... 572 e-160 ref|XP_002305124.1| predicted protein [Populus trichocarpa] gi|2... 550 e-154 ref|XP_002329829.1| predicted protein [Populus trichocarpa] gi|2... 539 e-150 ref|XP_002509448.1| trypsin domain-containing protein, putative ... 513 e-143 ref|XP_004155645.1| PREDICTED: glyoxysomal processing protease, ... 488 e-135 >ref|XP_002273997.2| PREDICTED: LOW QUALITY PROTEIN: glyoxysomal processing protease, glyoxysomal-like [Vitis vinifera] Length = 753 Score = 572 bits (1474), Expect = e-160 Identities = 322/605 (53%), Positives = 394/605 (65%), Gaps = 8/605 (1%) Frame = +2 Query: 2 VDVPASSFALQSVIEATRGYPEVSSWEVGWSLASLNNSPQALMDGHQTEVGHDIRYPLEK 181 VDVPA S A+QS+IEA+ G E W+VGWSLAS L+D QT+V L Sbjct: 137 VDVPAFSLAVQSIIEASSGSRE-QGWDVGWSLASYTGDSHTLVDAIQTQVS------LAX 189 Query: 182 PSHSGKSKSRNPNI-AMSTIRVAFLGVPSLTTEGLPNINISRSNKRGDLLLAMGSPFGLL 358 H S +P++ ST R+A LGV S+ ++ LPNI IS SNKRGDLLLAMGSPFG+L Sbjct: 190 FLHFMVGDSSHPSLMGKSTARIALLGVSSINSKDLPNIAISPSNKRGDLLLAMGSPFGVL 249 Query: 359 SPAHFINSISVGXXXXXXXXXXXXXXLLMADIRCFAGMEGGPIFSEHAEFIGILNRPLRV 538 SP HF NSISVG LLMADIRC GMEGGP+F+EHA+ IGIL RPLR Sbjct: 250 SPVHFFNSISVGSIANCYTPSPSRRSLLMADIRCLPGMEGGPVFNEHAQLIGILTRPLRQ 309 Query: 539 IASGAEIQLVIPWEAIALALSDLLQNGAPKEGVVSNIEK--INALGNECSTNCPQSNRAL 712 GAEIQLVIPWEAIA A DLLQ EG + + + +NA+G + + S+ Sbjct: 310 KTGGAEIQLVIPWEAIATACCDLLQKEVQNEGEMKHYNRGNLNAVGKKYLFSGHDSDGPF 369 Query: 713 NFVTKKQDCHHLLALRIEKAIASVALVTVGEGTWASGVVLNNNGLILTNAHLLEPGRFHK 892 N + ++ DC IEKA+AS+ LVT+ +G WASGVVLN+ GLILTNAHLLEP RF K Sbjct: 370 NSMHQQPDCCSPPLSLIEKAMASICLVTIDDGVWASGVVLNSQGLILTNAHLLEPWRFGK 429 Query: 893 TNVQGSDVPTSDSFA-LSFQSCASVXXXXXXXXXXXXXXXPNLVNNSNTSLGDDPGGYKL 1069 T +G + + P + + +S+ D GGYK Sbjct: 430 TVARGGRCGAEPEIPFIPSEESVYCRDEGTYSHQKSQDLLPKTLKIAGSSVMDGHGGYKS 489 Query: 1070 GS-YKSFKRICVRLDHMNPWIWCDAKVVYISNGPLDIALLQLESFPKNVCPIVPDFTCPS 1246 S Y+ + I +RLDH +P IWCDA+VVY+S GPLDIALLQLE P +CPI+ DF CPS Sbjct: 490 SSTYRGHRNIRIRLDHTDPRIWCDARVVYVSKGPLDIALLQLEFVPGQLCPIIMDFACPS 549 Query: 1247 PGSKAYVIGHGLFGPRSDLSPSVCSGVVARVVKSQGHIHPIEPGSEKTTTKYLPAMLETT 1426 GSKAYVIGHGLFGPR D PSVC G VA+VVKS+ + + ++ + PAMLETT Sbjct: 550 AGSKAYVIGHGLFGPRCDFFPSVCVGEVAKVVKSKMPL-SCQSSLQENILEDFPAMLETT 608 Query: 1427 ASVYGGGSGGAIVNSNGEMIGLVTSNARFGKGHVIPQMNFSIPCAALEPVFKFSEDMQDI 1606 A+V+ GGSGGA+VNS G MIGL+TSNAR G G VIP +NFSIPCAAL+ V+KFS+DMQ + Sbjct: 609 AAVHAGGSGGAVVNSEGHMIGLITSNARHGGGTVIPHLNFSIPCAALQAVYKFSKDMQGM 668 Query: 1607 SVLQDLDRPDEHLSTVWALTPPESPR---KVPPFLRQPESLQENNTEKKGSRFAKFIAER 1777 S+L DLD+P+EHLS+VWAL PP SP+ +P P+SL E+N E KGSRFAKFIAER Sbjct: 669 SLLLDLDKPNEHLSSVWALMPPLSPKPGPSLPNLPNLPQSLLEDNKEGKGSRFAKFIAER 728 Query: 1778 NGEVF 1792 N EVF Sbjct: 729 N-EVF 732 >ref|XP_002305124.1| predicted protein [Populus trichocarpa] gi|222848088|gb|EEE85635.1| predicted protein [Populus trichocarpa] Length = 752 Score = 550 bits (1416), Expect = e-154 Identities = 325/615 (52%), Positives = 398/615 (64%), Gaps = 15/615 (2%) Frame = +2 Query: 2 VDVPASSFALQSVIEATRGYPEVSSWEVGWSLASLNNSPQALMDGHQTEVGHDIRYPLEK 181 VDVP SS ALQS++EA+ G WEVGWSLAS N Q+ MD QT+ H + + Sbjct: 141 VDVPLSSLALQSLVEASSGSMN-HGWEVGWSLASPENGSQSFMDVVQTQTEHG-NASIAE 198 Query: 182 PSHSGKSKSRNPNI-AMSTIRVAFLGVPSLTTEGLPNINISRSNKRGDLLLAMGSPFGLL 358 + +S NP+I ST RVA LGV L + LPN IS S++RGD LLA+GSPFG+L Sbjct: 199 SQRRAREESSNPSIMGKSTTRVAILGV-FLHLKDLPNFEISASSRRGDFLLAVGSPFGVL 257 Query: 359 SPAHFINSISVGXXXXXXXXXXXXXXLLMADIRCFAGMEGGPIFSEHAEFIGILNRPLRV 538 SP HF NS+SVG LLMADIRC GMEG P+F E++ FIGIL RPLR Sbjct: 258 SPVHFFNSLSVGSIANCYPPRSSDISLLMADIRCLPGMEGSPVFCENSNFIGILIRPLRQ 317 Query: 539 IASGAEIQLVIPWEAIALALSDLL----QNGAPKEGVVSNIEKINALGNECSTNCPQSNR 706 +SGAEIQLVIPWEAIALA SDLL QN ++G+ N E +NA+GN S++ Sbjct: 318 KSSGAEIQLVIPWEAIALACSDLLLKEPQNA--EKGIHINKENLNAVGNAYSSSSDGP-- 373 Query: 707 ALNFVTKKQDCHHLLALR----IEKAIASVALVTVGEGTWASGVVLNNNGLILTNAHLLE 874 F K + HH+ +EKA+AS+ L+T+ E WASGV+LN+ GLILTNAHLLE Sbjct: 374 ---FPLKHE--HHISYCSSPPPVEKAMASICLITIDELVWASGVLLNDQGLILTNAHLLE 428 Query: 875 PGRFHKTNVQGSDVPTS--DSFAL--SFQSCASVXXXXXXXXXXXXXXXPNLVNNSNTSL 1042 P RF KT V G + T D F F + V P +N N+S+ Sbjct: 429 PWRFGKTTVNGGEDGTKLQDPFIPPEEFPRYSEVDGHEKTQRLP-----PKTLNIMNSSV 483 Query: 1043 GDDPGGYKLG-SYKSFKRICVRLDHMNPWIWCDAKVVYISNGPLDIALLQLESFPKNVCP 1219 D+ GYKL SYK I VRLDH +PWIWCDAKVV++ GPLD+ALLQLE P + P Sbjct: 484 ADESKGYKLSLSYKGPMNIRVRLDHADPWIWCDAKVVHVCKGPLDVALLQLEHVPDQLFP 543 Query: 1220 IVPDFTCPSPGSKAYVIGHGLFGPRSDLSPSVCSGVVARVVKSQGHIHPIEPGSEKTTTK 1399 DF C S GSKAYVIGHGLFGPR SPS+CSG V++VVK++ P S + Sbjct: 544 TKVDFECSSLGSKAYVIGHGLFGPRCGFSPSICSGAVSKVVKAKA---PSYCQSVQGGYS 600 Query: 1400 YLPAMLETTASVYGGGSGGAIVNSNGEMIGLVTSNARFGKGHVIPQMNFSIPCAALEPVF 1579 ++PAMLETTA+V+ GGSGGA+VNS G MIGLVTS AR G G VIP +NFSIPCA L P+F Sbjct: 601 HIPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTSKARHGGGTVIPHLNFSIPCAVLAPIF 660 Query: 1580 KFSEDMQDISVLQDLDRPDEHLSTVWALTPPESPRKVPPFLRQPES-LQENNTEKKGSRF 1756 F++DM+DIS+LQ+LDRP+EHLS+VWAL PP SP+ PP PES LQ+ + KGSRF Sbjct: 661 DFAKDMRDISLLQNLDRPNEHLSSVWALMPPLSPKPSPPLPSLPESILQDYEKQVKGSRF 720 Query: 1757 AKFIAERNGEVFLGS 1801 AKFIAER ++F G+ Sbjct: 721 AKFIAERE-KLFRGT 734 >ref|XP_002329829.1| predicted protein [Populus trichocarpa] gi|222870891|gb|EEF08022.1| predicted protein [Populus trichocarpa] Length = 716 Score = 539 bits (1388), Expect = e-150 Identities = 315/607 (51%), Positives = 386/607 (63%), Gaps = 7/607 (1%) Frame = +2 Query: 2 VDVPASSFALQSVIEATRGYPEVSSWEVGWSLASLNNSPQALMDGHQTEVGHDIRYPLEK 181 VDVP SS ALQS++EA+ G + WEVGWSLAS + PQ MD TE G+ + Sbjct: 141 VDVPVSSLALQSLVEASSGSMD-HGWEVGWSLASHESGPQPFMD---TEHGNASTVESHR 196 Query: 182 PSHSGKSKSRNPNI-AMSTIRVAFLGVPSLTTEGLPNINISRSNKRGDLLLAMGSPFGLL 358 + G S NP+I T RVA LGV L + LPN I S KRGD LLA+GSPFG+L Sbjct: 197 HARGGSS---NPSIMGRLTTRVAILGV-FLHLKDLPNFKILASRKRGDFLLAVGSPFGIL 252 Query: 359 SPAHFINSISVGXXXXXXXXXXXXXXLLMADIRCFAGMEGGPIFSEHAEFIGILNRPLRV 538 SP HF NS+SVG LLMAD RC GMEG P+F E+++FIGIL RPLR Sbjct: 253 SPVHFFNSLSVGSIANCYPPRSSDISLLMADFRCLPGMEGSPVFGENSDFIGILIRPLRQ 312 Query: 539 IASGAEIQLVIPWEAIALALSDLL----QNGAPKEGVVSNIEKINALGNECSTNCPQSNR 706 ++GAEIQLVIPWEAIA A SDLL QN ++G+ N E +NA N Sbjct: 313 KSTGAEIQLVIPWEAIATACSDLLLKEPQNA--EKGIHFNKENLNAHHNS---------- 360 Query: 707 ALNFVTKKQDCHHLLALRIEKAIASVALVTVGEGTWASGVVLNNNGLILTNAHLLEPGRF 886 H L +EKA+AS+ L+T+ E WASGV+LN+ GLILTNAHLLEP RF Sbjct: 361 -----------HRPSPLPVEKAMASICLITIDEAVWASGVLLNDQGLILTNAHLLEPWRF 409 Query: 887 HKTNVQGSDVPTSDSFALSFQSCASVXXXXXXXXXXXXXXXPNLVNNSNTSLGDDPGGYK 1066 KT V G + T S L F P +N ++ + D+ GYK Sbjct: 410 GKTTVNGREDGTK-SEDLFFPPKEFSRYSEVDGYRKSQRLPPKTMNIVDSLVADERKGYK 468 Query: 1067 LG-SYKSFKRICVRLDHMNPWIWCDAKVVYISNGPLDIALLQLESFPKNVCPIVPDFTCP 1243 L SYK + I VRLDH +PWIWCDAKVVY+ GPLD+ALLQLE P +CP DF P Sbjct: 469 LSLSYKGSRNIRVRLDHADPWIWCDAKVVYVCKGPLDVALLQLEHVPDQLCPTKVDFKSP 528 Query: 1244 SPGSKAYVIGHGLFGPRSDLSPSVCSGVVARVVKSQGHIHPIEPGSEKTTTKYLPAMLET 1423 S GSKAY+IGHGLFGPR SPSVCSGVV++VVK++ P S + ++PAMLET Sbjct: 529 SLGSKAYIIGHGLFGPRCGSSPSVCSGVVSKVVKTKA---PPYCQSLQGRNSHIPAMLET 585 Query: 1424 TASVYGGGSGGAIVNSNGEMIGLVTSNARFGKGHVIPQMNFSIPCAALEPVFKFSEDMQD 1603 TA+V+ GGSGGA++NS G MIGLVTSNAR G G VIP +NFSIPCA L P+F F+++M+D Sbjct: 586 TAAVHPGGSGGAVINSEGHMIGLVTSNARHGGGTVIPHLNFSIPCAVLAPIFDFAKEMRD 645 Query: 1604 ISVLQDLDRPDEHLSTVWALTPPESPRKVPPFLRQPES-LQENNTEKKGSRFAKFIAERN 1780 I++LQ+LD+P+E LS+VWAL PP P+ PP PES LQ+N + KGSRFAKFIAER+ Sbjct: 646 IALLQNLDQPNEDLSSVWALMPPLPPKPTPPLSTLPESILQDNEKQVKGSRFAKFIAERD 705 Query: 1781 GEVFLGS 1801 ++F GS Sbjct: 706 -KLFRGS 711 >ref|XP_002509448.1| trypsin domain-containing protein, putative [Ricinus communis] gi|223549347|gb|EEF50835.1| trypsin domain-containing protein, putative [Ricinus communis] Length = 729 Score = 513 bits (1322), Expect = e-143 Identities = 308/619 (49%), Positives = 388/619 (62%), Gaps = 11/619 (1%) Frame = +2 Query: 2 VDVPASSFALQSVIEATRGYPEVSSWEVGWSLASLNNSPQALMDGHQTEVGHDIRYPLEK 181 VDV SS ALQS++E++ G + WE+GWSLAS +N + MD QT+V Sbjct: 128 VDVAESSLALQSLVESSLGSLD-HGWEIGWSLASHDNGHRNSMDVIQTQV---------- 176 Query: 182 PSHSGKSKSRNPNIAMST-IRVAFLGVPSLTTEGLPNINISRSNKRGDLLLAMGSPFGLL 358 S + +S NP + T R+A LGV SL + LP I IS S RGD LL +GSPFG+L Sbjct: 177 -SKAQVGESGNPTLVSKTSTRIALLGV-SLNLKDLPIITISPSIIRGDSLLTVGSPFGVL 234 Query: 359 SPAHFINSISVGXXXXXXXXXXXXXXLLMADIRCFAGMEGGPIFSEHAEFIGILNRPLRV 538 SP HF NS+S+G L+MADIRC GMEG P F E +FIGIL RPLR Sbjct: 235 SPVHFFNSLSMGSVANCYPARSSNVSLVMADIRCLPGMEGAPAFGECGDFIGILTRPLRQ 294 Query: 539 IASGAEIQLVIPWEAIALALSDLL----QNGAPKEGVVSNIEKINALGNECSTNCPQSNR 706 ++GAEIQLVIPWEAIA A DLL QN +EG+ N E +NA+ N S +S+ Sbjct: 295 KSTGAEIQLVIPWEAIATACGDLLLKEPQNA--EEGIAINKENLNAVENAYSH---ESDG 349 Query: 707 ALNFVTKKQDCHHLLALRIEKAIASVALVTVGEGTWASGVVLNNNGLILTNAHLLEPGRF 886 ++ + + H L +EK +ASV L+T+ EG WASGV+LN+ GL+LTNAHLLEP RF Sbjct: 350 PFSYKYEHFNSHCSSTLPVEKVMASVCLITIDEGIWASGVLLNDQGLVLTNAHLLEPWRF 409 Query: 887 HKTNVQGSDVPTSDSFALSFQSCASVXXXXXXXXXXXXXXXP-NLVNNSNTSLGDDPGGY 1063 KT + G T S AL SV P N ++S+ D G Sbjct: 410 GKTTINGGRNRTK-SGALFLPPEGSVIPGHSNVDSYRGSQMPLNKAKIMDSSVFDQTKGD 468 Query: 1064 KLG-SYKSFKRICVRLDHMNPWIWCDAKVVYISNGPLDIALLQLESFPKNVCPIVPDFTC 1240 +L SY + I VRLDH NPWIWCDAKV+Y+S GPLD+ALLQLE P +CPI D+ C Sbjct: 469 QLSLSYSGHRNIRVRLDHFNPWIWCDAKVIYVSKGPLDVALLQLEYVPDQLCPIKADYAC 528 Query: 1241 PSPGSKAYVIGHGLFGPRSDLSPSVCSGVVARVVKSQG--HIHPIEPGSEKTTTKYLPAM 1414 P GSKAYVIGHGLFGPR PS+CSGV+A++VK + I+ S ++PAM Sbjct: 529 PILGSKAYVIGHGLFGPRCGFFPSICSGVIAKIVKVEAPTFYQSIQGDS------HIPAM 582 Query: 1415 LETTASVYGGGSGGAIVNSNGEMIGLVTSNARFGKGHVIPQMNFSIPCAALEPVFKFSED 1594 LETTA+V+ GGSGGA++NS+G MIGLVTSNAR G G VIP +NFSIPCA L P+F+F+ Sbjct: 583 LETTAAVHPGGSGGAVINSSGHMIGLVTSNARHGGGRVIPHLNFSIPCALLAPIFEFARG 642 Query: 1595 MQDISVLQDLDRPDEHLSTVWALTPPESPRKVPPFLRQPESLQENNTEKKG--SRFAKFI 1768 +DIS+LQ+LDRP++ LS+VWAL P S + PP PESL E++ EK+G S+FAKFI Sbjct: 643 TKDISLLQNLDRPNQQLSSVWALMPSLSHKPSPPLSNLPESLLEDH-EKQGRVSKFAKFI 701 Query: 1769 AERNGEVFLGSNHHMKEGN 1825 AER+ EV S K G+ Sbjct: 702 AERD-EVLRSSTRLGKVGS 719 >ref|XP_004155645.1| PREDICTED: glyoxysomal processing protease, glyoxysomal-like [Cucumis sativus] Length = 747 Score = 488 bits (1255), Expect = e-135 Identities = 284/599 (47%), Positives = 372/599 (62%), Gaps = 8/599 (1%) Frame = +2 Query: 5 DVPASSFALQSVIEATRGYPEVSSWEVGWSLASLNNSPQALMDGHQTEVGHDIRYPLEKP 184 D+P S+ ALQSV++A+ WEVGWSLAS N + D + ++ ++ R + Sbjct: 136 DIPTSATALQSVMDASIDSLH-QRWEVGWSLASYTNGSPSFRDSLRGQIENEKRTSVGSQ 194 Query: 185 SHSG-KSKSRNPNIAMSTIRVAFLGVPSLTTEGLPNINISRSNKRGDLLLAMGSPFGLLS 361 + S+N ++ TIR+A LGVPSL+ + +PNI+IS S +RG LLA+GSPFG+LS Sbjct: 195 KFLDLEGSSKNNDL---TIRIAILGVPSLSKD-MPNISISPSRQRGSFLLAVGSPFGVLS 250 Query: 362 PAHFINSISVGXXXXXXXXXXXXXXLLMADIRCFAGMEGGPIFSEHAEFIGILNRPLRVI 541 P HF+NS+SVG LLMAD+RC GMEG P+F E A IG+L RPL Sbjct: 251 PVHFLNSLSVGSISNCYPPSSLSKSLLMADMRCLPGMEGCPVFDEKARLIGVLIRPLVHY 310 Query: 542 ASGAEIQLVIPWEAIALALSDLLQNGAPKEGVVSNIEK-INALGNECSTNCPQSNRALNF 718 +GAEIQL+IPW AIA A S LL + N + I A+GN + + Sbjct: 311 MTGAEIQLLIPWGAIATACSGLLLGTCNVGERIDNDNRCIGAVGNMAVNKEQKLEGGFSS 370 Query: 719 VTKKQDCHHLLALRIEKAIASVALVTVGEGTWASGVVLNNNGLILTNAHLLEPGRFHKTN 898 + + C +IEKA+ASV LVT+GEG WASGV+LN+ GLILTNAHL+EP RF KTN Sbjct: 371 IQESSGCSRPFPFKIEKAVASVCLVTMGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTN 430 Query: 899 VQGS-DVPTSDSFALSFQSCASVXXXXXXXXXXXXXXXPNLVNNSNTSLGDDPGGYKLGS 1075 V G + + + PN N N L + KL S Sbjct: 431 VGGEKSIENAKLLQSHTEHSPCSMNNSVFGGQEIGNIEPNASKNGNILLHNQLEDNKL-S 489 Query: 1076 YKSFKR--ICVRLDHMNPWIWCDAKVVYISNGPLDIALLQLESFPKNVCPIVPDFTCPSP 1249 + ++ R + VRL H PWIWCDAK++YI G D+ALLQLE P+ + PI D +CP+ Sbjct: 490 FPNYGRRNLHVRLSHAEPWIWCDAKLLYICKGSWDVALLQLEQIPEQLSPITMDCSCPTS 549 Query: 1250 GSKAYVIGHGLFGPRSDLSPSVCSGVVARVVKSQGHIHPIEPGS--EKTTTKYLPAMLET 1423 GSK +VIGHGL GP+S LSPSVCSGVV+ VVK++ P S + + +Y PAMLET Sbjct: 550 GSKIHVIGHGLLGPKSGLSPSVCSGVVSNVVKAK------IPSSYHKGDSLEYFPAMLET 603 Query: 1424 TASVYGGGSGGAIVNSNGEMIGLVTSNARFGKGHVIPQMNFSIPCAALEPVFKFSEDMQD 1603 TA+V+ GGSGGA+VNS G MIGLVTSNAR G+G +IP +NFSIPCAALEP+ +FS+DM+D Sbjct: 604 TAAVHPGGSGGAVVNSEGHMIGLVTSNARHGRGVIIPHLNFSIPCAALEPIHRFSKDMED 663 Query: 1604 ISVLQDLDRPDEHLSTVWALTPPESPRKVPPFLRQPESLQENNTEK-KGSRFAKFIAER 1777 +SV++ LD P+E LS++WAL SP+ PP P+ L E++ K KGSRFAKFIAE+ Sbjct: 664 LSVVKVLDEPNEQLSSIWALMSQRSPKPSPP-PGLPQLLGEDHESKGKGSRFAKFIAEQ 721