BLASTX nr result
ID: Coptis21_contig00004534
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis21_contig00004534 (1845 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Viti... 537 e-150 ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|2... 535 e-149 ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glyc... 533 e-149 emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera] 533 e-149 ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus c... 530 e-148 >ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera] gi|296086874|emb|CBI33041.3| unnamed protein product [Vitis vinifera] Length = 486 Score = 537 bits (1383), Expect = e-150 Identities = 268/429 (62%), Positives = 316/429 (73%), Gaps = 11/429 (2%) Frame = +2 Query: 419 EKKTSHGRTYGWTFSLKRIVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEE 598 +K +GR GWT +++++V G SMRR+ ER++G DIWLLG+CY++S EE Sbjct: 63 KKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEE 122 Query: 599 SSDDACNANRLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 778 SS+ A ++N L EF DFSSRI MTYRKGF+AIGDSK TSDVNWGCMLRSSQMLVAQALL Sbjct: 123 SSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALL 182 Query: 779 FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 958 H +GRSWRK KP +YIEIL FGDS+ASAFSIHN+LQ GKAYGLAAGSW+GPYAM Sbjct: 183 LHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAM 242 Query: 959 CRSWETLVNSTQQS---DKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQA 1129 CRSWETL S ++ + +L M IY+VSGDE+GERGGAPV+ IE+ ++ C EF +GQ Sbjct: 243 CRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQV 302 Query: 1130 VWKPIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYL 1309 W PI +KVNPRYIPSL ATFTFPQSLGILGGKPG STYIVGVQD+KAFYL Sbjct: 303 DWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYL 362 Query: 1310 DPHEVQQVMDIKRNNSEVDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARAS 1489 DPHE Q V+DI+R N E DTSSYHC+++RH+ LD+IDPSLAIGFYCRDKDDFDDFC RAS Sbjct: 363 DPHEAQSVVDIRRENLEADTSSYHCNIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRAS 422 Query: 1490 KLTDASNGAPLLTITKSHSSPKTV-CKDFLVHSVDDESHGDFEMDNMN-------ESEIS 1645 KL D SNGAPL T+ HS PK + C D + D+ G E D+ + E Sbjct: 423 KLADKSNGAPLFTVAHIHSLPKPISCSDGM-----DDCSGFREDDSFDVVSNKGAEGYEH 477 Query: 1646 TQEDDWQLL 1672 EDDWQLL Sbjct: 478 EHEDDWQLL 486 >ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|222873995|gb|EEF11126.1| predicted protein [Populus trichocarpa] Length = 482 Score = 535 bits (1378), Expect = e-149 Identities = 270/423 (63%), Positives = 314/423 (74%), Gaps = 5/423 (1%) Frame = +2 Query: 419 EKKTSHGRT-YGWTFSLKRIVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSE 595 EKK H R GWT ++K+IV GGSMRR+ E ++G DIWLLG CY++S + Sbjct: 68 EKKAPHIRHGNGWTSAVKKIVAGGSMRRIQECVLGTSKTGISNTTGDIWLLGACYKISQD 127 Query: 596 ESSDDACNANRLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQAL 775 SS DA N L F+ DFSSRI +TYRKGFDAI DSK TSDV+WGCMLRSSQMLVAQAL Sbjct: 128 NSSGDAAATNALAAFNHDFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQMLVAQAL 187 Query: 776 LFHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYA 955 LFH LGRSWRKP++KP EY+EIL LFGDSE+SAFSIHNLL+ GKAYGLAAGSW+GPYA Sbjct: 188 LFHRLGRSWRKPLDKPLDREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAGSWVGPYA 247 Query: 956 MCRSWETLVNSTQQS---DKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQ 1126 +C SWE+LV S ++ + +LSM +YVVSG E+GERGGAPVLCIE+ A+ CSEF +GQ Sbjct: 248 VCHSWESLVRSRREETNLEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHCSEFSKGQ 307 Query: 1127 AVWKPIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFY 1306 W PI DK+NPRYIPSL ATFTFPQSLGILGGKPG STYIVGVQD+ AFY Sbjct: 308 EDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVGVQDENAFY 367 Query: 1307 LDPHEVQQVMDIKRNNSEVDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARA 1486 LDPHEVQ V+++ R++ E +TSSYHC+VVRH+PLD IDPSLAIGFYCRDKDDFDDFC A Sbjct: 368 LDPHEVQPVVNVSRDDVEANTSSYHCNVVRHMPLDLIDPSLAIGFYCRDKDDFDDFCTLA 427 Query: 1487 SKLTDASNGAPLLTITKSHSSPKTVCKDFLVH-SVDDESHGDFEMDNMNESEISTQEDDW 1663 SKLTD SNGAPL T+ S + L H S + S + MN+ E EDDW Sbjct: 428 SKLTDESNGAPLFTVAHS--------RKLLKHDSGEVRSDDSLGVMTMNDVEGCVHEDDW 479 Query: 1664 QLL 1672 QLL Sbjct: 480 QLL 482 >ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glycine max] Length = 485 Score = 533 bits (1374), Expect = e-149 Identities = 260/420 (61%), Positives = 312/420 (74%), Gaps = 2/420 (0%) Frame = +2 Query: 419 EKKTSHGRTYGWTFSLKRIVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEE 598 EKK R+ GW +++++V GGSMRR ER++G DIWLLGVC+++S +E Sbjct: 69 EKKAVPSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVCHKISQQE 128 Query: 599 SSDDACNANRLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 778 S+ +N L F DFSS+I +TYRKGFDAIGD+K+TSDVNWGCMLRSSQMLVAQALL Sbjct: 129 STGGVDTSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVNWGCMLRSSQMLVAQALL 188 Query: 779 FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 958 FH LGRSWRKP++KP EYI++L LFGDSEASAFSIHNLLQ GK YGLA GSW+GPYAM Sbjct: 189 FHKLGRSWRKPIDKPLDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGSWVGPYAM 248 Query: 959 CRSWETLVNSTQQSDKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQAVWK 1138 CR+WE L +P L M IYVVSGDE+GERGGAPV+CIED +K CSEF G AVW Sbjct: 249 CRTWEVLARKKNDLGEPPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCSEFSSGLAVWT 308 Query: 1139 PIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYLDPH 1318 P+ DKVNPRYIP L +TF FPQSLGI+GGKPG STYI+GVQ++KAFYLDPH Sbjct: 309 PLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGVQNEKAFYLDPH 368 Query: 1319 EVQQVMDIKRNNSE-VDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARASKL 1495 +VQQV++I + E TSSYHC+V+RH+PLD+IDPSLAIGFYCRDKDDFDDFC++ASKL Sbjct: 369 DVQQVVNISGDTQEPTGTSSYHCNVMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKL 428 Query: 1496 TDASNGAPLLTITKSHSSPKTVCKDFLVHSVDDESHGDFEMDNMN-ESEISTQEDDWQLL 1672 + SNGAPL T+ KS S K V D S D+ + + M+ ++ T EDDWQLL Sbjct: 429 AEESNGAPLFTVAKSRSFSKQVSNDV---SGDNTGFQEDDFPGMDCGNDTVTNEDDWQLL 485 >emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera] Length = 489 Score = 533 bits (1372), Expect = e-149 Identities = 269/432 (62%), Positives = 316/432 (73%), Gaps = 14/432 (3%) Frame = +2 Query: 419 EKKTSHGRTYGWTFSLKRIVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEE 598 +K +GR GWT +++++V G SMRR+ ER++G DIWLLG+CY++S EE Sbjct: 63 KKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEE 122 Query: 599 SSDDACNANRLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 778 SS+ A ++N L EF DFSSRI MTYRKGF+AIGDSK TSDVNWGCMLRSSQMLVAQALL Sbjct: 123 SSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALL 182 Query: 779 FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 958 H +GRSWRK KP +YIEIL FGDS+ASAFSIHN+LQ GKAYGLAAGSW+GPYAM Sbjct: 183 LHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAM 242 Query: 959 CRSWETLVNSTQQS---DKPTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQA 1129 CRSWETL S ++ + +L M IY+VSGDE+GERGGAPV+ IE+ ++ C EF +GQ Sbjct: 243 CRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQV 302 Query: 1130 VWKPIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYL 1309 W PI +KVNPRYIPSL ATFTFPQSLGILGGKPG STYIVGVQD+KAFYL Sbjct: 303 DWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYL 362 Query: 1310 DPHEVQQVMDIKRNNSEVDTSSYHC---SVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCA 1480 DPHE Q V+DI+R N E DTSSYHC S++RH+ LD+IDPSLAIGFYCRDKDDFDDFC Sbjct: 363 DPHEAQSVVDIRRENLEADTSSYHCNCSSIIRHICLDSIDPSLAIGFYCRDKDDFDDFCI 422 Query: 1481 RASKLTDASNGAPLLTITKSHSSPKTV-CKDFLVHSVDDESHGDFEMDNMN-------ES 1636 RASKL D SNGAPL T+ HS PK + C D + D+ G E D+ + E Sbjct: 423 RASKLADESNGAPLFTVAHIHSLPKPISCSDGM-----DDCSGFREDDSFDVVSNKGAEG 477 Query: 1637 EISTQEDDWQLL 1672 EDDWQLL Sbjct: 478 YEHEHEDDWQLL 489 >ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis] gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B, putative [Ricinus communis] Length = 489 Score = 530 bits (1366), Expect = e-148 Identities = 264/426 (61%), Positives = 312/426 (73%), Gaps = 8/426 (1%) Frame = +2 Query: 419 EKKTSHGRTYGWTFSLKRIVGGGSMRRLHERLIGXXXXXXXXXXXDIWLLGVCYRVSSEE 598 EKK SH R GWT ++K+IV GGSMRR+HER++G DIWLLGVCY++S +E Sbjct: 65 EKKGSHSRHNGWTSAVKKIVSGGSMRRIHERVLGPSRTGISSTTSDIWLLGVCYKISEDE 124 Query: 599 SSDDACNANRLYEFSVDFSSRIWMTYRKGFDAIGDSKFTSDVNWGCMLRSSQMLVAQALL 778 S + A N L EF+ D+SSRI MTYR+GFDAIGDSK+ SDV WGCMLRSSQMLVAQALL Sbjct: 125 SGN-ADTGNALAEFTHDYSSRILMTYRRGFDAIGDSKYISDVGWGCMLRSSQMLVAQALL 183 Query: 779 FHNLGRSWRKPMEKPFCSEYIEILDLFGDSEASAFSIHNLLQVGKAYGLAAGSWIGPYAM 958 FH LGR+W KP +KP Y+EIL LFGDSEA+ FSIHNL+Q GKAY LAAGSW+GPYAM Sbjct: 184 FHKLGRAWTKPFQKPMDQAYVEILHLFGDSEAAPFSIHNLIQAGKAYSLAAGSWVGPYAM 243 Query: 959 CRSWETLVNSTQQSDK---PTLSMTIYVVSGDENGERGGAPVLCIEDVAKLCSEFCEGQA 1129 CRSWE+L S ++ + +L M +YVVSGDE+GERGGAPV+ IED ++ C EF GQA Sbjct: 244 CRSWESLARSKREENSLEYQSLPMAVYVVSGDEDGERGGAPVVYIEDASRHCLEFSRGQA 303 Query: 1130 VWKPIXXXXXXXXXXDKVNPRYIPSLCATFTFPQSLGILGGKPGVSTYIVGVQDDKAFYL 1309 W PI DKVNPRYIPSL ATFTF QSLGI+GGKPG STYIVGVQDD AFYL Sbjct: 304 DWTPILLLVPLVLGLDKVNPRYIPSLQATFTFSQSLGIMGGKPGASTYIVGVQDDNAFYL 363 Query: 1310 DPHEVQQVMDIKRNNSEVDTSSYHCSVVRHLPLDTIDPSLAIGFYCRDKDDFDDFCARAS 1489 DPHEVQ V++I R++ E DTSSYH +VRH+PL +IDPSLAIGFYCRDKDDFD+FC AS Sbjct: 364 DPHEVQSVVNIGRDDIEADTSSYHSDIVRHIPLHSIDPSLAIGFYCRDKDDFDEFCLLAS 423 Query: 1490 KLTDASNGAPLLTITKSHSSPKTVCKDFLVHSVDDESHGDFEMD-----NMNESEISTQE 1654 KL D S GAPL T+ H PK V ++++ DDE D ++ N + QE Sbjct: 424 KLADDSQGAPLFTVAHCHKLPKPVSHGDMLNNEDDEVQEDDSVNVMMPVNDDAEGGGAQE 483 Query: 1655 DDWQLL 1672 D+WQLL Sbjct: 484 DEWQLL 489