BLASTX nr result
ID: Coptis24_contig00006288
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis24_contig00006288 (1575 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vit... 533 e-149 ref|NP_567010.5| Papain family cysteine protease [Arabidopsis th... 526 e-147 ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arab... 518 e-144 emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera] 512 e-143 emb|CAB41090.1| cysteine proteinase precursor-like protein [Arab... 509 e-142 >ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera] Length = 375 Score = 533 bits (1374), Expect = e-149 Identities = 269/379 (70%), Positives = 311/379 (82%), Gaps = 9/379 (2%) Frame = +2 Query: 278 MRGSIIFTLSVT-LLTYALSVSA---NFYD---DAKIYQVTDGN--RKFGENFILRKHEE 430 M G + L V LLT AL+ SA + +D D I QVTDG+ RKFG + +L +E Sbjct: 1 MGGGLTCALGVAALLTCALAASAISLHEHDTPWDPNIVQVTDGHSHRKFGVDGVLGTEKE 60 Query: 431 NFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAAEHQALDPTAVHGVTPFSDLSEEEFEM 610 F++FM+KYGKEYS+REEYVHRL IFAKN++RAAEHQALDPTA+HGVTPFSDLSEEEFE Sbjct: 61 -FRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPTALHGVTPFSDLSEEEFER 119 Query: 611 KFMGLRSTESPSLNDGVVTTNAADMEVGDLPEDFDWREKGAVTEVKMQGTCGSCWAFSTT 790 F G+ P + GV T AA +EV LPE FDWREKGAVTEVKMQGTCGSCWAFSTT Sbjct: 120 MFTGV--VGRPHMKGGVAETAAA-LEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFSTT 176 Query: 791 GAVEGANFLATGKLISLSEQQLVDCDHKCDPKEKDACDNGCGGGLMTNAYKYLIEAGGLE 970 GAVEGA+F++T KL++LSEQQLVDCDH CD ++K ACD+GC GGLMTNAYKYLIEAGGLE Sbjct: 177 GAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIEAGGLE 236 Query: 971 EEVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLDEDQIAANLVRHGPLAVGLNAVFMQTY 1150 EE +YPY GK G+CKFKP++V RVVNFT +P++E+QIAANLV HGPLAVGLNA+FMQTY Sbjct: 237 EESSYPYTGKHGECKFKPDRVAVRVVNFTEVPINENQIAANLVCHGPLAVGLNAIFMQTY 296 Query: 1151 IGGVSCPLICSKRRINHGVLLVGYGAKGFSILRLGDRPYWIIKNSWSKKWGENGYYKLCR 1330 IGGVSCPLIC KR INHGVLLVGYGAKG+SILR G +PYWIIKNSW K+WGE+GYY+LCR Sbjct: 297 IGGVSCPLICPKRWINHGVLLVGYGAKGYSILRFGYKPYWIIKNSWGKRWGEHGYYRLCR 356 Query: 1331 GQAVCGMNSMVSAVATATS 1387 G +CGMN+MVSAV T TS Sbjct: 357 GHGMCGMNTMVSAVVTQTS 375 >ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana] gi|17979125|gb|AAL49820.1| putative cysteine proteinase [Arabidopsis thaliana] gi|332645795|gb|AEE79316.1| Papain family cysteine protease [Arabidopsis thaliana] Length = 367 Score = 526 bits (1354), Expect = e-147 Identities = 250/345 (72%), Positives = 282/345 (81%) Frame = +2 Query: 353 DDAKIYQVTDGNRKFGENFILRKHEENFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAA 532 +D I QVT NR+ N + E F++FM YGK YSTREEY+HRL IFAKNV++AA Sbjct: 24 EDLTIRQVTADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAA 83 Query: 533 EHQALDPTAVHGVTPFSDLSEEEFEMKFMGLRSTESPSLNDGVVTTNAADMEVGDLPEDF 712 EHQ +DP+AVHGVT FSDL+EEEF+ + G+ G V A +EV LPEDF Sbjct: 84 EHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGS--RGGTVGAEAPMVEVDGLPEDF 141 Query: 713 DWREKGAVTEVKMQGTCGSCWAFSTTGAVEGANFLATGKLISLSEQQLVDCDHKCDPKEK 892 DWREKG VTEVK QG CGSCWAFSTTGA EGA+F++TGKL+SLSEQQLVDCD CDPK+K Sbjct: 142 DWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQACDPKDK 201 Query: 893 DACDNGCGGGLMTNAYKYLIEAGGLEEEVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLD 1072 ACDNGCGGGLMTNAY+YL+EAGGLEEE +YPY GK+G CKF PEKV RV+NFT IPLD Sbjct: 202 KACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPEKVAVRVLNFTTIPLD 261 Query: 1073 EDQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRRINHGVLLVGYGAKGFSILRL 1252 E+QIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKR +NHGVLLVGYG+KGFSILRL Sbjct: 262 ENQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRL 321 Query: 1253 GDRPYWIIKNSWSKKWGENGYYKLCRGQAVCGMNSMVSAVATATS 1387 ++PYWIIKNSW KKWGENGYYKLCRG +CG+NSMVSAVAT S Sbjct: 322 SNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVATQVS 366 >ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp. lyrata] gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp. lyrata] Length = 368 Score = 518 bits (1335), Expect = e-144 Identities = 253/361 (70%), Positives = 287/361 (79%), Gaps = 1/361 (0%) Frame = +2 Query: 308 VTLLTYALSVSANFYDDAKIYQVTDGNRKFGENFILRKHEENFQIFMQKYGKEYSTREEY 487 +T + + V A+ +D I QVT R+ N + E F++FM YGK YSTREEY Sbjct: 10 ITCIIFFCHVVASV-EDLTIRQVTADERRVRPNLLGTHTESKFRVFMSDYGKNYSTREEY 68 Query: 488 VHRLVIFAKNVMRAAEHQALDPTAVHGVTPFSDLSEEEFEMKFMGLRSTESPSLNDGVVT 667 +HRL IFAKNV++AAEHQ +DPTAVHGVT FSDL+EEEF+ + G+ V Sbjct: 69 IHRLGIFAKNVLKAAEHQMMDPTAVHGVTQFSDLTEEEFKRMYTGVADVGGS--RGHAVG 126 Query: 668 TNAADMEVGDLPEDFDWREKGAVTEVKMQGTCGSCWAFSTTGAVEGANFLATGKLISLSE 847 A +EV LPEDFDWREKG VTEVK QG CGSCWAFSTTGA EGA+F++TGKL+SLSE Sbjct: 127 AEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSE 186 Query: 848 QQLVDCDHK-CDPKEKDACDNGCGGGLMTNAYKYLIEAGGLEEEVAYPYVGKKGDCKFKP 1024 QQLVDCD CDPK+K ACDNGCGGGLMTNAY+YL+EAGGLEEE +YPY GK+G CKF P Sbjct: 187 QQLVDCDQAVCDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDP 246 Query: 1025 EKVVARVVNFTNIPLDEDQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRRINHG 1204 EKV RVVNFT IPLDEDQIAANLVR GPLAVGLNAVFMQTYIGGVSCPLICSKR++NHG Sbjct: 247 EKVAVRVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQTYIGGVSCPLICSKRKVNHG 306 Query: 1205 VLLVGYGAKGFSILRLGDRPYWIIKNSWSKKWGENGYYKLCRGQAVCGMNSMVSAVATAT 1384 VLLVGYG+KGFSILRL ++PYWIIKNSW KKWGENGYYKLCRG +CG+NSMVSAVAT Sbjct: 307 VLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVATQV 366 Query: 1385 S 1387 S Sbjct: 367 S 367 >emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera] Length = 321 Score = 512 bits (1319), Expect = e-143 Identities = 243/321 (75%), Positives = 277/321 (86%) Frame = +2 Query: 425 EENFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAAEHQALDPTAVHGVTPFSDLSEEEF 604 E+ F++FM+KYGKEYS+REEYVHRL IFAKN++RAAEHQALDP A+HGVTPFSDLSEEEF Sbjct: 4 EKEFRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPXALHGVTPFSDLSEEEF 63 Query: 605 EMKFMGLRSTESPSLNDGVVTTNAADMEVGDLPEDFDWREKGAVTEVKMQGTCGSCWAFS 784 E F G+ P + GV T AA +EV LPE FDWREKGAVTEVKMQGTCGSCWAFS Sbjct: 64 ERMFTGV--VGRPHMKGGVAETAAA-LEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFS 120 Query: 785 TTGAVEGANFLATGKLISLSEQQLVDCDHKCDPKEKDACDNGCGGGLMTNAYKYLIEAGG 964 TTGAVEGA+F++T KL++LSEQQLVDCDH CD ++K ACD+GC GGLMTNAYKYLIEAGG Sbjct: 121 TTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLMTNAYKYLIEAGG 180 Query: 965 LEEEVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLDEDQIAANLVRHGPLAVGLNAVFMQ 1144 LEEE +YPY GK G+CKFKP++V RVVNFT +P++E+QIAANLV HGPLAVGLNA FMQ Sbjct: 181 LEEESSYPYTGKHGECKFKPDRVAVRVVNFTEVPIBENQIAANLVCHGPLAVGLNAXFMQ 240 Query: 1145 TYIGGVSCPLICSKRRINHGVLLVGYGAKGFSILRLGDRPYWIIKNSWSKKWGENGYYKL 1324 TYIGGVSCPLIC KR INHGVLLVGYGAKG+SILR G +PYWIIKNSW +WGE+GYY+L Sbjct: 241 TYIGGVSCPLICPKRWINHGVLLVGYGAKGYSILRFGYKPYWIIKNSWGXRWGEHGYYRL 300 Query: 1325 CRGQAVCGMNSMVSAVATATS 1387 CRG +CGMN+MVSAV T TS Sbjct: 301 CRGHGMCGMNTMVSAVVTQTS 321 >emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana] Length = 363 Score = 509 bits (1312), Expect = e-142 Identities = 246/345 (71%), Positives = 278/345 (80%) Frame = +2 Query: 353 DDAKIYQVTDGNRKFGENFILRKHEENFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAA 532 +D I QVT NR+ N + E F++FM YGK YSTREEY+HRL IFAKNV++AA Sbjct: 24 EDLTIRQVTADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAA 83 Query: 533 EHQALDPTAVHGVTPFSDLSEEEFEMKFMGLRSTESPSLNDGVVTTNAADMEVGDLPEDF 712 EHQ +DP+AVHGVT FSDL+EEEF+ + G+ G V A +EV LPEDF Sbjct: 84 EHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGS--RGGTVGAEAPMVEVDGLPEDF 141 Query: 713 DWREKGAVTEVKMQGTCGSCWAFSTTGAVEGANFLATGKLISLSEQQLVDCDHKCDPKEK 892 DWREKG VTEVK QG CGSCWAFSTTGA EGA+F++TGKL+SLSEQQLVDCD +K Sbjct: 142 DWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQA----DK 197 Query: 893 DACDNGCGGGLMTNAYKYLIEAGGLEEEVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLD 1072 ACDNGCGGGLMTNAY+YL+EAGGLEEE +YPY GK+G CKF PEKV RV+NFT IPLD Sbjct: 198 KACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPEKVAVRVLNFTTIPLD 257 Query: 1073 EDQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRRINHGVLLVGYGAKGFSILRL 1252 E+QIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKR +NHGVLLVGYG+KGFSILRL Sbjct: 258 ENQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRL 317 Query: 1253 GDRPYWIIKNSWSKKWGENGYYKLCRGQAVCGMNSMVSAVATATS 1387 ++PYWIIKNSW KKWGENGYYKLCRG +CG+NSMVSAVAT S Sbjct: 318 SNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVATQVS 362