BLASTX nr result
ID: Coptis23_contig00000995
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis23_contig00000995 (1415 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vit... 531 e-148 ref|NP_567010.5| Papain family cysteine protease [Arabidopsis th... 525 e-147 ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arab... 518 e-144 emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera] 511 e-142 emb|CAB41090.1| cysteine proteinase precursor-like protein [Arab... 509 e-142 >ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera] Length = 375 Score = 531 bits (1368), Expect = e-148 Identities = 268/378 (70%), Positives = 308/378 (81%), Gaps = 9/378 (2%) Frame = -1 Query: 1334 MRGSIIFTLSVT-LLTYALSVSA---NFYD---DAKINQVTDGN--RKFGENFILRKHEE 1182 M G + L V LLT AL+ SA + +D D I QVTDG+ RKFG + +L +E Sbjct: 1 MGGGLTCALGVAALLTCALAASAISLHEHDTPWDPNIVQVTDGHSHRKFGVDGVLGTEKE 60 Query: 1181 NFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAAEHQALDPTAVHGVTPFSDLSEEEFEM 1002 F++FM+KYGKEYS+REEYVHRL IFAKN++RAAEHQALDPTA+HGVTPFSDLSEEEFE Sbjct: 61 -FRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPTALHGVTPFSDLSEEEFER 119 Query: 1001 KFMGLRSTESPLNDGVVTTNAADMEVGDLPEDFDWREKGAVTEVKMQGTCGSCWAFSTTG 822 F G+ P G V AA +EV LPE FDWREKGAVTEVKMQGTCGSCWAFSTTG Sbjct: 120 MFTGV--VGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFSTTG 177 Query: 821 AVEGANFLATGKLISLSEQQLVDCDHTCDPKEKDACDNGCGGGLMTNAYKYLIEAGGLEE 642 AVEGA+F++T KL++LSEQQLVDCDH CD ++K ACD+GC GGLMTNAYKYLIEAGGLEE Sbjct: 178 AVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIEAGGLEE 237 Query: 641 EVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLDEDQIAANLVRHGPLAVGLNAVFMQTYI 462 E +YPY GK G+CKFKP++V RVVNFT +P++E+QIAANLV HGPLAVGLNA+FMQTYI Sbjct: 238 ESSYPYTGKHGECKFKPDRVAVRVVNFTEVPINENQIAANLVCHGPLAVGLNAIFMQTYI 297 Query: 461 GGVSCPLICSKRRINHGVLLVGYGAKGFSILRLGDRPYWIIKNSWSKKWGENGYYKLCRG 282 GGVSCPLIC KR INHGVLLVGYGAKG+SILR G +PYWIIKNSW K+WGE+GYY+LCRG Sbjct: 298 GGVSCPLICPKRWINHGVLLVGYGAKGYSILRFGYKPYWIIKNSWGKRWGEHGYYRLCRG 357 Query: 281 QAVCGMNRMVSAVATATS 228 +CGMN MVSAV T TS Sbjct: 358 HGMCGMNTMVSAVVTQTS 375 >ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana] gi|17979125|gb|AAL49820.1| putative cysteine proteinase [Arabidopsis thaliana] gi|332645795|gb|AEE79316.1| Papain family cysteine protease [Arabidopsis thaliana] Length = 367 Score = 525 bits (1353), Expect = e-147 Identities = 249/344 (72%), Positives = 281/344 (81%) Frame = -1 Query: 1259 DDAKINQVTDGNRKFGENFILRKHEENFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAA 1080 +D I QVT NR+ N + E F++FM YGK YSTREEY+HRL IFAKNV++AA Sbjct: 24 EDLTIRQVTADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAA 83 Query: 1079 EHQALDPTAVHGVTPFSDLSEEEFEMKFMGLRSTESPLNDGVVTTNAADMEVGDLPEDFD 900 EHQ +DP+AVHGVT FSDL+EEEF+ + G+ G V A +EV LPEDFD Sbjct: 84 EHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGS-RGGTVGAEAPMVEVDGLPEDFD 142 Query: 899 WREKGAVTEVKMQGTCGSCWAFSTTGAVEGANFLATGKLISLSEQQLVDCDHTCDPKEKD 720 WREKG VTEVK QG CGSCWAFSTTGA EGA+F++TGKL+SLSEQQLVDCD CDPK+K Sbjct: 143 WREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKK 202 Query: 719 ACDNGCGGGLMTNAYKYLIEAGGLEEEVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLDE 540 ACDNGCGGGLMTNAY+YL+EAGGLEEE +YPY GK+G CKF PEKV RV+NFT IPLDE Sbjct: 203 ACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPEKVAVRVLNFTTIPLDE 262 Query: 539 DQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRRINHGVLLVGYGAKGFSILRLG 360 +QIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKR +NHGVLLVGYG+KGFSILRL Sbjct: 263 NQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLS 322 Query: 359 DRPYWIIKNSWSKKWGENGYYKLCRGQAVCGMNRMVSAVATATS 228 ++PYWIIKNSW KKWGENGYYKLCRG +CG+N MVSAVAT S Sbjct: 323 NKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVATQVS 366 >ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp. lyrata] gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp. lyrata] Length = 368 Score = 518 bits (1334), Expect = e-144 Identities = 252/360 (70%), Positives = 286/360 (79%), Gaps = 1/360 (0%) Frame = -1 Query: 1304 VTLLTYALSVSANFYDDAKINQVTDGNRKFGENFILRKHEENFQIFMQKYGKEYSTREEY 1125 +T + + V A+ +D I QVT R+ N + E F++FM YGK YSTREEY Sbjct: 10 ITCIIFFCHVVASV-EDLTIRQVTADERRVRPNLLGTHTESKFRVFMSDYGKNYSTREEY 68 Query: 1124 VHRLVIFAKNVMRAAEHQALDPTAVHGVTPFSDLSEEEFEMKFMGLRSTESPLNDGVVTT 945 +HRL IFAKNV++AAEHQ +DPTAVHGVT FSDL+EEEF+ + G+ V Sbjct: 69 IHRLGIFAKNVLKAAEHQMMDPTAVHGVTQFSDLTEEEFKRMYTGVADVGGSRGHAV-GA 127 Query: 944 NAADMEVGDLPEDFDWREKGAVTEVKMQGTCGSCWAFSTTGAVEGANFLATGKLISLSEQ 765 A +EV LPEDFDWREKG VTEVK QG CGSCWAFSTTGA EGA+F++TGKL+SLSEQ Sbjct: 128 EAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQ 187 Query: 764 QLVDCDHT-CDPKEKDACDNGCGGGLMTNAYKYLIEAGGLEEEVAYPYVGKKGDCKFKPE 588 QLVDCD CDPK+K ACDNGCGGGLMTNAY+YL+EAGGLEEE +YPY GK+G CKF PE Sbjct: 188 QLVDCDQAVCDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPE 247 Query: 587 KVVARVVNFTNIPLDEDQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRRINHGV 408 KV RVVNFT IPLDEDQIAANLVR GPLAVGLNAVFMQTYIGGVSCPLICSKR++NHGV Sbjct: 248 KVAVRVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQTYIGGVSCPLICSKRKVNHGV 307 Query: 407 LLVGYGAKGFSILRLGDRPYWIIKNSWSKKWGENGYYKLCRGQAVCGMNRMVSAVATATS 228 LLVGYG+KGFSILRL ++PYWIIKNSW KKWGENGYYKLCRG +CG+N MVSAVAT S Sbjct: 308 LLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVATQVS 367 >emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera] Length = 321 Score = 511 bits (1315), Expect = e-142 Identities = 242/320 (75%), Positives = 274/320 (85%) Frame = -1 Query: 1187 EENFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAAEHQALDPTAVHGVTPFSDLSEEEF 1008 E+ F++FM+KYGKEYS+REEYVHRL IFAKN++RAAEHQALDP A+HGVTPFSDLSEEEF Sbjct: 4 EKEFRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPXALHGVTPFSDLSEEEF 63 Query: 1007 EMKFMGLRSTESPLNDGVVTTNAADMEVGDLPEDFDWREKGAVTEVKMQGTCGSCWAFST 828 E F G+ P G V AA +EV LPE FDWREKGAVTEVKMQGTCGSCWAFST Sbjct: 64 ERMFTGV--VGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFST 121 Query: 827 TGAVEGANFLATGKLISLSEQQLVDCDHTCDPKEKDACDNGCGGGLMTNAYKYLIEAGGL 648 TGAVEGA+F++T KL++LSEQQLVDCDH CD ++K ACD+GC GGLMTNAYKYLIEAGGL Sbjct: 122 TGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLMTNAYKYLIEAGGL 181 Query: 647 EEEVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLDEDQIAANLVRHGPLAVGLNAVFMQT 468 EEE +YPY GK G+CKFKP++V RVVNFT +P++E+QIAANLV HGPLAVGLNA FMQT Sbjct: 182 EEESSYPYTGKHGECKFKPDRVAVRVVNFTEVPIBENQIAANLVCHGPLAVGLNAXFMQT 241 Query: 467 YIGGVSCPLICSKRRINHGVLLVGYGAKGFSILRLGDRPYWIIKNSWSKKWGENGYYKLC 288 YIGGVSCPLIC KR INHGVLLVGYGAKG+SILR G +PYWIIKNSW +WGE+GYY+LC Sbjct: 242 YIGGVSCPLICPKRWINHGVLLVGYGAKGYSILRFGYKPYWIIKNSWGXRWGEHGYYRLC 301 Query: 287 RGQAVCGMNRMVSAVATATS 228 RG +CGMN MVSAV T TS Sbjct: 302 RGHGMCGMNTMVSAVVTQTS 321 >emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana] Length = 363 Score = 509 bits (1311), Expect = e-142 Identities = 245/344 (71%), Positives = 277/344 (80%) Frame = -1 Query: 1259 DDAKINQVTDGNRKFGENFILRKHEENFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAA 1080 +D I QVT NR+ N + E F++FM YGK YSTREEY+HRL IFAKNV++AA Sbjct: 24 EDLTIRQVTADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAA 83 Query: 1079 EHQALDPTAVHGVTPFSDLSEEEFEMKFMGLRSTESPLNDGVVTTNAADMEVGDLPEDFD 900 EHQ +DP+AVHGVT FSDL+EEEF+ + G+ G V A +EV LPEDFD Sbjct: 84 EHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGS-RGGTVGAEAPMVEVDGLPEDFD 142 Query: 899 WREKGAVTEVKMQGTCGSCWAFSTTGAVEGANFLATGKLISLSEQQLVDCDHTCDPKEKD 720 WREKG VTEVK QG CGSCWAFSTTGA EGA+F++TGKL+SLSEQQLVDCD +K Sbjct: 143 WREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQA----DKK 198 Query: 719 ACDNGCGGGLMTNAYKYLIEAGGLEEEVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLDE 540 ACDNGCGGGLMTNAY+YL+EAGGLEEE +YPY GK+G CKF PEKV RV+NFT IPLDE Sbjct: 199 ACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPEKVAVRVLNFTTIPLDE 258 Query: 539 DQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRRINHGVLLVGYGAKGFSILRLG 360 +QIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKR +NHGVLLVGYG+KGFSILRL Sbjct: 259 NQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLS 318 Query: 359 DRPYWIIKNSWSKKWGENGYYKLCRGQAVCGMNRMVSAVATATS 228 ++PYWIIKNSW KKWGENGYYKLCRG +CG+N MVSAVAT S Sbjct: 319 NKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVATQVS 362