BLASTX nr result

ID: Coptis23_contig00000995 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00000995
         (1415 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vit...   531   e-148
ref|NP_567010.5| Papain family cysteine protease [Arabidopsis th...   525   e-147
ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arab...   518   e-144
emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]   511   e-142
emb|CAB41090.1| cysteine proteinase precursor-like protein [Arab...   509   e-142

>ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
          Length = 375

 Score =  531 bits (1368), Expect = e-148
 Identities = 268/378 (70%), Positives = 308/378 (81%), Gaps = 9/378 (2%)
 Frame = -1

Query: 1334 MRGSIIFTLSVT-LLTYALSVSA---NFYD---DAKINQVTDGN--RKFGENFILRKHEE 1182
            M G +   L V  LLT AL+ SA   + +D   D  I QVTDG+  RKFG + +L   +E
Sbjct: 1    MGGGLTCALGVAALLTCALAASAISLHEHDTPWDPNIVQVTDGHSHRKFGVDGVLGTEKE 60

Query: 1181 NFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAAEHQALDPTAVHGVTPFSDLSEEEFEM 1002
             F++FM+KYGKEYS+REEYVHRL IFAKN++RAAEHQALDPTA+HGVTPFSDLSEEEFE 
Sbjct: 61   -FRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPTALHGVTPFSDLSEEEFER 119

Query: 1001 KFMGLRSTESPLNDGVVTTNAADMEVGDLPEDFDWREKGAVTEVKMQGTCGSCWAFSTTG 822
             F G+     P   G V   AA +EV  LPE FDWREKGAVTEVKMQGTCGSCWAFSTTG
Sbjct: 120  MFTGV--VGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFSTTG 177

Query: 821  AVEGANFLATGKLISLSEQQLVDCDHTCDPKEKDACDNGCGGGLMTNAYKYLIEAGGLEE 642
            AVEGA+F++T KL++LSEQQLVDCDH CD ++K ACD+GC GGLMTNAYKYLIEAGGLEE
Sbjct: 178  AVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIEAGGLEE 237

Query: 641  EVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLDEDQIAANLVRHGPLAVGLNAVFMQTYI 462
            E +YPY GK G+CKFKP++V  RVVNFT +P++E+QIAANLV HGPLAVGLNA+FMQTYI
Sbjct: 238  ESSYPYTGKHGECKFKPDRVAVRVVNFTEVPINENQIAANLVCHGPLAVGLNAIFMQTYI 297

Query: 461  GGVSCPLICSKRRINHGVLLVGYGAKGFSILRLGDRPYWIIKNSWSKKWGENGYYKLCRG 282
            GGVSCPLIC KR INHGVLLVGYGAKG+SILR G +PYWIIKNSW K+WGE+GYY+LCRG
Sbjct: 298  GGVSCPLICPKRWINHGVLLVGYGAKGYSILRFGYKPYWIIKNSWGKRWGEHGYYRLCRG 357

Query: 281  QAVCGMNRMVSAVATATS 228
              +CGMN MVSAV T TS
Sbjct: 358  HGMCGMNTMVSAVVTQTS 375


>ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
            gi|17979125|gb|AAL49820.1| putative cysteine proteinase
            [Arabidopsis thaliana] gi|332645795|gb|AEE79316.1| Papain
            family cysteine protease [Arabidopsis thaliana]
          Length = 367

 Score =  525 bits (1353), Expect = e-147
 Identities = 249/344 (72%), Positives = 281/344 (81%)
 Frame = -1

Query: 1259 DDAKINQVTDGNRKFGENFILRKHEENFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAA 1080
            +D  I QVT  NR+   N +    E  F++FM  YGK YSTREEY+HRL IFAKNV++AA
Sbjct: 24   EDLTIRQVTADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAA 83

Query: 1079 EHQALDPTAVHGVTPFSDLSEEEFEMKFMGLRSTESPLNDGVVTTNAADMEVGDLPEDFD 900
            EHQ +DP+AVHGVT FSDL+EEEF+  + G+         G V   A  +EV  LPEDFD
Sbjct: 84   EHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGS-RGGTVGAEAPMVEVDGLPEDFD 142

Query: 899  WREKGAVTEVKMQGTCGSCWAFSTTGAVEGANFLATGKLISLSEQQLVDCDHTCDPKEKD 720
            WREKG VTEVK QG CGSCWAFSTTGA EGA+F++TGKL+SLSEQQLVDCD  CDPK+K 
Sbjct: 143  WREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKK 202

Query: 719  ACDNGCGGGLMTNAYKYLIEAGGLEEEVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLDE 540
            ACDNGCGGGLMTNAY+YL+EAGGLEEE +YPY GK+G CKF PEKV  RV+NFT IPLDE
Sbjct: 203  ACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPEKVAVRVLNFTTIPLDE 262

Query: 539  DQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRRINHGVLLVGYGAKGFSILRLG 360
            +QIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKR +NHGVLLVGYG+KGFSILRL 
Sbjct: 263  NQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLS 322

Query: 359  DRPYWIIKNSWSKKWGENGYYKLCRGQAVCGMNRMVSAVATATS 228
            ++PYWIIKNSW KKWGENGYYKLCRG  +CG+N MVSAVAT  S
Sbjct: 323  NKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVATQVS 366


>ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
            lyrata] gi|297322116|gb|EFH52537.1| hypothetical protein
            ARALYDRAFT_485911 [Arabidopsis lyrata subsp. lyrata]
          Length = 368

 Score =  518 bits (1334), Expect = e-144
 Identities = 252/360 (70%), Positives = 286/360 (79%), Gaps = 1/360 (0%)
 Frame = -1

Query: 1304 VTLLTYALSVSANFYDDAKINQVTDGNRKFGENFILRKHEENFQIFMQKYGKEYSTREEY 1125
            +T + +   V A+  +D  I QVT   R+   N +    E  F++FM  YGK YSTREEY
Sbjct: 10   ITCIIFFCHVVASV-EDLTIRQVTADERRVRPNLLGTHTESKFRVFMSDYGKNYSTREEY 68

Query: 1124 VHRLVIFAKNVMRAAEHQALDPTAVHGVTPFSDLSEEEFEMKFMGLRSTESPLNDGVVTT 945
            +HRL IFAKNV++AAEHQ +DPTAVHGVT FSDL+EEEF+  + G+          V   
Sbjct: 69   IHRLGIFAKNVLKAAEHQMMDPTAVHGVTQFSDLTEEEFKRMYTGVADVGGSRGHAV-GA 127

Query: 944  NAADMEVGDLPEDFDWREKGAVTEVKMQGTCGSCWAFSTTGAVEGANFLATGKLISLSEQ 765
             A  +EV  LPEDFDWREKG VTEVK QG CGSCWAFSTTGA EGA+F++TGKL+SLSEQ
Sbjct: 128  EAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQ 187

Query: 764  QLVDCDHT-CDPKEKDACDNGCGGGLMTNAYKYLIEAGGLEEEVAYPYVGKKGDCKFKPE 588
            QLVDCD   CDPK+K ACDNGCGGGLMTNAY+YL+EAGGLEEE +YPY GK+G CKF PE
Sbjct: 188  QLVDCDQAVCDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPE 247

Query: 587  KVVARVVNFTNIPLDEDQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRRINHGV 408
            KV  RVVNFT IPLDEDQIAANLVR GPLAVGLNAVFMQTYIGGVSCPLICSKR++NHGV
Sbjct: 248  KVAVRVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQTYIGGVSCPLICSKRKVNHGV 307

Query: 407  LLVGYGAKGFSILRLGDRPYWIIKNSWSKKWGENGYYKLCRGQAVCGMNRMVSAVATATS 228
            LLVGYG+KGFSILRL ++PYWIIKNSW KKWGENGYYKLCRG  +CG+N MVSAVAT  S
Sbjct: 308  LLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVATQVS 367


>emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
          Length = 321

 Score =  511 bits (1315), Expect = e-142
 Identities = 242/320 (75%), Positives = 274/320 (85%)
 Frame = -1

Query: 1187 EENFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAAEHQALDPTAVHGVTPFSDLSEEEF 1008
            E+ F++FM+KYGKEYS+REEYVHRL IFAKN++RAAEHQALDP A+HGVTPFSDLSEEEF
Sbjct: 4    EKEFRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPXALHGVTPFSDLSEEEF 63

Query: 1007 EMKFMGLRSTESPLNDGVVTTNAADMEVGDLPEDFDWREKGAVTEVKMQGTCGSCWAFST 828
            E  F G+     P   G V   AA +EV  LPE FDWREKGAVTEVKMQGTCGSCWAFST
Sbjct: 64   ERMFTGV--VGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFST 121

Query: 827  TGAVEGANFLATGKLISLSEQQLVDCDHTCDPKEKDACDNGCGGGLMTNAYKYLIEAGGL 648
            TGAVEGA+F++T KL++LSEQQLVDCDH CD ++K ACD+GC GGLMTNAYKYLIEAGGL
Sbjct: 122  TGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLMTNAYKYLIEAGGL 181

Query: 647  EEEVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLDEDQIAANLVRHGPLAVGLNAVFMQT 468
            EEE +YPY GK G+CKFKP++V  RVVNFT +P++E+QIAANLV HGPLAVGLNA FMQT
Sbjct: 182  EEESSYPYTGKHGECKFKPDRVAVRVVNFTEVPIBENQIAANLVCHGPLAVGLNAXFMQT 241

Query: 467  YIGGVSCPLICSKRRINHGVLLVGYGAKGFSILRLGDRPYWIIKNSWSKKWGENGYYKLC 288
            YIGGVSCPLIC KR INHGVLLVGYGAKG+SILR G +PYWIIKNSW  +WGE+GYY+LC
Sbjct: 242  YIGGVSCPLICPKRWINHGVLLVGYGAKGYSILRFGYKPYWIIKNSWGXRWGEHGYYRLC 301

Query: 287  RGQAVCGMNRMVSAVATATS 228
            RG  +CGMN MVSAV T TS
Sbjct: 302  RGHGMCGMNTMVSAVVTQTS 321


>emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana]
          Length = 363

 Score =  509 bits (1311), Expect = e-142
 Identities = 245/344 (71%), Positives = 277/344 (80%)
 Frame = -1

Query: 1259 DDAKINQVTDGNRKFGENFILRKHEENFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAA 1080
            +D  I QVT  NR+   N +    E  F++FM  YGK YSTREEY+HRL IFAKNV++AA
Sbjct: 24   EDLTIRQVTADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAA 83

Query: 1079 EHQALDPTAVHGVTPFSDLSEEEFEMKFMGLRSTESPLNDGVVTTNAADMEVGDLPEDFD 900
            EHQ +DP+AVHGVT FSDL+EEEF+  + G+         G V   A  +EV  LPEDFD
Sbjct: 84   EHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGS-RGGTVGAEAPMVEVDGLPEDFD 142

Query: 899  WREKGAVTEVKMQGTCGSCWAFSTTGAVEGANFLATGKLISLSEQQLVDCDHTCDPKEKD 720
            WREKG VTEVK QG CGSCWAFSTTGA EGA+F++TGKL+SLSEQQLVDCD      +K 
Sbjct: 143  WREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQA----DKK 198

Query: 719  ACDNGCGGGLMTNAYKYLIEAGGLEEEVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLDE 540
            ACDNGCGGGLMTNAY+YL+EAGGLEEE +YPY GK+G CKF PEKV  RV+NFT IPLDE
Sbjct: 199  ACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPEKVAVRVLNFTTIPLDE 258

Query: 539  DQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRRINHGVLLVGYGAKGFSILRLG 360
            +QIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKR +NHGVLLVGYG+KGFSILRL 
Sbjct: 259  NQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLS 318

Query: 359  DRPYWIIKNSWSKKWGENGYYKLCRGQAVCGMNRMVSAVATATS 228
            ++PYWIIKNSW KKWGENGYYKLCRG  +CG+N MVSAVAT  S
Sbjct: 319  NKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVATQVS 362


Top