BLASTX nr result

ID: Coptis24_contig00006288 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00006288
         (1575 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vit...   533   e-149
ref|NP_567010.5| Papain family cysteine protease [Arabidopsis th...   526   e-147
ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arab...   518   e-144
emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]   512   e-143
emb|CAB41090.1| cysteine proteinase precursor-like protein [Arab...   509   e-142

>ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
          Length = 375

 Score =  533 bits (1374), Expect = e-149
 Identities = 269/379 (70%), Positives = 311/379 (82%), Gaps = 9/379 (2%)
 Frame = +2

Query: 278  MRGSIIFTLSVT-LLTYALSVSA---NFYD---DAKIYQVTDGN--RKFGENFILRKHEE 430
            M G +   L V  LLT AL+ SA   + +D   D  I QVTDG+  RKFG + +L   +E
Sbjct: 1    MGGGLTCALGVAALLTCALAASAISLHEHDTPWDPNIVQVTDGHSHRKFGVDGVLGTEKE 60

Query: 431  NFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAAEHQALDPTAVHGVTPFSDLSEEEFEM 610
             F++FM+KYGKEYS+REEYVHRL IFAKN++RAAEHQALDPTA+HGVTPFSDLSEEEFE 
Sbjct: 61   -FRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPTALHGVTPFSDLSEEEFER 119

Query: 611  KFMGLRSTESPSLNDGVVTTNAADMEVGDLPEDFDWREKGAVTEVKMQGTCGSCWAFSTT 790
             F G+     P +  GV  T AA +EV  LPE FDWREKGAVTEVKMQGTCGSCWAFSTT
Sbjct: 120  MFTGV--VGRPHMKGGVAETAAA-LEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFSTT 176

Query: 791  GAVEGANFLATGKLISLSEQQLVDCDHKCDPKEKDACDNGCGGGLMTNAYKYLIEAGGLE 970
            GAVEGA+F++T KL++LSEQQLVDCDH CD ++K ACD+GC GGLMTNAYKYLIEAGGLE
Sbjct: 177  GAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIEAGGLE 236

Query: 971  EEVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLDEDQIAANLVRHGPLAVGLNAVFMQTY 1150
            EE +YPY GK G+CKFKP++V  RVVNFT +P++E+QIAANLV HGPLAVGLNA+FMQTY
Sbjct: 237  EESSYPYTGKHGECKFKPDRVAVRVVNFTEVPINENQIAANLVCHGPLAVGLNAIFMQTY 296

Query: 1151 IGGVSCPLICSKRRINHGVLLVGYGAKGFSILRLGDRPYWIIKNSWSKKWGENGYYKLCR 1330
            IGGVSCPLIC KR INHGVLLVGYGAKG+SILR G +PYWIIKNSW K+WGE+GYY+LCR
Sbjct: 297  IGGVSCPLICPKRWINHGVLLVGYGAKGYSILRFGYKPYWIIKNSWGKRWGEHGYYRLCR 356

Query: 1331 GQAVCGMNSMVSAVATATS 1387
            G  +CGMN+MVSAV T TS
Sbjct: 357  GHGMCGMNTMVSAVVTQTS 375


>ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
            gi|17979125|gb|AAL49820.1| putative cysteine proteinase
            [Arabidopsis thaliana] gi|332645795|gb|AEE79316.1| Papain
            family cysteine protease [Arabidopsis thaliana]
          Length = 367

 Score =  526 bits (1354), Expect = e-147
 Identities = 250/345 (72%), Positives = 282/345 (81%)
 Frame = +2

Query: 353  DDAKIYQVTDGNRKFGENFILRKHEENFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAA 532
            +D  I QVT  NR+   N +    E  F++FM  YGK YSTREEY+HRL IFAKNV++AA
Sbjct: 24   EDLTIRQVTADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAA 83

Query: 533  EHQALDPTAVHGVTPFSDLSEEEFEMKFMGLRSTESPSLNDGVVTTNAADMEVGDLPEDF 712
            EHQ +DP+AVHGVT FSDL+EEEF+  + G+          G V   A  +EV  LPEDF
Sbjct: 84   EHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGS--RGGTVGAEAPMVEVDGLPEDF 141

Query: 713  DWREKGAVTEVKMQGTCGSCWAFSTTGAVEGANFLATGKLISLSEQQLVDCDHKCDPKEK 892
            DWREKG VTEVK QG CGSCWAFSTTGA EGA+F++TGKL+SLSEQQLVDCD  CDPK+K
Sbjct: 142  DWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQACDPKDK 201

Query: 893  DACDNGCGGGLMTNAYKYLIEAGGLEEEVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLD 1072
             ACDNGCGGGLMTNAY+YL+EAGGLEEE +YPY GK+G CKF PEKV  RV+NFT IPLD
Sbjct: 202  KACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPEKVAVRVLNFTTIPLD 261

Query: 1073 EDQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRRINHGVLLVGYGAKGFSILRL 1252
            E+QIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKR +NHGVLLVGYG+KGFSILRL
Sbjct: 262  ENQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRL 321

Query: 1253 GDRPYWIIKNSWSKKWGENGYYKLCRGQAVCGMNSMVSAVATATS 1387
             ++PYWIIKNSW KKWGENGYYKLCRG  +CG+NSMVSAVAT  S
Sbjct: 322  SNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVATQVS 366


>ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
            lyrata] gi|297322116|gb|EFH52537.1| hypothetical protein
            ARALYDRAFT_485911 [Arabidopsis lyrata subsp. lyrata]
          Length = 368

 Score =  518 bits (1335), Expect = e-144
 Identities = 253/361 (70%), Positives = 287/361 (79%), Gaps = 1/361 (0%)
 Frame = +2

Query: 308  VTLLTYALSVSANFYDDAKIYQVTDGNRKFGENFILRKHEENFQIFMQKYGKEYSTREEY 487
            +T + +   V A+  +D  I QVT   R+   N +    E  F++FM  YGK YSTREEY
Sbjct: 10   ITCIIFFCHVVASV-EDLTIRQVTADERRVRPNLLGTHTESKFRVFMSDYGKNYSTREEY 68

Query: 488  VHRLVIFAKNVMRAAEHQALDPTAVHGVTPFSDLSEEEFEMKFMGLRSTESPSLNDGVVT 667
            +HRL IFAKNV++AAEHQ +DPTAVHGVT FSDL+EEEF+  + G+            V 
Sbjct: 69   IHRLGIFAKNVLKAAEHQMMDPTAVHGVTQFSDLTEEEFKRMYTGVADVGGS--RGHAVG 126

Query: 668  TNAADMEVGDLPEDFDWREKGAVTEVKMQGTCGSCWAFSTTGAVEGANFLATGKLISLSE 847
              A  +EV  LPEDFDWREKG VTEVK QG CGSCWAFSTTGA EGA+F++TGKL+SLSE
Sbjct: 127  AEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSE 186

Query: 848  QQLVDCDHK-CDPKEKDACDNGCGGGLMTNAYKYLIEAGGLEEEVAYPYVGKKGDCKFKP 1024
            QQLVDCD   CDPK+K ACDNGCGGGLMTNAY+YL+EAGGLEEE +YPY GK+G CKF P
Sbjct: 187  QQLVDCDQAVCDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDP 246

Query: 1025 EKVVARVVNFTNIPLDEDQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRRINHG 1204
            EKV  RVVNFT IPLDEDQIAANLVR GPLAVGLNAVFMQTYIGGVSCPLICSKR++NHG
Sbjct: 247  EKVAVRVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQTYIGGVSCPLICSKRKVNHG 306

Query: 1205 VLLVGYGAKGFSILRLGDRPYWIIKNSWSKKWGENGYYKLCRGQAVCGMNSMVSAVATAT 1384
            VLLVGYG+KGFSILRL ++PYWIIKNSW KKWGENGYYKLCRG  +CG+NSMVSAVAT  
Sbjct: 307  VLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVATQV 366

Query: 1385 S 1387
            S
Sbjct: 367  S 367


>emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
          Length = 321

 Score =  512 bits (1319), Expect = e-143
 Identities = 243/321 (75%), Positives = 277/321 (86%)
 Frame = +2

Query: 425  EENFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAAEHQALDPTAVHGVTPFSDLSEEEF 604
            E+ F++FM+KYGKEYS+REEYVHRL IFAKN++RAAEHQALDP A+HGVTPFSDLSEEEF
Sbjct: 4    EKEFRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPXALHGVTPFSDLSEEEF 63

Query: 605  EMKFMGLRSTESPSLNDGVVTTNAADMEVGDLPEDFDWREKGAVTEVKMQGTCGSCWAFS 784
            E  F G+     P +  GV  T AA +EV  LPE FDWREKGAVTEVKMQGTCGSCWAFS
Sbjct: 64   ERMFTGV--VGRPHMKGGVAETAAA-LEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFS 120

Query: 785  TTGAVEGANFLATGKLISLSEQQLVDCDHKCDPKEKDACDNGCGGGLMTNAYKYLIEAGG 964
            TTGAVEGA+F++T KL++LSEQQLVDCDH CD ++K ACD+GC GGLMTNAYKYLIEAGG
Sbjct: 121  TTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLMTNAYKYLIEAGG 180

Query: 965  LEEEVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLDEDQIAANLVRHGPLAVGLNAVFMQ 1144
            LEEE +YPY GK G+CKFKP++V  RVVNFT +P++E+QIAANLV HGPLAVGLNA FMQ
Sbjct: 181  LEEESSYPYTGKHGECKFKPDRVAVRVVNFTEVPIBENQIAANLVCHGPLAVGLNAXFMQ 240

Query: 1145 TYIGGVSCPLICSKRRINHGVLLVGYGAKGFSILRLGDRPYWIIKNSWSKKWGENGYYKL 1324
            TYIGGVSCPLIC KR INHGVLLVGYGAKG+SILR G +PYWIIKNSW  +WGE+GYY+L
Sbjct: 241  TYIGGVSCPLICPKRWINHGVLLVGYGAKGYSILRFGYKPYWIIKNSWGXRWGEHGYYRL 300

Query: 1325 CRGQAVCGMNSMVSAVATATS 1387
            CRG  +CGMN+MVSAV T TS
Sbjct: 301  CRGHGMCGMNTMVSAVVTQTS 321


>emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana]
          Length = 363

 Score =  509 bits (1312), Expect = e-142
 Identities = 246/345 (71%), Positives = 278/345 (80%)
 Frame = +2

Query: 353  DDAKIYQVTDGNRKFGENFILRKHEENFQIFMQKYGKEYSTREEYVHRLVIFAKNVMRAA 532
            +D  I QVT  NR+   N +    E  F++FM  YGK YSTREEY+HRL IFAKNV++AA
Sbjct: 24   EDLTIRQVTADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAA 83

Query: 533  EHQALDPTAVHGVTPFSDLSEEEFEMKFMGLRSTESPSLNDGVVTTNAADMEVGDLPEDF 712
            EHQ +DP+AVHGVT FSDL+EEEF+  + G+          G V   A  +EV  LPEDF
Sbjct: 84   EHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGS--RGGTVGAEAPMVEVDGLPEDF 141

Query: 713  DWREKGAVTEVKMQGTCGSCWAFSTTGAVEGANFLATGKLISLSEQQLVDCDHKCDPKEK 892
            DWREKG VTEVK QG CGSCWAFSTTGA EGA+F++TGKL+SLSEQQLVDCD      +K
Sbjct: 142  DWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQA----DK 197

Query: 893  DACDNGCGGGLMTNAYKYLIEAGGLEEEVAYPYVGKKGDCKFKPEKVVARVVNFTNIPLD 1072
             ACDNGCGGGLMTNAY+YL+EAGGLEEE +YPY GK+G CKF PEKV  RV+NFT IPLD
Sbjct: 198  KACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPEKVAVRVLNFTTIPLD 257

Query: 1073 EDQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRRINHGVLLVGYGAKGFSILRL 1252
            E+QIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKR +NHGVLLVGYG+KGFSILRL
Sbjct: 258  ENQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRL 317

Query: 1253 GDRPYWIIKNSWSKKWGENGYYKLCRGQAVCGMNSMVSAVATATS 1387
             ++PYWIIKNSW KKWGENGYYKLCRG  +CG+NSMVSAVAT  S
Sbjct: 318  SNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVATQVS 362


Top