BLASTX nr result

ID: Coptis25_contig00002416 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00002416
         (1791 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002326284.1| predicted protein [Populus trichocarpa] gi|1...   500   e-139
gb|ABQ10199.1| cysteine protease Cp1 [Actinidia deliciosa]            492   e-136
ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isof...   492   e-136
ref|NP_566880.1| thiol protease aleurain-like protein [Arabidops...   491   e-136
ref|NP_001030812.1| thiol protease aleurain-like protein [Arabid...   484   e-134

>ref|XP_002326284.1| predicted protein [Populus trichocarpa] gi|118482340|gb|ABK93094.1|
            unknown [Populus trichocarpa] gi|222833477|gb|EEE71954.1|
            predicted protein [Populus trichocarpa]
          Length = 358

 Score =  500 bits (1287), Expect = e-139
 Identities = 244/357 (68%), Positives = 284/357 (79%), Gaps = 1/357 (0%)
 Frame = +2

Query: 104  MALFHSSAVFSILFLFCCIFAVHAATNFDDQNPIKLVTDQFNEFQNTLLQTIGDTXXXXX 283
            MA      V SILFL CC   V A ++FD+ NPIKLV+D+ ++F+++ ++ +G +     
Sbjct: 1    MARVAGLVVSSILFLLCC---VAAGSSFDESNPIKLVSDRLHDFESSFVKVLGQSRRALS 57

Query: 284  XXXXXXXXGKRYETVDEIKKRFSNFVESMELIRSTNRKGLSYKLSLNKFADMSWEEFQKH 463
                    GKRYET  E+K RF+ F ES++LIRSTN+KGL Y L LN+FAD +W+EFQK+
Sbjct: 58   FARFAHRHGKRYETEGEMKLRFAIFSESLDLIRSTNKKGLPYTLGLNQFADWTWQEFQKY 117

Query: 464  KLGAAQECSAT-KGNHLLTDANLPPVKDWREEGIVSPVKDQGHCGSCWTFSTTGALEAAY 640
            +LGAAQ CSAT +GNH LT+A LP  KDWREEGIVSPVK+QGHCGSCWTFSTTGALEAAY
Sbjct: 118  RLGAAQNCSATTRGNHKLTNALLPETKDWREEGIVSPVKNQGHCGSCWTFSTTGALEAAY 177

Query: 641  KQAFGKDISLSEQQLVDXXXXXXXXXXXXXLPSQAFEYIKYNGGLEAEGAYPYTGKDGSC 820
             QAFGK ISLSEQQLVD             LPSQAFEYIK+NGGL+ E AYPYTGKD +C
Sbjct: 178  HQAFGKGISLSEQQLVDCARAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKDDAC 237

Query: 821  KFSSENAAVRVVDSVNITQGAEDELKHAVALVRPVSVAFQVIHEFRLYNGGVFTSNSCGT 1000
            KFSSEN  VRVV+SVNIT GAEDELKHAVA VRPVSVAF+V+  FRLY  GV+T+++CG+
Sbjct: 238  KFSSENVGVRVVESVNITLGAEDELKHAVAFVRPVSVAFEVVGSFRLYKEGVYTTSTCGS 297

Query: 1001 SPMDVNHAVLAVGYGVENGIPYWLVKNSWGADWGDNGYFKMEMGKNMCGIATCASYP 1171
            +PMDVNHAVLAVGYGVENGIPYWL+KNSWG DWGDNGYFKMEMGKNMCGIATCASYP
Sbjct: 298  TPMDVNHAVLAVGYGVENGIPYWLIKNSWGEDWGDNGYFKMEMGKNMCGIATCASYP 354


>gb|ABQ10199.1| cysteine protease Cp1 [Actinidia deliciosa]
          Length = 358

 Score =  492 bits (1267), Expect = e-136
 Identities = 234/347 (67%), Positives = 276/347 (79%), Gaps = 2/347 (0%)
 Frame = +2

Query: 137  ILFLFCCIFAVHAATNFDDQNPIK-LVTDQFNEFQNTLLQTIGDTXXXXXXXXXXXXXGK 313
            ++ L  C+    +A+ FDD+NPI+ +V+D   EF+ ++L  +GD+             GK
Sbjct: 9    LIILIACVAGASSASTFDDENPIRTVVSDALREFETSILSVLGDSRHALSFARFAHRYGK 68

Query: 314  RYETVDEIKKRFSNFVESMELIRSTNRKGLSYKLSLNKFADMSWEEFQKHKLGAAQECSA 493
            RYET +E K RF+ F E+++LIRS N+KGLSY L +N FAD +WEEF++H+LGAAQ CSA
Sbjct: 69   RYETAEETKLRFAIFSENLKLIRSHNKKGLSYTLGVNHFADWTWEEFRRHRLGAAQNCSA 128

Query: 494  T-KGNHLLTDANLPPVKDWREEGIVSPVKDQGHCGSCWTFSTTGALEAAYKQAFGKDISL 670
            T KGNH LT+  LP +KDWR  GIVSPVKDQGHCGSCWTFSTTGALEAAYKQAFGK ISL
Sbjct: 129  TTKGNHKLTEEALPEMKDWRVSGIVSPVKDQGHCGSCWTFSTTGALEAAYKQAFGKGISL 188

Query: 671  SEQQLVDXXXXXXXXXXXXXLPSQAFEYIKYNGGLEAEGAYPYTGKDGSCKFSSENAAVR 850
            SEQQLVD             LPSQAFEY+KYNGGL+ E AYPYTGK+G CKFSSEN  V+
Sbjct: 189  SEQQLVDCAGAFNNFGCSGGLPSQAFEYVKYNGGLDTEEAYPYTGKNGECKFSSENVGVQ 248

Query: 851  VVDSVNITQGAEDELKHAVALVRPVSVAFQVIHEFRLYNGGVFTSNSCGTSPMDVNHAVL 1030
            V+DSVNIT GAEDELKHAVA VRPVSVAFQV++ FRLY  GV+TS++CG +PMDVNHAVL
Sbjct: 249  VLDSVNITLGAEDELKHAVAFVRPVSVAFQVVNGFRLYKEGVYTSDTCGRTPMDVNHAVL 308

Query: 1031 AVGYGVENGIPYWLVKNSWGADWGDNGYFKMEMGKNMCGIATCASYP 1171
            AVGYGVENG+PYWL+KNSWGADWGD+GYFKMEMGKNMCG+ATCASYP
Sbjct: 309  AVGYGVENGVPYWLIKNSWGADWGDSGYFKMEMGKNMCGVATCASYP 355


>ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isoform 1 [Vitis vinifera]
            gi|147826441|emb|CAN62278.1| hypothetical protein
            VITISV_031382 [Vitis vinifera]
            gi|297738562|emb|CBI27807.3| unnamed protein product
            [Vitis vinifera]
          Length = 362

 Score =  492 bits (1266), Expect = e-136
 Identities = 234/355 (65%), Positives = 281/355 (79%), Gaps = 5/355 (1%)
 Frame = +2

Query: 122  SAVFSILFLFCCIFAV----HAATNFDDQNPIKLVTDQFNEFQNTLLQTIGDTXXXXXXX 289
            S V ++L L C + +     H  ++FD++NPI+LV+D   + ++++L+ IGDT       
Sbjct: 5    SVVAAVLILLCAVASGEADHHFRSSFDEENPIRLVSDSIRDLESSVLRLIGDTRHAHSFA 64

Query: 290  XXXXXXGKRYETVDEIKKRFSNFVESMELIRSTNRKGLSYKLSLNKFADMSWEEFQKHKL 469
                  GK Y+TVDEIK RF  F E+++LIRSTNRKGL Y L++N+FAD +WEEF++H+L
Sbjct: 65   SFAHRYGKSYKTVDEIKLRFEIFSENLKLIRSTNRKGLPYTLAVNQFADWTWEEFRRHRL 124

Query: 470  GAAQECSAT-KGNHLLTDANLPPVKDWREEGIVSPVKDQGHCGSCWTFSTTGALEAAYKQ 646
            GAAQ CSAT KGNH LTD  LP  KDWRE+GIVSP+KDQGHCGSCWTFSTTGALEAAY Q
Sbjct: 125  GAAQNCSATLKGNHKLTDVILPETKDWREDGIVSPIKDQGHCGSCWTFSTTGALEAAYAQ 184

Query: 647  AFGKDISLSEQQLVDXXXXXXXXXXXXXLPSQAFEYIKYNGGLEAEGAYPYTGKDGSCKF 826
            AFGK ISLSEQQLVD             LPSQAFEYIKYNGGL+ E AYPYTG DG+CKF
Sbjct: 185  AFGKGISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGLDGTCKF 244

Query: 827  SSENAAVRVVDSVNITQGAEDELKHAVALVRPVSVAFQVIHEFRLYNGGVFTSNSCGTSP 1006
            SSEN  V+V+DSVNIT GAEDELKHAVA VRPVSVAF+V+H+FR Y  GV+TS +CG++P
Sbjct: 245  SSENIGVQVLDSVNITLGAEDELKHAVAFVRPVSVAFEVVHDFRFYKKGVYTSGTCGSTP 304

Query: 1007 MDVNHAVLAVGYGVENGIPYWLVKNSWGADWGDNGYFKMEMGKNMCGIATCASYP 1171
            MDVNHAVLAVGYGVE+G+ YWL+KNSWG +WGDNGYFKME+GKNMCG+ATC+SYP
Sbjct: 305  MDVNHAVLAVGYGVEDGVAYWLIKNSWGENWGDNGYFKMELGKNMCGVATCSSYP 359


>ref|NP_566880.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
            gi|73622182|sp|Q8RWQ9.1|ALEUL_ARATH RecName: Full=Thiol
            protease aleurain-like; Flags: Precursor
            gi|20147207|gb|AAM10319.1| AT3g45310/F18N11_70
            [Arabidopsis thaliana] gi|332644500|gb|AEE78021.1| thiol
            protease aleurain-like protein [Arabidopsis thaliana]
          Length = 358

 Score =  491 bits (1265), Expect = e-136
 Identities = 234/352 (66%), Positives = 281/352 (79%), Gaps = 1/352 (0%)
 Frame = +2

Query: 119  SSAVFSILFLFCCIFAVHAATNFDDQNPIKLVTDQFNEFQNTLLQTIGDTXXXXXXXXXX 298
            SS++  ILF      A      FD+ NPIK+V+D  +E ++T++Q +G +          
Sbjct: 8    SSSILLILFAA----AASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63

Query: 299  XXXGKRYETVDEIKKRFSNFVESMELIRSTNRKGLSYKLSLNKFADMSWEEFQKHKLGAA 478
               GK+Y++V+E+K RFS F E+++LIRSTN+KGLSYKLSLN+FAD++W+EFQ++KLGAA
Sbjct: 64   HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123

Query: 479  QECSAT-KGNHLLTDANLPPVKDWREEGIVSPVKDQGHCGSCWTFSTTGALEAAYKQAFG 655
            Q CSAT KG+H +T+A +P  KDWRE+GIVSPVK+QGHCGSCWTFSTTGALEAAY QAFG
Sbjct: 124  QNCSATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFG 183

Query: 656  KDISLSEQQLVDXXXXXXXXXXXXXLPSQAFEYIKYNGGLEAEGAYPYTGKDGSCKFSSE 835
            K ISLSEQQLVD             LPSQAFEYIKYNGGL+ E AYPYTGKDG CKFS++
Sbjct: 184  KGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGGCKFSAK 243

Query: 836  NAAVRVVDSVNITQGAEDELKHAVALVRPVSVAFQVIHEFRLYNGGVFTSNSCGTSPMDV 1015
            N  V+V DSVNIT GAEDELKHAV LVRPVSVAF+V+HEFR Y  GVFTSN+CG +PMDV
Sbjct: 244  NIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCGNTPMDV 303

Query: 1016 NHAVLAVGYGVENGIPYWLVKNSWGADWGDNGYFKMEMGKNMCGIATCASYP 1171
            NHAVLAVGYGVE+ +PYWL+KNSWG +WGDNGYFKMEMGKNMCG+ATC+SYP
Sbjct: 304  NHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMCGVATCSSYP 355


>ref|NP_001030812.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
            gi|332644501|gb|AEE78022.1| thiol protease aleurain-like
            protein [Arabidopsis thaliana]
          Length = 357

 Score =  484 bits (1247), Expect = e-134
 Identities = 233/352 (66%), Positives = 280/352 (79%), Gaps = 1/352 (0%)
 Frame = +2

Query: 119  SSAVFSILFLFCCIFAVHAATNFDDQNPIKLVTDQFNEFQNTLLQTIGDTXXXXXXXXXX 298
            SS++  ILF      A      FD+ NPIK+V+D  +E ++T++Q +G +          
Sbjct: 8    SSSILLILFAA----AASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63

Query: 299  XXXGKRYETVDEIKKRFSNFVESMELIRSTNRKGLSYKLSLNKFADMSWEEFQKHKLGAA 478
               GK+Y++V+E+K RFS F E+++LIRSTN+KGLSYKLSLN+FAD++W+EFQ++KLGAA
Sbjct: 64   HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123

Query: 479  QECSAT-KGNHLLTDANLPPVKDWREEGIVSPVKDQGHCGSCWTFSTTGALEAAYKQAFG 655
            Q CSAT KG+H +T+A +P  KDWRE+GIVSPVK+QGHCGSCWTFSTTGALEAAY QAFG
Sbjct: 124  QNCSATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFG 183

Query: 656  KDISLSEQQLVDXXXXXXXXXXXXXLPSQAFEYIKYNGGLEAEGAYPYTGKDGSCKFSSE 835
            K ISLSEQQLVD             LPSQAFEYIKYNGGL+ E AYPYTGKDG CKFS++
Sbjct: 184  KGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGGCKFSAK 243

Query: 836  NAAVRVVDSVNITQGAEDELKHAVALVRPVSVAFQVIHEFRLYNGGVFTSNSCGTSPMDV 1015
            N  V+V DSVNIT GAEDELKHAV LVRPVSVAF+V+HEFR Y  GVFTSN+CG +PMDV
Sbjct: 244  NIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCGNTPMDV 303

Query: 1016 NHAVLAVGYGVENGIPYWLVKNSWGADWGDNGYFKMEMGKNMCGIATCASYP 1171
            NHAVLAVGYGVE+ +PYWL+KNSWG +WGDNGYFKMEMGKNMC +ATC+SYP
Sbjct: 304  NHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMC-VATCSSYP 354


Top