BLASTX nr result

ID: Coptis24_contig00000071 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00000071
         (1820 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002326284.1| predicted protein [Populus trichocarpa] gi|1...   500   e-139
gb|ABQ10199.1| cysteine protease Cp1 [Actinidia deliciosa]            492   e-136
ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isof...   492   e-136
ref|NP_566880.1| thiol protease aleurain-like protein [Arabidops...   491   e-136
ref|NP_001030812.1| thiol protease aleurain-like protein [Arabid...   484   e-134

>ref|XP_002326284.1| predicted protein [Populus trichocarpa] gi|118482340|gb|ABK93094.1|
            unknown [Populus trichocarpa] gi|222833477|gb|EEE71954.1|
            predicted protein [Populus trichocarpa]
          Length = 358

 Score =  500 bits (1287), Expect = e-139
 Identities = 244/357 (68%), Positives = 284/357 (79%), Gaps = 1/357 (0%)
 Frame = +1

Query: 109  MALFHSSAVFSILFLFCCIFAVHAATNFDDQNPIKLVTDQFNEFQNTLLQTIGDTXXXXX 288
            MA      V SILFL CC   V A ++FD+ NPIKLV+D+ ++F+++ ++ +G +     
Sbjct: 1    MARVAGLVVSSILFLLCC---VAAGSSFDESNPIKLVSDRLHDFESSFVKVLGQSRRALS 57

Query: 289  XXXXXXXXGKRYETVDEIKKRFSNFVESMELIRSTNRKGLSYKLSLNKFADMSWEEFQKH 468
                    GKRYET  E+K RF+ F ES++LIRSTN+KGL Y L LN+FAD +W+EFQK+
Sbjct: 58   FARFAHRHGKRYETEGEMKLRFAIFSESLDLIRSTNKKGLPYTLGLNQFADWTWQEFQKY 117

Query: 469  KLGAAQECSAT-KGNHLLTDANLPPVKDWREEGIVSPVKDQGHCGSCWTFSTTGALEAAY 645
            +LGAAQ CSAT +GNH LT+A LP  KDWREEGIVSPVK+QGHCGSCWTFSTTGALEAAY
Sbjct: 118  RLGAAQNCSATTRGNHKLTNALLPETKDWREEGIVSPVKNQGHCGSCWTFSTTGALEAAY 177

Query: 646  KQAFGKDISLSEQQLVDXXXXXXXXXXXXXLPSQAFEYIKYNGGLEAEGAYPYTGKDGSC 825
             QAFGK ISLSEQQLVD             LPSQAFEYIK+NGGL+ E AYPYTGKD +C
Sbjct: 178  HQAFGKGISLSEQQLVDCARAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKDDAC 237

Query: 826  KFSSENAAVRVVDSVNITQGAEDELKHAVALVRPVSVAFQVIHEFRLYNGGVFTSNSCGT 1005
            KFSSEN  VRVV+SVNIT GAEDELKHAVA VRPVSVAF+V+  FRLY  GV+T+++CG+
Sbjct: 238  KFSSENVGVRVVESVNITLGAEDELKHAVAFVRPVSVAFEVVGSFRLYKEGVYTTSTCGS 297

Query: 1006 SPMDVNHAVLAVGYGVENGIPYWLVKNSWGADWGDNGYFKMEMGKNMCGIATCASYP 1176
            +PMDVNHAVLAVGYGVENGIPYWL+KNSWG DWGDNGYFKMEMGKNMCGIATCASYP
Sbjct: 298  TPMDVNHAVLAVGYGVENGIPYWLIKNSWGEDWGDNGYFKMEMGKNMCGIATCASYP 354


>gb|ABQ10199.1| cysteine protease Cp1 [Actinidia deliciosa]
          Length = 358

 Score =  492 bits (1267), Expect = e-136
 Identities = 234/347 (67%), Positives = 276/347 (79%), Gaps = 2/347 (0%)
 Frame = +1

Query: 142  ILFLFCCIFAVHAATNFDDQNPIK-LVTDQFNEFQNTLLQTIGDTXXXXXXXXXXXXXGK 318
            ++ L  C+    +A+ FDD+NPI+ +V+D   EF+ ++L  +GD+             GK
Sbjct: 9    LIILIACVAGASSASTFDDENPIRTVVSDALREFETSILSVLGDSRHALSFARFAHRYGK 68

Query: 319  RYETVDEIKKRFSNFVESMELIRSTNRKGLSYKLSLNKFADMSWEEFQKHKLGAAQECSA 498
            RYET +E K RF+ F E+++LIRS N+KGLSY L +N FAD +WEEF++H+LGAAQ CSA
Sbjct: 69   RYETAEETKLRFAIFSENLKLIRSHNKKGLSYTLGVNHFADWTWEEFRRHRLGAAQNCSA 128

Query: 499  T-KGNHLLTDANLPPVKDWREEGIVSPVKDQGHCGSCWTFSTTGALEAAYKQAFGKDISL 675
            T KGNH LT+  LP +KDWR  GIVSPVKDQGHCGSCWTFSTTGALEAAYKQAFGK ISL
Sbjct: 129  TTKGNHKLTEEALPEMKDWRVSGIVSPVKDQGHCGSCWTFSTTGALEAAYKQAFGKGISL 188

Query: 676  SEQQLVDXXXXXXXXXXXXXLPSQAFEYIKYNGGLEAEGAYPYTGKDGSCKFSSENAAVR 855
            SEQQLVD             LPSQAFEY+KYNGGL+ E AYPYTGK+G CKFSSEN  V+
Sbjct: 189  SEQQLVDCAGAFNNFGCSGGLPSQAFEYVKYNGGLDTEEAYPYTGKNGECKFSSENVGVQ 248

Query: 856  VVDSVNITQGAEDELKHAVALVRPVSVAFQVIHEFRLYNGGVFTSNSCGTSPMDVNHAVL 1035
            V+DSVNIT GAEDELKHAVA VRPVSVAFQV++ FRLY  GV+TS++CG +PMDVNHAVL
Sbjct: 249  VLDSVNITLGAEDELKHAVAFVRPVSVAFQVVNGFRLYKEGVYTSDTCGRTPMDVNHAVL 308

Query: 1036 AVGYGVENGIPYWLVKNSWGADWGDNGYFKMEMGKNMCGIATCASYP 1176
            AVGYGVENG+PYWL+KNSWGADWGD+GYFKMEMGKNMCG+ATCASYP
Sbjct: 309  AVGYGVENGVPYWLIKNSWGADWGDSGYFKMEMGKNMCGVATCASYP 355


>ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isoform 1 [Vitis vinifera]
            gi|147826441|emb|CAN62278.1| hypothetical protein
            VITISV_031382 [Vitis vinifera]
            gi|297738562|emb|CBI27807.3| unnamed protein product
            [Vitis vinifera]
          Length = 362

 Score =  492 bits (1266), Expect = e-136
 Identities = 234/355 (65%), Positives = 281/355 (79%), Gaps = 5/355 (1%)
 Frame = +1

Query: 127  SAVFSILFLFCCIFAV----HAATNFDDQNPIKLVTDQFNEFQNTLLQTIGDTXXXXXXX 294
            S V ++L L C + +     H  ++FD++NPI+LV+D   + ++++L+ IGDT       
Sbjct: 5    SVVAAVLILLCAVASGEADHHFRSSFDEENPIRLVSDSIRDLESSVLRLIGDTRHAHSFA 64

Query: 295  XXXXXXGKRYETVDEIKKRFSNFVESMELIRSTNRKGLSYKLSLNKFADMSWEEFQKHKL 474
                  GK Y+TVDEIK RF  F E+++LIRSTNRKGL Y L++N+FAD +WEEF++H+L
Sbjct: 65   SFAHRYGKSYKTVDEIKLRFEIFSENLKLIRSTNRKGLPYTLAVNQFADWTWEEFRRHRL 124

Query: 475  GAAQECSAT-KGNHLLTDANLPPVKDWREEGIVSPVKDQGHCGSCWTFSTTGALEAAYKQ 651
            GAAQ CSAT KGNH LTD  LP  KDWRE+GIVSP+KDQGHCGSCWTFSTTGALEAAY Q
Sbjct: 125  GAAQNCSATLKGNHKLTDVILPETKDWREDGIVSPIKDQGHCGSCWTFSTTGALEAAYAQ 184

Query: 652  AFGKDISLSEQQLVDXXXXXXXXXXXXXLPSQAFEYIKYNGGLEAEGAYPYTGKDGSCKF 831
            AFGK ISLSEQQLVD             LPSQAFEYIKYNGGL+ E AYPYTG DG+CKF
Sbjct: 185  AFGKGISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGLDGTCKF 244

Query: 832  SSENAAVRVVDSVNITQGAEDELKHAVALVRPVSVAFQVIHEFRLYNGGVFTSNSCGTSP 1011
            SSEN  V+V+DSVNIT GAEDELKHAVA VRPVSVAF+V+H+FR Y  GV+TS +CG++P
Sbjct: 245  SSENIGVQVLDSVNITLGAEDELKHAVAFVRPVSVAFEVVHDFRFYKKGVYTSGTCGSTP 304

Query: 1012 MDVNHAVLAVGYGVENGIPYWLVKNSWGADWGDNGYFKMEMGKNMCGIATCASYP 1176
            MDVNHAVLAVGYGVE+G+ YWL+KNSWG +WGDNGYFKME+GKNMCG+ATC+SYP
Sbjct: 305  MDVNHAVLAVGYGVEDGVAYWLIKNSWGENWGDNGYFKMELGKNMCGVATCSSYP 359


>ref|NP_566880.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
            gi|73622182|sp|Q8RWQ9.1|ALEUL_ARATH RecName: Full=Thiol
            protease aleurain-like; Flags: Precursor
            gi|20147207|gb|AAM10319.1| AT3g45310/F18N11_70
            [Arabidopsis thaliana] gi|332644500|gb|AEE78021.1| thiol
            protease aleurain-like protein [Arabidopsis thaliana]
          Length = 358

 Score =  491 bits (1265), Expect = e-136
 Identities = 234/352 (66%), Positives = 281/352 (79%), Gaps = 1/352 (0%)
 Frame = +1

Query: 124  SSAVFSILFLFCCIFAVHAATNFDDQNPIKLVTDQFNEFQNTLLQTIGDTXXXXXXXXXX 303
            SS++  ILF      A      FD+ NPIK+V+D  +E ++T++Q +G +          
Sbjct: 8    SSSILLILFAA----AASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63

Query: 304  XXXGKRYETVDEIKKRFSNFVESMELIRSTNRKGLSYKLSLNKFADMSWEEFQKHKLGAA 483
               GK+Y++V+E+K RFS F E+++LIRSTN+KGLSYKLSLN+FAD++W+EFQ++KLGAA
Sbjct: 64   HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123

Query: 484  QECSAT-KGNHLLTDANLPPVKDWREEGIVSPVKDQGHCGSCWTFSTTGALEAAYKQAFG 660
            Q CSAT KG+H +T+A +P  KDWRE+GIVSPVK+QGHCGSCWTFSTTGALEAAY QAFG
Sbjct: 124  QNCSATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFG 183

Query: 661  KDISLSEQQLVDXXXXXXXXXXXXXLPSQAFEYIKYNGGLEAEGAYPYTGKDGSCKFSSE 840
            K ISLSEQQLVD             LPSQAFEYIKYNGGL+ E AYPYTGKDG CKFS++
Sbjct: 184  KGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGGCKFSAK 243

Query: 841  NAAVRVVDSVNITQGAEDELKHAVALVRPVSVAFQVIHEFRLYNGGVFTSNSCGTSPMDV 1020
            N  V+V DSVNIT GAEDELKHAV LVRPVSVAF+V+HEFR Y  GVFTSN+CG +PMDV
Sbjct: 244  NIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCGNTPMDV 303

Query: 1021 NHAVLAVGYGVENGIPYWLVKNSWGADWGDNGYFKMEMGKNMCGIATCASYP 1176
            NHAVLAVGYGVE+ +PYWL+KNSWG +WGDNGYFKMEMGKNMCG+ATC+SYP
Sbjct: 304  NHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMCGVATCSSYP 355


>ref|NP_001030812.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
            gi|332644501|gb|AEE78022.1| thiol protease aleurain-like
            protein [Arabidopsis thaliana]
          Length = 357

 Score =  484 bits (1247), Expect = e-134
 Identities = 233/352 (66%), Positives = 280/352 (79%), Gaps = 1/352 (0%)
 Frame = +1

Query: 124  SSAVFSILFLFCCIFAVHAATNFDDQNPIKLVTDQFNEFQNTLLQTIGDTXXXXXXXXXX 303
            SS++  ILF      A      FD+ NPIK+V+D  +E ++T++Q +G +          
Sbjct: 8    SSSILLILFAA----AASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63

Query: 304  XXXGKRYETVDEIKKRFSNFVESMELIRSTNRKGLSYKLSLNKFADMSWEEFQKHKLGAA 483
               GK+Y++V+E+K RFS F E+++LIRSTN+KGLSYKLSLN+FAD++W+EFQ++KLGAA
Sbjct: 64   HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123

Query: 484  QECSAT-KGNHLLTDANLPPVKDWREEGIVSPVKDQGHCGSCWTFSTTGALEAAYKQAFG 660
            Q CSAT KG+H +T+A +P  KDWRE+GIVSPVK+QGHCGSCWTFSTTGALEAAY QAFG
Sbjct: 124  QNCSATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFG 183

Query: 661  KDISLSEQQLVDXXXXXXXXXXXXXLPSQAFEYIKYNGGLEAEGAYPYTGKDGSCKFSSE 840
            K ISLSEQQLVD             LPSQAFEYIKYNGGL+ E AYPYTGKDG CKFS++
Sbjct: 184  KGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGGCKFSAK 243

Query: 841  NAAVRVVDSVNITQGAEDELKHAVALVRPVSVAFQVIHEFRLYNGGVFTSNSCGTSPMDV 1020
            N  V+V DSVNIT GAEDELKHAV LVRPVSVAF+V+HEFR Y  GVFTSN+CG +PMDV
Sbjct: 244  NIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCGNTPMDV 303

Query: 1021 NHAVLAVGYGVENGIPYWLVKNSWGADWGDNGYFKMEMGKNMCGIATCASYP 1176
            NHAVLAVGYGVE+ +PYWL+KNSWG +WGDNGYFKMEMGKNMC +ATC+SYP
Sbjct: 304  NHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMC-VATCSSYP 354


Top