BLASTX nr result

ID: Coptis24_contig00006676 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00006676
         (1480 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002890686.1| aspartyl protease family protein [Arabidopsi...   417   e-114
ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2...   417   e-114
ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis tha...   416   e-113
ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor,...   415   e-113
ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|2...   410   e-112

>ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297336528|gb|EFH66945.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  417 bits (1071), Expect = e-114
 Identities = 220/427 (51%), Positives = 282/427 (66%), Gaps = 13/427 (3%)
 Frame = -3

Query: 1385 SSYSLNLYPRNAVYKSQFKSLKEMTLDRLERDVARVNFLNNKIYQAINGNGTTDLKPM-- 1212
            SS+SL L+ R +V  ++    K +TL RL RD ARV  L  ++  AIN     DLKP+  
Sbjct: 67   SSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPVTT 126

Query: 1211 --------DIEIPVTSGRSQRSGEYFARFGFGTPPKQLYTTIDTGSDITWIQCAPCARCY 1056
                    DIE P+ SG +Q SGEYF R G G P +++Y  +DTGSD+ W+QC PCA CY
Sbjct: 127  MYTTTEEEDIEAPLISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCY 186

Query: 1055 SQTDPLFDSSMSSSYKPLACNSQQCTQLEPSGCFIRTNTCAYTVDYGDNSMTIGDFVMET 876
             QT+P+F+ S SSSY+PL+C++ QC  LE S C  R  TC Y V YGD S T+GDF  ET
Sbjct: 187  HQTEPIFEPSSSSSYEPLSCDTPQCNALEVSEC--RNATCLYEVSYGDGSYTVGDFATET 244

Query: 875  LTLGDSTTRNVAIGCGRINQGLFVXXXXXXXXXXXXXSFPSQINNPSFSYCLVDRTATSA 696
            LT+G +  +NVA+GCG  N+GLFV             + PSQ+N  SFSYCLVDR + SA
Sbjct: 245  LTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSA 304

Query: 695  STLEFGPLAIPNDAITVPLLRNSRMDTFYYVGLTGISVGGKMLPISPSAFAIDQNGGGGV 516
            ST+EFG  ++P DA+  PLLRN ++DTFYY+GLTGISVGG++L I  S+F +D++G GG+
Sbjct: 305  STVEFG-TSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGI 363

Query: 515  IVDSGTSVTRLQPQVYNMLRDEFVKGTRGLPPPIRFANFLDTCYNLTSRPA-GIPRVAFH 339
            I+DSGT+VTRLQ  +YN LRD F+KGT  L      A F DTCYNL+++    +P VAFH
Sbjct: 364  IIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMF-DTCYNLSAKTTIEVPTVAFH 422

Query: 338  F-DTKTLELPDKNYLYQA-ADNLFCLAFATPPPGSTFSAVIGNIQQQGMRVSFDTGRSLI 165
            F   K L LP KNY+    +   FCLAFA   P ++  A+IGN+QQQG RV+FD   SLI
Sbjct: 423  FPGGKMLALPAKNYMIPVDSVGTFCLAFA---PTASSLAIIGNVQQQGTRVTFDLANSLI 479

Query: 164  GFSLGKC 144
            GFS  KC
Sbjct: 480  GFSSNKC 486


>ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  417 bits (1071), Expect = e-114
 Identities = 223/425 (52%), Positives = 285/425 (67%), Gaps = 11/425 (2%)
 Frame = -3

Query: 1385 SSYSLNLYPRNAVYKSQFKSLKEMTLDRLERDVARVNFLNNKIYQAINGNGTTDLKPMD- 1209
            SS +L+L+ R +++KS  K  K + L RLERD  RV  L  ++  AI G   +DLKP++ 
Sbjct: 74   SSLTLSLHSRTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKPVEK 133

Query: 1208 ------IEIPVTSGRSQRSGEYFARFGFGTPPKQLYTTIDTGSDITWIQCAPCARCYSQT 1047
                  +E P+ SG SQ SGEYF+R G G+PPK +Y  +DTGSD+ W+QCAPCA CY Q 
Sbjct: 134  ELEAEALETPLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQA 193

Query: 1046 DPLFDSSMSSSYKPLACNSQQCTQLEPSGCFIRTNTCAYTVDYGDNSMTIGDFVMETLTL 867
            DP+F+ S SSSY PL C + QC  L+ S C  R ++C Y V YGD S T+GDF  ET+TL
Sbjct: 194  DPIFEPSFSSSYAPLTCETHQCKSLDVSEC--RNDSCLYEVSYGDGSYTVGDFATETITL 251

Query: 866  -GDSTTRNVAIGCGRINQGLFVXXXXXXXXXXXXXSFPSQINNPSFSYCLVDRTATSAST 690
             G ++  NVAIGCG  N+GLFV             SFPSQIN  SFSYCLV+R   SAST
Sbjct: 252  DGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCLVNRDTDSAST 311

Query: 689  LEFGPLAIPNDAITVPLLRNSRMDTFYYVGLTGISVGGKMLPISPSAFAIDQNGGGGVIV 510
            LEF    IP+ ++T PLLRN+++DTFYY+G+TGI VGG+ML I  S+F +D++G GG+IV
Sbjct: 312  LEFNS-PIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIV 370

Query: 509  DSGTSVTRLQPQVYNMLRDEFVKGTRGLPPPIRFANFLDTCYNLTSRPA-GIPRVAFHF- 336
            DSGT+VTRLQ  VYN LRD FV+GT+ LP     A F DTCY+L+SR +  +P V+FHF 
Sbjct: 371  DSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALF-DTCYDLSSRSSVEVPTVSFHFP 429

Query: 335  DTKTLELPDKNYLYQA-ADNLFCLAFATPPPGSTFSAVIGNIQQQGMRVSFDTGRSLIGF 159
            D K L LP KNYL    +   FC AFA   P ++  ++IGN+QQQG RVS+D   SL+GF
Sbjct: 430  DGKYLALPAKNYLIPVDSAGTFCFAFA---PTTSALSIIGNVQQQGTRVSYDLSNSLVGF 486

Query: 158  SLGKC 144
            S   C
Sbjct: 487  SPNGC 491


>ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical
            protein [Arabidopsis thaliana] gi|20466516|gb|AAM20575.1|
            unknown protein [Arabidopsis thaliana]
            gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis
            thaliana] gi|110736960|dbj|BAF00436.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332192515|gb|AEE30636.1| aspartyl protease-like
            protein [Arabidopsis thaliana]
          Length = 483

 Score =  416 bits (1068), Expect = e-113
 Identities = 220/426 (51%), Positives = 281/426 (65%), Gaps = 12/426 (2%)
 Frame = -3

Query: 1385 SSYSLNLYPRNAVYKSQFKSLKEMTLDRLERDVARVNFLNNKIYQAINGNGTTDLKPM-- 1212
            SS+SL L+ R +V  ++    K +TL RL RD ARV  L  ++  AIN     DLKP+  
Sbjct: 65   SSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPIST 124

Query: 1211 -------DIEIPVTSGRSQRSGEYFARFGFGTPPKQLYTTIDTGSDITWIQCAPCARCYS 1053
                   DIE P+ SG +Q SGEYF R G G P +++Y  +DTGSD+ W+QC PCA CY 
Sbjct: 125  MYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYH 184

Query: 1052 QTDPLFDSSMSSSYKPLACNSQQCTQLEPSGCFIRTNTCAYTVDYGDNSMTIGDFVMETL 873
            QT+P+F+ S SSSY+PL+C++ QC  LE S C  R  TC Y V YGD S T+GDF  ETL
Sbjct: 185  QTEPIFEPSSSSSYEPLSCDTPQCNALEVSEC--RNATCLYEVSYGDGSYTVGDFATETL 242

Query: 872  TLGDSTTRNVAIGCGRINQGLFVXXXXXXXXXXXXXSFPSQINNPSFSYCLVDRTATSAS 693
            T+G +  +NVA+GCG  N+GLFV             + PSQ+N  SFSYCLVDR + SAS
Sbjct: 243  TIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSAS 302

Query: 692  TLEFGPLAIPNDAITVPLLRNSRMDTFYYVGLTGISVGGKMLPISPSAFAIDQNGGGGVI 513
            T++FG    P DA+  PLLRN ++DTFYY+GLTGISVGG++L I  S+F +D++G GG+I
Sbjct: 303  TVDFGTSLSP-DAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGII 361

Query: 512  VDSGTSVTRLQPQVYNMLRDEFVKGTRGLPPPIRFANFLDTCYNLTSR-PAGIPRVAFHF 336
            +DSGT+VTRLQ ++YN LRD FVKGT  L      A F DTCYNL+++    +P VAFHF
Sbjct: 362  IDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMF-DTCYNLSAKTTVEVPTVAFHF 420

Query: 335  -DTKTLELPDKNYLYQA-ADNLFCLAFATPPPGSTFSAVIGNIQQQGMRVSFDTGRSLIG 162
               K L LP KNY+    +   FCLAFA   P ++  A+IGN+QQQG RV+FD   SLIG
Sbjct: 421  PGGKMLALPAKNYMIPVDSVGTFCLAFA---PTASSLAIIGNVQQQGTRVTFDLANSLIG 477

Query: 161  FSLGKC 144
            FS  KC
Sbjct: 478  FSSNKC 483


>ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223531426|gb|EEF33260.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 479

 Score =  415 bits (1067), Expect = e-113
 Identities = 218/426 (51%), Positives = 281/426 (65%), Gaps = 12/426 (2%)
 Frame = -3

Query: 1385 SSYSLNLYPRNAVYKSQFKSLKEMTLDRLERDVARVNFLNNKIYQAINGNGTTDLKPMD- 1209
            S  ++ L+ R +V K++    + +TL RLERD ARV  +N ++  AI+G  T+DLKP+D 
Sbjct: 61   SQLTMELHSRTSVQKTKHPDYRSLTLSRLERDSARVKSINTRLDLAIHGLSTSDLKPLDT 120

Query: 1208 --------IEIPVTSGRSQRSGEYFARFGFGTPPKQLYTTIDTGSDITWIQCAPCARCYS 1053
                    ++ P+ SG SQ SGEYF+R G G P   +Y  +DTGSD+ WIQCAPCA CY 
Sbjct: 121  DSQFRAEDLQGPIISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYH 180

Query: 1052 QTDPLFDSSMSSSYKPLACNSQQCTQLEPSGCFIRTNTCAYTVDYGDNSMTIGDFVMETL 873
            Q DP+F+ + S+SY PL+C+++QC  L+ S C  R NTC Y V YGD S T+GDFV ET+
Sbjct: 181  QADPIFEPASSTSYSPLSCDTKQCQSLDVSEC--RNNTCLYEVSYGDGSYTVGDFVTETI 238

Query: 872  TLGDSTTRNVAIGCGRINQGLFVXXXXXXXXXXXXXSFPSQINNPSFSYCLVDRTATSAS 693
            TLG ++  NVAIGCG  N+GLF+             SFPSQIN  SFSYCLVDR + SAS
Sbjct: 239  TLGSASVDNVAIGCGHNNEGLFIGAAGLLGLGGGKLSFPSQINASSFSYCLVDRDSDSAS 298

Query: 692  TLEFGPLAIPNDAITVPLLRNSRMDTFYYVGLTGISVGGKMLPISPSAFAIDQNGGGGVI 513
            TLEF    +P+ AIT PLLRN  +DTFYYVG+TG+SVGG++L I  S F +D++G GG+I
Sbjct: 299  TLEFNSALLPH-AITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGII 357

Query: 512  VDSGTSVTRLQPQVYNMLRDEFVKGTRGLPPPIRFANFLDTCYNLTSRPA-GIPRVAFHF 336
            +DSGT+VTRLQ   YN LRD FVKGT+ LP     A F DTCY+L+ + +  +P V FH 
Sbjct: 358  IDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALF-DTCYDLSRKTSVEVPTVTFHL 416

Query: 335  -DTKTLELPDKNYLYQA-ADNLFCLAFATPPPGSTFSAVIGNIQQQGMRVSFDTGRSLIG 162
               K L LP  NYL    +D  FC AFA   P S+  ++IGN+QQQG RV FD   SL+G
Sbjct: 417  AGGKVLPLPATNYLIPVDSDGTFCFAFA---PTSSALSIIGNVQQQGTRVGFDLANSLVG 473

Query: 161  FSLGKC 144
            F   +C
Sbjct: 474  FEPRQC 479


>ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|222864993|gb|EEF02124.1|
            predicted protein [Populus trichocarpa]
          Length = 484

 Score =  410 bits (1055), Expect = e-112
 Identities = 219/426 (51%), Positives = 280/426 (65%), Gaps = 12/426 (2%)
 Frame = -3

Query: 1385 SSYSLNLYPRNAVYKSQFKSLKEMTLDRLERDVARVNFLNNKIYQAINGNGTTDLKPM-- 1212
            S  ++ L  R ++ K+     K +TL RL+RD ARV  L  ++  AIN   ++DLKP+  
Sbjct: 66   SELTVELLSRTSIQKTTHTGYKSLTLSRLQRDSARVKSLVTRLDLAINSISSSDLKPLET 125

Query: 1211 -------DIEIPVTSGRSQRSGEYFARFGFGTPPKQLYTTIDTGSDITWIQCAPCARCYS 1053
                   D++ P+ SG SQ SGEYF+R G G PP Q Y  +DTGSD+ W+QCAPCA CY 
Sbjct: 126  DSEFKPEDLQSPIISGTSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQ 185

Query: 1052 QTDPLFDSSMSSSYKPLACNSQQCTQLEPSGCFIRTNTCAYTVDYGDNSMTIGDFVMETL 873
            Q DP+F+ + S+S+  L+CN++QC  L+ S C  R +TC Y V YGD S T+GDFV ET+
Sbjct: 186  QADPIFEPASSASFSTLSCNTRQCRSLDVSEC--RNDTCLYEVSYGDGSYTVGDFVTETI 243

Query: 872  TLGDSTTRNVAIGCGRINQGLFVXXXXXXXXXXXXXSFPSQINNPSFSYCLVDRTATSAS 693
            TLG +   NVAIGCG  N+GLFV             SFPSQIN  SFSYCLVDR + SAS
Sbjct: 244  TLGSAPVDNVAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCLVDRDSESAS 303

Query: 692  TLEFGPLAIPNDAITVPLLRNSRMDTFYYVGLTGISVGGKMLPISPSAFAIDQNGGGGVI 513
            TLEF    +P +A++ PLLRN  +DTFYYVGLTG+SVGG+++ I  SAF ID++G GGVI
Sbjct: 304  TLEFNS-TLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVI 362

Query: 512  VDSGTSVTRLQPQVYNMLRDEFVKGTRGLPPPIRFANFLDTCYNLTSR-PAGIPRVAFHF 336
            VDSGT++TRLQ  VYN LRD FVK TR LP     A F DTCY+L+S+    +P V+FHF
Sbjct: 363  VDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIALF-DTCYDLSSKGNVEVPTVSFHF 421

Query: 335  -DTKTLELPDKNYLYQA-ADNLFCLAFATPPPGSTFSAVIGNIQQQGMRVSFDTGRSLIG 162
             D K L LP KNYL    ++  FC AFA   P ++  ++IGN+QQQG RV +D    L+G
Sbjct: 422  PDGKELPLPAKNYLVPLDSEGTFCFAFA---PTASSLSIIGNVQQQGTRVVYDLVNHLVG 478

Query: 161  FSLGKC 144
            F   KC
Sbjct: 479  FVPNKC 484