BLASTX nr result

ID: Coptis23_contig00008845 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00008845
         (829 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor,...   251   2e-64
ref|XP_002890686.1| aspartyl protease family protein [Arabidopsi...   245   1e-62
ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2...   243   5e-62
ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis tha...   242   8e-62
ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|2...   242   8e-62

>ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis] gi|223531426|gb|EEF33260.1| Aspartic
           proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score =  251 bits (640), Expect = 2e-64
 Identities = 125/250 (50%), Positives = 162/250 (64%), Gaps = 9/250 (3%)
 Frame = -3

Query: 734 SSYSLNLYPRNAVYKSQFKSLKEMTLDRLERDVARVNFLNNKIYQAINGNRTTDLKPMD- 558
           S  ++ L+ R +V K++    + +TL RLERD ARV  +N ++  AI+G  T+DLKP+D 
Sbjct: 61  SQLTMELHSRTSVQKTKHPDYRSLTLSRLERDSARVKSINTRLDLAIHGLSTSDLKPLDT 120

Query: 557 --------IEIPVTSGRSQRSGEYFARFGFGTPPKQLYTTIDTGSDITWIQCAPCARCYS 402
                   ++ P+ SG SQ SGEYF+R G G P   +Y  +DTGSD+ WIQCAPCA CY 
Sbjct: 121 DSQFRAEDLQGPIISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYH 180

Query: 401 QTDPLFDSSMSSSYKPLACNSQQCTQLEPSGCFIRTNTCAYRVDYGDNSMTVGDFVMETL 222
           Q DP+F+ + S+SY PL+C+++QC  L+ S C  R NTC Y V YGD S TVGDFV ET+
Sbjct: 181 QADPIFEPASSTSYSPLSCDTKQCQSLDVSEC--RNNTCLYEVSYGDGSYTVGDFVTETI 238

Query: 221 TFGDSTTRNVAIGCGRINQGLFVXXXXXXXXXXXXXSFPSQINNPSFSYCLVDRTATSAS 42
           T G ++  NVAIGCG  N+GLF+             SFPSQIN  SFSYCLVDR + SAS
Sbjct: 239 TLGSASVDNVAIGCGHNNEGLFIGAAGLLGLGGGKLSFPSQINASSFSYCLVDRDSDSAS 298

Query: 41  TLEFGPLAIP 12
           TLEF    +P
Sbjct: 299 TLEFNSALLP 308


>ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
           gi|297336528|gb|EFH66945.1| aspartyl protease family
           protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  245 bits (625), Expect = 1e-62
 Identities = 126/254 (49%), Positives = 160/254 (62%), Gaps = 10/254 (3%)
 Frame = -3

Query: 734 SSYSLNLYPRNAVYKSQFKSLKEMTLDRLERDVARVNFLNNKIYQAINGNRTTDLKPM-- 561
           SS+SL L+ R +V  ++    K +TL RL RD ARV  L  ++  AIN     DLKP+  
Sbjct: 67  SSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPVTT 126

Query: 560 --------DIEIPVTSGRSQRSGEYFARFGFGTPPKQLYTTIDTGSDITWIQCAPCARCY 405
                   DIE P+ SG +Q SGEYF R G G P +++Y  +DTGSD+ W+QC PCA CY
Sbjct: 127 MYTTTEEEDIEAPLISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCY 186

Query: 404 SQTDPLFDSSMSSSYKPLACNSQQCTQLEPSGCFIRTNTCAYRVDYGDNSMTVGDFVMET 225
            QT+P+F+ S SSSY+PL+C++ QC  LE S C  R  TC Y V YGD S TVGDF  ET
Sbjct: 187 HQTEPIFEPSSSSSYEPLSCDTPQCNALEVSEC--RNATCLYEVSYGDGSYTVGDFATET 244

Query: 224 LTFGDSTTRNVAIGCGRINQGLFVXXXXXXXXXXXXXSFPSQINNPSFSYCLVDRTATSA 45
           LT G +  +NVA+GCG  N+GLFV             + PSQ+N  SFSYCLVDR + SA
Sbjct: 245 LTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSA 304

Query: 44  STLEFGPLAIPSDA 3
           ST+EFG  ++P DA
Sbjct: 305 STVEFG-TSLPPDA 317


>ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  243 bits (619), Expect = 5e-62
 Identities = 128/250 (51%), Positives = 159/250 (63%), Gaps = 8/250 (3%)
 Frame = -3

Query: 734 SSYSLNLYPRNAVYKSQFKSLKEMTLDRLERDVARVNFLNNKIYQAINGNRTTDLKPMD- 558
           SS +L+L+ R +++KS  K  K + L RLERD  RV  L  ++  AI G   +DLKP++ 
Sbjct: 74  SSLTLSLHSRTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKPVEK 133

Query: 557 ------IEIPVTSGRSQRSGEYFARFGFGTPPKQLYTTIDTGSDITWIQCAPCARCYSQT 396
                 +E P+ SG SQ SGEYF+R G G+PPK +Y  +DTGSD+ W+QCAPCA CY Q 
Sbjct: 134 ELEAEALETPLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQA 193

Query: 395 DPLFDSSMSSSYKPLACNSQQCTQLEPSGCFIRTNTCAYRVDYGDNSMTVGDFVMETLTF 216
           DP+F+ S SSSY PL C + QC  L+ S C  R ++C Y V YGD S TVGDF  ET+T 
Sbjct: 194 DPIFEPSFSSSYAPLTCETHQCKSLDVSEC--RNDSCLYEVSYGDGSYTVGDFATETITL 251

Query: 215 -GDSTTRNVAIGCGRINQGLFVXXXXXXXXXXXXXSFPSQINNPSFSYCLVDRTATSAST 39
            G ++  NVAIGCG  N+GLFV             SFPSQIN  SFSYCLV+R   SAST
Sbjct: 252 DGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCLVNRDTDSAST 311

Query: 38  LEFGPLAIPS 9
           LEF    IPS
Sbjct: 312 LEFNS-PIPS 320


>ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
           gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical
           protein [Arabidopsis thaliana]
           gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis
           thaliana] gi|23198172|gb|AAN15613.1| unknown protein
           [Arabidopsis thaliana] gi|110736960|dbj|BAF00436.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|332192515|gb|AEE30636.1| aspartyl protease-like
           protein [Arabidopsis thaliana]
          Length = 483

 Score =  242 bits (617), Expect = 8e-62
 Identities = 123/250 (49%), Positives = 156/250 (62%), Gaps = 9/250 (3%)
 Frame = -3

Query: 734 SSYSLNLYPRNAVYKSQFKSLKEMTLDRLERDVARVNFLNNKIYQAINGNRTTDLKPM-- 561
           SS+SL L+ R +V  ++    K +TL RL RD ARV  L  ++  AIN     DLKP+  
Sbjct: 65  SSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPIST 124

Query: 560 -------DIEIPVTSGRSQRSGEYFARFGFGTPPKQLYTTIDTGSDITWIQCAPCARCYS 402
                  DIE P+ SG +Q SGEYF R G G P +++Y  +DTGSD+ W+QC PCA CY 
Sbjct: 125 MYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYH 184

Query: 401 QTDPLFDSSMSSSYKPLACNSQQCTQLEPSGCFIRTNTCAYRVDYGDNSMTVGDFVMETL 222
           QT+P+F+ S SSSY+PL+C++ QC  LE S C  R  TC Y V YGD S TVGDF  ETL
Sbjct: 185 QTEPIFEPSSSSSYEPLSCDTPQCNALEVSEC--RNATCLYEVSYGDGSYTVGDFATETL 242

Query: 221 TFGDSTTRNVAIGCGRINQGLFVXXXXXXXXXXXXXSFPSQINNPSFSYCLVDRTATSAS 42
           T G +  +NVA+GCG  N+GLFV             + PSQ+N  SFSYCLVDR + SAS
Sbjct: 243 TIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSAS 302

Query: 41  TLEFGPLAIP 12
           T++FG    P
Sbjct: 303 TVDFGTSLSP 312


>ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|222864993|gb|EEF02124.1|
           predicted protein [Populus trichocarpa]
          Length = 484

 Score =  242 bits (617), Expect = 8e-62
 Identities = 123/251 (49%), Positives = 157/251 (62%), Gaps = 9/251 (3%)
 Frame = -3

Query: 734 SSYSLNLYPRNAVYKSQFKSLKEMTLDRLERDVARVNFLNNKIYQAINGNRTTDLKPM-- 561
           S  ++ L  R ++ K+     K +TL RL+RD ARV  L  ++  AIN   ++DLKP+  
Sbjct: 66  SELTVELLSRTSIQKTTHTGYKSLTLSRLQRDSARVKSLVTRLDLAINSISSSDLKPLET 125

Query: 560 -------DIEIPVTSGRSQRSGEYFARFGFGTPPKQLYTTIDTGSDITWIQCAPCARCYS 402
                  D++ P+ SG SQ SGEYF+R G G PP Q Y  +DTGSD+ W+QCAPCA CY 
Sbjct: 126 DSEFKPEDLQSPIISGTSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQ 185

Query: 401 QTDPLFDSSMSSSYKPLACNSQQCTQLEPSGCFIRTNTCAYRVDYGDNSMTVGDFVMETL 222
           Q DP+F+ + S+S+  L+CN++QC  L+ S C  R +TC Y V YGD S TVGDFV ET+
Sbjct: 186 QADPIFEPASSASFSTLSCNTRQCRSLDVSEC--RNDTCLYEVSYGDGSYTVGDFVTETI 243

Query: 221 TFGDSTTRNVAIGCGRINQGLFVXXXXXXXXXXXXXSFPSQINNPSFSYCLVDRTATSAS 42
           T G +   NVAIGCG  N+GLFV             SFPSQIN  SFSYCLVDR + SAS
Sbjct: 244 TLGSAPVDNVAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCLVDRDSESAS 303

Query: 41  TLEFGPLAIPS 9
           TLEF     P+
Sbjct: 304 TLEFNSTLPPN 314


Top