BLASTX nr result

ID: Coptis23_contig00000410 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00000410
         (2187 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1...   587   e-165
ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor,...   550   e-154
ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1...   540   e-151
ref|NP_188636.2| aspartyl protease family protein [Arabidopsis t...   533   e-149
ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata] ...   531   e-148

>ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  587 bits (1514), Expect = e-165
 Identities = 301/480 (62%), Positives = 345/480 (71%), Gaps = 5/480 (1%)
 Frame = +3

Query: 396  LYLVSGITTSNTDNRNTAST-----TSGADIHHLNVQEIITGIKARPSKAMGYHKLSEND 560
            L LVSGIT ++T    +A+T     +S     HLNV+E I G +  P +          D
Sbjct: 12   LQLVSGITATSTTTAASAATAAINNSSYPTFQHLNVKETIAGTRIIPLEV-------SED 64

Query: 561  DASTGSKWKLNLVHRDVISEGNVSDYHQLFIGLMKRDLKRVXXXXXXXXXXXXXXXXVSY 740
                G KW + +VHRD +S GN  D+     G +KRD KRV                 SY
Sbjct: 65   HEEGGEKWMMKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGG----SY 120

Query: 741  EVNDFGSQVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCNQCYRQSDP 920
             V+DFG+ VISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC QCY QSDP
Sbjct: 121  RVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDP 180

Query: 921  VFDPANSASFAGVACSSNVCDRLENAGCRAGRCRYAVSYGDGSYTRGTLALETITLDRTV 1100
            VFDPA+SASF GV+CSS+VCDRLENAGC AGRCRY VSYGDGSYT+GTLALET+T  RT+
Sbjct: 181  VFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGRTM 240

Query: 1101 VHNVAIGCGHKNRGMFVXXXXXXXXXXXSMSFIGQLEGQTGGTFGYCLVSRGSNSYGSLV 1280
            V +VAIGCGH+NRGMFV           SMSF+GQL GQTGG F YCLVSRG++S GSLV
Sbjct: 241  VRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTDSSGSLV 300

Query: 1281 FGRDALPVGAVWVPLVRNPRAPSFXXXXXXXXXXXXMRVPVSEDLFRLTEWGEGGVVMDT 1460
            FGR+ALP GA WVPLVRNPRAPSF            +RVP+SE++FRLTE G+GGVVMDT
Sbjct: 301  FGREALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDT 360

Query: 1461 GTAVTRLPLPAYEALRDSFIAGTAELPRTPVGVSIFDTCYDLSNYNMVRVPTVSFYFSSG 1640
            GTAVTRLP  AY+A RD+F+A TA LPR   GV+IFDTCYDL  +  VRVPTVSFYFS G
Sbjct: 361  GTAVTRLPTLAYQAFRDAFLAQTANLPRA-TGVAIFDTCYDLLGFVSVRVPTVSFYFSGG 419

Query: 1641 PVLTLPARNFLIPVNEMGTFCFAFAPSSTKLSXXXXXXXXXXXXSFDGANGFVGFGPNTC 1820
            P+LTLPARNFLIP+++ GTFCFAFAPS++ LS            SFDGANG+VGFGPN C
Sbjct: 420  PILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 479


>ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223548143|gb|EEF49635.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 481

 Score =  550 bits (1418), Expect = e-154
 Identities = 286/478 (59%), Positives = 332/478 (69%), Gaps = 3/478 (0%)
 Frame = +3

Query: 396  LYLVSGITTSNTDNRNTASTTSGADIHHLNVQEIITGIKARPSKAMGYHKLSEN-DDAST 572
            L L+S +T   +    T  T S      LNV+E IT      +KA  Y +L +N +D  T
Sbjct: 13   LLLLSLVTIIASATTTTLKTISYPHFQLLNVKEAIT-----ETKASQYQELFDNQNDTLT 67

Query: 573  GSKWKLNLVHRDVISEGNVSDYHQL--FIGLMKRDLKRVXXXXXXXXXXXXXXXXVSYEV 746
              KWKL LVHRD I+  N S Y     F   ++RD KRV                 SY V
Sbjct: 68   EGKWKLKLVHRDKITAFNKSSYDHSHNFHARIQRDKKRVATLIRRLSPRDATS---SYSV 124

Query: 747  NDFGSQVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCNQCYRQSDPVF 926
             +FG++V+SGM QGSGEYF+RIGVGSPPR QY+VIDSGSDIVWVQCQPC QCY Q+DPVF
Sbjct: 125  EEFGAEVVSGMNQGSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVF 184

Query: 927  DPANSASFAGVACSSNVCDRLENAGCRAGRCRYAVSYGDGSYTRGTLALETITLDRTVVH 1106
            DPA+SASF GV CSS+VC+R+ENAGC AG CRY V YGDGSYT+GTLALET+T  RTVV 
Sbjct: 185  DPADSASFMGVPCSSSVCERIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTFGRTVVR 244

Query: 1107 NVAIGCGHKNRGMFVXXXXXXXXXXXSMSFIGQLEGQTGGTFGYCLVSRGSNSYGSLVFG 1286
            NVAIGCGH+NRGMFV           SMS +GQL GQTGG F YCLVSRG++S GSL FG
Sbjct: 245  NVAIGCGHRNRGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTDSAGSLEFG 304

Query: 1287 RDALPVGAVWVPLVRNPRAPSFXXXXXXXXXXXXMRVPVSEDLFRLTEWGEGGVVMDTGT 1466
            R A+PVGA W+PL+RNPRAPSF            M+VP+SED+F+L E G GGVVMDTGT
Sbjct: 305  RGAMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGT 364

Query: 1467 AVTRLPLPAYEALRDSFIAGTAELPRTPVGVSIFDTCYDLSNYNMVRVPTVSFYFSSGPV 1646
            AVTR+P  AY A RD+FI  T  LPR   GVSIFDTCY+L+ +  VRVPTVSFYF+ GP+
Sbjct: 365  AVTRIPTVAYVAFRDAFIGQTGNLPRAS-GVSIFDTCYNLNGFVSVRVPTVSFYFAGGPI 423

Query: 1647 LTLPARNFLIPVNEMGTFCFAFAPSSTKLSXXXXXXXXXXXXSFDGANGFVGFGPNTC 1820
            LTLPARNFLIPV+++GTFCFAFA S + LS            SFDGANGFVGFGPN C
Sbjct: 424  LTLPARNFLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481


>ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  540 bits (1390), Expect = e-151
 Identities = 278/473 (58%), Positives = 331/473 (69%), Gaps = 4/473 (0%)
 Frame = +3

Query: 414  ITTSNTDNRNTASTTSGADIHHLNVQEIITGIKARPS---KAMGYHKLSENDDASTGSKW 584
            +TTS + N       S      LNV++I+T  K  P+   K + + KL+   +AS+ +K+
Sbjct: 12   LTTSTSHN-----IISYPHFQQLNVKQILTETKLNPTNTYKHLQHQKLNIATEASSPAKY 66

Query: 585  KLNLVHRDVISEGNVS-DYHQLFIGLMKRDLKRVXXXXXXXXXXXXXXXXVSYEVNDFGS 761
            KL LVHRD +   N S D+   F   M+RD KRV                 +Y    FGS
Sbjct: 67   KLKLVHRDKVPTFNTSHDHRTRFNARMQRDTKRVAALRRHLAAGKP-----TYAEEAFGS 121

Query: 762  QVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCNQCYRQSDPVFDPANS 941
             V+SGMEQGSGEYFVRIGVGSPPR+QY+VIDSGSDI+WVQC+PC QCY QSDPVF+PA+S
Sbjct: 122  DVVSGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADS 181

Query: 942  ASFAGVACSSNVCDRLENAGCRAGRCRYAVSYGDGSYTRGTLALETITLDRTVVHNVAIG 1121
            +S+AGV+C+S VC  ++NAGC  GRCRY VSYGDGSYT+GTLALET+T  RT++ NVAIG
Sbjct: 182  SSYAGVSCASTVCSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALETLTFGRTLIRNVAIG 241

Query: 1122 CGHKNRGMFVXXXXXXXXXXXSMSFIGQLEGQTGGTFGYCLVSRGSNSYGSLVFGRDALP 1301
            CGH N+GMFV            MSF+GQL GQ GGTF YCLVSRG  S G L FGR+A+P
Sbjct: 242  CGHHNQGMFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQSSGLLQFGREAVP 301

Query: 1302 VGAVWVPLVRNPRAPSFXXXXXXXXXXXXMRVPVSEDLFRLTEWGEGGVVMDTGTAVTRL 1481
            VGA WVPL+ NPRA SF            +RVP+SED+F+L+E G+GGVVMDTGTAVTRL
Sbjct: 302  VGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMDTGTAVTRL 361

Query: 1482 PLPAYEALRDSFIAGTAELPRTPVGVSIFDTCYDLSNYNMVRVPTVSFYFSSGPVLTLPA 1661
            P  AYEA RD+FIA T  LPR   GVSIFDTCYDL  +  VRVPTVSFYFS GP+LTLPA
Sbjct: 362  PTAAYEAFRDAFIAQTTNLPRAS-GVSIFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPA 420

Query: 1662 RNFLIPVNEMGTFCFAFAPSSTKLSXXXXXXXXXXXXSFDGANGFVGFGPNTC 1820
            RNFLIPV+++G+FCFAFAPSS+ LS            S DGANGFVGFGPN C
Sbjct: 421  RNFLIPVDDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFVGFGPNVC 473


>ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
            gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein
            ASPARTIC PROTEASE IN GUARD CELL 2; Short=AtASPG2; Flags:
            Precursor gi|11994777|dbj|BAB03167.1| nucleoid
            chloroplast DNA-binding protein-like [Arabidopsis
            thaliana] gi|28392860|gb|AAO41867.1| unknown protein
            [Arabidopsis thaliana] gi|332642798|gb|AEE76319.1|
            aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score =  533 bits (1373), Expect = e-149
 Identities = 262/417 (62%), Positives = 302/417 (72%)
 Frame = +3

Query: 570  TGSKWKLNLVHRDVISEGNVSDYHQLFIGLMKRDLKRVXXXXXXXXXXXXXXXXVSYEVN 749
            + SK+ L L+HRD        ++H      M+RD  RV                  YEVN
Sbjct: 55   SSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVN 114

Query: 750  DFGSQVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCNQCYRQSDPVFD 929
            DFGS ++SGM+QGSGEYFVRIGVGSPPR QYMVIDSGSD+VWVQCQPC  CY+QSDPVFD
Sbjct: 115  DFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFD 174

Query: 930  PANSASFAGVACSSNVCDRLENAGCRAGRCRYAVSYGDGSYTRGTLALETITLDRTVVHN 1109
            PA S S+ GV+C S+VCDR+EN+GC +G CRY V YGDGSYT+GTLALET+T  +TVV N
Sbjct: 175  PAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRN 234

Query: 1110 VAIGCGHKNRGMFVXXXXXXXXXXXSMSFIGQLEGQTGGTFGYCLVSRGSNSYGSLVFGR 1289
            VA+GCGH+NRGMF+           SMSF+GQL GQTGG FGYCLVSRG++S GSLVFGR
Sbjct: 235  VAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGR 294

Query: 1290 DALPVGAVWVPLVRNPRAPSFXXXXXXXXXXXXMRVPVSEDLFRLTEWGEGGVVMDTGTA 1469
            +ALPVGA WVPLVRNPRAPSF            +R+P+ + +F LTE G+GGVVMDTGTA
Sbjct: 295  EALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTA 354

Query: 1470 VTRLPLPAYEALRDSFIAGTAELPRTPVGVSIFDTCYDLSNYNMVRVPTVSFYFSSGPVL 1649
            VTRLP  AY A RD F + TA LPR   GVSIFDTCYDLS +  VRVPTVSFYF+ GPVL
Sbjct: 355  VTRLPTAAYVAFRDGFKSQTANLPRAS-GVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVL 413

Query: 1650 TLPARNFLIPVNEMGTFCFAFAPSSTKLSXXXXXXXXXXXXSFDGANGFVGFGPNTC 1820
            TLPARNFL+PV++ GT+CFAFA S T LS            SFDGANGFVGFGPN C
Sbjct: 414  TLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470


>ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
            gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata
            subsp. lyrata]
          Length = 471

 Score =  531 bits (1367), Expect = e-148
 Identities = 266/423 (62%), Positives = 307/423 (72%), Gaps = 1/423 (0%)
 Frame = +3

Query: 555  NDDASTGSKWKLNLVHRDVISEGNVSDYHQLFIGLMKRDLKRVXXXXXXXXXXXXXXXXV 734
            +DD++  SK+ L L+HRD        ++H      M+RD  RV                 
Sbjct: 52   SDDSN--SKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVVVASSD 109

Query: 735  S-YEVNDFGSQVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCNQCYRQ 911
            S YEVNDFGS V+SGM+QGSGEYFVRIGVGSPPR QYMVIDSGSD+VWVQCQPC  CY+Q
Sbjct: 110  SRYEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQ 169

Query: 912  SDPVFDPANSASFAGVACSSNVCDRLENAGCRAGRCRYAVSYGDGSYTRGTLALETITLD 1091
            SDPVFDPA S S+ GV+C S+VCDR+EN+GC +G CRY V YGDGSYT+GTLALET+T  
Sbjct: 170  SDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFA 229

Query: 1092 RTVVHNVAIGCGHKNRGMFVXXXXXXXXXXXSMSFIGQLEGQTGGTFGYCLVSRGSNSYG 1271
            +TVV NVA+GCGH+NRGMF+           SMSF+GQL GQTGG FGYCLVSRG++S G
Sbjct: 230  KTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTG 289

Query: 1272 SLVFGRDALPVGAVWVPLVRNPRAPSFXXXXXXXXXXXXMRVPVSEDLFRLTEWGEGGVV 1451
            SLVFGR+ALPVGA WVPLVRNPRAPSF            +R+P+ + +F LTE G+GGVV
Sbjct: 290  SLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVV 349

Query: 1452 MDTGTAVTRLPLPAYEALRDSFIAGTAELPRTPVGVSIFDTCYDLSNYNMVRVPTVSFYF 1631
            MDTGTAVTRLP  AY A RD F + TA LPR   GVSIFDTCYDLS +  VRVPTVSFYF
Sbjct: 350  MDTGTAVTRLPTGAYAAFRDGFKSQTANLPRAS-GVSIFDTCYDLSGFVSVRVPTVSFYF 408

Query: 1632 SSGPVLTLPARNFLIPVNEMGTFCFAFAPSSTKLSXXXXXXXXXXXXSFDGANGFVGFGP 1811
            + GPVLTLPARNFL+PV++ GT+CFAFA S T LS            SFDGANGFVGFGP
Sbjct: 409  TEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGP 468

Query: 1812 NTC 1820
            N C
Sbjct: 469  NVC 471


Top