BLASTX nr result

ID: Coptis21_contig00003219 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00003219
         (2054 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1...   582   e-163
ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor,...   552   e-154
ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1...   546   e-152
ref|NP_188636.2| aspartyl protease family protein [Arabidopsis t...   540   e-151
ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata] ...   536   e-150

>ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  582 bits (1501), Expect = e-163
 Identities = 292/448 (65%), Positives = 332/448 (74%)
 Frame = +3

Query: 396  HLNVQEIITGIKARPSKAMAYHKLSENDDASTGSKWKLNLVHRDVISEGNVSDYHQLFIG 575
            HLNV+E I G +  P +    H+         G KW + +VHRD +S GN  D+     G
Sbjct: 44   HLNVKETIAGTRIIPLEVSEDHE-------EGGEKWMMKVVHRDQLSFGNSDDHRHRLDG 96

Query: 576  LMKRDLKRVXXXXXXXXXXXXXXXXVSYEVNDFGSEVISGMEQGSGEYFVRIGVGSPPRS 755
             +KRD KRV                 SY V+DFG++VISGMEQGSGEYFVRIGVGSPPRS
Sbjct: 97   RLKRDAKRVASLIRRLSSGGGG----SYRVDDFGTDVISGMEQGSGEYFVRIGVGSPPRS 152

Query: 756  QYMVIDSGSDIVWVQCQPCNQCYKQSDPVFDPANSASFAGVACSSNVCDRLENAGCRAGR 935
            QYMVIDSGSDIVWVQCQPC QCY QSDPVFDPA+SASF GV+CSS+VCDRLENAGC AGR
Sbjct: 153  QYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSSVCDRLENAGCHAGR 212

Query: 936  CRYAVSYGDGSYTRGTLALETITLGRTVVHNVAIGCGHKNRGMFVXXXXXXXXXXXSMSF 1115
            CRY VSYGDGSYT+GTLALET+T GRT+V +VAIGCGH+NRGMFV           SMSF
Sbjct: 213  CRYEVSYGDGSYTKGTLALETLTFGRTMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSF 272

Query: 1116 IGQLEGQTGGTFGYCLVSRGSNSYGSLVFGRDALPVGAVWVPLVRNPRAPSFXXXXXXXX 1295
            +GQL GQTGG F YCLVSRG++S GSLVFGR+ALP GA WVPLVRNPRAPSF        
Sbjct: 273  VGQLGGQTGGAFSYCLVSRGTDSSGSLVFGREALPAGAAWVPLVRNPRAPSFYYIGLAGL 332

Query: 1296 XXXXMRVPVSEDLFRLTEWGEGGVVMDTGTAVTRLPLAAYEALRDSFIAGTAELPRTPVG 1475
                +RVP+SE++FRLTE G+GGVVMDTGTAVTRLP  AY+A RD+F+A TA LPR   G
Sbjct: 333  GVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRA-TG 391

Query: 1476 VSIFDTCYDLSNYNMVRVPTVSFYFSSGPVLTLPARNFLIPVDEMGTFCFAFAPSSTKLS 1655
            V+IFDTCYDL  +  VRVPTVSFYFS GP+LTLPARNFLIP+D+ GTFCFAFAPS++ LS
Sbjct: 392  VAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLS 451

Query: 1656 XXXXXXXXXXXXSFDGANGFVGFGPNTC 1739
                        SFDGANG+VGFGPN C
Sbjct: 452  ILGNIQQEGIQISFDGANGYVGFGPNIC 479


>ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223548143|gb|EEF49635.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 481

 Score =  552 bits (1422), Expect = e-154
 Identities = 282/450 (62%), Positives = 323/450 (71%), Gaps = 3/450 (0%)
 Frame = +3

Query: 399  LNVQEIITGIKARPSKAMAYHKLSEN-DDASTGSKWKLNLVHRDVISEGNVSDYHQL--F 569
            LNV+E IT      +KA  Y +L +N +D  T  KWKL LVHRD I+  N S Y     F
Sbjct: 41   LNVKEAIT-----ETKASQYQELFDNQNDTLTEGKWKLKLVHRDKITAFNKSSYDHSHNF 95

Query: 570  IGLMKRDLKRVXXXXXXXXXXXXXXXXVSYEVNDFGSEVISGMEQGSGEYFVRIGVGSPP 749
               ++RD KRV                 SY V +FG+EV+SGM QGSGEYF+RIGVGSPP
Sbjct: 96   HARIQRDKKRVATLIRRLSPRDATS---SYSVEEFGAEVVSGMNQGSGEYFIRIGVGSPP 152

Query: 750  RSQYMVIDSGSDIVWVQCQPCNQCYKQSDPVFDPANSASFAGVACSSNVCDRLENAGCRA 929
            R QY+VIDSGSDIVWVQCQPC QCY Q+DPVFDPA+SASF GV CSS+VC+R+ENAGC A
Sbjct: 153  REQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVPCSSSVCERIENAGCHA 212

Query: 930  GRCRYAVSYGDGSYTRGTLALETITLGRTVVHNVAIGCGHKNRGMFVXXXXXXXXXXXSM 1109
            G CRY V YGDGSYT+GTLALET+T GRTVV NVAIGCGH+NRGMFV           SM
Sbjct: 213  GGCRYEVMYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHRNRGMFVGAAGLLGLGGGSM 272

Query: 1110 SFIGQLEGQTGGTFGYCLVSRGSNSYGSLVFGRDALPVGAVWVPLVRNPRAPSFXXXXXX 1289
            S +GQL GQTGG F YCLVSRG++S GSL FGR A+PVGA W+PL+RNPRAPSF      
Sbjct: 273  SLVGQLGGQTGGAFSYCLVSRGTDSAGSLEFGRGAMPVGAAWIPLIRNPRAPSFYYIRLS 332

Query: 1290 XXXXXXMRVPVSEDLFRLTEWGEGGVVMDTGTAVTRLPLAAYEALRDSFIAGTAELPRTP 1469
                  M+VP+SED+F+L E G GGVVMDTGTAVTR+P  AY A RD+FI  T  LPR  
Sbjct: 333  GVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRAS 392

Query: 1470 VGVSIFDTCYDLSNYNMVRVPTVSFYFSSGPVLTLPARNFLIPVDEMGTFCFAFAPSSTK 1649
             GVSIFDTCY+L+ +  VRVPTVSFYF+ GP+LTLPARNFLIPVD++GTFCFAFA S + 
Sbjct: 393  -GVSIFDTCYNLNGFVSVRVPTVSFYFAGGPILTLPARNFLIPVDDVGTFCFAFAASPSG 451

Query: 1650 LSXXXXXXXXXXXXSFDGANGFVGFGPNTC 1739
            LS            SFDGANGFVGFGPN C
Sbjct: 452  LSIIGNIQQEGIQISFDGANGFVGFGPNVC 481


>ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  546 bits (1406), Expect = e-152
 Identities = 276/451 (61%), Positives = 327/451 (72%), Gaps = 4/451 (0%)
 Frame = +3

Query: 399  LNVQEIITGIKARPS---KAMAYHKLSENDDASTGSKWKLNLVHRDVISEGNVS-DYHQL 566
            LNV++I+T  K  P+   K + + KL+   +AS+ +K+KL LVHRD +   N S D+   
Sbjct: 29   LNVKQILTETKLNPTNTYKHLQHQKLNIATEASSPAKYKLKLVHRDKVPTFNTSHDHRTR 88

Query: 567  FIGLMKRDLKRVXXXXXXXXXXXXXXXXVSYEVNDFGSEVISGMEQGSGEYFVRIGVGSP 746
            F   M+RD KRV                 +Y    FGS+V+SGMEQGSGEYFVRIGVGSP
Sbjct: 89   FNARMQRDTKRVAALRRHLAAGKP-----TYAEEAFGSDVVSGMEQGSGEYFVRIGVGSP 143

Query: 747  PRSQYMVIDSGSDIVWVQCQPCNQCYKQSDPVFDPANSASFAGVACSSNVCDRLENAGCR 926
            PR+QY+VIDSGSDI+WVQC+PC QCY QSDPVF+PA+S+S+AGV+C+S VC  ++NAGC 
Sbjct: 144  PRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSYAGVSCASTVCSHVDNAGCH 203

Query: 927  AGRCRYAVSYGDGSYTRGTLALETITLGRTVVHNVAIGCGHKNRGMFVXXXXXXXXXXXS 1106
             GRCRY VSYGDGSYT+GTLALET+T GRT++ NVAIGCGH N+GMFV            
Sbjct: 204  EGRCRYEVSYGDGSYTKGTLALETLTFGRTLIRNVAIGCGHHNQGMFVGAAGLLGLGSGP 263

Query: 1107 MSFIGQLEGQTGGTFGYCLVSRGSNSYGSLVFGRDALPVGAVWVPLVRNPRAPSFXXXXX 1286
            MSF+GQL GQ GGTF YCLVSRG  S G L FGR+A+PVGA WVPL+ NPRA SF     
Sbjct: 264  MSFVGQLGGQAGGTFSYCLVSRGIQSSGLLQFGREAVPVGAAWVPLIHNPRAQSFYYVGL 323

Query: 1287 XXXXXXXMRVPVSEDLFRLTEWGEGGVVMDTGTAVTRLPLAAYEALRDSFIAGTAELPRT 1466
                   +RVP+SED+F+L+E G+GGVVMDTGTAVTRLP AAYEA RD+FIA T  LPR 
Sbjct: 324  SGLGVGGLRVPISEDVFKLSELGDGGVVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRA 383

Query: 1467 PVGVSIFDTCYDLSNYNMVRVPTVSFYFSSGPVLTLPARNFLIPVDEMGTFCFAFAPSST 1646
              GVSIFDTCYDL  +  VRVPTVSFYFS GP+LTLPARNFLIPVD++G+FCFAFAPSS+
Sbjct: 384  S-GVSIFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGSFCFAFAPSSS 442

Query: 1647 KLSXXXXXXXXXXXXSFDGANGFVGFGPNTC 1739
             LS            S DGANGFVGFGPN C
Sbjct: 443  GLSIIGNIQQEGIEISVDGANGFVGFGPNVC 473


>ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
            gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein
            ASPARTIC PROTEASE IN GUARD CELL 2; Short=AtASPG2; Flags:
            Precursor gi|11994777|dbj|BAB03167.1| nucleoid
            chloroplast DNA-binding protein-like [Arabidopsis
            thaliana] gi|28392860|gb|AAO41867.1| unknown protein
            [Arabidopsis thaliana] gi|332642798|gb|AEE76319.1|
            aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score =  540 bits (1390), Expect = e-151
 Identities = 265/417 (63%), Positives = 304/417 (72%)
 Frame = +3

Query: 489  TGSKWKLNLVHRDVISEGNVSDYHQLFIGLMKRDLKRVXXXXXXXXXXXXXXXXVSYEVN 668
            + SK+ L L+HRD        ++H      M+RD  RV                  YEVN
Sbjct: 55   SSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVN 114

Query: 669  DFGSEVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCNQCYKQSDPVFD 848
            DFGS+++SGM+QGSGEYFVRIGVGSPPR QYMVIDSGSD+VWVQCQPC  CYKQSDPVFD
Sbjct: 115  DFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFD 174

Query: 849  PANSASFAGVACSSNVCDRLENAGCRAGRCRYAVSYGDGSYTRGTLALETITLGRTVVHN 1028
            PA S S+ GV+C S+VCDR+EN+GC +G CRY V YGDGSYT+GTLALET+T  +TVV N
Sbjct: 175  PAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRN 234

Query: 1029 VAIGCGHKNRGMFVXXXXXXXXXXXSMSFIGQLEGQTGGTFGYCLVSRGSNSYGSLVFGR 1208
            VA+GCGH+NRGMF+           SMSF+GQL GQTGG FGYCLVSRG++S GSLVFGR
Sbjct: 235  VAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGR 294

Query: 1209 DALPVGAVWVPLVRNPRAPSFXXXXXXXXXXXXMRVPVSEDLFRLTEWGEGGVVMDTGTA 1388
            +ALPVGA WVPLVRNPRAPSF            +R+P+ + +F LTE G+GGVVMDTGTA
Sbjct: 295  EALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTA 354

Query: 1389 VTRLPLAAYEALRDSFIAGTAELPRTPVGVSIFDTCYDLSNYNMVRVPTVSFYFSSGPVL 1568
            VTRLP AAY A RD F + TA LPR   GVSIFDTCYDLS +  VRVPTVSFYF+ GPVL
Sbjct: 355  VTRLPTAAYVAFRDGFKSQTANLPRAS-GVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVL 413

Query: 1569 TLPARNFLIPVDEMGTFCFAFAPSSTKLSXXXXXXXXXXXXSFDGANGFVGFGPNTC 1739
            TLPARNFL+PVD+ GT+CFAFA S T LS            SFDGANGFVGFGPN C
Sbjct: 414  TLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470


>ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
            gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata
            subsp. lyrata]
          Length = 471

 Score =  536 bits (1381), Expect = e-150
 Identities = 268/423 (63%), Positives = 308/423 (72%), Gaps = 1/423 (0%)
 Frame = +3

Query: 474  NDDASTGSKWKLNLVHRDVISEGNVSDYHQLFIGLMKRDLKRVXXXXXXXXXXXXXXXXV 653
            +DD++  SK+ L L+HRD        ++H      M+RD  RV                 
Sbjct: 52   SDDSN--SKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVVVASSD 109

Query: 654  S-YEVNDFGSEVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCNQCYKQ 830
            S YEVNDFGS+V+SGM+QGSGEYFVRIGVGSPPR QYMVIDSGSD+VWVQCQPC  CYKQ
Sbjct: 110  SRYEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQ 169

Query: 831  SDPVFDPANSASFAGVACSSNVCDRLENAGCRAGRCRYAVSYGDGSYTRGTLALETITLG 1010
            SDPVFDPA S S+ GV+C S+VCDR+EN+GC +G CRY V YGDGSYT+GTLALET+T  
Sbjct: 170  SDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFA 229

Query: 1011 RTVVHNVAIGCGHKNRGMFVXXXXXXXXXXXSMSFIGQLEGQTGGTFGYCLVSRGSNSYG 1190
            +TVV NVA+GCGH+NRGMF+           SMSF+GQL GQTGG FGYCLVSRG++S G
Sbjct: 230  KTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTG 289

Query: 1191 SLVFGRDALPVGAVWVPLVRNPRAPSFXXXXXXXXXXXXMRVPVSEDLFRLTEWGEGGVV 1370
            SLVFGR+ALPVGA WVPLVRNPRAPSF            +R+P+ + +F LTE G+GGVV
Sbjct: 290  SLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVV 349

Query: 1371 MDTGTAVTRLPLAAYEALRDSFIAGTAELPRTPVGVSIFDTCYDLSNYNMVRVPTVSFYF 1550
            MDTGTAVTRLP  AY A RD F + TA LPR   GVSIFDTCYDLS +  VRVPTVSFYF
Sbjct: 350  MDTGTAVTRLPTGAYAAFRDGFKSQTANLPRAS-GVSIFDTCYDLSGFVSVRVPTVSFYF 408

Query: 1551 SSGPVLTLPARNFLIPVDEMGTFCFAFAPSSTKLSXXXXXXXXXXXXSFDGANGFVGFGP 1730
            + GPVLTLPARNFL+PVD+ GT+CFAFA S T LS            SFDGANGFVGFGP
Sbjct: 409  TEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGP 468

Query: 1731 NTC 1739
            N C
Sbjct: 469  NVC 471


Top