BLASTX nr result

ID: Cimicifuga21_contig00004907 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cimicifuga21_contig00004907
         (2031 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2...   611   e-172
ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor,...   598   e-168
ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|2...   596   e-168
ref|XP_002890686.1| aspartyl protease family protein [Arabidopsi...   580   e-163
ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis tha...   577   e-162

>ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  611 bits (1576), Expect = e-172
 Identities = 313/493 (63%), Positives = 377/493 (76%), Gaps = 6/493 (1%)
 Frame = -3

Query: 1849 FVFSIVYLFFCS---RVYSRSLLESTETTFLDVSASIQKTRDLLSFNPETLELQSLDEEK 1679
            F+  +++ FFC+    + +   L    TT LDVS SI+++ ++LS NP+  +++   +E+
Sbjct: 6    FLLCVLFAFFCTWGVSLVNARRLSLPRTTVLDVSGSIRESLNVLSLNPQYEQMEFQHQER 65

Query: 1678 Q---SSDSSSFTMRLHSRDTLHKSSHKDYKSLTLSRLERDSARVKSLNLKLDLAVKGIKK 1508
                SS SSS T+ LHSR ++HKSSHKDYKSL L+RLERDS RV+SL  ++DLA+ GI K
Sbjct: 66   SFPSSSSSSSLTLSLHSRTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITK 125

Query: 1507 SDLKPSQEVEQLVELKDVGELQGPIISGTSQGSGEYFSRVGIGSPPKPQYLVLDTGSDVT 1328
            SDLKP   VE+ +E +    L+ P++SG SQGSGEYFSRVGIGSPPK  Y+V+DTGSDV 
Sbjct: 126  SDLKP---VEKELEAE---ALETPLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVN 179

Query: 1327 WVQSAPCTDCYQQTDPIFEPSSSSSYKPLGCGTEQCRGLDVSACRNGTCLYQVSYGDGSY 1148
            WVQ APC DCYQQ DPIFEPS SSSY PL C T QC+ LDVS CRN +CLY+VSYGDGSY
Sbjct: 180  WVQCAPCADCYQQADPIFEPSFSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSY 239

Query: 1147 TVGDFVTETLTYGNSDAVENIAVGCGHDNEXXXXXXXXXXXXXXXXXSFPSQIKSTAFSY 968
            TVGDF TET+T   S ++ N+A+GCGHDNE                 SFPSQI +++FSY
Sbjct: 240  TVGDFATETITLDGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSY 299

Query: 967  CLVDRDSTSGSTLQFGANSQPPNAVTAPLLRNSQLDTFYYVGLTGLSVGGAMLPIPPSLF 788
            CLV+RD+ S STL+F +   P ++VTAPLLRN+QLDTFYY+G+TG+ VGG ML IP S F
Sbjct: 300  CLVNRDTDSASTLEFNSPI-PSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSF 358

Query: 787  SVDESGNGGIIVDSGTAITRLQTDAYNSLRDAFVKGTPNLKSTSGFALFDTCYDLSSETS 608
             VDESGNGGIIVDSGTA+TRLQ+D YNSLRD+FV+GT +L STSG ALFDTCYDLSS +S
Sbjct: 359  EVDESGNGGIIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSS 418

Query: 607  VEVPTVRFDFPGGKSLSLPAKNYLIPVDSAGTFCFAFAPTSNPLSIIGNVQQQGTRVGVD 428
            VEVPTV F FP GK L+LPAKNYLIPVDSAGTFCFAFAPT++ LSIIGNVQQQGTRV  D
Sbjct: 419  VEVPTVSFHFPDGKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYD 478

Query: 427  LGNSLVGFSLKQC 389
            L NSLVGFS   C
Sbjct: 479  LSNSLVGFSPNGC 491


>ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223531426|gb|EEF33260.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 479

 Score =  598 bits (1542), Expect = e-168
 Identities = 303/464 (65%), Positives = 356/464 (76%)
 Frame = -3

Query: 1780 ETTFLDVSASIQKTRDLLSFNPETLELQSLDEEKQSSDSSSFTMRLHSRDTLHKSSHKDY 1601
            +TT LDV ASIQK   +  F     ++   ++++  + SS  TM LHSR ++ K+ H DY
Sbjct: 24   KTTLLDVEASIQKAEAI--FTSSATKMTPFNQQEIVTSSSQLTMELHSRTSVQKTKHPDY 81

Query: 1600 KSLTLSRLERDSARVKSLNLKLDLAVKGIKKSDLKPSQEVEQLVELKDVGELQGPIISGT 1421
            +SLTLSRLERDSARVKS+N +LDLA+ G+  SDLKP     Q        +LQGPIISGT
Sbjct: 82   RSLTLSRLERDSARVKSINTRLDLAIHGLSTSDLKPLDTDSQF----RAEDLQGPIISGT 137

Query: 1420 SQGSGEYFSRVGIGSPPKPQYLVLDTGSDVTWVQSAPCTDCYQQTDPIFEPSSSSSYKPL 1241
            SQGSGEYFSRVGIG P  P Y+VLDTGSDV W+Q APC DCY Q DPIFEP+SS+SY PL
Sbjct: 138  SQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPL 197

Query: 1240 GCGTEQCRGLDVSACRNGTCLYQVSYGDGSYTVGDFVTETLTYGNSDAVENIAVGCGHDN 1061
             C T+QC+ LDVS CRN TCLY+VSYGDGSYTVGDFVTET+T G++ +V+N+A+GCGH+N
Sbjct: 198  SCDTKQCQSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLGSA-SVDNVAIGCGHNN 256

Query: 1060 EXXXXXXXXXXXXXXXXXSFPSQIKSTAFSYCLVDRDSTSGSTLQFGANSQPPNAVTAPL 881
            E                 SFPSQI +++FSYCLVDRDS S STL+F + +  P+A+TAPL
Sbjct: 257  EGLFIGAAGLLGLGGGKLSFPSQINASSFSYCLVDRDSDSASTLEFNS-ALLPHAITAPL 315

Query: 880  LRNSQLDTFYYVGLTGLSVGGAMLPIPPSLFSVDESGNGGIIVDSGTAITRLQTDAYNSL 701
            LRN +LDTFYYVG+TGLSVGG +L IP S+F +DESGNGGII+DSGTA+TRLQT AYN+L
Sbjct: 316  LRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRLQTAAYNAL 375

Query: 700  RDAFVKGTPNLKSTSGFALFDTCYDLSSETSVEVPTVRFDFPGGKSLSLPAKNYLIPVDS 521
            RDAFVKGT +L  TS  ALFDTCYDLS +TSVEVPTV F   GGK L LPA NYLIPVDS
Sbjct: 376  RDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLPATNYLIPVDS 435

Query: 520  AGTFCFAFAPTSNPLSIIGNVQQQGTRVGVDLGNSLVGFSLKQC 389
             GTFCFAFAPTS+ LSIIGNVQQQGTRVG DL NSLVGF  +QC
Sbjct: 436  DGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479


>ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|222864993|gb|EEF02124.1|
            predicted protein [Populus trichocarpa]
          Length = 484

 Score =  596 bits (1536), Expect = e-168
 Identities = 313/491 (63%), Positives = 370/491 (75%), Gaps = 3/491 (0%)
 Frame = -3

Query: 1852 FFVFSIVYLFFCSRVYSRSLL---ESTETTFLDVSASIQKTRDLLSFNPETLELQSLDEE 1682
            F+VF    LFF S   S S +     +ETT LDV+ASIQ+T+++ S  P+   +   +++
Sbjct: 5    FYVF--FSLFFASPPVSCSRILTPHPSETTVLDVAASIQRTKNIFSSGPK---MSPFNQQ 59

Query: 1681 KQSSDSSSFTMRLHSRDTLHKSSHKDYKSLTLSRLERDSARVKSLNLKLDLAVKGIKKSD 1502
            ++ + SS  T+ L SR ++ K++H  YKSLTLSRL+RDSARVKSL  +LDLA+  I  SD
Sbjct: 60   EKETTSSELTVELLSRTSIQKTTHTGYKSLTLSRLQRDSARVKSLVTRLDLAINSISSSD 119

Query: 1501 LKPSQEVEQLVELKDVGELQGPIISGTSQGSGEYFSRVGIGSPPKPQYLVLDTGSDVTWV 1322
            LKP   +E   E K   +LQ PIISGTSQGSGEYFSRVGIG PP   YL+LDTGSDV WV
Sbjct: 120  LKP---LETDSEFKPE-DLQSPIISGTSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWV 175

Query: 1321 QSAPCTDCYQQTDPIFEPSSSSSYKPLGCGTEQCRGLDVSACRNGTCLYQVSYGDGSYTV 1142
            Q APC DCYQQ DPIFEP+SS+S+  L C T QCR LDVS CRN TCLY+VSYGDGSYTV
Sbjct: 176  QCAPCADCYQQADPIFEPASSASFSTLSCNTRQCRSLDVSECRNDTCLYEVSYGDGSYTV 235

Query: 1141 GDFVTETLTYGNSDAVENIAVGCGHDNEXXXXXXXXXXXXXXXXXSFPSQIKSTAFSYCL 962
            GDFVTET+T G++  V+N+A+GCGH+NE                 SFPSQI +T+FSYCL
Sbjct: 236  GDFVTETITLGSAP-VDNVAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCL 294

Query: 961  VDRDSTSGSTLQFGANSQPPNAVTAPLLRNSQLDTFYYVGLTGLSVGGAMLPIPPSLFSV 782
            VDRDS S STL+F + + PPNAV+APLLRN  LDTFYYVGLTGLSVGG ++ IP S F +
Sbjct: 295  VDRDSESASTLEFNS-TLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQI 353

Query: 781  DESGNGGIIVDSGTAITRLQTDAYNSLRDAFVKGTPNLKSTSGFALFDTCYDLSSETSVE 602
            DESGNGG+IVDSGTAITRLQTD YNSLRDAFVK T +L ST+G ALFDTCYDLSS+ +VE
Sbjct: 354  DESGNGGVIVDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVE 413

Query: 601  VPTVRFDFPGGKSLSLPAKNYLIPVDSAGTFCFAFAPTSNPLSIIGNVQQQGTRVGVDLG 422
            VPTV F FP GK L LPAKNYL+P+DS GTFCFAFAPT++ LSIIGNVQQQGTRV  DL 
Sbjct: 414  VPTVSFHFPDGKELPLPAKNYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLV 473

Query: 421  NSLVGFSLKQC 389
            N LVGF   +C
Sbjct: 474  NHLVGFVPNKC 484


>ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297336528|gb|EFH66945.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  580 bits (1494), Expect = e-163
 Identities = 295/491 (60%), Positives = 367/491 (74%), Gaps = 2/491 (0%)
 Frame = -3

Query: 1855 SFFVFSIVYLFFCSRVYSRSLLES--TETTFLDVSASIQKTRDLLSFNPETLELQSLDEE 1682
            SF  F + +L   S V+SR L ++  T T+ L+V+ SI +T+   SF        +  EE
Sbjct: 7    SFLFFFVFFLTSHSFVFSRILPKTSVTTTSILNVADSIHRTKYTSSFR------LNQQEE 60

Query: 1681 KQSSDSSSFTMRLHSRDTLHKSSHKDYKSLTLSRLERDSARVKSLNLKLDLAVKGIKKSD 1502
            +  S SSSF+++LHSR ++  + H DYKSLTL+RL RD+ARVKSL  +LDLA+  I K+D
Sbjct: 61   QTHSRSSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKAD 120

Query: 1501 LKPSQEVEQLVELKDVGELQGPIISGTSQGSGEYFSRVGIGSPPKPQYLVLDTGSDVTWV 1322
            LKP   +    E +D+   + P+ISGT+QGSGEYF+RVGIG+P +  Y+VLDTGSDV W+
Sbjct: 121  LKPVTTMYTTTEEEDI---EAPLISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWL 177

Query: 1321 QSAPCTDCYQQTDPIFEPSSSSSYKPLGCGTEQCRGLDVSACRNGTCLYQVSYGDGSYTV 1142
            Q  PC DCY QT+PIFEPSSSSSY+PL C T QC  L+VS CRN TCLY+VSYGDGSYTV
Sbjct: 178  QCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTV 237

Query: 1141 GDFVTETLTYGNSDAVENIAVGCGHDNEXXXXXXXXXXXXXXXXXSFPSQIKSTAFSYCL 962
            GDF TETLT G S  V+N+AVGCGH NE                 + PSQ+ +T+FSYCL
Sbjct: 238  GDFATETLTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCL 296

Query: 961  VDRDSTSGSTLQFGANSQPPNAVTAPLLRNSQLDTFYYVGLTGLSVGGAMLPIPPSLFSV 782
            VDRDS S ST++FG  S PP+AV APLLRN QLDTFYY+GLTG+SVGG +L IP S F +
Sbjct: 297  VDRDSDSASTVEFGT-SLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEM 355

Query: 781  DESGNGGIIVDSGTAITRLQTDAYNSLRDAFVKGTPNLKSTSGFALFDTCYDLSSETSVE 602
            DESG+GGII+DSGTA+TRLQT  YNSLRD+F+KGT +L+  +G A+FDTCY+LS++T++E
Sbjct: 356  DESGSGGIIIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIE 415

Query: 601  VPTVRFDFPGGKSLSLPAKNYLIPVDSAGTFCFAFAPTSNPLSIIGNVQQQGTRVGVDLG 422
            VPTV F FPGGK L+LPAKNY+IPVDS GTFC AFAPT++ L+IIGNVQQQGTRV  DL 
Sbjct: 416  VPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLA 475

Query: 421  NSLVGFSLKQC 389
            NSL+GFS  +C
Sbjct: 476  NSLIGFSSNKC 486


>ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
            gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical
            protein [Arabidopsis thaliana] gi|20466516|gb|AAM20575.1|
            unknown protein [Arabidopsis thaliana]
            gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis
            thaliana] gi|110736960|dbj|BAF00436.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332192515|gb|AEE30636.1| aspartyl protease-like
            protein [Arabidopsis thaliana]
          Length = 483

 Score =  577 bits (1486), Expect = e-162
 Identities = 297/492 (60%), Positives = 364/492 (73%), Gaps = 2/492 (0%)
 Frame = -3

Query: 1858 NSFFVFSIVYLFFCSRVYSRSLLESTETT--FLDVSASIQKTRDLLSFNPETLELQSLDE 1685
            N  F F I +L   S V+SR L E++ TT   L+V+ SI +T+   SF        +  E
Sbjct: 4    NYSFFFFIFFLTSHSSVFSRILPETSTTTTSILNVADSIHRTKYTSSFR------LNQQE 57

Query: 1684 EKQSSDSSSFTMRLHSRDTLHKSSHKDYKSLTLSRLERDSARVKSLNLKLDLAVKGIKKS 1505
            E+  S SSSF+++LHSR ++  + H DYKSLTL+RL RD+ARVKSL  +LDLA+  I K+
Sbjct: 58   EQTHSASSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKA 117

Query: 1504 DLKPSQEVEQLVELKDVGELQGPIISGTSQGSGEYFSRVGIGSPPKPQYLVLDTGSDVTW 1325
            DLKP   +    E     +++ P+ISGT+QGSGEYF+RVGIG P +  Y+VLDTGSDV W
Sbjct: 118  DLKPISTMYTTEEQ----DIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNW 173

Query: 1324 VQSAPCTDCYQQTDPIFEPSSSSSYKPLGCGTEQCRGLDVSACRNGTCLYQVSYGDGSYT 1145
            +Q  PC DCY QT+PIFEPSSSSSY+PL C T QC  L+VS CRN TCLY+VSYGDGSYT
Sbjct: 174  LQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYT 233

Query: 1144 VGDFVTETLTYGNSDAVENIAVGCGHDNEXXXXXXXXXXXXXXXXXSFPSQIKSTAFSYC 965
            VGDF TETLT G S  V+N+AVGCGH NE                 + PSQ+ +T+FSYC
Sbjct: 234  VGDFATETLTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYC 292

Query: 964  LVDRDSTSGSTLQFGANSQPPNAVTAPLLRNSQLDTFYYVGLTGLSVGGAMLPIPPSLFS 785
            LVDRDS S ST+ FG  S  P+AV APLLRN QLDTFYY+GLTG+SVGG +L IP S F 
Sbjct: 293  LVDRDSDSASTVDFGT-SLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFE 351

Query: 784  VDESGNGGIIVDSGTAITRLQTDAYNSLRDAFVKGTPNLKSTSGFALFDTCYDLSSETSV 605
            +DESG+GGII+DSGTA+TRLQT+ YNSLRD+FVKGT +L+  +G A+FDTCY+LS++T+V
Sbjct: 352  MDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTV 411

Query: 604  EVPTVRFDFPGGKSLSLPAKNYLIPVDSAGTFCFAFAPTSNPLSIIGNVQQQGTRVGVDL 425
            EVPTV F FPGGK L+LPAKNY+IPVDS GTFC AFAPT++ L+IIGNVQQQGTRV  DL
Sbjct: 412  EVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDL 471

Query: 424  GNSLVGFSLKQC 389
             NSL+GFS  +C
Sbjct: 472  ANSLIGFSSNKC 483


Top