BLASTX nr result
ID: Cimicifuga21_contig00004907
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cimicifuga21_contig00004907 (2031 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2... 611 e-172 ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor,... 598 e-168 ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|2... 596 e-168 ref|XP_002890686.1| aspartyl protease family protein [Arabidopsi... 580 e-163 ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis tha... 577 e-162 >ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera] Length = 491 Score = 611 bits (1576), Expect = e-172 Identities = 313/493 (63%), Positives = 377/493 (76%), Gaps = 6/493 (1%) Frame = -3 Query: 1849 FVFSIVYLFFCS---RVYSRSLLESTETTFLDVSASIQKTRDLLSFNPETLELQSLDEEK 1679 F+ +++ FFC+ + + L TT LDVS SI+++ ++LS NP+ +++ +E+ Sbjct: 6 FLLCVLFAFFCTWGVSLVNARRLSLPRTTVLDVSGSIRESLNVLSLNPQYEQMEFQHQER 65 Query: 1678 Q---SSDSSSFTMRLHSRDTLHKSSHKDYKSLTLSRLERDSARVKSLNLKLDLAVKGIKK 1508 SS SSS T+ LHSR ++HKSSHKDYKSL L+RLERDS RV+SL ++DLA+ GI K Sbjct: 66 SFPSSSSSSSLTLSLHSRTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITK 125 Query: 1507 SDLKPSQEVEQLVELKDVGELQGPIISGTSQGSGEYFSRVGIGSPPKPQYLVLDTGSDVT 1328 SDLKP VE+ +E + L+ P++SG SQGSGEYFSRVGIGSPPK Y+V+DTGSDV Sbjct: 126 SDLKP---VEKELEAE---ALETPLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVN 179 Query: 1327 WVQSAPCTDCYQQTDPIFEPSSSSSYKPLGCGTEQCRGLDVSACRNGTCLYQVSYGDGSY 1148 WVQ APC DCYQQ DPIFEPS SSSY PL C T QC+ LDVS CRN +CLY+VSYGDGSY Sbjct: 180 WVQCAPCADCYQQADPIFEPSFSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSY 239 Query: 1147 TVGDFVTETLTYGNSDAVENIAVGCGHDNEXXXXXXXXXXXXXXXXXSFPSQIKSTAFSY 968 TVGDF TET+T S ++ N+A+GCGHDNE SFPSQI +++FSY Sbjct: 240 TVGDFATETITLDGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSY 299 Query: 967 CLVDRDSTSGSTLQFGANSQPPNAVTAPLLRNSQLDTFYYVGLTGLSVGGAMLPIPPSLF 788 CLV+RD+ S STL+F + P ++VTAPLLRN+QLDTFYY+G+TG+ VGG ML IP S F Sbjct: 300 CLVNRDTDSASTLEFNSPI-PSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSF 358 Query: 787 SVDESGNGGIIVDSGTAITRLQTDAYNSLRDAFVKGTPNLKSTSGFALFDTCYDLSSETS 608 VDESGNGGIIVDSGTA+TRLQ+D YNSLRD+FV+GT +L STSG ALFDTCYDLSS +S Sbjct: 359 EVDESGNGGIIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSS 418 Query: 607 VEVPTVRFDFPGGKSLSLPAKNYLIPVDSAGTFCFAFAPTSNPLSIIGNVQQQGTRVGVD 428 VEVPTV F FP GK L+LPAKNYLIPVDSAGTFCFAFAPT++ LSIIGNVQQQGTRV D Sbjct: 419 VEVPTVSFHFPDGKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYD 478 Query: 427 LGNSLVGFSLKQC 389 L NSLVGFS C Sbjct: 479 LSNSLVGFSPNGC 491 >ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] Length = 479 Score = 598 bits (1542), Expect = e-168 Identities = 303/464 (65%), Positives = 356/464 (76%) Frame = -3 Query: 1780 ETTFLDVSASIQKTRDLLSFNPETLELQSLDEEKQSSDSSSFTMRLHSRDTLHKSSHKDY 1601 +TT LDV ASIQK + F ++ ++++ + SS TM LHSR ++ K+ H DY Sbjct: 24 KTTLLDVEASIQKAEAI--FTSSATKMTPFNQQEIVTSSSQLTMELHSRTSVQKTKHPDY 81 Query: 1600 KSLTLSRLERDSARVKSLNLKLDLAVKGIKKSDLKPSQEVEQLVELKDVGELQGPIISGT 1421 +SLTLSRLERDSARVKS+N +LDLA+ G+ SDLKP Q +LQGPIISGT Sbjct: 82 RSLTLSRLERDSARVKSINTRLDLAIHGLSTSDLKPLDTDSQF----RAEDLQGPIISGT 137 Query: 1420 SQGSGEYFSRVGIGSPPKPQYLVLDTGSDVTWVQSAPCTDCYQQTDPIFEPSSSSSYKPL 1241 SQGSGEYFSRVGIG P P Y+VLDTGSDV W+Q APC DCY Q DPIFEP+SS+SY PL Sbjct: 138 SQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPL 197 Query: 1240 GCGTEQCRGLDVSACRNGTCLYQVSYGDGSYTVGDFVTETLTYGNSDAVENIAVGCGHDN 1061 C T+QC+ LDVS CRN TCLY+VSYGDGSYTVGDFVTET+T G++ +V+N+A+GCGH+N Sbjct: 198 SCDTKQCQSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLGSA-SVDNVAIGCGHNN 256 Query: 1060 EXXXXXXXXXXXXXXXXXSFPSQIKSTAFSYCLVDRDSTSGSTLQFGANSQPPNAVTAPL 881 E SFPSQI +++FSYCLVDRDS S STL+F + + P+A+TAPL Sbjct: 257 EGLFIGAAGLLGLGGGKLSFPSQINASSFSYCLVDRDSDSASTLEFNS-ALLPHAITAPL 315 Query: 880 LRNSQLDTFYYVGLTGLSVGGAMLPIPPSLFSVDESGNGGIIVDSGTAITRLQTDAYNSL 701 LRN +LDTFYYVG+TGLSVGG +L IP S+F +DESGNGGII+DSGTA+TRLQT AYN+L Sbjct: 316 LRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRLQTAAYNAL 375 Query: 700 RDAFVKGTPNLKSTSGFALFDTCYDLSSETSVEVPTVRFDFPGGKSLSLPAKNYLIPVDS 521 RDAFVKGT +L TS ALFDTCYDLS +TSVEVPTV F GGK L LPA NYLIPVDS Sbjct: 376 RDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLPATNYLIPVDS 435 Query: 520 AGTFCFAFAPTSNPLSIIGNVQQQGTRVGVDLGNSLVGFSLKQC 389 GTFCFAFAPTS+ LSIIGNVQQQGTRVG DL NSLVGF +QC Sbjct: 436 DGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479 >ref|XP_002315953.1| predicted protein [Populus trichocarpa] gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa] Length = 484 Score = 596 bits (1536), Expect = e-168 Identities = 313/491 (63%), Positives = 370/491 (75%), Gaps = 3/491 (0%) Frame = -3 Query: 1852 FFVFSIVYLFFCSRVYSRSLL---ESTETTFLDVSASIQKTRDLLSFNPETLELQSLDEE 1682 F+VF LFF S S S + +ETT LDV+ASIQ+T+++ S P+ + +++ Sbjct: 5 FYVF--FSLFFASPPVSCSRILTPHPSETTVLDVAASIQRTKNIFSSGPK---MSPFNQQ 59 Query: 1681 KQSSDSSSFTMRLHSRDTLHKSSHKDYKSLTLSRLERDSARVKSLNLKLDLAVKGIKKSD 1502 ++ + SS T+ L SR ++ K++H YKSLTLSRL+RDSARVKSL +LDLA+ I SD Sbjct: 60 EKETTSSELTVELLSRTSIQKTTHTGYKSLTLSRLQRDSARVKSLVTRLDLAINSISSSD 119 Query: 1501 LKPSQEVEQLVELKDVGELQGPIISGTSQGSGEYFSRVGIGSPPKPQYLVLDTGSDVTWV 1322 LKP +E E K +LQ PIISGTSQGSGEYFSRVGIG PP YL+LDTGSDV WV Sbjct: 120 LKP---LETDSEFKPE-DLQSPIISGTSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWV 175 Query: 1321 QSAPCTDCYQQTDPIFEPSSSSSYKPLGCGTEQCRGLDVSACRNGTCLYQVSYGDGSYTV 1142 Q APC DCYQQ DPIFEP+SS+S+ L C T QCR LDVS CRN TCLY+VSYGDGSYTV Sbjct: 176 QCAPCADCYQQADPIFEPASSASFSTLSCNTRQCRSLDVSECRNDTCLYEVSYGDGSYTV 235 Query: 1141 GDFVTETLTYGNSDAVENIAVGCGHDNEXXXXXXXXXXXXXXXXXSFPSQIKSTAFSYCL 962 GDFVTET+T G++ V+N+A+GCGH+NE SFPSQI +T+FSYCL Sbjct: 236 GDFVTETITLGSAP-VDNVAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCL 294 Query: 961 VDRDSTSGSTLQFGANSQPPNAVTAPLLRNSQLDTFYYVGLTGLSVGGAMLPIPPSLFSV 782 VDRDS S STL+F + + PPNAV+APLLRN LDTFYYVGLTGLSVGG ++ IP S F + Sbjct: 295 VDRDSESASTLEFNS-TLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQI 353 Query: 781 DESGNGGIIVDSGTAITRLQTDAYNSLRDAFVKGTPNLKSTSGFALFDTCYDLSSETSVE 602 DESGNGG+IVDSGTAITRLQTD YNSLRDAFVK T +L ST+G ALFDTCYDLSS+ +VE Sbjct: 354 DESGNGGVIVDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVE 413 Query: 601 VPTVRFDFPGGKSLSLPAKNYLIPVDSAGTFCFAFAPTSNPLSIIGNVQQQGTRVGVDLG 422 VPTV F FP GK L LPAKNYL+P+DS GTFCFAFAPT++ LSIIGNVQQQGTRV DL Sbjct: 414 VPTVSFHFPDGKELPLPAKNYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLV 473 Query: 421 NSLVGFSLKQC 389 N LVGF +C Sbjct: 474 NHLVGFVPNKC 484 >ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] Length = 486 Score = 580 bits (1494), Expect = e-163 Identities = 295/491 (60%), Positives = 367/491 (74%), Gaps = 2/491 (0%) Frame = -3 Query: 1855 SFFVFSIVYLFFCSRVYSRSLLES--TETTFLDVSASIQKTRDLLSFNPETLELQSLDEE 1682 SF F + +L S V+SR L ++ T T+ L+V+ SI +T+ SF + EE Sbjct: 7 SFLFFFVFFLTSHSFVFSRILPKTSVTTTSILNVADSIHRTKYTSSFR------LNQQEE 60 Query: 1681 KQSSDSSSFTMRLHSRDTLHKSSHKDYKSLTLSRLERDSARVKSLNLKLDLAVKGIKKSD 1502 + S SSSF+++LHSR ++ + H DYKSLTL+RL RD+ARVKSL +LDLA+ I K+D Sbjct: 61 QTHSRSSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKAD 120 Query: 1501 LKPSQEVEQLVELKDVGELQGPIISGTSQGSGEYFSRVGIGSPPKPQYLVLDTGSDVTWV 1322 LKP + E +D+ + P+ISGT+QGSGEYF+RVGIG+P + Y+VLDTGSDV W+ Sbjct: 121 LKPVTTMYTTTEEEDI---EAPLISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWL 177 Query: 1321 QSAPCTDCYQQTDPIFEPSSSSSYKPLGCGTEQCRGLDVSACRNGTCLYQVSYGDGSYTV 1142 Q PC DCY QT+PIFEPSSSSSY+PL C T QC L+VS CRN TCLY+VSYGDGSYTV Sbjct: 178 QCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTV 237 Query: 1141 GDFVTETLTYGNSDAVENIAVGCGHDNEXXXXXXXXXXXXXXXXXSFPSQIKSTAFSYCL 962 GDF TETLT G S V+N+AVGCGH NE + PSQ+ +T+FSYCL Sbjct: 238 GDFATETLTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCL 296 Query: 961 VDRDSTSGSTLQFGANSQPPNAVTAPLLRNSQLDTFYYVGLTGLSVGGAMLPIPPSLFSV 782 VDRDS S ST++FG S PP+AV APLLRN QLDTFYY+GLTG+SVGG +L IP S F + Sbjct: 297 VDRDSDSASTVEFGT-SLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEM 355 Query: 781 DESGNGGIIVDSGTAITRLQTDAYNSLRDAFVKGTPNLKSTSGFALFDTCYDLSSETSVE 602 DESG+GGII+DSGTA+TRLQT YNSLRD+F+KGT +L+ +G A+FDTCY+LS++T++E Sbjct: 356 DESGSGGIIIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIE 415 Query: 601 VPTVRFDFPGGKSLSLPAKNYLIPVDSAGTFCFAFAPTSNPLSIIGNVQQQGTRVGVDLG 422 VPTV F FPGGK L+LPAKNY+IPVDS GTFC AFAPT++ L+IIGNVQQQGTRV DL Sbjct: 416 VPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLA 475 Query: 421 NSLVGFSLKQC 389 NSL+GFS +C Sbjct: 476 NSLIGFSSNKC 486 >ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana] gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana] gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana] gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana] gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana] gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana] Length = 483 Score = 577 bits (1486), Expect = e-162 Identities = 297/492 (60%), Positives = 364/492 (73%), Gaps = 2/492 (0%) Frame = -3 Query: 1858 NSFFVFSIVYLFFCSRVYSRSLLESTETT--FLDVSASIQKTRDLLSFNPETLELQSLDE 1685 N F F I +L S V+SR L E++ TT L+V+ SI +T+ SF + E Sbjct: 4 NYSFFFFIFFLTSHSSVFSRILPETSTTTTSILNVADSIHRTKYTSSFR------LNQQE 57 Query: 1684 EKQSSDSSSFTMRLHSRDTLHKSSHKDYKSLTLSRLERDSARVKSLNLKLDLAVKGIKKS 1505 E+ S SSSF+++LHSR ++ + H DYKSLTL+RL RD+ARVKSL +LDLA+ I K+ Sbjct: 58 EQTHSASSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKA 117 Query: 1504 DLKPSQEVEQLVELKDVGELQGPIISGTSQGSGEYFSRVGIGSPPKPQYLVLDTGSDVTW 1325 DLKP + E +++ P+ISGT+QGSGEYF+RVGIG P + Y+VLDTGSDV W Sbjct: 118 DLKPISTMYTTEEQ----DIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNW 173 Query: 1324 VQSAPCTDCYQQTDPIFEPSSSSSYKPLGCGTEQCRGLDVSACRNGTCLYQVSYGDGSYT 1145 +Q PC DCY QT+PIFEPSSSSSY+PL C T QC L+VS CRN TCLY+VSYGDGSYT Sbjct: 174 LQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYT 233 Query: 1144 VGDFVTETLTYGNSDAVENIAVGCGHDNEXXXXXXXXXXXXXXXXXSFPSQIKSTAFSYC 965 VGDF TETLT G S V+N+AVGCGH NE + PSQ+ +T+FSYC Sbjct: 234 VGDFATETLTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYC 292 Query: 964 LVDRDSTSGSTLQFGANSQPPNAVTAPLLRNSQLDTFYYVGLTGLSVGGAMLPIPPSLFS 785 LVDRDS S ST+ FG S P+AV APLLRN QLDTFYY+GLTG+SVGG +L IP S F Sbjct: 293 LVDRDSDSASTVDFGT-SLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFE 351 Query: 784 VDESGNGGIIVDSGTAITRLQTDAYNSLRDAFVKGTPNLKSTSGFALFDTCYDLSSETSV 605 +DESG+GGII+DSGTA+TRLQT+ YNSLRD+FVKGT +L+ +G A+FDTCY+LS++T+V Sbjct: 352 MDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTV 411 Query: 604 EVPTVRFDFPGGKSLSLPAKNYLIPVDSAGTFCFAFAPTSNPLSIIGNVQQQGTRVGVDL 425 EVPTV F FPGGK L+LPAKNY+IPVDS GTFC AFAPT++ L+IIGNVQQQGTRV DL Sbjct: 412 EVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDL 471 Query: 424 GNSLVGFSLKQC 389 NSL+GFS +C Sbjct: 472 ANSLIGFSSNKC 483