BLASTX nr result
ID: Cimicifuga21_contig00009803
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cimicifuga21_contig00009803 (946 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor,... 178 1e-42 ref|XP_002300215.1| predicted protein [Populus trichocarpa] gi|2... 178 2e-42 ref|XP_002876869.1| aspartyl protease family protein [Arabidopsi... 177 2e-42 ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis tha... 176 5e-42 gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein... 176 5e-42 >ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] Length = 442 Score = 178 bits (452), Expect = 1e-42 Identities = 120/311 (38%), Positives = 158/311 (50%), Gaps = 6/311 (1%) Frame = +3 Query: 15 PIFDPAKSSSFRPLICDDDECKKIDNNQCDPHNNCKYNITYGDDDTTIGVVVQETLQFRG 194 PIFDP KSSSF L C CK + + C ++C+Y TYGD +T G + ET F Sbjct: 140 PIFDPKKSSSFSKLSCSSQLCKALPQSSCS--DSCEYLYTYGDYSSTQGTMATETFTFG- 196 Query: 195 ANENVTLEDIIIGCGYNNTNIMENLGEGIPGLIGLSRDPESFVSQLFYDKFAYCLGDVID 374 V++ ++ GCG +N + +G GL+GL R P S VSQL KF+YCL ID Sbjct: 197 ---KVSIPNVGFGCGEDNEG--DGFTQG-SGLVGLGRGPLSLVSQLKEAKFSYCLTS-ID 249 Query: 375 DEAYGYARFGEAASITGFTTPIQSV----EDNGXXXXXXILEGISVDNTRLSIPNGTFSV 542 D G AS+ G + I++ LEGISV TRL I TF + Sbjct: 250 DTKTSTLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQL 309 Query: 543 NKEYPSGFIIDSGATYTYLNKTAYNILINDLKQKTQGLRKVSDPNKLFELCYELEGFDLN 722 + G IIDSG T TYL ++A++++ + + GL + ELCY L Sbjct: 310 QDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQ-MGLPVDNSGATGLELCYNLPSDTSE 368 Query: 723 F-APVITLHFSGADLRLPVWNTWLTPIK-NVYCLSMFPTGGMSILGNFQQQNFNVGHDLD 896 P + LHF+GADL LP N + V CL+M +GGMSI GN QQQN V HDL+ Sbjct: 369 LEVPKLVLHFTGADLELPGENYMIADSSMGVICLAMGSSGGMSIFGNVQQQNMFVSHDLE 428 Query: 897 NNVVSFDLTYC 929 +SF T C Sbjct: 429 KETLSFLPTNC 439 >ref|XP_002300215.1| predicted protein [Populus trichocarpa] gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa] Length = 439 Score = 178 bits (451), Expect = 2e-42 Identities = 117/311 (37%), Positives = 159/311 (51%), Gaps = 6/311 (1%) Frame = +3 Query: 15 PIFDPAKSSSFRPLICDDDECKKIDNNQCDPHNNCKYNITYGDDDTTIGVVVQETLQFRG 194 PIFDP KSSSF L C C+ + + C+ N C+Y +YGD +T G++ ETL F Sbjct: 137 PIFDPKKSSSFSKLSCSSQLCEALPQSSCN--NGCEYLYSYGDYSSTQGILASETLTFGK 194 Query: 195 ANENVTLEDIIIGCGYNNTNIMENLGEGIPGLIGLSRDPESFVSQLFYDKFAYCLGDVID 374 A+ + ++ GCG +N + G G L+GL R P S VSQL KF+YCL +D Sbjct: 195 AS----VPNVAFGCGADNEGSGFSQGAG---LVGLGRGPLSLVSQLKEPKFSYCL-TTVD 246 Query: 375 DEAYGYARFGEAASITGFTTPIQSV----EDNGXXXXXXILEGISVDNTRLSIPNGTFSV 542 D G AS+ ++ I++ LEGISV +TRL I TFS+ Sbjct: 247 DTKTSTLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSL 306 Query: 543 NKEYPSGFIIDSGATYTYLNKTAYNILINDLKQKTQGLRKVSDPNKLFELCYELEGFDLN 722 + G IIDSG T TYL ++A+N++ + K L S + ++C+ L N Sbjct: 307 QDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKI-NLPVDSSGSTGLDVCFTLPSGSTN 365 Query: 723 F-APVITLHFSGADLRLPVWNTWL-TPIKNVYCLSMFPTGGMSILGNFQQQNFNVGHDLD 896 P + HF GADL LP N + V CL+M + GMSI GN QQQN V HDL+ Sbjct: 366 IEVPKLVFHFDGADLELPAENYMIGDSSMGVACLAMGSSSGMSIFGNVQQQNMLVLHDLE 425 Query: 897 NNVVSFDLTYC 929 +SF T C Sbjct: 426 KETLSFLPTQC 436 >ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] Length = 462 Score = 177 bits (450), Expect = 2e-42 Identities = 121/323 (37%), Positives = 162/323 (50%), Gaps = 18/323 (5%) Frame = +3 Query: 15 PIFDPAKSSSFRPLICDDDECKKIDNNQC-DPHNNCKYNITYGDDDTTIGVVVQETLQFR 191 PIFDP KSSS+ + C C + + C + ++C+Y TYGD +T G++ ET F Sbjct: 148 PIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFE 207 Query: 192 GANENVTLEDIIIGCGYNNTNIMENLGEGIP---GLIGLSRDPESFVSQLFYDKFAYCLG 362 N ++ I GCG +EN G+G GL+GL R P S +SQL KF+YCL Sbjct: 208 DEN---SISGIGFGCG------VENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLT 258 Query: 363 DVIDDEAYGYARFGEAAS---------ITGFTTPIQSVEDNGXXXXXXILE--GISVDNT 509 + D EA G AS + G T S+ N LE GI+V Sbjct: 259 SIEDSEASSSLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAK 318 Query: 510 RLSIPNGTFSVNKEYPSGFIIDSGATYTYLNKTAYNILINDLKQKTQGLRKVSDPNKL-F 686 RLS+ TF ++++ G IIDSG T TYL +TA+ +L + + V D Sbjct: 319 RLSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMS--LPVDDSGSTGL 376 Query: 687 ELCYELEGFDLNFA-PVITLHFSGADLRLPVWNTWLTPIKN-VYCLSMFPTGGMSILGNF 860 +LC++L N A P + HF GADL LP N + V CL+M + GMSI GN Sbjct: 377 DLCFKLPNAAKNIAVPKLIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNV 436 Query: 861 QQQNFNVGHDLDNNVVSFDLTYC 929 QQQNFNV HDL+ V+F T C Sbjct: 437 QQQNFNVLHDLEKETVTFVPTEC 459 >ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana] gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana] gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis thaliana] gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana] Length = 461 Score = 176 bits (447), Expect = 5e-42 Identities = 122/323 (37%), Positives = 160/323 (49%), Gaps = 18/323 (5%) Frame = +3 Query: 15 PIFDPAKSSSFRPLICDDDECKKIDNNQCDPHNN-CKYNITYGDDDTTIGVVVQETLQFR 191 PIFDP KSSS+ + C C + + C+ + C+Y TYGD +T G++ ET F Sbjct: 147 PIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFE 206 Query: 192 GANENVTLEDIIIGCGYNNTNIMENLGEGIP---GLIGLSRDPESFVSQLFYDKFAYCLG 362 N ++ I GCG +EN G+G GL+GL R P S +SQL KF+YCL Sbjct: 207 DEN---SISGIGFGCG------VENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLT 257 Query: 363 DVIDDEAYGYARFGE---------AASITGFTTPIQSVEDNGXXXXXXILE--GISVDNT 509 + D EA G AS+ G T S+ N LE GI+V Sbjct: 258 SIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAK 317 Query: 510 RLSIPNGTFSVNKEYPSGFIIDSGATYTYLNKTAYNILINDLKQKTQGLRKVSDPNKL-F 686 RLS+ TF + ++ G IIDSG T TYL +TA+ +L + + V D Sbjct: 318 RLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMS--LPVDDSGSTGL 375 Query: 687 ELCYELEGFDLNFA-PVITLHFSGADLRLPVWNTWLTPIKN-VYCLSMFPTGGMSILGNF 860 +LC++L N A P + HF GADL LP N + V CL+M + GMSI GN Sbjct: 376 DLCFKLPDAAKNIAVPKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNV 435 Query: 861 QQQNFNVGHDLDNNVVSFDLTYC 929 QQQNFNV HDL+ VSF T C Sbjct: 436 QQQNFNVLHDLEKETVSFVPTEC 458 >gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis thaliana] Length = 353 Score = 176 bits (447), Expect = 5e-42 Identities = 122/323 (37%), Positives = 160/323 (49%), Gaps = 18/323 (5%) Frame = +3 Query: 15 PIFDPAKSSSFRPLICDDDECKKIDNNQCDPHNN-CKYNITYGDDDTTIGVVVQETLQFR 191 PIFDP KSSS+ + C C + + C+ + C+Y TYGD +T G++ ET F Sbjct: 39 PIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFE 98 Query: 192 GANENVTLEDIIIGCGYNNTNIMENLGEGIP---GLIGLSRDPESFVSQLFYDKFAYCLG 362 N ++ I GCG +EN G+G GL+GL R P S +SQL KF+YCL Sbjct: 99 DEN---SISGIGFGCG------VENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLT 149 Query: 363 DVIDDEAYGYARFGE---------AASITGFTTPIQSVEDNGXXXXXXILE--GISVDNT 509 + D EA G AS+ G T S+ N LE GI+V Sbjct: 150 SIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAK 209 Query: 510 RLSIPNGTFSVNKEYPSGFIIDSGATYTYLNKTAYNILINDLKQKTQGLRKVSDPNKL-F 686 RLS+ TF + ++ G IIDSG T TYL +TA+ +L + + V D Sbjct: 210 RLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMS--LPVDDSGSTGL 267 Query: 687 ELCYELEGFDLNFA-PVITLHFSGADLRLPVWNTWLTPIKN-VYCLSMFPTGGMSILGNF 860 +LC++L N A P + HF GADL LP N + V CL+M + GMSI GN Sbjct: 268 DLCFKLPDAAKNIAVPKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNV 327 Query: 861 QQQNFNVGHDLDNNVVSFDLTYC 929 QQQNFNV HDL+ VSF T C Sbjct: 328 QQQNFNVLHDLEKETVSFVPTEC 350