BLASTX nr result
ID: Atractylodes21_contig00029526
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes21_contig00029526 (1494 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1... 534 e-149 ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2... 521 e-145 ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit,... 519 e-145 ref|XP_002311432.1| predicted protein [Populus trichocarpa] gi|2... 509 e-142 ref|NP_189198.1| aspartyl protease family protein [Arabidopsis t... 505 e-140 >ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera] Length = 458 Score = 534 bits (1376), Expect = e-149 Identities = 271/440 (61%), Positives = 322/440 (73%), Gaps = 8/440 (1%) Frame = +2 Query: 62 FTHICHSATINSDNH-HFLKLPLLHINPLQSPSQAFXXXXXXXXXXXXXXHRKPNL--PL 232 FT IC++ I + +LKL LLHI P +PSQA H +L P+ Sbjct: 19 FTDICNALPIAQNGTVEYLKLRLLHIKPFTTPSQALSFDSHRLSFFFSALHTPQSLKSPV 78 Query: 233 TSGAYAGAGQYFVTLHLGTPPQPLLLIADTGSDLIWVACSACRDDCSLTRPAHSAFLARH 412 SGA G+GQYFV L LGTPPQ LLL+ADTGSDL+WV CSACR+ C+ P SAFLARH Sbjct: 79 VSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRN-CTRHTPG-SAFLARH 136 Query: 413 SSSFGLHHCFDPACQLVPHPRPPVACNHTRLHSPCRYAYSYADGSITNGFFAKEATSFNS 592 S++F +HC+D ACQLVP P+ CNH RLHSPCRY YSY DGS T+GFF+KE T+ N+ Sbjct: 137 STTFSPNHCYDSACQLVPLPKHH-RCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNT 195 Query: 593 STGKALQHDSLAFGCGFKISGPSVSGPSFNGAQGVMGLGRGSISFVTQLGRRFGNKFSYC 772 S+G+ + +AFGC F+ISGPSVSG SFNGA GVMGLGRG IS +QLG RFGNKFSYC Sbjct: 196 SSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYC 255 Query: 773 LKDYTITPPPTSYLLIGTGARN-----SRMRYTPLQTNPLSHTFYYIGIQSVSVDNVKLR 937 L D+ I+P PTSYLLIG+ + RMR+TPL NPLS TFYYIGI+SVSVD +KL Sbjct: 256 LMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLP 315 Query: 938 VSPSVWVIDKLGNGGTIVDSGTTLTFLPDIAYRHVLAAFRRLVKLPTPSGSPPNFDICFN 1117 ++PSVW +D+LGNGGTIVDSGTTLTFLP+ AY +L +R V+LP+P+ P FD+C N Sbjct: 316 INPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPGFDLCVN 375 Query: 1118 VSGIRRPSLPKLSFKLGGNSVFSPPVGNYLIETAEGVKCLALQPVTSPGGFSVIGNLMQQ 1297 VS I P LPKLSFKLGG+SVFSPP NY ++T E VKCLALQ V +P GFSVIGNLMQQ Sbjct: 376 VSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQQ 435 Query: 1298 GFLFEFDIGKSRLGFSRRGC 1357 GFL EFD ++RLGFSR GC Sbjct: 436 GFLLEFDKDRTRLGFSRHGC 455 >ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus] gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus] Length = 459 Score = 521 bits (1341), Expect = e-145 Identities = 265/439 (60%), Positives = 319/439 (72%), Gaps = 8/439 (1%) Frame = +2 Query: 65 THICH-SATINSDNHHFLKLPLLHINPLQSPSQAFXXXXXXXXXXXXXXHRKPNLPLTSG 241 TH+ + +AT + FLKLPLLH P SPSQ+ + PL SG Sbjct: 21 THLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFSRPNPTLKSPLISG 80 Query: 242 AYAGAGQYFVTLHLGTPPQPLLLIADTGSDLIWVACSACRDDCSLTRPAHSAFLARHSSS 421 A G+GQYFV + LGTPPQ LLL+ADTGSDL+WV CSACR+ CS P SAFL RHSSS Sbjct: 81 ASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRN-CS-HHPPSSAFLPRHSSS 138 Query: 422 FGLHHCFDPACQLVPHPRPPVACNHTRLHSPCRYAYSYADGSITNGFFAKEATSFNSSTG 601 F HCFDP C+L+PH P CNHTRLHSPCR+ YSYADGS+++GFF+KE T+ S +G Sbjct: 139 FSPFHCFDPHCRLLPHA-PHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSG 197 Query: 602 KALQHDSLAFGCGFKISGPSVSGPSFNGAQGVMGLGRGSISFVTQLGRRFGNKFSYCLKD 781 + L+FGCGF+ISGPSVSG FNGA+GVMGLGRGSISF +QLGRRFGNKFSYCL D Sbjct: 198 SEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMD 257 Query: 782 YTITPPPTSYLLIGTGARN------SRMRYTPLQTNPLSHTFYYIGIQSVSVDNVKLRVS 943 YT++PPPTS+L+IG G + +++ YTPLQ NPLS TFYYI I S+++D VKL ++ Sbjct: 258 YTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPIN 317 Query: 944 PSVWVIDKLGNGGTIVDSGTTLTFLPDIAYRHVLAAFRRLVKLPTPSGSPPNFDICFNVS 1123 P+VW ID+ GNGGT+VDSGTTLT+L AY VL + RR VKLP + P FD+C N S Sbjct: 318 PAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNAS 377 Query: 1124 G-IRRPSLPKLSFKLGGNSVFSPPVGNYLIETAEGVKCLALQPVTSPGGFSVIGNLMQQG 1300 G RRPSLP+L F+LGG +VF+PP NY +ET EGV CLA++ V S GFSVIGNLMQQG Sbjct: 378 GESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQG 437 Query: 1301 FLFEFDIGKSRLGFSRRGC 1357 FL EFD +SRLGF+RRGC Sbjct: 438 FLLEFDKEESRLGFTRRGC 456 >ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus communis] gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus communis] Length = 455 Score = 519 bits (1336), Expect = e-145 Identities = 263/439 (59%), Positives = 320/439 (72%), Gaps = 13/439 (2%) Frame = +2 Query: 80 SATINSDNHHFLKLPLLHINPLQSPSQAFXXXXXXXXXXXXXX-----HRKPNL--PLTS 238 SA N+ +LKLPLLH P SPS+A H++ + P+ S Sbjct: 19 SAAANTTTE-YLKLPLLHKTPFTSPSEALAFDINRRLSLLHHHRHQQQHKQNSFRSPVIS 77 Query: 239 GAYAGAGQYFVTLHLGTPPQPLLLIADTGSDLIWVACSACRDDCSLTRPAHSAFLARHSS 418 GA +G+GQYFV+L +GTPPQ LLL+ADTGSDLIWV CS CR+ CS P SAF ARHS+ Sbjct: 78 GASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRN-CSHRSPG-SAFFARHST 135 Query: 419 SFGLHHCFDPACQLVPHPRPPVACNHTRLHSPCRYAYSYADGSITNGFFAKEATSFNSST 598 ++ HC+ P CQLVPHP P CN TRLHSPCRY Y+YAD S T GFF+KEA + N+ST Sbjct: 136 TYSAIHCYSPQCQLVPHPHPN-PCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTST 194 Query: 599 GKALQHDSLAFGCGFKISGPSVSGPSFNGAQGVMGLGRGSISFVTQLGRRFGNKFSYCLK 778 GK + + L+FGCGF+ISGPS++G SF GAQGVMGLGR ISF +QLGRRFG+KFSYCL Sbjct: 195 GKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCLM 254 Query: 779 DYTITPPPTSYLLIGTGARNSR------MRYTPLQTNPLSHTFYYIGIQSVSVDNVKLRV 940 DYT++PPPTS+L IG GA+N M +TPL NPLS TFYYI I+ V V+ VKL + Sbjct: 255 DYTLSPPPTSFLTIG-GAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPI 313 Query: 941 SPSVWVIDKLGNGGTIVDSGTTLTFLPDIAYRHVLAAFRRLVKLPTPSGSPPNFDICFNV 1120 +PSVW ID LGNGGTI+DSGTTLTF+ + AY +L AF++ VKLP+P+ P FD+C NV Sbjct: 314 NPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFDLCMNV 373 Query: 1121 SGIRRPSLPKLSFKLGGNSVFSPPVGNYLIETAEGVKCLALQPVTSPGGFSVIGNLMQQG 1300 SG+ RP+LP++SF L G SVFSPP NY IET + +KCLA+QPV+ GGFSV+GNLMQQG Sbjct: 374 SGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGNLMQQG 433 Query: 1301 FLFEFDIGKSRLGFSRRGC 1357 FL EFD KSRLGF+RRGC Sbjct: 434 FLLEFDRDKSRLGFTRRGC 452 >ref|XP_002311432.1| predicted protein [Populus trichocarpa] gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa] Length = 458 Score = 509 bits (1311), Expect = e-142 Identities = 254/440 (57%), Positives = 313/440 (71%), Gaps = 17/440 (3%) Frame = +2 Query: 89 INSDNHHFLKLPLLHINPLQSPSQAFXXXXXXXXXXXXXXHRKPN-------LPLTSGAY 247 +++ +LKLPLLH P +P Q+ HR N PL SGA Sbjct: 18 LSTSTTEYLKLPLLHKTPFPTPLQSLSSDLQRLSLLHHSHHRHQNHRRTSSKSPLMSGAS 77 Query: 248 AGAGQYFVTLHLGTPPQPLLLIADTGSDLIWVACSACRDDCSLTRPAHSAFLARHSSSFG 427 +G+GQYFV++ LG+PPQ LLL+ADTGSDL WV CSAC+ +CS+ P S FLARHS++F Sbjct: 78 SGSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSI-HPPGSTFLARHSTTFS 136 Query: 428 LHHCFDPACQLVPHPRPPVACNHTRLHSPCRYAYSYADGSITNGFFAKEATSFNSSTGKA 607 HCF CQLVP P P CNHTRLHS CRY Y Y+DGS T+GFF+KE T+ N+S+G+ Sbjct: 137 PTHCFSSLCQLVPQPNPN-PCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGRE 195 Query: 608 LQHDSLAFGCGFKISGPSVSGPSFNGAQGVMGLGRGSISFVTQLGRRFGNKFSYCLKDYT 787 ++ S+AFGCGF SGPS+ G SFNGA GVMGLGRG ISF +QLGRRFG FSYCL DYT Sbjct: 196 MKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYT 255 Query: 788 ITPPPTSYLLIGTGA-----RNSRMRYTPLQTNPLSHTFYYIGIQSVSVDNVKLRVSPSV 952 ++PPPTSYL+IG S M +TPL NP + TFYYI I+ V VD VKL + PSV Sbjct: 256 LSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSV 315 Query: 953 WVIDKLGNGGTIVDSGTTLTFLPDIAYRHVLAAFRRLVKLPTP----SGSPPNFDICFNV 1120 W +D+LGNGGT++DSGTTLTFL + AYR +L+AF+R VKLP+P + + FD+C NV Sbjct: 316 WSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTRSGFDLCVNV 375 Query: 1121 SGIRRPSLPKLSFKLGGNSVFSPPVGNYLIETAEGVKCLALQPVTSPGG-FSVIGNLMQQ 1297 +G+ RP P+LS +LGG S++SPP NY I+ +EG+KCLA+QPV + G FSVIGNLMQQ Sbjct: 376 TGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVEAESGRFSVIGNLMQQ 435 Query: 1298 GFLFEFDIGKSRLGFSRRGC 1357 GFL EFD GKSRLGFSRRGC Sbjct: 436 GFLLEFDRGKSRLGFSRRGC 455 >ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana] gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like protein [Arabidopsis thaliana] gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana] gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana] Length = 452 Score = 505 bits (1300), Expect = e-140 Identities = 256/428 (59%), Positives = 307/428 (71%), Gaps = 7/428 (1%) Frame = +2 Query: 95 SDNHHFLKLPLLHINPLQSPSQAFXXXXXXXXXXXXXXHRKP----NLPLTSGAYAGAGQ 262 S+++ +LKLPLL +P SP+QA RKP P+ SGA +G+GQ Sbjct: 26 SNHNKYLKLPLLRKSPFPSPTQALALDTRRLHFLSL--RRKPIPFVKSPVVSGAASGSGQ 83 Query: 263 YFVTLHLGTPPQPLLLIADTGSDLIWVACSACRDDCSLTRPAHSAFLARHSSSFGLHHCF 442 YFV L +G PPQ LLLIADTGSDL+WV CSACR+ CS PA + F RHSS+F HC+ Sbjct: 84 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRN-CSHHSPA-TVFFPRHSSTFSPAHCY 141 Query: 443 DPACQLVPHPRPPVACNHTRLHSPCRYAYSYADGSITNGFFAKEATSFNSSTGKALQHDS 622 DP C+LVP P CNHTR+HS C Y Y YADGS+T+G FA+E TS +S+GK + S Sbjct: 142 DPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKS 201 Query: 623 LAFGCGFKISGPSVSGPSFNGAQGVMGLGRGSISFVTQLGRRFGNKFSYCLKDYTITPPP 802 +AFGCGF+ISG SVSG SFNGA GVMGLGRG ISF +QLGRRFGNKFSYCL DYT++PPP Sbjct: 202 VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPP 261 Query: 803 TSYLLIGTGARN-SRMRYTPLQTNPLSHTFYYIGIQSVSVDNVKLRVSPSVWVIDKLGNG 979 TSYL+IG G S++ +TPL TNPLS TFYY+ ++SV V+ KLR+ PS+W ID GNG Sbjct: 262 TSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNG 321 Query: 980 GTIVDSGTTLTFLPDIAYRHVLAAFRRLVKLPTPSGSPPNFDICFNVSGIRRPS--LPKL 1153 GT+VDSGTTL FL + AYR V+AA RR VKLP P FD+C NVSG+ +P LP+L Sbjct: 322 GTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRL 381 Query: 1154 SFKLGGNSVFSPPVGNYLIETAEGVKCLALQPVTSPGGFSVIGNLMQQGFLFEFDIGKSR 1333 F+ G +VF PP NY IET E ++CLA+Q V GFSVIGNLMQQGFLFEFD +SR Sbjct: 382 KFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSR 441 Query: 1334 LGFSRRGC 1357 LGFSRRGC Sbjct: 442 LGFSRRGC 449