BLASTX nr result
ID: Atractylodes21_contig00001936
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes21_contig00001936 (1825 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla] 632 e-179 gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa] 511 e-142 ref|XP_002317418.1| predicted protein [Populus trichocarpa] gi|2... 500 e-139 ref|XP_002305743.1| predicted protein [Populus trichocarpa] gi|2... 495 e-137 ref|XP_002317417.1| predicted protein [Populus trichocarpa] gi|1... 492 e-136 >gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla] Length = 501 Score = 632 bits (1631), Expect = e-179 Identities = 309/503 (61%), Positives = 371/503 (73%), Gaps = 5/503 (0%) Frame = -1 Query: 1804 MGTSETLLTLLFLIIHVSSTIATSGLPVEISMLNDHEPELALSGQKVLELFQNWKEVHGK 1625 M TS +++T+L + +VS +I+T LP E S+L E ++ LS KV +LF WKE+HGK Sbjct: 1 MATSNSMITILIFLTYVSYSISTKTLPSEFSILEGQENDI-LSSAKVSDLFGKWKELHGK 59 Query: 1624 TYEHAEEET-RLANFRKSLKYVLEKNSKRKSEMEHMVGLNKFADLSNEEFKETYLSKVKG 1448 TY+H EEE RL NF+KS+K+V+EKNS+RKSE++H VGLNKFADLSNEEFKE Y+SKVKG Sbjct: 60 TYQHEEEENLRLENFKKSVKFVMEKNSERKSELDHTVGLNKFADLSNEEFKEMYMSKVKG 119 Query: 1447 SNSNILKMRGHGVKKNTTAAGVGSCDAPASLDWREKGVVTPIKDQGQCGSCWAFSVVGAI 1268 S SN LKM G VK+N + + +CDAP SLDWR+KGVVTP+KDQGQCGSCWAFSV G+I Sbjct: 120 SRSNELKMGG--VKRNMSVSS-RTCDAPTSLDWRDKGVVTPMKDQGQCGSCWAFSVSGSI 176 Query: 1267 ESAHALDTGDLIRLSEQELVXXXXXXXXXXXXNMDTAFRWVIKNGGIDTEADYPYTSANG 1088 ESA+A+ TGDLIRLSEQELV NMDTA+RW+IKNGG+D+E DYPYTS+NG Sbjct: 177 ESANAIATGDLIRLSEQELVDCDTYDYGCDGGNMDTAYRWIIKNGGLDSEDDYPYTSSNG 236 Query: 1087 YSGKCKISKQNNIAASIDSYTDVEPDENALLCAVAKQPVTVGIVGSAYDFQLYTGGIYDG 908 GKC +K S+DSY +VE +E+A+LCAVA PVT+GIVGSAYDFQLYTGG+Y+G Sbjct: 237 RDGKCDKTKSAKSVVSLDSYVEVESNEDAVLCAVATTPVTIGIVGSAYDFQLYTGGVYNG 296 Query: 907 ECSSSPYDIDHAVLVVGYGSQDGEDYWIVKNSWGTYWGMDGWILMKRNTDKKNGVCGINV 728 +CSS PYDIDHAVL+VGYGSQDG+DYWIVKNSWGTYWG++G+ILM+RNTD KNGVCG+ + Sbjct: 297 QCSSKPYDIDHAVLIVGYGSQDGKDYWIVKNSWGTYWGLEGYILMERNTDIKNGVCGMYL 356 Query: 727 Q----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKCGDFSYCAAGQTCCCIFEFYDY 560 + SKCGDF YCAA QTCCCIFEFY+Y Sbjct: 357 EPVYPITAAPTPPGPPPPPAPPSPPHPPPPPTPPAPSKCGDFHYCAADQTCCCIFEFYNY 416 Query: 559 CLIYGCCGYSNAVCCKGSSYCCPSDYPVCDVYDGYCFKNSGDTVGVXXXXXXXXXXXMPW 380 CLIYGCCGYS+AVCCK S+ CCPSDYP+CDV GYC+KNS T GV MPW Sbjct: 417 CLIYGCCGYSDAVCCKNSAACCPSDYPICDVQAGYCYKNSAKTFGVPAKKRQLAKHKMPW 476 Query: 379 EKTEETVVEEYLPLRWK*NKFEA 311 EK EET+ EE+ PL W N F A Sbjct: 477 EKIEETIKEEFQPLAWNRNPFAA 499 >gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa] Length = 509 Score = 511 bits (1315), Expect = e-142 Identities = 252/506 (49%), Positives = 316/506 (62%), Gaps = 14/506 (2%) Frame = -1 Query: 1786 LLTLLFLIIHVSST-IATSGLPVEISMLNDHEPELALSGQKVLELFQNWKEVHGKTYEHA 1610 L TL+ ++ S T + +S LP E S++ P +++ ++V+ELF+ W E HGK Y+H Sbjct: 8 LFTLVIFLVWASLTSLISSSLPSEFSIVG--RPGESIAEERVVELFKKWTEKHGKVYKHG 65 Query: 1609 EE-ETRLANFRKSLKYVLEKNSKRKSEMEHMVGLNKFADLSNEEFKETYLSKVKGSNSNI 1433 +E E + NFR +L+YV+EKN +R + H+VGLNKFAD+SNEEF+E Y+SKVK S Sbjct: 66 QEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGLNKFADMSNEEFREVYVSKVKKPTSKR 125 Query: 1432 LKMRGHGVKKNTTAAGVGSCDAPASLDWREKGVVTPIKDQGQCGSCWAFSVVGAIESAHA 1253 + + K A V +CD P SLDWR+ G+VT +KDQG CGSCWAFS GAIE +A Sbjct: 126 MAIERRRQGKAAAAKAVAACDGPTSLDWRKYGIVTGVKDQGDCGSCWAFSSTGAIEGINA 185 Query: 1252 LDTGDLIRLSEQELVXXXXXXXXXXXXNMDTAFRWVIKNGGIDTEADYPYTSANGYSGKC 1073 L GDLI LSEQELV MD AF WV+ NGGIDTE DYPYT G G C Sbjct: 186 LANGDLISLSEQELVDCDSTNDGCEGGYMDYAFEWVMSNGGIDTETDYPYT---GEDGTC 242 Query: 1072 KISKQNNIAASIDSYTDVEPDENALLCAVAKQPVTVGIVGSAYDFQLYTGGIYDGECSSS 893 +K+ A SID Y DV +E+AL CAV KQP++VGI G A DFQLYTGGIYDG+CS Sbjct: 243 NTTKEETKAVSIDGYEDVAEEESALFCAVLKQPISVGIDGGAIDFQLYTGGIYDGDCSDD 302 Query: 892 PYDIDHAVLVVGYGSQDGEDYWIVKNSWGTYWGMDGWILMKRNTDKKNGVCGINV----- 728 P DIDHAVLVVGYG++ GE+YWI+KNSWGT WGM G+ +KRNT K GVC IN Sbjct: 303 PDDIDHAVLVVGYGAESGEEYWIIKNSWGTDWGMKGYAYIKRNTSKDYGVCAINAMASYP 362 Query: 727 -------QXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKCGDFSYCAAGQTCCCIFEF 569 ++CGDFSYCAA +TCCCIFEF Sbjct: 363 TKESSAPSPYPSPAVPPPPPPPPPPPSPPPPPPPPSPSPTQCGDFSYCAATETCCCIFEF 422 Query: 568 YDYCLIYGCCGYSNAVCCKGSSYCCPSDYPVCDVYDGYCFKNSGDTVGVXXXXXXXXXXX 389 +DYCLIYGCC Y++AVCC G+ YCCP DYP+CD+ +G C +N GD +GV Sbjct: 423 FDYCLIYGCCDYTDAVCCTGTEYCCPHDYPICDIEEGLCLQNDGDFLGVTAKKRKMAKHK 482 Query: 388 MPWEKTEETVVEEYLPLRWK*NKFEA 311 PW K E++ + + PL WK N+F A Sbjct: 483 YPWTKPEDS-AKNHQPLEWKRNRFAA 507 >ref|XP_002317418.1| predicted protein [Populus trichocarpa] gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa] Length = 503 Score = 500 bits (1288), Expect = e-139 Identities = 250/514 (48%), Positives = 320/514 (62%), Gaps = 16/514 (3%) Frame = -1 Query: 1804 MGTSETLLTLLFLIIHVSSTIATSGLPVEISMLNDHEPELALSGQKVLELFQNWKEVHGK 1625 M + ++ ++L+ ++ T +S LP E ++ + EL +S + ++E+FQ W++ H K Sbjct: 1 MDSKKSQMSLIIFLLLALLTCLSSSLPGEHPIVVNDFSEL-VSEESIIEIFQQWRDRHQK 59 Query: 1624 TYEHA-EEETRLANFRKSLKYVLEKNSKRKSEMEHMVGLNKFADLSNEEFKETYLSKVKG 1448 YEHA E E R NF+++LKY++EK K+ + + H VGLNKFADLSNEEFKE YLSKVK Sbjct: 60 VYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSVGLNKFADLSNEEFKELYLSKVK- 118 Query: 1447 SNSNILKMRGHGVKKNTTA----AGVGSCDAPASLDWREKGVVTPIKDQGQCGSCWAFSV 1280 + +K++T + +CDAP+SLDWR+KGVVT +KDQG CGSCW+FS Sbjct: 119 --------KPINIKRSTARDWRQRNLQTCDAPSSLDWRKKGVVTAVKDQGDCGSCWSFST 170 Query: 1279 VGAIESAHALDTGDLIRLSEQELVXXXXXXXXXXXXNMDTAFRWVIKNGGIDTEADYPYT 1100 GAIE +A+ TGDLI LSEQELV MD AF WVI NGGIDTEA+YPYT Sbjct: 171 TGAIEGINAIVTGDLISLSEQELVDCDTTNYGCEGGYMDYAFEWVINNGGIDTEANYPYT 230 Query: 1099 SANGYSGKCKISKQNNIAASIDSYTDVEPDENALLCAVAKQPVTVGIVGSAYDFQLYTGG 920 G G C +K+ SID YTDV+ ++ALLCA +QP++VG+ GSA DFQLYTGG Sbjct: 231 ---GVDGTCNTTKEEIKVVSIDGYTDVDETDSALLCATVQQPISVGMDGSALDFQLYTGG 287 Query: 919 IYDGECSSSPYDIDHAVLVVGYGSQDGEDYWIVKNSWGTYWGMDGWILMKRNTDKKNGVC 740 IYDG+CS P DIDHAVL+VGYGS++GEDYWIVKNSWGT WGM+G+ +KRNTD GVC Sbjct: 288 IYDGDCSDDPNDIDHAVLIVGYGSENGEDYWIVKNSWGTEWGMEGYFYIKRNTDLPYGVC 347 Query: 739 GINVQ-----------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKCGDFSYCAAGQ 593 IN + S CGDF+YC + + Sbjct: 348 AINAEASYPTKESSSPSPTSPPSPPSPLSPPPPPPPTPVPPPPCPQPSDCGDFAYCPSDE 407 Query: 592 TCCCIFEFYDYCLIYGCCGYSNAVCCKGSSYCCPSDYPVCDVYDGYCFKNSGDTVGVXXX 413 TCCCI + +DYC++YGCC Y NAVCC S YCCPSDYP+CDV +G C K+ GD +GV Sbjct: 408 TCCCILKVFDYCIVYGCCQYENAVCCADSVYCCPSDYPICDVEEGLCLKSQGDYLGVPAS 467 Query: 412 XXXXXXXXMPWEKTEETVVEEYLPLRWK*NKFEA 311 PW K EE + LRWK N F+A Sbjct: 468 KRHMAKHKFPWTKLEEKTTTDRHALRWKRNPFDA 501 >ref|XP_002305743.1| predicted protein [Populus trichocarpa] gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa] Length = 494 Score = 495 bits (1274), Expect = e-137 Identities = 251/499 (50%), Positives = 311/499 (62%), Gaps = 11/499 (2%) Frame = -1 Query: 1774 LFLIIHVSSTIATSGLPVEISMLNDHEPELALSGQKVLELFQNWKEVHGKTYEHAEE-ET 1598 L L++ V T +S LP E S++ + EL + ++E+FQ W++ H K Y+HAEE E Sbjct: 4 LILLLLVGLTSVSSSLPSEYSIVGNDFSELP-PDESIIEIFQQWRDRHQKAYKHAEEAEK 62 Query: 1597 RLANFRKSLKYVLEKNSKRKSEMEHMVGLNKFADLSNEEFKETYLSKVKGSNSNILKMRG 1418 R NF+++LKY++EK K ++ + H VGLNKFADLSNEEFK+ YLSKVK I K R Sbjct: 63 RFGNFKRNLKYIIEKTGK-ETTLRHRVGLNKFADLSNEEFKQLYLSKVK---KPINKTRI 118 Query: 1417 HGVKKNTTAAGVGSCDAPASLDWREKGVVTPIKDQGQCGSCWAFSVVGAIESAHALDTGD 1238 ++ + + SCDAP+SLDWR+KGVVT +KDQG CGSCW+FS GAIE +A+ T D Sbjct: 119 DA--EDRSRRNLQSCDAPSSLDWRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTSD 176 Query: 1237 LIRLSEQELVXXXXXXXXXXXXNMDTAFRWVIKNGGIDTEADYPYTSANGYSGKCKISKQ 1058 LI LSEQELV MD AF WVI NGGIDTEA+YPYT G G C +K+ Sbjct: 177 LISLSEQELVDCDTTNYGCEGGYMDYAFEWVINNGGIDTEANYPYT---GVDGTCNTAKE 233 Query: 1057 NNIAASIDSYTDVEPDENALLCAVAKQPVTVGIVGSAYDFQLYTGGIYDGECSSSPYDID 878 SID Y DV+ ++ALLCA A+QP++VGI GSA DFQLYTGGIYDG+CS P DID Sbjct: 234 EIKVVSIDGYKDVDETDSALLCAAAQQPISVGIDGSAIDFQLYTGGIYDGDCSDDPDDID 293 Query: 877 HAVLVVGYGSQDGEDYWIVKNSWGTYWGMDGWILMKRNTDKKNGVCGINVQ--------- 725 HAVL+VGYGS++GEDYWIVKNSWGT WG++G+ +KRNTD GVC IN Sbjct: 294 HAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYIKRNTDLPYGVCAINAMASYPTKEAS 353 Query: 724 -XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKCGDFSYCAAGQTCCCIFEFYDYCLIY 548 S CGDFSYC + +TCCCI +DYCL+Y Sbjct: 354 AQSPTSPPSPPSPPPPPPPPPTPVPPPPSPQPSDCGDFSYCPSDETCCCILNVFDYCLVY 413 Query: 547 GCCGYSNAVCCKGSSYCCPSDYPVCDVYDGYCFKNSGDTVGVXXXXXXXXXXXMPWEKTE 368 GCC Y NAVCC S YCCPSDYP+CDV +G C K GD +GV PW K + Sbjct: 414 GCCAYENAVCCADSVYCCPSDYPICDVEEGLCLKGQGDYLGVAASKRHMAKHKFPWTKLQ 473 Query: 367 ETVVEEYLPLRWK*NKFEA 311 E ++ L+WK N F A Sbjct: 474 ERAKTDHRVLQWKRNPFAA 492 >ref|XP_002317417.1| predicted protein [Populus trichocarpa] gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa] gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa] Length = 498 Score = 492 bits (1266), Expect = e-136 Identities = 250/499 (50%), Positives = 314/499 (62%), Gaps = 7/499 (1%) Frame = -1 Query: 1804 MGTSETLLTLLFLIIHVSSTIATSGLPVEISMLNDHEPELALSGQKVLELFQNWKEVHGK 1625 M + + LTL L++ +SGLP E S +++ E L+ + + E+F+ WKE H K Sbjct: 1 MESQKPQLTLFILLLLAPLPCLSSGLPGEYSAVSNDLHE-GLTEEGITEVFKLWKEKHQK 59 Query: 1624 TYEHAEE-ETRLANFRKSLKYVLEKNSKRKSEMEHMVGLNKFADLSNEEFKETYLSKVKG 1448 Y+HAEE E R+ NF+++LKY++EKN KRKS +EH VGLNKFADLSNEEF+E YLSKVK Sbjct: 60 VYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNKFADLSNEEFREMYLSKVK- 118 Query: 1447 SNSNILKMRGHGVKKNTTAAGVGSCDAPASLDWREKGVVTPIKDQGQCGSCWAFSVVGAI 1268 I + R H + +CDAP+SLDWR KGVVT +KDQG CGSCW+FS GAI Sbjct: 119 KPITIEEKRKH--------RHLQTCDAPSSLDWRNKGVVTAVKDQGDCGSCWSFSTTGAI 170 Query: 1267 ESAHALDTGDLIRLSEQELVXXXXXXXXXXXXN-MDTAFRWVIKNGGIDTEADYPYTSAN 1091 E+ +A+ TGDLI LSEQELV MD+AF+WVI NGGIDTEADYPYT Sbjct: 171 EAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGNGGIDTEADYPYT--- 227 Query: 1090 GYSGKCKISKQNNIAASIDSYTDVEPDENALLCAVAKQPVTVGIVGSAYDFQLYTGGIYD 911 G G C +K+ SI+ Y DV+P ++ALLCA +QP++VG+ GSA DFQLYTGGIYD Sbjct: 228 GVDGTCNTAKEEKKVVSIEGYVDVDPSDSALLCATVQQPISVGMDGSALDFQLYTGGIYD 287 Query: 910 GECSSSPYDIDHAVLVVGYGSQDGEDYWIVKNSWGTYWGMDGWILMKRNTDKKNGVCGIN 731 G+CS P DIDHA+L+VGYGS++ EDYWIVKNSWGT WGM+G+ ++RNT K GVC IN Sbjct: 288 GDCSGDPNDIDHAILIVGYGSENDEDYWIVKNSWGTEWGMEGYFYIRRNTSKPYGVCAIN 347 Query: 730 VQ-----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKCGDFSYCAAGQTCCCIFEFY 566 S CGD S+C + +TCCCI + + Sbjct: 348 ADASYPTKVPSPPSPPSPPPPPSPPPPPPSPPPPCPQPSDCGDSSFCPSDETCCCILKLF 407 Query: 565 DYCLIYGCCGYSNAVCCKGSSYCCPSDYPVCDVYDGYCFKNSGDTVGVXXXXXXXXXXXM 386 C+IYGCC Y NAVCC S+YCCPSDYP+CDV DG C + GD +GV Sbjct: 408 SSCIIYGCCPYENAVCCAESTYCCPSDYPICDVDDGLCLRGQGDHLGVAARRRHMANYKF 467 Query: 385 PWEKTEETVVEEYLPLRWK 329 PW K EE + L+WK Sbjct: 468 PWTKFEEKKETKQPVLQWK 486