BLASTX nr result
ID: Atractylodes22_contig00012855
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes22_contig00012855 (1777 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla] 737 0.0 ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis v... 556 e-156 gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa] 536 e-150 ref|XP_002317418.1| predicted protein [Populus trichocarpa] gi|2... 534 e-149 ref|XP_002305743.1| predicted protein [Populus trichocarpa] gi|2... 534 e-149 >gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla] Length = 501 Score = 737 bits (1903), Expect = 0.0 Identities = 345/478 (72%), Positives = 392/478 (82%), Gaps = 1/478 (0%) Frame = -2 Query: 1650 PTEFLIIDDQETDILSSEKVHELFGKWKEMHGKTYDHEEEETRRLENFRKSLKYILEKNS 1471 P+EF I++ QE DILSS KV +LFGKWKE+HGKTY HEEEE RLENF+KS+K+++EKNS Sbjct: 27 PSEFSILEGQENDILSSAKVSDLFGKWKELHGKTYQHEEEENLRLENFKKSVKFVMEKNS 86 Query: 1470 KRKSETEHMVGLNKFADLSNDEFKNMYFSKIKGPRRNNLKMRGEIRNMTSNSRSCEAPAS 1291 +RKSE +H VGLNKFADLSN+EFK MY SK+KG R N LKM G RNM+ +SR+C+AP S Sbjct: 87 ERKSELDHTVGLNKFADLSNEEFKEMYMSKVKGSRSNELKMGGVKRNMSVSSRTCDAPTS 146 Query: 1290 LDWRDKGVVTPIKDQGQCGSCWAFSVTGSIEGAHAIATGDLLSLSEQELVDCDTNDYGCD 1111 LDWRDKGVVTP+KDQGQCGSCWAFSV+GSIE A+AIATGDL+ LSEQELVDCDT DYGCD Sbjct: 147 LDWRDKGVVTPMKDQGQCGSCWAFSVSGSIESANAIATGDLIRLSEQELVDCDTYDYGCD 206 Query: 1110 GGNMDTAFRWIIKNGGLDSEADYPYTSTNGYGSKCIKSKEKTSVVSIDSYVEVESHEDAL 931 GGNMDTA+RWIIKNGGLDSE DYPYTS+NG KC K+K SVVS+DSYVEVES+EDA+ Sbjct: 207 GGNMDTAYRWIIKNGGLDSEDDYPYTSSNGRDGKCDKTKSAKSVVSLDSYVEVESNEDAV 266 Query: 930 LCAVTKQPVTIGIDGSAYDFQLYTGGIYNGECSSSAYSIDHAVLVVGYGSQDGEDYWIVK 751 LCAV PVTIGI GSAYDFQLYTGG+YNG+CSS Y IDHAVL+VGYGSQDG+DYWIVK Sbjct: 267 LCAVATTPVTIGIVGSAYDFQLYTGGVYNGQCSSKPYDIDHAVLIVGYGSQDGKDYWIVK 326 Query: 750 NSWGTYWGMEGYILMKRNTGIKNGVCGMYLEPIYXXXXXXXXXXXXXXXXXXXXXXXXXX 571 NSWGTYWG+EGYILM+RNT IKNGVCGMYLEP+Y Sbjct: 327 NSWGTYWGLEGYILMERNTDIKNGVCGMYLEPVY---PITAAPTPPGPPPPPAPPSPPHP 383 Query: 570 XXXXXXXXXSKCGEFSYCAADQTCCCIFEFYNYCLIHGCCGYTNAVCCEGSSACCPSDYP 391 SKCG+F YCAADQTCCCIFEFYNYCLI+GCCGY++AVCC+ S+ACCPSDYP Sbjct: 384 PPPPTPPAPSKCGDFHYCAADQTCCCIFEFYNYCLIYGCCGYSDAVCCKNSAACCPSDYP 443 Query: 390 VCDVKAGYCFKKSGDTVGVAAKKRQLAKHKMPWERIEETV-VEYQPLVWKRNRFAAAA 220 +CDV+AGYC+K S T GV AKKRQLAKHKMPWE+IEET+ E+QPL W RN FAAAA Sbjct: 444 ICDVQAGYCYKNSAKTFGVPAKKRQLAKHKMPWEKIEETIKEEFQPLAWNRNPFAAAA 501 >ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera] Length = 501 Score = 556 bits (1434), Expect = e-156 Identities = 268/485 (55%), Positives = 326/485 (67%), Gaps = 10/485 (2%) Frame = -2 Query: 1650 PTEFLIIDDQETDILSSEKVHELFGKWKEMHGKTYDHEEEETRRLENFRKSLKYILEKNS 1471 PTEF I ++ S E+V ELF WKE H + Y H EE +R E F+++LKY++E+NS Sbjct: 26 PTEFYITGEE---FASEERVRELFHLWKERHKRVYKHAEETAKRFEIFKENLKYVIERNS 82 Query: 1470 KRKSETEHMVGLNKFADLSNDEFKNMYFSKIKGP--RRNNLKMRGEIRNMTSNSRSCEAP 1297 K H +G+NKFAD+SN+EFK Y SKIK P ++NN R + SCEAP Sbjct: 83 KGH---RHTLGMNKFADMSNEEFKEKYLSKIKKPINKKNNYLRRS--MQQKKGTASCEAP 137 Query: 1296 ASLDWRDKGVVTPIKDQGQCGSCWAFSVTGSIEGAHAIATGDLLSLSEQELVDCDTNDYG 1117 +SLDWR KGVVT IKDQG CGSCWAFS TG++EG +AI TGDL+SLSEQELVDCDT +YG Sbjct: 138 SSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDLISLSEQELVDCDTTNYG 197 Query: 1116 CDGGNMDTAFRWIIKNGGLDSEADYPYTSTNGYGSKCIKSKEKTSVVSIDSYVEVESHED 937 C+GG MD AF W+I NGG+DSE+DYPYT T+G C +KE T VVSID Y +V+ + Sbjct: 198 CEGGYMDYAFEWVISNGGIDSESDYPYTGTDG---TCNTTKEDTKVVSIDGYKDVDESDS 254 Query: 936 ALLCAVTKQPVTIGIDGSAYDFQLYTGGIYNGECSSSAYSIDHAVLVVGYGSQDGEDYWI 757 ALLCA QP+++G+DGSA DFQLYT GIY G+CS IDHAVL+VGYGS+D EDYWI Sbjct: 255 ALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDCSDDPDDIDHAVLIVGYGSEDSEDYWI 314 Query: 756 VKNSWGTYWGMEGYILMKRNTGIKNGVCGMYLEPIY--------XXXXXXXXXXXXXXXX 601 KNSWGT WGMEGY +KRNT + G C + Y Sbjct: 315 CKNSWGTSWGMEGYFYIKRNTDLPYGECAINAMASYPTKESSSPSPYPSPAVPPPPPPPP 374 Query: 600 XXXXXXXXXXXXXXXXXXXSKCGEFSYCAADQTCCCIFEFYNYCLIHGCCGYTNAVCCEG 421 S+CG+FSYC +D+TCCCI+EFY++CLI+GCC Y NAVCC G Sbjct: 375 SPPPPPPPSPPPPSPGPSPSECGDFSYCPSDETCCCIYEFYDFCLIYGCCEYENAVCCTG 434 Query: 420 SSACCPSDYPVCDVKAGYCFKKSGDTVGVAAKKRQLAKHKMPWERIEETVVEYQPLVWKR 241 + CCPSDYP+CDV+ G C K GD +GVAAKKR++AKHK PW +IEET YQPL WKR Sbjct: 435 TEYCCPSDYPICDVEEGLCLKNQGDYLGVAAKKRKMAKHKFPWTKIEETQKTYQPLEWKR 494 Query: 240 NRFAA 226 NRFAA Sbjct: 495 NRFAA 499 >gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa] Length = 509 Score = 536 bits (1380), Expect = e-150 Identities = 246/483 (50%), Positives = 326/483 (67%), Gaps = 8/483 (1%) Frame = -2 Query: 1650 PTEFLIIDDQETDILSSEKVHELFGKWKEMHGKTYDHEEEETRRLENFRKSLKYILEKNS 1471 P+EF I+ + + ++ E+V ELF KW E HGK Y H +E ++ +NFR +L+Y++EKN Sbjct: 29 PSEFSIVG-RPGESIAEERVVELFKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNG 87 Query: 1470 KRKSETEHMVGLNKFADLSNDEFKNMYFSKIKGPRRNNLKMRGEIRNMTSNSRS---CEA 1300 +R + H+VGLNKFAD+SN+EF+ +Y SK+K P + + + + +++ C+ Sbjct: 88 ERGASGGHLVGLNKFADMSNEEFREVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAACDG 147 Query: 1299 PASLDWRDKGVVTPIKDQGQCGSCWAFSVTGSIEGAHAIATGDLLSLSEQELVDCDTNDY 1120 P SLDWR G+VT +KDQG CGSCWAFS TG+IEG +A+A GDL+SLSEQELVDCD+ + Sbjct: 148 PTSLDWRKYGIVTGVKDQGDCGSCWAFSSTGAIEGINALANGDLISLSEQELVDCDSTND 207 Query: 1119 GCDGGNMDTAFRWIIKNGGLDSEADYPYTSTNGYGSKCIKSKEKTSVVSIDSYVEVESHE 940 GC+GG MD AF W++ NGG+D+E DYPYT +G C +KE+T VSID Y +V E Sbjct: 208 GCEGGYMDYAFEWVMSNGGIDTETDYPYTGEDG---TCNTTKEETKAVSIDGYEDVAEEE 264 Query: 939 DALLCAVTKQPVTIGIDGSAYDFQLYTGGIYNGECSSSAYSIDHAVLVVGYGSQDGEDYW 760 AL CAV KQP+++GIDG A DFQLYTGGIY+G+CS IDHAVLVVGYG++ GE+YW Sbjct: 265 SALFCAVLKQPISVGIDGGAIDFQLYTGGIYDGDCSDDPDDIDHAVLVVGYGAESGEEYW 324 Query: 759 IVKNSWGTYWGMEGYILMKRNTGIKNGVCGMYLEPIY-----XXXXXXXXXXXXXXXXXX 595 I+KNSWGT WGM+GY +KRNT GVC + Y Sbjct: 325 IIKNSWGTDWGMKGYAYIKRNTSKDYGVCAINAMASYPTKESSAPSPYPSPAVPPPPPPP 384 Query: 594 XXXXXXXXXXXXXXXXXSKCGEFSYCAADQTCCCIFEFYNYCLIHGCCGYTNAVCCEGSS 415 ++CG+FSYCAA +TCCCIFEF++YCLI+GCC YT+AVCC G+ Sbjct: 385 PPPPSPPPPPPPPSPSPTQCGDFSYCAATETCCCIFEFFDYCLIYGCCDYTDAVCCTGTE 444 Query: 414 ACCPSDYPVCDVKAGYCFKKSGDTVGVAAKKRQLAKHKMPWERIEETVVEYQPLVWKRNR 235 CCP DYP+CD++ G C + GD +GV AKKR++AKHK PW + E++ +QPL WKRNR Sbjct: 445 YCCPHDYPICDIEEGLCLQNDGDFLGVTAKKRKMAKHKYPWTKPEDSAKNHQPLEWKRNR 504 Query: 234 FAA 226 FAA Sbjct: 505 FAA 507 >ref|XP_002317418.1| predicted protein [Populus trichocarpa] gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa] Length = 503 Score = 534 bits (1375), Expect = e-149 Identities = 249/480 (51%), Positives = 323/480 (67%), Gaps = 5/480 (1%) Frame = -2 Query: 1650 PTEFLIIDDQETDILSSEKVHELFGKWKEMHGKTYDHEEEETRRLENFRKSLKYILEKNS 1471 P E I+ + ++++S E + E+F +W++ H K Y+H E +R NF+++LKYI+EK Sbjct: 27 PGEHPIVVNDFSELVSEESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAG 86 Query: 1470 KRKSETEHMVGLNKFADLSNDEFKNMYFSKIKGPRRNNLKMRGEIRNMTSNSRSCEAPAS 1291 K+ + H VGLNKFADLSN+EFK +Y SK+K P N+K N ++C+AP+S Sbjct: 87 KKTAALGHSVGLNKFADLSNEEFKELYLSKVKKPI--NIKRSTARDWRQRNLQTCDAPSS 144 Query: 1290 LDWRDKGVVTPIKDQGQCGSCWAFSVTGSIEGAHAIATGDLLSLSEQELVDCDTNDYGCD 1111 LDWR KGVVT +KDQG CGSCW+FS TG+IEG +AI TGDL+SLSEQELVDCDT +YGC+ Sbjct: 145 LDWRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTTNYGCE 204 Query: 1110 GGNMDTAFRWIIKNGGLDSEADYPYTSTNGYGSKCIKSKEKTSVVSIDSYVEVESHEDAL 931 GG MD AF W+I NGG+D+EA+YPYT +G C +KE+ VVSID Y +V+ + AL Sbjct: 205 GGYMDYAFEWVINNGGIDTEANYPYTGVDG---TCNTTKEEIKVVSIDGYTDVDETDSAL 261 Query: 930 LCAVTKQPVTIGIDGSAYDFQLYTGGIYNGECSSSAYSIDHAVLVVGYGSQDGEDYWIVK 751 LCA +QP+++G+DGSA DFQLYTGGIY+G+CS IDHAVL+VGYGS++GEDYWIVK Sbjct: 262 LCATVQQPISVGMDGSALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGYGSENGEDYWIVK 321 Query: 750 NSWGTYWGMEGYILMKRNTGIKNGVCGMYLEPIY----XXXXXXXXXXXXXXXXXXXXXX 583 NSWGT WGMEGY +KRNT + GVC + E Y Sbjct: 322 NSWGTEWGMEGYFYIKRNTDLPYGVCAINAEASYPTKESSSPSPTSPPSPPSPLSPPPPP 381 Query: 582 XXXXXXXXXXXXXSKCGEFSYCAADQTCCCIFEFYNYCLIHGCCGYTNAVCCEGSSACCP 403 S CG+F+YC +D+TCCCI + ++YC+++GCC Y NAVCC S CCP Sbjct: 382 PPTPVPPPPCPQPSDCGDFAYCPSDETCCCILKVFDYCIVYGCCQYENAVCCADSVYCCP 441 Query: 402 SDYPVCDVKAGYCFKKSGDTVGVAAKKRQLAKHKMPWERIEE-TVVEYQPLVWKRNRFAA 226 SDYP+CDV+ G C K GD +GV A KR +AKHK PW ++EE T + L WKRN F A Sbjct: 442 SDYPICDVEEGLCLKSQGDYLGVPASKRHMAKHKFPWTKLEEKTTTDRHALRWKRNPFDA 501 >ref|XP_002305743.1| predicted protein [Populus trichocarpa] gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa] Length = 494 Score = 534 bits (1375), Expect = e-149 Identities = 252/479 (52%), Positives = 326/479 (68%), Gaps = 4/479 (0%) Frame = -2 Query: 1650 PTEFLIIDDQETDILSSEKVHELFGKWKEMHGKTYDHEEEETRRLENFRKSLKYILEKNS 1471 P+E+ I+ + +++ E + E+F +W++ H K Y H EE +R NF+++LKYI+EK Sbjct: 20 PSEYSIVGNDFSELPPDESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTG 79 Query: 1470 KRKSETEHMVGLNKFADLSNDEFKNMYFSKIKGPRRNNLKMRGEIRNMTSNSRSCEAPAS 1291 K ++ H VGLNKFADLSN+EFK +Y SK+K P N ++ E R+ N +SC+AP+S Sbjct: 80 K-ETTLRHRVGLNKFADLSNEEFKQLYLSKVKKPI-NKTRIDAEDRSRR-NLQSCDAPSS 136 Query: 1290 LDWRDKGVVTPIKDQGQCGSCWAFSVTGSIEGAHAIATGDLLSLSEQELVDCDTNDYGCD 1111 LDWR KGVVT +KDQG CGSCW+FS TG+IEG +AI T DL+SLSEQELVDCDT +YGC+ Sbjct: 137 LDWRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTTNYGCE 196 Query: 1110 GGNMDTAFRWIIKNGGLDSEADYPYTSTNGYGSKCIKSKEKTSVVSIDSYVEVESHEDAL 931 GG MD AF W+I NGG+D+EA+YPYT +G C +KE+ VVSID Y +V+ + AL Sbjct: 197 GGYMDYAFEWVINNGGIDTEANYPYTGVDG---TCNTAKEEIKVVSIDGYKDVDETDSAL 253 Query: 930 LCAVTKQPVTIGIDGSAYDFQLYTGGIYNGECSSSAYSIDHAVLVVGYGSQDGEDYWIVK 751 LCA +QP+++GIDGSA DFQLYTGGIY+G+CS IDHAVL+VGYGS++GEDYWIVK Sbjct: 254 LCAAAQQPISVGIDGSAIDFQLYTGGIYDGDCSDDPDDIDHAVLIVGYGSENGEDYWIVK 313 Query: 750 NSWGTYWGMEGYILMKRNTGIKNGVC---GMYLEPIYXXXXXXXXXXXXXXXXXXXXXXX 580 NSWGT WG+EGY +KRNT + GVC M P Sbjct: 314 NSWGTSWGIEGYFYIKRNTDLPYGVCAINAMASYPTKEASAQSPTSPPSPPSPPPPPPPP 373 Query: 579 XXXXXXXXXXXXSKCGEFSYCAADQTCCCIFEFYNYCLIHGCCGYTNAVCCEGSSACCPS 400 S CG+FSYC +D+TCCCI ++YCL++GCC Y NAVCC S CCPS Sbjct: 374 PTPVPPPPSPQPSDCGDFSYCPSDETCCCILNVFDYCLVYGCCAYENAVCCADSVYCCPS 433 Query: 399 DYPVCDVKAGYCFKKSGDTVGVAAKKRQLAKHKMPWERIEETV-VEYQPLVWKRNRFAA 226 DYP+CDV+ G C K GD +GVAA KR +AKHK PW +++E +++ L WKRN FAA Sbjct: 434 DYPICDVEEGLCLKGQGDYLGVAASKRHMAKHKFPWTKLQERAKTDHRVLQWKRNPFAA 492