BLASTX nr result
ID: Atractylodes21_contig00017842
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes21_contig00017842 (1112 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla] 445 e-123 ref|XP_002317417.1| predicted protein [Populus trichocarpa] gi|1... 306 5e-81 ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [C... 237 3e-60 dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil] 234 4e-59 gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indi... 232 1e-58 >gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla] Length = 501 Score = 445 bits (1145), Expect = e-123 Identities = 203/289 (70%), Positives = 229/289 (79%) Frame = +3 Query: 3 GGNMDTAFRWIIKNGGLDSEADYPYTSTNGYGSKCIKSKEKTSVVSIDSYVEVESHEDAL 182 GGNMDTA+RWIIKNGGLDSE DYPYTS+NG KC K+K SVVS+DSYVEVES+EDA+ Sbjct: 207 GGNMDTAYRWIIKNGGLDSEDDYPYTSSNGRDGKCDKTKSAKSVVSLDSYVEVESNEDAV 266 Query: 183 LCAVAKQPVTIGIDGSAYDFQLYTGGIYNGECSSSAYSIDHAVLVVGYGSQDGEDYWIVK 362 LCAVA PVTIGI GSAYDFQLYTGG+YNG+CSS Y IDHAVL+VGYGSQDG+DYWIVK Sbjct: 267 LCAVATTPVTIGIVGSAYDFQLYTGGVYNGQCSSKPYDIDHAVLIVGYGSQDGKDYWIVK 326 Query: 363 NSWGTYWGMEGYILMKRKTGIKNGVCGMYLEPIYXXXXXXXXXXXXXXXXXXXXXXXXXX 542 NSWGTYWG+EGYILM+R T IKNGVCGMYLEP+Y Sbjct: 327 NSWGTYWGLEGYILMERNTDIKNGVCGMYLEPVY---PITAAPTPPGPPPPPAPPSPPHP 383 Query: 543 XXXXXXXXXXKCGEFSYCAADQTCCCIFEFYNYCLIHGCCGYTNAVCCEGSSACCPSDYP 722 KCG+F YCAADQTCCCIFEFYNYCLI+GCCGY++AVCC+ S+ACCPSDYP Sbjct: 384 PPPPTPPAPSKCGDFHYCAADQTCCCIFEFYNYCLIYGCCGYSDAVCCKNSAACCPSDYP 443 Query: 723 VCDVKAGYCFKKSSDTVGVAAKKRQLAKHKMPWERIEETVVEEYQPLVW 869 +CDV+AGYC+K S+ T GV AKKRQLAKHKMPWE+IEET+ EE+QPL W Sbjct: 444 ICDVQAGYCYKNSAKTFGVPAKKRQLAKHKMPWEKIEETIKEEFQPLAW 492 >ref|XP_002317417.1| predicted protein [Populus trichocarpa] gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa] gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa] Length = 498 Score = 306 bits (785), Expect = 5e-81 Identities = 138/290 (47%), Positives = 185/290 (63%) Frame = +3 Query: 3 GGNMDTAFRWIIKNGGLDSEADYPYTSTNGYGSKCIKSKEKTSVVSIDSYVEVESHEDAL 182 GG+MD+AF+W+I NGG+D+EADYPYT +G C +KE+ VVSI+ YV+V+ + AL Sbjct: 202 GGDMDSAFQWVIGNGGIDTEADYPYTGVDG---TCNTAKEEKKVVSIEGYVDVDPSDSAL 258 Query: 183 LCAVAKQPVTIGIDGSAYDFQLYTGGIYNGECSSSAYSIDHAVLVVGYGSQDGEDYWIVK 362 LCA +QP+++G+DGSA DFQLYTGGIY+G+CS IDHA+L+VGYGS++ EDYWIVK Sbjct: 259 LCATVQQPISVGMDGSALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYGSENDEDYWIVK 318 Query: 363 NSWGTYWGMEGYILMKRKTGIKNGVCGMYLEPIYXXXXXXXXXXXXXXXXXXXXXXXXXX 542 NSWGT WGMEGY ++R T GVC + + Y Sbjct: 319 NSWGTEWGMEGYFYIRRNTSKPYGVCAINADASY--PTKVPSPPSPPSPPPPPSPPPPPP 376 Query: 543 XXXXXXXXXXKCGEFSYCAADQTCCCIFEFYNYCLIHGCCGYTNAVCCEGSSACCPSDYP 722 CG+ S+C +D+TCCCI + ++ C+I+GCC Y NAVCC S+ CCPSDYP Sbjct: 377 SPPPPCPQPSDCGDSSFCPSDETCCCILKLFSSCIIYGCCPYENAVCCAESTYCCPSDYP 436 Query: 723 VCDVKAGYCFKKSSDTVGVAAKKRQLAKHKMPWERIEETVVEEYQPLVWK 872 +CDV G C + D +GVAA++R +A +K PW + EE + L WK Sbjct: 437 ICDVDDGLCLRGQGDHLGVAARRRHMANYKFPWTKFEEKKETKQPVLQWK 486 >ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 459 Score = 237 bits (605), Expect = 3e-60 Identities = 122/281 (43%), Positives = 158/281 (56%), Gaps = 1/281 (0%) Frame = +3 Query: 3 GGNMDTAFRWIIKNGGLDSEADYPYTSTNGYGSKCIKSKEKTSVVSIDSYVEVE-SHEDA 179 GG MD AF +II+NGGLD+E DYPY G+ S CI+ K+ VV+IDSY +V ++E A Sbjct: 193 GGLMDYAFEFIIENGGLDTEEDYPYY---GFDSSCIQYKKNAKVVAIDSYEDVPVNNEKA 249 Query: 180 LLCAVAKQPVTIGIDGSAYDFQLYTGGIYNGECSSSAYSIDHAVLVVGYGSQDGEDYWIV 359 L AV+KQ V++ I+G FQLY GI+ G C + +DH V VVGYGS+ G DYWIV Sbjct: 250 LQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTD---LDHGVNVVGYGSEGGVDYWIV 306 Query: 360 KNSWGTYWGMEGYILMKRKTGIKNGVCGMYLEPIYXXXXXXXXXXXXXXXXXXXXXXXXX 539 +NSWG WG GY+ M+R G+CG+ +EP Y Sbjct: 307 RNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPTKTGPNPPNPGPTPPSPVKPPSV- 365 Query: 540 XXXXXXXXXXXKCGEFSYCAADQTCCCIFEFYNYCLIHGCCGYTNAVCCEGSSACCPSDY 719 C E+ C A +TCCCIF+F N CL GCC +A CC+ +CCP DY Sbjct: 366 ------------CDEYYTCPAAETCCCIFQFSNLCLEWGCCPLESATCCDDHYSCCPHDY 413 Query: 720 PVCDVKAGYCFKKSSDTVGVAAKKRQLAKHKMPWERIEETV 842 PVC+V+AG C K +D GV A +R A + W R + TV Sbjct: 414 PVCNVRAGTCSKSKNDIFGVKAMRRTAAAARPSWARRDVTV 454 >dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil] Length = 474 Score = 234 bits (596), Expect = 4e-59 Identities = 116/266 (43%), Positives = 154/266 (57%), Gaps = 2/266 (0%) Frame = +3 Query: 3 GGNMDTAFRWIIKNGGLDSEADYPYTSTNGYGSKCIKSKEKTS-VVSIDSYVEVE-SHED 176 GG+M AF++IIKNGG+DSE DYPYT +G KC ++ + V SID Y EV ++E Sbjct: 203 GGDMGYAFQFIIKNGGIDSEEDYPYTGKDG---KCDSYRQNNAKVASIDGYEEVPVNNEK 259 Query: 177 ALLCAVAKQPVTIGIDGSAYDFQLYTGGIYNGECSSSAYSIDHAVLVVGYGSQDGEDYWI 356 +L AVA QPV++ I+ YDFQLY+ GI+ G C + +DH V VGYG+++G DYWI Sbjct: 260 SLQKAVANQPVSVAIEAGGYDFQLYSSGIFTGSCGTD---LDHGVAAVGYGTENGVDYWI 316 Query: 357 VKNSWGTYWGMEGYILMKRKTGIKNGVCGMYLEPIYXXXXXXXXXXXXXXXXXXXXXXXX 536 VKNSWG YWG +GY+ M+R K G+CG+ +E Y Sbjct: 317 VKNSWGDYWGEKGYVRMQRNVKAKTGLCGIAMEASYPTKKGGDNPPPSPPSPPSPTPTPP 376 Query: 537 XXXXXXXXXXXXKCGEFSYCAADQTCCCIFEFYNYCLIHGCCGYTNAVCCEGSSACCPSD 716 C +F+ C A TCCC+F F NYC GCC +AVCC+ +CCP D Sbjct: 377 SPSPSV-------CDKFNACPASTTCCCVFPFGNYCFAWGCCPLDSAVCCDDHYSCCPHD 429 Query: 717 YPVCDVKAGYCFKKSSDTVGVAAKKR 794 YPVC V++G C KK ++ +GV A R Sbjct: 430 YPVCHVRSGTCTKKKNNPLGVKAMTR 455 >gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group] Length = 449 Score = 232 bits (592), Expect = 1e-58 Identities = 117/260 (45%), Positives = 153/260 (58%), Gaps = 1/260 (0%) Frame = +3 Query: 3 GGNMDTAFRWIIKNGGLDSEADYPYTSTNGYGSKCIKSKEKTSVVSIDSYVEVESH-EDA 179 GG MD A+++++KNGG+D+EADYPY T+G C K+K K VV+ID Y +V ++ ED Sbjct: 190 GGLMDYAYKFVVKNGGIDTEADYPYRETDG---TCNKNKLKRRVVTIDGYKDVPANNEDM 246 Query: 180 LLCAVAKQPVTIGIDGSAYDFQLYTGGIYNGECSSSAYSIDHAVLVVGYGSQDGEDYWIV 359 LL AVA+QPV++GI GSA FQLY+ GI++G C +S +DHA+L+VGYGS+ G+DYWIV Sbjct: 247 LLQAVAQQPVSVGICGSARAFQLYSKGIFDGPCPTS---LDHAILIVGYGSEGGKDYWIV 303 Query: 360 KNSWGTYWGMEGYILMKRKTGIKNGVCGMYLEPIYXXXXXXXXXXXXXXXXXXXXXXXXX 539 KNSWG WGM+GY+ M R TG NGVCG+ P + Sbjct: 304 KNSWGESWGMKGYMYMHRNTGNSNGVCGINQMPSF-------------------PTKSSP 344 Query: 540 XXXXXXXXXXXKCGEFSYCAADQTCCCIFEFYNYCLIHGCCGYTNAVCCEGSSACCPSDY 719 KC +YC TCCC + CL CC NAVCC+ + CCP DY Sbjct: 345 NPPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGLCLSWSCCELDNAVCCKDNRYCCPHDY 404 Query: 720 PVCDVKAGYCFKKSSDTVGV 779 PVCD + CFK ++ V Sbjct: 405 PVCDTASQRCFKANNGNFSV 424