BLASTX nr result

ID: Atractylodes21_contig00017842 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes21_contig00017842
         (1112 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]       445   e-123
ref|XP_002317417.1| predicted protein [Populus trichocarpa] gi|1...   306   5e-81
ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [C...   237   3e-60
dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]           234   4e-59
gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indi...   232   1e-58

>gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]
          Length = 501

 Score =  445 bits (1145), Expect = e-123
 Identities = 203/289 (70%), Positives = 229/289 (79%)
 Frame = +3

Query: 3    GGNMDTAFRWIIKNGGLDSEADYPYTSTNGYGSKCIKSKEKTSVVSIDSYVEVESHEDAL 182
            GGNMDTA+RWIIKNGGLDSE DYPYTS+NG   KC K+K   SVVS+DSYVEVES+EDA+
Sbjct: 207  GGNMDTAYRWIIKNGGLDSEDDYPYTSSNGRDGKCDKTKSAKSVVSLDSYVEVESNEDAV 266

Query: 183  LCAVAKQPVTIGIDGSAYDFQLYTGGIYNGECSSSAYSIDHAVLVVGYGSQDGEDYWIVK 362
            LCAVA  PVTIGI GSAYDFQLYTGG+YNG+CSS  Y IDHAVL+VGYGSQDG+DYWIVK
Sbjct: 267  LCAVATTPVTIGIVGSAYDFQLYTGGVYNGQCSSKPYDIDHAVLIVGYGSQDGKDYWIVK 326

Query: 363  NSWGTYWGMEGYILMKRKTGIKNGVCGMYLEPIYXXXXXXXXXXXXXXXXXXXXXXXXXX 542
            NSWGTYWG+EGYILM+R T IKNGVCGMYLEP+Y                          
Sbjct: 327  NSWGTYWGLEGYILMERNTDIKNGVCGMYLEPVY---PITAAPTPPGPPPPPAPPSPPHP 383

Query: 543  XXXXXXXXXXKCGEFSYCAADQTCCCIFEFYNYCLIHGCCGYTNAVCCEGSSACCPSDYP 722
                      KCG+F YCAADQTCCCIFEFYNYCLI+GCCGY++AVCC+ S+ACCPSDYP
Sbjct: 384  PPPPTPPAPSKCGDFHYCAADQTCCCIFEFYNYCLIYGCCGYSDAVCCKNSAACCPSDYP 443

Query: 723  VCDVKAGYCFKKSSDTVGVAAKKRQLAKHKMPWERIEETVVEEYQPLVW 869
            +CDV+AGYC+K S+ T GV AKKRQLAKHKMPWE+IEET+ EE+QPL W
Sbjct: 444  ICDVQAGYCYKNSAKTFGVPAKKRQLAKHKMPWEKIEETIKEEFQPLAW 492


>ref|XP_002317417.1| predicted protein [Populus trichocarpa] gi|118488173|gb|ABK95906.1|
            unknown [Populus trichocarpa] gi|222860482|gb|EEE98029.1|
            predicted protein [Populus trichocarpa]
          Length = 498

 Score =  306 bits (785), Expect = 5e-81
 Identities = 138/290 (47%), Positives = 185/290 (63%)
 Frame = +3

Query: 3    GGNMDTAFRWIIKNGGLDSEADYPYTSTNGYGSKCIKSKEKTSVVSIDSYVEVESHEDAL 182
            GG+MD+AF+W+I NGG+D+EADYPYT  +G    C  +KE+  VVSI+ YV+V+  + AL
Sbjct: 202  GGDMDSAFQWVIGNGGIDTEADYPYTGVDG---TCNTAKEEKKVVSIEGYVDVDPSDSAL 258

Query: 183  LCAVAKQPVTIGIDGSAYDFQLYTGGIYNGECSSSAYSIDHAVLVVGYGSQDGEDYWIVK 362
            LCA  +QP+++G+DGSA DFQLYTGGIY+G+CS     IDHA+L+VGYGS++ EDYWIVK
Sbjct: 259  LCATVQQPISVGMDGSALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYGSENDEDYWIVK 318

Query: 363  NSWGTYWGMEGYILMKRKTGIKNGVCGMYLEPIYXXXXXXXXXXXXXXXXXXXXXXXXXX 542
            NSWGT WGMEGY  ++R T    GVC +  +  Y                          
Sbjct: 319  NSWGTEWGMEGYFYIRRNTSKPYGVCAINADASY--PTKVPSPPSPPSPPPPPSPPPPPP 376

Query: 543  XXXXXXXXXXKCGEFSYCAADQTCCCIFEFYNYCLIHGCCGYTNAVCCEGSSACCPSDYP 722
                       CG+ S+C +D+TCCCI + ++ C+I+GCC Y NAVCC  S+ CCPSDYP
Sbjct: 377  SPPPPCPQPSDCGDSSFCPSDETCCCILKLFSSCIIYGCCPYENAVCCAESTYCCPSDYP 436

Query: 723  VCDVKAGYCFKKSSDTVGVAAKKRQLAKHKMPWERIEETVVEEYQPLVWK 872
            +CDV  G C +   D +GVAA++R +A +K PW + EE    +   L WK
Sbjct: 437  ICDVDDGLCLRGQGDHLGVAARRRHMANYKFPWTKFEEKKETKQPVLQWK 486


>ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 459

 Score =  237 bits (605), Expect = 3e-60
 Identities = 122/281 (43%), Positives = 158/281 (56%), Gaps = 1/281 (0%)
 Frame = +3

Query: 3   GGNMDTAFRWIIKNGGLDSEADYPYTSTNGYGSKCIKSKEKTSVVSIDSYVEVE-SHEDA 179
           GG MD AF +II+NGGLD+E DYPY    G+ S CI+ K+   VV+IDSY +V  ++E A
Sbjct: 193 GGLMDYAFEFIIENGGLDTEEDYPYY---GFDSSCIQYKKNAKVVAIDSYEDVPVNNEKA 249

Query: 180 LLCAVAKQPVTIGIDGSAYDFQLYTGGIYNGECSSSAYSIDHAVLVVGYGSQDGEDYWIV 359
           L  AV+KQ V++ I+G    FQLY  GI+ G C +    +DH V VVGYGS+ G DYWIV
Sbjct: 250 LQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTD---LDHGVNVVGYGSEGGVDYWIV 306

Query: 360 KNSWGTYWGMEGYILMKRKTGIKNGVCGMYLEPIYXXXXXXXXXXXXXXXXXXXXXXXXX 539
           +NSWG  WG  GY+ M+R      G+CG+ +EP Y                         
Sbjct: 307 RNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPTKTGPNPPNPGPTPPSPVKPPSV- 365

Query: 540 XXXXXXXXXXXKCGEFSYCAADQTCCCIFEFYNYCLIHGCCGYTNAVCCEGSSACCPSDY 719
                       C E+  C A +TCCCIF+F N CL  GCC   +A CC+   +CCP DY
Sbjct: 366 ------------CDEYYTCPAAETCCCIFQFSNLCLEWGCCPLESATCCDDHYSCCPHDY 413

Query: 720 PVCDVKAGYCFKKSSDTVGVAAKKRQLAKHKMPWERIEETV 842
           PVC+V+AG C K  +D  GV A +R  A  +  W R + TV
Sbjct: 414 PVCNVRAGTCSKSKNDIFGVKAMRRTAAAARPSWARRDVTV 454


>dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 474

 Score =  234 bits (596), Expect = 4e-59
 Identities = 116/266 (43%), Positives = 154/266 (57%), Gaps = 2/266 (0%)
 Frame = +3

Query: 3   GGNMDTAFRWIIKNGGLDSEADYPYTSTNGYGSKCIKSKEKTS-VVSIDSYVEVE-SHED 176
           GG+M  AF++IIKNGG+DSE DYPYT  +G   KC   ++  + V SID Y EV  ++E 
Sbjct: 203 GGDMGYAFQFIIKNGGIDSEEDYPYTGKDG---KCDSYRQNNAKVASIDGYEEVPVNNEK 259

Query: 177 ALLCAVAKQPVTIGIDGSAYDFQLYTGGIYNGECSSSAYSIDHAVLVVGYGSQDGEDYWI 356
           +L  AVA QPV++ I+   YDFQLY+ GI+ G C +    +DH V  VGYG+++G DYWI
Sbjct: 260 SLQKAVANQPVSVAIEAGGYDFQLYSSGIFTGSCGTD---LDHGVAAVGYGTENGVDYWI 316

Query: 357 VKNSWGTYWGMEGYILMKRKTGIKNGVCGMYLEPIYXXXXXXXXXXXXXXXXXXXXXXXX 536
           VKNSWG YWG +GY+ M+R    K G+CG+ +E  Y                        
Sbjct: 317 VKNSWGDYWGEKGYVRMQRNVKAKTGLCGIAMEASYPTKKGGDNPPPSPPSPPSPTPTPP 376

Query: 537 XXXXXXXXXXXXKCGEFSYCAADQTCCCIFEFYNYCLIHGCCGYTNAVCCEGSSACCPSD 716
                        C +F+ C A  TCCC+F F NYC   GCC   +AVCC+   +CCP D
Sbjct: 377 SPSPSV-------CDKFNACPASTTCCCVFPFGNYCFAWGCCPLDSAVCCDDHYSCCPHD 429

Query: 717 YPVCDVKAGYCFKKSSDTVGVAAKKR 794
           YPVC V++G C KK ++ +GV A  R
Sbjct: 430 YPVCHVRSGTCTKKKNNPLGVKAMTR 455


>gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
          Length = 449

 Score =  232 bits (592), Expect = 1e-58
 Identities = 117/260 (45%), Positives = 153/260 (58%), Gaps = 1/260 (0%)
 Frame = +3

Query: 3   GGNMDTAFRWIIKNGGLDSEADYPYTSTNGYGSKCIKSKEKTSVVSIDSYVEVESH-EDA 179
           GG MD A+++++KNGG+D+EADYPY  T+G    C K+K K  VV+ID Y +V ++ ED 
Sbjct: 190 GGLMDYAYKFVVKNGGIDTEADYPYRETDG---TCNKNKLKRRVVTIDGYKDVPANNEDM 246

Query: 180 LLCAVAKQPVTIGIDGSAYDFQLYTGGIYNGECSSSAYSIDHAVLVVGYGSQDGEDYWIV 359
           LL AVA+QPV++GI GSA  FQLY+ GI++G C +S   +DHA+L+VGYGS+ G+DYWIV
Sbjct: 247 LLQAVAQQPVSVGICGSARAFQLYSKGIFDGPCPTS---LDHAILIVGYGSEGGKDYWIV 303

Query: 360 KNSWGTYWGMEGYILMKRKTGIKNGVCGMYLEPIYXXXXXXXXXXXXXXXXXXXXXXXXX 539
           KNSWG  WGM+GY+ M R TG  NGVCG+   P +                         
Sbjct: 304 KNSWGESWGMKGYMYMHRNTGNSNGVCGINQMPSF-------------------PTKSSP 344

Query: 540 XXXXXXXXXXXKCGEFSYCAADQTCCCIFEFYNYCLIHGCCGYTNAVCCEGSSACCPSDY 719
                      KC   +YC    TCCC +     CL   CC   NAVCC+ +  CCP DY
Sbjct: 345 NPPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGLCLSWSCCELDNAVCCKDNRYCCPHDY 404

Query: 720 PVCDVKAGYCFKKSSDTVGV 779
           PVCD  +  CFK ++    V
Sbjct: 405 PVCDTASQRCFKANNGNFSV 424


Top