BLASTX nr result

ID: Atractylodes22_contig00006990 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes22_contig00006990
         (1243 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]       444   e-138
gb|AAO18731.1| cysteine protease [Gossypium hirsutum]                 367   e-110
ref|XP_002317417.1| predicted protein [Populus trichocarpa] gi|1...   358   e-107
gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]            356   e-107
ref|XP_002317418.1| predicted protein [Populus trichocarpa] gi|2...   350   e-106

>gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]
          Length = 501

 Score =  444 bits (1143), Expect(2) = e-138
 Identities = 218/333 (65%), Positives = 266/333 (79%), Gaps = 1/333 (0%)
 Frame = +2

Query: 29   MGTSETLLTLLFLIIHVSSTIATSGLPVEISMLNDHEPELALSGQKVLELFQKWKEVHGK 208
            M TS +++T+L  + +VS +I+T  LP E S+L   E ++ LS  KV +LF KWKE+HGK
Sbjct: 1    MATSNSMITILIFLTYVSYSISTKTLPSEFSILEGQENDI-LSSAKVSDLFGKWKELHGK 59

Query: 209  TYEHAEEET-RLVNFRKSLKYVLEKNSKRKSEMEHMVGLNKFADLSNEEFKETYLSKVKG 385
            TY+H EEE  RL NF+KS+K+V+EKNS+RKSE++H VGLNKFADLSNEEFKE Y+SKVKG
Sbjct: 60   TYQHEEEENLRLENFKKSVKFVMEKNSERKSELDHTVGLNKFADLSNEEFKEMYMSKVKG 119

Query: 386  SNSNILKMRGHGVKKNTTAAGVGSCDAPASLDWREKGVVTPIKDQGQCGSCWAFSVVGAI 565
            S SN LKM G  VK+N + +   +CDAP SLDWR+KGVVTP+KDQGQCGSCWAFSV G+I
Sbjct: 120  SRSNELKMGG--VKRNMSVSS-RTCDAPTSLDWRDKGVVTPMKDQGQCGSCWAFSVSGSI 176

Query: 566  ESAHALDTGDLIRLSEQELVXXXXXXXXXXXXXMDTAFRWVIKNGGIDTEADYPYTSANG 745
            ESA+A+ TGDLIRLSEQELV             MDTA+RW+IKNGG+D+E DYPYTS+NG
Sbjct: 177  ESANAIATGDLIRLSEQELVDCDTYDYGCDGGNMDTAYRWIIKNGGLDSEDDYPYTSSNG 236

Query: 746  YSGKCKISKQNNIAASIDSYTDVEPDENALLCAVAKQPVTVGIVGSAYDFQLYTGGIYDG 925
              GKC  +K      S+DSY +VE +E+A+LCAVA  PVT+GIVGSAYDFQLYTGG+Y+G
Sbjct: 237  RDGKCDKTKSAKSVVSLDSYVEVESNEDAVLCAVATTPVTIGIVGSAYDFQLYTGGVYNG 296

Query: 926  ECSSSPYDIDHAVLVVGYGSQDGEDYWIVKNSW 1024
            +CSS PYDIDHAVL+VGYGSQDG+DYWIVKNSW
Sbjct: 297  QCSSKPYDIDHAVLIVGYGSQDGKDYWIVKNSW 329



 Score = 74.3 bits (181), Expect(2) = e-138
 Identities = 30/43 (69%), Positives = 39/43 (90%)
 Frame = +3

Query: 1113 IVKNSWGTYWGMDGWILMKRNTDKKNGVCGINVQPSYPHSSAP 1241
            IVKNSWGTYWG++G+ILM+RNTD KNGVCG+ ++P YP ++AP
Sbjct: 324  IVKNSWGTYWGLEGYILMERNTDIKNGVCGMYLEPVYPITAAP 366


>gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
          Length = 389

 Score =  367 bits (943), Expect(2) = e-110
 Identities = 194/334 (58%), Positives = 235/334 (70%), Gaps = 2/334 (0%)
 Frame = +2

Query: 29   MGTSETLLTLLFLIIHVSSTIATSGLPVEISMLNDHEPELALSGQKVLELFQKWKEVHGK 208
            MG    +L  LFLI+  S T  +S LP E S++ +HE +  LS ++VLE+FQ+WKE H K
Sbjct: 1    MGFQRNILGFLFLIL-ASLTSLSSSLPSEYSIV-EHEIDAFLSEERVLEIFQQWKEKHRK 58

Query: 209  TYEHAEE-ETRLVNFRKSLKYVLEKNSKRKS-EMEHMVGLNKFADLSNEEFKETYLSKVK 382
             Y HAEE E R  NF+ +LKY+LE+N+KRK+ + EH VGLNKFAD+SNEEF++ YLSKVK
Sbjct: 59   VYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLNKFADMSNEEFRKAYLSKVK 118

Query: 383  GSNSNILKMRGHGVKKNTTAAGVGSCDAPASLDWREKGVVTPIKDQGQCGSCWAFSVVGA 562
               +  + +  +  +K      V SCDAP+SLDWR  GVVT +KDQG CGSCWAFS  GA
Sbjct: 119  KPINKGITLSRNMRRK------VQSCDAPSSLDWRNYGVVTAVKDQGSCGSCWAFSSTGA 172

Query: 563  IESAHALDTGDLIRLSEQELVXXXXXXXXXXXXXMDTAFRWVIKNGGIDTEADYPYTSAN 742
            +E  +AL TGDLI LSEQELV             MD AF WVI NGGID+E+DYPYT   
Sbjct: 173  MEGINALVTGDLISLSEQELVECDTSNYGCEGGYMDYAFEWVINNGGIDSESDYPYT--- 229

Query: 743  GYSGKCKISKQNNIAASIDSYTDVEPDENALLCAVAKQPVTVGIVGSAYDFQLYTGGIYD 922
            G  G C  +K+     SID Y DVE  ++ALLCAVA+QPV+VGI GSA DFQLYTGGIYD
Sbjct: 230  GVDGTCNTTKEETKVVSIDGYQDVEQSDSALLCAVAQQPVSVGIDGSAIDFQLYTGGIYD 289

Query: 923  GECSSSPYDIDHAVLVVGYGSQDGEDYWIVKNSW 1024
            G CS  P DIDHAVL+VGYGS+D E+YWIVKNSW
Sbjct: 290  GSCSDDPDDIDHAVLIVGYGSEDSEEYWIVKNSW 323



 Score = 57.8 bits (138), Expect(2) = e-110
 Identities = 23/38 (60%), Positives = 28/38 (73%)
 Frame = +3

Query: 1113 IVKNSWGTYWGMDGWILMKRNTDKKNGVCGINVQPSYP 1226
            IVKNSWGT WG+DG+  +KR+TD   GVC +N   SYP
Sbjct: 318  IVKNSWGTSWGIDGYFYLKRDTDLPYGVCAVNAMASYP 355


>ref|XP_002317417.1| predicted protein [Populus trichocarpa] gi|118488173|gb|ABK95906.1|
            unknown [Populus trichocarpa] gi|222860482|gb|EEE98029.1|
            predicted protein [Populus trichocarpa]
          Length = 498

 Score =  358 bits (918), Expect(2) = e-107
 Identities = 185/334 (55%), Positives = 233/334 (69%), Gaps = 2/334 (0%)
 Frame = +2

Query: 29   MGTSETLLTLLFLIIHVSSTIATSGLPVEISMLNDHEPELALSGQKVLELFQKWKEVHGK 208
            M + +  LTL  L++       +SGLP E S +++   E  L+ + + E+F+ WKE H K
Sbjct: 1    MESQKPQLTLFILLLLAPLPCLSSGLPGEYSAVSNDLHE-GLTEEGITEVFKLWKEKHQK 59

Query: 209  TYEHAEE-ETRLVNFRKSLKYVLEKNSKRKSEMEHMVGLNKFADLSNEEFKETYLSKVKG 385
             Y+HAEE E R+ NF+++LKY++EKN KRKS +EH VGLNKFADLSNEEF+E YLSKVK 
Sbjct: 60   VYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNKFADLSNEEFREMYLSKVK- 118

Query: 386  SNSNILKMRGHGVKKNTTAAGVGSCDAPASLDWREKGVVTPIKDQGQCGSCWAFSVVGAI 565
                I + R H          + +CDAP+SLDWR KGVVT +KDQG CGSCW+FS  GAI
Sbjct: 119  KPITIEEKRKH--------RHLQTCDAPSSLDWRNKGVVTAVKDQGDCGSCWSFSTTGAI 170

Query: 566  ESAHALDTGDLIRLSEQELVXXXXXXXXXXXXX-MDTAFRWVIKNGGIDTEADYPYTSAN 742
            E+ +A+ TGDLI LSEQELV              MD+AF+WVI NGGIDTEADYPYT   
Sbjct: 171  EAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGNGGIDTEADYPYT--- 227

Query: 743  GYSGKCKISKQNNIAASIDSYTDVEPDENALLCAVAKQPVTVGIVGSAYDFQLYTGGIYD 922
            G  G C  +K+     SI+ Y DV+P ++ALLCA  +QP++VG+ GSA DFQLYTGGIYD
Sbjct: 228  GVDGTCNTAKEEKKVVSIEGYVDVDPSDSALLCATVQQPISVGMDGSALDFQLYTGGIYD 287

Query: 923  GECSSSPYDIDHAVLVVGYGSQDGEDYWIVKNSW 1024
            G+CS  P DIDHA+L+VGYGS++ EDYWIVKNSW
Sbjct: 288  GDCSGDPNDIDHAILIVGYGSENDEDYWIVKNSW 321



 Score = 58.9 bits (141), Expect(2) = e-107
 Identities = 24/38 (63%), Positives = 28/38 (73%)
 Frame = +3

Query: 1113 IVKNSWGTYWGMDGWILMKRNTDKKNGVCGINVQPSYP 1226
            IVKNSWGT WGM+G+  ++RNT K  GVC IN   SYP
Sbjct: 316  IVKNSWGTEWGMEGYFYIRRNTSKPYGVCAINADASYP 353


>gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
          Length = 509

 Score =  356 bits (913), Expect(2) = e-107
 Identities = 178/328 (54%), Positives = 223/328 (67%), Gaps = 2/328 (0%)
 Frame = +2

Query: 47   LLTLLFLIIHVSST-IATSGLPVEISMLNDHEPELALSGQKVLELFQKWKEVHGKTYEHA 223
            L TL+  ++  S T + +S LP E S++    P  +++ ++V+ELF+KW E HGK Y+H 
Sbjct: 8    LFTLVIFLVWASLTSLISSSLPSEFSIVG--RPGESIAEERVVELFKKWTEKHGKVYKHG 65

Query: 224  EE-ETRLVNFRKSLKYVLEKNSKRKSEMEHMVGLNKFADLSNEEFKETYLSKVKGSNSNI 400
            +E E +  NFR +L+YV+EKN +R +   H+VGLNKFAD+SNEEF+E Y+SKVK   S  
Sbjct: 66   QEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGLNKFADMSNEEFREVYVSKVKKPTSKR 125

Query: 401  LKMRGHGVKKNTTAAGVGSCDAPASLDWREKGVVTPIKDQGQCGSCWAFSVVGAIESAHA 580
            + +      K   A  V +CD P SLDWR+ G+VT +KDQG CGSCWAFS  GAIE  +A
Sbjct: 126  MAIERRRQGKAAAAKAVAACDGPTSLDWRKYGIVTGVKDQGDCGSCWAFSSTGAIEGINA 185

Query: 581  LDTGDLIRLSEQELVXXXXXXXXXXXXXMDTAFRWVIKNGGIDTEADYPYTSANGYSGKC 760
            L  GDLI LSEQELV             MD AF WV+ NGGIDTE DYPYT   G  G C
Sbjct: 186  LANGDLISLSEQELVDCDSTNDGCEGGYMDYAFEWVMSNGGIDTETDYPYT---GEDGTC 242

Query: 761  KISKQNNIAASIDSYTDVEPDENALLCAVAKQPVTVGIVGSAYDFQLYTGGIYDGECSSS 940
              +K+   A SID Y DV  +E+AL CAV KQP++VGI G A DFQLYTGGIYDG+CS  
Sbjct: 243  NTTKEETKAVSIDGYEDVAEEESALFCAVLKQPISVGIDGGAIDFQLYTGGIYDGDCSDD 302

Query: 941  PYDIDHAVLVVGYGSQDGEDYWIVKNSW 1024
            P DIDHAVLVVGYG++ GE+YWI+KNSW
Sbjct: 303  PDDIDHAVLVVGYGAESGEEYWIIKNSW 330



 Score = 60.1 bits (144), Expect(2) = e-107
 Identities = 28/45 (62%), Positives = 31/45 (68%), Gaps = 2/45 (4%)
 Frame = +3

Query: 1113 IVKNSWGTYWGMDGWILMKRNTDKKNGVCGINVQPSYP--HSSAP 1241
            I+KNSWGT WGM G+  +KRNT K  GVC IN   SYP   SSAP
Sbjct: 325  IIKNSWGTDWGMKGYAYIKRNTSKDYGVCAINAMASYPTKESSAP 369


>ref|XP_002317418.1| predicted protein [Populus trichocarpa] gi|222860483|gb|EEE98030.1|
            predicted protein [Populus trichocarpa]
          Length = 503

 Score =  350 bits (899), Expect(2) = e-106
 Identities = 178/337 (52%), Positives = 232/337 (68%), Gaps = 5/337 (1%)
 Frame = +2

Query: 29   MGTSETLLTLLFLIIHVSSTIATSGLPVEISMLNDHEPELALSGQKVLELFQKWKEVHGK 208
            M + ++ ++L+  ++    T  +S LP E  ++ +   EL +S + ++E+FQ+W++ H K
Sbjct: 1    MDSKKSQMSLIIFLLLALLTCLSSSLPGEHPIVVNDFSEL-VSEESIIEIFQQWRDRHQK 59

Query: 209  TYEHA-EEETRLVNFRKSLKYVLEKNSKRKSEMEHMVGLNKFADLSNEEFKETYLSKVKG 385
             YEHA E E R  NF+++LKY++EK  K+ + + H VGLNKFADLSNEEFKE YLSKVK 
Sbjct: 60   VYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSVGLNKFADLSNEEFKELYLSKVK- 118

Query: 386  SNSNILKMRGHGVKKNTTA----AGVGSCDAPASLDWREKGVVTPIKDQGQCGSCWAFSV 553
                    +   +K++T        + +CDAP+SLDWR+KGVVT +KDQG CGSCW+FS 
Sbjct: 119  --------KPINIKRSTARDWRQRNLQTCDAPSSLDWRKKGVVTAVKDQGDCGSCWSFST 170

Query: 554  VGAIESAHALDTGDLIRLSEQELVXXXXXXXXXXXXXMDTAFRWVIKNGGIDTEADYPYT 733
             GAIE  +A+ TGDLI LSEQELV             MD AF WVI NGGIDTEA+YPYT
Sbjct: 171  TGAIEGINAIVTGDLISLSEQELVDCDTTNYGCEGGYMDYAFEWVINNGGIDTEANYPYT 230

Query: 734  SANGYSGKCKISKQNNIAASIDSYTDVEPDENALLCAVAKQPVTVGIVGSAYDFQLYTGG 913
               G  G C  +K+     SID YTDV+  ++ALLCA  +QP++VG+ GSA DFQLYTGG
Sbjct: 231  ---GVDGTCNTTKEEIKVVSIDGYTDVDETDSALLCATVQQPISVGMDGSALDFQLYTGG 287

Query: 914  IYDGECSSSPYDIDHAVLVVGYGSQDGEDYWIVKNSW 1024
            IYDG+CS  P DIDHAVL+VGYGS++GEDYWIVKNSW
Sbjct: 288  IYDGDCSDDPNDIDHAVLIVGYGSENGEDYWIVKNSW 324



 Score = 61.6 bits (148), Expect(2) = e-106
 Identities = 28/45 (62%), Positives = 33/45 (73%), Gaps = 2/45 (4%)
 Frame = +3

Query: 1113 IVKNSWGTYWGMDGWILMKRNTDKKNGVCGINVQPSYP--HSSAP 1241
            IVKNSWGT WGM+G+  +KRNTD   GVC IN + SYP   SS+P
Sbjct: 319  IVKNSWGTEWGMEGYFYIKRNTDLPYGVCAINAEASYPTKESSSP 363


Top