BLASTX nr result

ID: Atractylodes21_contig00001936 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes21_contig00001936
         (1825 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]       632   e-179
gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]            511   e-142
ref|XP_002317418.1| predicted protein [Populus trichocarpa] gi|2...   500   e-139
ref|XP_002305743.1| predicted protein [Populus trichocarpa] gi|2...   495   e-137
ref|XP_002317417.1| predicted protein [Populus trichocarpa] gi|1...   492   e-136

>gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]
          Length = 501

 Score =  632 bits (1631), Expect = e-179
 Identities = 309/503 (61%), Positives = 371/503 (73%), Gaps = 5/503 (0%)
 Frame = -1

Query: 1804 MGTSETLLTLLFLIIHVSSTIATSGLPVEISMLNDHEPELALSGQKVLELFQNWKEVHGK 1625
            M TS +++T+L  + +VS +I+T  LP E S+L   E ++ LS  KV +LF  WKE+HGK
Sbjct: 1    MATSNSMITILIFLTYVSYSISTKTLPSEFSILEGQENDI-LSSAKVSDLFGKWKELHGK 59

Query: 1624 TYEHAEEET-RLANFRKSLKYVLEKNSKRKSEMEHMVGLNKFADLSNEEFKETYLSKVKG 1448
            TY+H EEE  RL NF+KS+K+V+EKNS+RKSE++H VGLNKFADLSNEEFKE Y+SKVKG
Sbjct: 60   TYQHEEEENLRLENFKKSVKFVMEKNSERKSELDHTVGLNKFADLSNEEFKEMYMSKVKG 119

Query: 1447 SNSNILKMRGHGVKKNTTAAGVGSCDAPASLDWREKGVVTPIKDQGQCGSCWAFSVVGAI 1268
            S SN LKM G  VK+N + +   +CDAP SLDWR+KGVVTP+KDQGQCGSCWAFSV G+I
Sbjct: 120  SRSNELKMGG--VKRNMSVSS-RTCDAPTSLDWRDKGVVTPMKDQGQCGSCWAFSVSGSI 176

Query: 1267 ESAHALDTGDLIRLSEQELVXXXXXXXXXXXXNMDTAFRWVIKNGGIDTEADYPYTSANG 1088
            ESA+A+ TGDLIRLSEQELV            NMDTA+RW+IKNGG+D+E DYPYTS+NG
Sbjct: 177  ESANAIATGDLIRLSEQELVDCDTYDYGCDGGNMDTAYRWIIKNGGLDSEDDYPYTSSNG 236

Query: 1087 YSGKCKISKQNNIAASIDSYTDVEPDENALLCAVAKQPVTVGIVGSAYDFQLYTGGIYDG 908
              GKC  +K      S+DSY +VE +E+A+LCAVA  PVT+GIVGSAYDFQLYTGG+Y+G
Sbjct: 237  RDGKCDKTKSAKSVVSLDSYVEVESNEDAVLCAVATTPVTIGIVGSAYDFQLYTGGVYNG 296

Query: 907  ECSSSPYDIDHAVLVVGYGSQDGEDYWIVKNSWGTYWGMDGWILMKRNTDKKNGVCGINV 728
            +CSS PYDIDHAVL+VGYGSQDG+DYWIVKNSWGTYWG++G+ILM+RNTD KNGVCG+ +
Sbjct: 297  QCSSKPYDIDHAVLIVGYGSQDGKDYWIVKNSWGTYWGLEGYILMERNTDIKNGVCGMYL 356

Query: 727  Q----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKCGDFSYCAAGQTCCCIFEFYDY 560
            +                                   SKCGDF YCAA QTCCCIFEFY+Y
Sbjct: 357  EPVYPITAAPTPPGPPPPPAPPSPPHPPPPPTPPAPSKCGDFHYCAADQTCCCIFEFYNY 416

Query: 559  CLIYGCCGYSNAVCCKGSSYCCPSDYPVCDVYDGYCFKNSGDTVGVXXXXXXXXXXXMPW 380
            CLIYGCCGYS+AVCCK S+ CCPSDYP+CDV  GYC+KNS  T GV           MPW
Sbjct: 417  CLIYGCCGYSDAVCCKNSAACCPSDYPICDVQAGYCYKNSAKTFGVPAKKRQLAKHKMPW 476

Query: 379  EKTEETVVEEYLPLRWK*NKFEA 311
            EK EET+ EE+ PL W  N F A
Sbjct: 477  EKIEETIKEEFQPLAWNRNPFAA 499


>gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
          Length = 509

 Score =  511 bits (1315), Expect = e-142
 Identities = 252/506 (49%), Positives = 316/506 (62%), Gaps = 14/506 (2%)
 Frame = -1

Query: 1786 LLTLLFLIIHVSST-IATSGLPVEISMLNDHEPELALSGQKVLELFQNWKEVHGKTYEHA 1610
            L TL+  ++  S T + +S LP E S++    P  +++ ++V+ELF+ W E HGK Y+H 
Sbjct: 8    LFTLVIFLVWASLTSLISSSLPSEFSIVG--RPGESIAEERVVELFKKWTEKHGKVYKHG 65

Query: 1609 EE-ETRLANFRKSLKYVLEKNSKRKSEMEHMVGLNKFADLSNEEFKETYLSKVKGSNSNI 1433
            +E E +  NFR +L+YV+EKN +R +   H+VGLNKFAD+SNEEF+E Y+SKVK   S  
Sbjct: 66   QEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGLNKFADMSNEEFREVYVSKVKKPTSKR 125

Query: 1432 LKMRGHGVKKNTTAAGVGSCDAPASLDWREKGVVTPIKDQGQCGSCWAFSVVGAIESAHA 1253
            + +      K   A  V +CD P SLDWR+ G+VT +KDQG CGSCWAFS  GAIE  +A
Sbjct: 126  MAIERRRQGKAAAAKAVAACDGPTSLDWRKYGIVTGVKDQGDCGSCWAFSSTGAIEGINA 185

Query: 1252 LDTGDLIRLSEQELVXXXXXXXXXXXXNMDTAFRWVIKNGGIDTEADYPYTSANGYSGKC 1073
            L  GDLI LSEQELV             MD AF WV+ NGGIDTE DYPYT   G  G C
Sbjct: 186  LANGDLISLSEQELVDCDSTNDGCEGGYMDYAFEWVMSNGGIDTETDYPYT---GEDGTC 242

Query: 1072 KISKQNNIAASIDSYTDVEPDENALLCAVAKQPVTVGIVGSAYDFQLYTGGIYDGECSSS 893
              +K+   A SID Y DV  +E+AL CAV KQP++VGI G A DFQLYTGGIYDG+CS  
Sbjct: 243  NTTKEETKAVSIDGYEDVAEEESALFCAVLKQPISVGIDGGAIDFQLYTGGIYDGDCSDD 302

Query: 892  PYDIDHAVLVVGYGSQDGEDYWIVKNSWGTYWGMDGWILMKRNTDKKNGVCGINV----- 728
            P DIDHAVLVVGYG++ GE+YWI+KNSWGT WGM G+  +KRNT K  GVC IN      
Sbjct: 303  PDDIDHAVLVVGYGAESGEEYWIIKNSWGTDWGMKGYAYIKRNTSKDYGVCAINAMASYP 362

Query: 727  -------QXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKCGDFSYCAAGQTCCCIFEF 569
                                                   ++CGDFSYCAA +TCCCIFEF
Sbjct: 363  TKESSAPSPYPSPAVPPPPPPPPPPPSPPPPPPPPSPSPTQCGDFSYCAATETCCCIFEF 422

Query: 568  YDYCLIYGCCGYSNAVCCKGSSYCCPSDYPVCDVYDGYCFKNSGDTVGVXXXXXXXXXXX 389
            +DYCLIYGCC Y++AVCC G+ YCCP DYP+CD+ +G C +N GD +GV           
Sbjct: 423  FDYCLIYGCCDYTDAVCCTGTEYCCPHDYPICDIEEGLCLQNDGDFLGVTAKKRKMAKHK 482

Query: 388  MPWEKTEETVVEEYLPLRWK*NKFEA 311
             PW K E++  + + PL WK N+F A
Sbjct: 483  YPWTKPEDS-AKNHQPLEWKRNRFAA 507


>ref|XP_002317418.1| predicted protein [Populus trichocarpa] gi|222860483|gb|EEE98030.1|
            predicted protein [Populus trichocarpa]
          Length = 503

 Score =  500 bits (1288), Expect = e-139
 Identities = 250/514 (48%), Positives = 320/514 (62%), Gaps = 16/514 (3%)
 Frame = -1

Query: 1804 MGTSETLLTLLFLIIHVSSTIATSGLPVEISMLNDHEPELALSGQKVLELFQNWKEVHGK 1625
            M + ++ ++L+  ++    T  +S LP E  ++ +   EL +S + ++E+FQ W++ H K
Sbjct: 1    MDSKKSQMSLIIFLLLALLTCLSSSLPGEHPIVVNDFSEL-VSEESIIEIFQQWRDRHQK 59

Query: 1624 TYEHA-EEETRLANFRKSLKYVLEKNSKRKSEMEHMVGLNKFADLSNEEFKETYLSKVKG 1448
             YEHA E E R  NF+++LKY++EK  K+ + + H VGLNKFADLSNEEFKE YLSKVK 
Sbjct: 60   VYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSVGLNKFADLSNEEFKELYLSKVK- 118

Query: 1447 SNSNILKMRGHGVKKNTTA----AGVGSCDAPASLDWREKGVVTPIKDQGQCGSCWAFSV 1280
                    +   +K++T        + +CDAP+SLDWR+KGVVT +KDQG CGSCW+FS 
Sbjct: 119  --------KPINIKRSTARDWRQRNLQTCDAPSSLDWRKKGVVTAVKDQGDCGSCWSFST 170

Query: 1279 VGAIESAHALDTGDLIRLSEQELVXXXXXXXXXXXXNMDTAFRWVIKNGGIDTEADYPYT 1100
             GAIE  +A+ TGDLI LSEQELV             MD AF WVI NGGIDTEA+YPYT
Sbjct: 171  TGAIEGINAIVTGDLISLSEQELVDCDTTNYGCEGGYMDYAFEWVINNGGIDTEANYPYT 230

Query: 1099 SANGYSGKCKISKQNNIAASIDSYTDVEPDENALLCAVAKQPVTVGIVGSAYDFQLYTGG 920
               G  G C  +K+     SID YTDV+  ++ALLCA  +QP++VG+ GSA DFQLYTGG
Sbjct: 231  ---GVDGTCNTTKEEIKVVSIDGYTDVDETDSALLCATVQQPISVGMDGSALDFQLYTGG 287

Query: 919  IYDGECSSSPYDIDHAVLVVGYGSQDGEDYWIVKNSWGTYWGMDGWILMKRNTDKKNGVC 740
            IYDG+CS  P DIDHAVL+VGYGS++GEDYWIVKNSWGT WGM+G+  +KRNTD   GVC
Sbjct: 288  IYDGDCSDDPNDIDHAVLIVGYGSENGEDYWIVKNSWGTEWGMEGYFYIKRNTDLPYGVC 347

Query: 739  GINVQ-----------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKCGDFSYCAAGQ 593
             IN +                                          S CGDF+YC + +
Sbjct: 348  AINAEASYPTKESSSPSPTSPPSPPSPLSPPPPPPPTPVPPPPCPQPSDCGDFAYCPSDE 407

Query: 592  TCCCIFEFYDYCLIYGCCGYSNAVCCKGSSYCCPSDYPVCDVYDGYCFKNSGDTVGVXXX 413
            TCCCI + +DYC++YGCC Y NAVCC  S YCCPSDYP+CDV +G C K+ GD +GV   
Sbjct: 408  TCCCILKVFDYCIVYGCCQYENAVCCADSVYCCPSDYPICDVEEGLCLKSQGDYLGVPAS 467

Query: 412  XXXXXXXXMPWEKTEETVVEEYLPLRWK*NKFEA 311
                     PW K EE    +   LRWK N F+A
Sbjct: 468  KRHMAKHKFPWTKLEEKTTTDRHALRWKRNPFDA 501


>ref|XP_002305743.1| predicted protein [Populus trichocarpa] gi|222848707|gb|EEE86254.1|
            predicted protein [Populus trichocarpa]
          Length = 494

 Score =  495 bits (1274), Expect = e-137
 Identities = 251/499 (50%), Positives = 311/499 (62%), Gaps = 11/499 (2%)
 Frame = -1

Query: 1774 LFLIIHVSSTIATSGLPVEISMLNDHEPELALSGQKVLELFQNWKEVHGKTYEHAEE-ET 1598
            L L++ V  T  +S LP E S++ +   EL    + ++E+FQ W++ H K Y+HAEE E 
Sbjct: 4    LILLLLVGLTSVSSSLPSEYSIVGNDFSELP-PDESIIEIFQQWRDRHQKAYKHAEEAEK 62

Query: 1597 RLANFRKSLKYVLEKNSKRKSEMEHMVGLNKFADLSNEEFKETYLSKVKGSNSNILKMRG 1418
            R  NF+++LKY++EK  K ++ + H VGLNKFADLSNEEFK+ YLSKVK     I K R 
Sbjct: 63   RFGNFKRNLKYIIEKTGK-ETTLRHRVGLNKFADLSNEEFKQLYLSKVK---KPINKTRI 118

Query: 1417 HGVKKNTTAAGVGSCDAPASLDWREKGVVTPIKDQGQCGSCWAFSVVGAIESAHALDTGD 1238
                ++ +   + SCDAP+SLDWR+KGVVT +KDQG CGSCW+FS  GAIE  +A+ T D
Sbjct: 119  DA--EDRSRRNLQSCDAPSSLDWRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTSD 176

Query: 1237 LIRLSEQELVXXXXXXXXXXXXNMDTAFRWVIKNGGIDTEADYPYTSANGYSGKCKISKQ 1058
            LI LSEQELV             MD AF WVI NGGIDTEA+YPYT   G  G C  +K+
Sbjct: 177  LISLSEQELVDCDTTNYGCEGGYMDYAFEWVINNGGIDTEANYPYT---GVDGTCNTAKE 233

Query: 1057 NNIAASIDSYTDVEPDENALLCAVAKQPVTVGIVGSAYDFQLYTGGIYDGECSSSPYDID 878
                 SID Y DV+  ++ALLCA A+QP++VGI GSA DFQLYTGGIYDG+CS  P DID
Sbjct: 234  EIKVVSIDGYKDVDETDSALLCAAAQQPISVGIDGSAIDFQLYTGGIYDGDCSDDPDDID 293

Query: 877  HAVLVVGYGSQDGEDYWIVKNSWGTYWGMDGWILMKRNTDKKNGVCGINVQ--------- 725
            HAVL+VGYGS++GEDYWIVKNSWGT WG++G+  +KRNTD   GVC IN           
Sbjct: 294  HAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYIKRNTDLPYGVCAINAMASYPTKEAS 353

Query: 724  -XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKCGDFSYCAAGQTCCCIFEFYDYCLIY 548
                                            S CGDFSYC + +TCCCI   +DYCL+Y
Sbjct: 354  AQSPTSPPSPPSPPPPPPPPPTPVPPPPSPQPSDCGDFSYCPSDETCCCILNVFDYCLVY 413

Query: 547  GCCGYSNAVCCKGSSYCCPSDYPVCDVYDGYCFKNSGDTVGVXXXXXXXXXXXMPWEKTE 368
            GCC Y NAVCC  S YCCPSDYP+CDV +G C K  GD +GV            PW K +
Sbjct: 414  GCCAYENAVCCADSVYCCPSDYPICDVEEGLCLKGQGDYLGVAASKRHMAKHKFPWTKLQ 473

Query: 367  ETVVEEYLPLRWK*NKFEA 311
            E    ++  L+WK N F A
Sbjct: 474  ERAKTDHRVLQWKRNPFAA 492


>ref|XP_002317417.1| predicted protein [Populus trichocarpa] gi|118488173|gb|ABK95906.1|
            unknown [Populus trichocarpa] gi|222860482|gb|EEE98029.1|
            predicted protein [Populus trichocarpa]
          Length = 498

 Score =  492 bits (1266), Expect = e-136
 Identities = 250/499 (50%), Positives = 314/499 (62%), Gaps = 7/499 (1%)
 Frame = -1

Query: 1804 MGTSETLLTLLFLIIHVSSTIATSGLPVEISMLNDHEPELALSGQKVLELFQNWKEVHGK 1625
            M + +  LTL  L++       +SGLP E S +++   E  L+ + + E+F+ WKE H K
Sbjct: 1    MESQKPQLTLFILLLLAPLPCLSSGLPGEYSAVSNDLHE-GLTEEGITEVFKLWKEKHQK 59

Query: 1624 TYEHAEE-ETRLANFRKSLKYVLEKNSKRKSEMEHMVGLNKFADLSNEEFKETYLSKVKG 1448
             Y+HAEE E R+ NF+++LKY++EKN KRKS +EH VGLNKFADLSNEEF+E YLSKVK 
Sbjct: 60   VYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNKFADLSNEEFREMYLSKVK- 118

Query: 1447 SNSNILKMRGHGVKKNTTAAGVGSCDAPASLDWREKGVVTPIKDQGQCGSCWAFSVVGAI 1268
                I + R H          + +CDAP+SLDWR KGVVT +KDQG CGSCW+FS  GAI
Sbjct: 119  KPITIEEKRKH--------RHLQTCDAPSSLDWRNKGVVTAVKDQGDCGSCWSFSTTGAI 170

Query: 1267 ESAHALDTGDLIRLSEQELVXXXXXXXXXXXXN-MDTAFRWVIKNGGIDTEADYPYTSAN 1091
            E+ +A+ TGDLI LSEQELV              MD+AF+WVI NGGIDTEADYPYT   
Sbjct: 171  EAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGNGGIDTEADYPYT--- 227

Query: 1090 GYSGKCKISKQNNIAASIDSYTDVEPDENALLCAVAKQPVTVGIVGSAYDFQLYTGGIYD 911
            G  G C  +K+     SI+ Y DV+P ++ALLCA  +QP++VG+ GSA DFQLYTGGIYD
Sbjct: 228  GVDGTCNTAKEEKKVVSIEGYVDVDPSDSALLCATVQQPISVGMDGSALDFQLYTGGIYD 287

Query: 910  GECSSSPYDIDHAVLVVGYGSQDGEDYWIVKNSWGTYWGMDGWILMKRNTDKKNGVCGIN 731
            G+CS  P DIDHA+L+VGYGS++ EDYWIVKNSWGT WGM+G+  ++RNT K  GVC IN
Sbjct: 288  GDCSGDPNDIDHAILIVGYGSENDEDYWIVKNSWGTEWGMEGYFYIRRNTSKPYGVCAIN 347

Query: 730  VQ-----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSKCGDFSYCAAGQTCCCIFEFY 566
                                                  S CGD S+C + +TCCCI + +
Sbjct: 348  ADASYPTKVPSPPSPPSPPPPPSPPPPPPSPPPPCPQPSDCGDSSFCPSDETCCCILKLF 407

Query: 565  DYCLIYGCCGYSNAVCCKGSSYCCPSDYPVCDVYDGYCFKNSGDTVGVXXXXXXXXXXXM 386
              C+IYGCC Y NAVCC  S+YCCPSDYP+CDV DG C +  GD +GV            
Sbjct: 408  SSCIIYGCCPYENAVCCAESTYCCPSDYPICDVDDGLCLRGQGDHLGVAARRRHMANYKF 467

Query: 385  PWEKTEETVVEEYLPLRWK 329
            PW K EE    +   L+WK
Sbjct: 468  PWTKFEEKKETKQPVLQWK 486


Top