BLASTX nr result

ID: Atractylodes22_contig00000243 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes22_contig00000243
         (1634 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Viti...   600   e-169
emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]   596   e-168
ref|XP_004138950.1| PREDICTED: cysteine protease ATG4-like [Cucu...   592   e-167
ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glyc...   591   e-166
ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glyc...   589   e-166

>ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
            gi|296086874|emb|CBI33041.3| unnamed protein product
            [Vitis vinifera]
          Length = 486

 Score =  600 bits (1547), Expect = e-169
 Identities = 307/463 (66%), Positives = 358/463 (77%), Gaps = 15/463 (3%)
 Frame = -2

Query: 1528 SEAGPSNRRSPKASLWSGFLVSAFSVFDTHSESTDCQKKESGT---RSHGWTAAVKRVMN 1358
            SE   S+ +  K SLWS    SAFSVF+T+SES+    ++      R++GWT AV++V+ 
Sbjct: 24   SEPQSSDTKLSKVSLWSSVFASAFSVFETNSESSPSASEKKAIDNGRNNGWTTAVRKVVT 83

Query: 1357 GGSMRRIQERVLGY-KTNVSNSISDIWLLGVCYKICLEDPSQEPVHSNGYAAFSEDFSSR 1181
            G SMRRIQERVLG  KT +S+S SDIWLLG+CYKI  E+ S     SNG A F +DFSSR
Sbjct: 84   GVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSNGLAEFEQDFSSR 143

Query: 1180 ILMTYRKGFVSIGDSKYTSDVNWGCMLRSSQMLMGQALLIHRLGRSWRKPSHQPFDRDYI 1001
            ILMTYRKGF +IGDSK TSDVNWGCMLRSSQML+ QALL+HR+GRSWRK SH+P D+DYI
Sbjct: 144  ILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMDQDYI 203

Query: 1000 EILHMFGDSEDSAFSIHNLLQAGEGYSLAPGSWVGPYAMCRTWETLARRKIEENELQDQP 821
            EILH FGDS+ SAFSIHN+LQAG+ Y LA GSWVGPYAMCR+WETLAR K EE +L+ Q 
Sbjct: 204  EILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARSKREETDLECQS 263

Query: 820  FPMAIYVVSGDEDGERGGAPVLCIQDASRHCSEFSRGQLEWSPIXXXXXXXXXLDKVNPR 641
             PMAIY+VSGDEDGERGGAPV+ I++ASRHC EFS+GQ++W+PI         L+KVNPR
Sbjct: 264  LPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVPLVLGLEKVNPR 323

Query: 640  YLPLLAATFIFPQSLGIMGGRPGASTYIVGVQDDKAFYLDPHEVQQAVNISKDNLEADTS 461
            Y+P LAATF FPQSLGI+GG+PGASTYIVGVQD+KAFYLDPHE Q  V+I ++NLEADTS
Sbjct: 324  YIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLEADTS 383

Query: 460  SYHCNVIRQIPLESIDPSLAIGFYCRDKADFDDFCSRASELAAESNGAPLFTVTETRHSP 281
            SYHCN+IR I L+SIDPSLAIGFYCRDK DFDDFC RAS+LA +SNGAPLFTV      P
Sbjct: 384  SYHCNIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADKSNGAPLFTVAHIHSLP 443

Query: 280  K----SSG---SSRTQEVDSFDAVEN-AEEGCA---QDDWQLL 185
            K    S G    S  +E DSFD V N   EG     +DDWQLL
Sbjct: 444  KPISCSDGMDDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 486


>emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  596 bits (1537), Expect = e-168
 Identities = 308/466 (66%), Positives = 358/466 (76%), Gaps = 18/466 (3%)
 Frame = -2

Query: 1528 SEAGPSNRRSPKASLWSGFLVSAFSVFDTHSESTDCQKKESGT---RSHGWTAAVKRVMN 1358
            SE   S+ +  K SLWS    SAFSVF+T+SES+    ++      R++GWT AV++V+ 
Sbjct: 24   SEPQSSDTKLSKVSLWSSVFASAFSVFETNSESSPSASEKKAIDNGRNNGWTTAVRKVVT 83

Query: 1357 GGSMRRIQERVLGY-KTNVSNSISDIWLLGVCYKICLEDPSQEPVHSNGYAAFSEDFSSR 1181
            G SMRRIQERVLG  KT +S+S SDIWLLG+CYKI  E+ S     SNG A F +DFSSR
Sbjct: 84   GVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSNGLAEFEQDFSSR 143

Query: 1180 ILMTYRKGFVSIGDSKYTSDVNWGCMLRSSQMLMGQALLIHRLGRSWRKPSHQPFDRDYI 1001
            ILMTYRKGF +IGDSK TSDVNWGCMLRSSQML+ QALL+HR+GRSWRK SH+P D+DYI
Sbjct: 144  ILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMDQDYI 203

Query: 1000 EILHMFGDSEDSAFSIHNLLQAGEGYSLAPGSWVGPYAMCRTWETLARRKIEENELQDQP 821
            EILH FGDS+ SAFSIHN+LQAG+ Y LA GSWVGPYAMCR+WETLAR K EE +L+ Q 
Sbjct: 204  EILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARSKREETDLECQS 263

Query: 820  FPMAIYVVSGDEDGERGGAPVLCIQDASRHCSEFSRGQLEWSPIXXXXXXXXXLDKVNPR 641
             PMAIY+VSGDEDGERGGAPV+ I++ASRHC EFS+GQ++W+PI         L+KVNPR
Sbjct: 264  LPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVPLVLGLEKVNPR 323

Query: 640  YLPLLAATFIFPQSLGIMGGRPGASTYIVGVQDDKAFYLDPHEVQQAVNISKDNLEADTS 461
            Y+P LAATF FPQSLGI+GG+PGASTYIVGVQD+KAFYLDPHE Q  V+I ++NLEADTS
Sbjct: 324  YIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLEADTS 383

Query: 460  SYHCN---VIRQIPLESIDPSLAIGFYCRDKADFDDFCSRASELAAESNGAPLFTVTETR 290
            SYHCN   +IR I L+SIDPSLAIGFYCRDK DFDDFC RAS+LA ESNGAPLFTV    
Sbjct: 384  SYHCNCSSIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADESNGAPLFTVAHIH 443

Query: 289  HSPK----SSG---SSRTQEVDSFDAVEN-AEEGCA---QDDWQLL 185
              PK    S G    S  +E DSFD V N   EG     +DDWQLL
Sbjct: 444  SLPKPISCSDGMDDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 489


>ref|XP_004138950.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
            gi|449512710|ref|XP_004164121.1| PREDICTED: cysteine
            protease ATG4-like [Cucumis sativus]
          Length = 483

 Score =  592 bits (1527), Expect = e-167
 Identities = 305/464 (65%), Positives = 345/464 (74%), Gaps = 8/464 (1%)
 Frame = -2

Query: 1552 DRICGSICSEAGPSNRRSPKASLWSGFLVSAFSVFDTHSESTDCQKKESGTRSHGWTAAV 1373
            DR   S+  E G  N  S KAS WSGF  S FS+F+ H +S+  +KK    R + W A V
Sbjct: 21   DRTHRSVYPELGSKNHISSKASSWSGFFSSNFSIFEHHKDSSVTEKKVFHPRHNVW-ATV 79

Query: 1372 KRVMNGGSMRRIQERVLGYK-TNVSNSISDIWLLGVCYKICLEDPSQEPVHSNGYAAFSE 1196
            ++VM  GSMRRIQER+LG + + V +S  DIWLLGVC+KI  + P  +   S G A + +
Sbjct: 80   RKVMTSGSMRRIQERLLGSRRSGVYSSGGDIWLLGVCHKISQDHPPDDAASSPGVAGYEQ 139

Query: 1195 DFSSRILMTYRKGFVSIGDSKYTSDVNWGCMLRSSQMLMGQALLIHRLGRSWRKPSHQPF 1016
            DFSSRILMTYRKGF  I DSKYTSDVNWGCMLRSSQML+ QALL HRLGRSWRKPS +P 
Sbjct: 140  DFSSRILMTYRKGFHVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKPSQKPL 199

Query: 1015 DRDYIEILHMFGDSEDSAFSIHNLLQAGEGYSLAPGSWVGPYAMCRTWETLARRKIEENE 836
            D++Y+EILH+FGDSE SAFSIHNLLQAG  Y LA GSWVGPYAMCR+WETL R K E   
Sbjct: 200  DKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVGPYAMCRSWETLVRSKRETPI 259

Query: 835  LQDQPFPMAIYVVSGDEDGERGGAPVLCIQDASRHCSEFSRGQLEWSPIXXXXXXXXXLD 656
            LQDQ  PMAIY+VSGDEDGERGGAPVL I DASRHC EFS+GQ +WSPI         L+
Sbjct: 260  LQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSKGQHDWSPILLLVPLVLGLE 319

Query: 655  KVNPRYLPLLAATFIFPQSLGIMGGRPGASTYIVGVQDDKAFYLDPHEVQQAVNISKDNL 476
            K+NPRY+P L  TF FPQSLGI+GG+PGASTYIVGVQD+ AFYLDPHEVQQ VNI KD+L
Sbjct: 320  KINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQQVVNIDKDDL 379

Query: 475  EADTSSYHCNVIRQIPLESIDPSLAIGFYCRDKADFDDFCSRASELAAESNGAPLFTVTE 296
            EADTSSYHCNVIR IPLESIDPSLAIGFYCRDK DFD+FC RAS+LA ES+GAPLFTV E
Sbjct: 380  EADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFCHRASKLAEESDGAPLFTVAE 439

Query: 295  T------RHSPKSSGSSRTQEVDSFDAVENA-EEGCAQDDWQLL 185
            T      R S   +  SR  E D    V    EE   +DDWQ L
Sbjct: 440  THSTNPGRQSSALNDHSRLVEDDGDGVVHMPNEEESHEDDWQFL 483


>ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 486

 Score =  591 bits (1523), Expect = e-166
 Identities = 299/488 (61%), Positives = 366/488 (75%), Gaps = 9/488 (1%)
 Frame = -2

Query: 1621 VMKNIHERXXXXXXXXXXXXXSPDRICGSICSEAGPSNRRSPKASLWSGFLVSAFSVFDT 1442
            V+K + ER             + D     + S+AG SN + PKASLWS    S FSV +T
Sbjct: 2    VLKGLCERIVSSKCSSKSSTETVDNTQVPVYSKAGSSNSKFPKASLWSNIFTSGFSVVET 61

Query: 1441 HSESTDCQKKESGTRSHGWTAAVKRVMNGGSMRRIQERVLGY-KTNVSNSISDIWLLGVC 1265
            +SES+  +KK   +RS GW AAV++V+ GGSMRR QERVLG  +T++S+S  DIWLLGVC
Sbjct: 62   YSESSASEKKAVHSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVC 121

Query: 1264 YKICLEDPSQEPVHSNGYAAFSEDFSSRILMTYRKGFVSIGDSKYTSDVNWGCMLRSSQM 1085
            +KI  ++ S    +SNG A+F +DFSS+IL+TYRKGF +IGD+KYTSDV+WGCMLRSSQM
Sbjct: 122  HKISQQESSGGVDNSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVHWGCMLRSSQM 181

Query: 1084 LMGQALLIHRLGRSWRKPSHQPFDRDYIEILHMFGDSEDSAFSIHNLLQAGEGYSLAPGS 905
            L+ QALL H+LGRSWRKP  +P D++YI++L +FGDSE SAFSIHNLLQAG+GY LA GS
Sbjct: 182  LVAQALLFHKLGRSWRKPIDKPPDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGS 241

Query: 904  WVGPYAMCRTWETLARRKIEENELQDQPFPMAIYVVSGDEDGERGGAPVLCIQDASRHCS 725
            WVGPYAMCRTWE LAR+K   N+L + P PMAIYVVSGDEDGERGGAPV+CI+DAS+ C 
Sbjct: 242  WVGPYAMCRTWEVLARKK---NDLGELPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCF 298

Query: 724  EFSRGQLEWSPIXXXXXXXXXLDKVNPRYLPLLAATFIFPQSLGIMGGRPGASTYIVGVQ 545
            EFS G   W+P+         LDKVNPRY+PLL +TF FPQSLGIMGG+PGASTYI+G Q
Sbjct: 299  EFSSGLAAWTPLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGAQ 358

Query: 544  DDKAFYLDPHEVQQAVNISKDNLE-ADTSSYHCNVIRQIPLESIDPSLAIGFYCRDKADF 368
            ++KAFYLDPH+VQQ VNIS D  E   TSSYHCN++R IPL+SIDPSLAIGFYCRDK DF
Sbjct: 359  NEKAFYLDPHDVQQVVNISGDTQEPTSTSSYHCNIMRHIPLDSIDPSLAIGFYCRDKDDF 418

Query: 367  DDFCSRASELAAESNGAPLFTVTETRHSPKS------SGSSRTQEVDSFDAVENAEE-GC 209
            DDFCS+AS+LA ESNGAPLFTVT++R   K       SG +   + + F  ++   + G 
Sbjct: 419  DDFCSQASKLAEESNGAPLFTVTQSRSFSKQVTSNDVSGDNTGFQEEDFPGMDRGNDTGT 478

Query: 208  AQDDWQLL 185
             +DDWQLL
Sbjct: 479  NEDDWQLL 486


>ref|XP_003552483.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 485

 Score =  589 bits (1519), Expect = e-166
 Identities = 300/487 (61%), Positives = 365/487 (74%), Gaps = 8/487 (1%)
 Frame = -2

Query: 1621 VMKNIHERXXXXXXXXXXXXXSPDRICGSICSEAGPSNRRSPKASLWSGFLVSAFSVFDT 1442
            V+K + ER             + D     + S+AG S+ + PKASLWS    S FSV +T
Sbjct: 2    VLKGLCERIVSSKCSSKSSTETVDNTQVPVYSKAGSSDCKFPKASLWSSIFTSGFSVVET 61

Query: 1441 HSESTDCQKKESGTRSHGWTAAVKRVMNGGSMRRIQERVLGY-KTNVSNSISDIWLLGVC 1265
            +SES+  +KK   +RS GW AAV++V+ GGSMRR QERVLG  +T++S+S  DIWLLGVC
Sbjct: 62   YSESSASEKKAVPSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGVC 121

Query: 1264 YKICLEDPSQEPVHSNGYAAFSEDFSSRILMTYRKGFVSIGDSKYTSDVNWGCMLRSSQM 1085
            +KI  ++ +     SNG A+F +DFSS+IL+TYRKGF +IGD+KYTSDVNWGCMLRSSQM
Sbjct: 122  HKISQQESTGGVDTSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVNWGCMLRSSQM 181

Query: 1084 LMGQALLIHRLGRSWRKPSHQPFDRDYIEILHMFGDSEDSAFSIHNLLQAGEGYSLAPGS 905
            L+ QALL H+LGRSWRKP  +P D++YI++L +FGDSE SAFSIHNLLQAG+GY LA GS
Sbjct: 182  LVAQALLFHKLGRSWRKPIDKPLDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVGS 241

Query: 904  WVGPYAMCRTWETLARRKIEENELQDQPFPMAIYVVSGDEDGERGGAPVLCIQDASRHCS 725
            WVGPYAMCRTWE LAR+K   N+L + P PMAIYVVSGDEDGERGGAPV+CI+DAS+ CS
Sbjct: 242  WVGPYAMCRTWEVLARKK---NDLGEPPLPMAIYVVSGDEDGERGGAPVVCIEDASKRCS 298

Query: 724  EFSRGQLEWSPIXXXXXXXXXLDKVNPRYLPLLAATFIFPQSLGIMGGRPGASTYIVGVQ 545
            EFS G   W+P+         LDKVNPRY+PLL +TF FPQSLGIMGG+PGASTYI+GVQ
Sbjct: 299  EFSSGLAVWTPLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGVQ 358

Query: 544  DDKAFYLDPHEVQQAVNISKDNLE-ADTSSYHCNVIRQIPLESIDPSLAIGFYCRDKADF 368
            ++KAFYLDPH+VQQ VNIS D  E   TSSYHCNV+R IPL+SIDPSLAIGFYCRDK DF
Sbjct: 359  NEKAFYLDPHDVQQVVNISGDTQEPTGTSSYHCNVMRHIPLDSIDPSLAIGFYCRDKDDF 418

Query: 367  DDFCSRASELAAESNGAPLFTVTETRHSPKS-----SGSSRTQEVDSFDAVENAEEGCA- 206
            DDFCS+AS+LA ESNGAPLFTV ++R   K      SG +   + D F  ++   +    
Sbjct: 419  DDFCSQASKLAEESNGAPLFTVAKSRSFSKQVSNDVSGDNTGFQEDDFPGMDCGNDTVTN 478

Query: 205  QDDWQLL 185
            +DDWQLL
Sbjct: 479  EDDWQLL 485


Top