BLASTX nr result

ID: Bupleurum21_contig00007369 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00007369
         (1472 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Viti...   541   e-151
emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]   536   e-150
ref|XP_002309707.1| predicted protein [Populus trichocarpa] gi|2...   521   e-145
ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|2...   512   e-142
ref|XP_004138950.1| PREDICTED: cysteine protease ATG4-like [Cucu...   509   e-142

>ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
            gi|296086874|emb|CBI33041.3| unnamed protein product
            [Vitis vinifera]
          Length = 486

 Score =  541 bits (1394), Expect = e-151
 Identities = 267/401 (66%), Positives = 311/401 (77%), Gaps = 4/401 (0%)
 Frame = -2

Query: 1429 TMRRFQERVLGLNKTGISISHSDIWLLGLCYSLSNEDSSADPIQSHGFAAFVEDFESRIL 1250
            +MRR QERVLG +KTGIS S SDIWLLGLCY +S E+SS     S+G A F +DF SRIL
Sbjct: 86   SMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSNGLAEFEQDFSSRIL 145

Query: 1249 MTYRKGFNAIGETKYTSDVNWGCMIRSSQMLVAQALVIHRLGRNWRKSLEKPLDKDYIEI 1070
            MTYRKGF AIG++K TSDVNWGCM+RSSQMLVAQAL++HR+GR+WRK+  KP+D+DYIEI
Sbjct: 146  MTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMDQDYIEI 205

Query: 1069 LHYFGDSEASVFSVHNLIQSGKL--LCPGSWVGPYAMCRTWEALARSKMKETEPENQSLP 896
            LH+FGDS+AS FS+HN++Q+GK   L  GSWVGPYAMCR+WE LARSK +ET+ E QSLP
Sbjct: 206  LHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARSKREETDLECQSLP 265

Query: 895  MAMYVVSGDEDGERGGAPVVCIDDASRHCHEFSRGQVDWSPIXXXXXXXXXLDKINTRYI 716
            MA+Y+VSGDEDGERGGAPVV I++ASRHC EFS+GQVDW+PI         L+K+N RYI
Sbjct: 266  MAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVPLVLGLEKVNPRYI 325

Query: 715  PLLSATFTFPQSLGILGGRPGVSTYIVGVQDDNAFYLDPHEVKQVVDISRDNLEADTSSY 536
            P L+ATFTFPQSLGILGG+PG STYIVGVQD+ AFYLDPHE + VVDI R+NLEADTSSY
Sbjct: 326  PSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLEADTSSY 385

Query: 535  HCNAIRQISLDQIDPSLALGFYCRDKDDFDDFCSRASDLAAQSNGAPLFTVAQSCNSMKP 356
            HCN IR I LD IDPSLA+GFYCRDKDDFDDFC RAS LA +SNGAPLFTVA   +  KP
Sbjct: 386  HCNIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADKSNGAPLFTVAHIHSLPKP 445

Query: 355  AGQCE--TSCDGAAVHXXXXXXXXXXXXXDNTTQEDEWQIL 239
                +    C G                      ED+WQ+L
Sbjct: 446  ISCSDGMDDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 486


>emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  536 bits (1382), Expect = e-150
 Identities = 267/404 (66%), Positives = 311/404 (76%), Gaps = 7/404 (1%)
 Frame = -2

Query: 1429 TMRRFQERVLGLNKTGISISHSDIWLLGLCYSLSNEDSSADPIQSHGFAAFVEDFESRIL 1250
            +MRR QERVLG +KTGIS S SDIWLLGLCY +S E+SS     S+G A F +DF SRIL
Sbjct: 86   SMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSNGLAEFEQDFSSRIL 145

Query: 1249 MTYRKGFNAIGETKYTSDVNWGCMIRSSQMLVAQALVIHRLGRNWRKSLEKPLDKDYIEI 1070
            MTYRKGF AIG++K TSDVNWGCM+RSSQMLVAQAL++HR+GR+WRK+  KP+D+DYIEI
Sbjct: 146  MTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMDQDYIEI 205

Query: 1069 LHYFGDSEASVFSVHNLIQSGKL--LCPGSWVGPYAMCRTWEALARSKMKETEPENQSLP 896
            LH+FGDS+AS FS+HN++Q+GK   L  GSWVGPYAMCR+WE LARSK +ET+ E QSLP
Sbjct: 206  LHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARSKREETDLECQSLP 265

Query: 895  MAMYVVSGDEDGERGGAPVVCIDDASRHCHEFSRGQVDWSPIXXXXXXXXXLDKINTRYI 716
            MA+Y+VSGDEDGERGGAPVV I++ASRHC EFS+GQVDW+PI         L+K+N RYI
Sbjct: 266  MAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVPLVLGLEKVNPRYI 325

Query: 715  PLLSATFTFPQSLGILGGRPGVSTYIVGVQDDNAFYLDPHEVKQVVDISRDNLEADTSSY 536
            P L+ATFTFPQSLGILGG+PG STYIVGVQD+ AFYLDPHE + VVDI R+NLEADTSSY
Sbjct: 326  PSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLEADTSSY 385

Query: 535  HCNA---IRQISLDQIDPSLALGFYCRDKDDFDDFCSRASDLAAQSNGAPLFTVAQSCNS 365
            HCN    IR I LD IDPSLA+GFYCRDKDDFDDFC RAS LA +SNGAPLFTVA   + 
Sbjct: 386  HCNCSSIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADESNGAPLFTVAHIHSL 445

Query: 364  MKPAGQCE--TSCDGAAVHXXXXXXXXXXXXXDNTTQEDEWQIL 239
             KP    +    C G                      ED+WQ+L
Sbjct: 446  PKPISCSDGMDDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 489


>ref|XP_002309707.1| predicted protein [Populus trichocarpa] gi|222852610|gb|EEE90157.1|
            predicted protein [Populus trichocarpa]
          Length = 481

 Score =  521 bits (1342), Expect = e-145
 Identities = 260/399 (65%), Positives = 307/399 (76%), Gaps = 2/399 (0%)
 Frame = -2

Query: 1429 TMRRFQERVLGLNKTGISISHSDIWLLGLCYSLSNEDSSADPIQSHGFAAFVEDFESRIL 1250
            TMRR QERVLG +KTGIS + SDIWLLG  Y +S +DSS +   ++  AAF  DF SRIL
Sbjct: 91   TMRRIQERVLGTSKTGISNTTSDIWLLGARYKISQDDSSGNADATNALAAFHRDFSSRIL 150

Query: 1249 MTYRKGFNAIGETKYTSDVNWGCMIRSSQMLVAQALVIHRLGRNWRKSLEKPLDKDYIEI 1070
            +TYRKGF+ I ++K TSDVNWGCM+RSSQMLVAQAL+ HRLGR+WRK ++KPLD+DY+EI
Sbjct: 151  ITYRKGFDMIEDSKLTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKPVDKPLDRDYVEI 210

Query: 1069 LHYFGDSEASVFSVHNLIQSGKL--LCPGSWVGPYAMCRTWEALARSKMKETEPENQSLP 896
            LH FGDSEAS FS+HNL+Q+GK   L  GSWVGPYAMCR+WE+LARSK +ET  E Q+LP
Sbjct: 211  LHLFGDSEASAFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWESLARSKREETNLEYQTLP 270

Query: 895  MAMYVVSGDEDGERGGAPVVCIDDASRHCHEFSRGQVDWSPIXXXXXXXXXLDKINTRYI 716
            MA+YVVSG EDGERGGAPV+ I+DA+RHC EFS+G+ DW+PI         LDKIN RYI
Sbjct: 271  MAVYVVSGCEDGERGGAPVLSIEDAARHCSEFSKGREDWTPILLLVPLVLGLDKINPRYI 330

Query: 715  PLLSATFTFPQSLGILGGRPGVSTYIVGVQDDNAFYLDPHEVKQVVDISRDNLEADTSSY 536
            P L ATFTFPQSLGILGG+PG STYIVGVQD+NAFYLDPHEV+ VV+ SRD++EA+TSSY
Sbjct: 331  PSLQATFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQPVVNFSRDDVEANTSSY 390

Query: 535  HCNAIRQISLDQIDPSLALGFYCRDKDDFDDFCSRASDLAAQSNGAPLFTVAQSCNSMKP 356
            HC+ +R I LD IDPSLA+GFYCRDKDDFDDFCS AS LA +SNGAPLFTVA S  S K 
Sbjct: 391  HCDVVRHIPLDLIDPSLAIGFYCRDKDDFDDFCSLASKLADESNGAPLFTVANSYKSSKH 450

Query: 355  AGQCETSCDGAAVHXXXXXXXXXXXXXDNTTQEDEWQIL 239
                    D + V              +    ED+WQ+L
Sbjct: 451  --------DSSEVRDDDPLGVMTMNDAEGCLNEDDWQLL 481


>ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|222873995|gb|EEF11126.1|
            predicted protein [Populus trichocarpa]
          Length = 482

 Score =  512 bits (1318), Expect = e-142
 Identities = 250/399 (62%), Positives = 304/399 (76%), Gaps = 2/399 (0%)
 Frame = -2

Query: 1429 TMRRFQERVLGLNKTGISISHSDIWLLGLCYSLSNEDSSADPIQSHGFAAFVEDFESRIL 1250
            +MRR QE VLG +KTGIS +  DIWLLG CY +S ++SS D   ++  AAF  DF SRIL
Sbjct: 92   SMRRIQECVLGTSKTGISNTTGDIWLLGACYKISQDNSSGDAAATNALAAFNHDFSSRIL 151

Query: 1249 MTYRKGFNAIGETKYTSDVNWGCMIRSSQMLVAQALVIHRLGRNWRKSLEKPLDKDYIEI 1070
            +TYRKGF+AI ++K TSDV+WGCM+RSSQMLVAQAL+ HRLGR+WRK L+KPLD++Y+EI
Sbjct: 152  ITYRKGFDAIEDSKLTSDVSWGCMLRSSQMLVAQALLFHRLGRSWRKPLDKPLDREYVEI 211

Query: 1069 LHYFGDSEASVFSVHNLIQSGKL--LCPGSWVGPYAMCRTWEALARSKMKETEPENQSLP 896
            LH FGDSE+S FS+HNL+++GK   L  GSWVGPYA+C +WE+L RS+ +ET  E QSL 
Sbjct: 212  LHLFGDSESSAFSIHNLLRAGKAYGLAAGSWVGPYAVCHSWESLVRSRREETNLEYQSLS 271

Query: 895  MAMYVVSGDEDGERGGAPVVCIDDASRHCHEFSRGQVDWSPIXXXXXXXXXLDKINTRYI 716
            MA+YVVSG EDGERGGAPV+CI++A+RHC EFS+GQ DW+PI         LDKIN RYI
Sbjct: 272  MAVYVVSGSEDGERGGAPVLCIEEAARHCSEFSKGQEDWTPILLLVPLVLGLDKINPRYI 331

Query: 715  PLLSATFTFPQSLGILGGRPGVSTYIVGVQDDNAFYLDPHEVKQVVDISRDNLEADTSSY 536
            P L ATFTFPQSLGILGG+PG STYIVGVQD+NAFYLDPHEV+ VV++SRD++EA+TSSY
Sbjct: 332  PSLQATFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQPVVNVSRDDVEANTSSY 391

Query: 535  HCNAIRQISLDQIDPSLALGFYCRDKDDFDDFCSRASDLAAQSNGAPLFTVAQSCNSMKP 356
            HCN +R + LD IDPSLA+GFYCRDKDDFDDFC+ AS L  +SNGAPLFTVA S   +K 
Sbjct: 392  HCNVVRHMPLDLIDPSLAIGFYCRDKDDFDDFCTLASKLTDESNGAPLFTVAHSRKLLKH 451

Query: 355  AGQCETSCDGAAVHXXXXXXXXXXXXXDNTTQEDEWQIL 239
                    D   V              +    ED+WQ+L
Sbjct: 452  --------DSGEVRSDDSLGVMTMNDVEGCVHEDDWQLL 482


>ref|XP_004138950.1| PREDICTED: cysteine protease ATG4-like [Cucumis sativus]
            gi|449512710|ref|XP_004164121.1| PREDICTED: cysteine
            protease ATG4-like [Cucumis sativus]
          Length = 483

 Score =  509 bits (1311), Expect = e-142
 Identities = 250/399 (62%), Positives = 302/399 (75%), Gaps = 2/399 (0%)
 Frame = -2

Query: 1429 TMRRFQERVLGLNKTGISISHSDIWLLGLCYSLSNEDSSADPIQSHGFAAFVEDFESRIL 1250
            +MRR QER+LG  ++G+  S  DIWLLG+C+ +S +    D   S G A + +DF SRIL
Sbjct: 87   SMRRIQERLLGSRRSGVYSSGGDIWLLGVCHKISQDHPPDDAASSPGVAGYEQDFSSRIL 146

Query: 1249 MTYRKGFNAIGETKYTSDVNWGCMIRSSQMLVAQALVIHRLGRNWRKSLEKPLDKDYIEI 1070
            MTYRKGF+ I ++KYTSDVNWGCM+RSSQMLVAQAL+ HRLGR+WRK  +KPLDK+Y+EI
Sbjct: 147  MTYRKGFHVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKPSQKPLDKEYVEI 206

Query: 1069 LHYFGDSEASVFSVHNLIQSGKL--LCPGSWVGPYAMCRTWEALARSKMKETEPENQSLP 896
            LH FGDSE S FS+HNL+Q+G+   L  GSWVGPYAMCR+WE L RSK +    ++Q LP
Sbjct: 207  LHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVGPYAMCRSWETLVRSKRETPILQDQQLP 266

Query: 895  MAMYVVSGDEDGERGGAPVVCIDDASRHCHEFSRGQVDWSPIXXXXXXXXXLDKINTRYI 716
            MA+Y+VSGDEDGERGGAPV+ IDDASRHC EFS+GQ DWSPI         L+KIN RYI
Sbjct: 267  MAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSKGQHDWSPILLLVPLVLGLEKINPRYI 326

Query: 715  PLLSATFTFPQSLGILGGRPGVSTYIVGVQDDNAFYLDPHEVKQVVDISRDNLEADTSSY 536
            P L  TFTFPQSLGILGG+PG STYIVGVQD+NAFYLDPHEV+QVV+I +D+LEADTSSY
Sbjct: 327  PSLRTTFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQQVVNIDKDDLEADTSSY 386

Query: 535  HCNAIRQISLDQIDPSLALGFYCRDKDDFDDFCSRASDLAAQSNGAPLFTVAQSCNSMKP 356
            HCN IR I L+ IDPSLA+GFYCRDKDDFD+FC RAS LA +S+GAPLFTVA++ +S  P
Sbjct: 387  HCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFCHRASKLAEESDGAPLFTVAET-HSTNP 445

Query: 355  AGQCETSCDGAAVHXXXXXXXXXXXXXDNTTQEDEWQIL 239
              Q     D + +              +  + ED+WQ L
Sbjct: 446  GRQSSALNDHSRL-VEDDGDGVVHMPNEEESHEDDWQFL 483


Top