BLASTX nr result

ID: Angelica22_contig00011866 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00011866
         (1769 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Viti...   529   e-148
emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]   524   e-146
ref|XP_002309707.1| predicted protein [Populus trichocarpa] gi|2...   509   e-141
ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|2...   506   e-141
ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus c...   503   e-140

>ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
            gi|296086874|emb|CBI33041.3| unnamed protein product
            [Vitis vinifera]
          Length = 486

 Score =  529 bits (1363), Expect = e-148
 Identities = 264/406 (65%), Positives = 309/406 (76%), Gaps = 3/406 (0%)
 Frame = -3

Query: 1548 VIFGKKMRRFQERVLGLNRTGVSISVSEIWLLGVCYSLSNEDSSADPIQSHGFAAFVEDF 1369
            V+ G  MRR QERVLG ++TG+S S S+IWLLG+CY +S E+SS     S+G A F +DF
Sbjct: 81   VVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSNGLAEFEQDF 140

Query: 1368 ESRILLTYRKGFAAIGETKYTSDVNWGCMLRSSQMLVAQALVIQRLGRNWRKSFEKPLSK 1189
             SRIL+TYRKGF AIG++K TSDVNWGCMLRSSQMLVAQAL++ R+GR+WRK+  KP+ +
Sbjct: 141  SSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMDQ 200

Query: 1188 DYIEILHYFGDSEASAFSIHNLLQAGKL--LSPGSWVGPYAMCRTWETLAQSKLEETEPE 1015
            DYIEILH+FGDS+ASAFSIHN+LQAGK   L+ GSWVGPYAMCR+WETLA+SK EET+ E
Sbjct: 201  DYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARSKREETDLE 260

Query: 1014 NQSLPMAMYVVCGDENGERGGAPVLCIEDASRHCREFSRGQADWTPIXXXXXXXXXXXXL 835
             QSLPMA+Y+V GDE+GERGGAPV+ IE+ASRHC EFS+GQ DWTPI            +
Sbjct: 261  CQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVPLVLGLEKV 320

Query: 834  NPRYIPLLVATFTFPQSLGILGGRPGVSTYIVAVQDDNAFYLDPHEVKQVVDVARDNLEA 655
            NPRYIP L ATFTFPQSLGILGG+PG STYIV VQD+ AFYLDPHE + VVD+ R+NLEA
Sbjct: 321  NPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLEA 380

Query: 654  DTSSYHCNIIRQIPLDSIDPSLAIGFYCRDKDDFDNFCWRASELAAQSNGAPLLTVTQSR 475
            DTSSYHCNIIR I LDSIDPSLAIGFYCRDKDDFD+FC RAS+LA +SNGAPL TV    
Sbjct: 381  DTSSYHCNIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADKSNGAPLFTVAHIH 440

Query: 474  NSTKPAGEYETSCDTIGV-HXXXXXXXXXXXXXXNSGTQEDEWQLL 340
            +  KP    +   D  G                      ED+WQLL
Sbjct: 441  SLPKPISCSDGMDDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 486


>emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  524 bits (1350), Expect = e-146
 Identities = 264/409 (64%), Positives = 309/409 (75%), Gaps = 6/409 (1%)
 Frame = -3

Query: 1548 VIFGKKMRRFQERVLGLNRTGVSISVSEIWLLGVCYSLSNEDSSADPIQSHGFAAFVEDF 1369
            V+ G  MRR QERVLG ++TG+S S S+IWLLG+CY +S E+SS     S+G A F +DF
Sbjct: 81   VVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSNGLAEFEQDF 140

Query: 1368 ESRILLTYRKGFAAIGETKYTSDVNWGCMLRSSQMLVAQALVIQRLGRNWRKSFEKPLSK 1189
             SRIL+TYRKGF AIG++K TSDVNWGCMLRSSQMLVAQAL++ R+GR+WRK+  KP+ +
Sbjct: 141  SSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMDQ 200

Query: 1188 DYIEILHYFGDSEASAFSIHNLLQAGKL--LSPGSWVGPYAMCRTWETLAQSKLEETEPE 1015
            DYIEILH+FGDS+ASAFSIHN+LQAGK   L+ GSWVGPYAMCR+WETLA+SK EET+ E
Sbjct: 201  DYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARSKREETDLE 260

Query: 1014 NQSLPMAMYVVCGDENGERGGAPVLCIEDASRHCREFSRGQADWTPIXXXXXXXXXXXXL 835
             QSLPMA+Y+V GDE+GERGGAPV+ IE+ASRHC EFS+GQ DWTPI            +
Sbjct: 261  CQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVPLVLGLEKV 320

Query: 834  NPRYIPLLVATFTFPQSLGILGGRPGVSTYIVAVQDDNAFYLDPHEVKQVVDVARDNLEA 655
            NPRYIP L ATFTFPQSLGILGG+PG STYIV VQD+ AFYLDPHE + VVD+ R+NLEA
Sbjct: 321  NPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLEA 380

Query: 654  DTSSYHCN---IIRQIPLDSIDPSLAIGFYCRDKDDFDNFCWRASELAAQSNGAPLLTVT 484
            DTSSYHCN   IIR I LDSIDPSLAIGFYCRDKDDFD+FC RAS+LA +SNGAPL TV 
Sbjct: 381  DTSSYHCNCSSIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADESNGAPLFTVA 440

Query: 483  QSRNSTKPAGEYETSCDTIGV-HXXXXXXXXXXXXXXNSGTQEDEWQLL 340
               +  KP    +   D  G                      ED+WQLL
Sbjct: 441  HIHSLPKPISCSDGMDDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 489


>ref|XP_002309707.1| predicted protein [Populus trichocarpa] gi|222852610|gb|EEE90157.1|
            predicted protein [Populus trichocarpa]
          Length = 481

 Score =  509 bits (1310), Expect = e-141
 Identities = 256/405 (63%), Positives = 304/405 (75%), Gaps = 2/405 (0%)
 Frame = -3

Query: 1548 VIFGKKMRRFQERVLGLNRTGVSISVSEIWLLGVCYSLSNEDSSADPIQSHGFAAFVEDF 1369
            ++ G  MRR QERVLG ++TG+S + S+IWLLG  Y +S +DSS +   ++  AAF  DF
Sbjct: 86   IVAGGTMRRIQERVLGTSKTGISNTTSDIWLLGARYKISQDDSSGNADATNALAAFHRDF 145

Query: 1368 ESRILLTYRKGFAAIGETKYTSDVNWGCMLRSSQMLVAQALVIQRLGRNWRKSFEKPLSK 1189
             SRIL+TYRKGF  I ++K TSDVNWGCMLRSSQMLVAQAL+  RLGR+WRK  +KPL +
Sbjct: 146  SSRILITYRKGFDMIEDSKLTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKPVDKPLDR 205

Query: 1188 DYIEILHYFGDSEASAFSIHNLLQAGKL--LSPGSWVGPYAMCRTWETLAQSKLEETEPE 1015
            DY+EILH FGDSEASAFSIHNLLQAGK   L+ GSWVGPYAMCR+WE+LA+SK EET  E
Sbjct: 206  DYVEILHLFGDSEASAFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWESLARSKREETNLE 265

Query: 1014 NQSLPMAMYVVCGDENGERGGAPVLCIEDASRHCREFSRGQADWTPIXXXXXXXXXXXXL 835
             Q+LPMA+YVV G E+GERGGAPVL IEDA+RHC EFS+G+ DWTPI            +
Sbjct: 266  YQTLPMAVYVVSGCEDGERGGAPVLSIEDAARHCSEFSKGREDWTPILLLVPLVLGLDKI 325

Query: 834  NPRYIPLLVATFTFPQSLGILGGRPGVSTYIVAVQDDNAFYLDPHEVKQVVDVARDNLEA 655
            NPRYIP L ATFTFPQSLGILGG+PG STYIV VQD+NAFYLDPHEV+ VV+ +RD++EA
Sbjct: 326  NPRYIPSLQATFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQPVVNFSRDDVEA 385

Query: 654  DTSSYHCNIIRQIPLDSIDPSLAIGFYCRDKDDFDNFCWRASELAAQSNGAPLLTVTQSR 475
            +TSSYHC+++R IPLD IDPSLAIGFYCRDKDDFD+FC  AS+LA +SNGAPL TV  S 
Sbjct: 386  NTSSYHCDVVRHIPLDLIDPSLAIGFYCRDKDDFDDFCSLASKLADESNGAPLFTVANSY 445

Query: 474  NSTKPAGEYETSCDTIGVHXXXXXXXXXXXXXXNSGTQEDEWQLL 340
             S+K         D +GV                    ED+WQLL
Sbjct: 446  KSSKHDSSEVRDDDPLGV---------MTMNDAEGCLNEDDWQLL 481


>ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|222873995|gb|EEF11126.1|
            predicted protein [Populus trichocarpa]
          Length = 482

 Score =  506 bits (1303), Expect = e-141
 Identities = 250/405 (61%), Positives = 303/405 (74%), Gaps = 2/405 (0%)
 Frame = -3

Query: 1548 VIFGKKMRRFQERVLGLNRTGVSISVSEIWLLGVCYSLSNEDSSADPIQSHGFAAFVEDF 1369
            ++ G  MRR QE VLG ++TG+S +  +IWLLG CY +S ++SS D   ++  AAF  DF
Sbjct: 87   IVAGGSMRRIQECVLGTSKTGISNTTGDIWLLGACYKISQDNSSGDAAATNALAAFNHDF 146

Query: 1368 ESRILLTYRKGFAAIGETKYTSDVNWGCMLRSSQMLVAQALVIQRLGRNWRKSFEKPLSK 1189
             SRIL+TYRKGF AI ++K TSDV+WGCMLRSSQMLVAQAL+  RLGR+WRK  +KPL +
Sbjct: 147  SSRILITYRKGFDAIEDSKLTSDVSWGCMLRSSQMLVAQALLFHRLGRSWRKPLDKPLDR 206

Query: 1188 DYIEILHYFGDSEASAFSIHNLLQAGKL--LSPGSWVGPYAMCRTWETLAQSKLEETEPE 1015
            +Y+EILH FGDSE+SAFSIHNLL+AGK   L+ GSWVGPYA+C +WE+L +S+ EET  E
Sbjct: 207  EYVEILHLFGDSESSAFSIHNLLRAGKAYGLAAGSWVGPYAVCHSWESLVRSRREETNLE 266

Query: 1014 NQSLPMAMYVVCGDENGERGGAPVLCIEDASRHCREFSRGQADWTPIXXXXXXXXXXXXL 835
             QSL MA+YVV G E+GERGGAPVLCIE+A+RHC EFS+GQ DWTPI            +
Sbjct: 267  YQSLSMAVYVVSGSEDGERGGAPVLCIEEAARHCSEFSKGQEDWTPILLLVPLVLGLDKI 326

Query: 834  NPRYIPLLVATFTFPQSLGILGGRPGVSTYIVAVQDDNAFYLDPHEVKQVVDVARDNLEA 655
            NPRYIP L ATFTFPQSLGILGG+PG STYIV VQD+NAFYLDPHEV+ VV+V+RD++EA
Sbjct: 327  NPRYIPSLQATFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQPVVNVSRDDVEA 386

Query: 654  DTSSYHCNIIRQIPLDSIDPSLAIGFYCRDKDDFDNFCWRASELAAQSNGAPLLTVTQSR 475
            +TSSYHCN++R +PLD IDPSLAIGFYCRDKDDFD+FC  AS+L  +SNGAPL TV  SR
Sbjct: 387  NTSSYHCNVVRHMPLDLIDPSLAIGFYCRDKDDFDDFCTLASKLTDESNGAPLFTVAHSR 446

Query: 474  NSTKPAGEYETSCDTIGVHXXXXXXXXXXXXXXNSGTQEDEWQLL 340
               K       S D++GV                    ED+WQLL
Sbjct: 447  KLLKHDSGEVRSDDSLGV---------MTMNDVEGCVHEDDWQLL 482


>ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis]
            gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B,
            putative [Ricinus communis]
          Length = 489

 Score =  503 bits (1294), Expect = e-140
 Identities = 253/408 (62%), Positives = 294/408 (72%), Gaps = 5/408 (1%)
 Frame = -3

Query: 1548 VIFGKKMRRFQERVLGLNRTGVSISVSEIWLLGVCYSLSNEDSSADPIQSHGFAAFVEDF 1369
            ++ G  MRR  ERVLG +RTG+S + S+IWLLGVCY +S ED S +    +  A F  D+
Sbjct: 83   IVSGGSMRRIHERVLGPSRTGISSTTSDIWLLGVCYKIS-EDESGNADTGNALAEFTHDY 141

Query: 1368 ESRILLTYRKGFAAIGETKYTSDVNWGCMLRSSQMLVAQALVIQRLGRNWRKSFEKPLSK 1189
             SRIL+TYR+GF AIG++KY SDV WGCMLRSSQMLVAQAL+  +LGR W K F+KP+ +
Sbjct: 142  SSRILMTYRRGFDAIGDSKYISDVGWGCMLRSSQMLVAQALLFHKLGRAWTKPFQKPMDQ 201

Query: 1188 DYIEILHYFGDSEASAFSIHNLLQAGKL--LSPGSWVGPYAMCRTWETLAQSKLEETEPE 1015
             Y+EILH FGDSEA+ FSIHNL+QAGK   L+ GSWVGPYAMCR+WE+LA+SK EE   E
Sbjct: 202  AYVEILHLFGDSEAAPFSIHNLIQAGKAYSLAAGSWVGPYAMCRSWESLARSKREENSLE 261

Query: 1014 NQSLPMAMYVVCGDENGERGGAPVLCIEDASRHCREFSRGQADWTPIXXXXXXXXXXXXL 835
             QSLPMA+YVV GDE+GERGGAPV+ IEDASRHC EFSRGQADWTPI            +
Sbjct: 262  YQSLPMAVYVVSGDEDGERGGAPVVYIEDASRHCLEFSRGQADWTPILLLVPLVLGLDKV 321

Query: 834  NPRYIPLLVATFTFPQSLGILGGRPGVSTYIVAVQDDNAFYLDPHEVKQVVDVARDNLEA 655
            NPRYIP L ATFTF QSLGI+GG+PG STYIV VQDDNAFYLDPHEV+ VV++ RD++EA
Sbjct: 322  NPRYIPSLQATFTFSQSLGIMGGKPGASTYIVGVQDDNAFYLDPHEVQSVVNIGRDDIEA 381

Query: 654  DTSSYHCNIIRQIPLDSIDPSLAIGFYCRDKDDFDNFCWRASELAAQSNGAPLLTVTQSR 475
            DTSSYH +I+R IPL SIDPSLAIGFYCRDKDDFD FC  AS+LA  S GAPL TV    
Sbjct: 382  DTSSYHSDIVRHIPLHSIDPSLAIGFYCRDKDDFDEFCLLASKLADDSQGAPLFTVAHCH 441

Query: 474  NSTKPAGE---YETSCDTIGVHXXXXXXXXXXXXXXNSGTQEDEWQLL 340
               KP           D +                   G QEDEWQLL
Sbjct: 442  KLPKPVSHGDMLNNEDDEVQEDDSVNVMMPVNDDAEGGGAQEDEWQLL 489


Top