BLASTX nr result

ID: Cephaelis21_contig00004167 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00004167
         (2146 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Viti...   416   e-137
ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glyc...   409   e-136
emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]   416   e-136
ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|2...   426   e-135
ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus c...   420   e-135

>ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
            gi|296086874|emb|CBI33041.3| unnamed protein product
            [Vitis vinifera]
          Length = 486

 Score =  416 bits (1070), Expect(2) = e-137
 Identities = 216/376 (57%), Positives = 262/376 (69%), Gaps = 8/376 (2%)
 Frame = -2

Query: 1779 MKAFCEEAEGCSEYSWRSSTGTPNRLATSASTEAGQSSHNLKKSPAWLDFVVSAFSIFDK 1600
            MK FCE+A   S++S ++ + + N    S+ T+       L K   W     SAFS+F+ 
Sbjct: 1    MKGFCEKAVA-SKFSCKTKSDSSNSEPQSSDTK-------LSKVSLWSSVFASAFSVFET 52

Query: 1599 YSDPRG----KNKVSCPKTHGWTAHLRRMMNSGSMRRF----LGLNKTGSCSSISDIWLL 1444
             S+       K  +   + +GWT  +R+++   SMRR     LG +KTG  SS SDIWLL
Sbjct: 53   NSESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLL 112

Query: 1443 GNCYKVADGSSLSDPTQSEGFASFVEDFSSRILITYRKGFAPIGDSKYTSDVNWGCMLRS 1264
            G CYK++   S +  + S G A F +DFSSRIL+TYRKGF  IGDSK TSDVNWGCMLRS
Sbjct: 113  GLCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRS 172

Query: 1263 SQMLIAQALVFHHLGRSWRKTLDKPMDQKYFEILHIFGDSELSPYSIHNLLDAGKTYGLA 1084
            SQML+AQAL+ H +GRSWRKT  KPMDQ Y EILH FGDS+ S +SIHN+L AGK YGLA
Sbjct: 173  SQMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLA 232

Query: 1083 PGSWVGPYAMCRTWETLARRKRKDNVDEDLSSMIAIYVVSGDEDGERGGAPVVCIEDIIR 904
             GSWVGPYAMCR+WETLAR KR++   E  S  +AIY+VSGDEDGERGGAPVV IE+  R
Sbjct: 233  AGSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASR 292

Query: 903  HFLEYSRGQGDWMXXXXXXXXXLGLDKINPRYIPLLGDTFQFPQSLGILGGRPGASTYIV 724
            H LE+S+GQ DW          LGL+K+NPRYIP L  TF FPQSLGILGG+PGASTYIV
Sbjct: 293  HCLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIV 352

Query: 723  GLQDENAFFLDPHEVR 676
            G+QDE AF+LDPHE +
Sbjct: 353  GVQDEKAFYLDPHEAQ 368



 Score =  100 bits (249), Expect(2) = e-137
 Identities = 55/98 (56%), Positives = 65/98 (66%), Gaps = 4/98 (4%)
 Frame = -1

Query: 595 ILLHFPLGSIDPSLAIGFYCRDKCDFDDFCSRASKLVDKSNGAPLFTVTEKRKSTYSVNH 416
           I+ H  L SIDPSLAIGFYCRDK DFDDFC RASKL DKSNGAPLFTV         ++ 
Sbjct: 389 IIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADKSNGAPLFTVAHIHSLPKPISC 448

Query: 415 HDNINNNVGFPEPGSYDL----APEGEYEHGNNEDEWQ 314
            D +++  GF E  S+D+      EG YEH  +ED+WQ
Sbjct: 449 SDGMDDCSGFREDDSFDVVSNKGAEG-YEH-EHEDDWQ 484


>ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glycine max]
          Length = 486

 Score =  409 bits (1050), Expect(2) = e-136
 Identities = 208/374 (55%), Positives = 260/374 (69%), Gaps = 5/374 (1%)
 Frame = -2

Query: 1782 VMKAFCEEAEGCSEYSWRSSTGTPNRLATSASTEAGQSSHNLKKSPAWLDFVVSAFSIFD 1603
            V+K  CE     S+ S +SST T +       ++AG S+    K+  W +   S FS+ +
Sbjct: 2    VLKGLCERIVS-SKCSSKSSTETVDNTQVPVYSKAGSSNSKFPKASLWSNIFTSGFSVVE 60

Query: 1602 KYSDPRGKNKVSC-PKTHGWTAHLRRMMNSGSMRRF----LGLNKTGSCSSISDIWLLGN 1438
             YS+     K +   ++ GW A +R+++  GSMRRF    LG ++T   SS  DIWLLG 
Sbjct: 61   TYSESSASEKKAVHSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGV 120

Query: 1437 CYKVADGSSLSDPTQSEGFASFVEDFSSRILITYRKGFAPIGDSKYTSDVNWGCMLRSSQ 1258
            C+K++   S      S G ASF +DFSS+IL+TYRKGF  IGD+KYTSDV+WGCMLRSSQ
Sbjct: 121  CHKISQQESSGGVDNSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVHWGCMLRSSQ 180

Query: 1257 MLIAQALVFHHLGRSWRKTLDKPMDQKYFEILHIFGDSELSPYSIHNLLDAGKTYGLAPG 1078
            ML+AQAL+FH LGRSWRK +DKP D++Y ++L +FGDSE S +SIHNLL AGK YGLA G
Sbjct: 181  MLVAQALLFHKLGRSWRKPIDKPPDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVG 240

Query: 1077 SWVGPYAMCRTWETLARRKRKDNVDEDLSSMIAIYVVSGDEDGERGGAPVVCIEDIIRHF 898
            SWVGPYAMCRTWE LAR+K   N   +L   +AIYVVSGDEDGERGGAPVVCIED  +  
Sbjct: 241  SWVGPYAMCRTWEVLARKK---NDLGELPLPMAIYVVSGDEDGERGGAPVVCIEDASKRC 297

Query: 897  LEYSRGQGDWMXXXXXXXXXLGLDKINPRYIPLLGDTFQFPQSLGILGGRPGASTYIVGL 718
             E+S G   W          LGLDK+NPRYIPLL  TF+FPQSLGI+GG+PGASTYI+G 
Sbjct: 298  FEFSSGLAAWTPLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGA 357

Query: 717  QDENAFFLDPHEVR 676
            Q+E AF+LDPH+V+
Sbjct: 358  QNEKAFYLDPHDVQ 371



 Score =  107 bits (266), Expect(2) = e-136
 Identities = 55/97 (56%), Positives = 66/97 (68%), Gaps = 3/97 (3%)
 Frame = -1

Query: 595 ILLHFPLGSIDPSLAIGFYCRDKCDFDDFCSRASKLVDKSNGAPLFTVTEKRKSTYSVNH 416
           I+ H PL SIDPSLAIGFYCRDK DFDDFCS+ASKL ++SNGAPLFTVT+ R  +  V  
Sbjct: 393 IMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKLAEESNGAPLFTVTQSRSFSKQVTS 452

Query: 415 HDNINNNVGFPE---PGSYDLAPEGEYEHGNNEDEWQ 314
           +D   +N GF E   PG      +   + G NED+WQ
Sbjct: 453 NDVSGDNTGFQEEDFPGM-----DRGNDTGTNEDDWQ 484


>emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  416 bits (1070), Expect(2) = e-136
 Identities = 216/376 (57%), Positives = 262/376 (69%), Gaps = 8/376 (2%)
 Frame = -2

Query: 1779 MKAFCEEAEGCSEYSWRSSTGTPNRLATSASTEAGQSSHNLKKSPAWLDFVVSAFSIFDK 1600
            MK FCE+A   S++S ++ + + N    S+ T+       L K   W     SAFS+F+ 
Sbjct: 1    MKGFCEKAVA-SKFSCKTKSDSSNSEPQSSDTK-------LSKVSLWSSVFASAFSVFET 52

Query: 1599 YSDPRG----KNKVSCPKTHGWTAHLRRMMNSGSMRRF----LGLNKTGSCSSISDIWLL 1444
             S+       K  +   + +GWT  +R+++   SMRR     LG +KTG  SS SDIWLL
Sbjct: 53   NSESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLL 112

Query: 1443 GNCYKVADGSSLSDPTQSEGFASFVEDFSSRILITYRKGFAPIGDSKYTSDVNWGCMLRS 1264
            G CYK++   S +  + S G A F +DFSSRIL+TYRKGF  IGDSK TSDVNWGCMLRS
Sbjct: 113  GLCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRS 172

Query: 1263 SQMLIAQALVFHHLGRSWRKTLDKPMDQKYFEILHIFGDSELSPYSIHNLLDAGKTYGLA 1084
            SQML+AQAL+ H +GRSWRKT  KPMDQ Y EILH FGDS+ S +SIHN+L AGK YGLA
Sbjct: 173  SQMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLA 232

Query: 1083 PGSWVGPYAMCRTWETLARRKRKDNVDEDLSSMIAIYVVSGDEDGERGGAPVVCIEDIIR 904
             GSWVGPYAMCR+WETLAR KR++   E  S  +AIY+VSGDEDGERGGAPVV IE+  R
Sbjct: 233  AGSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASR 292

Query: 903  HFLEYSRGQGDWMXXXXXXXXXLGLDKINPRYIPLLGDTFQFPQSLGILGGRPGASTYIV 724
            H LE+S+GQ DW          LGL+K+NPRYIP L  TF FPQSLGILGG+PGASTYIV
Sbjct: 293  HCLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIV 352

Query: 723  GLQDENAFFLDPHEVR 676
            G+QDE AF+LDPHE +
Sbjct: 353  GVQDEKAFYLDPHEAQ 368



 Score = 99.0 bits (245), Expect(2) = e-136
 Identities = 54/98 (55%), Positives = 65/98 (66%), Gaps = 4/98 (4%)
 Frame = -1

Query: 595 ILLHFPLGSIDPSLAIGFYCRDKCDFDDFCSRASKLVDKSNGAPLFTVTEKRKSTYSVNH 416
           I+ H  L SIDPSLAIGFYCRDK DFDDFC RASKL D+SNGAPLFTV         ++ 
Sbjct: 392 IIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADESNGAPLFTVAHIHSLPKPISC 451

Query: 415 HDNINNNVGFPEPGSYDL----APEGEYEHGNNEDEWQ 314
            D +++  GF E  S+D+      EG YEH  +ED+WQ
Sbjct: 452 SDGMDDCSGFREDDSFDVVSNKGAEG-YEH-EHEDDWQ 487


>ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|222873995|gb|EEF11126.1|
            predicted protein [Populus trichocarpa]
          Length = 482

 Score =  426 bits (1096), Expect(2) = e-135
 Identities = 217/376 (57%), Positives = 264/376 (70%), Gaps = 7/376 (1%)
 Frame = -2

Query: 1779 MKAFCEEAEGCSEYSWRSSTGTPNRLATSASTEAGQSSHNLKKSPAWLDFVVSAFSIFDK 1600
            MK F E     S  S  S+  +PNR  TS S+E G +     K   W  F  SAFS+FD 
Sbjct: 1    MKGFRERGFVASSKS-SSTAESPNRSFTSDSSELGSADTKFSKPSLWSTFFASAFSVFDT 59

Query: 1599 YSDPRGKNKVSCPKT---HGWTAHLRRMMNSGSMRRF----LGLNKTGSCSSISDIWLLG 1441
            + D    ++   P     +GWT+ +++++  GSMRR     LG +KTG  ++  DIWLLG
Sbjct: 60   HCDSSSTSEKKAPHIRHGNGWTSAVKKIVAGGSMRRIQECVLGTSKTGISNTTGDIWLLG 119

Query: 1440 NCYKVADGSSLSDPTQSEGFASFVEDFSSRILITYRKGFAPIGDSKYTSDVNWGCMLRSS 1261
             CYK++  +S  D   +   A+F  DFSSRILITYRKGF  I DSK TSDV+WGCMLRSS
Sbjct: 120  ACYKISQDNSSGDAAATNALAAFNHDFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSS 179

Query: 1260 QMLIAQALVFHHLGRSWRKTLDKPMDQKYFEILHIFGDSELSPYSIHNLLDAGKTYGLAP 1081
            QML+AQAL+FH LGRSWRK LDKP+D++Y EILH+FGDSE S +SIHNLL AGK YGLA 
Sbjct: 180  QMLVAQALLFHRLGRSWRKPLDKPLDREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAA 239

Query: 1080 GSWVGPYAMCRTWETLARRKRKDNVDEDLSSMIAIYVVSGDEDGERGGAPVVCIEDIIRH 901
            GSWVGPYA+C +WE+L R +R++   E  S  +A+YVVSG EDGERGGAPV+CIE+  RH
Sbjct: 240  GSWVGPYAVCHSWESLVRSRREETNLEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARH 299

Query: 900  FLEYSRGQGDWMXXXXXXXXXLGLDKINPRYIPLLGDTFQFPQSLGILGGRPGASTYIVG 721
              E+S+GQ DW          LGLDKINPRYIP L  TF FPQSLGILGG+PGASTYIVG
Sbjct: 300  CSEFSKGQEDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVG 359

Query: 720  LQDENAFFLDPHEVRP 673
            +QDENAF+LDPHEV+P
Sbjct: 360  VQDENAFYLDPHEVQP 375



 Score = 85.9 bits (211), Expect(2) = e-135
 Identities = 45/94 (47%), Positives = 58/94 (61%)
 Frame = -1

Query: 595 ILLHFPLGSIDPSLAIGFYCRDKCDFDDFCSRASKLVDKSNGAPLFTVTEKRKSTYSVNH 416
           ++ H PL  IDPSLAIGFYCRDK DFDDFC+ ASKL D+SNGAPLFTV   RK     + 
Sbjct: 395 VVRHMPLDLIDPSLAIGFYCRDKDDFDDFCTLASKLTDESNGAPLFTVAHSRKLLKHDSG 454

Query: 415 HDNINNNVGFPEPGSYDLAPEGEYEHGNNEDEWQ 314
               ++++G        +    + E   +ED+WQ
Sbjct: 455 EVRSDDSLG--------VMTMNDVEGCVHEDDWQ 480


>ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis]
            gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B,
            putative [Ricinus communis]
          Length = 489

 Score =  420 bits (1080), Expect(2) = e-135
 Identities = 218/374 (58%), Positives = 264/374 (70%), Gaps = 6/374 (1%)
 Frame = -2

Query: 1779 MKAFCEEAEGCSEYSWRSSTGTPNRLATSASTEAGQSSHNLKKSPAWLDFVVSAFSIFDK 1600
            MK F E     S  S +    TPNR  TS   E+G  S+   K   W  F  SAFS+F+ 
Sbjct: 1    MKGFRERV--ASRCSSKCPVDTPNRSLTSDCLESG--SNFSTKGSLWSSFFASAFSVFET 56

Query: 1599 Y--SDPRGKNKVSCPKTHGWTAHLRRMMNSGSMRRF----LGLNKTGSCSSISDIWLLGN 1438
            Y  S P  + K S  + +GWT+ ++++++ GSMRR     LG ++TG  S+ SDIWLLG 
Sbjct: 57   YRESPPASEKKGSHSRHNGWTSAVKKIVSGGSMRRIHERVLGPSRTGISSTTSDIWLLGV 116

Query: 1437 CYKVADGSSLSDPTQSEGFASFVEDFSSRILITYRKGFAPIGDSKYTSDVNWGCMLRSSQ 1258
            CYK+++  S +  T     A F  D+SSRIL+TYR+GF  IGDSKY SDV WGCMLRSSQ
Sbjct: 117  CYKISEDESGNADT-GNALAEFTHDYSSRILMTYRRGFDAIGDSKYISDVGWGCMLRSSQ 175

Query: 1257 MLIAQALVFHHLGRSWRKTLDKPMDQKYFEILHIFGDSELSPYSIHNLLDAGKTYGLAPG 1078
            ML+AQAL+FH LGR+W K   KPMDQ Y EILH+FGDSE +P+SIHNL+ AGK Y LA G
Sbjct: 176  MLVAQALLFHKLGRAWTKPFQKPMDQAYVEILHLFGDSEAAPFSIHNLIQAGKAYSLAAG 235

Query: 1077 SWVGPYAMCRTWETLARRKRKDNVDEDLSSMIAIYVVSGDEDGERGGAPVVCIEDIIRHF 898
            SWVGPYAMCR+WE+LAR KR++N  E  S  +A+YVVSGDEDGERGGAPVV IED  RH 
Sbjct: 236  SWVGPYAMCRSWESLARSKREENSLEYQSLPMAVYVVSGDEDGERGGAPVVYIEDASRHC 295

Query: 897  LEYSRGQGDWMXXXXXXXXXLGLDKINPRYIPLLGDTFQFPQSLGILGGRPGASTYIVGL 718
            LE+SRGQ DW          LGLDK+NPRYIP L  TF F QSLGI+GG+PGASTYIVG+
Sbjct: 296  LEFSRGQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFSQSLGIMGGKPGASTYIVGV 355

Query: 717  QDENAFFLDPHEVR 676
            QD+NAF+LDPHEV+
Sbjct: 356  QDDNAFYLDPHEVQ 369



 Score = 90.1 bits (222), Expect(2) = e-135
 Identities = 52/98 (53%), Positives = 60/98 (61%), Gaps = 4/98 (4%)
 Frame = -1

Query: 595 ILLHFPLGSIDPSLAIGFYCRDKCDFDDFCSRASKLVDKSNGAPLFTVTEKRKSTYSVNH 416
           I+ H PL SIDPSLAIGFYCRDK DFD+FC  ASKL D S GAPLFTV    K    V+H
Sbjct: 390 IVRHIPLHSIDPSLAIGFYCRDKDDFDEFCLLASKLADDSQGAPLFTVAHCHKLPKPVSH 449

Query: 415 HDNINN-NVGFPEPGSYD-LAPEGEYEHGN--NEDEWQ 314
            D +NN +    E  S + + P  +   G    EDEWQ
Sbjct: 450 GDMLNNEDDEVQEDDSVNVMMPVNDDAEGGGAQEDEWQ 487


Top