BLASTX nr result
ID: Cephaelis21_contig00004167
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00004167 (2146 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Viti... 416 e-137 ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glyc... 409 e-136 emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera] 416 e-136 ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|2... 426 e-135 ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus c... 420 e-135 >ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera] gi|296086874|emb|CBI33041.3| unnamed protein product [Vitis vinifera] Length = 486 Score = 416 bits (1070), Expect(2) = e-137 Identities = 216/376 (57%), Positives = 262/376 (69%), Gaps = 8/376 (2%) Frame = -2 Query: 1779 MKAFCEEAEGCSEYSWRSSTGTPNRLATSASTEAGQSSHNLKKSPAWLDFVVSAFSIFDK 1600 MK FCE+A S++S ++ + + N S+ T+ L K W SAFS+F+ Sbjct: 1 MKGFCEKAVA-SKFSCKTKSDSSNSEPQSSDTK-------LSKVSLWSSVFASAFSVFET 52 Query: 1599 YSDPRG----KNKVSCPKTHGWTAHLRRMMNSGSMRRF----LGLNKTGSCSSISDIWLL 1444 S+ K + + +GWT +R+++ SMRR LG +KTG SS SDIWLL Sbjct: 53 NSESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLL 112 Query: 1443 GNCYKVADGSSLSDPTQSEGFASFVEDFSSRILITYRKGFAPIGDSKYTSDVNWGCMLRS 1264 G CYK++ S + + S G A F +DFSSRIL+TYRKGF IGDSK TSDVNWGCMLRS Sbjct: 113 GLCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRS 172 Query: 1263 SQMLIAQALVFHHLGRSWRKTLDKPMDQKYFEILHIFGDSELSPYSIHNLLDAGKTYGLA 1084 SQML+AQAL+ H +GRSWRKT KPMDQ Y EILH FGDS+ S +SIHN+L AGK YGLA Sbjct: 173 SQMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLA 232 Query: 1083 PGSWVGPYAMCRTWETLARRKRKDNVDEDLSSMIAIYVVSGDEDGERGGAPVVCIEDIIR 904 GSWVGPYAMCR+WETLAR KR++ E S +AIY+VSGDEDGERGGAPVV IE+ R Sbjct: 233 AGSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASR 292 Query: 903 HFLEYSRGQGDWMXXXXXXXXXLGLDKINPRYIPLLGDTFQFPQSLGILGGRPGASTYIV 724 H LE+S+GQ DW LGL+K+NPRYIP L TF FPQSLGILGG+PGASTYIV Sbjct: 293 HCLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIV 352 Query: 723 GLQDENAFFLDPHEVR 676 G+QDE AF+LDPHE + Sbjct: 353 GVQDEKAFYLDPHEAQ 368 Score = 100 bits (249), Expect(2) = e-137 Identities = 55/98 (56%), Positives = 65/98 (66%), Gaps = 4/98 (4%) Frame = -1 Query: 595 ILLHFPLGSIDPSLAIGFYCRDKCDFDDFCSRASKLVDKSNGAPLFTVTEKRKSTYSVNH 416 I+ H L SIDPSLAIGFYCRDK DFDDFC RASKL DKSNGAPLFTV ++ Sbjct: 389 IIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADKSNGAPLFTVAHIHSLPKPISC 448 Query: 415 HDNINNNVGFPEPGSYDL----APEGEYEHGNNEDEWQ 314 D +++ GF E S+D+ EG YEH +ED+WQ Sbjct: 449 SDGMDDCSGFREDDSFDVVSNKGAEG-YEH-EHEDDWQ 484 >ref|XP_003534478.1| PREDICTED: cysteine protease ATG4-like [Glycine max] Length = 486 Score = 409 bits (1050), Expect(2) = e-136 Identities = 208/374 (55%), Positives = 260/374 (69%), Gaps = 5/374 (1%) Frame = -2 Query: 1782 VMKAFCEEAEGCSEYSWRSSTGTPNRLATSASTEAGQSSHNLKKSPAWLDFVVSAFSIFD 1603 V+K CE S+ S +SST T + ++AG S+ K+ W + S FS+ + Sbjct: 2 VLKGLCERIVS-SKCSSKSSTETVDNTQVPVYSKAGSSNSKFPKASLWSNIFTSGFSVVE 60 Query: 1602 KYSDPRGKNKVSC-PKTHGWTAHLRRMMNSGSMRRF----LGLNKTGSCSSISDIWLLGN 1438 YS+ K + ++ GW A +R+++ GSMRRF LG ++T SS DIWLLG Sbjct: 61 TYSESSASEKKAVHSRSSGWAAAVRKVVTGGSMRRFQERVLGSSRTDISSSDGDIWLLGV 120 Query: 1437 CYKVADGSSLSDPTQSEGFASFVEDFSSRILITYRKGFAPIGDSKYTSDVNWGCMLRSSQ 1258 C+K++ S S G ASF +DFSS+IL+TYRKGF IGD+KYTSDV+WGCMLRSSQ Sbjct: 121 CHKISQQESSGGVDNSNGLASFEQDFSSKILVTYRKGFDAIGDTKYTSDVHWGCMLRSSQ 180 Query: 1257 MLIAQALVFHHLGRSWRKTLDKPMDQKYFEILHIFGDSELSPYSIHNLLDAGKTYGLAPG 1078 ML+AQAL+FH LGRSWRK +DKP D++Y ++L +FGDSE S +SIHNLL AGK YGLA G Sbjct: 181 MLVAQALLFHKLGRSWRKPIDKPPDKEYIDVLQLFGDSEASAFSIHNLLQAGKGYGLAVG 240 Query: 1077 SWVGPYAMCRTWETLARRKRKDNVDEDLSSMIAIYVVSGDEDGERGGAPVVCIEDIIRHF 898 SWVGPYAMCRTWE LAR+K N +L +AIYVVSGDEDGERGGAPVVCIED + Sbjct: 241 SWVGPYAMCRTWEVLARKK---NDLGELPLPMAIYVVSGDEDGERGGAPVVCIEDASKRC 297 Query: 897 LEYSRGQGDWMXXXXXXXXXLGLDKINPRYIPLLGDTFQFPQSLGILGGRPGASTYIVGL 718 E+S G W LGLDK+NPRYIPLL TF+FPQSLGI+GG+PGASTYI+G Sbjct: 298 FEFSSGLAAWTPLLLLVPLVLGLDKVNPRYIPLLRSTFKFPQSLGIMGGKPGASTYIIGA 357 Query: 717 QDENAFFLDPHEVR 676 Q+E AF+LDPH+V+ Sbjct: 358 QNEKAFYLDPHDVQ 371 Score = 107 bits (266), Expect(2) = e-136 Identities = 55/97 (56%), Positives = 66/97 (68%), Gaps = 3/97 (3%) Frame = -1 Query: 595 ILLHFPLGSIDPSLAIGFYCRDKCDFDDFCSRASKLVDKSNGAPLFTVTEKRKSTYSVNH 416 I+ H PL SIDPSLAIGFYCRDK DFDDFCS+ASKL ++SNGAPLFTVT+ R + V Sbjct: 393 IMRHIPLDSIDPSLAIGFYCRDKDDFDDFCSQASKLAEESNGAPLFTVTQSRSFSKQVTS 452 Query: 415 HDNINNNVGFPE---PGSYDLAPEGEYEHGNNEDEWQ 314 +D +N GF E PG + + G NED+WQ Sbjct: 453 NDVSGDNTGFQEEDFPGM-----DRGNDTGTNEDDWQ 484 >emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera] Length = 489 Score = 416 bits (1070), Expect(2) = e-136 Identities = 216/376 (57%), Positives = 262/376 (69%), Gaps = 8/376 (2%) Frame = -2 Query: 1779 MKAFCEEAEGCSEYSWRSSTGTPNRLATSASTEAGQSSHNLKKSPAWLDFVVSAFSIFDK 1600 MK FCE+A S++S ++ + + N S+ T+ L K W SAFS+F+ Sbjct: 1 MKGFCEKAVA-SKFSCKTKSDSSNSEPQSSDTK-------LSKVSLWSSVFASAFSVFET 52 Query: 1599 YSDPRG----KNKVSCPKTHGWTAHLRRMMNSGSMRRF----LGLNKTGSCSSISDIWLL 1444 S+ K + + +GWT +R+++ SMRR LG +KTG SS SDIWLL Sbjct: 53 NSESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLL 112 Query: 1443 GNCYKVADGSSLSDPTQSEGFASFVEDFSSRILITYRKGFAPIGDSKYTSDVNWGCMLRS 1264 G CYK++ S + + S G A F +DFSSRIL+TYRKGF IGDSK TSDVNWGCMLRS Sbjct: 113 GLCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRS 172 Query: 1263 SQMLIAQALVFHHLGRSWRKTLDKPMDQKYFEILHIFGDSELSPYSIHNLLDAGKTYGLA 1084 SQML+AQAL+ H +GRSWRKT KPMDQ Y EILH FGDS+ S +SIHN+L AGK YGLA Sbjct: 173 SQMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLA 232 Query: 1083 PGSWVGPYAMCRTWETLARRKRKDNVDEDLSSMIAIYVVSGDEDGERGGAPVVCIEDIIR 904 GSWVGPYAMCR+WETLAR KR++ E S +AIY+VSGDEDGERGGAPVV IE+ R Sbjct: 233 AGSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVYIEEASR 292 Query: 903 HFLEYSRGQGDWMXXXXXXXXXLGLDKINPRYIPLLGDTFQFPQSLGILGGRPGASTYIV 724 H LE+S+GQ DW LGL+K+NPRYIP L TF FPQSLGILGG+PGASTYIV Sbjct: 293 HCLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIV 352 Query: 723 GLQDENAFFLDPHEVR 676 G+QDE AF+LDPHE + Sbjct: 353 GVQDEKAFYLDPHEAQ 368 Score = 99.0 bits (245), Expect(2) = e-136 Identities = 54/98 (55%), Positives = 65/98 (66%), Gaps = 4/98 (4%) Frame = -1 Query: 595 ILLHFPLGSIDPSLAIGFYCRDKCDFDDFCSRASKLVDKSNGAPLFTVTEKRKSTYSVNH 416 I+ H L SIDPSLAIGFYCRDK DFDDFC RASKL D+SNGAPLFTV ++ Sbjct: 392 IIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADESNGAPLFTVAHIHSLPKPISC 451 Query: 415 HDNINNNVGFPEPGSYDL----APEGEYEHGNNEDEWQ 314 D +++ GF E S+D+ EG YEH +ED+WQ Sbjct: 452 SDGMDDCSGFREDDSFDVVSNKGAEG-YEH-EHEDDWQ 487 >ref|XP_002331599.1| predicted protein [Populus trichocarpa] gi|222873995|gb|EEF11126.1| predicted protein [Populus trichocarpa] Length = 482 Score = 426 bits (1096), Expect(2) = e-135 Identities = 217/376 (57%), Positives = 264/376 (70%), Gaps = 7/376 (1%) Frame = -2 Query: 1779 MKAFCEEAEGCSEYSWRSSTGTPNRLATSASTEAGQSSHNLKKSPAWLDFVVSAFSIFDK 1600 MK F E S S S+ +PNR TS S+E G + K W F SAFS+FD Sbjct: 1 MKGFRERGFVASSKS-SSTAESPNRSFTSDSSELGSADTKFSKPSLWSTFFASAFSVFDT 59 Query: 1599 YSDPRGKNKVSCPKT---HGWTAHLRRMMNSGSMRRF----LGLNKTGSCSSISDIWLLG 1441 + D ++ P +GWT+ +++++ GSMRR LG +KTG ++ DIWLLG Sbjct: 60 HCDSSSTSEKKAPHIRHGNGWTSAVKKIVAGGSMRRIQECVLGTSKTGISNTTGDIWLLG 119 Query: 1440 NCYKVADGSSLSDPTQSEGFASFVEDFSSRILITYRKGFAPIGDSKYTSDVNWGCMLRSS 1261 CYK++ +S D + A+F DFSSRILITYRKGF I DSK TSDV+WGCMLRSS Sbjct: 120 ACYKISQDNSSGDAAATNALAAFNHDFSSRILITYRKGFDAIEDSKLTSDVSWGCMLRSS 179 Query: 1260 QMLIAQALVFHHLGRSWRKTLDKPMDQKYFEILHIFGDSELSPYSIHNLLDAGKTYGLAP 1081 QML+AQAL+FH LGRSWRK LDKP+D++Y EILH+FGDSE S +SIHNLL AGK YGLA Sbjct: 180 QMLVAQALLFHRLGRSWRKPLDKPLDREYVEILHLFGDSESSAFSIHNLLRAGKAYGLAA 239 Query: 1080 GSWVGPYAMCRTWETLARRKRKDNVDEDLSSMIAIYVVSGDEDGERGGAPVVCIEDIIRH 901 GSWVGPYA+C +WE+L R +R++ E S +A+YVVSG EDGERGGAPV+CIE+ RH Sbjct: 240 GSWVGPYAVCHSWESLVRSRREETNLEYQSLSMAVYVVSGSEDGERGGAPVLCIEEAARH 299 Query: 900 FLEYSRGQGDWMXXXXXXXXXLGLDKINPRYIPLLGDTFQFPQSLGILGGRPGASTYIVG 721 E+S+GQ DW LGLDKINPRYIP L TF FPQSLGILGG+PGASTYIVG Sbjct: 300 CSEFSKGQEDWTPILLLVPLVLGLDKINPRYIPSLQATFTFPQSLGILGGKPGASTYIVG 359 Query: 720 LQDENAFFLDPHEVRP 673 +QDENAF+LDPHEV+P Sbjct: 360 VQDENAFYLDPHEVQP 375 Score = 85.9 bits (211), Expect(2) = e-135 Identities = 45/94 (47%), Positives = 58/94 (61%) Frame = -1 Query: 595 ILLHFPLGSIDPSLAIGFYCRDKCDFDDFCSRASKLVDKSNGAPLFTVTEKRKSTYSVNH 416 ++ H PL IDPSLAIGFYCRDK DFDDFC+ ASKL D+SNGAPLFTV RK + Sbjct: 395 VVRHMPLDLIDPSLAIGFYCRDKDDFDDFCTLASKLTDESNGAPLFTVAHSRKLLKHDSG 454 Query: 415 HDNINNNVGFPEPGSYDLAPEGEYEHGNNEDEWQ 314 ++++G + + E +ED+WQ Sbjct: 455 EVRSDDSLG--------VMTMNDVEGCVHEDDWQ 480 >ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis] gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B, putative [Ricinus communis] Length = 489 Score = 420 bits (1080), Expect(2) = e-135 Identities = 218/374 (58%), Positives = 264/374 (70%), Gaps = 6/374 (1%) Frame = -2 Query: 1779 MKAFCEEAEGCSEYSWRSSTGTPNRLATSASTEAGQSSHNLKKSPAWLDFVVSAFSIFDK 1600 MK F E S S + TPNR TS E+G S+ K W F SAFS+F+ Sbjct: 1 MKGFRERV--ASRCSSKCPVDTPNRSLTSDCLESG--SNFSTKGSLWSSFFASAFSVFET 56 Query: 1599 Y--SDPRGKNKVSCPKTHGWTAHLRRMMNSGSMRRF----LGLNKTGSCSSISDIWLLGN 1438 Y S P + K S + +GWT+ ++++++ GSMRR LG ++TG S+ SDIWLLG Sbjct: 57 YRESPPASEKKGSHSRHNGWTSAVKKIVSGGSMRRIHERVLGPSRTGISSTTSDIWLLGV 116 Query: 1437 CYKVADGSSLSDPTQSEGFASFVEDFSSRILITYRKGFAPIGDSKYTSDVNWGCMLRSSQ 1258 CYK+++ S + T A F D+SSRIL+TYR+GF IGDSKY SDV WGCMLRSSQ Sbjct: 117 CYKISEDESGNADT-GNALAEFTHDYSSRILMTYRRGFDAIGDSKYISDVGWGCMLRSSQ 175 Query: 1257 MLIAQALVFHHLGRSWRKTLDKPMDQKYFEILHIFGDSELSPYSIHNLLDAGKTYGLAPG 1078 ML+AQAL+FH LGR+W K KPMDQ Y EILH+FGDSE +P+SIHNL+ AGK Y LA G Sbjct: 176 MLVAQALLFHKLGRAWTKPFQKPMDQAYVEILHLFGDSEAAPFSIHNLIQAGKAYSLAAG 235 Query: 1077 SWVGPYAMCRTWETLARRKRKDNVDEDLSSMIAIYVVSGDEDGERGGAPVVCIEDIIRHF 898 SWVGPYAMCR+WE+LAR KR++N E S +A+YVVSGDEDGERGGAPVV IED RH Sbjct: 236 SWVGPYAMCRSWESLARSKREENSLEYQSLPMAVYVVSGDEDGERGGAPVVYIEDASRHC 295 Query: 897 LEYSRGQGDWMXXXXXXXXXLGLDKINPRYIPLLGDTFQFPQSLGILGGRPGASTYIVGL 718 LE+SRGQ DW LGLDK+NPRYIP L TF F QSLGI+GG+PGASTYIVG+ Sbjct: 296 LEFSRGQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFSQSLGIMGGKPGASTYIVGV 355 Query: 717 QDENAFFLDPHEVR 676 QD+NAF+LDPHEV+ Sbjct: 356 QDDNAFYLDPHEVQ 369 Score = 90.1 bits (222), Expect(2) = e-135 Identities = 52/98 (53%), Positives = 60/98 (61%), Gaps = 4/98 (4%) Frame = -1 Query: 595 ILLHFPLGSIDPSLAIGFYCRDKCDFDDFCSRASKLVDKSNGAPLFTVTEKRKSTYSVNH 416 I+ H PL SIDPSLAIGFYCRDK DFD+FC ASKL D S GAPLFTV K V+H Sbjct: 390 IVRHIPLHSIDPSLAIGFYCRDKDDFDEFCLLASKLADDSQGAPLFTVAHCHKLPKPVSH 449 Query: 415 HDNINN-NVGFPEPGSYD-LAPEGEYEHGN--NEDEWQ 314 D +NN + E S + + P + G EDEWQ Sbjct: 450 GDMLNNEDDEVQEDDSVNVMMPVNDDAEGGGAQEDEWQ 487