BLASTX nr result

ID: Papaver29_contig00035735 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver29_contig00035735
         (1042 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010277861.1| PREDICTED: cysteine protease ATG4-like [Nelu...   409   e-111
ref|XP_008810459.1| PREDICTED: cysteine protease ATG4B-like isof...   399   e-108
gb|KDO40319.1| hypothetical protein CISIN_1g011418mg [Citrus sin...   399   e-108
gb|KDO40316.1| hypothetical protein CISIN_1g011418mg [Citrus sin...   399   e-108
ref|XP_006441973.1| hypothetical protein CICLE_v10019906mg [Citr...   399   e-108
ref|XP_006478507.1| PREDICTED: cysteine protease ATG4-like isofo...   398   e-108
ref|XP_010646415.1| PREDICTED: cysteine protease ATG4 isoform X1...   398   e-108
ref|XP_003635099.1| PREDICTED: cysteine protease ATG4 isoform X2...   398   e-108
ref|XP_007049917.1| Peptidase family C54 protein isoform 3 [Theo...   398   e-108
emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]   398   e-108
ref|XP_012081889.1| PREDICTED: cysteine protease ATG4-like isofo...   397   e-108
ref|XP_010934835.1| PREDICTED: cysteine protease ATG4B-like isof...   392   e-106
ref|XP_010934834.1| PREDICTED: cysteine protease ATG4B-like isof...   392   e-106
ref|XP_010934833.1| PREDICTED: cysteine protease ATG4B-like isof...   392   e-106
ref|XP_012435798.1| PREDICTED: cysteine protease ATG4 isoform X1...   390   e-106
gb|KJB46920.1| hypothetical protein B456_008G001300 [Gossypium r...   390   e-106
gb|KJB46918.1| hypothetical protein B456_008G001300 [Gossypium r...   390   e-106
ref|XP_012435802.1| PREDICTED: cysteine protease ATG4 isoform X3...   390   e-106
ref|XP_012435801.1| PREDICTED: cysteine protease ATG4 isoform X2...   390   e-106
ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus c...   390   e-106

>ref|XP_010277861.1| PREDICTED: cysteine protease ATG4-like [Nelumbo nucifera]
           gi|720070813|ref|XP_010277863.1| PREDICTED: cysteine
           protease ATG4-like [Nelumbo nucifera]
           gi|720070816|ref|XP_010277864.1| PREDICTED: cysteine
           protease ATG4-like [Nelumbo nucifera]
           gi|720070819|ref|XP_010277865.1| PREDICTED: cysteine
           protease ATG4-like [Nelumbo nucifera]
          Length = 490

 Score =  409 bits (1051), Expect = e-111
 Identities = 209/318 (65%), Positives = 240/318 (75%)
 Frame = -3

Query: 959 DTTKSVKGXXXXXXXXXXXXXFEAYSDASQPGKNSTDVKSANGWATTMKRIVSGSSMRAL 780
           DTT S KG             FE YS++S   K + + K+  GW T +K++++  SMR L
Sbjct: 37  DTTSS-KGSLWSSLFTSPSSLFETYSESSISVKKTFNSKTY-GWTTALKKVLTVGSMRRL 94

Query: 779 RERLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRK 600
           +ER++GPS TGVSSSTSEIW LGVCYK+S ED S +  +GNGLV F +DFSSRIWMTYRK
Sbjct: 95  QERILGPSKTGVSSSTSEIWLLGVCYKVSEEDSSGNLVNGNGLVAFTEDFSSRIWMTYRK 154

Query: 599 GFSAIGDSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFG 420
           GF  I DSK TSDV+WGCM RSSQMLVAQALL H LGRSWRKP + P+DP YI+IL  FG
Sbjct: 155 GFDVI-DSKFTSDVNWGCMLRSSQMLVAQALLVHCLGRSWRKPLQPPFDPEYIEILHLFG 213

Query: 419 DSDSSAFSIHNLIQAGKAYGLAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAV 240
           DS++SAFSIHNL+QAGKAYGLAAGSW+GPYAMCRSWETL R  RE  + E E+ SL MAV
Sbjct: 214 DSEASAFSIHNLLQAGKAYGLAAGSWIGPYAMCRSWETLVRSKREQTNLEIENQSLSMAV 273

Query: 239 HIVSGDEDGERGGAPVLCIEDAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSL 60
           +IVSGDEDGERGGAPV+C+ED AR CSEF  GQ DW+PI         LEKVN RYIP L
Sbjct: 274 YIVSGDEDGERGGAPVVCVEDVARLCSEFSKGQVDWAPILLLVPLVLGLEKVNPRYIPLL 333

Query: 59  RVTFTFPQSLGILGGKPG 6
             TFTFPQSLGILGGK G
Sbjct: 334 WATFTFPQSLGILGGKSG 351


>ref|XP_008810459.1| PREDICTED: cysteine protease ATG4B-like isoform X1 [Phoenix
           dactylifera]
          Length = 494

 Score =  399 bits (1024), Expect = e-108
 Identities = 192/290 (66%), Positives = 225/290 (77%)
 Frame = -3

Query: 875 SQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVCYKI 696
           S  G+       + GW T +K++V+G SMR L+ERL+G S+T   S TSEIW LG+ YK+
Sbjct: 63  SHSGEKKATKSRSYGWTTAVKKVVAGGSMRRLQERLLGTSSTDALSLTSEIWLLGMRYKL 122

Query: 695 SPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQMLVA 516
           SPE+ S  + HGNG   F++DFSSRIW+TYRKGF AIG SKLTSDV WGCM RSSQMLVA
Sbjct: 123 SPEESSGGADHGNGSAAFLEDFSSRIWITYRKGFDAIGYSKLTSDVRWGCMIRSSQMLVA 182

Query: 515 QALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGSWVG 336
           QALLFHHLGR WRKP +KP+DP YI+IL  FGDS++ AFS+HNL++AGK YGLAAGSW+G
Sbjct: 183 QALLFHHLGRYWRKPSQKPHDPKYIEILHLFGDSEACAFSLHNLLEAGKGYGLAAGSWLG 242

Query: 335 PYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARRCSE 156
           PYAMCR+WETLAR  RE AD ++E  SLPMAV++VSGDEDGERGGAPV+CI+ AAR CS+
Sbjct: 243 PYAMCRTWETLARAKREQADLDREKESLPMAVYVVSGDEDGERGGAPVVCIDVAARLCSD 302

Query: 155 FGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPG 6
           F  GQ  W PI         LEKVN RYIP L  TFTFPQSLGILGGKPG
Sbjct: 303 FSKGQISWVPILLLVPLVLGLEKVNPRYIPLLWETFTFPQSLGILGGKPG 352


>gb|KDO40319.1| hypothetical protein CISIN_1g011418mg [Citrus sinensis]
          Length = 392

 Score =  399 bits (1024), Expect = e-108
 Identities = 199/317 (62%), Positives = 237/317 (74%)
 Frame = -3

Query: 953 TKSVKGXXXXXXXXXXXXXFEAYSDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRE 774
           +KS KG             FE YS++S   K +   KS NGW   +KR+V+  SMR + E
Sbjct: 38  SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKS-NGWTAAVKRLVTAGSMRRIHE 96

Query: 773 RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 594
           R++GPS TG+SSSTS+IW LGVC+KI+ ++   D+   NGL EF  DFSSRI ++YRKGF
Sbjct: 97  RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF 156

Query: 593 SAIGDSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDS 414
             IGDSK+TSDV WGCM RSSQMLVAQALLFH LGR WRKP +KP+D  Y++IL  FGDS
Sbjct: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 216

Query: 413 DSSAFSIHNLIQAGKAYGLAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHI 234
           ++S FSIHNL+QAGKAYGLAAGSWVGPYAMCRSWE LAR  R  A+      SLPMA+++
Sbjct: 217 ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYV 274

Query: 233 VSGDEDGERGGAPVLCIEDAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRV 54
           VSGDEDGERGGAPV+CI+DA+R CS F  GQADW+PI         LEKVN RYIP+LR+
Sbjct: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334

Query: 53  TFTFPQSLGILGGKPGA 3
           TFTFPQSLGI+GGKPGA
Sbjct: 335 TFTFPQSLGIVGGKPGA 351


>gb|KDO40316.1| hypothetical protein CISIN_1g011418mg [Citrus sinensis]
           gi|641820321|gb|KDO40317.1| hypothetical protein
           CISIN_1g011418mg [Citrus sinensis]
          Length = 486

 Score =  399 bits (1024), Expect = e-108
 Identities = 199/317 (62%), Positives = 237/317 (74%)
 Frame = -3

Query: 953 TKSVKGXXXXXXXXXXXXXFEAYSDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRE 774
           +KS KG             FE YS++S   K +   KS NGW   +KR+V+  SMR + E
Sbjct: 38  SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKS-NGWTAAVKRLVTAGSMRRIHE 96

Query: 773 RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 594
           R++GPS TG+SSSTS+IW LGVC+KI+ ++   D+   NGL EF  DFSSRI ++YRKGF
Sbjct: 97  RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF 156

Query: 593 SAIGDSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDS 414
             IGDSK+TSDV WGCM RSSQMLVAQALLFH LGR WRKP +KP+D  Y++IL  FGDS
Sbjct: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 216

Query: 413 DSSAFSIHNLIQAGKAYGLAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHI 234
           ++S FSIHNL+QAGKAYGLAAGSWVGPYAMCRSWE LAR  R  A+      SLPMA+++
Sbjct: 217 ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYV 274

Query: 233 VSGDEDGERGGAPVLCIEDAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRV 54
           VSGDEDGERGGAPV+CI+DA+R CS F  GQADW+PI         LEKVN RYIP+LR+
Sbjct: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334

Query: 53  TFTFPQSLGILGGKPGA 3
           TFTFPQSLGI+GGKPGA
Sbjct: 335 TFTFPQSLGIVGGKPGA 351


>ref|XP_006441973.1| hypothetical protein CICLE_v10019906mg [Citrus clementina]
           gi|557544235|gb|ESR55213.1| hypothetical protein
           CICLE_v10019906mg [Citrus clementina]
          Length = 486

 Score =  399 bits (1024), Expect = e-108
 Identities = 199/317 (62%), Positives = 237/317 (74%)
 Frame = -3

Query: 953 TKSVKGXXXXXXXXXXXXXFEAYSDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRE 774
           +KS KG             FE YS++S   K +   KS NGW   +KR+V+  SMR + E
Sbjct: 38  SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKS-NGWTAAVKRLVTAGSMRRIHE 96

Query: 773 RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 594
           R++GPS TG+SSSTS+IW LGVC+KI+ ++   D+   NGL EF  DFSSRI ++YRKGF
Sbjct: 97  RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF 156

Query: 593 SAIGDSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDS 414
             IGDSK+TSDV WGCM RSSQMLVAQALLFH LGR WRKP +KP+D  Y++IL  FGDS
Sbjct: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 216

Query: 413 DSSAFSIHNLIQAGKAYGLAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHI 234
           ++S FSIHNL+QAGKAYGLAAGSWVGPYAMCRSWE LAR  R  A+      SLPMA+++
Sbjct: 217 ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYV 274

Query: 233 VSGDEDGERGGAPVLCIEDAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRV 54
           VSGDEDGERGGAPV+CI+DA+R CS F  GQADW+PI         LEKVN RYIP+LR+
Sbjct: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334

Query: 53  TFTFPQSLGILGGKPGA 3
           TFTFPQSLGI+GGKPGA
Sbjct: 335 TFTFPQSLGIVGGKPGA 351


>ref|XP_006478507.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Citrus sinensis]
          Length = 486

 Score =  398 bits (1023), Expect = e-108
 Identities = 199/317 (62%), Positives = 237/317 (74%)
 Frame = -3

Query: 953 TKSVKGXXXXXXXXXXXXXFEAYSDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRE 774
           +KS KG             FE YS++S   K +   KS NGW   +KR+V+  SMR + E
Sbjct: 38  SKSSKGSLLSSLFNSAFSVFETYSESSANEKKAVHNKS-NGWTAAVKRLVTAGSMRRIHE 96

Query: 773 RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 594
           R++GPS TG+SSSTS+IW LGVC+KI+ ++   D+   NGL EF  DFSSRI ++YRKGF
Sbjct: 97  RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF 156

Query: 593 SAIGDSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDS 414
             IGDSK+TSDV WGCM RSSQMLVAQALLFH LGR WRKP +KP+D  Y++IL  FGDS
Sbjct: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 216

Query: 413 DSSAFSIHNLIQAGKAYGLAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHI 234
           ++S FSIHNL+QAGKAYGLAAGSWVGPYAMCRSWE LAR  R  A+      SLPMA+++
Sbjct: 217 ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYV 274

Query: 233 VSGDEDGERGGAPVLCIEDAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRV 54
           VSGDEDGERGGAPV+CI+DA+R CS F  GQADW+PI         LEKVN RYIP+LR+
Sbjct: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334

Query: 53  TFTFPQSLGILGGKPGA 3
           TFTFPQSLGI+GGKPGA
Sbjct: 335 TFTFPQSLGIVGGKPGA 351


>ref|XP_010646415.1| PREDICTED: cysteine protease ATG4 isoform X1 [Vitis vinifera]
          Length = 489

 Score =  398 bits (1022), Expect = e-108
 Identities = 198/294 (67%), Positives = 228/294 (77%)
 Frame = -3

Query: 884 SDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVC 705
           S  S   K + D    NGW T ++++V+G SMR ++ER++G S TG+SSSTS+IW LG+C
Sbjct: 56  SSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLC 115

Query: 704 YKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQM 525
           YKIS E+ S+ ++  NGL EF  DFSSRI MTYRKGF AIGDSKLTSDV+WGCM RSSQM
Sbjct: 116 YKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQM 175

Query: 524 LVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGS 345
           LVAQALL H +GRSWRK   KP D  YI+IL  FGDS +SAFSIHN++QAGKAYGLAAGS
Sbjct: 176 LVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGS 235

Query: 344 WVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARR 165
           WVGPYAMCRSWETLAR  RE  D E +  SLPMA++IVSGDEDGERGGAPV+ IE+A+R 
Sbjct: 236 WVGPYAMCRSWETLARSKREETDLECQ--SLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293

Query: 164 CSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGA 3
           C EF  GQ DW+PI         LEKVN RYIPSL  TFTFPQSLGILGGKPGA
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGA 347


>ref|XP_003635099.1| PREDICTED: cysteine protease ATG4 isoform X2 [Vitis vinifera]
           gi|296086874|emb|CBI33041.3| unnamed protein product
           [Vitis vinifera]
          Length = 486

 Score =  398 bits (1022), Expect = e-108
 Identities = 198/294 (67%), Positives = 228/294 (77%)
 Frame = -3

Query: 884 SDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVC 705
           S  S   K + D    NGW T ++++V+G SMR ++ER++G S TG+SSSTS+IW LG+C
Sbjct: 56  SSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLC 115

Query: 704 YKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQM 525
           YKIS E+ S+ ++  NGL EF  DFSSRI MTYRKGF AIGDSKLTSDV+WGCM RSSQM
Sbjct: 116 YKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQM 175

Query: 524 LVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGS 345
           LVAQALL H +GRSWRK   KP D  YI+IL  FGDS +SAFSIHN++QAGKAYGLAAGS
Sbjct: 176 LVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGS 235

Query: 344 WVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARR 165
           WVGPYAMCRSWETLAR  RE  D E +  SLPMA++IVSGDEDGERGGAPV+ IE+A+R 
Sbjct: 236 WVGPYAMCRSWETLARSKREETDLECQ--SLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293

Query: 164 CSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGA 3
           C EF  GQ DW+PI         LEKVN RYIPSL  TFTFPQSLGILGGKPGA
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGA 347


>ref|XP_007049917.1| Peptidase family C54 protein isoform 3 [Theobroma cacao]
           gi|508702178|gb|EOX94074.1| Peptidase family C54 protein
           isoform 3 [Theobroma cacao]
          Length = 486

 Score =  398 bits (1022), Expect = e-108
 Identities = 196/298 (65%), Positives = 231/298 (77%), Gaps = 1/298 (0%)
 Frame = -3

Query: 893 EAYSDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFL 714
           + YS++S   K +   ++ NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW L
Sbjct: 57  DTYSESSACEKKALHARN-NGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIWLL 115

Query: 713 GVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRS 534
           GVCYKIS    S D    NGL  F  DFSSRI MTYRKGF AIGD+K+TSD  WGCM RS
Sbjct: 116 GVCYKISQVSSSGDVDASNGLAAFKRDFSSRILMTYRKGFDAIGDTKITSDFGWGCMLRS 175

Query: 533 SQMLVA-QALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGL 357
           SQMLVA QALLFH LGRSWRKP +KP++  YI+IL QFGDS+++AFSIHNL++AGK YGL
Sbjct: 176 SQMLVAQQALLFHQLGRSWRKPLQKPFEQAYIEILHQFGDSEATAFSIHNLVEAGKIYGL 235

Query: 356 AAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIED 177
           AAGSWVGPYAMCRSWE+LAR  RE  D   EH SLPMAV++VSGDEDGERGGAPV+C+ED
Sbjct: 236 AAGSWVGPYAMCRSWESLARFKREEND--LEHQSLPMAVYVVSGDEDGERGGAPVVCVED 293

Query: 176 AARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGA 3
           A+R C EF   +ADW+PI         L+KVN+RYIPSL+ TFTFPQ LGILGGKPGA
Sbjct: 294 ASRHCFEFSRCRADWTPILLLVPLVLGLDKVNSRYIPSLQATFTFPQCLGILGGKPGA 351


>emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  398 bits (1022), Expect = e-108
 Identities = 198/294 (67%), Positives = 228/294 (77%)
 Frame = -3

Query: 884 SDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVC 705
           S  S   K + D    NGW T ++++V+G SMR ++ER++G S TG+SSSTS+IW LG+C
Sbjct: 56  SSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLC 115

Query: 704 YKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQM 525
           YKIS E+ S+ ++  NGL EF  DFSSRI MTYRKGF AIGDSKLTSDV+WGCM RSSQM
Sbjct: 116 YKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQM 175

Query: 524 LVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGS 345
           LVAQALL H +GRSWRK   KP D  YI+IL  FGDS +SAFSIHN++QAGKAYGLAAGS
Sbjct: 176 LVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGS 235

Query: 344 WVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARR 165
           WVGPYAMCRSWETLAR  RE  D E +  SLPMA++IVSGDEDGERGGAPV+ IE+A+R 
Sbjct: 236 WVGPYAMCRSWETLARSKREETDLECQ--SLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293

Query: 164 CSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGA 3
           C EF  GQ DW+PI         LEKVN RYIPSL  TFTFPQSLGILGGKPGA
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGA 347


>ref|XP_012081889.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Jatropha curcas]
           gi|802675786|ref|XP_012081890.1| PREDICTED: cysteine
           protease ATG4-like isoform X1 [Jatropha curcas]
           gi|802675813|ref|XP_012081891.1| PREDICTED: cysteine
           protease ATG4-like isoform X1 [Jatropha curcas]
           gi|643718243|gb|KDP29532.1| hypothetical protein
           JCGZ_19245 [Jatropha curcas]
          Length = 492

 Score =  397 bits (1021), Expect = e-108
 Identities = 193/279 (69%), Positives = 224/279 (80%), Gaps = 1/279 (0%)
 Frame = -3

Query: 836 NGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDS-THG 660
           NGW + +K+IV+G SMR + ER++GPS TG+S++TSEIW LGVCYKIS +  ++D+ T  
Sbjct: 79  NGWTSAVKKIVAGGSMRRIHERVLGPSRTGISNTTSEIWLLGVCYKISQDGSNADAATSN 138

Query: 659 NGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSW 480
           NGL +F  DFSSRI MTYRKGF AIGDSK TSDV WGCM RSSQMLVAQALLFH LGRSW
Sbjct: 139 NGLADFTHDFSSRILMTYRKGFDAIGDSKFTSDVGWGCMLRSSQMLVAQALLFHQLGRSW 198

Query: 479 RKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGSWVGPYAMCRSWETLA 300
           RKP +KP D  Y++IL  FGDS++S FSIHNLI AGKAYGLAAGSWVGPYAMCRSWE LA
Sbjct: 199 RKPIQKPLDQKYVEILHLFGDSEASPFSIHNLIHAGKAYGLAAGSWVGPYAMCRSWELLA 258

Query: 299 RLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARRCSEFGNGQADWSPIX 120
           R  RE  +   EH +LPMAV++VSGDEDGERGGAPV+CIEDA+R C +F  GQA+W+PI 
Sbjct: 259 RCKRE--ENNLEHEALPMAVYVVSGDEDGERGGAPVVCIEDASRHCLDFSRGQANWTPIL 316

Query: 119 XXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGA 3
                   LEKVN RYIPSL+ T TFPQSLGI+GGKPGA
Sbjct: 317 LLVPLVLGLEKVNLRYIPSLQATLTFPQSLGIMGGKPGA 355


>ref|XP_010934835.1| PREDICTED: cysteine protease ATG4B-like isoform X3 [Elaeis
           guineensis] gi|743831925|ref|XP_010934836.1| PREDICTED:
           cysteine protease ATG4B-like isoform X3 [Elaeis
           guineensis]
          Length = 396

 Score =  392 bits (1008), Expect = e-106
 Identities = 190/291 (65%), Positives = 223/291 (76%)
 Frame = -3

Query: 875 SQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVCYKI 696
           S  G+       + GW T +K++V+G SMR L+E L+G S+T   SSTS+IW LG CYK+
Sbjct: 63  SHSGEKKPTKSRSYGWTTAVKKVVTGGSMRRLQE-LLGTSSTDALSSTSDIWLLGKCYKL 121

Query: 695 SPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQMLVA 516
           SPE+ S  + HGNG   F++DFSSRIW+TYRKGF AIGDSK TSDV WGCM RSSQMLVA
Sbjct: 122 SPEESSGGTDHGNGSAAFLEDFSSRIWITYRKGFDAIGDSKFTSDVRWGCMIRSSQMLVA 181

Query: 515 QALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGSWVG 336
           QALLFHHLGRSWRKP +KP+D  YI+IL  FGDS++ AFSIHNL++AGKAYGLAA  WVG
Sbjct: 182 QALLFHHLGRSWRKPSQKPHDSKYIEILHLFGDSEACAFSIHNLLEAGKAYGLAAREWVG 241

Query: 335 PYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARRCSE 156
           PYAMCR+WET+ R  RE AD +KE   LPM V++VSGDEDGERGGAPV+CI+ AAR CS+
Sbjct: 242 PYAMCRTWETITRAKREQADLDKEKERLPMVVYVVSGDEDGERGGAPVVCIDVAARLCSD 301

Query: 155 FGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGA 3
           F  GQ  W+P+         LEKVN RYIP L  TFTFPQSLGILGGKPGA
Sbjct: 302 FTKGQISWAPMLLLVPLVLGLEKVNPRYIPLLWETFTFPQSLGILGGKPGA 352


>ref|XP_010934834.1| PREDICTED: cysteine protease ATG4B-like isoform X2 [Elaeis
           guineensis]
          Length = 484

 Score =  392 bits (1008), Expect = e-106
 Identities = 190/291 (65%), Positives = 223/291 (76%)
 Frame = -3

Query: 875 SQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVCYKI 696
           S  G+       + GW T +K++V+G SMR L+E L+G S+T   SSTS+IW LG CYK+
Sbjct: 63  SHSGEKKPTKSRSYGWTTAVKKVVTGGSMRRLQE-LLGTSSTDALSSTSDIWLLGKCYKL 121

Query: 695 SPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQMLVA 516
           SPE+ S  + HGNG   F++DFSSRIW+TYRKGF AIGDSK TSDV WGCM RSSQMLVA
Sbjct: 122 SPEESSGGTDHGNGSAAFLEDFSSRIWITYRKGFDAIGDSKFTSDVRWGCMIRSSQMLVA 181

Query: 515 QALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGSWVG 336
           QALLFHHLGRSWRKP +KP+D  YI+IL  FGDS++ AFSIHNL++AGKAYGLAA  WVG
Sbjct: 182 QALLFHHLGRSWRKPSQKPHDSKYIEILHLFGDSEACAFSIHNLLEAGKAYGLAAREWVG 241

Query: 335 PYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARRCSE 156
           PYAMCR+WET+ R  RE AD +KE   LPM V++VSGDEDGERGGAPV+CI+ AAR CS+
Sbjct: 242 PYAMCRTWETITRAKREQADLDKEKERLPMVVYVVSGDEDGERGGAPVVCIDVAARLCSD 301

Query: 155 FGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGA 3
           F  GQ  W+P+         LEKVN RYIP L  TFTFPQSLGILGGKPGA
Sbjct: 302 FTKGQISWAPMLLLVPLVLGLEKVNPRYIPLLWETFTFPQSLGILGGKPGA 352


>ref|XP_010934833.1| PREDICTED: cysteine protease ATG4B-like isoform X1 [Elaeis
           guineensis]
          Length = 488

 Score =  392 bits (1008), Expect = e-106
 Identities = 190/291 (65%), Positives = 223/291 (76%)
 Frame = -3

Query: 875 SQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVCYKI 696
           S  G+       + GW T +K++V+G SMR L+E L+G S+T   SSTS+IW LG CYK+
Sbjct: 63  SHSGEKKPTKSRSYGWTTAVKKVVTGGSMRRLQE-LLGTSSTDALSSTSDIWLLGKCYKL 121

Query: 695 SPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQMLVA 516
           SPE+ S  + HGNG   F++DFSSRIW+TYRKGF AIGDSK TSDV WGCM RSSQMLVA
Sbjct: 122 SPEESSGGTDHGNGSAAFLEDFSSRIWITYRKGFDAIGDSKFTSDVRWGCMIRSSQMLVA 181

Query: 515 QALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGSWVG 336
           QALLFHHLGRSWRKP +KP+D  YI+IL  FGDS++ AFSIHNL++AGKAYGLAA  WVG
Sbjct: 182 QALLFHHLGRSWRKPSQKPHDSKYIEILHLFGDSEACAFSIHNLLEAGKAYGLAAREWVG 241

Query: 335 PYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARRCSE 156
           PYAMCR+WET+ R  RE AD +KE   LPM V++VSGDEDGERGGAPV+CI+ AAR CS+
Sbjct: 242 PYAMCRTWETITRAKREQADLDKEKERLPMVVYVVSGDEDGERGGAPVVCIDVAARLCSD 301

Query: 155 FGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGA 3
           F  GQ  W+P+         LEKVN RYIP L  TFTFPQSLGILGGKPGA
Sbjct: 302 FTKGQISWAPMLLLVPLVLGLEKVNPRYIPLLWETFTFPQSLGILGGKPGA 352


>ref|XP_012435798.1| PREDICTED: cysteine protease ATG4 isoform X1 [Gossypium raimondii]
           gi|823202390|ref|XP_012435799.1| PREDICTED: cysteine
           protease ATG4 isoform X1 [Gossypium raimondii]
           gi|823202393|ref|XP_012435800.1| PREDICTED: cysteine
           protease ATG4 isoform X1 [Gossypium raimondii]
          Length = 495

 Score =  390 bits (1003), Expect = e-106
 Identities = 194/299 (64%), Positives = 227/299 (75%), Gaps = 2/299 (0%)
 Frame = -3

Query: 893 EAYSDASQPG--KNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIW 720
           + YS++S     +  +     NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW
Sbjct: 56  DTYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIW 115

Query: 719 FLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMH 540
            LG+CYKIS E  S D    + L  F  DFSSRI MTYRKGF AIG++K+TSD  WGCM 
Sbjct: 116 LLGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCML 174

Query: 539 RSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYG 360
           RSSQMLVAQALLFH LGRSWRKP +KP+D  YI+IL QFGDS++SAFSIHNL++AGK YG
Sbjct: 175 RSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYG 234

Query: 359 LAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIE 180
           LAAGSWVGPYAMCRSWE+LAR  RE  D E +   LPMAV++VSGDEDGERGGAPV+CIE
Sbjct: 235 LAAGSWVGPYAMCRSWESLARSKREEIDLECQ--LLPMAVYVVSGDEDGERGGAPVVCIE 292

Query: 179 DAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGA 3
           DA+R C EF   QADW+PI         L+KVN RYIPSL+ TFTFPQ LGILGGKPGA
Sbjct: 293 DASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGA 351


>gb|KJB46920.1| hypothetical protein B456_008G001300 [Gossypium raimondii]
          Length = 429

 Score =  390 bits (1003), Expect = e-106
 Identities = 194/299 (64%), Positives = 227/299 (75%), Gaps = 2/299 (0%)
 Frame = -3

Query: 893 EAYSDASQPG--KNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIW 720
           + YS++S     +  +     NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW
Sbjct: 56  DTYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIW 115

Query: 719 FLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMH 540
            LG+CYKIS E  S D    + L  F  DFSSRI MTYRKGF AIG++K+TSD  WGCM 
Sbjct: 116 LLGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCML 174

Query: 539 RSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYG 360
           RSSQMLVAQALLFH LGRSWRKP +KP+D  YI+IL QFGDS++SAFSIHNL++AGK YG
Sbjct: 175 RSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYG 234

Query: 359 LAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIE 180
           LAAGSWVGPYAMCRSWE+LAR  RE  D E +   LPMAV++VSGDEDGERGGAPV+CIE
Sbjct: 235 LAAGSWVGPYAMCRSWESLARSKREEIDLECQ--LLPMAVYVVSGDEDGERGGAPVVCIE 292

Query: 179 DAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGA 3
           DA+R C EF   QADW+PI         L+KVN RYIPSL+ TFTFPQ LGILGGKPGA
Sbjct: 293 DASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGA 351


>gb|KJB46918.1| hypothetical protein B456_008G001300 [Gossypium raimondii]
          Length = 392

 Score =  390 bits (1003), Expect = e-106
 Identities = 194/299 (64%), Positives = 227/299 (75%), Gaps = 2/299 (0%)
 Frame = -3

Query: 893 EAYSDASQPG--KNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIW 720
           + YS++S     +  +     NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW
Sbjct: 56  DTYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIW 115

Query: 719 FLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMH 540
            LG+CYKIS E  S D    + L  F  DFSSRI MTYRKGF AIG++K+TSD  WGCM 
Sbjct: 116 LLGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCML 174

Query: 539 RSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYG 360
           RSSQMLVAQALLFH LGRSWRKP +KP+D  YI+IL QFGDS++SAFSIHNL++AGK YG
Sbjct: 175 RSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYG 234

Query: 359 LAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIE 180
           LAAGSWVGPYAMCRSWE+LAR  RE  D E +   LPMAV++VSGDEDGERGGAPV+CIE
Sbjct: 235 LAAGSWVGPYAMCRSWESLARSKREEIDLECQ--LLPMAVYVVSGDEDGERGGAPVVCIE 292

Query: 179 DAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGA 3
           DA+R C EF   QADW+PI         L+KVN RYIPSL+ TFTFPQ LGILGGKPGA
Sbjct: 293 DASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGA 351


>ref|XP_012435802.1| PREDICTED: cysteine protease ATG4 isoform X3 [Gossypium raimondii]
           gi|763779846|gb|KJB46917.1| hypothetical protein
           B456_008G001300 [Gossypium raimondii]
          Length = 433

 Score =  390 bits (1003), Expect = e-106
 Identities = 194/299 (64%), Positives = 227/299 (75%), Gaps = 2/299 (0%)
 Frame = -3

Query: 893 EAYSDASQPG--KNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIW 720
           + YS++S     +  +     NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW
Sbjct: 56  DTYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIW 115

Query: 719 FLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMH 540
            LG+CYKIS E  S D    + L  F  DFSSRI MTYRKGF AIG++K+TSD  WGCM 
Sbjct: 116 LLGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCML 174

Query: 539 RSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYG 360
           RSSQMLVAQALLFH LGRSWRKP +KP+D  YI+IL QFGDS++SAFSIHNL++AGK YG
Sbjct: 175 RSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYG 234

Query: 359 LAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIE 180
           LAAGSWVGPYAMCRSWE+LAR  RE  D E +   LPMAV++VSGDEDGERGGAPV+CIE
Sbjct: 235 LAAGSWVGPYAMCRSWESLARSKREEIDLECQ--LLPMAVYVVSGDEDGERGGAPVVCIE 292

Query: 179 DAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGA 3
           DA+R C EF   QADW+PI         L+KVN RYIPSL+ TFTFPQ LGILGGKPGA
Sbjct: 293 DASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGA 351


>ref|XP_012435801.1| PREDICTED: cysteine protease ATG4 isoform X2 [Gossypium raimondii]
           gi|763779844|gb|KJB46915.1| hypothetical protein
           B456_008G001300 [Gossypium raimondii]
           gi|763779845|gb|KJB46916.1| hypothetical protein
           B456_008G001300 [Gossypium raimondii]
           gi|763779850|gb|KJB46921.1| hypothetical protein
           B456_008G001300 [Gossypium raimondii]
          Length = 488

 Score =  390 bits (1003), Expect = e-106
 Identities = 194/299 (64%), Positives = 227/299 (75%), Gaps = 2/299 (0%)
 Frame = -3

Query: 893 EAYSDASQPG--KNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIW 720
           + YS++S     +  +     NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW
Sbjct: 56  DTYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIW 115

Query: 719 FLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMH 540
            LG+CYKIS E  S D    + L  F  DFSSRI MTYRKGF AIG++K+TSD  WGCM 
Sbjct: 116 LLGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCML 174

Query: 539 RSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYG 360
           RSSQMLVAQALLFH LGRSWRKP +KP+D  YI+IL QFGDS++SAFSIHNL++AGK YG
Sbjct: 175 RSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYG 234

Query: 359 LAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIE 180
           LAAGSWVGPYAMCRSWE+LAR  RE  D E +   LPMAV++VSGDEDGERGGAPV+CIE
Sbjct: 235 LAAGSWVGPYAMCRSWESLARSKREEIDLECQ--LLPMAVYVVSGDEDGERGGAPVVCIE 292

Query: 179 DAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGA 3
           DA+R C EF   QADW+PI         L+KVN RYIPSL+ TFTFPQ LGILGGKPGA
Sbjct: 293 DASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGA 351


>ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis]
           gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B,
           putative [Ricinus communis]
          Length = 489

 Score =  390 bits (1003), Expect = e-106
 Identities = 197/315 (62%), Positives = 231/315 (73%)
 Frame = -3

Query: 947 SVKGXXXXXXXXXXXXXFEAYSDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERL 768
           S KG             FE Y ++    +        NGW + +K+IVSG SMR + ER+
Sbjct: 37  STKGSLWSSFFASAFSVFETYRESPPASEKKGSHSRHNGWTSAVKKIVSGGSMRRIHERV 96

Query: 767 IGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSA 588
           +GPS TG+SS+TS+IW LGVCYKIS ED S ++  GN L EF  D+SSRI MTYR+GF A
Sbjct: 97  LGPSRTGISSTTSDIWLLGVCYKIS-EDESGNADTGNALAEFTHDYSSRILMTYRRGFDA 155

Query: 587 IGDSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDS 408
           IGDSK  SDV WGCM RSSQMLVAQALLFH LGR+W KPF+KP D  Y++IL  FGDS++
Sbjct: 156 IGDSKYISDVGWGCMLRSSQMLVAQALLFHKLGRAWTKPFQKPMDQAYVEILHLFGDSEA 215

Query: 407 SAFSIHNLIQAGKAYGLAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVS 228
           + FSIHNLIQAGKAY LAAGSWVGPYAMCRSWE+LAR  RE  +   E+ SLPMAV++VS
Sbjct: 216 APFSIHNLIQAGKAYSLAAGSWVGPYAMCRSWESLARSKRE--ENSLEYQSLPMAVYVVS 273

Query: 227 GDEDGERGGAPVLCIEDAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTF 48
           GDEDGERGGAPV+ IEDA+R C EF  GQADW+PI         L+KVN RYIPSL+ TF
Sbjct: 274 GDEDGERGGAPVVYIEDASRHCLEFSRGQADWTPILLLVPLVLGLDKVNPRYIPSLQATF 333

Query: 47  TFPQSLGILGGKPGA 3
           TF QSLGI+GGKPGA
Sbjct: 334 TFSQSLGIMGGKPGA 348