BLASTX nr result

ID: Papaver29_contig00035734 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver29_contig00035734
         (1048 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010277861.1| PREDICTED: cysteine protease ATG4-like [Nelu...   406   e-110
ref|XP_012081889.1| PREDICTED: cysteine protease ATG4-like isofo...   402   e-109
gb|KDO40319.1| hypothetical protein CISIN_1g011418mg [Citrus sin...   402   e-109
gb|KDO40316.1| hypothetical protein CISIN_1g011418mg [Citrus sin...   402   e-109
ref|XP_006441973.1| hypothetical protein CICLE_v10019906mg [Citr...   402   e-109
ref|XP_006478507.1| PREDICTED: cysteine protease ATG4-like isofo...   401   e-109
ref|XP_007049917.1| Peptidase family C54 protein isoform 3 [Theo...   400   e-108
ref|XP_008810459.1| PREDICTED: cysteine protease ATG4B-like isof...   397   e-107
ref|XP_010646415.1| PREDICTED: cysteine protease ATG4 isoform X1...   396   e-107
ref|XP_003635099.1| PREDICTED: cysteine protease ATG4 isoform X2...   396   e-107
emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]   396   e-107
ref|XP_010934835.1| PREDICTED: cysteine protease ATG4B-like isof...   394   e-107
ref|XP_010934834.1| PREDICTED: cysteine protease ATG4B-like isof...   394   e-107
ref|XP_010934833.1| PREDICTED: cysteine protease ATG4B-like isof...   394   e-107
ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus c...   392   e-106
ref|XP_007049915.1| Peptidase family C54 protein isoform 1 [Theo...   390   e-105
ref|XP_012435798.1| PREDICTED: cysteine protease ATG4 isoform X1...   389   e-105
gb|KJB46920.1| hypothetical protein B456_008G001300 [Gossypium r...   389   e-105
gb|KJB46918.1| hypothetical protein B456_008G001300 [Gossypium r...   389   e-105
ref|XP_012435802.1| PREDICTED: cysteine protease ATG4 isoform X3...   389   e-105

>ref|XP_010277861.1| PREDICTED: cysteine protease ATG4-like [Nelumbo nucifera]
           gi|720070813|ref|XP_010277863.1| PREDICTED: cysteine
           protease ATG4-like [Nelumbo nucifera]
           gi|720070816|ref|XP_010277864.1| PREDICTED: cysteine
           protease ATG4-like [Nelumbo nucifera]
           gi|720070819|ref|XP_010277865.1| PREDICTED: cysteine
           protease ATG4-like [Nelumbo nucifera]
          Length = 490

 Score =  406 bits (1044), Expect = e-110
 Identities = 205/314 (65%), Positives = 239/314 (76%)
 Frame = -3

Query: 947 SAKGXXXXXXXXXXXXXFEAYSDASQSGKNSTDAKSANGWATTMKRIVSGSSMRALRERL 768
           S+KG             FE YS++S S K + ++K+  GW T +K++++  SMR L+ER+
Sbjct: 40  SSKGSLWSSLFTSPSSLFETYSESSISVKKTFNSKTY-GWTTALKKVLTVGSMRRLQERI 98

Query: 767 IGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSA 588
           +GPS TGVSSSTSEIW LGVCYK+S ED S +  +GNGLV F +DFSSRIWMTYRKGF  
Sbjct: 99  LGPSKTGVSSSTSEIWLLGVCYKVSEEDSSGNLVNGNGLVAFTEDFSSRIWMTYRKGFDV 158

Query: 587 IADSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDSES 408
           I DSK TSDV+WGCM RSSQMLVAQALL H LGRSWRKP + P+D  YI+IL  FGDSE+
Sbjct: 159 I-DSKFTSDVNWGCMLRSSQMLVAQALLVHCLGRSWRKPLQPPFDPEYIEILHLFGDSEA 217

Query: 407 SAFSIHNLIEAGKAYGLAAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHIVS 228
           SAFSIHNL++AGKAYGLAAGSW+GPYAMCRSWETL R  RE  N   ++ SL MAV+IVS
Sbjct: 218 SAFSIHNLLQAGKAYGLAAGSWIGPYAMCRSWETLVRSKREQTNLEIENQSLSMAVYIVS 277

Query: 227 GDEDGERGGAPVLCIEDAARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRATF 48
           GDEDGERGGAPV+C+ED AR CS+F  GQ DW+PI         LEKVN RYIP L ATF
Sbjct: 278 GDEDGERGGAPVVCVEDVARLCSEFSKGQVDWAPILLLVPLVLGLEKVNPRYIPLLWATF 337

Query: 47  TFPQSLGILGGKPG 6
           TFPQSLGILGGK G
Sbjct: 338 TFPQSLGILGGKSG 351


>ref|XP_012081889.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Jatropha curcas]
           gi|802675786|ref|XP_012081890.1| PREDICTED: cysteine
           protease ATG4-like isoform X1 [Jatropha curcas]
           gi|802675813|ref|XP_012081891.1| PREDICTED: cysteine
           protease ATG4-like isoform X1 [Jatropha curcas]
           gi|643718243|gb|KDP29532.1| hypothetical protein
           JCGZ_19245 [Jatropha curcas]
          Length = 492

 Score =  402 bits (1034), Expect = e-109
 Identities = 204/320 (63%), Positives = 241/320 (75%), Gaps = 3/320 (0%)
 Frame = -3

Query: 953 NKSAKGXXXXXXXXXXXXXFEAYSDA--SQSGKNSTDAKSANGWATTMKRIVSGSSMRAL 780
           +K +KG             FE Y ++  + S K  +  ++ NGW + +K+IV+G SMR +
Sbjct: 39  SKFSKGFLWSSFFTAAFSVFETYRESPPTTSEKKGSHTRN-NGWTSAVKKIVAGGSMRRI 97

Query: 779 RERLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDS-THGNGLVEFVDDFSSRIWMTYR 603
            ER++GPS TG+S++TSEIW LGVCYKIS +  ++D+ T  NGL +F  DFSSRI MTYR
Sbjct: 98  HERVLGPSRTGISNTTSEIWLLGVCYKISQDGSNADAATSNNGLADFTHDFSSRILMTYR 157

Query: 602 KGFSAIADSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQF 423
           KGF AI DSK TSDV WGCM RSSQMLVAQALLFH LGRSWRKP +KP DQ Y++IL  F
Sbjct: 158 KGFDAIGDSKFTSDVGWGCMLRSSQMLVAQALLFHQLGRSWRKPIQKPLDQKYVEILHLF 217

Query: 422 GDSESSAFSIHNLIEAGKAYGLAAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMA 243
           GDSE+S FSIHNLI AGKAYGLAAGSWVGPYAMCRSWE LAR  RE  N   +H +LPMA
Sbjct: 218 GDSEASPFSIHNLIHAGKAYGLAAGSWVGPYAMCRSWELLARCKREENN--LEHEALPMA 275

Query: 242 VHIVSGDEDGERGGAPVLCIEDAARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPS 63
           V++VSGDEDGERGGAPV+CIEDA+R C DF  GQA+W+PI         LEKVN RYIPS
Sbjct: 276 VYVVSGDEDGERGGAPVVCIEDASRHCLDFSRGQANWTPILLLVPLVLGLEKVNLRYIPS 335

Query: 62  LRATFTFPQSLGILGGKPGA 3
           L+AT TFPQSLGI+GGKPGA
Sbjct: 336 LQATLTFPQSLGIMGGKPGA 355


>gb|KDO40319.1| hypothetical protein CISIN_1g011418mg [Citrus sinensis]
          Length = 392

 Score =  402 bits (1034), Expect = e-109
 Identities = 200/317 (63%), Positives = 238/317 (75%)
 Frame = -3

Query: 953 NKSAKGXXXXXXXXXXXXXFEAYSDASQSGKNSTDAKSANGWATTMKRIVSGSSMRALRE 774
           +KS+KG             FE YS++S S K +   KS NGW   +KR+V+  SMR + E
Sbjct: 38  SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKS-NGWTAAVKRLVTAGSMRRIHE 96

Query: 773 RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 594
           R++GPS TG+SSSTS+IW LGVC+KI+ ++   D+   NGL EF  DFSSRI ++YRKGF
Sbjct: 97  RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF 156

Query: 593 SAIADSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDS 414
             I DSK+TSDV WGCM RSSQMLVAQALLFH LGR WRKP +KP+D+ Y++IL  FGDS
Sbjct: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 216

Query: 413 ESSAFSIHNLIEAGKAYGLAAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHI 234
           E+S FSIHNL++AGKAYGLAAGSWVGPYAMCRSWE LAR  R  A  G    SLPMA+++
Sbjct: 217 ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYV 274

Query: 233 VSGDEDGERGGAPVLCIEDAARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRA 54
           VSGDEDGERGGAPV+CI+DA+R CS F  GQADW+PI         LEKVN RYIP+LR 
Sbjct: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334

Query: 53  TFTFPQSLGILGGKPGA 3
           TFTFPQSLGI+GGKPGA
Sbjct: 335 TFTFPQSLGIVGGKPGA 351


>gb|KDO40316.1| hypothetical protein CISIN_1g011418mg [Citrus sinensis]
           gi|641820321|gb|KDO40317.1| hypothetical protein
           CISIN_1g011418mg [Citrus sinensis]
          Length = 486

 Score =  402 bits (1034), Expect = e-109
 Identities = 200/317 (63%), Positives = 238/317 (75%)
 Frame = -3

Query: 953 NKSAKGXXXXXXXXXXXXXFEAYSDASQSGKNSTDAKSANGWATTMKRIVSGSSMRALRE 774
           +KS+KG             FE YS++S S K +   KS NGW   +KR+V+  SMR + E
Sbjct: 38  SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKS-NGWTAAVKRLVTAGSMRRIHE 96

Query: 773 RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 594
           R++GPS TG+SSSTS+IW LGVC+KI+ ++   D+   NGL EF  DFSSRI ++YRKGF
Sbjct: 97  RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF 156

Query: 593 SAIADSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDS 414
             I DSK+TSDV WGCM RSSQMLVAQALLFH LGR WRKP +KP+D+ Y++IL  FGDS
Sbjct: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 216

Query: 413 ESSAFSIHNLIEAGKAYGLAAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHI 234
           E+S FSIHNL++AGKAYGLAAGSWVGPYAMCRSWE LAR  R  A  G    SLPMA+++
Sbjct: 217 ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYV 274

Query: 233 VSGDEDGERGGAPVLCIEDAARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRA 54
           VSGDEDGERGGAPV+CI+DA+R CS F  GQADW+PI         LEKVN RYIP+LR 
Sbjct: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334

Query: 53  TFTFPQSLGILGGKPGA 3
           TFTFPQSLGI+GGKPGA
Sbjct: 335 TFTFPQSLGIVGGKPGA 351


>ref|XP_006441973.1| hypothetical protein CICLE_v10019906mg [Citrus clementina]
           gi|557544235|gb|ESR55213.1| hypothetical protein
           CICLE_v10019906mg [Citrus clementina]
          Length = 486

 Score =  402 bits (1034), Expect = e-109
 Identities = 200/317 (63%), Positives = 238/317 (75%)
 Frame = -3

Query: 953 NKSAKGXXXXXXXXXXXXXFEAYSDASQSGKNSTDAKSANGWATTMKRIVSGSSMRALRE 774
           +KS+KG             FE YS++S S K +   KS NGW   +KR+V+  SMR + E
Sbjct: 38  SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKS-NGWTAAVKRLVTAGSMRRIHE 96

Query: 773 RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 594
           R++GPS TG+SSSTS+IW LGVC+KI+ ++   D+   NGL EF  DFSSRI ++YRKGF
Sbjct: 97  RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF 156

Query: 593 SAIADSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDS 414
             I DSK+TSDV WGCM RSSQMLVAQALLFH LGR WRKP +KP+D+ Y++IL  FGDS
Sbjct: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 216

Query: 413 ESSAFSIHNLIEAGKAYGLAAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHI 234
           E+S FSIHNL++AGKAYGLAAGSWVGPYAMCRSWE LAR  R  A  G    SLPMA+++
Sbjct: 217 ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYV 274

Query: 233 VSGDEDGERGGAPVLCIEDAARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRA 54
           VSGDEDGERGGAPV+CI+DA+R CS F  GQADW+PI         LEKVN RYIP+LR 
Sbjct: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334

Query: 53  TFTFPQSLGILGGKPGA 3
           TFTFPQSLGI+GGKPGA
Sbjct: 335 TFTFPQSLGIVGGKPGA 351


>ref|XP_006478507.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Citrus sinensis]
          Length = 486

 Score =  401 bits (1031), Expect = e-109
 Identities = 199/317 (62%), Positives = 238/317 (75%)
 Frame = -3

Query: 953 NKSAKGXXXXXXXXXXXXXFEAYSDASQSGKNSTDAKSANGWATTMKRIVSGSSMRALRE 774
           +KS+KG             FE YS++S + K +   KS NGW   +KR+V+  SMR + E
Sbjct: 38  SKSSKGSLLSSLFNSAFSVFETYSESSANEKKAVHNKS-NGWTAAVKRLVTAGSMRRIHE 96

Query: 773 RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 594
           R++GPS TG+SSSTS+IW LGVC+KI+ ++   D+   NGL EF  DFSSRI ++YRKGF
Sbjct: 97  RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF 156

Query: 593 SAIADSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDS 414
             I DSK+TSDV WGCM RSSQMLVAQALLFH LGR WRKP +KP+D+ Y++IL  FGDS
Sbjct: 157 DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 216

Query: 413 ESSAFSIHNLIEAGKAYGLAAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHI 234
           E+S FSIHNL++AGKAYGLAAGSWVGPYAMCRSWE LAR  R  A  G    SLPMA+++
Sbjct: 217 ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYV 274

Query: 233 VSGDEDGERGGAPVLCIEDAARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRA 54
           VSGDEDGERGGAPV+CI+DA+R CS F  GQADW+PI         LEKVN RYIP+LR 
Sbjct: 275 VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334

Query: 53  TFTFPQSLGILGGKPGA 3
           TFTFPQSLGI+GGKPGA
Sbjct: 335 TFTFPQSLGIVGGKPGA 351


>ref|XP_007049917.1| Peptidase family C54 protein isoform 3 [Theobroma cacao]
           gi|508702178|gb|EOX94074.1| Peptidase family C54 protein
           isoform 3 [Theobroma cacao]
          Length = 486

 Score =  400 bits (1027), Expect = e-108
 Identities = 197/298 (66%), Positives = 233/298 (78%), Gaps = 1/298 (0%)
 Frame = -3

Query: 893 EAYSDASQSGKNSTDAKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFL 714
           + YS++S   K +  A++ NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW L
Sbjct: 57  DTYSESSACEKKALHARN-NGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIWLL 115

Query: 713 GVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIADSKLTSDVHWGCMHRS 534
           GVCYKIS    S D    NGL  F  DFSSRI MTYRKGF AI D+K+TSD  WGCM RS
Sbjct: 116 GVCYKISQVSSSGDVDASNGLAAFKRDFSSRILMTYRKGFDAIGDTKITSDFGWGCMLRS 175

Query: 533 SQMLVA-QALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDSESSAFSIHNLIEAGKAYGL 357
           SQMLVA QALLFH LGRSWRKP +KP++Q YI+IL QFGDSE++AFSIHNL+EAGK YGL
Sbjct: 176 SQMLVAQQALLFHQLGRSWRKPLQKPFEQAYIEILHQFGDSEATAFSIHNLVEAGKIYGL 235

Query: 356 AAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHIVSGDEDGERGGAPVLCIED 177
           AAGSWVGPYAMCRSWE+LAR  RE  +   +H SLPMAV++VSGDEDGERGGAPV+C+ED
Sbjct: 236 AAGSWVGPYAMCRSWESLARFKREEND--LEHQSLPMAVYVVSGDEDGERGGAPVVCVED 293

Query: 176 AARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRATFTFPQSLGILGGKPGA 3
           A+R C +F   +ADW+PI         L+KVN+RYIPSL+ATFTFPQ LGILGGKPGA
Sbjct: 294 ASRHCFEFSRCRADWTPILLLVPLVLGLDKVNSRYIPSLQATFTFPQCLGILGGKPGA 351


>ref|XP_008810459.1| PREDICTED: cysteine protease ATG4B-like isoform X1 [Phoenix
           dactylifera]
          Length = 494

 Score =  397 bits (1019), Expect = e-107
 Identities = 198/316 (62%), Positives = 233/316 (73%)
 Frame = -3

Query: 953 NKSAKGXXXXXXXXXXXXXFEAYSDASQSGKNSTDAKSANGWATTMKRIVSGSSMRALRE 774
           NKS K              FE + ++    K +T ++S  GW T +K++V+G SMR L+E
Sbjct: 38  NKSLKASFLSSFFASTLSIFETHPESHSGEKKATKSRSY-GWTTAVKKVVAGGSMRRLQE 96

Query: 773 RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 594
           RL+G S+T   S TSEIW LG+ YK+SPE+ S  + HGNG   F++DFSSRIW+TYRKGF
Sbjct: 97  RLLGTSSTDALSLTSEIWLLGMRYKLSPEESSGGADHGNGSAAFLEDFSSRIWITYRKGF 156

Query: 593 SAIADSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDS 414
            AI  SKLTSDV WGCM RSSQMLVAQALLFHHLGR WRKP +KP+D  YI+IL  FGDS
Sbjct: 157 DAIGYSKLTSDVRWGCMIRSSQMLVAQALLFHHLGRYWRKPSQKPHDPKYIEILHLFGDS 216

Query: 413 ESSAFSIHNLIEAGKAYGLAAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHI 234
           E+ AFS+HNL+EAGK YGLAAGSW+GPYAMCR+WETLAR  RE A+  ++  SLPMAV++
Sbjct: 217 EACAFSLHNLLEAGKGYGLAAGSWLGPYAMCRTWETLARAKREQADLDREKESLPMAVYV 276

Query: 233 VSGDEDGERGGAPVLCIEDAARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRA 54
           VSGDEDGERGGAPV+CI+ AAR CSDF  GQ  W PI         LEKVN RYIP L  
Sbjct: 277 VSGDEDGERGGAPVVCIDVAARLCSDFSKGQISWVPILLLVPLVLGLEKVNPRYIPLLWE 336

Query: 53  TFTFPQSLGILGGKPG 6
           TFTFPQSLGILGGKPG
Sbjct: 337 TFTFPQSLGILGGKPG 352


>ref|XP_010646415.1| PREDICTED: cysteine protease ATG4 isoform X1 [Vitis vinifera]
          Length = 489

 Score =  396 bits (1018), Expect = e-107
 Identities = 196/294 (66%), Positives = 230/294 (78%)
 Frame = -3

Query: 884 SDASQSGKNSTDAKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVC 705
           S  S S K + D    NGW T ++++V+G SMR ++ER++G S TG+SSSTS+IW LG+C
Sbjct: 56  SSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLC 115

Query: 704 YKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIADSKLTSDVHWGCMHRSSQM 525
           YKIS E+ S+ ++  NGL EF  DFSSRI MTYRKGF AI DSKLTSDV+WGCM RSSQM
Sbjct: 116 YKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQM 175

Query: 524 LVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDSESSAFSIHNLIEAGKAYGLAAGS 345
           LVAQALL H +GRSWRK   KP DQ YI+IL  FGDS++SAFSIHN+++AGKAYGLAAGS
Sbjct: 176 LVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGS 235

Query: 344 WVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARR 165
           WVGPYAMCRSWETLAR  RE  +   +  SLPMA++IVSGDEDGERGGAPV+ IE+A+R 
Sbjct: 236 WVGPYAMCRSWETLARSKREETD--LECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293

Query: 164 CSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRATFTFPQSLGILGGKPGA 3
           C +F  GQ DW+PI         LEKVN RYIPSL ATFTFPQSLGILGGKPGA
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGA 347


>ref|XP_003635099.1| PREDICTED: cysteine protease ATG4 isoform X2 [Vitis vinifera]
           gi|296086874|emb|CBI33041.3| unnamed protein product
           [Vitis vinifera]
          Length = 486

 Score =  396 bits (1018), Expect = e-107
 Identities = 196/294 (66%), Positives = 230/294 (78%)
 Frame = -3

Query: 884 SDASQSGKNSTDAKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVC 705
           S  S S K + D    NGW T ++++V+G SMR ++ER++G S TG+SSSTS+IW LG+C
Sbjct: 56  SSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLC 115

Query: 704 YKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIADSKLTSDVHWGCMHRSSQM 525
           YKIS E+ S+ ++  NGL EF  DFSSRI MTYRKGF AI DSKLTSDV+WGCM RSSQM
Sbjct: 116 YKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQM 175

Query: 524 LVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDSESSAFSIHNLIEAGKAYGLAAGS 345
           LVAQALL H +GRSWRK   KP DQ YI+IL  FGDS++SAFSIHN+++AGKAYGLAAGS
Sbjct: 176 LVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGS 235

Query: 344 WVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARR 165
           WVGPYAMCRSWETLAR  RE  +   +  SLPMA++IVSGDEDGERGGAPV+ IE+A+R 
Sbjct: 236 WVGPYAMCRSWETLARSKREETD--LECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293

Query: 164 CSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRATFTFPQSLGILGGKPGA 3
           C +F  GQ DW+PI         LEKVN RYIPSL ATFTFPQSLGILGGKPGA
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGA 347


>emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  396 bits (1018), Expect = e-107
 Identities = 196/294 (66%), Positives = 230/294 (78%)
 Frame = -3

Query: 884 SDASQSGKNSTDAKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVC 705
           S  S S K + D    NGW T ++++V+G SMR ++ER++G S TG+SSSTS+IW LG+C
Sbjct: 56  SSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLC 115

Query: 704 YKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIADSKLTSDVHWGCMHRSSQM 525
           YKIS E+ S+ ++  NGL EF  DFSSRI MTYRKGF AI DSKLTSDV+WGCM RSSQM
Sbjct: 116 YKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQM 175

Query: 524 LVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDSESSAFSIHNLIEAGKAYGLAAGS 345
           LVAQALL H +GRSWRK   KP DQ YI+IL  FGDS++SAFSIHN+++AGKAYGLAAGS
Sbjct: 176 LVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGS 235

Query: 344 WVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARR 165
           WVGPYAMCRSWETLAR  RE  +   +  SLPMA++IVSGDEDGERGGAPV+ IE+A+R 
Sbjct: 236 WVGPYAMCRSWETLARSKREETD--LECQSLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293

Query: 164 CSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRATFTFPQSLGILGGKPGA 3
           C +F  GQ DW+PI         LEKVN RYIPSL ATFTFPQSLGILGGKPGA
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGA 347


>ref|XP_010934835.1| PREDICTED: cysteine protease ATG4B-like isoform X3 [Elaeis
           guineensis] gi|743831925|ref|XP_010934836.1| PREDICTED:
           cysteine protease ATG4B-like isoform X3 [Elaeis
           guineensis]
          Length = 396

 Score =  394 bits (1011), Expect = e-107
 Identities = 197/317 (62%), Positives = 231/317 (72%)
 Frame = -3

Query: 953 NKSAKGXXXXXXXXXXXXXFEAYSDASQSGKNSTDAKSANGWATTMKRIVSGSSMRALRE 774
           NKS K              FE + ++    K  T ++S  GW T +K++V+G SMR L+E
Sbjct: 38  NKSLKASFLSSFFVSTLSIFETHPESHSGEKKPTKSRSY-GWTTAVKKVVTGGSMRRLQE 96

Query: 773 RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 594
            L+G S+T   SSTS+IW LG CYK+SPE+ S  + HGNG   F++DFSSRIW+TYRKGF
Sbjct: 97  -LLGTSSTDALSSTSDIWLLGKCYKLSPEESSGGTDHGNGSAAFLEDFSSRIWITYRKGF 155

Query: 593 SAIADSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDS 414
            AI DSK TSDV WGCM RSSQMLVAQALLFHHLGRSWRKP +KP+D  YI+IL  FGDS
Sbjct: 156 DAIGDSKFTSDVRWGCMIRSSQMLVAQALLFHHLGRSWRKPSQKPHDSKYIEILHLFGDS 215

Query: 413 ESSAFSIHNLIEAGKAYGLAAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHI 234
           E+ AFSIHNL+EAGKAYGLAA  WVGPYAMCR+WET+ R  RE A+  K+   LPM V++
Sbjct: 216 EACAFSIHNLLEAGKAYGLAAREWVGPYAMCRTWETITRAKREQADLDKEKERLPMVVYV 275

Query: 233 VSGDEDGERGGAPVLCIEDAARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRA 54
           VSGDEDGERGGAPV+CI+ AAR CSDF  GQ  W+P+         LEKVN RYIP L  
Sbjct: 276 VSGDEDGERGGAPVVCIDVAARLCSDFTKGQISWAPMLLLVPLVLGLEKVNPRYIPLLWE 335

Query: 53  TFTFPQSLGILGGKPGA 3
           TFTFPQSLGILGGKPGA
Sbjct: 336 TFTFPQSLGILGGKPGA 352


>ref|XP_010934834.1| PREDICTED: cysteine protease ATG4B-like isoform X2 [Elaeis
           guineensis]
          Length = 484

 Score =  394 bits (1011), Expect = e-107
 Identities = 197/317 (62%), Positives = 231/317 (72%)
 Frame = -3

Query: 953 NKSAKGXXXXXXXXXXXXXFEAYSDASQSGKNSTDAKSANGWATTMKRIVSGSSMRALRE 774
           NKS K              FE + ++    K  T ++S  GW T +K++V+G SMR L+E
Sbjct: 38  NKSLKASFLSSFFVSTLSIFETHPESHSGEKKPTKSRSY-GWTTAVKKVVTGGSMRRLQE 96

Query: 773 RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 594
            L+G S+T   SSTS+IW LG CYK+SPE+ S  + HGNG   F++DFSSRIW+TYRKGF
Sbjct: 97  -LLGTSSTDALSSTSDIWLLGKCYKLSPEESSGGTDHGNGSAAFLEDFSSRIWITYRKGF 155

Query: 593 SAIADSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDS 414
            AI DSK TSDV WGCM RSSQMLVAQALLFHHLGRSWRKP +KP+D  YI+IL  FGDS
Sbjct: 156 DAIGDSKFTSDVRWGCMIRSSQMLVAQALLFHHLGRSWRKPSQKPHDSKYIEILHLFGDS 215

Query: 413 ESSAFSIHNLIEAGKAYGLAAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHI 234
           E+ AFSIHNL+EAGKAYGLAA  WVGPYAMCR+WET+ R  RE A+  K+   LPM V++
Sbjct: 216 EACAFSIHNLLEAGKAYGLAAREWVGPYAMCRTWETITRAKREQADLDKEKERLPMVVYV 275

Query: 233 VSGDEDGERGGAPVLCIEDAARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRA 54
           VSGDEDGERGGAPV+CI+ AAR CSDF  GQ  W+P+         LEKVN RYIP L  
Sbjct: 276 VSGDEDGERGGAPVVCIDVAARLCSDFTKGQISWAPMLLLVPLVLGLEKVNPRYIPLLWE 335

Query: 53  TFTFPQSLGILGGKPGA 3
           TFTFPQSLGILGGKPGA
Sbjct: 336 TFTFPQSLGILGGKPGA 352


>ref|XP_010934833.1| PREDICTED: cysteine protease ATG4B-like isoform X1 [Elaeis
           guineensis]
          Length = 488

 Score =  394 bits (1011), Expect = e-107
 Identities = 197/317 (62%), Positives = 231/317 (72%)
 Frame = -3

Query: 953 NKSAKGXXXXXXXXXXXXXFEAYSDASQSGKNSTDAKSANGWATTMKRIVSGSSMRALRE 774
           NKS K              FE + ++    K  T ++S  GW T +K++V+G SMR L+E
Sbjct: 38  NKSLKASFLSSFFVSTLSIFETHPESHSGEKKPTKSRSY-GWTTAVKKVVTGGSMRRLQE 96

Query: 773 RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 594
            L+G S+T   SSTS+IW LG CYK+SPE+ S  + HGNG   F++DFSSRIW+TYRKGF
Sbjct: 97  -LLGTSSTDALSSTSDIWLLGKCYKLSPEESSGGTDHGNGSAAFLEDFSSRIWITYRKGF 155

Query: 593 SAIADSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDS 414
            AI DSK TSDV WGCM RSSQMLVAQALLFHHLGRSWRKP +KP+D  YI+IL  FGDS
Sbjct: 156 DAIGDSKFTSDVRWGCMIRSSQMLVAQALLFHHLGRSWRKPSQKPHDSKYIEILHLFGDS 215

Query: 413 ESSAFSIHNLIEAGKAYGLAAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHI 234
           E+ AFSIHNL+EAGKAYGLAA  WVGPYAMCR+WET+ R  RE A+  K+   LPM V++
Sbjct: 216 EACAFSIHNLLEAGKAYGLAAREWVGPYAMCRTWETITRAKREQADLDKEKERLPMVVYV 275

Query: 233 VSGDEDGERGGAPVLCIEDAARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRA 54
           VSGDEDGERGGAPV+CI+ AAR CSDF  GQ  W+P+         LEKVN RYIP L  
Sbjct: 276 VSGDEDGERGGAPVVCIDVAARLCSDFTKGQISWAPMLLLVPLVLGLEKVNPRYIPLLWE 335

Query: 53  TFTFPQSLGILGGKPGA 3
           TFTFPQSLGILGGKPGA
Sbjct: 336 TFTFPQSLGILGGKPGA 352


>ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis]
           gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B,
           putative [Ricinus communis]
          Length = 489

 Score =  392 bits (1007), Expect = e-106
 Identities = 197/318 (61%), Positives = 235/318 (73%)
 Frame = -3

Query: 956 TNKSAKGXXXXXXXXXXXXXFEAYSDASQSGKNSTDAKSANGWATTMKRIVSGSSMRALR 777
           +N S KG             FE Y ++  + +        NGW + +K+IVSG SMR + 
Sbjct: 34  SNFSTKGSLWSSFFASAFSVFETYRESPPASEKKGSHSRHNGWTSAVKKIVSGGSMRRIH 93

Query: 776 ERLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKG 597
           ER++GPS TG+SS+TS+IW LGVCYKIS ED S ++  GN L EF  D+SSRI MTYR+G
Sbjct: 94  ERVLGPSRTGISSTTSDIWLLGVCYKIS-EDESGNADTGNALAEFTHDYSSRILMTYRRG 152

Query: 596 FSAIADSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGD 417
           F AI DSK  SDV WGCM RSSQMLVAQALLFH LGR+W KPF+KP DQ Y++IL  FGD
Sbjct: 153 FDAIGDSKYISDVGWGCMLRSSQMLVAQALLFHKLGRAWTKPFQKPMDQAYVEILHLFGD 212

Query: 416 SESSAFSIHNLIEAGKAYGLAAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVH 237
           SE++ FSIHNLI+AGKAY LAAGSWVGPYAMCRSWE+LAR  RE  +   ++ SLPMAV+
Sbjct: 213 SEAAPFSIHNLIQAGKAYSLAAGSWVGPYAMCRSWESLARSKREENS--LEYQSLPMAVY 270

Query: 236 IVSGDEDGERGGAPVLCIEDAARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLR 57
           +VSGDEDGERGGAPV+ IEDA+R C +F  GQADW+PI         L+KVN RYIPSL+
Sbjct: 271 VVSGDEDGERGGAPVVYIEDASRHCLEFSRGQADWTPILLLVPLVLGLDKVNPRYIPSLQ 330

Query: 56  ATFTFPQSLGILGGKPGA 3
           ATFTF QSLGI+GGKPGA
Sbjct: 331 ATFTFSQSLGIMGGKPGA 348


>ref|XP_007049915.1| Peptidase family C54 protein isoform 1 [Theobroma cacao]
            gi|508702176|gb|EOX94072.1| Peptidase family C54 protein
            isoform 1 [Theobroma cacao]
          Length = 514

 Score =  390 bits (1001), Expect = e-105
 Identities = 196/319 (61%), Positives = 232/319 (72%), Gaps = 22/319 (6%)
 Frame = -3

Query: 893  EAYSDASQSGKNSTDAKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFL 714
            + YS++S   K +  A++ NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW L
Sbjct: 57   DTYSESSACEKKALHARN-NGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIWLL 115

Query: 713  GVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIADSKLTSDVHWGCMHRS 534
            GVCYKIS    S D    NGL  F  DFSSRI MTYRKGF AI D+K+TSD  WGCM RS
Sbjct: 116  GVCYKISQVSSSGDVDASNGLAAFKRDFSSRILMTYRKGFDAIGDTKITSDFGWGCMLRS 175

Query: 533  SQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDSESSAFSIHNLIEAGKAYGLA 354
            SQMLVAQALLFH LGRSWRKP +KP++Q YI+IL QFGDSE++AFSIHNL+EAGK YGLA
Sbjct: 176  SQMLVAQALLFHQLGRSWRKPLQKPFEQAYIEILHQFGDSEATAFSIHNLVEAGKIYGLA 235

Query: 353  AGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHIVSGDEDGERGGAPVLCIEDA 174
            AGSWVGPYAMCRSWE+LAR  RE  +   +H SLPMAV++VSGDEDGERGGAPV+C+EDA
Sbjct: 236  AGSWVGPYAMCRSWESLARFKREEND--LEHQSLPMAVYVVSGDEDGERGGAPVVCVEDA 293

Query: 173  ARRCSDFGNGQADWSPIXXXXXXXXXLEKVNT----------------------RYIPSL 60
            +R C +F   +ADW+PI         L+KVN+                       YIPSL
Sbjct: 294  SRHCFEFSRCRADWTPILLLVPLVLGLDKVNSSFCKEDSTFETEGELHLDFAYLEYIPSL 353

Query: 59   RATFTFPQSLGILGGKPGA 3
            +ATFTFPQ LGILGGKPGA
Sbjct: 354  QATFTFPQCLGILGGKPGA 372


>ref|XP_012435798.1| PREDICTED: cysteine protease ATG4 isoform X1 [Gossypium raimondii]
           gi|823202390|ref|XP_012435799.1| PREDICTED: cysteine
           protease ATG4 isoform X1 [Gossypium raimondii]
           gi|823202393|ref|XP_012435800.1| PREDICTED: cysteine
           protease ATG4 isoform X1 [Gossypium raimondii]
          Length = 495

 Score =  389 bits (999), Expect = e-105
 Identities = 194/299 (64%), Positives = 227/299 (75%), Gaps = 2/299 (0%)
 Frame = -3

Query: 893 EAYSDASQSG--KNSTDAKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIW 720
           + YS++S S   +  +     NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW
Sbjct: 56  DTYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIW 115

Query: 719 FLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIADSKLTSDVHWGCMH 540
            LG+CYKIS E  S D    + L  F  DFSSRI MTYRKGF AI ++K+TSD  WGCM 
Sbjct: 116 LLGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCML 174

Query: 539 RSSQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDSESSAFSIHNLIEAGKAYG 360
           RSSQMLVAQALLFH LGRSWRKP +KP+D  YI+IL QFGDSE+SAFSIHNL+EAGK YG
Sbjct: 175 RSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYG 234

Query: 359 LAAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHIVSGDEDGERGGAPVLCIE 180
           LAAGSWVGPYAMCRSWE+LAR  RE  +   +   LPMAV++VSGDEDGERGGAPV+CIE
Sbjct: 235 LAAGSWVGPYAMCRSWESLARSKREEID--LECQLLPMAVYVVSGDEDGERGGAPVVCIE 292

Query: 179 DAARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRATFTFPQSLGILGGKPGA 3
           DA+R C +F   QADW+PI         L+KVN RYIPSL+ATFTFPQ LGILGGKPGA
Sbjct: 293 DASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGA 351


>gb|KJB46920.1| hypothetical protein B456_008G001300 [Gossypium raimondii]
          Length = 429

 Score =  389 bits (999), Expect = e-105
 Identities = 194/299 (64%), Positives = 227/299 (75%), Gaps = 2/299 (0%)
 Frame = -3

Query: 893 EAYSDASQSG--KNSTDAKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIW 720
           + YS++S S   +  +     NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW
Sbjct: 56  DTYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIW 115

Query: 719 FLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIADSKLTSDVHWGCMH 540
            LG+CYKIS E  S D    + L  F  DFSSRI MTYRKGF AI ++K+TSD  WGCM 
Sbjct: 116 LLGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCML 174

Query: 539 RSSQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDSESSAFSIHNLIEAGKAYG 360
           RSSQMLVAQALLFH LGRSWRKP +KP+D  YI+IL QFGDSE+SAFSIHNL+EAGK YG
Sbjct: 175 RSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYG 234

Query: 359 LAAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHIVSGDEDGERGGAPVLCIE 180
           LAAGSWVGPYAMCRSWE+LAR  RE  +   +   LPMAV++VSGDEDGERGGAPV+CIE
Sbjct: 235 LAAGSWVGPYAMCRSWESLARSKREEID--LECQLLPMAVYVVSGDEDGERGGAPVVCIE 292

Query: 179 DAARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRATFTFPQSLGILGGKPGA 3
           DA+R C +F   QADW+PI         L+KVN RYIPSL+ATFTFPQ LGILGGKPGA
Sbjct: 293 DASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGA 351


>gb|KJB46918.1| hypothetical protein B456_008G001300 [Gossypium raimondii]
          Length = 392

 Score =  389 bits (999), Expect = e-105
 Identities = 194/299 (64%), Positives = 227/299 (75%), Gaps = 2/299 (0%)
 Frame = -3

Query: 893 EAYSDASQSG--KNSTDAKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIW 720
           + YS++S S   +  +     NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW
Sbjct: 56  DTYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIW 115

Query: 719 FLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIADSKLTSDVHWGCMH 540
            LG+CYKIS E  S D    + L  F  DFSSRI MTYRKGF AI ++K+TSD  WGCM 
Sbjct: 116 LLGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCML 174

Query: 539 RSSQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDSESSAFSIHNLIEAGKAYG 360
           RSSQMLVAQALLFH LGRSWRKP +KP+D  YI+IL QFGDSE+SAFSIHNL+EAGK YG
Sbjct: 175 RSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYG 234

Query: 359 LAAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHIVSGDEDGERGGAPVLCIE 180
           LAAGSWVGPYAMCRSWE+LAR  RE  +   +   LPMAV++VSGDEDGERGGAPV+CIE
Sbjct: 235 LAAGSWVGPYAMCRSWESLARSKREEID--LECQLLPMAVYVVSGDEDGERGGAPVVCIE 292

Query: 179 DAARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRATFTFPQSLGILGGKPGA 3
           DA+R C +F   QADW+PI         L+KVN RYIPSL+ATFTFPQ LGILGGKPGA
Sbjct: 293 DASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGA 351


>ref|XP_012435802.1| PREDICTED: cysteine protease ATG4 isoform X3 [Gossypium raimondii]
           gi|763779846|gb|KJB46917.1| hypothetical protein
           B456_008G001300 [Gossypium raimondii]
          Length = 433

 Score =  389 bits (999), Expect = e-105
 Identities = 194/299 (64%), Positives = 227/299 (75%), Gaps = 2/299 (0%)
 Frame = -3

Query: 893 EAYSDASQSG--KNSTDAKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIW 720
           + YS++S S   +  +     NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW
Sbjct: 56  DTYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIW 115

Query: 719 FLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIADSKLTSDVHWGCMH 540
            LG+CYKIS E  S D    + L  F  DFSSRI MTYRKGF AI ++K+TSD  WGCM 
Sbjct: 116 LLGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCML 174

Query: 539 RSSQMLVAQALLFHHLGRSWRKPFEKPYDQVYIKILDQFGDSESSAFSIHNLIEAGKAYG 360
           RSSQMLVAQALLFH LGRSWRKP +KP+D  YI+IL QFGDSE+SAFSIHNL+EAGK YG
Sbjct: 175 RSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYG 234

Query: 359 LAAGSWVGPYAMCRSWETLARLNREHANPGKDHTSLPMAVHIVSGDEDGERGGAPVLCIE 180
           LAAGSWVGPYAMCRSWE+LAR  RE  +   +   LPMAV++VSGDEDGERGGAPV+CIE
Sbjct: 235 LAAGSWVGPYAMCRSWESLARSKREEID--LECQLLPMAVYVVSGDEDGERGGAPVVCIE 292

Query: 179 DAARRCSDFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRATFTFPQSLGILGGKPGA 3
           DA+R C +F   QADW+PI         L+KVN RYIPSL+ATFTFPQ LGILGGKPGA
Sbjct: 293 DASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGA 351


Top