BLASTX nr result

ID: Papaver30_contig00023346 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver30_contig00023346
         (1323 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010277861.1| PREDICTED: cysteine protease ATG4-like [Nelu...   436   e-119
ref|XP_010646415.1| PREDICTED: cysteine protease ATG4 isoform X1...   424   e-116
ref|XP_008810459.1| PREDICTED: cysteine protease ATG4B-like isof...   424   e-116
ref|XP_003635099.1| PREDICTED: cysteine protease ATG4 isoform X2...   424   e-116
emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]   424   e-116
ref|XP_012081889.1| PREDICTED: cysteine protease ATG4-like isofo...   424   e-115
gb|KDO40319.1| hypothetical protein CISIN_1g011418mg [Citrus sin...   419   e-114
gb|KDO40316.1| hypothetical protein CISIN_1g011418mg [Citrus sin...   419   e-114
ref|XP_006441973.1| hypothetical protein CICLE_v10019906mg [Citr...   419   e-114
ref|XP_007049917.1| Peptidase family C54 protein isoform 3 [Theo...   419   e-114
ref|XP_006478507.1| PREDICTED: cysteine protease ATG4-like isofo...   419   e-114
ref|XP_010934835.1| PREDICTED: cysteine protease ATG4B-like isof...   418   e-114
ref|XP_010934834.1| PREDICTED: cysteine protease ATG4B-like isof...   418   e-114
ref|XP_010934833.1| PREDICTED: cysteine protease ATG4B-like isof...   418   e-114
ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus c...   417   e-113
ref|XP_012435798.1| PREDICTED: cysteine protease ATG4 isoform X1...   412   e-112
gb|KJB46920.1| hypothetical protein B456_008G001300 [Gossypium r...   412   e-112
gb|KJB46918.1| hypothetical protein B456_008G001300 [Gossypium r...   412   e-112
ref|XP_012435802.1| PREDICTED: cysteine protease ATG4 isoform X3...   412   e-112
ref|XP_012435801.1| PREDICTED: cysteine protease ATG4 isoform X2...   412   e-112

>ref|XP_010277861.1| PREDICTED: cysteine protease ATG4-like [Nelumbo nucifera]
            gi|720070813|ref|XP_010277863.1| PREDICTED: cysteine
            protease ATG4-like [Nelumbo nucifera]
            gi|720070816|ref|XP_010277864.1| PREDICTED: cysteine
            protease ATG4-like [Nelumbo nucifera]
            gi|720070819|ref|XP_010277865.1| PREDICTED: cysteine
            protease ATG4-like [Nelumbo nucifera]
          Length = 490

 Score =  436 bits (1121), Expect = e-119
 Identities = 222/333 (66%), Positives = 254/333 (76%)
 Frame = -2

Query: 1001 DTTKSVKGXXXXXXXXXXXXXFEAYSDASQPGKNSTDVKSANGWATTMKRIVSGSSMRAL 822
            DTT S KG             FE YS++S   K + + K+  GW T +K++++  SMR L
Sbjct: 37   DTTSS-KGSLWSSLFTSPSSLFETYSESSISVKKTFNSKTY-GWTTALKKVLTVGSMRRL 94

Query: 821  RERLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRK 642
            +ER++GPS TGVSSSTSEIW LGVCYK+S ED S +  +GNGLV F +DFSSRIWMTYRK
Sbjct: 95   QERILGPSKTGVSSSTSEIWLLGVCYKVSEEDSSGNLVNGNGLVAFTEDFSSRIWMTYRK 154

Query: 641  GFSAIGDSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFG 462
            GF  I DSK TSDV+WGCM RSSQMLVAQALL H LGRSWRKP + P+DP YI+IL  FG
Sbjct: 155  GFDVI-DSKFTSDVNWGCMLRSSQMLVAQALLVHCLGRSWRKPLQPPFDPEYIEILHLFG 213

Query: 461  DSDSSAFSIHNLIQAGKAYGLAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAV 282
            DS++SAFSIHNL+QAGKAYGLAAGSW+GPYAMCRSWETL R  RE  + E E+ SL MAV
Sbjct: 214  DSEASAFSIHNLLQAGKAYGLAAGSWIGPYAMCRSWETLVRSKREQTNLEIENQSLSMAV 273

Query: 281  HIVSGDEDGERGGAPVLCIEDAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSL 102
            +IVSGDEDGERGGAPV+C+ED AR CSEF  GQ DW+PI         LEKVN RYIP L
Sbjct: 274  YIVSGDEDGERGGAPVVCVEDVARLCSEFSKGQVDWAPILLLVPLVLGLEKVNPRYIPLL 333

Query: 101  RVTFTFPQSLGILGGKPGASTYIVGVQDDKAFY 3
              TFTFPQSLGILGGK G STYIVG+QDDKAFY
Sbjct: 334  WATFTFPQSLGILGGKSGVSTYIVGMQDDKAFY 366


>ref|XP_010646415.1| PREDICTED: cysteine protease ATG4 isoform X1 [Vitis vinifera]
          Length = 489

 Score =  424 bits (1091), Expect = e-116
 Identities = 211/308 (68%), Positives = 242/308 (78%)
 Frame = -2

Query: 926 SDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVC 747
           S  S   K + D    NGW T ++++V+G SMR ++ER++G S TG+SSSTS+IW LG+C
Sbjct: 56  SSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLC 115

Query: 746 YKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQM 567
           YKIS E+ S+ ++  NGL EF  DFSSRI MTYRKGF AIGDSKLTSDV+WGCM RSSQM
Sbjct: 116 YKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQM 175

Query: 566 LVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGS 387
           LVAQALL H +GRSWRK   KP D  YI+IL  FGDS +SAFSIHN++QAGKAYGLAAGS
Sbjct: 176 LVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGS 235

Query: 386 WVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARR 207
           WVGPYAMCRSWETLAR  RE  D E +  SLPMA++IVSGDEDGERGGAPV+ IE+A+R 
Sbjct: 236 WVGPYAMCRSWETLARSKREETDLECQ--SLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293

Query: 206 CSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGASTYIVG 27
           C EF  GQ DW+PI         LEKVN RYIPSL  TFTFPQSLGILGGKPGASTYIVG
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVG 353

Query: 26  VQDDKAFY 3
           VQD+KAFY
Sbjct: 354 VQDEKAFY 361


>ref|XP_008810459.1| PREDICTED: cysteine protease ATG4B-like isoform X1 [Phoenix
           dactylifera]
          Length = 494

 Score =  424 bits (1091), Expect = e-116
 Identities = 205/305 (67%), Positives = 238/305 (78%)
 Frame = -2

Query: 917 SQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVCYKI 738
           S  G+       + GW T +K++V+G SMR L+ERL+G S+T   S TSEIW LG+ YK+
Sbjct: 63  SHSGEKKATKSRSYGWTTAVKKVVAGGSMRRLQERLLGTSSTDALSLTSEIWLLGMRYKL 122

Query: 737 SPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQMLVA 558
           SPE+ S  + HGNG   F++DFSSRIW+TYRKGF AIG SKLTSDV WGCM RSSQMLVA
Sbjct: 123 SPEESSGGADHGNGSAAFLEDFSSRIWITYRKGFDAIGYSKLTSDVRWGCMIRSSQMLVA 182

Query: 557 QALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGSWVG 378
           QALLFHHLGR WRKP +KP+DP YI+IL  FGDS++ AFS+HNL++AGK YGLAAGSW+G
Sbjct: 183 QALLFHHLGRYWRKPSQKPHDPKYIEILHLFGDSEACAFSLHNLLEAGKGYGLAAGSWLG 242

Query: 377 PYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARRCSE 198
           PYAMCR+WETLAR  RE AD ++E  SLPMAV++VSGDEDGERGGAPV+CI+ AAR CS+
Sbjct: 243 PYAMCRTWETLARAKREQADLDREKESLPMAVYVVSGDEDGERGGAPVVCIDVAARLCSD 302

Query: 197 FGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGASTYIVGVQD 18
           F  GQ  W PI         LEKVN RYIP L  TFTFPQSLGILGGKPG STYIVGVQD
Sbjct: 303 FSKGQISWVPILLLVPLVLGLEKVNPRYIPLLWETFTFPQSLGILGGKPGTSTYIVGVQD 362

Query: 17  DKAFY 3
           DKA Y
Sbjct: 363 DKALY 367


>ref|XP_003635099.1| PREDICTED: cysteine protease ATG4 isoform X2 [Vitis vinifera]
           gi|296086874|emb|CBI33041.3| unnamed protein product
           [Vitis vinifera]
          Length = 486

 Score =  424 bits (1091), Expect = e-116
 Identities = 211/308 (68%), Positives = 242/308 (78%)
 Frame = -2

Query: 926 SDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVC 747
           S  S   K + D    NGW T ++++V+G SMR ++ER++G S TG+SSSTS+IW LG+C
Sbjct: 56  SSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLC 115

Query: 746 YKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQM 567
           YKIS E+ S+ ++  NGL EF  DFSSRI MTYRKGF AIGDSKLTSDV+WGCM RSSQM
Sbjct: 116 YKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQM 175

Query: 566 LVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGS 387
           LVAQALL H +GRSWRK   KP D  YI+IL  FGDS +SAFSIHN++QAGKAYGLAAGS
Sbjct: 176 LVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGS 235

Query: 386 WVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARR 207
           WVGPYAMCRSWETLAR  RE  D E +  SLPMA++IVSGDEDGERGGAPV+ IE+A+R 
Sbjct: 236 WVGPYAMCRSWETLARSKREETDLECQ--SLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293

Query: 206 CSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGASTYIVG 27
           C EF  GQ DW+PI         LEKVN RYIPSL  TFTFPQSLGILGGKPGASTYIVG
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVG 353

Query: 26  VQDDKAFY 3
           VQD+KAFY
Sbjct: 354 VQDEKAFY 361


>emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  424 bits (1091), Expect = e-116
 Identities = 211/308 (68%), Positives = 242/308 (78%)
 Frame = -2

Query: 926 SDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVC 747
           S  S   K + D    NGW T ++++V+G SMR ++ER++G S TG+SSSTS+IW LG+C
Sbjct: 56  SSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSSTSDIWLLGLC 115

Query: 746 YKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQM 567
           YKIS E+ S+ ++  NGL EF  DFSSRI MTYRKGF AIGDSKLTSDV+WGCM RSSQM
Sbjct: 116 YKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQM 175

Query: 566 LVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGS 387
           LVAQALL H +GRSWRK   KP D  YI+IL  FGDS +SAFSIHN++QAGKAYGLAAGS
Sbjct: 176 LVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGS 235

Query: 386 WVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARR 207
           WVGPYAMCRSWETLAR  RE  D E +  SLPMA++IVSGDEDGERGGAPV+ IE+A+R 
Sbjct: 236 WVGPYAMCRSWETLARSKREETDLECQ--SLPMAIYIVSGDEDGERGGAPVVYIEEASRH 293

Query: 206 CSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGASTYIVG 27
           C EF  GQ DW+PI         LEKVN RYIPSL  TFTFPQSLGILGGKPGASTYIVG
Sbjct: 294 CLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPGASTYIVG 353

Query: 26  VQDDKAFY 3
           VQD+KAFY
Sbjct: 354 VQDEKAFY 361


>ref|XP_012081889.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Jatropha curcas]
           gi|802675786|ref|XP_012081890.1| PREDICTED: cysteine
           protease ATG4-like isoform X1 [Jatropha curcas]
           gi|802675813|ref|XP_012081891.1| PREDICTED: cysteine
           protease ATG4-like isoform X1 [Jatropha curcas]
           gi|643718243|gb|KDP29532.1| hypothetical protein
           JCGZ_19245 [Jatropha curcas]
          Length = 492

 Score =  424 bits (1089), Expect = e-115
 Identities = 206/293 (70%), Positives = 237/293 (80%), Gaps = 1/293 (0%)
 Frame = -2

Query: 878 NGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDS-THG 702
           NGW + +K+IV+G SMR + ER++GPS TG+S++TSEIW LGVCYKIS +  ++D+ T  
Sbjct: 79  NGWTSAVKKIVAGGSMRRIHERVLGPSRTGISNTTSEIWLLGVCYKISQDGSNADAATSN 138

Query: 701 NGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSW 522
           NGL +F  DFSSRI MTYRKGF AIGDSK TSDV WGCM RSSQMLVAQALLFH LGRSW
Sbjct: 139 NGLADFTHDFSSRILMTYRKGFDAIGDSKFTSDVGWGCMLRSSQMLVAQALLFHQLGRSW 198

Query: 521 RKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGSWVGPYAMCRSWETLA 342
           RKP +KP D  Y++IL  FGDS++S FSIHNLI AGKAYGLAAGSWVGPYAMCRSWE LA
Sbjct: 199 RKPIQKPLDQKYVEILHLFGDSEASPFSIHNLIHAGKAYGLAAGSWVGPYAMCRSWELLA 258

Query: 341 RLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARRCSEFGNGQADWSPIX 162
           R  RE  +   EH +LPMAV++VSGDEDGERGGAPV+CIEDA+R C +F  GQA+W+PI 
Sbjct: 259 RCKRE--ENNLEHEALPMAVYVVSGDEDGERGGAPVVCIEDASRHCLDFSRGQANWTPIL 316

Query: 161 XXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGASTYIVGVQDDKAFY 3
                   LEKVN RYIPSL+ T TFPQSLGI+GGKPGASTYIVGVQDD AFY
Sbjct: 317 LLVPLVLGLEKVNLRYIPSLQATLTFPQSLGIMGGKPGASTYIVGVQDDNAFY 369


>gb|KDO40319.1| hypothetical protein CISIN_1g011418mg [Citrus sinensis]
          Length = 392

 Score =  419 bits (1078), Expect = e-114
 Identities = 209/331 (63%), Positives = 249/331 (75%)
 Frame = -2

Query: 995  TKSVKGXXXXXXXXXXXXXFEAYSDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRE 816
            +KS KG             FE YS++S   K +   KS NGW   +KR+V+  SMR + E
Sbjct: 38   SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKS-NGWTAAVKRLVTAGSMRRIHE 96

Query: 815  RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 636
            R++GPS TG+SSSTS+IW LGVC+KI+ ++   D+   NGL EF  DFSSRI ++YRKGF
Sbjct: 97   RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF 156

Query: 635  SAIGDSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDS 456
              IGDSK+TSDV WGCM RSSQMLVAQALLFH LGR WRKP +KP+D  Y++IL  FGDS
Sbjct: 157  DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 216

Query: 455  DSSAFSIHNLIQAGKAYGLAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHI 276
            ++S FSIHNL+QAGKAYGLAAGSWVGPYAMCRSWE LAR  R  A+      SLPMA+++
Sbjct: 217  ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYV 274

Query: 275  VSGDEDGERGGAPVLCIEDAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRV 96
            VSGDEDGERGGAPV+CI+DA+R CS F  GQADW+PI         LEKVN RYIP+LR+
Sbjct: 275  VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334

Query: 95   TFTFPQSLGILGGKPGASTYIVGVQDDKAFY 3
            TFTFPQSLGI+GGKPGASTYIVGVQ++ A Y
Sbjct: 335  TFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365


>gb|KDO40316.1| hypothetical protein CISIN_1g011418mg [Citrus sinensis]
            gi|641820321|gb|KDO40317.1| hypothetical protein
            CISIN_1g011418mg [Citrus sinensis]
          Length = 486

 Score =  419 bits (1078), Expect = e-114
 Identities = 209/331 (63%), Positives = 249/331 (75%)
 Frame = -2

Query: 995  TKSVKGXXXXXXXXXXXXXFEAYSDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRE 816
            +KS KG             FE YS++S   K +   KS NGW   +KR+V+  SMR + E
Sbjct: 38   SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKS-NGWTAAVKRLVTAGSMRRIHE 96

Query: 815  RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 636
            R++GPS TG+SSSTS+IW LGVC+KI+ ++   D+   NGL EF  DFSSRI ++YRKGF
Sbjct: 97   RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF 156

Query: 635  SAIGDSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDS 456
              IGDSK+TSDV WGCM RSSQMLVAQALLFH LGR WRKP +KP+D  Y++IL  FGDS
Sbjct: 157  DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 216

Query: 455  DSSAFSIHNLIQAGKAYGLAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHI 276
            ++S FSIHNL+QAGKAYGLAAGSWVGPYAMCRSWE LAR  R  A+      SLPMA+++
Sbjct: 217  ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYV 274

Query: 275  VSGDEDGERGGAPVLCIEDAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRV 96
            VSGDEDGERGGAPV+CI+DA+R CS F  GQADW+PI         LEKVN RYIP+LR+
Sbjct: 275  VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334

Query: 95   TFTFPQSLGILGGKPGASTYIVGVQDDKAFY 3
            TFTFPQSLGI+GGKPGASTYIVGVQ++ A Y
Sbjct: 335  TFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365


>ref|XP_006441973.1| hypothetical protein CICLE_v10019906mg [Citrus clementina]
            gi|557544235|gb|ESR55213.1| hypothetical protein
            CICLE_v10019906mg [Citrus clementina]
          Length = 486

 Score =  419 bits (1078), Expect = e-114
 Identities = 209/331 (63%), Positives = 249/331 (75%)
 Frame = -2

Query: 995  TKSVKGXXXXXXXXXXXXXFEAYSDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRE 816
            +KS KG             FE YS++S   K +   KS NGW   +KR+V+  SMR + E
Sbjct: 38   SKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKS-NGWTAAVKRLVTAGSMRRIHE 96

Query: 815  RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 636
            R++GPS TG+SSSTS+IW LGVC+KI+ ++   D+   NGL EF  DFSSRI ++YRKGF
Sbjct: 97   RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF 156

Query: 635  SAIGDSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDS 456
              IGDSK+TSDV WGCM RSSQMLVAQALLFH LGR WRKP +KP+D  Y++IL  FGDS
Sbjct: 157  DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 216

Query: 455  DSSAFSIHNLIQAGKAYGLAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHI 276
            ++S FSIHNL+QAGKAYGLAAGSWVGPYAMCRSWE LAR  R  A+      SLPMA+++
Sbjct: 217  ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYV 274

Query: 275  VSGDEDGERGGAPVLCIEDAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRV 96
            VSGDEDGERGGAPV+CI+DA+R CS F  GQADW+PI         LEKVN RYIP+LR+
Sbjct: 275  VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334

Query: 95   TFTFPQSLGILGGKPGASTYIVGVQDDKAFY 3
            TFTFPQSLGI+GGKPGASTYIVGVQ++ A Y
Sbjct: 335  TFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365


>ref|XP_007049917.1| Peptidase family C54 protein isoform 3 [Theobroma cacao]
           gi|508702178|gb|EOX94074.1| Peptidase family C54 protein
           isoform 3 [Theobroma cacao]
          Length = 486

 Score =  419 bits (1078), Expect = e-114
 Identities = 206/312 (66%), Positives = 243/312 (77%), Gaps = 1/312 (0%)
 Frame = -2

Query: 935 EAYSDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFL 756
           + YS++S   K +   ++ NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW L
Sbjct: 57  DTYSESSACEKKALHARN-NGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIWLL 115

Query: 755 GVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRS 576
           GVCYKIS    S D    NGL  F  DFSSRI MTYRKGF AIGD+K+TSD  WGCM RS
Sbjct: 116 GVCYKISQVSSSGDVDASNGLAAFKRDFSSRILMTYRKGFDAIGDTKITSDFGWGCMLRS 175

Query: 575 SQMLVA-QALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGL 399
           SQMLVA QALLFH LGRSWRKP +KP++  YI+IL QFGDS+++AFSIHNL++AGK YGL
Sbjct: 176 SQMLVAQQALLFHQLGRSWRKPLQKPFEQAYIEILHQFGDSEATAFSIHNLVEAGKIYGL 235

Query: 398 AAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIED 219
           AAGSWVGPYAMCRSWE+LAR  RE  D   EH SLPMAV++VSGDEDGERGGAPV+C+ED
Sbjct: 236 AAGSWVGPYAMCRSWESLARFKREEND--LEHQSLPMAVYVVSGDEDGERGGAPVVCVED 293

Query: 218 AARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGAST 39
           A+R C EF   +ADW+PI         L+KVN+RYIPSL+ TFTFPQ LGILGGKPGAST
Sbjct: 294 ASRHCFEFSRCRADWTPILLLVPLVLGLDKVNSRYIPSLQATFTFPQCLGILGGKPGAST 353

Query: 38  YIVGVQDDKAFY 3
           YIVGVQ++  FY
Sbjct: 354 YIVGVQEENVFY 365


>ref|XP_006478507.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Citrus sinensis]
          Length = 486

 Score =  419 bits (1077), Expect = e-114
 Identities = 209/331 (63%), Positives = 249/331 (75%)
 Frame = -2

Query: 995  TKSVKGXXXXXXXXXXXXXFEAYSDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRE 816
            +KS KG             FE YS++S   K +   KS NGW   +KR+V+  SMR + E
Sbjct: 38   SKSSKGSLLSSLFNSAFSVFETYSESSANEKKAVHNKS-NGWTAAVKRLVTAGSMRRIHE 96

Query: 815  RLIGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGF 636
            R++GPS TG+SSSTS+IW LGVC+KI+ ++   D+   NGL EF  DFSSRI ++YRKGF
Sbjct: 97   RVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYRKGF 156

Query: 635  SAIGDSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDS 456
              IGDSK+TSDV WGCM RSSQMLVAQALLFH LGR WRKP +KP+D  Y++IL  FGDS
Sbjct: 157  DPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLFGDS 216

Query: 455  DSSAFSIHNLIQAGKAYGLAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHI 276
            ++S FSIHNL+QAGKAYGLAAGSWVGPYAMCRSWE LAR  R  A+      SLPMA+++
Sbjct: 217  ETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQR--AETGLGCQSLPMAIYV 274

Query: 275  VSGDEDGERGGAPVLCIEDAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRV 96
            VSGDEDGERGGAPV+CI+DA+R CS F  GQADW+PI         LEKVN RYIP+LR+
Sbjct: 275  VSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLRL 334

Query: 95   TFTFPQSLGILGGKPGASTYIVGVQDDKAFY 3
            TFTFPQSLGI+GGKPGASTYIVGVQ++ A Y
Sbjct: 335  TFTFPQSLGIVGGKPGASTYIVGVQEESAIY 365


>ref|XP_010934835.1| PREDICTED: cysteine protease ATG4B-like isoform X3 [Elaeis
           guineensis] gi|743831925|ref|XP_010934836.1| PREDICTED:
           cysteine protease ATG4B-like isoform X3 [Elaeis
           guineensis]
          Length = 396

 Score =  418 bits (1075), Expect = e-114
 Identities = 203/305 (66%), Positives = 236/305 (77%)
 Frame = -2

Query: 917 SQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVCYKI 738
           S  G+       + GW T +K++V+G SMR L+E L+G S+T   SSTS+IW LG CYK+
Sbjct: 63  SHSGEKKPTKSRSYGWTTAVKKVVTGGSMRRLQE-LLGTSSTDALSSTSDIWLLGKCYKL 121

Query: 737 SPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQMLVA 558
           SPE+ S  + HGNG   F++DFSSRIW+TYRKGF AIGDSK TSDV WGCM RSSQMLVA
Sbjct: 122 SPEESSGGTDHGNGSAAFLEDFSSRIWITYRKGFDAIGDSKFTSDVRWGCMIRSSQMLVA 181

Query: 557 QALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGSWVG 378
           QALLFHHLGRSWRKP +KP+D  YI+IL  FGDS++ AFSIHNL++AGKAYGLAA  WVG
Sbjct: 182 QALLFHHLGRSWRKPSQKPHDSKYIEILHLFGDSEACAFSIHNLLEAGKAYGLAAREWVG 241

Query: 377 PYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARRCSE 198
           PYAMCR+WET+ R  RE AD +KE   LPM V++VSGDEDGERGGAPV+CI+ AAR CS+
Sbjct: 242 PYAMCRTWETITRAKREQADLDKEKERLPMVVYVVSGDEDGERGGAPVVCIDVAARLCSD 301

Query: 197 FGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGASTYIVGVQD 18
           F  GQ  W+P+         LEKVN RYIP L  TFTFPQSLGILGGKPGASTYIVGVQD
Sbjct: 302 FTKGQISWAPMLLLVPLVLGLEKVNPRYIPLLWETFTFPQSLGILGGKPGASTYIVGVQD 361

Query: 17  DKAFY 3
           DKA Y
Sbjct: 362 DKALY 366


>ref|XP_010934834.1| PREDICTED: cysteine protease ATG4B-like isoform X2 [Elaeis
           guineensis]
          Length = 484

 Score =  418 bits (1075), Expect = e-114
 Identities = 203/305 (66%), Positives = 236/305 (77%)
 Frame = -2

Query: 917 SQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVCYKI 738
           S  G+       + GW T +K++V+G SMR L+E L+G S+T   SSTS+IW LG CYK+
Sbjct: 63  SHSGEKKPTKSRSYGWTTAVKKVVTGGSMRRLQE-LLGTSSTDALSSTSDIWLLGKCYKL 121

Query: 737 SPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQMLVA 558
           SPE+ S  + HGNG   F++DFSSRIW+TYRKGF AIGDSK TSDV WGCM RSSQMLVA
Sbjct: 122 SPEESSGGTDHGNGSAAFLEDFSSRIWITYRKGFDAIGDSKFTSDVRWGCMIRSSQMLVA 181

Query: 557 QALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGSWVG 378
           QALLFHHLGRSWRKP +KP+D  YI+IL  FGDS++ AFSIHNL++AGKAYGLAA  WVG
Sbjct: 182 QALLFHHLGRSWRKPSQKPHDSKYIEILHLFGDSEACAFSIHNLLEAGKAYGLAAREWVG 241

Query: 377 PYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARRCSE 198
           PYAMCR+WET+ R  RE AD +KE   LPM V++VSGDEDGERGGAPV+CI+ AAR CS+
Sbjct: 242 PYAMCRTWETITRAKREQADLDKEKERLPMVVYVVSGDEDGERGGAPVVCIDVAARLCSD 301

Query: 197 FGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGASTYIVGVQD 18
           F  GQ  W+P+         LEKVN RYIP L  TFTFPQSLGILGGKPGASTYIVGVQD
Sbjct: 302 FTKGQISWAPMLLLVPLVLGLEKVNPRYIPLLWETFTFPQSLGILGGKPGASTYIVGVQD 361

Query: 17  DKAFY 3
           DKA Y
Sbjct: 362 DKALY 366


>ref|XP_010934833.1| PREDICTED: cysteine protease ATG4B-like isoform X1 [Elaeis
           guineensis]
          Length = 488

 Score =  418 bits (1075), Expect = e-114
 Identities = 203/305 (66%), Positives = 236/305 (77%)
 Frame = -2

Query: 917 SQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIWFLGVCYKI 738
           S  G+       + GW T +K++V+G SMR L+E L+G S+T   SSTS+IW LG CYK+
Sbjct: 63  SHSGEKKPTKSRSYGWTTAVKKVVTGGSMRRLQE-LLGTSSTDALSSTSDIWLLGKCYKL 121

Query: 737 SPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMHRSSQMLVA 558
           SPE+ S  + HGNG   F++DFSSRIW+TYRKGF AIGDSK TSDV WGCM RSSQMLVA
Sbjct: 122 SPEESSGGTDHGNGSAAFLEDFSSRIWITYRKGFDAIGDSKFTSDVRWGCMIRSSQMLVA 181

Query: 557 QALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYGLAAGSWVG 378
           QALLFHHLGRSWRKP +KP+D  YI+IL  FGDS++ AFSIHNL++AGKAYGLAA  WVG
Sbjct: 182 QALLFHHLGRSWRKPSQKPHDSKYIEILHLFGDSEACAFSIHNLLEAGKAYGLAAREWVG 241

Query: 377 PYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIEDAARRCSE 198
           PYAMCR+WET+ R  RE AD +KE   LPM V++VSGDEDGERGGAPV+CI+ AAR CS+
Sbjct: 242 PYAMCRTWETITRAKREQADLDKEKERLPMVVYVVSGDEDGERGGAPVVCIDVAARLCSD 301

Query: 197 FGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGASTYIVGVQD 18
           F  GQ  W+P+         LEKVN RYIP L  TFTFPQSLGILGGKPGASTYIVGVQD
Sbjct: 302 FTKGQISWAPMLLLVPLVLGLEKVNPRYIPLLWETFTFPQSLGILGGKPGASTYIVGVQD 361

Query: 17  DKAFY 3
           DKA Y
Sbjct: 362 DKALY 366


>ref|XP_002529225.1| Cysteine protease ATG4B, putative [Ricinus communis]
            gi|223531343|gb|EEF33181.1| Cysteine protease ATG4B,
            putative [Ricinus communis]
          Length = 489

 Score =  417 bits (1071), Expect = e-113
 Identities = 210/329 (63%), Positives = 244/329 (74%)
 Frame = -2

Query: 989  SVKGXXXXXXXXXXXXXFEAYSDASQPGKNSTDVKSANGWATTMKRIVSGSSMRALRERL 810
            S KG             FE Y ++    +        NGW + +K+IVSG SMR + ER+
Sbjct: 37   STKGSLWSSFFASAFSVFETYRESPPASEKKGSHSRHNGWTSAVKKIVSGGSMRRIHERV 96

Query: 809  IGPSNTGVSSSTSEIWFLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSA 630
            +GPS TG+SS+TS+IW LGVCYKIS ED S ++  GN L EF  D+SSRI MTYR+GF A
Sbjct: 97   LGPSRTGISSTTSDIWLLGVCYKIS-EDESGNADTGNALAEFTHDYSSRILMTYRRGFDA 155

Query: 629  IGDSKLTSDVHWGCMHRSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDS 450
            IGDSK  SDV WGCM RSSQMLVAQALLFH LGR+W KPF+KP D  Y++IL  FGDS++
Sbjct: 156  IGDSKYISDVGWGCMLRSSQMLVAQALLFHKLGRAWTKPFQKPMDQAYVEILHLFGDSEA 215

Query: 449  SAFSIHNLIQAGKAYGLAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVS 270
            + FSIHNLIQAGKAY LAAGSWVGPYAMCRSWE+LAR  RE  +   E+ SLPMAV++VS
Sbjct: 216  APFSIHNLIQAGKAYSLAAGSWVGPYAMCRSWESLARSKRE--ENSLEYQSLPMAVYVVS 273

Query: 269  GDEDGERGGAPVLCIEDAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTF 90
            GDEDGERGGAPV+ IEDA+R C EF  GQADW+PI         L+KVN RYIPSL+ TF
Sbjct: 274  GDEDGERGGAPVVYIEDASRHCLEFSRGQADWTPILLLVPLVLGLDKVNPRYIPSLQATF 333

Query: 89   TFPQSLGILGGKPGASTYIVGVQDDKAFY 3
            TF QSLGI+GGKPGASTYIVGVQDD AFY
Sbjct: 334  TFSQSLGIMGGKPGASTYIVGVQDDNAFY 362


>ref|XP_012435798.1| PREDICTED: cysteine protease ATG4 isoform X1 [Gossypium raimondii]
           gi|823202390|ref|XP_012435799.1| PREDICTED: cysteine
           protease ATG4 isoform X1 [Gossypium raimondii]
           gi|823202393|ref|XP_012435800.1| PREDICTED: cysteine
           protease ATG4 isoform X1 [Gossypium raimondii]
          Length = 495

 Score =  412 bits (1058), Expect = e-112
 Identities = 203/313 (64%), Positives = 239/313 (76%), Gaps = 2/313 (0%)
 Frame = -2

Query: 935 EAYSDASQPG--KNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIW 762
           + YS++S     +  +     NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW
Sbjct: 56  DTYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIW 115

Query: 761 FLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMH 582
            LG+CYKIS E  S D    + L  F  DFSSRI MTYRKGF AIG++K+TSD  WGCM 
Sbjct: 116 LLGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCML 174

Query: 581 RSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYG 402
           RSSQMLVAQALLFH LGRSWRKP +KP+D  YI+IL QFGDS++SAFSIHNL++AGK YG
Sbjct: 175 RSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYG 234

Query: 401 LAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIE 222
           LAAGSWVGPYAMCRSWE+LAR  RE  D E +   LPMAV++VSGDEDGERGGAPV+CIE
Sbjct: 235 LAAGSWVGPYAMCRSWESLARSKREEIDLECQ--LLPMAVYVVSGDEDGERGGAPVVCIE 292

Query: 221 DAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGAS 42
           DA+R C EF   QADW+PI         L+KVN RYIPSL+ TFTFPQ LGILGGKPGAS
Sbjct: 293 DASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGAS 352

Query: 41  TYIVGVQDDKAFY 3
           TYIVG+Q++  FY
Sbjct: 353 TYIVGIQEENVFY 365


>gb|KJB46920.1| hypothetical protein B456_008G001300 [Gossypium raimondii]
          Length = 429

 Score =  412 bits (1058), Expect = e-112
 Identities = 203/313 (64%), Positives = 239/313 (76%), Gaps = 2/313 (0%)
 Frame = -2

Query: 935 EAYSDASQPG--KNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIW 762
           + YS++S     +  +     NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW
Sbjct: 56  DTYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIW 115

Query: 761 FLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMH 582
            LG+CYKIS E  S D    + L  F  DFSSRI MTYRKGF AIG++K+TSD  WGCM 
Sbjct: 116 LLGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCML 174

Query: 581 RSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYG 402
           RSSQMLVAQALLFH LGRSWRKP +KP+D  YI+IL QFGDS++SAFSIHNL++AGK YG
Sbjct: 175 RSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYG 234

Query: 401 LAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIE 222
           LAAGSWVGPYAMCRSWE+LAR  RE  D E +   LPMAV++VSGDEDGERGGAPV+CIE
Sbjct: 235 LAAGSWVGPYAMCRSWESLARSKREEIDLECQ--LLPMAVYVVSGDEDGERGGAPVVCIE 292

Query: 221 DAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGAS 42
           DA+R C EF   QADW+PI         L+KVN RYIPSL+ TFTFPQ LGILGGKPGAS
Sbjct: 293 DASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGAS 352

Query: 41  TYIVGVQDDKAFY 3
           TYIVG+Q++  FY
Sbjct: 353 TYIVGIQEENVFY 365


>gb|KJB46918.1| hypothetical protein B456_008G001300 [Gossypium raimondii]
          Length = 392

 Score =  412 bits (1058), Expect = e-112
 Identities = 203/313 (64%), Positives = 239/313 (76%), Gaps = 2/313 (0%)
 Frame = -2

Query: 935 EAYSDASQPG--KNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIW 762
           + YS++S     +  +     NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW
Sbjct: 56  DTYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIW 115

Query: 761 FLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMH 582
            LG+CYKIS E  S D    + L  F  DFSSRI MTYRKGF AIG++K+TSD  WGCM 
Sbjct: 116 LLGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCML 174

Query: 581 RSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYG 402
           RSSQMLVAQALLFH LGRSWRKP +KP+D  YI+IL QFGDS++SAFSIHNL++AGK YG
Sbjct: 175 RSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYG 234

Query: 401 LAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIE 222
           LAAGSWVGPYAMCRSWE+LAR  RE  D E +   LPMAV++VSGDEDGERGGAPV+CIE
Sbjct: 235 LAAGSWVGPYAMCRSWESLARSKREEIDLECQ--LLPMAVYVVSGDEDGERGGAPVVCIE 292

Query: 221 DAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGAS 42
           DA+R C EF   QADW+PI         L+KVN RYIPSL+ TFTFPQ LGILGGKPGAS
Sbjct: 293 DASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGAS 352

Query: 41  TYIVGVQDDKAFY 3
           TYIVG+Q++  FY
Sbjct: 353 TYIVGIQEENVFY 365


>ref|XP_012435802.1| PREDICTED: cysteine protease ATG4 isoform X3 [Gossypium raimondii]
           gi|763779846|gb|KJB46917.1| hypothetical protein
           B456_008G001300 [Gossypium raimondii]
          Length = 433

 Score =  412 bits (1058), Expect = e-112
 Identities = 203/313 (64%), Positives = 239/313 (76%), Gaps = 2/313 (0%)
 Frame = -2

Query: 935 EAYSDASQPG--KNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIW 762
           + YS++S     +  +     NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW
Sbjct: 56  DTYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIW 115

Query: 761 FLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMH 582
            LG+CYKIS E  S D    + L  F  DFSSRI MTYRKGF AIG++K+TSD  WGCM 
Sbjct: 116 LLGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCML 174

Query: 581 RSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYG 402
           RSSQMLVAQALLFH LGRSWRKP +KP+D  YI+IL QFGDS++SAFSIHNL++AGK YG
Sbjct: 175 RSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYG 234

Query: 401 LAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIE 222
           LAAGSWVGPYAMCRSWE+LAR  RE  D E +   LPMAV++VSGDEDGERGGAPV+CIE
Sbjct: 235 LAAGSWVGPYAMCRSWESLARSKREEIDLECQ--LLPMAVYVVSGDEDGERGGAPVVCIE 292

Query: 221 DAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGAS 42
           DA+R C EF   QADW+PI         L+KVN RYIPSL+ TFTFPQ LGILGGKPGAS
Sbjct: 293 DASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGAS 352

Query: 41  TYIVGVQDDKAFY 3
           TYIVG+Q++  FY
Sbjct: 353 TYIVGIQEENVFY 365


>ref|XP_012435801.1| PREDICTED: cysteine protease ATG4 isoform X2 [Gossypium raimondii]
           gi|763779844|gb|KJB46915.1| hypothetical protein
           B456_008G001300 [Gossypium raimondii]
           gi|763779845|gb|KJB46916.1| hypothetical protein
           B456_008G001300 [Gossypium raimondii]
           gi|763779850|gb|KJB46921.1| hypothetical protein
           B456_008G001300 [Gossypium raimondii]
          Length = 488

 Score =  412 bits (1058), Expect = e-112
 Identities = 203/313 (64%), Positives = 239/313 (76%), Gaps = 2/313 (0%)
 Frame = -2

Query: 935 EAYSDASQPG--KNSTDVKSANGWATTMKRIVSGSSMRALRERLIGPSNTGVSSSTSEIW 762
           + YS++S     +  +     NGW   +KR+VSG SMR + ER++GPS  G+SSSTS+IW
Sbjct: 56  DTYSESSSSSACERKSSFSKTNGWTAAVKRVVSGGSMRRIHERVLGPSKIGISSSTSDIW 115

Query: 761 FLGVCYKISPEDLSSDSTHGNGLVEFVDDFSSRIWMTYRKGFSAIGDSKLTSDVHWGCMH 582
            LG+CYKIS E  S D    + L  F  DFSSRI MTYRKGF AIG++K+TSD  WGCM 
Sbjct: 116 LLGLCYKISQES-SGDVDATSALAAFKQDFSSRILMTYRKGFDAIGETKITSDASWGCML 174

Query: 581 RSSQMLVAQALLFHHLGRSWRKPFEKPYDPVYIKILDQFGDSDSSAFSIHNLIQAGKAYG 402
           RSSQMLVAQALLFH LGRSWRKP +KP+D  YI+IL QFGDS++SAFSIHNL++AGK YG
Sbjct: 175 RSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIEILHQFGDSEASAFSIHNLVEAGKNYG 234

Query: 401 LAAGSWVGPYAMCRSWETLARLNREHADPEKEHTSLPMAVHIVSGDEDGERGGAPVLCIE 222
           LAAGSWVGPYAMCRSWE+LAR  RE  D E +   LPMAV++VSGDEDGERGGAPV+CIE
Sbjct: 235 LAAGSWVGPYAMCRSWESLARSKREEIDLECQ--LLPMAVYVVSGDEDGERGGAPVVCIE 292

Query: 221 DAARRCSEFGNGQADWSPIXXXXXXXXXLEKVNTRYIPSLRVTFTFPQSLGILGGKPGAS 42
           DA+R C EF   QADW+PI         L+KVN RYIPSL+ TFTFPQ LGILGGKPGAS
Sbjct: 293 DASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRYIPSLQATFTFPQCLGILGGKPGAS 352

Query: 41  TYIVGVQDDKAFY 3
           TYIVG+Q++  FY
Sbjct: 353 TYIVGIQEENVFY 365


Top