BLASTX nr result
ID: Cornus23_contig00005905
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cornus23_contig00005905 (1590 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_009354662.1| PREDICTED: cysteine protease ATG4-like [Pyru... 667 0.0 ref|XP_007217926.1| hypothetical protein PRUPE_ppa004885mg [Prun... 657 0.0 ref|XP_008232834.1| PREDICTED: cysteine protease ATG4 [Prunus mume] 656 0.0 ref|XP_008346919.1| PREDICTED: cysteine protease ATG4-like isofo... 654 0.0 ref|XP_008375115.1| PREDICTED: cysteine protease ATG4 [Malus dom... 643 0.0 ref|XP_003635099.1| PREDICTED: cysteine protease ATG4 isoform X2... 625 e-176 ref|XP_007049917.1| Peptidase family C54 protein isoform 3 [Theo... 625 e-176 gb|KHG08139.1| Cysteine protease ATG4 [Gossypium arboreum] 622 e-175 emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera] 620 e-174 ref|XP_012435801.1| PREDICTED: cysteine protease ATG4 isoform X2... 619 e-174 ref|XP_010646415.1| PREDICTED: cysteine protease ATG4 isoform X1... 619 e-174 ref|XP_012435798.1| PREDICTED: cysteine protease ATG4 isoform X1... 615 e-173 gb|KDO40316.1| hypothetical protein CISIN_1g011418mg [Citrus sin... 612 e-172 ref|XP_006441973.1| hypothetical protein CICLE_v10019906mg [Citr... 611 e-172 ref|XP_006478507.1| PREDICTED: cysteine protease ATG4-like isofo... 611 e-172 ref|XP_007049915.1| Peptidase family C54 protein isoform 1 [Theo... 610 e-172 ref|XP_002309707.1| autophagy 4b family protein [Populus trichoc... 608 e-171 ref|XP_006372315.1| autophagy 4b family protein [Populus trichoc... 608 e-171 ref|XP_011005240.1| PREDICTED: cysteine protease ATG4-like [Popu... 607 e-170 ref|XP_012081889.1| PREDICTED: cysteine protease ATG4-like isofo... 598 e-168 >ref|XP_009354662.1| PREDICTED: cysteine protease ATG4-like [Pyrus x bretschneideri] gi|694327605|ref|XP_009354663.1| PREDICTED: cysteine protease ATG4-like [Pyrus x bretschneideri] Length = 487 Score = 667 bits (1721), Expect = 0.0 Identities = 329/457 (71%), Positives = 374/457 (81%), Gaps = 13/457 (2%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESKNCGKKASISRSN-WTAAVKRAWNGGSM 1413 D GS +SK +K SLW+ FA FSIFET SES KK S SR+N WTAAV++ GSM Sbjct: 32 DSGSRDSKHNKASLWTNFFASAFSIFETHSESSITEKKESHSRNNGWTAAVRKVVTSGSM 91 Query: 1412 RKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRILMT 1233 R+ HERVLG +RTGI SS SDIWLLGVCYK+SQ+DS+GD+ +GL AF +DFSS+ILMT Sbjct: 92 RRIHERVLGSSRTGI-SSASDIWLLGVCYKVSQDDSSGDAPINNGLGAFEQDFSSKILMT 150 Query: 1232 YRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIEILH 1053 YRKGF+AIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWR+ LHKPLD YIEIL+ Sbjct: 151 YRKGFEAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRRPLHKPLDEAYIEILY 210 Query: 1052 LFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSLPMA 876 FGDSE S FSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRC+RE T+LD Q LPMA Sbjct: 211 HFGDSETSTFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCRREVTDLDDQPLPMA 270 Query: 875 VYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRYVSL 696 VY+VSGDEDGERGGAPVVCI+DASRHC EFS+GQ DWTPI LEK+NPRY+ Sbjct: 271 VYIVSGDEDGERGGAPVVCIEDASRHCLEFSRGQVDWTPILLLVPLVLGLEKVNPRYIPS 330 Query: 695 LRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSSYHC 516 LRATFTFPQSLGI+GG PGASTYI+GVQDE+A YLDPHE QPV+++RRD++EADT SYHC Sbjct: 331 LRATFTFPQSLGIMGGKPGASTYIIGVQDEKALYLDPHEVQPVINIRRDDLEADTLSYHC 390 Query: 515 NVPCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARNTPKQTS 336 NV H+PLD +DPSLAIGFYCRD+ DFNDFC RASKLAD+S+GAPLFTVT + P+ + Sbjct: 391 NVIRHIPLDLIDPSLAIGFYCRDRDDFNDFCFRASKLADESNGAPLFTVTQTHSFPRPVN 450 Query: 335 HHGTLSDSGEIQ-----------DAEGCSQEDDWQLL 258 H L DSG ++ DA+G +QEDDWQLL Sbjct: 451 HSDALGDSGAVENDDSFSVLPMSDADGSAQEDDWQLL 487 >ref|XP_007217926.1| hypothetical protein PRUPE_ppa004885mg [Prunus persica] gi|462414388|gb|EMJ19125.1| hypothetical protein PRUPE_ppa004885mg [Prunus persica] Length = 487 Score = 657 bits (1696), Expect = 0.0 Identities = 329/457 (71%), Positives = 368/457 (80%), Gaps = 13/457 (2%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESKNCGKKASISRSN-WTAAVKRAWNGGSM 1413 D GS +SK K SLWS FA FSIFET SES KK SR+N WT AV++ GGSM Sbjct: 32 DSGSRDSKHDKASLWSNFFASAFSIFETHSESSITEKKEIHSRNNGWTEAVRKVVTGGSM 91 Query: 1412 RKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRILMT 1233 R+ HERVLG +RTGI SS SDIWLLGV YK+SQ++S+GD+ +GL AF +DFSSRILMT Sbjct: 92 RRIHERVLGSSRTGI-SSASDIWLLGVLYKVSQDESSGDAATNNGLRAFEQDFSSRILMT 150 Query: 1232 YRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIEILH 1053 YRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWR++LHKPLD YIEILH Sbjct: 151 YRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRRTLHKPLDEQYIEILH 210 Query: 1052 LFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSLPMA 876 FGDSE SAFSIHNLLQAGKAYDLAAGSWVGPYAMCR+WETLVRCKRE T D Q LPMA Sbjct: 211 HFGDSEGSAFSIHNLLQAGKAYDLAAGSWVGPYAMCRSWETLVRCKREGTAFDNQPLPMA 270 Query: 875 VYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRYVSL 696 VY+VSGDEDGERGGAPVVCI DASRHC EFS+G+ DWTPI LEK+NPRY+ Sbjct: 271 VYIVSGDEDGERGGAPVVCIQDASRHCLEFSRGRVDWTPILLLVPLVLGLEKVNPRYIPS 330 Query: 695 LRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSSYHC 516 L ATFTFPQSLGI+GG PGASTYI+GVQDE+A YLDPHE QP +++RRD++EADT SYHC Sbjct: 331 LWATFTFPQSLGIMGGKPGASTYIIGVQDEKALYLDPHEVQPAINIRRDDLEADTLSYHC 390 Query: 515 NVPCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARNTPKQTS 336 NV H+PLDS+DPSLAIGFYCRD+ DF+DFC RASKLAD S+GAPLFTVT + N PK + Sbjct: 391 NVIRHIPLDSIDPSLAIGFYCRDRDDFDDFCFRASKLADGSNGAPLFTVTQSHNFPKPVN 450 Query: 335 HHGTLSDSGEIQ-----------DAEGCSQEDDWQLL 258 H L DSG +Q DA+G + EDDWQLL Sbjct: 451 HSDVLDDSGGVQNDDSFVAPPISDADGSAHEDDWQLL 487 >ref|XP_008232834.1| PREDICTED: cysteine protease ATG4 [Prunus mume] Length = 487 Score = 656 bits (1693), Expect = 0.0 Identities = 328/457 (71%), Positives = 367/457 (80%), Gaps = 13/457 (2%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESKNCGKKASISRSN-WTAAVKRAWNGGSM 1413 D GS +SK K SLWS FA FSIFET SES N KK SR+N WT AV++ GGSM Sbjct: 32 DSGSRDSKHDKASLWSNFFASAFSIFETHSESSNTEKKEIHSRNNGWTEAVRKVVTGGSM 91 Query: 1412 RKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRILMT 1233 R+ HERVLG +RTGI SS SDIWLLGV YK+SQ++ +GD+ +GL AF +DFSSRILMT Sbjct: 92 RRIHERVLGSSRTGI-SSASDIWLLGVRYKVSQDEFSGDAATNNGLRAFEQDFSSRILMT 150 Query: 1232 YRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIEILH 1053 YRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWR+ LHKPLD YIEILH Sbjct: 151 YRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRRPLHKPLDEQYIEILH 210 Query: 1052 LFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSLPMA 876 FGDSE SAFSIHNLLQ+GKAYDLAAGSWVGPYAMCR+WETLVRCKRE T D Q LPMA Sbjct: 211 HFGDSEGSAFSIHNLLQSGKAYDLAAGSWVGPYAMCRSWETLVRCKREGTAFDNQPLPMA 270 Query: 875 VYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRYVSL 696 VY+VSGDEDGERGGAPVVCI DASRHC EFS+G+ DWTPI LEK+NPRY+ Sbjct: 271 VYIVSGDEDGERGGAPVVCIQDASRHCLEFSRGRVDWTPILLLVPLVLGLEKVNPRYIPS 330 Query: 695 LRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSSYHC 516 L ATFTFPQSLGI+GG PGASTYI+GVQDE+A YLDPHE QP +++RRD++EADT SYHC Sbjct: 331 LWATFTFPQSLGIMGGKPGASTYIIGVQDEKALYLDPHEVQPAINIRRDDLEADTLSYHC 390 Query: 515 NVPCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARNTPKQTS 336 NV H+PLDS+DPSLAIGFYCRD+ DF+DFC RASKLAD S+GAPLFTVT+ N PK + Sbjct: 391 NVIRHIPLDSIDPSLAIGFYCRDRDDFDDFCFRASKLADGSNGAPLFTVTETHNFPKPVN 450 Query: 335 HHGTLSDSGEIQ-----------DAEGCSQEDDWQLL 258 H L DSG +Q DA+G + EDDWQLL Sbjct: 451 HSDVLDDSGGVQNDDSFVAPPISDADGSAHEDDWQLL 487 >ref|XP_008346919.1| PREDICTED: cysteine protease ATG4-like isoform X2 [Malus domestica] Length = 487 Score = 654 bits (1687), Expect = 0.0 Identities = 323/457 (70%), Positives = 370/457 (80%), Gaps = 13/457 (2%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESKNCGKKASISRSN-WTAAVKRAWNGGSM 1413 D GS +SK +K SLWS F FSIFET SES KK S SR+N WTAAV++A + GSM Sbjct: 32 DSGSRDSKHNKASLWSNFFESAFSIFETHSESSITDKKESHSRNNGWTAAVRKAVSSGSM 91 Query: 1412 RKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRILMT 1233 R+ E VLG +R GI SS SDIWLLGVCYK+SQ+DS+GD+ +GL AF +DFSSRILMT Sbjct: 92 RRIQEHVLGSSRIGI-SSASDIWLLGVCYKVSQDDSSGDAPINNGLGAFEQDFSSRILMT 150 Query: 1232 YRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIEILH 1053 YRKGF+AIG+SKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSW + LHKPLD YI IL+ Sbjct: 151 YRKGFEAIGNSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWTRPLHKPLDEAYIGILY 210 Query: 1052 LFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSLPMA 876 FGDSE S FSIHNLLQAG+AYDLAAGSWVGPYAMCRTWETLVRC+RE T+LD Q LPMA Sbjct: 211 HFGDSETSTFSIHNLLQAGRAYDLAAGSWVGPYAMCRTWETLVRCRREATDLDDQPLPMA 270 Query: 875 VYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRYVSL 696 VY+VSGDEDGERGGAPVVCI+DASRHC EFS+GQ DWTPI LEK+NPRY+ Sbjct: 271 VYIVSGDEDGERGGAPVVCIEDASRHCLEFSRGQVDWTPILLLVPLVLGLEKVNPRYIPS 330 Query: 695 LRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSSYHC 516 LRATFTFPQSLGI+GG PGASTYI+GVQDE+A YLDPHE QPV+++RRD++EADT SYHC Sbjct: 331 LRATFTFPQSLGIMGGKPGASTYIIGVQDEKALYLDPHEVQPVINIRRDDLEADTLSYHC 390 Query: 515 NVPCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARNTPKQTS 336 NV H+PLD +DPSLAIGFYCRD+ DFNDFC RASKLAD+S+GAPLFTVT + P+ + Sbjct: 391 NVIRHIPLDLIDPSLAIGFYCRDRDDFNDFCFRASKLADESNGAPLFTVTQTHSVPRPVN 450 Query: 335 HHGTLSDSGEIQ-----------DAEGCSQEDDWQLL 258 H L DSG ++ DA+G +QED+WQLL Sbjct: 451 HSDALGDSGAVENDDSFSVLPMSDADGSAQEDEWQLL 487 >ref|XP_008375115.1| PREDICTED: cysteine protease ATG4 [Malus domestica] Length = 492 Score = 643 bits (1658), Expect = 0.0 Identities = 321/462 (69%), Positives = 365/462 (79%), Gaps = 18/462 (3%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESKNCGKKASISRSN-WTAAVKRAWNGGSM 1413 D GS +SK +K SLWS F FSIFET SES KK S SR+N WT AV++A GSM Sbjct: 32 DSGSRDSKHNKASLWSNFFXSAFSIFETHSESSITXKKESHSRNNGWTXAVRKAVXSGSM 91 Query: 1412 RKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRILMT 1233 R+ E VLG +R GI SS SDIWLLGVCYK+SQ+DS+GD+ +GL AF +DFSSRILMT Sbjct: 92 RRIXEXVLGSSRXGI-SSASDIWLLGVCYKVSQDDSSGDAPINNGLGAFEQDFSSRILMT 150 Query: 1232 YRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQ-----ALLFHRLGRSWRKSLHKPLDHGY 1068 YRKGF+AIG+SKYTSDVNWGCMLRSSQMLV Q ALLFHRLGRSW + LHKPLD Y Sbjct: 151 YRKGFEAIGBSKYTSDVNWGCMLRSSQMLVXQXIFLQALLFHRLGRSWXRPLHKPLDEAY 210 Query: 1067 IEILHLFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-Q 891 I IL+ FGDSE S FSIHNLLQAG AYDLAAGSWVGPYAMCRTWETLVRC+RE T+LD Q Sbjct: 211 IXILYHFGDSETSTFSIHNLLQAGXAYDLAAGSWVGPYAMCRTWETLVRCRREATDLDDQ 270 Query: 890 SLPMAVYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINP 711 LPMAVY+VSGDEDGERGGAPVVCI+DASRHC EFS+GQ DWTPI LEK+NP Sbjct: 271 PLPMAVYIVSGDEDGERGGAPVVCIEDASRHCLEFSRGQVDWTPILLLVPLVLGLEKVNP 330 Query: 710 RYVSLLRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADT 531 RY+ LRATFTFPQSLGI+GG PG STYI+GVQDE+A YLDPHE QPV+++RRD++EADT Sbjct: 331 RYIPSLRATFTFPQSLGIMGGKPGVSTYIIGVQDEKALYLDPHEVQPVINIRRDDMEADT 390 Query: 530 SSYHCNVPCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARNT 351 SYHCNV H+PLD +DPSLAIGFYCRD+ DFNDFC RASKLAD+S+GAPLFTVT + Sbjct: 391 LSYHCNVIRHIPLDLIDPSLAIGFYCRDRDDFNDFCFRASKLADESNGAPLFTVTQTHSV 450 Query: 350 PKQTSHHGTLSDSGEIQ-----------DAEGCSQEDDWQLL 258 P+ +H L DSG ++ DA+G +QEDDWQLL Sbjct: 451 PRPVNHSDALGDSGAVENDDSFSVLPMSDADGSAQEDDWQLL 492 >ref|XP_003635099.1| PREDICTED: cysteine protease ATG4 isoform X2 [Vitis vinifera] gi|296086874|emb|CBI33041.3| unnamed protein product [Vitis vinifera] Length = 486 Score = 625 bits (1611), Expect = e-176 Identities = 318/462 (68%), Positives = 368/462 (79%), Gaps = 18/462 (3%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESK-NCGKKASISR---SNWTAAVKRAWNG 1422 +P SS++KLSK SLWS +FA FS+FET SES + +K +I + WT AV++ G Sbjct: 25 EPQSSDTKLSKVSLWSSVFASAFSVFETNSESSPSASEKKAIDNGRNNGWTTAVRKVVTG 84 Query: 1421 GSMRKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRI 1242 SMR+ ERVLG ++TGI SSTSDIWLLG+CYKISQE+S+ ++ +GLA F +DFSSRI Sbjct: 85 VSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSNGLAEFEQDFSSRI 144 Query: 1241 LMTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIE 1062 LMTYRKGF+AIGDSK TSDVNWGCMLRSSQMLVAQALL HR+GRSWRK+ HKP+D YIE Sbjct: 145 LMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMDQDYIE 204 Query: 1061 ILHLFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSL 885 ILH FGDS+ASAFSIHN+LQAGKAY LAAGSWVGPYAMCR+WETL R KREET+L+ QSL Sbjct: 205 ILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARSKREETDLECQSL 264 Query: 884 PMAVYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRY 705 PMA+Y+VSGDEDGERGGAPVV I++ASRHC EFSKGQ DWTPI LEK+NPRY Sbjct: 265 PMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVPLVLGLEKVNPRY 324 Query: 704 VSLLRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSS 525 + L ATFTFPQSLGILGG PGASTYIVGVQDE+A+YLDPHEAQ VVD+RR+N+EADTSS Sbjct: 325 IPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLEADTSS 384 Query: 524 YHCNVPCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARNTPK 345 YHCN+ H+ LDS+DPSLAIGFYCRDK DF+DFC RASKLAD+S+GAPLFTV + PK Sbjct: 385 YHCNIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADKSNGAPLFTVAHIHSLPK 444 Query: 344 QTSHHGTLSD-SGEIQD----------AEGC--SQEDDWQLL 258 S + D SG +D AEG EDDWQLL Sbjct: 445 PISCSDGMDDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 486 >ref|XP_007049917.1| Peptidase family C54 protein isoform 3 [Theobroma cacao] gi|508702178|gb|EOX94074.1| Peptidase family C54 protein isoform 3 [Theobroma cacao] Length = 486 Score = 625 bits (1611), Expect = e-176 Identities = 312/456 (68%), Positives = 364/456 (79%), Gaps = 12/456 (2%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESKNCGKKASISRSN-WTAAVKRAWNGGSM 1413 +PG S+ K SK+S+WS LFA FSIF+T SES C KKA +R+N WTAAVKR +GGSM Sbjct: 31 EPGPSDCKFSKSSVWSNLFASAFSIFDTYSESSACEKKALHARNNGWTAAVKRVVSGGSM 90 Query: 1412 RKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRILMT 1233 R+ HERVLG ++ GI SSTSDIWLLGVCYKISQ S+GD + +GLAAF DFSSRILMT Sbjct: 91 RRIHERVLGPSKIGISSSTSDIWLLGVCYKISQVSSSGDVDASNGLAAFKRDFSSRILMT 150 Query: 1232 YRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQ-ALLFHRLGRSWRKSLHKPLDHGYIEIL 1056 YRKGFDAIGD+K TSD WGCMLRSSQMLVAQ ALLFH+LGRSWRK L KP + YIEIL Sbjct: 151 YRKGFDAIGDTKITSDFGWGCMLRSSQMLVAQQALLFHQLGRSWRKPLQKPFEQAYIEIL 210 Query: 1055 HLFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSLPM 879 H FGDSEA+AFSIHNL++AGK Y LAAGSWVGPYAMCR+WE+L R KREE +L+ QSLPM Sbjct: 211 HQFGDSEATAFSIHNLVEAGKIYGLAAGSWVGPYAMCRSWESLARFKREENDLEHQSLPM 270 Query: 878 AVYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRYVS 699 AVYVVSGDEDGERGGAPVVC++DASRHCFEFS+ + DWTPI L+K+N RY+ Sbjct: 271 AVYVVSGDEDGERGGAPVVCVEDASRHCFEFSRCRADWTPILLLVPLVLGLDKVNSRYIP 330 Query: 698 LLRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSSYH 519 L+ATFTFPQ LGILGG PGASTYIVGVQ+E +YLDPH+ Q VV++ +DN EADTSSYH Sbjct: 331 SLQATFTFPQCLGILGGKPGASTYIVGVQEENVFYLDPHDVQLVVNLSQDNQEADTSSYH 390 Query: 518 CNVPCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARNTPKQT 339 C++ H+PLDS+DPSLAIGF+CRDK DF+DFC RASKLAD+S+GAPLFTV ++ K Sbjct: 391 CDIIRHIPLDSIDPSLAIGFFCRDKDDFDDFCLRASKLADESNGAPLFTVAQTHSSFKPI 450 Query: 338 SHHGTLSDSGEIQ---------DAEGCSQEDDWQLL 258 SH L D+GE++ D +G EDDWQLL Sbjct: 451 SHGNALDDTGEVREDDSLGVVPDMDGSIHEDDWQLL 486 >gb|KHG08139.1| Cysteine protease ATG4 [Gossypium arboreum] Length = 530 Score = 622 bits (1603), Expect = e-175 Identities = 315/467 (67%), Positives = 367/467 (78%), Gaps = 23/467 (4%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESKN---CGKKASISRSN-WTAAVKRAWNG 1422 +PG ++SK SKTSLWS FA FS+F+T SES + C KK+S S++N WTAAVKR +G Sbjct: 65 EPGPNDSKFSKTSLWSNFFASAFSVFDTYSESSSSSACEKKSSFSKTNGWTAAVKRVVSG 124 Query: 1421 GSMRKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRI 1242 GSMR+ HERVLG ++ GI SSTSDIWLLG+CYKISQE S+GD + LAAF +DFSSRI Sbjct: 125 GSMRRIHERVLGPSKIGISSSTSDIWLLGLCYKISQE-SSGDVDATSALAAFKQDFSSRI 183 Query: 1241 LMTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIE 1062 LMTYRKGFDAIG++K TSD +WGCMLRSSQMLVAQALLFHRLGRSWRK KP D YIE Sbjct: 184 LMTYRKGFDAIGETKITSDASWGCMLRSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIE 243 Query: 1061 ILHLFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSL 885 ILH FGDSEASAFSIHNL++AGK Y LAAGSWVGPYAMCR+WE+L R KREE +L+ Q L Sbjct: 244 ILHQFGDSEASAFSIHNLVEAGKNYGLAAGSWVGPYAMCRSWESLARSKREENDLECQLL 303 Query: 884 PMAVYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRY 705 PMAVYVVSGDEDGERGGAPVVCI+DASRHCFEFS+ Q DWTPI L+K+NPRY Sbjct: 304 PMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRY 363 Query: 704 VSLLRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSS 525 + L+ATFTFPQ LGILGG PGASTYIVGVQ+E +YLDPH+ QPVV++ DN+EADTSS Sbjct: 364 IPSLQATFTFPQCLGILGGKPGASTYIVGVQEENVFYLDPHDVQPVVNLSSDNLEADTSS 423 Query: 524 YHCNVPCHVPLDSLDPSLAIGFYCRDKG-------DFNDFCSRASKLADQSDGAPLFTVT 366 YHC++ ++PLDSLDPSLAIGF+CRDKG DF+DFC RASKLAD+S+GAPLFTV Sbjct: 424 YHCDIIRYIPLDSLDPSLAIGFFCRDKGFPVNLVDDFDDFCFRASKLADESNGAPLFTVA 483 Query: 365 DARNTPKQTSHHGTLSDSG-----------EIQDAEGCSQEDDWQLL 258 + K +H T++D+G D +G S EDDWQLL Sbjct: 484 QTHSVFKPINHGDTMADAGGDRMDDSIGVLPTGDVDGNSHEDDWQLL 530 >emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera] Length = 489 Score = 620 bits (1598), Expect = e-174 Identities = 318/465 (68%), Positives = 368/465 (79%), Gaps = 21/465 (4%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESK-NCGKKASISR---SNWTAAVKRAWNG 1422 +P SS++KLSK SLWS +FA FS+FET SES + +K +I + WT AV++ G Sbjct: 25 EPQSSDTKLSKVSLWSSVFASAFSVFETNSESSPSASEKKAIDNGRNNGWTTAVRKVVTG 84 Query: 1421 GSMRKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRI 1242 SMR+ ERVLG ++TGI SSTSDIWLLG+CYKISQE+S+ ++ +GLA F +DFSSRI Sbjct: 85 VSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSNGLAEFEQDFSSRI 144 Query: 1241 LMTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIE 1062 LMTYRKGF+AIGDSK TSDVNWGCMLRSSQMLVAQALL HR+GRSWRK+ HKP+D YIE Sbjct: 145 LMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMDQDYIE 204 Query: 1061 ILHLFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSL 885 ILH FGDS+ASAFSIHN+LQAGKAY LAAGSWVGPYAMCR+WETL R KREET+L+ QSL Sbjct: 205 ILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARSKREETDLECQSL 264 Query: 884 PMAVYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRY 705 PMA+Y+VSGDEDGERGGAPVV I++ASRHC EFSKGQ DWTPI LEK+NPRY Sbjct: 265 PMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVPLVLGLEKVNPRY 324 Query: 704 VSLLRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSS 525 + L ATFTFPQSLGILGG PGASTYIVGVQDE+A+YLDPHEAQ VVD+RR+N+EADTSS Sbjct: 325 IPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLEADTSS 384 Query: 524 YHCN---VPCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARN 354 YHCN + H+ LDS+DPSLAIGFYCRDK DF+DFC RASKLAD+S+GAPLFTV + Sbjct: 385 YHCNCSSIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADESNGAPLFTVAHIHS 444 Query: 353 TPKQTSHHGTLSD-SGEIQD----------AEGC--SQEDDWQLL 258 PK S + D SG +D AEG EDDWQLL Sbjct: 445 LPKPISCSDGMDDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 489 >ref|XP_012435801.1| PREDICTED: cysteine protease ATG4 isoform X2 [Gossypium raimondii] gi|763779844|gb|KJB46915.1| hypothetical protein B456_008G001300 [Gossypium raimondii] gi|763779845|gb|KJB46916.1| hypothetical protein B456_008G001300 [Gossypium raimondii] gi|763779850|gb|KJB46921.1| hypothetical protein B456_008G001300 [Gossypium raimondii] Length = 488 Score = 619 bits (1597), Expect = e-174 Identities = 309/460 (67%), Positives = 365/460 (79%), Gaps = 16/460 (3%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESKN---CGKKASISRSN-WTAAVKRAWNG 1422 +PG ++SK SKTSLWS FA FS+F+T SES + C +K+S S++N WTAAVKR +G Sbjct: 30 EPGPNDSKFSKTSLWSNFFASAFSVFDTYSESSSSSACERKSSFSKTNGWTAAVKRVVSG 89 Query: 1421 GSMRKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRI 1242 GSMR+ HERVLG ++ GI SSTSDIWLLG+CYKISQE S+GD + LAAF +DFSSRI Sbjct: 90 GSMRRIHERVLGPSKIGISSSTSDIWLLGLCYKISQE-SSGDVDATSALAAFKQDFSSRI 148 Query: 1241 LMTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIE 1062 LMTYRKGFDAIG++K TSD +WGCMLRSSQMLVAQALLFHRLGRSWRK KP D YIE Sbjct: 149 LMTYRKGFDAIGETKITSDASWGCMLRSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIE 208 Query: 1061 ILHLFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSL 885 ILH FGDSEASAFSIHNL++AGK Y LAAGSWVGPYAMCR+WE+L R KREE +L+ Q L Sbjct: 209 ILHQFGDSEASAFSIHNLVEAGKNYGLAAGSWVGPYAMCRSWESLARSKREEIDLECQLL 268 Query: 884 PMAVYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRY 705 PMAVYVVSGDEDGERGGAPVVCI+DASRHCFEFS+ Q DWTPI L+K+NPRY Sbjct: 269 PMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRY 328 Query: 704 VSLLRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSS 525 + L+ATFTFPQ LGILGG PGASTYIVG+Q+E +YLDPH+ QPVV++ +N+EADTSS Sbjct: 329 IPSLQATFTFPQCLGILGGKPGASTYIVGIQEENVFYLDPHDVQPVVNLSTENLEADTSS 388 Query: 524 YHCNVPCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARNTPK 345 YHCN+ ++PL+SLDPSLAIGF+CRDK DF+DFC RASKLAD+S+GAPLFTV + K Sbjct: 389 YHCNIIRYIPLESLDPSLAIGFFCRDKDDFDDFCFRASKLADESNGAPLFTVAQTHSVFK 448 Query: 344 QTSHHGTLSDSG-----------EIQDAEGCSQEDDWQLL 258 +H T++++G D +G S EDDWQ L Sbjct: 449 PINHGDTMANAGGDRMDDSVRVLPTGDVDGNSHEDDWQFL 488 >ref|XP_010646415.1| PREDICTED: cysteine protease ATG4 isoform X1 [Vitis vinifera] Length = 489 Score = 619 bits (1597), Expect = e-174 Identities = 318/465 (68%), Positives = 368/465 (79%), Gaps = 21/465 (4%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESK-NCGKKASISR---SNWTAAVKRAWNG 1422 +P SS++KLSK SLWS +FA FS+FET SES + +K +I + WT AV++ G Sbjct: 25 EPQSSDTKLSKVSLWSSVFASAFSVFETNSESSPSASEKKAIDNGRNNGWTTAVRKVVTG 84 Query: 1421 GSMRKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRI 1242 SMR+ ERVLG ++TGI SSTSDIWLLG+CYKISQE+S+ ++ +GLA F +DFSSRI Sbjct: 85 VSMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSNGLAEFEQDFSSRI 144 Query: 1241 LMTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIE 1062 LMTYRKGF+AIGDSK TSDVNWGCMLRSSQMLVAQALL HR+GRSWRK+ HKP+D YIE Sbjct: 145 LMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMDQDYIE 204 Query: 1061 ILHLFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSL 885 ILH FGDS+ASAFSIHN+LQAGKAY LAAGSWVGPYAMCR+WETL R KREET+L+ QSL Sbjct: 205 ILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARSKREETDLECQSL 264 Query: 884 PMAVYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRY 705 PMA+Y+VSGDEDGERGGAPVV I++ASRHC EFSKGQ DWTPI LEK+NPRY Sbjct: 265 PMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVPLVLGLEKVNPRY 324 Query: 704 VSLLRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSS 525 + L ATFTFPQSLGILGG PGASTYIVGVQDE+A+YLDPHEAQ VVD+RR+N+EADTSS Sbjct: 325 IPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLEADTSS 384 Query: 524 YHCN---VPCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARN 354 YHCN + H+ LDS+DPSLAIGFYCRDK DF+DFC RASKLAD+S+GAPLFTV + Sbjct: 385 YHCNCSSIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADKSNGAPLFTVAHIHS 444 Query: 353 TPKQTSHHGTLSD-SGEIQD----------AEGC--SQEDDWQLL 258 PK S + D SG +D AEG EDDWQLL Sbjct: 445 LPKPISCSDGMDDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 489 >ref|XP_012435798.1| PREDICTED: cysteine protease ATG4 isoform X1 [Gossypium raimondii] gi|823202390|ref|XP_012435799.1| PREDICTED: cysteine protease ATG4 isoform X1 [Gossypium raimondii] gi|823202393|ref|XP_012435800.1| PREDICTED: cysteine protease ATG4 isoform X1 [Gossypium raimondii] Length = 495 Score = 615 bits (1586), Expect = e-173 Identities = 310/467 (66%), Positives = 366/467 (78%), Gaps = 23/467 (4%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESKN---CGKKASISRSN-WTAAVKRAWNG 1422 +PG ++SK SKTSLWS FA FS+F+T SES + C +K+S S++N WTAAVKR +G Sbjct: 30 EPGPNDSKFSKTSLWSNFFASAFSVFDTYSESSSSSACERKSSFSKTNGWTAAVKRVVSG 89 Query: 1421 GSMRKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRI 1242 GSMR+ HERVLG ++ GI SSTSDIWLLG+CYKISQE S+GD + LAAF +DFSSRI Sbjct: 90 GSMRRIHERVLGPSKIGISSSTSDIWLLGLCYKISQE-SSGDVDATSALAAFKQDFSSRI 148 Query: 1241 LMTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIE 1062 LMTYRKGFDAIG++K TSD +WGCMLRSSQMLVAQALLFHRLGRSWRK KP D YIE Sbjct: 149 LMTYRKGFDAIGETKITSDASWGCMLRSSQMLVAQALLFHRLGRSWRKPSQKPFDLAYIE 208 Query: 1061 ILHLFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSL 885 ILH FGDSEASAFSIHNL++AGK Y LAAGSWVGPYAMCR+WE+L R KREE +L+ Q L Sbjct: 209 ILHQFGDSEASAFSIHNLVEAGKNYGLAAGSWVGPYAMCRSWESLARSKREEIDLECQLL 268 Query: 884 PMAVYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRY 705 PMAVYVVSGDEDGERGGAPVVCI+DASRHCFEFS+ Q DWTPI L+K+NPRY Sbjct: 269 PMAVYVVSGDEDGERGGAPVVCIEDASRHCFEFSRHQADWTPILLLVPLVLGLDKVNPRY 328 Query: 704 VSLLRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSS 525 + L+ATFTFPQ LGILGG PGASTYIVG+Q+E +YLDPH+ QPVV++ +N+EADTSS Sbjct: 329 IPSLQATFTFPQCLGILGGKPGASTYIVGIQEENVFYLDPHDVQPVVNLSTENLEADTSS 388 Query: 524 YHCNVPCHVPLDSLDPSLAIGFYCRDKG-------DFNDFCSRASKLADQSDGAPLFTVT 366 YHCN+ ++PL+SLDPSLAIGF+CRDKG DF+DFC RASKLAD+S+GAPLFTV Sbjct: 389 YHCNIIRYIPLESLDPSLAIGFFCRDKGFLVNLVDDFDDFCFRASKLADESNGAPLFTVA 448 Query: 365 DARNTPKQTSHHGTLSDSG-----------EIQDAEGCSQEDDWQLL 258 + K +H T++++G D +G S EDDWQ L Sbjct: 449 QTHSVFKPINHGDTMANAGGDRMDDSVRVLPTGDVDGNSHEDDWQFL 495 >gb|KDO40316.1| hypothetical protein CISIN_1g011418mg [Citrus sinensis] gi|641820321|gb|KDO40317.1| hypothetical protein CISIN_1g011418mg [Citrus sinensis] Length = 486 Score = 612 bits (1577), Expect = e-172 Identities = 311/455 (68%), Positives = 355/455 (78%), Gaps = 13/455 (2%) Frame = -2 Query: 1583 GSSNSKLSKTSLWSGLFALPFSIFETTSESKNCGKKASISRSN-WTAAVKRAWNGGSMRK 1407 GSS SK SK SL S LF FS+FET SES KKA ++SN WTAAVKR GSMR+ Sbjct: 34 GSSESKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSMRR 93 Query: 1406 FHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRILMTYR 1227 HERVLG +RTGI SSTSDIWLLGVC+KI+Q+++ GD+ +GLA F +DFSSRIL++YR Sbjct: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYR 153 Query: 1226 KGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIEILHLF 1047 KGFD IGDSK TSDV WGCMLRSSQMLVAQALLFHRLGR WRK L KP D Y+EILHLF Sbjct: 154 KGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 213 Query: 1046 GDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSLPMAVY 870 GDSE S FSIHNLLQAGKAY LAAGSWVGPYAMCR+WE L RC+R ET L QSLPMA+Y Sbjct: 214 GDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 273 Query: 869 VVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRYVSLLR 690 VVSGDEDGERGGAPVVCIDDASRHC FSKGQ DWTPI LEK+NPRY+ LR Sbjct: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333 Query: 689 ATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSSYHCNV 510 TFTFPQSLGI+GG PGASTYIVGVQ+E A YLDPH+ QPV+++ +D++EADTS+YH +V Sbjct: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393 Query: 509 PCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARNTPKQTSHH 330 H+ LDS+DPSLAIGFYCRDK DF+DFC+RASKLA++S+GAPLFTVT P +H Sbjct: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP--VNHS 451 Query: 329 GTLSDSG-----------EIQDAEGCSQEDDWQLL 258 L ++G + DA G + EDDWQLL Sbjct: 452 DVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486 >ref|XP_006441973.1| hypothetical protein CICLE_v10019906mg [Citrus clementina] gi|557544235|gb|ESR55213.1| hypothetical protein CICLE_v10019906mg [Citrus clementina] Length = 486 Score = 611 bits (1576), Expect = e-172 Identities = 311/457 (68%), Positives = 356/457 (77%), Gaps = 13/457 (2%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESKNCGKKASISRSN-WTAAVKRAWNGGSM 1413 +PGSS SK SK SL S LF FS+FET SES KKA ++SN WTAAVKR GSM Sbjct: 32 EPGSSESKSSKGSLLSSLFNSAFSVFETYSESSASEKKAVHNKSNGWTAAVKRLVTAGSM 91 Query: 1412 RKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRILMT 1233 R+ HERVLG +RTGI SSTSDIWLLGVC+KI+Q+++ GD+ +GLA F +DFSSRIL++ Sbjct: 92 RRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILIS 151 Query: 1232 YRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIEILH 1053 YRKGFD IGDSK TSDV WGCMLRSSQMLVAQALLFHRLGR WRK L KP D Y+EILH Sbjct: 152 YRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILH 211 Query: 1052 LFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSLPMA 876 LFGDSE S FSIHNLLQAGKAY LAAGSWVGPYAMCR+WE L RC+R ET L QSLPMA Sbjct: 212 LFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMA 271 Query: 875 VYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRYVSL 696 +YVVSGDEDGERGGAPVVCIDDASRHC FSKGQ DWTPI LEK+NPRY+ Sbjct: 272 IYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPT 331 Query: 695 LRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSSYHC 516 LR TFTFPQSLGI+GG PGASTYIVGVQ+E A YLDPH+ Q V+++ +D++EADTS+YH Sbjct: 332 LRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQLVINIGKDDLEADTSTYHS 391 Query: 515 NVPCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARNTPKQTS 336 +V H+ LDS+DPSLAIGFYCRDK DF+DFC+RASKLA++S+GAPLFTVT P + Sbjct: 392 DVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP--VN 449 Query: 335 HHGTLSDSG-----------EIQDAEGCSQEDDWQLL 258 H L ++G + DA G + EDDWQLL Sbjct: 450 HSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486 >ref|XP_006478507.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Citrus sinensis] Length = 486 Score = 611 bits (1575), Expect = e-172 Identities = 311/455 (68%), Positives = 355/455 (78%), Gaps = 13/455 (2%) Frame = -2 Query: 1583 GSSNSKLSKTSLWSGLFALPFSIFETTSESKNCGKKASISRSN-WTAAVKRAWNGGSMRK 1407 GSS SK SK SL S LF FS+FET SES KKA ++SN WTAAVKR GSMR+ Sbjct: 34 GSSESKSSKGSLLSSLFNSAFSVFETYSESSANEKKAVHNKSNGWTAAVKRLVTAGSMRR 93 Query: 1406 FHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRILMTYR 1227 HERVLG +RTGI SSTSDIWLLGVC+KI+Q+++ GD+ +GLA F +DFSSRIL++YR Sbjct: 94 IHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGDAAGNNGLAEFNQDFSSRILISYR 153 Query: 1226 KGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIEILHLF 1047 KGFD IGDSK TSDV WGCMLRSSQMLVAQALLFHRLGR WRK L KP D Y+EILHLF Sbjct: 154 KGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRLGRPWRKPLQKPFDREYVEILHLF 213 Query: 1046 GDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSLPMAVY 870 GDSE S FSIHNLLQAGKAY LAAGSWVGPYAMCR+WE L RC+R ET L QSLPMA+Y Sbjct: 214 GDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWEALARCQRAETGLGCQSLPMAIY 273 Query: 869 VVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRYVSLLR 690 VVSGDEDGERGGAPVVCIDDASRHC FSKGQ DWTPI LEK+NPRY+ LR Sbjct: 274 VVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTPILLLVPLVLGLEKVNPRYIPTLR 333 Query: 689 ATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSSYHCNV 510 TFTFPQSLGI+GG PGASTYIVGVQ+E A YLDPH+ QPV+++ +D++EADTS+YH +V Sbjct: 334 LTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHDVQPVINIGKDDLEADTSTYHSDV 393 Query: 509 PCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARNTPKQTSHH 330 H+ LDS+DPSLAIGFYCRDK DF+DFC+RASKLA++S+GAPLFTVT P +H Sbjct: 394 IRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAEESNGAPLFTVTQTHKKP--VNHS 451 Query: 329 GTLSDSG-----------EIQDAEGCSQEDDWQLL 258 L ++G + DA G + EDDWQLL Sbjct: 452 DVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486 >ref|XP_007049915.1| Peptidase family C54 protein isoform 1 [Theobroma cacao] gi|508702176|gb|EOX94072.1| Peptidase family C54 protein isoform 1 [Theobroma cacao] Length = 514 Score = 610 bits (1574), Expect = e-172 Identities = 312/484 (64%), Positives = 364/484 (75%), Gaps = 40/484 (8%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESKNCGKKASISRSN-WTAAVKRAWNGGSM 1413 +PG S+ K SK+S+WS LFA FSIF+T SES C KKA +R+N WTAAVKR +GGSM Sbjct: 31 EPGPSDCKFSKSSVWSNLFASAFSIFDTYSESSACEKKALHARNNGWTAAVKRVVSGGSM 90 Query: 1412 RKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRILMT 1233 R+ HERVLG ++ GI SSTSDIWLLGVCYKISQ S+GD + +GLAAF DFSSRILMT Sbjct: 91 RRIHERVLGPSKIGISSSTSDIWLLGVCYKISQVSSSGDVDASNGLAAFKRDFSSRILMT 150 Query: 1232 YRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIEILH 1053 YRKGFDAIGD+K TSD WGCMLRSSQMLVAQALLFH+LGRSWRK L KP + YIEILH Sbjct: 151 YRKGFDAIGDTKITSDFGWGCMLRSSQMLVAQALLFHQLGRSWRKPLQKPFEQAYIEILH 210 Query: 1052 LFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSLPMA 876 FGDSEA+AFSIHNL++AGK Y LAAGSWVGPYAMCR+WE+L R KREE +L+ QSLPMA Sbjct: 211 QFGDSEATAFSIHNLVEAGKIYGLAAGSWVGPYAMCRSWESLARFKREENDLEHQSLPMA 270 Query: 875 VYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINP----- 711 VYVVSGDEDGERGGAPVVC++DASRHCFEFS+ + DWTPI L+K+N Sbjct: 271 VYVVSGDEDGERGGAPVVCVEDASRHCFEFSRCRADWTPILLLVPLVLGLDKVNSSFCKE 330 Query: 710 -----------------RYVSLLRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPH 582 Y+ L+ATFTFPQ LGILGG PGASTYIVGVQ+E +YLDPH Sbjct: 331 DSTFETEGELHLDFAYLEYIPSLQATFTFPQCLGILGGKPGASTYIVGVQEENVFYLDPH 390 Query: 581 EAQPVVDVRRDNVEADTSSYHCNVPCHVPLDSLDPSLAIGFYCRDKG-------DFNDFC 423 + Q VV++ +DN EADTSSYHC++ H+PLDS+DPSLAIGF+CRDKG DF+DFC Sbjct: 391 DVQLVVNLSQDNQEADTSSYHCDIIRHIPLDSIDPSLAIGFFCRDKGLPVDLVDDFDDFC 450 Query: 422 SRASKLADQSDGAPLFTVTDARNTPKQTSHHGTLSDSGEIQ---------DAEGCSQEDD 270 RASKLAD+S+GAPLFTV ++ K SH L D+GE++ D +G EDD Sbjct: 451 LRASKLADESNGAPLFTVAQTHSSFKPISHGNALDDTGEVREDDSLGVVPDMDGSIHEDD 510 Query: 269 WQLL 258 WQLL Sbjct: 511 WQLL 514 >ref|XP_002309707.1| autophagy 4b family protein [Populus trichocarpa] gi|222852610|gb|EEE90157.1| autophagy 4b family protein [Populus trichocarpa] Length = 481 Score = 608 bits (1569), Expect = e-171 Identities = 307/451 (68%), Positives = 360/451 (79%), Gaps = 7/451 (1%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESKNCGKKAS--ISRSN-WTAAVKRAWNGG 1419 +PGS+++K+SK SLWS FA FS+F+ +S + + I SN WT++VK+ GG Sbjct: 31 EPGSTDTKVSKPSLWSSFFASAFSVFDIYRDSSSTSHNEAPHIRHSNGWTSSVKKIVAGG 90 Query: 1418 SMRKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRIL 1239 +MR+ ERVLG ++TGI ++TSDIWLLG YKISQ+DS+G+++ + LAAF DFSSRIL Sbjct: 91 TMRRIQERVLGTSKTGISNTTSDIWLLGARYKISQDDSSGNADATNALAAFHRDFSSRIL 150 Query: 1238 MTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIEI 1059 +TYRKGFD I DSK TSDVNWGCMLRSSQMLVAQALLFHRLGRSWRK + KPLD Y+EI Sbjct: 151 ITYRKGFDMIEDSKLTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKPVDKPLDRDYVEI 210 Query: 1058 LHLFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSLP 882 LHLFGDSEASAFSIHNLLQAGKAY LAAGSWVGPYAMCR+WE+L R KREET L+ Q+LP Sbjct: 211 LHLFGDSEASAFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWESLARSKREETNLEYQTLP 270 Query: 881 MAVYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRYV 702 MAVYVVSG EDGERGGAPV+ I+DA+RHC EFSKG+ DWTPI L+KINPRY+ Sbjct: 271 MAVYVVSGCEDGERGGAPVLSIEDAARHCSEFSKGREDWTPILLLVPLVLGLDKINPRYI 330 Query: 701 SLLRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSSY 522 L+ATFTFPQSLGILGG PGASTYIVGVQDE A+YLDPHE QPVV+ RD+VEA+TSSY Sbjct: 331 PSLQATFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQPVVNFSRDDVEANTSSY 390 Query: 521 HCNVPCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARNTPKQ 342 HC+V H+PLD +DPSLAIGFYCRDK DF+DFCS ASKLAD+S+GAPLFTV ++ + K Sbjct: 391 HCDVVRHIPLDLIDPSLAIGFYCRDKDDFDDFCSLASKLADESNGAPLFTVANSYKSSKH 450 Query: 341 TSHHGTLSDS---GEIQDAEGCSQEDDWQLL 258 S D + DAEGC EDDWQLL Sbjct: 451 DSSEVRDDDPLGVMTMNDAEGCLNEDDWQLL 481 >ref|XP_006372315.1| autophagy 4b family protein [Populus trichocarpa] gi|550318931|gb|ERP50112.1| autophagy 4b family protein [Populus trichocarpa] Length = 482 Score = 608 bits (1568), Expect = e-171 Identities = 307/449 (68%), Positives = 357/449 (79%), Gaps = 7/449 (1%) Frame = -2 Query: 1583 GSSNSKLSKTSLWSGLFALPFSIFETTSESKNCGKKAS--ISRSN-WTAAVKRAWNGGSM 1413 GS+++K SK SLWS FA FS+F+T +S + +K + I N WT+AVK+ GGSM Sbjct: 34 GSADTKFSKPSLWSTFFASAFSVFDTHCDSSSTSEKKAPHIRHGNGWTSAVKKIVAGGSM 93 Query: 1412 RKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRILMT 1233 R+ E VLG ++TGI ++T DIWLLG CYKISQ++S+GD+ + LAAF DFSSRIL+T Sbjct: 94 RRIQECVLGTSKTGISNTTGDIWLLGACYKISQDNSSGDAAATNALAAFNHDFSSRILIT 153 Query: 1232 YRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIEILH 1053 YRKGFDAI DSK TSDV+WGCMLRSSQMLVAQALLFHRLGRSWRK L KPLD Y+EILH Sbjct: 154 YRKGFDAIEDSKLTSDVSWGCMLRSSQMLVAQALLFHRLGRSWRKPLDKPLDREYVEILH 213 Query: 1052 LFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSLPMA 876 LFGDSE+SAFSIHNLL+AGKAY LAAGSWVGPYA+C +WE+LVR +REET L+ QSL MA Sbjct: 214 LFGDSESSAFSIHNLLRAGKAYGLAAGSWVGPYAVCHSWESLVRSRREETNLEYQSLSMA 273 Query: 875 VYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRYVSL 696 VYVVSG EDGERGGAPV+CI++A+RHC EFSKGQ DWTPI L+KINPRY+ Sbjct: 274 VYVVSGSEDGERGGAPVLCIEEAARHCSEFSKGQEDWTPILLLVPLVLGLDKINPRYIPS 333 Query: 695 LRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSSYHC 516 L+ATFTFPQSLGILGG PGASTYIVGVQDE A+YLDPHE QPVV+V RD+VEA+TSSYHC Sbjct: 334 LQATFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQPVVNVSRDDVEANTSSYHC 393 Query: 515 NVPCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARNTPKQTS 336 NV H+PLD +DPSLAIGFYCRDK DF+DFC+ ASKL D+S+GAPLFTV +R K S Sbjct: 394 NVVRHMPLDLIDPSLAIGFYCRDKDDFDDFCTLASKLTDESNGAPLFTVAHSRKLLKHDS 453 Query: 335 HHGTLSDS---GEIQDAEGCSQEDDWQLL 258 DS + D EGC EDDWQLL Sbjct: 454 GEVRSDDSLGVMTMNDVEGCVHEDDWQLL 482 >ref|XP_011005240.1| PREDICTED: cysteine protease ATG4-like [Populus euphratica] Length = 481 Score = 607 bits (1564), Expect = e-170 Identities = 303/451 (67%), Positives = 355/451 (78%), Gaps = 7/451 (1%) Frame = -2 Query: 1589 DPGSSNSKLSKTSLWSGLFALPFSIFETTSESKNCGKKASIS---RSNWTAAVKRAWNGG 1419 +PGS+++K SK SLWS A FS+F+ +S + + R+ WT++VK+ GG Sbjct: 31 EPGSTDTKFSKPSLWSSFLASAFSVFDIYRDSSSTSHSEGLHIRHRNGWTSSVKKILAGG 90 Query: 1418 SMRKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDSNYIDGLAAFVEDFSSRIL 1239 +MR+ ERVLG ++TGI ++TSDIW LG CYKISQ DS+G+++ D LAAF DFSSRIL Sbjct: 91 TMRRIQERVLGTSKTGISNTTSDIWFLGACYKISQGDSSGNADATDALAAFHRDFSSRIL 150 Query: 1238 MTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIEI 1059 +TYRKGFD I DSK+TSDVNWGCMLRSSQMLVAQALLFHRLGRSWRK L KPLD Y+EI Sbjct: 151 ITYRKGFDMIEDSKFTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKPLDKPLDQDYVEI 210 Query: 1058 LHLFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSLP 882 LHLFGDSEASAFSI NLLQAGKAY LAAGSWVGPYAMCR+WE+L R KREET L+ Q+LP Sbjct: 211 LHLFGDSEASAFSIRNLLQAGKAYGLAAGSWVGPYAMCRSWESLARSKREETNLEFQTLP 270 Query: 881 MAVYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRYV 702 MAVYVVSG EDGERGGAPV+ I+DA+RHC EFSKG+ DWTPI L+KINPRY+ Sbjct: 271 MAVYVVSGCEDGERGGAPVLSIEDAARHCSEFSKGREDWTPILLLVPLVLGLDKINPRYI 330 Query: 701 SLLRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSSY 522 L+ATFTFPQSLGILGG PGASTYIVGVQD+ A+YLDPHE QPVV+ RD+VEA+TSSY Sbjct: 331 PSLQATFTFPQSLGILGGKPGASTYIVGVQDKNAFYLDPHEVQPVVNFSRDDVEANTSSY 390 Query: 521 HCNVPCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARNTPKQ 342 HC+V H+PLD +DPSLAIGFYCRDK DF+DFCS ASKLAD+S+GAPLFTV ++ + K Sbjct: 391 HCDVVRHIPLDLIDPSLAIGFYCRDKDDFDDFCSLASKLADESNGAPLFTVANSYKSSKH 450 Query: 341 TSHHGTLSDS---GEIQDAEGCSQEDDWQLL 258 S D + DAEGC ED WQLL Sbjct: 451 DSSEVRDDDPLGVMTMNDAEGCLNEDGWQLL 481 >ref|XP_012081889.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Jatropha curcas] gi|802675786|ref|XP_012081890.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Jatropha curcas] gi|802675813|ref|XP_012081891.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Jatropha curcas] gi|643718243|gb|KDP29532.1| hypothetical protein JCGZ_19245 [Jatropha curcas] Length = 492 Score = 598 bits (1541), Expect = e-168 Identities = 305/460 (66%), Positives = 351/460 (76%), Gaps = 17/460 (3%) Frame = -2 Query: 1586 PGSS-NSKLSKTSLWSGLFALPFSIFETTSESK--NCGKKASISRSN-WTAAVKRAWNGG 1419 PGSS +SK SK LWS F FS+FET ES KK S +R+N WT+AVK+ GG Sbjct: 33 PGSSGDSKFSKGFLWSSFFTAAFSVFETYRESPPTTSEKKGSHTRNNGWTSAVKKIVAGG 92 Query: 1418 SMRKFHERVLGVNRTGIYSSTSDIWLLGVCYKISQEDSTGDS-NYIDGLAAFVEDFSSRI 1242 SMR+ HERVLG +RTGI ++TS+IWLLGVCYKISQ+ S D+ +GLA F DFSSRI Sbjct: 93 SMRRIHERVLGPSRTGISNTTSEIWLLGVCYKISQDGSNADAATSNNGLADFTHDFSSRI 152 Query: 1241 LMTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKSLHKPLDHGYIE 1062 LMTYRKGFDAIGDSK+TSDV WGCMLRSSQMLVAQALLFH+LGRSWRK + KPLD Y+E Sbjct: 153 LMTYRKGFDAIGDSKFTSDVGWGCMLRSSQMLVAQALLFHQLGRSWRKPIQKPLDQKYVE 212 Query: 1061 ILHLFGDSEASAFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCKREETELD-QSL 885 ILHLFGDSEAS FSIHNL+ AGKAY LAAGSWVGPYAMCR+WE L RCKREE L+ ++L Sbjct: 213 ILHLFGDSEASPFSIHNLIHAGKAYGLAAGSWVGPYAMCRSWELLARCKREENNLEHEAL 272 Query: 884 PMAVYVVSGDEDGERGGAPVVCIDDASRHCFEFSKGQFDWTPIXXXXXXXXXLEKINPRY 705 PMAVYVVSGDEDGERGGAPVVCI+DASRHC +FS+GQ +WTPI LEK+N RY Sbjct: 273 PMAVYVVSGDEDGERGGAPVVCIEDASRHCLDFSRGQANWTPILLLVPLVLGLEKVNLRY 332 Query: 704 VSLLRATFTFPQSLGILGGTPGASTYIVGVQDEEAYYLDPHEAQPVVDVRRDNVEADTSS 525 + L+AT TFPQSLGI+GG PGASTYIVGVQD+ A+YLDPH QPVV++ RD + ADTSS Sbjct: 333 IPSLQATLTFPQSLGIMGGKPGASTYIVGVQDDNAFYLDPHGVQPVVNISRDGIGADTSS 392 Query: 524 YHCNVPCHVPLDSLDPSLAIGFYCRDKGDFNDFCSRASKLADQSDGAPLFTVTDARNTPK 345 YH + H+PL+S+DPSLAIGFYCRDK DF++FC ASKLAD S GAPLFTV + PK Sbjct: 393 YHSDFIRHIPLESIDPSLAIGFYCRDKDDFDEFCFLASKLADDSHGAPLFTVAHSHKLPK 452 Query: 344 QTSHHGTLSDSGEIQ-----------DAEGCSQEDDWQLL 258 T +D+ +Q DAE EDDWQLL Sbjct: 453 SVGRTDTSNDNSGVQEDDSLGVMPMNDAEAPVNEDDWQLL 492