BLASTX nr result

ID: Forsythia22_contig00014246 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00014246
         (2045 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_012839481.1| PREDICTED: cysteine protease ATG4-like [Eryt...   643   0.0  
gb|EYU31238.1| hypothetical protein MIMGU_mgv1a004965mg [Erythra...   635   e-179
ref|XP_011100135.1| PREDICTED: cysteine protease ATG4-like [Sesa...   616   e-173
ref|XP_012844842.1| PREDICTED: cysteine protease ATG4-like [Eryt...   592   e-166
ref|XP_009588897.1| PREDICTED: cysteine protease ATG4-like isofo...   581   e-163
ref|XP_009761431.1| PREDICTED: cysteine protease ATG4-like isofo...   579   e-162
ref|XP_006354186.1| PREDICTED: cysteine protease ATG4-like isofo...   572   e-160
ref|XP_004228630.1| PREDICTED: cysteine protease ATG4 isoform X2...   570   e-159
ref|XP_009761430.1| PREDICTED: cysteine protease ATG4-like isofo...   564   e-157
ref|XP_010316044.1| PREDICTED: cysteine protease ATG4 isoform X1...   555   e-155
ref|XP_003635099.1| PREDICTED: cysteine protease ATG4 isoform X2...   546   e-152
ref|XP_010093156.1| hypothetical protein L484_005165 [Morus nota...   544   e-151
emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]   541   e-150
ref|XP_010646415.1| PREDICTED: cysteine protease ATG4 isoform X1...   540   e-150
ref|XP_009354662.1| PREDICTED: cysteine protease ATG4-like [Pyru...   540   e-150
gb|EPS69655.1| hypothetical protein M569_05108, partial [Genlise...   537   e-149
ref|XP_007049917.1| Peptidase family C54 protein isoform 3 [Theo...   535   e-149
ref|XP_007217926.1| hypothetical protein PRUPE_ppa004885mg [Prun...   535   e-149
ref|XP_008232834.1| PREDICTED: cysteine protease ATG4 [Prunus mume]   532   e-148
ref|XP_008346919.1| PREDICTED: cysteine protease ATG4-like isofo...   528   e-147

>ref|XP_012839481.1| PREDICTED: cysteine protease ATG4-like [Erythranthe guttatus]
          Length = 502

 Score =  643 bits (1658), Expect = 0.0
 Identities = 332/512 (64%), Positives = 378/512 (73%), Gaps = 6/512 (1%)
 Frame = -3

Query: 1881 MKGFQGKNSDPIHNSGKRNCNY--SLNGIXXXXXXXXXXXXXXXXXXXXVAIWSGIWLPA 1708
            MKGF+  ++DP+ NS  +  ++  S                         AIWS IW   
Sbjct: 1    MKGFREIDTDPLRNSAVKKSDFDNSPKNSPGSVSSEAEPSIRHSNHKKSEAIWSVIWPTT 60

Query: 1707 LSIFGTSNHXXXXXXXXXXXXXXXXXXXXSG-RNYYEGWRAAVRRVMMNGGSMRRILGFN 1531
            + IFG SN                     SG +NYYEGWR AVRRVMMNG +MRRI GF+
Sbjct: 61   IPIFGISNSNRTESESKNSGGSVIRKKSSSGSKNYYEGWRGAVRRVMMNGVTMRRIWGFS 120

Query: 1530 KPGISVSNSDIWLLGICYKVCQEGDELNSSSSDPTQREGFAAFVEDFSSLILITYRKGFD 1351
            +   S S S+IWLLG+CY+V QEG+E   SSSDPTQ EGFAAFVEDFSS +LITYRKGF 
Sbjct: 121  RATTSSSKSNIWLLGVCYQVSQEGEE---SSSDPTQSEGFAAFVEDFSSRVLITYRKGFS 177

Query: 1350 PIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRSWRKSLHKPLDQSYIEVLQLFGDSM 1171
            PIGDSK+ SDV+WGCMLRSSQMLVAQAF+ HK GRSWRKS HKPLDQ+Y+E+L LFGD+ 
Sbjct: 178  PIGDSKYISDVNWGCMLRSSQMLVAQAFLVHKLGRSWRKSPHKPLDQTYVEILHLFGDAE 237

Query: 1170 DLPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETLVRSKREE-NENGGLSSMMAIYVVS 994
            D P S+HNLLQAGKAYGLAPGSWVGPYAMCRTWE+LVR+K+EE   NG L+S MA+YVVS
Sbjct: 238  DSPCSVHNLLQAGKAYGLAPGSWVGPYAMCRTWESLVRNKKEEIGNNGVLASTMALYVVS 297

Query: 993  GDEDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXXXXXXXXXLDKVNPRYLPLLAATF 814
            GDEDGERGGAPV+C++D+ RHC EF  G+VDW            L+KVNPRYLPLL+ATF
Sbjct: 298  GDEDGERGGAPVVCIEDVLRHCSEFGGGQVDWAPILLMVPLVLGLEKVNPRYLPLLSATF 357

Query: 813  TFPQCLGILGGRPGASTYIVGVQDENAFYLDPHEVQQVVDIKRDNVDADTSSYHSNVVRH 634
            TFPQ LGILGGRPGASTYIVGVQDE AFYLDPHEVQQV+DIKRDN+D DTSSYH NVVRH
Sbjct: 358  TFPQSLGILGGRPGASTYIVGVQDEKAFYLDPHEVQQVIDIKRDNLDLDTSSYHCNVVRH 417

Query: 633  IPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSNGAPLFTITETRSSPNPAARHET- 457
            IPLDSID SLAIGFYC+DKSDF+DFCARA ELIDQSNGAPLFTI ETR+S  PA  + T 
Sbjct: 418  IPLDSIDSSLAIGFYCRDKSDFNDFCARASELIDQSNGAPLFTIAETRTSAKPAGNNNTE 477

Query: 456  LSDNATREYDSKDMLPTDGPEDCAQ-DDWQLL 364
            + DN T + D          E+CAQ DDWQLL
Sbjct: 478  IEDNNTADNDL-------ATENCAQEDDWQLL 502


>gb|EYU31238.1| hypothetical protein MIMGU_mgv1a004965mg [Erythranthe guttata]
          Length = 502

 Score =  635 bits (1638), Expect = e-179
 Identities = 330/512 (64%), Positives = 375/512 (73%), Gaps = 6/512 (1%)
 Frame = -3

Query: 1881 MKGFQGKNSDPIHNSGKRNCNY--SLNGIXXXXXXXXXXXXXXXXXXXXVAIWSGIWLPA 1708
            MKGF+  ++DP+ NS  +  ++  S                         AIWS IW   
Sbjct: 1    MKGFREIDTDPLRNSAVKKSDFDNSPKNSPGSVSSEAEPSILLSNHEKSEAIWSVIWPTT 60

Query: 1707 LSIFGTSNHXXXXXXXXXXXXXXXXXXXXSG-RNYYEGWRAAVRRVMMNGGSMRRILGFN 1531
            + IFG SN                     SG +NYYEGWR AVRRVMMNG +MRRI GF+
Sbjct: 61   IPIFGISNGNRSESGSRNSGSSIIRKKSSSGSKNYYEGWRGAVRRVMMNGVTMRRIWGFS 120

Query: 1530 KPGISVSNSDIWLLGICYKVCQEGDELNSSSSDPTQREGFAAFVEDFSSLILITYRKGFD 1351
            +   S S S+I LLG+CY+V QEG+E   SSSDPTQ EGFAAFVEDFSS +LITYRKGF 
Sbjct: 121  RASTSSSKSNICLLGVCYQVSQEGEE---SSSDPTQSEGFAAFVEDFSSRVLITYRKGFS 177

Query: 1350 PIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRSWRKSLHKPLDQSYIEVLQLFGDSM 1171
            PIGDSK+ SDV+WGCMLRSSQMLVAQAF+ HK GRSWRKS HKPLDQ+Y+E+L LFGD+ 
Sbjct: 178  PIGDSKYISDVNWGCMLRSSQMLVAQAFLVHKLGRSWRKSPHKPLDQTYVEILHLFGDAE 237

Query: 1170 DLPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETLVRSKREENENGG-LSSMMAIYVVS 994
            D P S+HNLLQAGKAYGLAPGSWVGPYAMCRTWE+LVR+K+EE +N G L S MA+YVVS
Sbjct: 238  DSPCSVHNLLQAGKAYGLAPGSWVGPYAMCRTWESLVRNKKEEIDNNGVLLSTMALYVVS 297

Query: 993  GDEDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXXXXXXXXXLDKVNPRYLPLLAATF 814
            GDEDGERGGAPV+C++D+ RHC EF  G+VDW            L+KVNPRYLPLL+ATF
Sbjct: 298  GDEDGERGGAPVVCIEDVLRHCSEFGGGQVDWAPILLMVPLVLGLEKVNPRYLPLLSATF 357

Query: 813  TFPQCLGILGGRPGASTYIVGVQDENAFYLDPHEVQQVVDIKRDNVDADTSSYHSNVVRH 634
            TFPQ LGILGGRPGASTYIVGVQDE AFYLDPHEVQQV+DIKRDN+D DTSSYH NVVRH
Sbjct: 358  TFPQSLGILGGRPGASTYIVGVQDEKAFYLDPHEVQQVIDIKRDNLDLDTSSYHCNVVRH 417

Query: 633  IPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSNGAPLFTITETRSSPNPAARHETL 454
            IPLDSID SLAIGFYC+DKSDFDDFCARA ELIDQSNGAPLFTI ETR+   PA  + T 
Sbjct: 418  IPLDSIDSSLAIGFYCRDKSDFDDFCARASELIDQSNGAPLFTIAETRTLAKPAGNNNT- 476

Query: 453  SDNATREYDSKDMLPTD-GPEDCAQ-DDWQLL 364
                  E + K+    D   E+CAQ DDWQLL
Sbjct: 477  ------EIEDKNNADNDLATENCAQEDDWQLL 502


>ref|XP_011100135.1| PREDICTED: cysteine protease ATG4-like [Sesamum indicum]
          Length = 528

 Score =  616 bits (1588), Expect = e-173
 Identities = 325/530 (61%), Positives = 378/530 (71%), Gaps = 24/530 (4%)
 Frame = -3

Query: 1881 MKGFQGKNSDPIHNSGKRNCNYSLNGIXXXXXXXXXXXXXXXXXXXXVA-IWSGIWLPAL 1705
            MKGF G++SDPI++S K++   S N                       + IW GI  P L
Sbjct: 1    MKGFLGRSSDPIYSSCKKDSTCSYNQEKCLGSVSSEAGPSVSSSKNQKSVIWWGIRSPGL 60

Query: 1704 SIFGTSN-HXXXXXXXXXXXXXXXXXXXXSGRNYYEGWRAAVRRVMMNGGSMRRILGFNK 1528
            SIF   N +                    SG+NYYEGWRAAVRRVMMNGG+MRRILGFN+
Sbjct: 61   SIFENLNKNCEKSDTRNCSNNRSAKKKLASGKNYYEGWRAAVRRVMMNGGTMRRILGFNR 120

Query: 1527 PGISVSNSDIWLLGICYKVCQEGDELNSSSSDPTQREGFAAFVEDFSSLILITYRKGFDP 1348
             G+SVS SDIWLLGICY+V QEGD+ NS   DP Q EGFAAFVEDFSS +L TYRKGF P
Sbjct: 121  TGVSVSKSDIWLLGICYRVAQEGDDANSL--DPMQSEGFAAFVEDFSSRLLFTYRKGFSP 178

Query: 1347 IGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRSWRKSLHKPLDQSYIEVLQLFGDSMD 1168
            IGD+K+TSDV+WGCMLRSSQMLVAQAF+FHK GRSWRKS  K LD SY+E+L LFGD  D
Sbjct: 179  IGDTKYTSDVYWGCMLRSSQMLVAQAFLFHKLGRSWRKSPDKQLDPSYLEILHLFGDDED 238

Query: 1167 LPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETLVRSKREENENGGLSSMMAIYVVSGD 988
             P SIHNLLQAG  YGLAPGSWVGPYAMCRTWE+LVR+KREE  NG LSSM+A+YVVSGD
Sbjct: 239  SPCSIHNLLQAGGTYGLAPGSWVGPYAMCRTWESLVRNKREEIGNGILSSMIAVYVVSGD 298

Query: 987  EDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXXXXXXXXXLDKVNPRYLPLLAATFTF 808
            +DGERGGAPVLC++DI+R C + S G+ DW            L+K+NPRYLPLL+ATFTF
Sbjct: 299  DDGERGGAPVLCIEDIARLCSKCSGGQCDWAPILLMVPLVLGLEKINPRYLPLLSATFTF 358

Query: 807  PQCLGILGGRPGASTYIVGVQDENAF------------------YLDPHEVQ-QVVDIKR 685
            PQ LG+LGG+PGASTYIVGVQDE                     Y     VQ QV  +K 
Sbjct: 359  PQSLGLLGGKPGASTYIVGVQDEKGTGNVTIASISVYSLVVKCQYAPSLFVQLQVAQVKM 418

Query: 684  DNVDADTSSYHSNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSNGAPLFT 505
            D++DADTSSYH NVVRH+PL+SID SLAIGFYC++KSDFDDFC+RA ELIDQSNGAPLFT
Sbjct: 419  DDLDADTSSYHCNVVRHMPLESIDSSLAIGFYCRNKSDFDDFCSRASELIDQSNGAPLFT 478

Query: 504  ITETRSSPNPAARHETLSDNA--TREYDSKDMLPTDGPEDCAQDD-WQLL 364
            I E+RS+PNPA+   T  D+A  T++++  D LPT G E+  QDD WQLL
Sbjct: 479  IAESRSNPNPASYRSTAKDDAICTQDFEPVDKLPTSGSENFTQDDEWQLL 528


>ref|XP_012844842.1| PREDICTED: cysteine protease ATG4-like [Erythranthe guttatus]
          Length = 396

 Score =  592 bits (1526), Expect = e-166
 Identities = 296/406 (72%), Positives = 332/406 (81%), Gaps = 3/406 (0%)
 Frame = -3

Query: 1572 MMNGGSMRRILGFNKPGISVSNSDIWLLGICYKVCQEGDELNSSSSDPTQREGFAAFVED 1393
            MMNG +MRRI GF++   S S S+I LLG+CY+V QEG+E   SSSDPTQ EGFAAFVED
Sbjct: 1    MMNGVTMRRIWGFSRASTSSSKSNICLLGVCYQVSQEGEE---SSSDPTQSEGFAAFVED 57

Query: 1392 FSSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRSWRKSLHKPLD 1213
            FSS +LITYRKGF PIGDSK+ SDV+WGCMLRSSQMLVAQAF+ HK GRSWRKS HKPLD
Sbjct: 58   FSSRVLITYRKGFSPIGDSKYISDVNWGCMLRSSQMLVAQAFLVHKLGRSWRKSPHKPLD 117

Query: 1212 QSYIEVLQLFGDSMDLPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETLVRSKREENEN 1033
            Q+Y+E+L LFGD+ D P S+HNLLQAGKAYGLAPGSWVGPYAMCRTWE+LVR+K+EE +N
Sbjct: 118  QTYVEILHLFGDAEDSPCSVHNLLQAGKAYGLAPGSWVGPYAMCRTWESLVRNKKEEIDN 177

Query: 1032 GG-LSSMMAIYVVSGDEDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXXXXXXXXXLD 856
             G L S MA+YVVSGDEDGERGGAPV+C++D+ RHC EF  G+VDW            L+
Sbjct: 178  NGVLLSTMALYVVSGDEDGERGGAPVVCIEDVLRHCSEFGGGQVDWAPILLMVPLVLGLE 237

Query: 855  KVNPRYLPLLAATFTFPQCLGILGGRPGASTYIVGVQDENAFYLDPHEVQQVVDIKRDNV 676
            KVNPRYLPLL+ATFTFPQ LGILGGRPGASTYIVGVQDE AFYLDPHEVQQV+DIKRDN+
Sbjct: 238  KVNPRYLPLLSATFTFPQSLGILGGRPGASTYIVGVQDEKAFYLDPHEVQQVIDIKRDNL 297

Query: 675  DADTSSYHSNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSNGAPLFTITE 496
            D DTSSYH NVVRHIPLDSID SLAIGFYC+DKSDFDDFCARA ELIDQSNGAPLFTI E
Sbjct: 298  DLDTSSYHCNVVRHIPLDSIDSSLAIGFYCRDKSDFDDFCARASELIDQSNGAPLFTIAE 357

Query: 495  TRSSPNPAARHETLSDNATREYDSKDMLPTD-GPEDCAQ-DDWQLL 364
            TR+   PA  + T       E + K+    D   E+CAQ DDWQLL
Sbjct: 358  TRTLAKPAGNNNT-------EIEDKNNADNDLATENCAQEDDWQLL 396


>ref|XP_009588897.1| PREDICTED: cysteine protease ATG4-like isoform X2 [Nicotiana
            tomentosiformis]
          Length = 487

 Score =  581 bits (1498), Expect = e-163
 Identities = 298/465 (64%), Positives = 342/465 (73%), Gaps = 7/465 (1%)
 Frame = -3

Query: 1737 AIWSGIWLPALSIFGTSNHXXXXXXXXXXXXXXXXXXXXSGRNYYEG----WRAAVRRVM 1570
            ++WSG  +   SIF T                       S + YY G    W +AV+R M
Sbjct: 40   SVWSGFLVSPFSIFDTE------------PKGCLKKGDLSSKKYYNGIGINWTSAVKR-M 86

Query: 1569 MNGGSMRRILGFNKPGI-SVSNSDIWLLGICYKVCQEGDELNSSSSDPTQREGFAAFVED 1393
            MN GSMRRI G NK GI + S SDIWLLG+CYKV Q+ D     S +PTQ EGF+AFV+D
Sbjct: 87   MNSGSMRRIFGINKTGIPNGSKSDIWLLGVCYKVVQDDDP----SIEPTQSEGFSAFVDD 142

Query: 1392 FSSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRSWRKSLHKPLD 1213
            FSS IL+TYRKGF PIGDSK+TSDV+WGCMLRSSQMLVAQA + H+ GRSWRKSL KP D
Sbjct: 143  FSSRILVTYRKGFPPIGDSKYTSDVNWGCMLRSSQMLVAQALLLHRLGRSWRKSLDKPHD 202

Query: 1212 QSYIEVLQLFGDSMDLPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETLVRSKREENEN 1033
            Q Y+++L LFGDS +   SIHNLLQAGK YGL+PGSWVGPYAMCRTWETL RSKREE  N
Sbjct: 203  QKYVDILHLFGDSEESACSIHNLLQAGKTYGLSPGSWVGPYAMCRTWETLARSKREETGN 262

Query: 1032 GGLSSMMAIYVVSGDEDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXXXXXXXXXLDK 853
              +SS+M+ YVVSGDEDGERGGAPVLCV+DI RHC + S GEVDW            LDK
Sbjct: 263  ADMSSLMSFYVVSGDEDGERGGAPVLCVEDIVRHCSDLSNGEVDWIPVLFLVPLVLGLDK 322

Query: 852  VNPRYLPLLAATFTFPQCLGILGGRPGASTYIVGVQDENAFYLDPHEVQQVVDIKRDNVD 673
            +N RYLPLLAATF+FPQ LGILGGRPGASTYIVGVQD+ AFYLDPHEVQ VVDIK D +D
Sbjct: 323  INSRYLPLLAATFSFPQSLGILGGRPGASTYIVGVQDDQAFYLDPHEVQPVVDIKTDKLD 382

Query: 672  ADTSSYHSNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSNGAPLFTITET 493
             DTSSYH N +RH PLDSIDPSLAIGFYC+DKSDFDDFC RA +L+DQSNGAPLFTITET
Sbjct: 383  IDTSSYHCNTLRHFPLDSIDPSLAIGFYCRDKSDFDDFCIRASKLVDQSNGAPLFTITET 442

Query: 492  RSSPNPAARHETLSDNA-TREYDSKDML-PTDGPEDCAQDDWQLL 364
            RSSP     ++ ++ N+   E DS D + P +      +D+WQLL
Sbjct: 443  RSSPTAVEYNDRVTSNSGVPELDSFDAVGPGESDGGRPEDEWQLL 487


>ref|XP_009761431.1| PREDICTED: cysteine protease ATG4-like isoform X2 [Nicotiana
            sylvestris]
          Length = 487

 Score =  579 bits (1492), Expect = e-162
 Identities = 298/465 (64%), Positives = 341/465 (73%), Gaps = 7/465 (1%)
 Frame = -3

Query: 1737 AIWSGIWLPALSIFGTSNHXXXXXXXXXXXXXXXXXXXXSGRNYYEG----WRAAVRRVM 1570
            ++WSG  +   SIF T                       S + YY G    W +AV+R M
Sbjct: 40   SVWSGFLVSPFSIFDTE------------PKGCLKKGDLSSKKYYNGIGINWTSAVKR-M 86

Query: 1569 MNGGSMRRILGFNKPGI-SVSNSDIWLLGICYKVCQEGDELNSSSSDPTQREGFAAFVED 1393
            MN GSMRRI G NK GI + S SDIWLLG+CYKV Q+ D     S +PTQ EGF+AFV+D
Sbjct: 87   MNSGSMRRIFGINKTGIPNGSKSDIWLLGVCYKVVQDDDP----SIEPTQSEGFSAFVDD 142

Query: 1392 FSSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRSWRKSLHKPLD 1213
            FSS IL+TYRKGF PIGDSK+TSDV+WGCMLRSSQMLVAQA + H+ GRSWRKSL KP D
Sbjct: 143  FSSRILVTYRKGFPPIGDSKYTSDVNWGCMLRSSQMLVAQALLLHRLGRSWRKSLDKPHD 202

Query: 1212 QSYIEVLQLFGDSMDLPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETLVRSKREENEN 1033
            Q Y+E+L LFGDS +   SIHNLLQAGK YGL+PGSWVGPYAMCRTWETL RSKREE  N
Sbjct: 203  QKYVEILHLFGDSEESACSIHNLLQAGKTYGLSPGSWVGPYAMCRTWETLARSKREEMGN 262

Query: 1032 GGLSSMMAIYVVSGDEDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXXXXXXXXXLDK 853
              +SS+M+IYVVSGDEDGERGGAPVLCV+DI RHC + S+GE DW            LDK
Sbjct: 263  ADMSSLMSIYVVSGDEDGERGGAPVLCVEDIVRHCSDLSKGEADWIPVLFLVPLVLGLDK 322

Query: 852  VNPRYLPLLAATFTFPQCLGILGGRPGASTYIVGVQDENAFYLDPHEVQQVVDIKRDNVD 673
            +N RYLPLLAATF+FPQ LGILGGRPGASTYIVGVQD+ AFYLDPHEVQ VVDIK D +D
Sbjct: 323  INSRYLPLLAATFSFPQSLGILGGRPGASTYIVGVQDDQAFYLDPHEVQPVVDIKTDKLD 382

Query: 672  ADTSSYHSNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSNGAPLFTITET 493
             DTSSYH N VRH PLDSIDPSLAIGFYC+DKSDFDDFC RA +L+DQSNGAPLFTITET
Sbjct: 383  IDTSSYHCNTVRHFPLDSIDPSLAIGFYCRDKSDFDDFCIRASKLVDQSNGAPLFTITET 442

Query: 492  RSSPNPAARHETLSDNA-TREYDSKDML-PTDGPEDCAQDDWQLL 364
            RS       ++ ++ N+   E DS D + P +      +D+WQLL
Sbjct: 443  RSPATSVEYNDRVTSNSGVPELDSFDAVGPGESDGSRPEDEWQLL 487


>ref|XP_006354186.1| PREDICTED: cysteine protease ATG4-like isoform X2 [Solanum tuberosum]
          Length = 496

 Score =  572 bits (1474), Expect = e-160
 Identities = 287/417 (68%), Positives = 327/417 (78%), Gaps = 5/417 (1%)
 Frame = -3

Query: 1599 GWRAAVRRVMMNGGSMRRILGFNKPGI-SVSNSDIWLLGICYKVCQEGDELNSSSSDPTQ 1423
            GW +AV+R M+N GSMRRI G +K GI + S SDIWLLG+CYKV Q+ D    SS +PTQ
Sbjct: 86   GWTSAVKR-MINSGSMRRIFGMDKTGIPNGSKSDIWLLGVCYKVVQDDD----SSIEPTQ 140

Query: 1422 REGFAAFVEDFSSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRS 1243
             EGF+AFV+DFSS IL+TYRKGF PIGD+K+TSDV+WGCMLRSSQMLVAQA + H+ GRS
Sbjct: 141  SEGFSAFVDDFSSRILVTYRKGFAPIGDTKYTSDVNWGCMLRSSQMLVAQALLLHRLGRS 200

Query: 1242 WRKSLHKPLDQSYIEVLQLFGDSMDLPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETL 1063
            WRKS+ KPL++ Y+E+L LFGDS    +SIHNLLQAGK YGL+PGSWVGPYAMCRTWETL
Sbjct: 201  WRKSMDKPLEEKYVEILHLFGDSEGSAYSIHNLLQAGKTYGLSPGSWVGPYAMCRTWETL 260

Query: 1062 VRSKREENENGGLSSMMAIYVVSGDEDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXX 883
             RSKREE  N  +S  MAIYVVSGDEDGERGGAPVLC++DI +HC   S+GEVDWT    
Sbjct: 261  ARSKREETGNADVSPAMAIYVVSGDEDGERGGAPVLCIEDIVKHCSGLSKGEVDWTPVVF 320

Query: 882  XXXXXXXLDKVNPRYLPLLAATFTFPQCLGILGGRPGASTYIVGVQDENAFYLDPHEVQQ 703
                   LDK+N RYLPLLAATF+FPQ LGILGGRPGASTYI+GVQD+ AFYLDPHEVQ 
Sbjct: 321  LVPLVLGLDKINSRYLPLLAATFSFPQSLGILGGRPGASTYIIGVQDDKAFYLDPHEVQP 380

Query: 702  VVDIKRDNVDADTSSYHSNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSN 523
            VVDIK D +D DTSSYH N VRH PLDSIDPSLAIGFYC+DKSDFDDFC RA EL+DQSN
Sbjct: 381  VVDIKMDKLDVDTSSYHCNTVRHFPLDSIDPSLAIGFYCRDKSDFDDFCIRASELVDQSN 440

Query: 522  GAPLFTITETRSSPNPAARHETL-SDNATREYDSKDMLPTDGPEDCA---QDDWQLL 364
            GAPLFTIT TRSS      ++ L SD    E DS D     G  D +   +D+WQLL
Sbjct: 441  GAPLFTITATRSSATSVEYNDRLTSDTGVPELDSFD-AGAPGESDGSSRPEDEWQLL 496


>ref|XP_004228630.1| PREDICTED: cysteine protease ATG4 isoform X2 [Solanum lycopersicum]
          Length = 493

 Score =  570 bits (1470), Expect = e-159
 Identities = 287/417 (68%), Positives = 328/417 (78%), Gaps = 5/417 (1%)
 Frame = -3

Query: 1599 GWRAAVRRVMMNGGSMRRILGFNKPGI-SVSNSDIWLLGICYKVCQEGDELNSSSSDPTQ 1423
            GW +AV+R M+N GSMRRI G +K G+ + S SDIWLLG+CYKV Q+ D    SS +PTQ
Sbjct: 83   GWTSAVKR-MINSGSMRRIFGMDKTGMPNGSKSDIWLLGVCYKVVQDDD----SSIEPTQ 137

Query: 1422 REGFAAFVEDFSSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRS 1243
             EGFAAFV+DFSS IL+TYRKGF PI D+K+TSDV+WGCMLRSSQMLVAQA + H+ GRS
Sbjct: 138  SEGFAAFVDDFSSRILVTYRKGFAPIEDTKYTSDVNWGCMLRSSQMLVAQALLLHRLGRS 197

Query: 1242 WRKSLHKPLDQSYIEVLQLFGDSMDLPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETL 1063
            WRKS+ KPL+Q Y+E+L LFGDS++  +SIHNLLQAGK YGL+PGSWVGPYAMCRTWETL
Sbjct: 198  WRKSMDKPLEQKYVEILHLFGDSVESAYSIHNLLQAGKTYGLSPGSWVGPYAMCRTWETL 257

Query: 1062 VRSKREENENGGLSSMMAIYVVSGDEDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXX 883
             R KREE  N  +S  MAIYVVSGDEDGERGGAPVLCV+DI +HC   ++GEVDWT    
Sbjct: 258  ARCKREETGNAVMSPAMAIYVVSGDEDGERGGAPVLCVEDIVKHCSGLAKGEVDWTPVLF 317

Query: 882  XXXXXXXLDKVNPRYLPLLAATFTFPQCLGILGGRPGASTYIVGVQDENAFYLDPHEVQQ 703
                   LDK+N RYLPLLAATF+FPQ LGILGGRPGASTYIVGVQD+ A YLDPHEVQ 
Sbjct: 318  LVPLVLGLDKINSRYLPLLAATFSFPQSLGILGGRPGASTYIVGVQDDKAVYLDPHEVQP 377

Query: 702  VVDIKRDNVDADTSSYHSNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSN 523
            VVDIK D +D DTSSYH N VRH PLDSIDPSLAIGFYC+DKSDFDDFC RA EL+DQSN
Sbjct: 378  VVDIKMDKLDVDTSSYHCNTVRHFPLDSIDPSLAIGFYCRDKSDFDDFCIRASELVDQSN 437

Query: 522  GAPLFTITETRSSPNPAARHETL-SDNATREYDSKDML---PTDGPEDCAQDDWQLL 364
            GAPLFTITETRSS      ++ L SD    E DS D +    +DG     +D+WQLL
Sbjct: 438  GAPLFTITETRSSATSVEYNDRLTSDTGVPELDSFDAVAPGESDG-SSRPEDEWQLL 493


>ref|XP_009761430.1| PREDICTED: cysteine protease ATG4-like isoform X1 [Nicotiana
            sylvestris]
          Length = 515

 Score =  564 bits (1453), Expect = e-157
 Identities = 298/493 (60%), Positives = 341/493 (69%), Gaps = 35/493 (7%)
 Frame = -3

Query: 1737 AIWSGIWLPALSIFGTSNHXXXXXXXXXXXXXXXXXXXXSGRNYYEG----WRAAVRRVM 1570
            ++WSG  +   SIF T                       S + YY G    W +AV+R M
Sbjct: 40   SVWSGFLVSPFSIFDTE------------PKGCLKKGDLSSKKYYNGIGINWTSAVKR-M 86

Query: 1569 MNGGSMRRILGFNKPGI-SVSNSDIWLLGICYKVCQEGDELNSSSSDPTQREGFAAFVED 1393
            MN GSMRRI G NK GI + S SDIWLLG+CYKV Q+ D     S +PTQ EGF+AFV+D
Sbjct: 87   MNSGSMRRIFGINKTGIPNGSKSDIWLLGVCYKVVQDDDP----SIEPTQSEGFSAFVDD 142

Query: 1392 FSSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRSWRKSLHK--- 1222
            FSS IL+TYRKGF PIGDSK+TSDV+WGCMLRSSQMLVAQA + H+ GRSWRKSL K   
Sbjct: 143  FSSRILVTYRKGFPPIGDSKYTSDVNWGCMLRSSQMLVAQALLLHRLGRSWRKSLDKVLE 202

Query: 1221 -------------------------PLDQSYIEVLQLFGDSMDLPFSIHNLLQAGKAYGL 1117
                                     P DQ Y+E+L LFGDS +   SIHNLLQAGK YGL
Sbjct: 203  SQNTAVLSVVKMLNFQFKTTILHSQPHDQKYVEILHLFGDSEESACSIHNLLQAGKTYGL 262

Query: 1116 APGSWVGPYAMCRTWETLVRSKREENENGGLSSMMAIYVVSGDEDGERGGAPVLCVQDIS 937
            +PGSWVGPYAMCRTWETL RSKREE  N  +SS+M+IYVVSGDEDGERGGAPVLCV+DI 
Sbjct: 263  SPGSWVGPYAMCRTWETLARSKREEMGNADMSSLMSIYVVSGDEDGERGGAPVLCVEDIV 322

Query: 936  RHCFEFSRGEVDWTXXXXXXXXXXXLDKVNPRYLPLLAATFTFPQCLGILGGRPGASTYI 757
            RHC + S+GE DW            LDK+N RYLPLLAATF+FPQ LGILGGRPGASTYI
Sbjct: 323  RHCSDLSKGEADWIPVLFLVPLVLGLDKINSRYLPLLAATFSFPQSLGILGGRPGASTYI 382

Query: 756  VGVQDENAFYLDPHEVQQVVDIKRDNVDADTSSYHSNVVRHIPLDSIDPSLAIGFYCKDK 577
            VGVQD+ AFYLDPHEVQ VVDIK D +D DTSSYH N VRH PLDSIDPSLAIGFYC+DK
Sbjct: 383  VGVQDDQAFYLDPHEVQPVVDIKTDKLDIDTSSYHCNTVRHFPLDSIDPSLAIGFYCRDK 442

Query: 576  SDFDDFCARALELIDQSNGAPLFTITETRSSPNPAARHETLSDNA-TREYDSKDML-PTD 403
            SDFDDFC RA +L+DQSNGAPLFTITETRS       ++ ++ N+   E DS D + P +
Sbjct: 443  SDFDDFCIRASKLVDQSNGAPLFTITETRSPATSVEYNDRVTSNSGVPELDSFDAVGPGE 502

Query: 402  GPEDCAQDDWQLL 364
                  +D+WQLL
Sbjct: 503  SDGSRPEDEWQLL 515


>ref|XP_010316044.1| PREDICTED: cysteine protease ATG4 isoform X1 [Solanum lycopersicum]
          Length = 521

 Score =  555 bits (1431), Expect = e-155
 Identities = 287/445 (64%), Positives = 328/445 (73%), Gaps = 33/445 (7%)
 Frame = -3

Query: 1599 GWRAAVRRVMMNGGSMRRILGFNKPGI-SVSNSDIWLLGICYKVCQEGDELNSSSSDPTQ 1423
            GW +AV+R M+N GSMRRI G +K G+ + S SDIWLLG+CYKV Q+ D    SS +PTQ
Sbjct: 83   GWTSAVKR-MINSGSMRRIFGMDKTGMPNGSKSDIWLLGVCYKVVQDDD----SSIEPTQ 137

Query: 1422 REGFAAFVEDFSSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRS 1243
             EGFAAFV+DFSS IL+TYRKGF PI D+K+TSDV+WGCMLRSSQMLVAQA + H+ GRS
Sbjct: 138  SEGFAAFVDDFSSRILVTYRKGFAPIEDTKYTSDVNWGCMLRSSQMLVAQALLLHRLGRS 197

Query: 1242 WRKSLHK----------------------------PLDQSYIEVLQLFGDSMDLPFSIHN 1147
            WRKS+ K                            PL+Q Y+E+L LFGDS++  +SIHN
Sbjct: 198  WRKSMDKVLESQNTAVFSVVKMLNFQFRTTIMHNQPLEQKYVEILHLFGDSVESAYSIHN 257

Query: 1146 LLQAGKAYGLAPGSWVGPYAMCRTWETLVRSKREENENGGLSSMMAIYVVSGDEDGERGG 967
            LLQAGK YGL+PGSWVGPYAMCRTWETL R KREE  N  +S  MAIYVVSGDEDGERGG
Sbjct: 258  LLQAGKTYGLSPGSWVGPYAMCRTWETLARCKREETGNAVMSPAMAIYVVSGDEDGERGG 317

Query: 966  APVLCVQDISRHCFEFSRGEVDWTXXXXXXXXXXXLDKVNPRYLPLLAATFTFPQCLGIL 787
            APVLCV+DI +HC   ++GEVDWT           LDK+N RYLPLLAATF+FPQ LGIL
Sbjct: 318  APVLCVEDIVKHCSGLAKGEVDWTPVLFLVPLVLGLDKINSRYLPLLAATFSFPQSLGIL 377

Query: 786  GGRPGASTYIVGVQDENAFYLDPHEVQQVVDIKRDNVDADTSSYHSNVVRHIPLDSIDPS 607
            GGRPGASTYIVGVQD+ A YLDPHEVQ VVDIK D +D DTSSYH N VRH PLDSIDPS
Sbjct: 378  GGRPGASTYIVGVQDDKAVYLDPHEVQPVVDIKMDKLDVDTSSYHCNTVRHFPLDSIDPS 437

Query: 606  LAIGFYCKDKSDFDDFCARALELIDQSNGAPLFTITETRSSPNPAARHETL-SDNATREY 430
            LAIGFYC+DKSDFDDFC RA EL+DQSNGAPLFTITETRSS      ++ L SD    E 
Sbjct: 438  LAIGFYCRDKSDFDDFCIRASELVDQSNGAPLFTITETRSSATSVEYNDRLTSDTGVPEL 497

Query: 429  DSKDML---PTDGPEDCAQDDWQLL 364
            DS D +    +DG     +D+WQLL
Sbjct: 498  DSFDAVAPGESDG-SSRPEDEWQLL 521


>ref|XP_003635099.1| PREDICTED: cysteine protease ATG4 isoform X2 [Vitis vinifera]
            gi|296086874|emb|CBI33041.3| unnamed protein product
            [Vitis vinifera]
          Length = 486

 Score =  546 bits (1406), Expect = e-152
 Identities = 282/466 (60%), Positives = 336/466 (72%), Gaps = 8/466 (1%)
 Frame = -3

Query: 1737 AIWSGIWLPALSIFGTSNHXXXXXXXXXXXXXXXXXXXXSGRNYYEGWRAAVRRVMMNGG 1558
            ++WS ++  A S+F T++                      GRN   GW  AVR+V+  G 
Sbjct: 37   SLWSSVFASAFSVFETNSESSPSASEKKAIDN--------GRN--NGWTTAVRKVV-TGV 85

Query: 1557 SMRRI----LGFNKPGISVSNSDIWLLGICYKVCQEGDELNSSSSDPTQREGFAAFVEDF 1390
            SMRRI    LG +K GIS S SDIWLLG+CYK+ QE    ++SSS+     G A F +DF
Sbjct: 86   SMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSN-----GLAEFEQDF 140

Query: 1389 SSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRSWRKSLHKPLDQ 1210
            SS IL+TYRKGF+ IGDSK TSDV+WGCMLRSSQMLVAQA + H+ GRSWRK+ HKP+DQ
Sbjct: 141  SSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMDQ 200

Query: 1209 SYIEVLQLFGDSMDLPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETLVRSKREENENG 1030
             YIE+L  FGDS    FSIHN+LQAGKAYGLA GSWVGPYAMCR+WETL RSKREE +  
Sbjct: 201  DYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARSKREETDLE 260

Query: 1029 GLSSMMAIYVVSGDEDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXXXXXXXXXLDKV 850
              S  MAIY+VSGDEDGERGGAPV+ +++ SRHC EFS+G+VDWT           L+KV
Sbjct: 261  CQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVPLVLGLEKV 320

Query: 849  NPRYLPLLAATFTFPQCLGILGGRPGASTYIVGVQDENAFYLDPHEVQQVVDIKRDNVDA 670
            NPRY+P LAATFTFPQ LGILGG+PGASTYIVGVQDE AFYLDPHE Q VVDI+R+N++A
Sbjct: 321  NPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLEA 380

Query: 669  DTSSYHSNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSNGAPLFTITETR 490
            DTSSYH N++RHI LDSIDPSLAIGFYC+DK DFDDFC RA +L D+SNGAPLFT+    
Sbjct: 381  DTSSYHCNIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADKSNGAPLFTVAHIH 440

Query: 489  SSPNPAARHETLSD-NATREYDSKDMLPTDGPEDCA---QDDWQLL 364
            S P P +  + + D +  RE DS D++   G E      +DDWQLL
Sbjct: 441  SLPKPISCSDGMDDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 486


>ref|XP_010093156.1| hypothetical protein L484_005165 [Morus notabilis]
            gi|587863878|gb|EXB53615.1| hypothetical protein
            L484_005165 [Morus notabilis]
          Length = 444

 Score =  544 bits (1401), Expect = e-151
 Identities = 266/421 (63%), Positives = 318/421 (75%), Gaps = 4/421 (0%)
 Frame = -3

Query: 1614 RNYYEGWRAAVRRVMMNGGSMR---RILGFNKPGISVSNSDIWLLGICYKVCQEGDELNS 1444
            R+ + GW AAVR+ +  G   R   RILG+ + G+S S SDIWLLG+CYK+ Q+   ++ 
Sbjct: 29   RSRFNGWTAAVRKAVSVGSMRRFHERILGYARTGVSSSTSDIWLLGVCYKISQDEPSVDL 88

Query: 1443 SSSDPTQREGFAAFVEDFSSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFI 1264
                P    G A F +DFSS IL+TYRKGF  IGDSK+TSDV+WGCMLRSSQMLVAQA +
Sbjct: 89   ----PAANSGLADFEQDFSSRILMTYRKGFGAIGDSKYTSDVNWGCMLRSSQMLVAQALL 144

Query: 1263 FHKFGRSWRKSLHKPLDQSYIEVLQLFGDSMDLPFSIHNLLQAGKAYGLAPGSWVGPYAM 1084
            FH+ GR WR+ +  PLDQ YI++L  F DS +  FSIHNLLQAGKAY L  GSW+GPYAM
Sbjct: 145  FHRLGRCWRRPVQSPLDQEYIDILNHFDDSEESAFSIHNLLQAGKAYDLTAGSWMGPYAM 204

Query: 1083 CRTWETLVRSKREENENGGLSSMMAIYVVSGDEDGERGGAPVLCVQDISRHCFEFSRGEV 904
            CRTWETLVRSKREEN+       MA+Y+VSGDEDGERGGAPV+CV+D  RHC EFSRG+ 
Sbjct: 205  CRTWETLVRSKREENDFENHPLPMAVYIVSGDEDGERGGAPVVCVEDAFRHCLEFSRGQA 264

Query: 903  DWTXXXXXXXXXXXLDKVNPRYLPLLAATFTFPQCLGILGGRPGASTYIVGVQDENAFYL 724
            +WT           LD VNPRY+P L  TFTFPQ LGI+GGRPGASTYIVGVQDE AFYL
Sbjct: 265  NWTPMLLLVPLVLGLDTVNPRYIPSLRETFTFPQSLGIMGGRPGASTYIVGVQDEKAFYL 324

Query: 723  DPHEVQQVVDIKRDNVDADTSSYHSNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARAL 544
            DPHEVQ V+DI R++V+ADTSSYHSNV+RHI LDSIDPSLAIGFYC+DK+DFDDFC RA 
Sbjct: 325  DPHEVQPVIDISRNSVEADTSSYHSNVIRHIGLDSIDPSLAIGFYCRDKNDFDDFCFRAS 384

Query: 543  ELIDQSNGAPLFTITETRSSPNPAARHETLSDNATREYDSKDMLPTDGPEDCA-QDDWQL 367
            +L D+SNGAPLFT+T T++ P P    + L D++    DS D LP++  EDC+ +DDWQL
Sbjct: 385  KLADESNGAPLFTVTRTKNLPKPVGHADVLGDSSGIS-DSFDALPSNNTEDCSHEDDWQL 443

Query: 366  L 364
            L
Sbjct: 444  L 444


>emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  541 bits (1393), Expect = e-150
 Identities = 282/469 (60%), Positives = 337/469 (71%), Gaps = 11/469 (2%)
 Frame = -3

Query: 1737 AIWSGIWLPALSIFGTSNHXXXXXXXXXXXXXXXXXXXXSGRNYYEGWRAAVRRVMMNGG 1558
            ++WS ++  A S+F T++                      GRN   GW  AVR+V+  G 
Sbjct: 37   SLWSSVFASAFSVFETNSESSPSASEKKAIDN--------GRN--NGWTTAVRKVV-TGV 85

Query: 1557 SMRRI----LGFNKPGISVSNSDIWLLGICYKVCQEGDELNSSSSDPTQREGFAAFVEDF 1390
            SMRRI    LG +K GIS S SDIWLLG+CYK+ QE    ++SSS+     G A F +DF
Sbjct: 86   SMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSN-----GLAEFEQDF 140

Query: 1389 SSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRSWRKSLHKPLDQ 1210
            SS IL+TYRKGF+ IGDSK TSDV+WGCMLRSSQMLVAQA + H+ GRSWRK+ HKP+DQ
Sbjct: 141  SSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMDQ 200

Query: 1209 SYIEVLQLFGDSMDLPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETLVRSKREENENG 1030
             YIE+L  FGDS    FSIHN+LQAGKAYGLA GSWVGPYAMCR+WETL RSKREE +  
Sbjct: 201  DYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARSKREETDLE 260

Query: 1029 GLSSMMAIYVVSGDEDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXXXXXXXXXLDKV 850
              S  MAIY+VSGDEDGERGGAPV+ +++ SRHC EFS+G+VDWT           L+KV
Sbjct: 261  CQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVPLVLGLEKV 320

Query: 849  NPRYLPLLAATFTFPQCLGILGGRPGASTYIVGVQDENAFYLDPHEVQQVVDIKRDNVDA 670
            NPRY+P LAATFTFPQ LGILGG+PGASTYIVGVQDE AFYLDPHE Q VVDI+R+N++A
Sbjct: 321  NPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLEA 380

Query: 669  DTSSYH---SNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSNGAPLFTIT 499
            DTSSYH   S+++RHI LDSIDPSLAIGFYC+DK DFDDFC RA +L D+SNGAPLFT+ 
Sbjct: 381  DTSSYHCNCSSIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADESNGAPLFTVA 440

Query: 498  ETRSSPNPAARHETLSD-NATREYDSKDMLPTDGPEDCA---QDDWQLL 364
               S P P +  + + D +  RE DS D++   G E      +DDWQLL
Sbjct: 441  HIHSLPKPISCSDGMDDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 489


>ref|XP_010646415.1| PREDICTED: cysteine protease ATG4 isoform X1 [Vitis vinifera]
          Length = 489

 Score =  540 bits (1392), Expect = e-150
 Identities = 282/469 (60%), Positives = 337/469 (71%), Gaps = 11/469 (2%)
 Frame = -3

Query: 1737 AIWSGIWLPALSIFGTSNHXXXXXXXXXXXXXXXXXXXXSGRNYYEGWRAAVRRVMMNGG 1558
            ++WS ++  A S+F T++                      GRN   GW  AVR+V+  G 
Sbjct: 37   SLWSSVFASAFSVFETNSESSPSASEKKAIDN--------GRN--NGWTTAVRKVV-TGV 85

Query: 1557 SMRRI----LGFNKPGISVSNSDIWLLGICYKVCQEGDELNSSSSDPTQREGFAAFVEDF 1390
            SMRRI    LG +K GIS S SDIWLLG+CYK+ QE    ++SSS+     G A F +DF
Sbjct: 86   SMRRIQERVLGTSKTGISSSTSDIWLLGLCYKISQEESSNHASSSN-----GLAEFEQDF 140

Query: 1389 SSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRSWRKSLHKPLDQ 1210
            SS IL+TYRKGF+ IGDSK TSDV+WGCMLRSSQMLVAQA + H+ GRSWRK+ HKP+DQ
Sbjct: 141  SSRILMTYRKGFEAIGDSKLTSDVNWGCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMDQ 200

Query: 1209 SYIEVLQLFGDSMDLPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETLVRSKREENENG 1030
             YIE+L  FGDS    FSIHN+LQAGKAYGLA GSWVGPYAMCR+WETL RSKREE +  
Sbjct: 201  DYIEILHHFGDSKASAFSIHNILQAGKAYGLAAGSWVGPYAMCRSWETLARSKREETDLE 260

Query: 1029 GLSSMMAIYVVSGDEDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXXXXXXXXXLDKV 850
              S  MAIY+VSGDEDGERGGAPV+ +++ SRHC EFS+G+VDWT           L+KV
Sbjct: 261  CQSLPMAIYIVSGDEDGERGGAPVVYIEEASRHCLEFSKGQVDWTPILLLVPLVLGLEKV 320

Query: 849  NPRYLPLLAATFTFPQCLGILGGRPGASTYIVGVQDENAFYLDPHEVQQVVDIKRDNVDA 670
            NPRY+P LAATFTFPQ LGILGG+PGASTYIVGVQDE AFYLDPHE Q VVDI+R+N++A
Sbjct: 321  NPRYIPSLAATFTFPQSLGILGGKPGASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLEA 380

Query: 669  DTSSYH---SNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSNGAPLFTIT 499
            DTSSYH   S+++RHI LDSIDPSLAIGFYC+DK DFDDFC RA +L D+SNGAPLFT+ 
Sbjct: 381  DTSSYHCNCSSIIRHICLDSIDPSLAIGFYCRDKDDFDDFCIRASKLADKSNGAPLFTVA 440

Query: 498  ETRSSPNPAARHETLSD-NATREYDSKDMLPTDGPEDCA---QDDWQLL 364
               S P P +  + + D +  RE DS D++   G E      +DDWQLL
Sbjct: 441  HIHSLPKPISCSDGMDDCSGFREDDSFDVVSNKGAEGYEHEHEDDWQLL 489


>ref|XP_009354662.1| PREDICTED: cysteine protease ATG4-like [Pyrus x bretschneideri]
            gi|694327605|ref|XP_009354663.1| PREDICTED: cysteine
            protease ATG4-like [Pyrus x bretschneideri]
          Length = 487

 Score =  540 bits (1390), Expect = e-150
 Identities = 275/463 (59%), Positives = 330/463 (71%), Gaps = 5/463 (1%)
 Frame = -3

Query: 1737 AIWSGIWLPALSIFGTSNHXXXXXXXXXXXXXXXXXXXXSGRNYYEGWRAAVRRVMMNGG 1558
            ++W+  +  A SIF T +                       RN   GW AAVR+V+ +G 
Sbjct: 44   SLWTNFFASAFSIFETHSESSITEKKESH-----------SRN--NGWTAAVRKVVTSGS 90

Query: 1557 SMR---RILGFNKPGISVSNSDIWLLGICYKVCQEGDELNSSSSDPTQREGFAAFVEDFS 1387
              R   R+LG ++ GIS S SDIWLLG+CYKV Q+      SS D     G  AF +DFS
Sbjct: 91   MRRIHERVLGSSRTGIS-SASDIWLLGVCYKVSQD-----DSSGDAPINNGLGAFEQDFS 144

Query: 1386 SLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRSWRKSLHKPLDQS 1207
            S IL+TYRKGF+ IGDSK+TSDV+WGCMLRSSQMLVAQA +FH+ GRSWR+ LHKPLD++
Sbjct: 145  SKILMTYRKGFEAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRRPLHKPLDEA 204

Query: 1206 YIEVLQLFGDSMDLPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETLVRSKREENENGG 1027
            YIE+L  FGDS    FSIHNLLQAGKAY LA GSWVGPYAMCRTWETLVR +RE  +   
Sbjct: 205  YIEILYHFGDSETSTFSIHNLLQAGKAYDLAAGSWVGPYAMCRTWETLVRCRREVTDLDD 264

Query: 1026 LSSMMAIYVVSGDEDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXXXXXXXXXLDKVN 847
                MA+Y+VSGDEDGERGGAPV+C++D SRHC EFSRG+VDWT           L+KVN
Sbjct: 265  QPLPMAVYIVSGDEDGERGGAPVVCIEDASRHCLEFSRGQVDWTPILLLVPLVLGLEKVN 324

Query: 846  PRYLPLLAATFTFPQCLGILGGRPGASTYIVGVQDENAFYLDPHEVQQVVDIKRDNVDAD 667
            PRY+P L ATFTFPQ LGI+GG+PGASTYI+GVQDE A YLDPHEVQ V++I+RD+++AD
Sbjct: 325  PRYIPSLRATFTFPQSLGIMGGKPGASTYIIGVQDEKALYLDPHEVQPVINIRRDDLEAD 384

Query: 666  TSSYHSNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSNGAPLFTITETRS 487
            T SYH NV+RHIPLD IDPSLAIGFYC+D+ DF+DFC RA +L D+SNGAPLFT+T+T S
Sbjct: 385  TLSYHCNVIRHIPLDLIDPSLAIGFYCRDRDDFNDFCFRASKLADESNGAPLFTVTQTHS 444

Query: 486  SPNPAARHETLSDN-ATREYDSKDMLPTDGPEDCAQ-DDWQLL 364
             P P    + L D+ A    DS  +LP    +  AQ DDWQLL
Sbjct: 445  FPRPVNHSDALGDSGAVENDDSFSVLPMSDADGSAQEDDWQLL 487


>gb|EPS69655.1| hypothetical protein M569_05108, partial [Genlisea aurea]
          Length = 403

 Score =  537 bits (1383), Expect = e-149
 Identities = 272/419 (64%), Positives = 318/419 (75%), Gaps = 1/419 (0%)
 Frame = -3

Query: 1617 GRNYYEGWRAAVRRVMMNGGSMRRILGFNKPGISVSNSDIWLLGICYKVCQEGDELNSSS 1438
            GRNYY+ WRAAVRRV+MNGGSMRRI GF +   S S  DIWLLG+CY+V  EGD+    S
Sbjct: 7    GRNYYDSWRAAVRRVIMNGGSMRRIWGFGRMWASASKGDIWLLGVCYQVFHEGDD----S 62

Query: 1437 SDPTQREGFAAFVEDFSSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFIFH 1258
            SDPTQ EGFAAFVED SS I ITYR+GF PI +SK+ SD +WGCMLRSSQMLVAQAF+FH
Sbjct: 63   SDPTQSEGFAAFVEDLSSRIWITYRRGFLPIENSKYCSDANWGCMLRSSQMLVAQAFLFH 122

Query: 1257 KFGRSWRKSLHKPLDQSYIEVLQLFGDSMDLPFSIHNLLQAGKAYGLAPGSWVGPYAMCR 1078
            K GRSWRK+ ++P    YIE+LQLFGDS + P SIHNLLQ GKAYGLAPGSWVGPYAMCR
Sbjct: 123  KLGRSWRKTSNQP--HEYIEILQLFGDSEESPCSIHNLLQVGKAYGLAPGSWVGPYAMCR 180

Query: 1077 TWETLVRSKREENENGGLSSMMAIYVVSGDEDGERGGAPVLCVQDISRHCFEFSRGEVDW 898
             WE L+R      + G L+SMM +YVVSGD DGERGGAPVLCV+D+SR C EF RG+ +W
Sbjct: 181  AWECLMRY----TDCGFLTSMMTLYVVSGDGDGERGGAPVLCVEDVSRRCSEFGRGQDNW 236

Query: 897  TXXXXXXXXXXXLDKVNPRYLPLLAATFTFPQCLGILGGRPGASTYIVGVQDENAFYLDP 718
                        L+KVNPRYLPLL+ATFTFPQ LGILGGRPG STYI+G+QDE A+YLDP
Sbjct: 237  APVLLLVPLVLGLEKVNPRYLPLLSATFTFPQSLGILGGRPGVSTYIIGIQDEKAYYLDP 296

Query: 717  HEVQQVVDIKRDNVDADTSSYHSNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARALEL 538
            HEVQQVVD+K+ ++D D SSYH NVVR++PLDSID SLAIGFYC+ K +FDDFC RA EL
Sbjct: 297  HEVQQVVDMKKGDIDIDFSSYHCNVVRNMPLDSIDSSLAIGFYCRGKREFDDFCIRASEL 356

Query: 537  IDQSNGAPLFTITETRSSPNPAARHETLSDNATREYDSKDMLPTDGPEDCA-QDDWQLL 364
            I+QSNGAPLFT+++        A  E        + D +  L    PE+C  +D+WQLL
Sbjct: 357  IEQSNGAPLFTVSK--------AGDEKRGIRMDDDDDDEGAL----PENCENEDEWQLL 403


>ref|XP_007049917.1| Peptidase family C54 protein isoform 3 [Theobroma cacao]
            gi|508702178|gb|EOX94074.1| Peptidase family C54 protein
            isoform 3 [Theobroma cacao]
          Length = 486

 Score =  535 bits (1377), Expect = e-149
 Identities = 281/464 (60%), Positives = 328/464 (70%), Gaps = 6/464 (1%)
 Frame = -3

Query: 1737 AIWSGIWLPALSIFGTSNHXXXXXXXXXXXXXXXXXXXXSGRNYYEGWRAAVRRVMMNGG 1558
            ++WS ++  A SIF T +                       RN   GW AAV+RV+ +GG
Sbjct: 43   SVWSNLFASAFSIFDTYSESSACEKKALH-----------ARN--NGWTAAVKRVV-SGG 88

Query: 1557 SMRRI----LGFNKPGISVSNSDIWLLGICYKVCQEGDELNSSSSDPTQREGFAAFVEDF 1390
            SMRRI    LG +K GIS S SDIWLLG+CYK+ Q      SSS D     G AAF  DF
Sbjct: 89   SMRRIHERVLGPSKIGISSSTSDIWLLGVCYKISQV-----SSSGDVDASNGLAAFKRDF 143

Query: 1389 SSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQ-AFIFHKFGRSWRKSLHKPLD 1213
            SS IL+TYRKGFD IGD+K TSD  WGCMLRSSQMLVAQ A +FH+ GRSWRK L KP +
Sbjct: 144  SSRILMTYRKGFDAIGDTKITSDFGWGCMLRSSQMLVAQQALLFHQLGRSWRKPLQKPFE 203

Query: 1212 QSYIEVLQLFGDSMDLPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETLVRSKREENEN 1033
            Q+YIE+L  FGDS    FSIHNL++AGK YGLA GSWVGPYAMCR+WE+L R KREEN+ 
Sbjct: 204  QAYIEILHQFGDSEATAFSIHNLVEAGKIYGLAAGSWVGPYAMCRSWESLARFKREENDL 263

Query: 1032 GGLSSMMAIYVVSGDEDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXXXXXXXXXLDK 853
               S  MA+YVVSGDEDGERGGAPV+CV+D SRHCFEFSR   DWT           LDK
Sbjct: 264  EHQSLPMAVYVVSGDEDGERGGAPVVCVEDASRHCFEFSRCRADWTPILLLVPLVLGLDK 323

Query: 852  VNPRYLPLLAATFTFPQCLGILGGRPGASTYIVGVQDENAFYLDPHEVQQVVDIKRDNVD 673
            VN RY+P L ATFTFPQCLGILGG+PGASTYIVGVQ+EN FYLDPH+VQ VV++ +DN +
Sbjct: 324  VNSRYIPSLQATFTFPQCLGILGGKPGASTYIVGVQEENVFYLDPHDVQLVVNLSQDNQE 383

Query: 672  ADTSSYHSNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSNGAPLFTITET 493
            ADTSSYH +++RHIPLDSIDPSLAIGF+C+DK DFDDFC RA +L D+SNGAPLFT+ +T
Sbjct: 384  ADTSSYHCDIIRHIPLDSIDPSLAIGFFCRDKDDFDDFCLRASKLADESNGAPLFTVAQT 443

Query: 492  RSSPNPAARHETLSDNA-TREYDSKDMLPTDGPEDCAQDDWQLL 364
             SS  P +    L D    RE DS  ++P D      +DDWQLL
Sbjct: 444  HSSFKPISHGNALDDTGEVREDDSLGVVP-DMDGSIHEDDWQLL 486


>ref|XP_007217926.1| hypothetical protein PRUPE_ppa004885mg [Prunus persica]
            gi|462414388|gb|EMJ19125.1| hypothetical protein
            PRUPE_ppa004885mg [Prunus persica]
          Length = 487

 Score =  535 bits (1377), Expect = e-149
 Identities = 275/464 (59%), Positives = 325/464 (70%), Gaps = 6/464 (1%)
 Frame = -3

Query: 1737 AIWSGIWLPALSIFGTSNHXXXXXXXXXXXXXXXXXXXXSGRNYYEGWRAAVRRVMMNGG 1558
            ++WS  +  A SIF T +                       RN   GW  AVR+V+  GG
Sbjct: 44   SLWSNFFASAFSIFETHSESSITEKKEIH-----------SRN--NGWTEAVRKVV-TGG 89

Query: 1557 SMRRI----LGFNKPGISVSNSDIWLLGICYKVCQEGDELNSSSSDPTQREGFAAFVEDF 1390
            SMRRI    LG ++ GIS S SDIWLLG+ YKV Q+      SS D     G  AF +DF
Sbjct: 90   SMRRIHERVLGSSRTGIS-SASDIWLLGVLYKVSQD-----ESSGDAATNNGLRAFEQDF 143

Query: 1389 SSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRSWRKSLHKPLDQ 1210
            SS IL+TYRKGFD IGDSK+TSDV+WGCMLRSSQMLVAQA +FH+ GRSWR++LHKPLD+
Sbjct: 144  SSRILMTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRRTLHKPLDE 203

Query: 1209 SYIEVLQLFGDSMDLPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETLVRSKREENENG 1030
             YIE+L  FGDS    FSIHNLLQAGKAY LA GSWVGPYAMCR+WETLVR KRE     
Sbjct: 204  QYIEILHHFGDSEGSAFSIHNLLQAGKAYDLAAGSWVGPYAMCRSWETLVRCKREGTAFD 263

Query: 1029 GLSSMMAIYVVSGDEDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXXXXXXXXXLDKV 850
                 MA+Y+VSGDEDGERGGAPV+C+QD SRHC EFSRG VDWT           L+KV
Sbjct: 264  NQPLPMAVYIVSGDEDGERGGAPVVCIQDASRHCLEFSRGRVDWTPILLLVPLVLGLEKV 323

Query: 849  NPRYLPLLAATFTFPQCLGILGGRPGASTYIVGVQDENAFYLDPHEVQQVVDIKRDNVDA 670
            NPRY+P L ATFTFPQ LGI+GG+PGASTYI+GVQDE A YLDPHEVQ  ++I+RD+++A
Sbjct: 324  NPRYIPSLWATFTFPQSLGIMGGKPGASTYIIGVQDEKALYLDPHEVQPAINIRRDDLEA 383

Query: 669  DTSSYHSNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSNGAPLFTITETR 490
            DT SYH NV+RHIPLDSIDPSLAIGFYC+D+ DFDDFC RA +L D SNGAPLFT+T++ 
Sbjct: 384  DTLSYHCNVIRHIPLDSIDPSLAIGFYCRDRDDFDDFCFRASKLADGSNGAPLFTVTQSH 443

Query: 489  SSPNPAARHETLSDNATREYDSKDMLP--TDGPEDCAQDDWQLL 364
            + P P    + L D+   + D   + P  +D      +DDWQLL
Sbjct: 444  NFPKPVNHSDVLDDSGGVQNDDSFVAPPISDADGSAHEDDWQLL 487


>ref|XP_008232834.1| PREDICTED: cysteine protease ATG4 [Prunus mume]
          Length = 487

 Score =  532 bits (1371), Expect = e-148
 Identities = 275/464 (59%), Positives = 323/464 (69%), Gaps = 6/464 (1%)
 Frame = -3

Query: 1737 AIWSGIWLPALSIFGTSNHXXXXXXXXXXXXXXXXXXXXSGRNYYEGWRAAVRRVMMNGG 1558
            ++WS  +  A SIF T +                       RN   GW  AVR+V+  GG
Sbjct: 44   SLWSNFFASAFSIFETHSESSNTEKKEIH-----------SRN--NGWTEAVRKVV-TGG 89

Query: 1557 SMRRI----LGFNKPGISVSNSDIWLLGICYKVCQEGDELNSSSSDPTQREGFAAFVEDF 1390
            SMRRI    LG ++ GIS S SDIWLLG+ YKV Q+       S D     G  AF +DF
Sbjct: 90   SMRRIHERVLGSSRTGIS-SASDIWLLGVRYKVSQD-----EFSGDAATNNGLRAFEQDF 143

Query: 1389 SSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRSWRKSLHKPLDQ 1210
            SS IL+TYRKGFD IGDSK+TSDV+WGCMLRSSQMLVAQA +FH+ GRSWR+ LHKPLD+
Sbjct: 144  SSRILMTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRRPLHKPLDE 203

Query: 1209 SYIEVLQLFGDSMDLPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETLVRSKREENENG 1030
             YIE+L  FGDS    FSIHNLLQ+GKAY LA GSWVGPYAMCR+WETLVR KRE     
Sbjct: 204  QYIEILHHFGDSEGSAFSIHNLLQSGKAYDLAAGSWVGPYAMCRSWETLVRCKREGTAFD 263

Query: 1029 GLSSMMAIYVVSGDEDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXXXXXXXXXLDKV 850
                 MA+Y+VSGDEDGERGGAPV+C+QD SRHC EFSRG VDWT           L+KV
Sbjct: 264  NQPLPMAVYIVSGDEDGERGGAPVVCIQDASRHCLEFSRGRVDWTPILLLVPLVLGLEKV 323

Query: 849  NPRYLPLLAATFTFPQCLGILGGRPGASTYIVGVQDENAFYLDPHEVQQVVDIKRDNVDA 670
            NPRY+P L ATFTFPQ LGI+GG+PGASTYI+GVQDE A YLDPHEVQ  ++I+RD+++A
Sbjct: 324  NPRYIPSLWATFTFPQSLGIMGGKPGASTYIIGVQDEKALYLDPHEVQPAINIRRDDLEA 383

Query: 669  DTSSYHSNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSNGAPLFTITETR 490
            DT SYH NV+RHIPLDSIDPSLAIGFYC+D+ DFDDFC RA +L D SNGAPLFT+TET 
Sbjct: 384  DTLSYHCNVIRHIPLDSIDPSLAIGFYCRDRDDFDDFCFRASKLADGSNGAPLFTVTETH 443

Query: 489  SSPNPAARHETLSDNATREYDSKDMLP--TDGPEDCAQDDWQLL 364
            + P P    + L D+   + D   + P  +D      +DDWQLL
Sbjct: 444  NFPKPVNHSDVLDDSGGVQNDDSFVAPPISDADGSAHEDDWQLL 487


>ref|XP_008346919.1| PREDICTED: cysteine protease ATG4-like isoform X2 [Malus domestica]
          Length = 487

 Score =  528 bits (1361), Expect = e-147
 Identities = 273/464 (58%), Positives = 329/464 (70%), Gaps = 6/464 (1%)
 Frame = -3

Query: 1737 AIWSGIWLPALSIFGTSNHXXXXXXXXXXXXXXXXXXXXSGRNYYEGWRAAVRRVMMNGG 1558
            ++WS  +  A SIF T +                       RN   GW AAVR+ + + G
Sbjct: 44   SLWSNFFESAFSIFETHSESSITDKKESH-----------SRN--NGWTAAVRKAV-SSG 89

Query: 1557 SMRRI----LGFNKPGISVSNSDIWLLGICYKVCQEGDELNSSSSDPTQREGFAAFVEDF 1390
            SMRRI    LG ++ GIS S SDIWLLG+CYKV Q+      SS D     G  AF +DF
Sbjct: 90   SMRRIQEHVLGSSRIGIS-SASDIWLLGVCYKVSQD-----DSSGDAPINNGLGAFEQDF 143

Query: 1389 SSLILITYRKGFDPIGDSKFTSDVHWGCMLRSSQMLVAQAFIFHKFGRSWRKSLHKPLDQ 1210
            SS IL+TYRKGF+ IG+SK+TSDV+WGCMLRSSQMLVAQA +FH+ GRSW + LHKPLD+
Sbjct: 144  SSRILMTYRKGFEAIGNSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWTRPLHKPLDE 203

Query: 1209 SYIEVLQLFGDSMDLPFSIHNLLQAGKAYGLAPGSWVGPYAMCRTWETLVRSKREENENG 1030
            +YI +L  FGDS    FSIHNLLQAG+AY LA GSWVGPYAMCRTWETLVR +RE  +  
Sbjct: 204  AYIGILYHFGDSETSTFSIHNLLQAGRAYDLAAGSWVGPYAMCRTWETLVRCRREATDLD 263

Query: 1029 GLSSMMAIYVVSGDEDGERGGAPVLCVQDISRHCFEFSRGEVDWTXXXXXXXXXXXLDKV 850
                 MA+Y+VSGDEDGERGGAPV+C++D SRHC EFSRG+VDWT           L+KV
Sbjct: 264  DQPLPMAVYIVSGDEDGERGGAPVVCIEDASRHCLEFSRGQVDWTPILLLVPLVLGLEKV 323

Query: 849  NPRYLPLLAATFTFPQCLGILGGRPGASTYIVGVQDENAFYLDPHEVQQVVDIKRDNVDA 670
            NPRY+P L ATFTFPQ LGI+GG+PGASTYI+GVQDE A YLDPHEVQ V++I+RD+++A
Sbjct: 324  NPRYIPSLRATFTFPQSLGIMGGKPGASTYIIGVQDEKALYLDPHEVQPVINIRRDDLEA 383

Query: 669  DTSSYHSNVVRHIPLDSIDPSLAIGFYCKDKSDFDDFCARALELIDQSNGAPLFTITETR 490
            DT SYH NV+RHIPLD IDPSLAIGFYC+D+ DF+DFC RA +L D+SNGAPLFT+T+T 
Sbjct: 384  DTLSYHCNVIRHIPLDLIDPSLAIGFYCRDRDDFNDFCFRASKLADESNGAPLFTVTQTH 443

Query: 489  SSPNPAARHETLSDN-ATREYDSKDMLPTDGPEDCAQ-DDWQLL 364
            S P P    + L D+ A    DS  +LP    +  AQ D+WQLL
Sbjct: 444  SVPRPVNHSDALGDSGAVENDDSFSVLPMSDADGSAQEDEWQLL 487


Top