BLASTX nr result

ID: Forsythia22_contig00002606 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00002606
         (1856 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011074626.1| PREDICTED: zingipain-2 [Sesamum indicum]          664   0.0  
ref|XP_012839123.1| PREDICTED: zingipain-2 [Erythranthe guttatus...   650   0.0  
emb|CDP07460.1| unnamed protein product [Coffea canephora]            632   e-178
gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlise...   597   e-167
ref|XP_009597081.1| PREDICTED: zingipain-2 [Nicotiana tomentosif...   596   e-167
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   594   e-167
ref|XP_009771293.1| PREDICTED: zingipain-2 [Nicotiana sylvestris]     590   e-165
ref|XP_004238304.1| PREDICTED: zingipain-2 [Solanum lycopersicum]     586   e-164
ref|XP_012444786.1| PREDICTED: zingipain-2 [Gossypium raimondii]...   585   e-164
gb|KHG28107.1| Cysteinease RD21a -like protein [Gossypium arboreum]   584   e-164
ref|XP_010271530.1| PREDICTED: low-temperature-induced cysteine ...   583   e-163
ref|XP_012071947.1| PREDICTED: low-temperature-induced cysteine ...   581   e-163
ref|XP_011048840.1| PREDICTED: low-temperature-induced cysteine ...   579   e-162
ref|XP_002307688.2| cysteine protease family protein [Populus tr...   575   e-161
ref|XP_002280937.1| PREDICTED: zingipain-2 [Vitis vinifera] gi|3...   574   e-161
emb|CDX94938.1| BnaC05g07330D [Brassica napus]                        572   e-160
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   570   e-159
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   567   e-158
ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087...   567   e-158
ref|XP_010110007.1| Oryzain alpha chain [Morus notabilis] gi|587...   566   e-158

>ref|XP_011074626.1| PREDICTED: zingipain-2 [Sesamum indicum]
          Length = 439

 Score =  664 bits (1714), Expect = 0.0
 Identities = 309/407 (75%), Positives = 349/407 (85%)
 Frame = -3

Query: 1539 VPTCKSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLN 1360
            +PTCKSSSIS LF+SWC++YGKTY S QEK+ R KVFE+NY YV+ HN+  NSSYTLSLN
Sbjct: 17   LPTCKSSSISHLFDSWCKEYGKTYASEQEKQQRFKVFEQNYEYVTLHNARPNSSYTLSLN 76

Query: 1359 AFADLTHHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTG 1180
            AFADLT+HEFKAKYLGLS SA +L IRLNS    +EG +LVKESD+PSS+DWRK+GAVT 
Sbjct: 77   AFADLTNHEFKAKYLGLSLSASNLAIRLNSEQVGIEGPDLVKESDLPSSVDWRKQGAVTE 136

Query: 1179 VKDQGSCGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEF 1000
            VKDQGSCGACWSFS TGA+EGINKIVTGSLISLSEQELIDCD+SYNDGCGGGLMDYAY+F
Sbjct: 137  VKDQGSCGACWSFSTTGAVEGINKIVTGSLISLSEQELIDCDKSYNDGCGGGLMDYAYQF 196

Query: 999  VIKNQGIDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVG 820
            +IKN+GIDTEKDYPYQG E TC KEKLK+HVVTIDSY DI  KNEK+L QAVATQP+SVG
Sbjct: 197  IIKNKGIDTEKDYPYQGREGTCKKEKLKKHVVTIDSYADITAKNEKKLQQAVATQPVSVG 256

Query: 819  ISGSGNSFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYI 640
            I GS  SFQLYSGGIFTG CS  LDHAVLIVGYDSKDG+DYWI+KNSWG+YWGMDGYM++
Sbjct: 257  ICGSEKSFQLYSGGIFTGPCSASLDHAVLIVGYDSKDGQDYWIIKNSWGRYWGMDGYMHM 316

Query: 639  IRNTGNPEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGL 460
             RNTGN EG+CGIN+LASYP+K               RCNLFTYCSSGETCCC     G+
Sbjct: 317  QRNTGNGEGLCGINLLASYPIK-TSPNPPPSPPPGPVRCNLFTYCSSGETCCCTRDIFGI 375

Query: 459  CLKWKCCEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNATIAK 319
            CLKWKCC A+SAVCC+DR  CCPHDYP+CDT+RN+CLK IGN+T+A+
Sbjct: 376  CLKWKCCGAESAVCCQDRRSCCPHDYPVCDTKRNMCLKWIGNSTVAQ 422


>ref|XP_012839123.1| PREDICTED: zingipain-2 [Erythranthe guttatus]
            gi|604331887|gb|EYU36745.1| hypothetical protein
            MIMGU_mgv1a006749mg [Erythranthe guttata]
          Length = 433

 Score =  650 bits (1676), Expect = 0.0
 Identities = 302/411 (73%), Positives = 350/411 (85%), Gaps = 1/411 (0%)
 Frame = -3

Query: 1539 VPTCKSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLN 1360
            +P  KSS ISDLF+SWC++YGKTY S QEK++RL VF ENY YV+QHN+ ANSSYTLS+N
Sbjct: 17   LPISKSSLISDLFDSWCEEYGKTYASEQEKQHRLNVFHENYKYVNQHNADANSSYTLSVN 76

Query: 1359 AFADLTHHEFKAKYLGLSPSADDLIIRLNSGSDS-VEGSNLVKESDIPSSLDWRKKGAVT 1183
            AFADLT+HEF+A YLGLSPS  D +IRLNS S S ++G NL+KES+IPSSLDWR KGAVT
Sbjct: 77   AFADLTNHEFRANYLGLSPSKSDSVIRLNSRSASAIDGDNLIKESEIPSSLDWRNKGAVT 136

Query: 1182 GVKDQGSCGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYE 1003
             VKDQGSCGACWSFSATGA+EGIN+I TGSL+SLSEQELIDCD+SYNDGC GGLMDYAY+
Sbjct: 137  AVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCNGGLMDYAYD 196

Query: 1002 FVIKNQGIDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISV 823
            F+IKN+GIDTE+DY Y+G   TC+K K+ +HVVTIDSYVDIP K+EK+LLQAVATQPISV
Sbjct: 197  FIIKNKGIDTEEDYSYKGRSATCDKNKMNKHVVTIDSYVDIPEKDEKKLLQAVATQPISV 256

Query: 822  GISGSGNSFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMY 643
            GI GS +SFQLYSGGIFTG CST LDHAVLIVGYDSKDGKDYWI+KNSWGK WG+ GYM+
Sbjct: 257  GICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGKSWGIKGYMH 316

Query: 642  IIRNTGNPEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLG 463
            ++RN+G+ EGVCGIN LASYPVK              T+CN+FTYCSSGETCCCA  FLG
Sbjct: 317  MVRNSGSEEGVCGINTLASYPVK-SSTNPPPSPTPGPTKCNIFTYCSSGETCCCARYFLG 375

Query: 462  LCLKWKCCEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNATIAKQPG 310
            +CL W CCEA+SAVCC D  HCCPHDYP+CDT++NLCLK+ GN T++K  G
Sbjct: 376  VCLSWNCCEAESAVCCDDHRHCCPHDYPVCDTKKNLCLKKSGNTTVSKPLG 426


>emb|CDP07460.1| unnamed protein product [Coffea canephora]
          Length = 441

 Score =  632 bits (1631), Expect = e-178
 Identities = 298/434 (68%), Positives = 347/434 (79%)
 Frame = -3

Query: 1536 PTCKSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNA 1357
            P CKSS  +DLF +WC+Q+GKTY S +EK+YRL+VFE+NY YV++HNSLANS+YTLSLNA
Sbjct: 18   PICKSSLTADLFENWCKQHGKTYPSEEEKQYRLRVFEDNYDYVTKHNSLANSTYTLSLNA 77

Query: 1356 FADLTHHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTGV 1177
            FADLTHHEFKAKYLG S SAD LI RLN GS S+  S  V + DIPSSLDWR KGAVT V
Sbjct: 78   FADLTHHEFKAKYLGFSASADGLI-RLNRGSSSIGASGAVGKYDIPSSLDWRNKGAVTNV 136

Query: 1176 KDQGSCGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFV 997
            KDQGSCGACW+FSATGA+EGIN+IVTGSL+SLSEQELIDCDRSYN+GC GGLMDY YEFV
Sbjct: 137  KDQGSCGACWAFSATGAIEGINEIVTGSLVSLSEQELIDCDRSYNNGCNGGLMDYTYEFV 196

Query: 996  IKNQGIDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGI 817
            +KN GIDTE+DYP++G + TCN  KLKR VV+ID Y+D+P  NE+ LLQAVA QP+SVGI
Sbjct: 197  VKNGGIDTEQDYPFKGRDGTCNSNKLKRRVVSIDGYIDVPANNEQELLQAVAAQPVSVGI 256

Query: 816  SGSGNSFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYII 637
             GS   FQLYSGGIFTG CST LDHAVLIVGYDSK+G DYWIVKNSWG  WG++GY++II
Sbjct: 257  CGSERGFQLYSGGIFTGPCSTSLDHAVLIVGYDSKNGADYWIVKNSWGTSWGINGYIHII 316

Query: 636  RNTGNPEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLC 457
            RN+GN  GVCGINM+ASYP K              T+C+LF+ C +GETCCC+  FLGLC
Sbjct: 317  RNSGNSAGVCGINMMASYPTK-SSLNPPPSPPPGPTKCSLFSSCPAGETCCCSMEFLGLC 375

Query: 456  LKWKCCEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNATIAKQPGNNFSTG*AIGV 277
            L WKCC+  SAVCCKDR HCCPHDYPICDT+RNLCL+++GN+T+ KQ  N   +G     
Sbjct: 376  LSWKCCDLDSAVCCKDRLHCCPHDYPICDTKRNLCLRRMGNSTLVKQLKNGGRSG----- 430

Query: 276  LLSMIELGGWSSHF 235
                 + G WSS F
Sbjct: 431  -----KFGDWSSLF 439


>gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlisea aurea]
          Length = 424

 Score =  597 bits (1538), Expect = e-167
 Identities = 277/401 (69%), Positives = 326/401 (81%)
 Frame = -3

Query: 1524 SSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADL 1345
            SSSISDLF+SWCQ++GKTY S +E+E+RL VF ENY +++ HN+ AN SYTLSLNAFADL
Sbjct: 23   SSSISDLFDSWCQEHGKTYVSEEEREHRLGVFSENYDFIASHNARANYSYTLSLNAFADL 82

Query: 1344 THHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTGVKDQG 1165
            T  EF  +YLG SPS  DL+IR N GS S    N    S +PSS+DWRKKGAVTG+KDQG
Sbjct: 83   TRSEFGGRYLGFSPSGHDLLIRKNRGSGSYRSRNY---SAVPSSIDWRKKGAVTGIKDQG 139

Query: 1164 SCGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNQ 985
            SCGACWSFSATGA+EGIN+IVTGSL+SLSEQELIDCD SYN GC GGLMDYAYEF++KN+
Sbjct: 140  SCGACWSFSATGAIEGINQIVTGSLVSLSEQELIDCDHSYNQGCNGGLMDYAYEFILKNK 199

Query: 984  GIDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 805
            GIDTE+DY Y+G + +C++ KL + VVTIDSYVDIP KNE+ LL+AVA+QP+SVGISG  
Sbjct: 200  GIDTEEDYSYKGRDASCSQNKLNKRVVTIDSYVDIPEKNEQMLLEAVASQPVSVGISGGD 259

Query: 804  NSFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYIIRNTG 625
              FQ YS GIFTG CST LDHAVLIVGYDSK+GKDYWIVKNSWGK WGMDGYMY+ RNTG
Sbjct: 260  APFQFYSQGIFTGPCSTSLDHAVLIVGYDSKNGKDYWIVKNSWGKSWGMDGYMYVQRNTG 319

Query: 624  NPEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLCLKWK 445
            N  G+C INM+ASYPVK              T+C+LF+YCS GETCCCA RFLGLC+++K
Sbjct: 320  NQNGICEINMMASYPVKTNPNPSPSPSPPGPTKCSLFSYCSQGETCCCARRFLGLCMRYK 379

Query: 444  CCEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNATIA 322
            CC A+SAVCC+D  HCCP DYPICDT +++C K  GN+T+A
Sbjct: 380  CCGAESAVCCEDNVHCCPQDYPICDTAQSVCRKMSGNSTMA 420


>ref|XP_009597081.1| PREDICTED: zingipain-2 [Nicotiana tomentosiformis]
          Length = 439

 Score =  596 bits (1537), Expect = e-167
 Identities = 282/415 (67%), Positives = 326/415 (78%)
 Frame = -3

Query: 1536 PTCKSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNA 1357
            P C  SSISDLF SWCQQ GKTY+S QE+ YRL+VFEENYAY+ +HNS  NS+YTL LNA
Sbjct: 18   PICTCSSISDLFESWCQQNGKTYSSEQERVYRLEVFEENYAYIIEHNSKGNSTYTLDLNA 77

Query: 1356 FADLTHHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTGV 1177
            F+DLTHHEFK  +LGLS SA+D I RL +GS S    N V   DIPSSLDWR+KGAVT V
Sbjct: 78   FSDLTHHEFKNSFLGLSSSANDFI-RLKTGSSSAGVFNDVGVVDIPSSLDWREKGAVTKV 136

Query: 1176 KDQGSCGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFV 997
            K+QGSCGACWSFSATGA+EGINKIVTGSL+SLSEQELIDCD+SYNDGCGGGLMDYA+EFV
Sbjct: 137  KNQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAFEFV 196

Query: 996  IKNQGIDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGI 817
             KN GIDTE+DYP+   E TCNK KL+R VVTID Y D+P  +E +LL+AVA QP+SVGI
Sbjct: 197  KKNGGIDTEEDYPFNEREGTCNKNKLQRRVVTIDGYTDVPQYDEDKLLKAVANQPVSVGI 256

Query: 816  SGSGNSFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYII 637
             GS  +FQ YS GIFTG CST LDHAVLIVGY S++G DYWI+KNSWG  WG++GYM++ 
Sbjct: 257  CGSERAFQSYSKGIFTGPCSTVLDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYMHMQ 316

Query: 636  RNTGNPEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLC 457
            RN+GN EG+CGIN LASYP K              ++C++FT C  GETCCC WR LG+C
Sbjct: 317  RNSGNQEGICGINKLASYPTK-SSPNPPSPPSPGPSKCSMFTSCGQGETCCCGWRLLGVC 375

Query: 456  LKWKCCEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNATIAKQPGNNFSTG 292
            + WKCC   SAVCCKD  HCCPHDYPICDT RNLCLK++ NATI +QP     +G
Sbjct: 376  VSWKCCGLDSAVCCKDGRHCCPHDYPICDTSRNLCLKRMSNATIVQQPQKEAFSG 430


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  594 bits (1531), Expect = e-167
 Identities = 279/415 (67%), Positives = 326/415 (78%)
 Frame = -3

Query: 1536 PTCKSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNA 1357
            P C  SSISDLF +WCQQ GK Y+S QE+ YR KVFEENYAY+++HNS  NSSYTL LNA
Sbjct: 18   PFCTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYITEHNSKENSSYTLGLNA 77

Query: 1356 FADLTHHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTGV 1177
            ++DLTHHEF+  +LGLS SA+D I     GS S E + ++ + D PSSLDWR+KGAVT V
Sbjct: 78   YSDLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSE-TGVLSDVDAPSSLDWREKGAVTDV 136

Query: 1176 KDQGSCGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFV 997
            K+QGSCGACWSFSATGAMEGINKI TGSL+SLSEQELIDCDRSYN+GCGGGLMDYA+EFV
Sbjct: 137  KNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGGGLMDYAFEFV 196

Query: 996  IKNQGIDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGI 817
            IKN GIDTEKDYP++  E TCNK KL+RHVVTID Y DIP  +E +LL+AVATQP+SVGI
Sbjct: 197  IKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKAVATQPVSVGI 256

Query: 816  SGSGNSFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYII 637
             GS  +FQ YS GIFTG CST LDHAVLIVGY S++G DYWI+KNSWG  WG++GY+++ 
Sbjct: 257  CGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYIHMQ 316

Query: 636  RNTGNPEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLC 457
            RN+GN EG+CGIN LASYP K              ++C++FT C  GETCCC  +FLG+C
Sbjct: 317  RNSGNQEGICGINKLASYPTK-TSPNPPTPPAPGPSKCSMFTSCGQGETCCCGSKFLGIC 375

Query: 456  LKWKCCEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNATIAKQPGNNFSTG 292
            L WKCC   SAVCCKD  HCCP DYPICDT RNLCLK++ NATI +QP     TG
Sbjct: 376  LSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQQPQKEAFTG 430


>ref|XP_009771293.1| PREDICTED: zingipain-2 [Nicotiana sylvestris]
          Length = 439

 Score =  590 bits (1521), Expect = e-165
 Identities = 277/415 (66%), Positives = 326/415 (78%)
 Frame = -3

Query: 1536 PTCKSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNA 1357
            P C  SSISDLF +WCQQ GK+Y+S QE+ YRLKVFEENYAY+ +HNS  NS+YTL LNA
Sbjct: 18   PICTCSSISDLFETWCQQNGKSYSSEQERVYRLKVFEENYAYIIEHNSKGNSTYTLGLNA 77

Query: 1356 FADLTHHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTGV 1177
            ++DLTHHEFK  +LGLS SA+D I RL +GS S    + V + D+PSSLDWR+KGAVT V
Sbjct: 78   YSDLTHHEFKNSFLGLSSSANDFI-RLKTGSSSAGVFSDVGDVDVPSSLDWREKGAVTKV 136

Query: 1176 KDQGSCGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFV 997
            K+QGSCGACWSFSATGA+EGINKIV+GSL+SLSEQELIDCD+SYNDGCGGGLMDYA+EFV
Sbjct: 137  KNQGSCGACWSFSATGAIEGINKIVSGSLVSLSEQELIDCDKSYNDGCGGGLMDYAFEFV 196

Query: 996  IKNQGIDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGI 817
             KN GIDTE+DYP+   E TCNK KL+R VVTID Y D+P  +E +LL+AVA QP+SVGI
Sbjct: 197  KKNGGIDTEEDYPFIEREGTCNKNKLQRRVVTIDGYTDVPQNDEDKLLKAVAKQPVSVGI 256

Query: 816  SGSGNSFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYII 637
             GS  +FQ YS GIFTG CST LDHAVLIVGY S++G DYWI+KNSWG  WG++GYM++ 
Sbjct: 257  CGSERAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYMHMQ 316

Query: 636  RNTGNPEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLC 457
            RN+GN EG+CGIN LASYP K              ++C++FT C  GETCCC W  LG+C
Sbjct: 317  RNSGNQEGICGINKLASYPTK-SSPNPPSPPSPGPSKCSMFTSCGQGETCCCGWSLLGVC 375

Query: 456  LKWKCCEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNATIAKQPGNNFSTG 292
            L WKCC   SAVCCKD  HCCPHDYPICDT RNLCLK++ NATI +QP     +G
Sbjct: 376  LSWKCCGLDSAVCCKDGRHCCPHDYPICDTSRNLCLKRMSNATIVQQPQKEAFSG 430


>ref|XP_004238304.1| PREDICTED: zingipain-2 [Solanum lycopersicum]
          Length = 439

 Score =  586 bits (1510), Expect = e-164
 Identities = 275/415 (66%), Positives = 321/415 (77%)
 Frame = -3

Query: 1536 PTCKSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNA 1357
            P C  SSISDLF +WCQQ GK Y+S QE+ YR KVFEENYAY+++HNS  NSSYTL LNA
Sbjct: 18   PLCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYITEHNSKGNSSYTLGLNA 77

Query: 1356 FADLTHHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTGV 1177
            ++DLTHHEF+  +LGLS SA+D I     GS S   + ++ + D PSSLDWR KGAVT V
Sbjct: 78   YSDLTHHEFRNSFLGLSSSANDFIRLKGRGSGS-SAAGVLSDVDAPSSLDWRDKGAVTNV 136

Query: 1176 KDQGSCGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFV 997
            K+QGSCGACWSFSATGA+EGINKI TGSL+SLSEQELIDCDRSYN GCGGGLMDYA+EFV
Sbjct: 137  KNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGGGLMDYAFEFV 196

Query: 996  IKNQGIDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGI 817
            IKN GIDTEKDYP++  E TCNK KL+R VVTID Y DIP  +E +LL+AVATQP+SVGI
Sbjct: 197  IKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKAVATQPVSVGI 256

Query: 816  SGSGNSFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYII 637
             GS  +FQ YS GIFTG C TDLDHAVLIVGY S++G DYWI+KNSWG  WG++GY+++ 
Sbjct: 257  CGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTSWGINGYIHMQ 316

Query: 636  RNTGNPEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLC 457
            RN+GN EG+CG+N LASYP K              ++C+ FT C  GETCCC  +FLG+C
Sbjct: 317  RNSGNQEGICGVNKLASYPTK-TSPNPPNPPAPGPSKCSTFTSCGQGETCCCGLKFLGIC 375

Query: 456  LKWKCCEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNATIAKQPGNNFSTG 292
            L WKCC   SAVCCKD  HCCP DYPICDT RNLCLK++ NATI +QP     TG
Sbjct: 376  LSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQQPQKEPFTG 430


>ref|XP_012444786.1| PREDICTED: zingipain-2 [Gossypium raimondii]
            gi|763791179|gb|KJB58175.1| hypothetical protein
            B456_009G197900 [Gossypium raimondii]
          Length = 431

 Score =  585 bits (1509), Expect = e-164
 Identities = 273/408 (66%), Positives = 323/408 (79%)
 Frame = -3

Query: 1521 SSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADLT 1342
            S IS +F +WC Q+GK+Y+S +EK YRLKVFE+NYA+V+QHN++ NSSY+L+LNAFADLT
Sbjct: 26   SHISKIFETWCHQHGKSYSSEEEKSYRLKVFEDNYAFVTQHNAMTNSSYSLALNAFADLT 85

Query: 1341 HHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTGVKDQGS 1162
            HHEFKA  LGLS +A      +     ++    LV+  DIP+SLDWR+KGAVT VKDQGS
Sbjct: 86   HHEFKASRLGLSGAA------IQFRCSNLREPRLVR--DIPASLDWREKGAVTQVKDQGS 137

Query: 1161 CGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNQG 982
            CGACWSFSATGA+EG+NKIVTGSLISLSEQEL+DCD++YN GC GGLMDYA++FVI N G
Sbjct: 138  CGACWSFSATGAIEGVNKIVTGSLISLSEQELVDCDKTYNTGCEGGLMDYAFQFVINNHG 197

Query: 981  IDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 802
            IDTE+DYPYQG E TCNKEKLKRHVVTID Y D+P  NEK+LLQAVATQP+SVGI GS  
Sbjct: 198  IDTEEDYPYQGREHTCNKEKLKRHVVTIDDYTDVPMTNEKKLLQAVATQPVSVGICGSER 257

Query: 801  SFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYIIRNTGN 622
            +FQLY  GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG  WGM+GY+++IRNTG 
Sbjct: 258  AFQLYCKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMIRNTGK 317

Query: 621  PEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLCLKWKC 442
             EG+CGINMLASYP+K              T+C+ FTYCS+GETCCC  R  G+C  WKC
Sbjct: 318  SEGICGINMLASYPIK-TSPNPPPSPPPGPTKCDFFTYCSAGETCCCTHRIFGICFLWKC 376

Query: 441  CEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNATIAKQPGNNFS 298
            C   SAVCCKD  HCCPH+YPICDT+ N CLK++GNATI +    N +
Sbjct: 377  CGLDSAVCCKDNRHCCPHNYPICDTKNNQCLKRVGNATIMESSNTNLA 424


>gb|KHG28107.1| Cysteinease RD21a -like protein [Gossypium arboreum]
          Length = 431

 Score =  584 bits (1506), Expect = e-164
 Identities = 273/408 (66%), Positives = 322/408 (78%)
 Frame = -3

Query: 1521 SSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADLT 1342
            S IS  F +WCQQ+GK+Y S +EK YRLKVFE+NYA+V+QHN++ NSSY+L+LNAFAD T
Sbjct: 26   SHISKKFETWCQQHGKSYLSEEEKSYRLKVFEDNYAFVTQHNAMVNSSYSLALNAFADFT 85

Query: 1341 HHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTGVKDQGS 1162
            HHEFKA  LGLS +A      +     ++    LV+  DIP SLDWR+KGAVT VKDQGS
Sbjct: 86   HHEFKASRLGLSGAA------IQFRHPNLREPRLVR--DIPDSLDWREKGAVTQVKDQGS 137

Query: 1161 CGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNQG 982
            CGACWSFSATGA+EG+NKIVTGSLISLSEQEL+DCD++YN GC GGLMDYA++FVI N G
Sbjct: 138  CGACWSFSATGAIEGVNKIVTGSLISLSEQELVDCDKTYNTGCEGGLMDYAFQFVINNHG 197

Query: 981  IDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 802
            IDTE+DYPYQG E TCNKEKLKRHVVTID Y D+P  NEK+LLQAVATQP+SVGI GS  
Sbjct: 198  IDTEEDYPYQGREHTCNKEKLKRHVVTIDDYTDVPMNNEKKLLQAVATQPVSVGICGSER 257

Query: 801  SFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYIIRNTGN 622
            +FQLYS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG+ WGM+GY+++IRN+G 
Sbjct: 258  AFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGRRWGMNGYIHMIRNSGK 317

Query: 621  PEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLCLKWKC 442
             EG+CGINMLASYP+K              T+C+ FTYCS+GETCCC  R  G+C  WKC
Sbjct: 318  SEGICGINMLASYPIK-TSPNPPPSPPPGPTKCDFFTYCSAGETCCCTHRIFGICFSWKC 376

Query: 441  CEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNATIAKQPGNNFS 298
            C   SAVCCKD  HCCPH+YPICDT+ N CLK++GNATI +    N +
Sbjct: 377  CGLDSAVCCKDNRHCCPHNYPICDTKNNQCLKRVGNATIMESSDTNLA 424


>ref|XP_010271530.1| PREDICTED: low-temperature-induced cysteine proteinase [Nelumbo
            nucifera]
          Length = 443

 Score =  583 bits (1504), Expect = e-163
 Identities = 269/426 (63%), Positives = 332/426 (77%)
 Frame = -3

Query: 1521 SSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADLT 1342
            SS SDLF+ WC+++G+TY+S +E+ +RLKVFE+N A+V++HNS+ANS+Y+L+LNAFADLT
Sbjct: 23   SSTSDLFDRWCEEHGRTYSSEEERLFRLKVFEDNLAFVTEHNSMANSTYSLALNAFADLT 82

Query: 1341 HHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTGVKDQGS 1162
            HHEFK   LGL+ +A D++         +E  ++  +  +PSS+DWR+KGAVT VKDQGS
Sbjct: 83   HHEFKISRLGLAAAATDMVRSSPRAPSLIESPSIAGQ--LPSSIDWREKGAVTNVKDQGS 140

Query: 1161 CGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNQG 982
            CGACWSFSATGA+EGINKIVTGS +SLSEQEL+DCDRSYN GCGGGLMDYA+++VIKN+G
Sbjct: 141  CGACWSFSATGAIEGINKIVTGSPLSLSEQELVDCDRSYNSGCGGGLMDYAFQWVIKNKG 200

Query: 981  IDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 802
            IDTE DYPYQGGE+TCNK+KL++HVVTID Y D+P  +EK LLQAVA+QP+SVGI GS  
Sbjct: 201  IDTEDDYPYQGGERTCNKDKLRKHVVTIDGYTDVPSNSEKHLLQAVASQPVSVGICGSER 260

Query: 801  SFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYIIRNTGN 622
            +FQLYS GIF+G CST LDHAVLIVGY S++G DYWIVKNSWG  WGM+GYM+++RN+G+
Sbjct: 261  AFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYMHMLRNSGS 320

Query: 621  PEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLCLKWKC 442
            P+GVCGINMLASYP K              TRC+L TYC  GETCCC  R LG+C  WKC
Sbjct: 321  PQGVCGINMLASYPTK-TSPNPPPSPSPGPTRCDLLTYCQEGETCCCTRRILGICFSWKC 379

Query: 441  CEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNATIAKQPGNNFSTG*AIGVLLSMI 262
            CE  SAVCCKD  +CCPHDYPICDT R  CLK  GN T  K          ++    S++
Sbjct: 380  CELDSAVCCKDHRYCCPHDYPICDTERKQCLKSTGNFTSVK----------SLDKSSSLV 429

Query: 261  ELGGWS 244
            + GGW+
Sbjct: 430  KFGGWN 435


>ref|XP_012071947.1| PREDICTED: low-temperature-induced cysteine proteinase [Jatropha
            curcas] gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha
            curcas] gi|643731232|gb|KDP38570.1| hypothetical protein
            JCGZ_04495 [Jatropha curcas]
          Length = 441

 Score =  581 bits (1497), Expect = e-163
 Identities = 275/414 (66%), Positives = 324/414 (78%), Gaps = 3/414 (0%)
 Frame = -3

Query: 1524 SSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADL 1345
            SS I+ LF +WCQQ+GKTY S +EK +RLKVF++NY +V++HNS  NSSYTLSLNAFADL
Sbjct: 23   SSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADL 82

Query: 1344 THHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKE---SDIPSSLDWRKKGAVTGVK 1174
            THHEFKA  LGLS +A        S S +V+ SN       +D+P+S+DWRK GAVT VK
Sbjct: 83   THHEFKASRLGLSSAA--------SASLNVDRSNRQIPDFVADVPASVDWRKNGAVTQVK 134

Query: 1173 DQGSCGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVI 994
            DQG+CGACWSFSATGA+EGINKIVTGSL+SLSEQEL+DCD+SYN+GC GG+MDYA++FVI
Sbjct: 135  DQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVI 194

Query: 993  KNQGIDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGIS 814
             N GIDTE+DYPYQG +++CNKEKLKRHVVTID YVD+P  NEK LL+AVA QP+SVGI 
Sbjct: 195  DNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGIC 254

Query: 813  GSGNSFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYIIR 634
            GS  +FQLYS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG YWGMDGYM++ R
Sbjct: 255  GSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQR 314

Query: 633  NTGNPEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLCL 454
            N+G+  G+CGINMLASYP K              TRC+LFT+C  GETCCC     G+CL
Sbjct: 315  NSGSSRGLCGINMLASYP-KKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICL 373

Query: 453  KWKCCEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNATIAKQPGNNFSTG 292
             WKCCE  SAVCCKD  HCCP DYP+CDT RN+CLK  GNAT  ++   N S+G
Sbjct: 374  SWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSSG 427


>ref|XP_011048840.1| PREDICTED: low-temperature-induced cysteine proteinase [Populus
            euphratica]
          Length = 480

 Score =  579 bits (1493), Expect = e-162
 Identities = 273/431 (63%), Positives = 331/431 (76%)
 Frame = -3

Query: 1620 QTFPLLSPISKMXXXXXXXXXXXXLIDVPTCKSSSISDLFNSWCQQYGKTYNSVQEKEYR 1441
            Q  PL S   KM             +  P+  SS IS LF +WC+++GK+Y S +E+ +R
Sbjct: 34   QKSPLFSLKQKMNFLCIFALTLLISVLSPSTASSGISQLFETWCKEHGKSYTSQEERSHR 93

Query: 1440 LKVFEENYAYVSQHNSLANSSYTLSLNAFADLTHHEFKAKYLGLSPSADDLIIRLNSGSD 1261
            LKVFE+NY +V++HNS  NSSYTLSLNAF+DLTHHEFK   LGLS +       +N G  
Sbjct: 94   LKVFEDNYDFVTKHNSKGNSSYTLSLNAFSDLTHHEFKTSRLGLSAAP------MNLGHR 147

Query: 1260 SVEGSNLVKESDIPSSLDWRKKGAVTGVKDQGSCGACWSFSATGAMEGINKIVTGSLISL 1081
            ++E + +V   DIP+S+DWR KGAVT VKDQGSCGACWSFSATGA+EGINKIVTGSL+SL
Sbjct: 148  NLEITGVV--GDIPASIDWRNKGAVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSL 205

Query: 1080 SEQELIDCDRSYNDGCGGGLMDYAYEFVIKNQGIDTEKDYPYQGGEKTCNKEKLKRHVVT 901
            SEQELI+CD+S+NDGCGGGLMDYA++FVI N GIDTE+DYPY+  + TCNK+K+KR VVT
Sbjct: 206  SEQELIECDKSFNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDKMKRRVVT 265

Query: 900  IDSYVDIPPKNEKRLLQAVATQPISVGISGSGNSFQLYSGGIFTGSCSTDLDHAVLIVGY 721
            ID YVD+P  NEK+LLQAVA QP+SVGI GS  +FQ+YS GIFTG CST LDHAVLIVGY
Sbjct: 266  IDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGY 325

Query: 720  DSKDGKDYWIVKNSWGKYWGMDGYMYIIRNTGNPEGVCGINMLASYPVKXXXXXXXXXXX 541
             S++G DYWIVKNSWG  WGM GYM++ RN+GN +GVCGINMLASYPVK           
Sbjct: 326  GSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLASYPVK-TSPNPPPPPP 384

Query: 540  XXXTRCNLFTYCSSGETCCCAWRFLGLCLKWKCCEAKSAVCCKDRSHCCPHDYPICDTRR 361
               T+C+LF+YC++GETCCCA +F G+C+ WKCC   SAVCCKDR HCCPHDYP+CDT +
Sbjct: 385  PGPTKCDLFSYCAAGETCCCARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDK 444

Query: 360  NLCLKQIGNAT 328
            N+C K+ GNAT
Sbjct: 445  NMCFKRAGNAT 455


>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  575 bits (1481), Expect = e-161
 Identities = 266/403 (66%), Positives = 321/403 (79%)
 Frame = -3

Query: 1536 PTCKSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNA 1357
            P+  SS IS LF +WC+++GK+Y S +E+ +RLKVFE+NY +V++HNS  NSSY+L+LNA
Sbjct: 18   PSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNA 77

Query: 1356 FADLTHHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTGV 1177
            FADLTHHEFK   LGLS +       LN    ++E + +V   DIP+S+DWR KG VT V
Sbjct: 78   FADLTHHEFKTSRLGLSAAP------LNLAHRNLEITGVV--GDIPASIDWRNKGVVTNV 129

Query: 1176 KDQGSCGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFV 997
            KDQGSCGACWSFSATGA+EGINKIVTGSL+SLSEQELI+CD+SYNDGCGGGLMDYA++FV
Sbjct: 130  KDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFV 189

Query: 996  IKNQGIDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGI 817
            I N GIDTE+DYPY+  + TCNK+++KR VVTID YVD+P  NEK+LLQAVA QP+SVGI
Sbjct: 190  INNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGI 249

Query: 816  SGSGNSFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYII 637
             GS  +FQ+YS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG  WGM GYM++ 
Sbjct: 250  CGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQ 309

Query: 636  RNTGNPEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLC 457
            RN+GN +GVCGINMLASYPVK              T+CNL TYC++GETCCCA +F G+C
Sbjct: 310  RNSGNSQGVCGINMLASYPVK-TSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGIC 368

Query: 456  LKWKCCEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNAT 328
            + WKCC   SAVCCKDR HCCPHDYP+CDT +N+C K+ GNAT
Sbjct: 369  ISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNAT 411


>ref|XP_002280937.1| PREDICTED: zingipain-2 [Vitis vinifera] gi|302142569|emb|CBI19772.3|
            unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  574 bits (1479), Expect = e-161
 Identities = 268/432 (62%), Positives = 328/432 (75%), Gaps = 3/432 (0%)
 Frame = -3

Query: 1527 KSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFAD 1348
            ++SS +DLF +WC+QYGKTY+S +EK  RLKVFEEN+A+V+QHNS+AN+SYTL+LNAFAD
Sbjct: 21   EASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFAD 80

Query: 1347 LTHHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTGVKDQ 1168
            LTHHEFKA  LG SP     I  + +          V+E  +P ++DWRK GAVTGVKDQ
Sbjct: 81   LTHHEFKASRLGFSPGRAQSIRSVGTP---------VQELHVPPAVDWRKSGAVTGVKDQ 131

Query: 1167 GSCGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKN 988
            G+CG CWSFS TGA+EGINKIVTGSL+SLSEQEL+DCDRSYN GC GGLMDYAY+FVIKN
Sbjct: 132  GNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIKN 191

Query: 987  QGIDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGS 808
            QGID+E DYPY G +K CNKEKLK+H+VTID Y DIPP +EK+LLQ VA QP+SVGI GS
Sbjct: 192  QGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGS 251

Query: 807  GNSFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYIIRNT 628
              +FQLYS G++TG CS+ LDHAVLIVGY ++DG D+WIVKNSWG++WGM GY++++RN 
Sbjct: 252  EKTFQLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWGMRGYIHMLRNN 311

Query: 627  GNPEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLCLKW 448
            G  EG+CGINMLASYP K              T+C+ F+ CS GETCCC+WRF+G+CL W
Sbjct: 312  GTAEGICGINMLASYPAK-TSPNPPPPPTPGPTKCDFFSSCSEGETCCCSWRFIGVCLSW 370

Query: 447  KCCEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNAT---IAKQPGNNFSTG*AIGV 277
             CC AKSAVCC + ++CCP  +PICDT+RN CLK  GN T   + K+ G           
Sbjct: 371  NCCTAKSAVCCDNNNYCCPASHPICDTKRNRCLKPAGNGTGVEVLKRRG----------- 419

Query: 276  LLSMIELGGWSS 241
              S ++ GGWSS
Sbjct: 420  --SSVKFGGWSS 429


>emb|CDX94938.1| BnaC05g07330D [Brassica napus]
          Length = 442

 Score =  572 bits (1474), Expect = e-160
 Identities = 266/409 (65%), Positives = 320/409 (78%)
 Frame = -3

Query: 1545 IDVPTCKSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLS 1366
            +  P+  S  IS+LF++WCQ++GKTY S +E+++R+++F +N+ +V++HN +ANS+Y+LS
Sbjct: 23   LSFPSSSSDDISELFDAWCQRHGKTYASEEERQHRIEIFRDNHDFVTRHNGIANSTYSLS 82

Query: 1365 LNAFADLTHHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAV 1186
            LNAFADLTHHEFKA  LGLS S+  L++      ++V G        +P S+DWRKKGAV
Sbjct: 83   LNAFADLTHHEFKASRLGLSASSAPLLVAKGESVENVGGK-------VPDSVDWRKKGAV 135

Query: 1185 TGVKDQGSCGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAY 1006
            T VKDQGSCGACWSFSATGAMEGIN+IVTG LISLSEQELIDCD+SYNDGC GGLMDYA+
Sbjct: 136  TNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNDGCNGGLMDYAF 195

Query: 1005 EFVIKNQGIDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPIS 826
            +FVIKN GIDTEKDYPYQ  + TC K+KLKR VVTIDSY  +   +EK LL+AVA QP+S
Sbjct: 196  QFVIKNHGIDTEKDYPYQERDGTCKKDKLKRKVVTIDSYAGVKSNDEKALLEAVAAQPVS 255

Query: 825  VGISGSGNSFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYM 646
            VGI GS  +FQLYS GIF+G CST LDHAVLIVGY S++G DYWIVKNSWGK WGMDG+M
Sbjct: 256  VGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFM 315

Query: 645  YIIRNTGNPEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFL 466
            ++ RNTGN EGVCGINMLASYP+K              T+CNLFTYC++ ETCCCA    
Sbjct: 316  HMQRNTGNSEGVCGINMLASYPIK-THPNPPPPSPSGPTKCNLFTYCAADETCCCARNLF 374

Query: 465  GLCLKWKCCEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNATIAK 319
            GLC  WKCCE +SAVCCKD  HCCP DYP+CDT R+LCLK+ GN T  K
Sbjct: 375  GLCFSWKCCELESAVCCKDGRHCCPRDYPVCDTTRSLCLKKTGNFTEIK 423


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  570 bits (1470), Expect = e-159
 Identities = 269/394 (68%), Positives = 309/394 (78%), Gaps = 1/394 (0%)
 Frame = -3

Query: 1524 SSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADL 1345
            SS IS LF SW +++GKTY S ++K YR K+FEENY +V +HNS  NSSYTLSLNAFADL
Sbjct: 25   SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84

Query: 1344 THHEFKAKYLGLSP-SADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTGVKDQ 1168
            THHEFKA  LGLS  S    + R N       G       D+P S+DWRKKGAV+ VKDQ
Sbjct: 85   THHEFKASRLGLSAFSTSGKLSRRNFPLHDFVG-------DVPISIDWRKKGAVSQVKDQ 137

Query: 1167 GSCGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKN 988
            G+CGACWSFSATGA+EGINKIVTGSL+SLSEQEL+DCDRSYN+GC GGLMDYAY+FVI+N
Sbjct: 138  GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIEN 197

Query: 987  QGIDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGS 808
             GIDTE+DYPYQ  EKTCNKEKLKRHVVTID Y D+P  NEK LL+AVA QP+SVGI GS
Sbjct: 198  NGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGS 257

Query: 807  GNSFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYIIRNT 628
              +FQLYS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG +WG++GYMY++RN+
Sbjct: 258  ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNS 317

Query: 627  GNPEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLCLKW 448
            GN +G+CGINMLAS+PVK              T+C+LFT C  GETCCC  R  GLC  W
Sbjct: 318  GNSQGLCGINMLASFPVK-TSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSW 376

Query: 447  KCCEAKSAVCCKDRSHCCPHDYPICDTRRNLCLK 346
            KCCE  SAVCCKD  HCCPHDYP+CDT+RN+CLK
Sbjct: 377  KCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  567 bits (1461), Expect = e-158
 Identities = 265/402 (65%), Positives = 316/402 (78%)
 Frame = -3

Query: 1524 SSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADL 1345
            S  IS+LF+ WCQ++GKTY S +E++ R+++F++N+ +V+QHN + N++Y+LSLNAFADL
Sbjct: 25   SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84

Query: 1344 THHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTGVKDQG 1165
            THHEFKA  LGLS SA  +I+       + +G +L     +P S+DWRKKGAVT VKDQG
Sbjct: 85   THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 137

Query: 1164 SCGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNQ 985
            SCGACWSFSATGAMEGIN+IVTG LISLSEQELIDCD+SYN GC GGLMDYA+EFVIKN 
Sbjct: 138  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197

Query: 984  GIDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 805
            GIDTEKDYPYQ  + TC K+KLK+ VVTIDSY  +   +EK L++AVA QP+SVGI GS 
Sbjct: 198  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257

Query: 804  NSFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYIIRNTG 625
             +FQLYS GIF+G CST LDHAVLIVGY S++G DYWIVKNSWGK WGMDG+M++ RNT 
Sbjct: 258  RAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 317

Query: 624  NPEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLCLKWK 445
            N +GVCGINMLASYP+K              T+CNLFTYCSSGETCCCA    GLC  WK
Sbjct: 318  NSDGVCGINMLASYPIK-THPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWK 376

Query: 444  CCEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNATIAK 319
            CCE +SAVCCKD  HCCPHDYP+CDT R+LCLK+ GN T  K
Sbjct: 377  CCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIK 418


>ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1|
            JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  567 bits (1461), Expect = e-158
 Identities = 265/398 (66%), Positives = 314/398 (78%)
 Frame = -3

Query: 1521 SSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADLT 1342
            S IS LF +WC Q+GK Y+S +EK YRLKVFEENYA+V+QHN + NSSY+L+LNAFADLT
Sbjct: 24   SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83

Query: 1341 HHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTGVKDQGS 1162
            HHEFKA  LGLS +A      +     +++   LV+  DIP+S+DWR KGAVT VKDQGS
Sbjct: 84   HHEFKASRLGLSAAA------IEGSRPNLQLPGLVR--DIPASMDWRTKGAVTKVKDQGS 135

Query: 1161 CGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNQG 982
            CGACWSFSATGA+EGINKIVTG+L+SLSEQEL+DCDRSYN GC GGLMDYAY+FVI N G
Sbjct: 136  CGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHG 195

Query: 981  IDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 802
            ID E+DYPY G EKTCNKEK KR VVTID Y  +P  NE  LLQAVA QP+SVGI GS  
Sbjct: 196  IDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSER 255

Query: 801  SFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYIIRNTGN 622
            +FQLYS GIFTG CS+ LDHAVLIVGY S++G DYWIVKNSWG  WGM+GY++++RN+G+
Sbjct: 256  AFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGD 315

Query: 621  PEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLCLKWKC 442
             +G+CGINMLASYP K              T+C+LFTYCS+GETCCC  R  G+C  WKC
Sbjct: 316  SKGLCGINMLASYPTK-TSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKC 374

Query: 441  CEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQIGNAT 328
            CE  SAVCCKD  HCCP+DYP+CDT+++ CLK++GNAT
Sbjct: 375  CELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNAT 412


>ref|XP_010110007.1| Oryzain alpha chain [Morus notabilis] gi|587938276|gb|EXC25025.1|
            Oryzain alpha chain [Morus notabilis]
          Length = 517

 Score =  567 bits (1460), Expect = e-158
 Identities = 264/394 (67%), Positives = 310/394 (78%)
 Frame = -3

Query: 1524 SSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADL 1345
            S + S LF +WC+++G++Y+S +E+ YRL VFE+N A+V+QHN++ NSSYTLSLNAFADL
Sbjct: 23   SLNSSQLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVTQHNNMGNSSYTLSLNAFADL 82

Query: 1344 THHEFKAKYLGLSPSADDLIIRLNSGSDSVEGSNLVKESDIPSSLDWRKKGAVTGVKDQG 1165
            THHEFK+  LG S +    + +L        GS L+   D+P+SLDWRKKGAVT VKDQG
Sbjct: 83   THHEFKSSRLGFSSALLSSLPKL--------GSKLLDLRDVPASLDWRKKGAVTNVKDQG 134

Query: 1164 SCGACWSFSATGAMEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNQ 985
            SCGACW+FSATGA+EGINKIVTGSL+SLSEQELIDCD SYN GC GGLMDYAY+FVI N 
Sbjct: 135  SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAGCDGGLMDYAYQFVIDNH 194

Query: 984  GIDTEKDYPYQGGEKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 805
            GIDTE+DYPYQ  +K+C KEKLKR VVTID Y D+ P N  +LLQAV TQP+SVGI GS 
Sbjct: 195  GIDTEEDYPYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQAVVTQPVSVGICGSE 254

Query: 804  NSFQLYSGGIFTGSCSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGMDGYMYIIRNTG 625
             +FQLYS GIFTG CST LDHAVLIVGYDS++G DYWIVKNSWGK WGMDGY+++ RNTG
Sbjct: 255  RAFQLYSKGIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQWGMDGYIHMQRNTG 314

Query: 624  NPEGVCGINMLASYPVKXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCAWRFLGLCLKWK 445
            N +GVCGINMLASYP K              TRC+ F  C  GETCCC+WRFLGLC  WK
Sbjct: 315  NSQGVCGINMLASYPTK-TSPNPPPSPSPGPTRCSFFAQCGEGETCCCSWRFLGLCFSWK 373

Query: 444  CCEAKSAVCCKDRSHCCPHDYPICDTRRNLCLKQ 343
            CC   SAVCCKD+ HCCP DYP+CDT+RN+CLK+
Sbjct: 374  CCGLNSAVCCKDKIHCCPQDYPLCDTQRNVCLKE 407


Top