BLASTX nr result

ID: Forsythia23_contig00023292 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia23_contig00023292
         (1555 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011074626.1| PREDICTED: zingipain-2 [Sesamum indicum]          664   0.0  
ref|XP_012839123.1| PREDICTED: zingipain-2 [Erythranthe guttatus...   663   0.0  
emb|CDP07460.1| unnamed protein product [Coffea canephora]            644   0.0  
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   594   e-167
gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlise...   594   e-167
ref|XP_009597081.1| PREDICTED: zingipain-2 [Nicotiana tomentosif...   590   e-166
ref|XP_004238304.1| PREDICTED: zingipain-2 [Solanum lycopersicum]     590   e-165
ref|XP_010271530.1| PREDICTED: low-temperature-induced cysteine ...   590   e-165
gb|KHG28107.1| Cysteinease RD21a -like protein [Gossypium arboreum]   589   e-165
ref|XP_012444786.1| PREDICTED: zingipain-2 [Gossypium raimondii]...   588   e-165
ref|XP_012071947.1| PREDICTED: low-temperature-induced cysteine ...   586   e-164
ref|XP_009771293.1| PREDICTED: zingipain-2 [Nicotiana sylvestris]     585   e-164
ref|XP_011048840.1| PREDICTED: low-temperature-induced cysteine ...   581   e-163
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   580   e-162
ref|XP_002307688.2| cysteine protease family protein [Populus tr...   578   e-162
ref|XP_002280937.1| PREDICTED: zingipain-2 [Vitis vinifera] gi|3...   573   e-160
ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087...   571   e-160
ref|XP_008444761.1| PREDICTED: zingipain-2 [Cucumis melo]             570   e-160
ref|XP_004500967.1| PREDICTED: zingipain-2 [Cicer arietinum]          568   e-159
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   568   e-159

>ref|XP_011074626.1| PREDICTED: zingipain-2 [Sesamum indicum]
          Length = 439

 Score =  664 bits (1712), Expect = 0.0
 Identities = 305/406 (75%), Positives = 345/406 (84%)
 Frame = -2

Query: 1341 TCKSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVF 1162
            TCKSSSIS LF++WC++YGKTY S QEK+ R KVFE+NY YV+ HN+  NSSYTLSLN F
Sbjct: 19   TCKSSSISHLFDSWCKEYGKTYASEQEKQQRFKVFEQNYEYVTLHNARPNSSYTLSLNAF 78

Query: 1161 ADLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIK 982
            ADLT+HEFKAKYLGLS SA +  IRLN     +EG +LVKESD+PSS+DWRK+GAVT +K
Sbjct: 79   ADLTNHEFKAKYLGLSLSASNLAIRLNSEQVGIEGPDLVKESDLPSSVDWRKQGAVTEVK 138

Query: 981  DQGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVI 802
            DQGSCGACWSFS TGA+EGINKIVTGSLISLSEQELIDCD+SYNDGCGGGLMDYAY+F+I
Sbjct: 139  DQGSCGACWSFSTTGAVEGINKIVTGSLISLSEQELIDCDKSYNDGCGGGLMDYAYQFII 198

Query: 801  KNKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGIS 622
            KNKGIDTEKDYPYQGRE TC KEKLK+HVVTIDSY DI  KNEK+L QAVATQP+SVGI 
Sbjct: 199  KNKGIDTEKDYPYQGREGTCKKEKLKKHVVTIDSYADITAKNEKKLQQAVATQPVSVGIC 258

Query: 621  GSGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIR 442
            GS  SFQLYSGGIFTG CS  LDHAVLIVGYDSKDG+DYWI+KNSWG+YWG+DGYM++ R
Sbjct: 259  GSEKSFQLYSGGIFTGPCSASLDHAVLIVGYDSKDGQDYWIIKNSWGRYWGMDGYMHMQR 318

Query: 441  NNGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLK 262
            N GN EG+CGIN+LASYP+K              RCN FTYCSSGETCCC R   G+CLK
Sbjct: 319  NTGNGEGLCGINLLASYPIKTSPNPPPSPPPGPVRCNLFTYCSSGETCCCTRDIFGICLK 378

Query: 261  WKCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQL 124
            WKCC A+SAVCC+DR  CCPHDYP+CDTKRN+CLK IGN+T+A+ L
Sbjct: 379  WKCCGAESAVCCQDRRSCCPHDYPVCDTKRNMCLKWIGNSTVAQPL 424


>ref|XP_012839123.1| PREDICTED: zingipain-2 [Erythranthe guttatus]
            gi|604331887|gb|EYU36745.1| hypothetical protein
            MIMGU_mgv1a006749mg [Erythranthe guttata]
          Length = 433

 Score =  663 bits (1710), Expect = 0.0
 Identities = 303/406 (74%), Positives = 350/406 (86%), Gaps = 1/406 (0%)
 Frame = -2

Query: 1335 KSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFAD 1156
            KSS ISDLF++WC++YGKTY S QEK++RL VF ENY YV+QHN+ ANSSYTLS+N FAD
Sbjct: 21   KSSLISDLFDSWCEEYGKTYASEQEKQHRLNVFHENYKYVNQHNADANSSYTLSVNAFAD 80

Query: 1155 LTHHEFKAKYLGLSPSADDSIIRLN-RGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 979
            LT+HEF+A YLGLSPS  DS+IRLN R + +++G NL+KES+IPSSLDWR KGAVT +KD
Sbjct: 81   LTNHEFRANYLGLSPSKSDSVIRLNSRSASAIDGDNLIKESEIPSSLDWRNKGAVTAVKD 140

Query: 978  QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 799
            QGSCGACWSFSATGA+EGIN+I TGSL+SLSEQELIDCD+SYNDGC GGLMDYAY+F+IK
Sbjct: 141  QGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCNGGLMDYAYDFIIK 200

Query: 798  NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 619
            NKGIDTE+DY Y+GR  TC+K K+ +HVVTIDSYVDIP K+EK+LLQAVATQPISVGI G
Sbjct: 201  NKGIDTEEDYSYKGRSATCDKNKMNKHVVTIDSYVDIPEKDEKKLLQAVATQPISVGICG 260

Query: 618  SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 439
            S +SFQLYSGGIFTG CST LDHAVLIVGYDSKDGKDYWI+KNSWGK WGI GYM+++RN
Sbjct: 261  SDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGKSWGIKGYMHMVRN 320

Query: 438  NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 259
            +G+ EGVCGIN LASYPVK             T+CN FTYCSSGETCCCAR FLG+CL W
Sbjct: 321  SGSEEGVCGINTLASYPVKSSTNPPPSPTPGPTKCNIFTYCSSGETCCCARYFLGVCLSW 380

Query: 258  KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLG 121
             CCEA+SAVCC D  HCCPHDYP+CDTK+NLCLK+ GN T++K LG
Sbjct: 381  NCCEAESAVCCDDHRHCCPHDYPVCDTKKNLCLKKSGNTTVSKPLG 426


>emb|CDP07460.1| unnamed protein product [Coffea canephora]
          Length = 441

 Score =  644 bits (1662), Expect = 0.0
 Identities = 300/431 (69%), Positives = 347/431 (80%)
 Frame = -2

Query: 1338 CKSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFA 1159
            CKSS  +DLF NWC+Q+GKTY S +EK+YRL+VFE+NY YV++HNSLANS+YTLSLN FA
Sbjct: 20   CKSSLTADLFENWCKQHGKTYPSEEEKQYRLRVFEDNYDYVTKHNSLANSTYTLSLNAFA 79

Query: 1158 DLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 979
            DLTHHEFKAKYLG S SAD  +IRLNRGS S+  S  V + DIPSSLDWR KGAVT +KD
Sbjct: 80   DLTHHEFKAKYLGFSASAD-GLIRLNRGSSSIGASGAVGKYDIPSSLDWRNKGAVTNVKD 138

Query: 978  QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 799
            QGSCGACW+FSATGAIEGIN+IVTGSL+SLSEQELIDCDRSYN+GC GGLMDY YEFV+K
Sbjct: 139  QGSCGACWAFSATGAIEGINEIVTGSLVSLSEQELIDCDRSYNNGCNGGLMDYTYEFVVK 198

Query: 798  NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 619
            N GIDTE+DYP++GR+ TCN  KLKR VV+ID Y+D+P  NE+ LLQAVA QP+SVGI G
Sbjct: 199  NGGIDTEQDYPFKGRDGTCNSNKLKRRVVSIDGYIDVPANNEQELLQAVAAQPVSVGICG 258

Query: 618  SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 439
            S   FQLYSGGIFTG CST LDHAVLIVGYDSK+G DYWIVKNSWG  WGI+GY++IIRN
Sbjct: 259  SERGFQLYSGGIFTGPCSTSLDHAVLIVGYDSKNGADYWIVKNSWGTSWGINGYIHIIRN 318

Query: 438  NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 259
            +GN  GVCGINM+ASYP K             T+C+ F+ C +GETCCC+  FLGLCL W
Sbjct: 319  SGNSAGVCGINMMASYPTKSSLNPPPSPPPGPTKCSLFSSCPAGETCCCSMEFLGLCLSW 378

Query: 258  KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG*AIGVLLS 79
            KCC+  SAVCCKDR HCCPHDYPICDTKRNLCL+++GN+T+ KQL N   +G        
Sbjct: 379  KCCDLDSAVCCKDRLHCCPHDYPICDTKRNLCLRRMGNSTLVKQLKNGGRSG-------- 430

Query: 78   MIELGGWSSYF 46
              + G WSS F
Sbjct: 431  --KFGDWSSLF 439


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  594 bits (1532), Expect = e-167
 Identities = 277/412 (67%), Positives = 323/412 (78%)
 Frame = -2

Query: 1338 CKSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFA 1159
            C  SSISDLF  WCQQ GK Y+S QE+ YR KVFEENYAY+++HNS  NSSYTL LN ++
Sbjct: 20   CTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYITEHNSKENSSYTLGLNAYS 79

Query: 1158 DLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 979
            DLTHHEF+  +LGLS SA+D I    RGS S E + ++ + D PSSLDWR+KGAVT +K+
Sbjct: 80   DLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSE-TGVLSDVDAPSSLDWREKGAVTDVKN 138

Query: 978  QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 799
            QGSCGACWSFSATGA+EGINKI TGSL+SLSEQELIDCDRSYN+GCGGGLMDYA+EFVIK
Sbjct: 139  QGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGGGLMDYAFEFVIK 198

Query: 798  NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 619
            N GIDTEKDYP++ RE TCNK KL+RHVVTID Y DIP  +E +LL+AVATQP+SVGI G
Sbjct: 199  NGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICG 258

Query: 618  SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 439
            S  +FQ YS GIFTG CST LDHAVLIVGY S++G DYWI+KNSWG  WGI+GY+++ RN
Sbjct: 259  SARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYIHMQRN 318

Query: 438  NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 259
            +GN EG+CGIN LASYP K             ++C+ FT C  GETCCC  +FLG+CL W
Sbjct: 319  SGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCCCGSKFLGICLSW 378

Query: 258  KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG 103
            KCC   SAVCCKD  HCCP DYPICDT RNLCLK++ NATI +Q      TG
Sbjct: 379  KCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQQPQKEAFTG 430


>gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlisea aurea]
          Length = 424

 Score =  594 bits (1532), Expect = e-167
 Identities = 278/405 (68%), Positives = 327/405 (80%), Gaps = 1/405 (0%)
 Frame = -2

Query: 1332 SSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADL 1153
            SSSISDLF++WCQ++GKTY S +E+E+RL VF ENY +++ HN+ AN SYTLSLN FADL
Sbjct: 23   SSSISDLFDSWCQEHGKTYVSEEEREHRLGVFSENYDFIASHNARANYSYTLSLNAFADL 82

Query: 1152 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 973
            T  EF  +YLG SPS  D +IR NRGS S    N    S +PSS+DWRKKGAVTGIKDQG
Sbjct: 83   TRSEFGGRYLGFSPSGHDLLIRKNRGSGSYRSRNY---SAVPSSIDWRKKGAVTGIKDQG 139

Query: 972  SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 793
            SCGACWSFSATGAIEGIN+IVTGSL+SLSEQELIDCD SYN GC GGLMDYAYEF++KNK
Sbjct: 140  SCGACWSFSATGAIEGINQIVTGSLVSLSEQELIDCDHSYNQGCNGGLMDYAYEFILKNK 199

Query: 792  GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 613
            GIDTE+DY Y+GR+ +C++ KL + VVTIDSYVDIP KNE+ LL+AVA+QP+SVGISG  
Sbjct: 200  GIDTEEDYSYKGRDASCSQNKLNKRVVTIDSYVDIPEKNEQMLLEAVASQPVSVGISGGD 259

Query: 612  NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 433
              FQ YS GIFTG CST LDHAVLIVGYDSK+GKDYWIVKNSWGK WG+DGYMY+ RN G
Sbjct: 260  APFQFYSQGIFTGPCSTSLDHAVLIVGYDSKNGKDYWIVKNSWGKSWGMDGYMYVQRNTG 319

Query: 432  NPEGVCGINMLASYPVK-XXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWK 256
            N  G+C INM+ASYPVK              T+C+ F+YCS GETCCCARRFLGLC+++K
Sbjct: 320  NQNGICEINMMASYPVKTNPNPSPSPSPPGPTKCSLFSYCSQGETCCCARRFLGLCMRYK 379

Query: 255  CCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLG 121
            CC A+SAVCC+D  HCCP DYPICDT +++C K  GN+T+A  +G
Sbjct: 380  CCGAESAVCCEDNVHCCPQDYPICDTAQSVCRKMSGNSTMAIPVG 424


>ref|XP_009597081.1| PREDICTED: zingipain-2 [Nicotiana tomentosiformis]
          Length = 439

 Score =  590 bits (1522), Expect = e-166
 Identities = 278/404 (68%), Positives = 319/404 (78%)
 Frame = -2

Query: 1338 CKSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFA 1159
            C  SSISDLF +WCQQ GKTY+S QE+ YRL+VFEENYAY+ +HNS  NS+YTL LN F+
Sbjct: 20   CTCSSISDLFESWCQQNGKTYSSEQERVYRLEVFEENYAYIIEHNSKGNSTYTLDLNAFS 79

Query: 1158 DLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 979
            DLTHHEFK  +LGLS SA+D  IRL  GS S    N V   DIPSSLDWR+KGAVT +K+
Sbjct: 80   DLTHHEFKNSFLGLSSSAND-FIRLKTGSSSAGVFNDVGVVDIPSSLDWREKGAVTKVKN 138

Query: 978  QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 799
            QGSCGACWSFSATGAIEGINKIVTGSL+SLSEQELIDCD+SYNDGCGGGLMDYA+EFV K
Sbjct: 139  QGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAFEFVKK 198

Query: 798  NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 619
            N GIDTE+DYP+  RE TCNK KL+R VVTID Y D+P  +E +LL+AVA QP+SVGI G
Sbjct: 199  NGGIDTEEDYPFNEREGTCNKNKLQRRVVTIDGYTDVPQYDEDKLLKAVANQPVSVGICG 258

Query: 618  SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 439
            S  +FQ YS GIFTG CST LDHAVLIVGY S++G DYWI+KNSWG  WGI+GYM++ RN
Sbjct: 259  SERAFQSYSKGIFTGPCSTVLDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYMHMQRN 318

Query: 438  NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 259
            +GN EG+CGIN LASYP K             ++C+ FT C  GETCCC  R LG+C+ W
Sbjct: 319  SGNQEGICGINKLASYPTKSSPNPPSPPSPGPSKCSMFTSCGQGETCCCGWRLLGVCVSW 378

Query: 258  KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQ 127
            KCC   SAVCCKD  HCCPHDYPICDT RNLCLK++ NATI +Q
Sbjct: 379  KCCGLDSAVCCKDGRHCCPHDYPICDTSRNLCLKRMSNATIVQQ 422


>ref|XP_004238304.1| PREDICTED: zingipain-2 [Solanum lycopersicum]
          Length = 439

 Score =  590 bits (1521), Expect = e-165
 Identities = 273/404 (67%), Positives = 318/404 (78%)
 Frame = -2

Query: 1338 CKSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFA 1159
            C  SSISDLF  WCQQ GK Y+S QE+ YR KVFEENYAY+++HNS  NSSYTL LN ++
Sbjct: 20   CTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYITEHNSKGNSSYTLGLNAYS 79

Query: 1158 DLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 979
            DLTHHEF+  +LGLS SA+D I    RGS S   + ++ + D PSSLDWR KGAVT +K+
Sbjct: 80   DLTHHEFRNSFLGLSSSANDFIRLKGRGSGS-SAAGVLSDVDAPSSLDWRDKGAVTNVKN 138

Query: 978  QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 799
            QGSCGACWSFSATGAIEGINKI TGSL+SLSEQELIDCDRSYN GCGGGLMDYA+EFVIK
Sbjct: 139  QGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGGGLMDYAFEFVIK 198

Query: 798  NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 619
            N GIDTEKDYP++ +E TCNK KL+R VVTID Y DIP  +E +LL+AVATQP+SVGI G
Sbjct: 199  NGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICG 258

Query: 618  SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 439
            S  +FQ YS GIFTG C TDLDHAVLIVGY S++G DYWI+KNSWG  WGI+GY+++ RN
Sbjct: 259  SARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTSWGINGYIHMQRN 318

Query: 438  NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 259
            +GN EG+CG+N LASYP K             ++C+TFT C  GETCCC  +FLG+CL W
Sbjct: 319  SGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCCCGLKFLGICLSW 378

Query: 258  KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQ 127
            KCC   SAVCCKD  HCCP DYPICDT RNLCLK++ NATI +Q
Sbjct: 379  KCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQQ 422


>ref|XP_010271530.1| PREDICTED: low-temperature-induced cysteine proteinase [Nelumbo
            nucifera]
          Length = 443

 Score =  590 bits (1520), Expect = e-165
 Identities = 269/425 (63%), Positives = 331/425 (77%)
 Frame = -2

Query: 1329 SSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADLT 1150
            SS SDLF+ WC+++G+TY+S +E+ +RLKVFE+N A+V++HNS+ANS+Y+L+LN FADLT
Sbjct: 23   SSTSDLFDRWCEEHGRTYSSEEERLFRLKVFEDNLAFVTEHNSMANSTYSLALNAFADLT 82

Query: 1149 HHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGS 970
            HHEFK   LGL+ +A D +    R    +E  ++  +  +PSS+DWR+KGAVT +KDQGS
Sbjct: 83   HHEFKISRLGLAAAATDMVRSSPRAPSLIESPSIAGQ--LPSSIDWREKGAVTNVKDQGS 140

Query: 969  CGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKG 790
            CGACWSFSATGAIEGINKIVTGS +SLSEQEL+DCDRSYN GCGGGLMDYA+++VIKNKG
Sbjct: 141  CGACWSFSATGAIEGINKIVTGSPLSLSEQELVDCDRSYNSGCGGGLMDYAFQWVIKNKG 200

Query: 789  IDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 610
            IDTE DYPYQG E+TCNK+KL++HVVTID Y D+P  +EK LLQAVA+QP+SVGI GS  
Sbjct: 201  IDTEDDYPYQGGERTCNKDKLRKHVVTIDGYTDVPSNSEKHLLQAVASQPVSVGICGSER 260

Query: 609  SFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGN 430
            +FQLYS GIF+G CST LDHAVLIVGY S++G DYWIVKNSWG  WG++GYM+++RN+G+
Sbjct: 261  AFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYMHMLRNSGS 320

Query: 429  PEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCC 250
            P+GVCGINMLASYP K             TRC+  TYC  GETCCC RR LG+C  WKCC
Sbjct: 321  PQGVCGINMLASYPTKTSPNPPPSPSPGPTRCDLLTYCQEGETCCCTRRILGICFSWKCC 380

Query: 249  EAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG*AIGVLLSMIE 70
            E  SAVCCKD  +CCPHDYPICDT+R  CLK  GN T  K L  +           S+++
Sbjct: 381  ELDSAVCCKDHRYCCPHDYPICDTERKQCLKSTGNFTSVKSLDKS----------SSLVK 430

Query: 69   LGGWS 55
             GGW+
Sbjct: 431  FGGWN 435


>gb|KHG28107.1| Cysteinease RD21a -like protein [Gossypium arboreum]
          Length = 431

 Score =  589 bits (1518), Expect = e-165
 Identities = 273/407 (67%), Positives = 321/407 (78%)
 Frame = -2

Query: 1329 SSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADLT 1150
            S IS  F  WCQQ+GK+Y S +EK YRLKVFE+NYA+V+QHN++ NSSY+L+LN FAD T
Sbjct: 26   SHISKKFETWCQQHGKSYLSEEEKSYRLKVFEDNYAFVTQHNAMVNSSYSLALNAFADFT 85

Query: 1149 HHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGS 970
            HHEFKA  LGLS +A      +     ++    LV+  DIP SLDWR+KGAVT +KDQGS
Sbjct: 86   HHEFKASRLGLSGAA------IQFRHPNLREPRLVR--DIPDSLDWREKGAVTQVKDQGS 137

Query: 969  CGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKG 790
            CGACWSFSATGAIEG+NKIVTGSLISLSEQEL+DCD++YN GC GGLMDYA++FVI N G
Sbjct: 138  CGACWSFSATGAIEGVNKIVTGSLISLSEQELVDCDKTYNTGCEGGLMDYAFQFVINNHG 197

Query: 789  IDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 610
            IDTE+DYPYQGRE TCNKEKLKRHVVTID Y D+P  NEK+LLQAVATQP+SVGI GS  
Sbjct: 198  IDTEEDYPYQGREHTCNKEKLKRHVVTIDDYTDVPMNNEKKLLQAVATQPVSVGICGSER 257

Query: 609  SFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGN 430
            +FQLYS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG+ WG++GY+++IRN+G 
Sbjct: 258  AFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGRRWGMNGYIHMIRNSGK 317

Query: 429  PEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCC 250
             EG+CGINMLASYP+K             T+C+ FTYCS+GETCCC  R  G+C  WKCC
Sbjct: 318  SEGICGINMLASYPIKTSPNPPPSPPPGPTKCDFFTYCSAGETCCCTHRIFGICFSWKCC 377

Query: 249  EAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFS 109
               SAVCCKD  HCCPH+YPICDTK N CLK++GNATI +    N +
Sbjct: 378  GLDSAVCCKDNRHCCPHNYPICDTKNNQCLKRVGNATIMESSDTNLA 424


>ref|XP_012444786.1| PREDICTED: zingipain-2 [Gossypium raimondii]
            gi|763791179|gb|KJB58175.1| hypothetical protein
            B456_009G197900 [Gossypium raimondii]
          Length = 431

 Score =  588 bits (1516), Expect = e-165
 Identities = 272/407 (66%), Positives = 321/407 (78%)
 Frame = -2

Query: 1329 SSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADLT 1150
            S IS +F  WC Q+GK+Y+S +EK YRLKVFE+NYA+V+QHN++ NSSY+L+LN FADLT
Sbjct: 26   SHISKIFETWCHQHGKSYSSEEEKSYRLKVFEDNYAFVTQHNAMTNSSYSLALNAFADLT 85

Query: 1149 HHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGS 970
            HHEFKA  LGLS +A      +     ++    LV+  DIP+SLDWR+KGAVT +KDQGS
Sbjct: 86   HHEFKASRLGLSGAA------IQFRCSNLREPRLVR--DIPASLDWREKGAVTQVKDQGS 137

Query: 969  CGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKG 790
            CGACWSFSATGAIEG+NKIVTGSLISLSEQEL+DCD++YN GC GGLMDYA++FVI N G
Sbjct: 138  CGACWSFSATGAIEGVNKIVTGSLISLSEQELVDCDKTYNTGCEGGLMDYAFQFVINNHG 197

Query: 789  IDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 610
            IDTE+DYPYQGRE TCNKEKLKRHVVTID Y D+P  NEK+LLQAVATQP+SVGI GS  
Sbjct: 198  IDTEEDYPYQGREHTCNKEKLKRHVVTIDDYTDVPMTNEKKLLQAVATQPVSVGICGSER 257

Query: 609  SFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGN 430
            +FQLY  GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG  WG++GY+++IRN G 
Sbjct: 258  AFQLYCKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMIRNTGK 317

Query: 429  PEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCC 250
             EG+CGINMLASYP+K             T+C+ FTYCS+GETCCC  R  G+C  WKCC
Sbjct: 318  SEGICGINMLASYPIKTSPNPPPSPPPGPTKCDFFTYCSAGETCCCTHRIFGICFLWKCC 377

Query: 249  EAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFS 109
               SAVCCKD  HCCPH+YPICDTK N CLK++GNATI +    N +
Sbjct: 378  GLDSAVCCKDNRHCCPHNYPICDTKNNQCLKRVGNATIMESSNTNLA 424


>ref|XP_012071947.1| PREDICTED: low-temperature-induced cysteine proteinase [Jatropha
            curcas] gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha
            curcas] gi|643731232|gb|KDP38570.1| hypothetical protein
            JCGZ_04495 [Jatropha curcas]
          Length = 441

 Score =  586 bits (1510), Expect = e-164
 Identities = 270/410 (65%), Positives = 322/410 (78%)
 Frame = -2

Query: 1332 SSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADL 1153
            SS I+ LF  WCQQ+GKTY S +EK +RLKVF++NY +V++HNS  NSSYTLSLN FADL
Sbjct: 23   SSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADL 82

Query: 1152 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 973
            THHEFKA  LGLS +A  S+  ++R +  +        +D+P+S+DWRK GAVT +KDQG
Sbjct: 83   THHEFKASRLGLSSAASASL-NVDRSNRQIPDF----VADVPASVDWRKNGAVTQVKDQG 137

Query: 972  SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 793
            +CGACWSFSATGAIEGINKIVTGSL+SLSEQEL+DCD+SYN+GC GG+MDYA++FVI N 
Sbjct: 138  NCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNH 197

Query: 792  GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 613
            GIDTE+DYPYQGR+++CNKEKLKRHVVTID YVD+P  NEK LL+AVA QP+SVGI GS 
Sbjct: 198  GIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSE 257

Query: 612  NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 433
             +FQLYS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG YWG+DGYM++ RN+G
Sbjct: 258  RAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSG 317

Query: 432  NPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKC 253
            +  G+CGINMLASYP K             TRC+ FT+C  GETCCC     G+CL WKC
Sbjct: 318  SSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKC 377

Query: 252  CEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG 103
            CE  SAVCCKD  HCCP DYP+CDT RN+CLK  GNAT  ++   N S+G
Sbjct: 378  CELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSSG 427


>ref|XP_009771293.1| PREDICTED: zingipain-2 [Nicotiana sylvestris]
          Length = 439

 Score =  585 bits (1508), Expect = e-164
 Identities = 274/404 (67%), Positives = 318/404 (78%)
 Frame = -2

Query: 1338 CKSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFA 1159
            C  SSISDLF  WCQQ GK+Y+S QE+ YRLKVFEENYAY+ +HNS  NS+YTL LN ++
Sbjct: 20   CTCSSISDLFETWCQQNGKSYSSEQERVYRLKVFEENYAYIIEHNSKGNSTYTLGLNAYS 79

Query: 1158 DLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 979
            DLTHHEFK  +LGLS SA+D  IRL  GS S    + V + D+PSSLDWR+KGAVT +K+
Sbjct: 80   DLTHHEFKNSFLGLSSSAND-FIRLKTGSSSAGVFSDVGDVDVPSSLDWREKGAVTKVKN 138

Query: 978  QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 799
            QGSCGACWSFSATGAIEGINKIV+GSL+SLSEQELIDCD+SYNDGCGGGLMDYA+EFV K
Sbjct: 139  QGSCGACWSFSATGAIEGINKIVSGSLVSLSEQELIDCDKSYNDGCGGGLMDYAFEFVKK 198

Query: 798  NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 619
            N GIDTE+DYP+  RE TCNK KL+R VVTID Y D+P  +E +LL+AVA QP+SVGI G
Sbjct: 199  NGGIDTEEDYPFIEREGTCNKNKLQRRVVTIDGYTDVPQNDEDKLLKAVAKQPVSVGICG 258

Query: 618  SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 439
            S  +FQ YS GIFTG CST LDHAVLIVGY S++G DYWI+KNSWG  WGI+GYM++ RN
Sbjct: 259  SERAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYMHMQRN 318

Query: 438  NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 259
            +GN EG+CGIN LASYP K             ++C+ FT C  GETCCC    LG+CL W
Sbjct: 319  SGNQEGICGINKLASYPTKSSPNPPSPPSPGPSKCSMFTSCGQGETCCCGWSLLGVCLSW 378

Query: 258  KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQ 127
            KCC   SAVCCKD  HCCPHDYPICDT RNLCLK++ NATI +Q
Sbjct: 379  KCCGLDSAVCCKDGRHCCPHDYPICDTSRNLCLKRMSNATIVQQ 422


>ref|XP_011048840.1| PREDICTED: low-temperature-induced cysteine proteinase [Populus
            euphratica]
          Length = 480

 Score =  581 bits (1497), Expect = e-163
 Identities = 265/398 (66%), Positives = 321/398 (80%)
 Frame = -2

Query: 1332 SSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADL 1153
            SS IS LF  WC+++GK+Y S +E+ +RLKVFE+NY +V++HNS  NSSYTLSLN F+DL
Sbjct: 66   SSGISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYTLSLNAFSDL 125

Query: 1152 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 973
            THHEFK   LGLS +       +N G  ++E + +V   DIP+S+DWR KGAVT +KDQG
Sbjct: 126  THHEFKTSRLGLSAAP------MNLGHRNLEITGVV--GDIPASIDWRNKGAVTNVKDQG 177

Query: 972  SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 793
            SCGACWSFSATGAIEGINKIVTGSL+SLSEQELI+CD+S+NDGCGGGLMDYA++FVI N 
Sbjct: 178  SCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSFNDGCGGGLMDYAFQFVINNH 237

Query: 792  GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 613
            GIDTE+DYPY+ R+ TCNK+K+KR VVTID YVD+P  NEK+LLQAVA QP+SVGI GS 
Sbjct: 238  GIDTEEDYPYRARDGTCNKDKMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSE 297

Query: 612  NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 433
             +FQ+YS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG  WG+ GYM++ RN+G
Sbjct: 298  RAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSG 357

Query: 432  NPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKC 253
            N +GVCGINMLASYPVK             T+C+ F+YC++GETCCCAR+F G+C+ WKC
Sbjct: 358  NSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCDLFSYCAAGETCCCARKFFGICISWKC 417

Query: 252  CEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNAT 139
            C   SAVCCKDR HCCPHDYP+CDT +N+C K+ GNAT
Sbjct: 418  CGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNAT 455


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  580 bits (1494), Expect = e-162
 Identities = 270/393 (68%), Positives = 309/393 (78%), Gaps = 1/393 (0%)
 Frame = -2

Query: 1332 SSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADL 1153
            SS IS LF +W +++GKTY S ++K YR K+FEENY +V +HNS  NSSYTLSLN FADL
Sbjct: 25   SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84

Query: 1152 THHEFKAKYLGLSP-SADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQ 976
            THHEFKA  LGLS  S    + R N       G       D+P S+DWRKKGAV+ +KDQ
Sbjct: 85   THHEFKASRLGLSAFSTSGKLSRRNFPLHDFVG-------DVPISIDWRKKGAVSQVKDQ 137

Query: 975  GSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKN 796
            G+CGACWSFSATGAIEGINKIVTGSL+SLSEQEL+DCDRSYN+GC GGLMDYAY+FVI+N
Sbjct: 138  GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIEN 197

Query: 795  KGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGS 616
             GIDTE+DYPYQ REKTCNKEKLKRHVVTID Y D+P  NEK LL+AVA QP+SVGI GS
Sbjct: 198  NGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGS 257

Query: 615  GNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNN 436
              +FQLYS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG +WGI+GYMY++RN+
Sbjct: 258  ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNS 317

Query: 435  GNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWK 256
            GN +G+CGINMLAS+PVK             T+C+ FT C  GETCCC RR  GLC  WK
Sbjct: 318  GNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWK 377

Query: 255  CCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLK 157
            CCE  SAVCCKD  HCCPHDYP+CDTKRN+CLK
Sbjct: 378  CCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410


>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  578 bits (1489), Expect = e-162
 Identities = 264/398 (66%), Positives = 318/398 (79%)
 Frame = -2

Query: 1332 SSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADL 1153
            SS IS LF  WC+++GK+Y S +E+ +RLKVFE+NY +V++HNS  NSSY+L+LN FADL
Sbjct: 22   SSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADL 81

Query: 1152 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 973
            THHEFK   LGLS +       LN    ++E + +V   DIP+S+DWR KG VT +KDQG
Sbjct: 82   THHEFKTSRLGLSAAP------LNLAHRNLEITGVV--GDIPASIDWRNKGVVTNVKDQG 133

Query: 972  SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 793
            SCGACWSFSATGAIEGINKIVTGSL+SLSEQELI+CD+SYNDGCGGGLMDYA++FVI N 
Sbjct: 134  SCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNH 193

Query: 792  GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 613
            GIDTE+DYPY+ R+ TCNK+++KR VVTID YVD+P  NEK+LLQAVA QP+SVGI GS 
Sbjct: 194  GIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSE 253

Query: 612  NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 433
             +FQ+YS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG  WG+ GYM++ RN+G
Sbjct: 254  RAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSG 313

Query: 432  NPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKC 253
            N +GVCGINMLASYPVK             T+CN  TYC++GETCCCAR+F G+C+ WKC
Sbjct: 314  NSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKC 373

Query: 252  CEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNAT 139
            C   SAVCCKDR HCCPHDYP+CDT +N+C K+ GNAT
Sbjct: 374  CGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNAT 411


>ref|XP_002280937.1| PREDICTED: zingipain-2 [Vitis vinifera] gi|302142569|emb|CBI19772.3|
            unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  573 bits (1478), Expect = e-160
 Identities = 266/428 (62%), Positives = 325/428 (75%)
 Frame = -2

Query: 1335 KSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFAD 1156
            ++SS +DLF  WC+QYGKTY+S +EK  RLKVFEEN+A+V+QHNS+AN+SYTL+LN FAD
Sbjct: 21   EASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFAD 80

Query: 1155 LTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQ 976
            LTHHEFKA  LG SP    SI  +            V+E  +P ++DWRK GAVTG+KDQ
Sbjct: 81   LTHHEFKASRLGFSPGRAQSIRSVGTP---------VQELHVPPAVDWRKSGAVTGVKDQ 131

Query: 975  GSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKN 796
            G+CG CWSFS TGAIEGINKIVTGSL+SLSEQEL+DCDRSYN GC GGLMDYAY+FVIKN
Sbjct: 132  GNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIKN 191

Query: 795  KGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGS 616
            +GID+E DYPY G +K CNKEKLK+H+VTID Y DIPP +EK+LLQ VA QP+SVGI GS
Sbjct: 192  QGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGS 251

Query: 615  GNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNN 436
              +FQLYS G++TG CS+ LDHAVLIVGY ++DG D+WIVKNSWG++WG+ GY++++RNN
Sbjct: 252  EKTFQLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWGMRGYIHMLRNN 311

Query: 435  GNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWK 256
            G  EG+CGINMLASYP K             T+C+ F+ CS GETCCC+ RF+G+CL W 
Sbjct: 312  GTAEGICGINMLASYPAKTSPNPPPPPTPGPTKCDFFSSCSEGETCCCSWRFIGVCLSWN 371

Query: 255  CCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG*AIGVLLSM 76
            CC AKSAVCC + ++CCP  +PICDTKRN CLK  GN T  + L    S+          
Sbjct: 372  CCTAKSAVCCDNNNYCCPASHPICDTKRNRCLKPAGNGTGVEVLKRRGSS---------- 421

Query: 75   IELGGWSS 52
            ++ GGWSS
Sbjct: 422  VKFGGWSS 429


>ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1|
            JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  571 bits (1472), Expect = e-160
 Identities = 266/408 (65%), Positives = 315/408 (77%)
 Frame = -2

Query: 1329 SSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADLT 1150
            S IS LF  WC Q+GK Y+S +EK YRLKVFEENYA+V+QHN + NSSY+L+LN FADLT
Sbjct: 24   SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83

Query: 1149 HHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGS 970
            HHEFKA  LGLS +A      +     +++   LV+  DIP+S+DWR KGAVT +KDQGS
Sbjct: 84   HHEFKASRLGLSAAA------IEGSRPNLQLPGLVR--DIPASMDWRTKGAVTKVKDQGS 135

Query: 969  CGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKG 790
            CGACWSFSATGAIEGINKIVTG+L+SLSEQEL+DCDRSYN GC GGLMDYAY+FVI N G
Sbjct: 136  CGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHG 195

Query: 789  IDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 610
            ID E+DYPY GREKTCNKEK KR VVTID Y  +P  NE  LLQAVA QP+SVGI GS  
Sbjct: 196  IDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSER 255

Query: 609  SFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGN 430
            +FQLYS GIFTG CS+ LDHAVLIVGY S++G DYWIVKNSWG  WG++GY++++RN+G+
Sbjct: 256  AFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGD 315

Query: 429  PEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCC 250
             +G+CGINMLASYP K             T+C+ FTYCS+GETCCC  R  G+C  WKCC
Sbjct: 316  SKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCC 375

Query: 249  EAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFST 106
            E  SAVCCKD  HCCP+DYP+CDTK++ CLK++GNAT  +      ST
Sbjct: 376  ELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKRHST 423


>ref|XP_008444761.1| PREDICTED: zingipain-2 [Cucumis melo]
          Length = 431

 Score =  570 bits (1470), Expect = e-160
 Identities = 272/428 (63%), Positives = 319/428 (74%)
 Frame = -2

Query: 1332 SSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADL 1153
            +S++S+LF  WC ++GK+Y+S +EK YRL VF +NY +V+ HN+L NSSYTLSLN +ADL
Sbjct: 22   TSNVSELFEIWCTEHGKSYSSAEEKLYRLSVFADNYEFVTHHNNLGNSSYTLSLNSYADL 81

Query: 1152 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 973
            THHEFK   LG SP+         R    V         D+P SLDWRKKGAVT +KDQG
Sbjct: 82   THHEFKVSRLGFSPAL--------RNFRPVLPQEPSLPRDVPDSLDWRKKGAVTAVKDQG 133

Query: 972  SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 793
            SCGACWSFSATGAIEGIN+I+TGSLIS+SEQELIDCDRSYN GCGGGLMDYAY+FVI N 
Sbjct: 134  SCGACWSFSATGAIEGINQIMTGSLISVSEQELIDCDRSYNSGCGGGLMDYAYQFVISNH 193

Query: 792  GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 613
            GIDTE DYPYQGR+ +C K+KL+R+VVTID Y DIPP +E +LLQAVA QP+SVGI GS 
Sbjct: 194  GIDTEDDYPYQGRDGSCRKDKLQRNVVTIDGYTDIPPNDEGKLLQAVAAQPVSVGICGSE 253

Query: 612  NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 433
             +FQLYS GIF+G CST LDHAVLIVGY S++G DYWIVKNSWGK WG+DGYM++ RN+G
Sbjct: 254  RAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSG 313

Query: 432  NPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKC 253
            N EGVCGIN LASYP K             T+C+  T C++GETCCCA++FLGLCL WKC
Sbjct: 314  NSEGVCGINKLASYPTKTSPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKC 373

Query: 252  CEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG*AIGVLLSMI 73
            C   SAVCCKD  HCCP DYPICDT RNLCLK+  N T  + L N  S+G          
Sbjct: 374  CGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKRTMNGTRMEVLENRSSSG---------- 423

Query: 72   ELGGWSSY 49
              G WSS+
Sbjct: 424  SSGTWSSF 431


>ref|XP_004500967.1| PREDICTED: zingipain-2 [Cicer arietinum]
          Length = 436

 Score =  568 bits (1464), Expect = e-159
 Identities = 260/397 (65%), Positives = 306/397 (77%), Gaps = 2/397 (0%)
 Frame = -2

Query: 1320 SDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADLTHHE 1141
            S LF  WC+Q+GKTY S QEK YR  VFE+NYA+V+QHN + NSSYTLSLN FADLTHHE
Sbjct: 27   SKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHHE 86

Query: 1140 FKAKYLGLSPSADDSIIRL--NRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGSC 967
            FKA  LGL PS   S++R   NR  D     + ++   +PS +DWRK GAV+ +KDQGSC
Sbjct: 87   FKATRLGLPPS---SLLRFKFNRFQDQQRSDDFLQ---VPSEIDWRKNGAVSIVKDQGSC 140

Query: 966  GACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKGI 787
            GACWSFSATGAIEGINKIVTGSL+SLSEQEL+DCD +YN GC GGLMDYAY+F+I N GI
Sbjct: 141  GACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGI 200

Query: 786  DTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGNS 607
            DTE+DYPYQ R+  C K+KLKR VVTID Y D+PP +EK+LL+AVA QP+SVGI GS  +
Sbjct: 201  DTEEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARA 260

Query: 606  FQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGNP 427
            FQLYS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWGKYWG++GY++++RN  + 
Sbjct: 261  FQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSS 320

Query: 426  EGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCCE 247
             G+CGINMLASYP K              +CN FTYCS GETCCCA++FLG+C  WKCC 
Sbjct: 321  AGLCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCG 380

Query: 246  AKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATI 136
              SAVCCKD+ HCCP DYP+CD     CLK+I N TI
Sbjct: 381  VTSAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTI 417


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  568 bits (1463), Expect = e-159
 Identities = 266/401 (66%), Positives = 320/401 (79%)
 Frame = -2

Query: 1332 SSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADL 1153
            S  IS+LF++WCQ++GKTY S +E++ R+++F++N+ +V+QHN + N++Y+LSLN FADL
Sbjct: 25   SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84

Query: 1152 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 973
            THHEFKA  LGLS SA  S+I  ++G  S+ GS  VK   +P S+DWRKKGAVT +KDQG
Sbjct: 85   THHEFKASRLGLSVSAP-SVIMASKGQ-SLGGS--VK---VPDSVDWRKKGAVTNVKDQG 137

Query: 972  SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 793
            SCGACWSFSATGA+EGIN+IVTG LISLSEQELIDCD+SYN GC GGLMDYA+EFVIKN 
Sbjct: 138  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197

Query: 792  GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 613
            GIDTEKDYPYQ R+ TC K+KLK+ VVTIDSY  +   +EK L++AVA QP+SVGI GS 
Sbjct: 198  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257

Query: 612  NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 433
             +FQLYS GIF+G CST LDHAVLIVGY S++G DYWIVKNSWGK WG+DG+M++ RN  
Sbjct: 258  RAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 317

Query: 432  NPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKC 253
            N +GVCGINMLASYP+K             T+CN FTYCSSGETCCCAR   GLC  WKC
Sbjct: 318  NSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKC 377

Query: 252  CEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAK 130
            CE +SAVCCKD  HCCPHDYP+CDT R+LCLK+ GN T  K
Sbjct: 378  CEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIK 418


Top