BLASTX nr result
ID: Forsythia23_contig00023292
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00023292 (1555 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011074626.1| PREDICTED: zingipain-2 [Sesamum indicum] 664 0.0 ref|XP_012839123.1| PREDICTED: zingipain-2 [Erythranthe guttatus... 663 0.0 emb|CDP07460.1| unnamed protein product [Coffea canephora] 644 0.0 ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S... 594 e-167 gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlise... 594 e-167 ref|XP_009597081.1| PREDICTED: zingipain-2 [Nicotiana tomentosif... 590 e-166 ref|XP_004238304.1| PREDICTED: zingipain-2 [Solanum lycopersicum] 590 e-165 ref|XP_010271530.1| PREDICTED: low-temperature-induced cysteine ... 590 e-165 gb|KHG28107.1| Cysteinease RD21a -like protein [Gossypium arboreum] 589 e-165 ref|XP_012444786.1| PREDICTED: zingipain-2 [Gossypium raimondii]... 588 e-165 ref|XP_012071947.1| PREDICTED: low-temperature-induced cysteine ... 586 e-164 ref|XP_009771293.1| PREDICTED: zingipain-2 [Nicotiana sylvestris] 585 e-164 ref|XP_011048840.1| PREDICTED: low-temperature-induced cysteine ... 581 e-163 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 580 e-162 ref|XP_002307688.2| cysteine protease family protein [Populus tr... 578 e-162 ref|XP_002280937.1| PREDICTED: zingipain-2 [Vitis vinifera] gi|3... 573 e-160 ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087... 571 e-160 ref|XP_008444761.1| PREDICTED: zingipain-2 [Cucumis melo] 570 e-160 ref|XP_004500967.1| PREDICTED: zingipain-2 [Cicer arietinum] 568 e-159 ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha... 568 e-159 >ref|XP_011074626.1| PREDICTED: zingipain-2 [Sesamum indicum] Length = 439 Score = 664 bits (1712), Expect = 0.0 Identities = 305/406 (75%), Positives = 345/406 (84%) Frame = -2 Query: 1341 TCKSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVF 1162 TCKSSSIS LF++WC++YGKTY S QEK+ R KVFE+NY YV+ HN+ NSSYTLSLN F Sbjct: 19 TCKSSSISHLFDSWCKEYGKTYASEQEKQQRFKVFEQNYEYVTLHNARPNSSYTLSLNAF 78 Query: 1161 ADLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIK 982 ADLT+HEFKAKYLGLS SA + IRLN +EG +LVKESD+PSS+DWRK+GAVT +K Sbjct: 79 ADLTNHEFKAKYLGLSLSASNLAIRLNSEQVGIEGPDLVKESDLPSSVDWRKQGAVTEVK 138 Query: 981 DQGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVI 802 DQGSCGACWSFS TGA+EGINKIVTGSLISLSEQELIDCD+SYNDGCGGGLMDYAY+F+I Sbjct: 139 DQGSCGACWSFSTTGAVEGINKIVTGSLISLSEQELIDCDKSYNDGCGGGLMDYAYQFII 198 Query: 801 KNKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGIS 622 KNKGIDTEKDYPYQGRE TC KEKLK+HVVTIDSY DI KNEK+L QAVATQP+SVGI Sbjct: 199 KNKGIDTEKDYPYQGREGTCKKEKLKKHVVTIDSYADITAKNEKKLQQAVATQPVSVGIC 258 Query: 621 GSGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIR 442 GS SFQLYSGGIFTG CS LDHAVLIVGYDSKDG+DYWI+KNSWG+YWG+DGYM++ R Sbjct: 259 GSEKSFQLYSGGIFTGPCSASLDHAVLIVGYDSKDGQDYWIIKNSWGRYWGMDGYMHMQR 318 Query: 441 NNGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLK 262 N GN EG+CGIN+LASYP+K RCN FTYCSSGETCCC R G+CLK Sbjct: 319 NTGNGEGLCGINLLASYPIKTSPNPPPSPPPGPVRCNLFTYCSSGETCCCTRDIFGICLK 378 Query: 261 WKCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQL 124 WKCC A+SAVCC+DR CCPHDYP+CDTKRN+CLK IGN+T+A+ L Sbjct: 379 WKCCGAESAVCCQDRRSCCPHDYPVCDTKRNMCLKWIGNSTVAQPL 424 >ref|XP_012839123.1| PREDICTED: zingipain-2 [Erythranthe guttatus] gi|604331887|gb|EYU36745.1| hypothetical protein MIMGU_mgv1a006749mg [Erythranthe guttata] Length = 433 Score = 663 bits (1710), Expect = 0.0 Identities = 303/406 (74%), Positives = 350/406 (86%), Gaps = 1/406 (0%) Frame = -2 Query: 1335 KSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFAD 1156 KSS ISDLF++WC++YGKTY S QEK++RL VF ENY YV+QHN+ ANSSYTLS+N FAD Sbjct: 21 KSSLISDLFDSWCEEYGKTYASEQEKQHRLNVFHENYKYVNQHNADANSSYTLSVNAFAD 80 Query: 1155 LTHHEFKAKYLGLSPSADDSIIRLN-RGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 979 LT+HEF+A YLGLSPS DS+IRLN R + +++G NL+KES+IPSSLDWR KGAVT +KD Sbjct: 81 LTNHEFRANYLGLSPSKSDSVIRLNSRSASAIDGDNLIKESEIPSSLDWRNKGAVTAVKD 140 Query: 978 QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 799 QGSCGACWSFSATGA+EGIN+I TGSL+SLSEQELIDCD+SYNDGC GGLMDYAY+F+IK Sbjct: 141 QGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCNGGLMDYAYDFIIK 200 Query: 798 NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 619 NKGIDTE+DY Y+GR TC+K K+ +HVVTIDSYVDIP K+EK+LLQAVATQPISVGI G Sbjct: 201 NKGIDTEEDYSYKGRSATCDKNKMNKHVVTIDSYVDIPEKDEKKLLQAVATQPISVGICG 260 Query: 618 SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 439 S +SFQLYSGGIFTG CST LDHAVLIVGYDSKDGKDYWI+KNSWGK WGI GYM+++RN Sbjct: 261 SDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGKSWGIKGYMHMVRN 320 Query: 438 NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 259 +G+ EGVCGIN LASYPVK T+CN FTYCSSGETCCCAR FLG+CL W Sbjct: 321 SGSEEGVCGINTLASYPVKSSTNPPPSPTPGPTKCNIFTYCSSGETCCCARYFLGVCLSW 380 Query: 258 KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLG 121 CCEA+SAVCC D HCCPHDYP+CDTK+NLCLK+ GN T++K LG Sbjct: 381 NCCEAESAVCCDDHRHCCPHDYPVCDTKKNLCLKKSGNTTVSKPLG 426 >emb|CDP07460.1| unnamed protein product [Coffea canephora] Length = 441 Score = 644 bits (1662), Expect = 0.0 Identities = 300/431 (69%), Positives = 347/431 (80%) Frame = -2 Query: 1338 CKSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFA 1159 CKSS +DLF NWC+Q+GKTY S +EK+YRL+VFE+NY YV++HNSLANS+YTLSLN FA Sbjct: 20 CKSSLTADLFENWCKQHGKTYPSEEEKQYRLRVFEDNYDYVTKHNSLANSTYTLSLNAFA 79 Query: 1158 DLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 979 DLTHHEFKAKYLG S SAD +IRLNRGS S+ S V + DIPSSLDWR KGAVT +KD Sbjct: 80 DLTHHEFKAKYLGFSASAD-GLIRLNRGSSSIGASGAVGKYDIPSSLDWRNKGAVTNVKD 138 Query: 978 QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 799 QGSCGACW+FSATGAIEGIN+IVTGSL+SLSEQELIDCDRSYN+GC GGLMDY YEFV+K Sbjct: 139 QGSCGACWAFSATGAIEGINEIVTGSLVSLSEQELIDCDRSYNNGCNGGLMDYTYEFVVK 198 Query: 798 NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 619 N GIDTE+DYP++GR+ TCN KLKR VV+ID Y+D+P NE+ LLQAVA QP+SVGI G Sbjct: 199 NGGIDTEQDYPFKGRDGTCNSNKLKRRVVSIDGYIDVPANNEQELLQAVAAQPVSVGICG 258 Query: 618 SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 439 S FQLYSGGIFTG CST LDHAVLIVGYDSK+G DYWIVKNSWG WGI+GY++IIRN Sbjct: 259 SERGFQLYSGGIFTGPCSTSLDHAVLIVGYDSKNGADYWIVKNSWGTSWGINGYIHIIRN 318 Query: 438 NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 259 +GN GVCGINM+ASYP K T+C+ F+ C +GETCCC+ FLGLCL W Sbjct: 319 SGNSAGVCGINMMASYPTKSSLNPPPSPPPGPTKCSLFSSCPAGETCCCSMEFLGLCLSW 378 Query: 258 KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG*AIGVLLS 79 KCC+ SAVCCKDR HCCPHDYPICDTKRNLCL+++GN+T+ KQL N +G Sbjct: 379 KCCDLDSAVCCKDRLHCCPHDYPICDTKRNLCLRRMGNSTLVKQLKNGGRSG-------- 430 Query: 78 MIELGGWSSYF 46 + G WSS F Sbjct: 431 --KFGDWSSLF 439 >ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum] Length = 439 Score = 594 bits (1532), Expect = e-167 Identities = 277/412 (67%), Positives = 323/412 (78%) Frame = -2 Query: 1338 CKSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFA 1159 C SSISDLF WCQQ GK Y+S QE+ YR KVFEENYAY+++HNS NSSYTL LN ++ Sbjct: 20 CTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYITEHNSKENSSYTLGLNAYS 79 Query: 1158 DLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 979 DLTHHEF+ +LGLS SA+D I RGS S E + ++ + D PSSLDWR+KGAVT +K+ Sbjct: 80 DLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSE-TGVLSDVDAPSSLDWREKGAVTDVKN 138 Query: 978 QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 799 QGSCGACWSFSATGA+EGINKI TGSL+SLSEQELIDCDRSYN+GCGGGLMDYA+EFVIK Sbjct: 139 QGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGGGLMDYAFEFVIK 198 Query: 798 NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 619 N GIDTEKDYP++ RE TCNK KL+RHVVTID Y DIP +E +LL+AVATQP+SVGI G Sbjct: 199 NGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICG 258 Query: 618 SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 439 S +FQ YS GIFTG CST LDHAVLIVGY S++G DYWI+KNSWG WGI+GY+++ RN Sbjct: 259 SARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYIHMQRN 318 Query: 438 NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 259 +GN EG+CGIN LASYP K ++C+ FT C GETCCC +FLG+CL W Sbjct: 319 SGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCCCGSKFLGICLSW 378 Query: 258 KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG 103 KCC SAVCCKD HCCP DYPICDT RNLCLK++ NATI +Q TG Sbjct: 379 KCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQQPQKEAFTG 430 >gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlisea aurea] Length = 424 Score = 594 bits (1532), Expect = e-167 Identities = 278/405 (68%), Positives = 327/405 (80%), Gaps = 1/405 (0%) Frame = -2 Query: 1332 SSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADL 1153 SSSISDLF++WCQ++GKTY S +E+E+RL VF ENY +++ HN+ AN SYTLSLN FADL Sbjct: 23 SSSISDLFDSWCQEHGKTYVSEEEREHRLGVFSENYDFIASHNARANYSYTLSLNAFADL 82 Query: 1152 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 973 T EF +YLG SPS D +IR NRGS S N S +PSS+DWRKKGAVTGIKDQG Sbjct: 83 TRSEFGGRYLGFSPSGHDLLIRKNRGSGSYRSRNY---SAVPSSIDWRKKGAVTGIKDQG 139 Query: 972 SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 793 SCGACWSFSATGAIEGIN+IVTGSL+SLSEQELIDCD SYN GC GGLMDYAYEF++KNK Sbjct: 140 SCGACWSFSATGAIEGINQIVTGSLVSLSEQELIDCDHSYNQGCNGGLMDYAYEFILKNK 199 Query: 792 GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 613 GIDTE+DY Y+GR+ +C++ KL + VVTIDSYVDIP KNE+ LL+AVA+QP+SVGISG Sbjct: 200 GIDTEEDYSYKGRDASCSQNKLNKRVVTIDSYVDIPEKNEQMLLEAVASQPVSVGISGGD 259 Query: 612 NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 433 FQ YS GIFTG CST LDHAVLIVGYDSK+GKDYWIVKNSWGK WG+DGYMY+ RN G Sbjct: 260 APFQFYSQGIFTGPCSTSLDHAVLIVGYDSKNGKDYWIVKNSWGKSWGMDGYMYVQRNTG 319 Query: 432 NPEGVCGINMLASYPVK-XXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWK 256 N G+C INM+ASYPVK T+C+ F+YCS GETCCCARRFLGLC+++K Sbjct: 320 NQNGICEINMMASYPVKTNPNPSPSPSPPGPTKCSLFSYCSQGETCCCARRFLGLCMRYK 379 Query: 255 CCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLG 121 CC A+SAVCC+D HCCP DYPICDT +++C K GN+T+A +G Sbjct: 380 CCGAESAVCCEDNVHCCPQDYPICDTAQSVCRKMSGNSTMAIPVG 424 >ref|XP_009597081.1| PREDICTED: zingipain-2 [Nicotiana tomentosiformis] Length = 439 Score = 590 bits (1522), Expect = e-166 Identities = 278/404 (68%), Positives = 319/404 (78%) Frame = -2 Query: 1338 CKSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFA 1159 C SSISDLF +WCQQ GKTY+S QE+ YRL+VFEENYAY+ +HNS NS+YTL LN F+ Sbjct: 20 CTCSSISDLFESWCQQNGKTYSSEQERVYRLEVFEENYAYIIEHNSKGNSTYTLDLNAFS 79 Query: 1158 DLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 979 DLTHHEFK +LGLS SA+D IRL GS S N V DIPSSLDWR+KGAVT +K+ Sbjct: 80 DLTHHEFKNSFLGLSSSAND-FIRLKTGSSSAGVFNDVGVVDIPSSLDWREKGAVTKVKN 138 Query: 978 QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 799 QGSCGACWSFSATGAIEGINKIVTGSL+SLSEQELIDCD+SYNDGCGGGLMDYA+EFV K Sbjct: 139 QGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAFEFVKK 198 Query: 798 NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 619 N GIDTE+DYP+ RE TCNK KL+R VVTID Y D+P +E +LL+AVA QP+SVGI G Sbjct: 199 NGGIDTEEDYPFNEREGTCNKNKLQRRVVTIDGYTDVPQYDEDKLLKAVANQPVSVGICG 258 Query: 618 SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 439 S +FQ YS GIFTG CST LDHAVLIVGY S++G DYWI+KNSWG WGI+GYM++ RN Sbjct: 259 SERAFQSYSKGIFTGPCSTVLDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYMHMQRN 318 Query: 438 NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 259 +GN EG+CGIN LASYP K ++C+ FT C GETCCC R LG+C+ W Sbjct: 319 SGNQEGICGINKLASYPTKSSPNPPSPPSPGPSKCSMFTSCGQGETCCCGWRLLGVCVSW 378 Query: 258 KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQ 127 KCC SAVCCKD HCCPHDYPICDT RNLCLK++ NATI +Q Sbjct: 379 KCCGLDSAVCCKDGRHCCPHDYPICDTSRNLCLKRMSNATIVQQ 422 >ref|XP_004238304.1| PREDICTED: zingipain-2 [Solanum lycopersicum] Length = 439 Score = 590 bits (1521), Expect = e-165 Identities = 273/404 (67%), Positives = 318/404 (78%) Frame = -2 Query: 1338 CKSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFA 1159 C SSISDLF WCQQ GK Y+S QE+ YR KVFEENYAY+++HNS NSSYTL LN ++ Sbjct: 20 CTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYITEHNSKGNSSYTLGLNAYS 79 Query: 1158 DLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 979 DLTHHEF+ +LGLS SA+D I RGS S + ++ + D PSSLDWR KGAVT +K+ Sbjct: 80 DLTHHEFRNSFLGLSSSANDFIRLKGRGSGS-SAAGVLSDVDAPSSLDWRDKGAVTNVKN 138 Query: 978 QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 799 QGSCGACWSFSATGAIEGINKI TGSL+SLSEQELIDCDRSYN GCGGGLMDYA+EFVIK Sbjct: 139 QGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGGGLMDYAFEFVIK 198 Query: 798 NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 619 N GIDTEKDYP++ +E TCNK KL+R VVTID Y DIP +E +LL+AVATQP+SVGI G Sbjct: 199 NGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICG 258 Query: 618 SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 439 S +FQ YS GIFTG C TDLDHAVLIVGY S++G DYWI+KNSWG WGI+GY+++ RN Sbjct: 259 SARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTSWGINGYIHMQRN 318 Query: 438 NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 259 +GN EG+CG+N LASYP K ++C+TFT C GETCCC +FLG+CL W Sbjct: 319 SGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCCCGLKFLGICLSW 378 Query: 258 KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQ 127 KCC SAVCCKD HCCP DYPICDT RNLCLK++ NATI +Q Sbjct: 379 KCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQQ 422 >ref|XP_010271530.1| PREDICTED: low-temperature-induced cysteine proteinase [Nelumbo nucifera] Length = 443 Score = 590 bits (1520), Expect = e-165 Identities = 269/425 (63%), Positives = 331/425 (77%) Frame = -2 Query: 1329 SSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADLT 1150 SS SDLF+ WC+++G+TY+S +E+ +RLKVFE+N A+V++HNS+ANS+Y+L+LN FADLT Sbjct: 23 SSTSDLFDRWCEEHGRTYSSEEERLFRLKVFEDNLAFVTEHNSMANSTYSLALNAFADLT 82 Query: 1149 HHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGS 970 HHEFK LGL+ +A D + R +E ++ + +PSS+DWR+KGAVT +KDQGS Sbjct: 83 HHEFKISRLGLAAAATDMVRSSPRAPSLIESPSIAGQ--LPSSIDWREKGAVTNVKDQGS 140 Query: 969 CGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKG 790 CGACWSFSATGAIEGINKIVTGS +SLSEQEL+DCDRSYN GCGGGLMDYA+++VIKNKG Sbjct: 141 CGACWSFSATGAIEGINKIVTGSPLSLSEQELVDCDRSYNSGCGGGLMDYAFQWVIKNKG 200 Query: 789 IDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 610 IDTE DYPYQG E+TCNK+KL++HVVTID Y D+P +EK LLQAVA+QP+SVGI GS Sbjct: 201 IDTEDDYPYQGGERTCNKDKLRKHVVTIDGYTDVPSNSEKHLLQAVASQPVSVGICGSER 260 Query: 609 SFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGN 430 +FQLYS GIF+G CST LDHAVLIVGY S++G DYWIVKNSWG WG++GYM+++RN+G+ Sbjct: 261 AFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYMHMLRNSGS 320 Query: 429 PEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCC 250 P+GVCGINMLASYP K TRC+ TYC GETCCC RR LG+C WKCC Sbjct: 321 PQGVCGINMLASYPTKTSPNPPPSPSPGPTRCDLLTYCQEGETCCCTRRILGICFSWKCC 380 Query: 249 EAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG*AIGVLLSMIE 70 E SAVCCKD +CCPHDYPICDT+R CLK GN T K L + S+++ Sbjct: 381 ELDSAVCCKDHRYCCPHDYPICDTERKQCLKSTGNFTSVKSLDKS----------SSLVK 430 Query: 69 LGGWS 55 GGW+ Sbjct: 431 FGGWN 435 >gb|KHG28107.1| Cysteinease RD21a -like protein [Gossypium arboreum] Length = 431 Score = 589 bits (1518), Expect = e-165 Identities = 273/407 (67%), Positives = 321/407 (78%) Frame = -2 Query: 1329 SSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADLT 1150 S IS F WCQQ+GK+Y S +EK YRLKVFE+NYA+V+QHN++ NSSY+L+LN FAD T Sbjct: 26 SHISKKFETWCQQHGKSYLSEEEKSYRLKVFEDNYAFVTQHNAMVNSSYSLALNAFADFT 85 Query: 1149 HHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGS 970 HHEFKA LGLS +A + ++ LV+ DIP SLDWR+KGAVT +KDQGS Sbjct: 86 HHEFKASRLGLSGAA------IQFRHPNLREPRLVR--DIPDSLDWREKGAVTQVKDQGS 137 Query: 969 CGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKG 790 CGACWSFSATGAIEG+NKIVTGSLISLSEQEL+DCD++YN GC GGLMDYA++FVI N G Sbjct: 138 CGACWSFSATGAIEGVNKIVTGSLISLSEQELVDCDKTYNTGCEGGLMDYAFQFVINNHG 197 Query: 789 IDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 610 IDTE+DYPYQGRE TCNKEKLKRHVVTID Y D+P NEK+LLQAVATQP+SVGI GS Sbjct: 198 IDTEEDYPYQGREHTCNKEKLKRHVVTIDDYTDVPMNNEKKLLQAVATQPVSVGICGSER 257 Query: 609 SFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGN 430 +FQLYS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG+ WG++GY+++IRN+G Sbjct: 258 AFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGRRWGMNGYIHMIRNSGK 317 Query: 429 PEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCC 250 EG+CGINMLASYP+K T+C+ FTYCS+GETCCC R G+C WKCC Sbjct: 318 SEGICGINMLASYPIKTSPNPPPSPPPGPTKCDFFTYCSAGETCCCTHRIFGICFSWKCC 377 Query: 249 EAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFS 109 SAVCCKD HCCPH+YPICDTK N CLK++GNATI + N + Sbjct: 378 GLDSAVCCKDNRHCCPHNYPICDTKNNQCLKRVGNATIMESSDTNLA 424 >ref|XP_012444786.1| PREDICTED: zingipain-2 [Gossypium raimondii] gi|763791179|gb|KJB58175.1| hypothetical protein B456_009G197900 [Gossypium raimondii] Length = 431 Score = 588 bits (1516), Expect = e-165 Identities = 272/407 (66%), Positives = 321/407 (78%) Frame = -2 Query: 1329 SSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADLT 1150 S IS +F WC Q+GK+Y+S +EK YRLKVFE+NYA+V+QHN++ NSSY+L+LN FADLT Sbjct: 26 SHISKIFETWCHQHGKSYSSEEEKSYRLKVFEDNYAFVTQHNAMTNSSYSLALNAFADLT 85 Query: 1149 HHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGS 970 HHEFKA LGLS +A + ++ LV+ DIP+SLDWR+KGAVT +KDQGS Sbjct: 86 HHEFKASRLGLSGAA------IQFRCSNLREPRLVR--DIPASLDWREKGAVTQVKDQGS 137 Query: 969 CGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKG 790 CGACWSFSATGAIEG+NKIVTGSLISLSEQEL+DCD++YN GC GGLMDYA++FVI N G Sbjct: 138 CGACWSFSATGAIEGVNKIVTGSLISLSEQELVDCDKTYNTGCEGGLMDYAFQFVINNHG 197 Query: 789 IDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 610 IDTE+DYPYQGRE TCNKEKLKRHVVTID Y D+P NEK+LLQAVATQP+SVGI GS Sbjct: 198 IDTEEDYPYQGREHTCNKEKLKRHVVTIDDYTDVPMTNEKKLLQAVATQPVSVGICGSER 257 Query: 609 SFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGN 430 +FQLY GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG WG++GY+++IRN G Sbjct: 258 AFQLYCKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMIRNTGK 317 Query: 429 PEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCC 250 EG+CGINMLASYP+K T+C+ FTYCS+GETCCC R G+C WKCC Sbjct: 318 SEGICGINMLASYPIKTSPNPPPSPPPGPTKCDFFTYCSAGETCCCTHRIFGICFLWKCC 377 Query: 249 EAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFS 109 SAVCCKD HCCPH+YPICDTK N CLK++GNATI + N + Sbjct: 378 GLDSAVCCKDNRHCCPHNYPICDTKNNQCLKRVGNATIMESSNTNLA 424 >ref|XP_012071947.1| PREDICTED: low-temperature-induced cysteine proteinase [Jatropha curcas] gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] gi|643731232|gb|KDP38570.1| hypothetical protein JCGZ_04495 [Jatropha curcas] Length = 441 Score = 586 bits (1510), Expect = e-164 Identities = 270/410 (65%), Positives = 322/410 (78%) Frame = -2 Query: 1332 SSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADL 1153 SS I+ LF WCQQ+GKTY S +EK +RLKVF++NY +V++HNS NSSYTLSLN FADL Sbjct: 23 SSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADL 82 Query: 1152 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 973 THHEFKA LGLS +A S+ ++R + + +D+P+S+DWRK GAVT +KDQG Sbjct: 83 THHEFKASRLGLSSAASASL-NVDRSNRQIPDF----VADVPASVDWRKNGAVTQVKDQG 137 Query: 972 SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 793 +CGACWSFSATGAIEGINKIVTGSL+SLSEQEL+DCD+SYN+GC GG+MDYA++FVI N Sbjct: 138 NCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNH 197 Query: 792 GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 613 GIDTE+DYPYQGR+++CNKEKLKRHVVTID YVD+P NEK LL+AVA QP+SVGI GS Sbjct: 198 GIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSE 257 Query: 612 NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 433 +FQLYS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG YWG+DGYM++ RN+G Sbjct: 258 RAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSG 317 Query: 432 NPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKC 253 + G+CGINMLASYP K TRC+ FT+C GETCCC G+CL WKC Sbjct: 318 SSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKC 377 Query: 252 CEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG 103 CE SAVCCKD HCCP DYP+CDT RN+CLK GNAT ++ N S+G Sbjct: 378 CELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSSG 427 >ref|XP_009771293.1| PREDICTED: zingipain-2 [Nicotiana sylvestris] Length = 439 Score = 585 bits (1508), Expect = e-164 Identities = 274/404 (67%), Positives = 318/404 (78%) Frame = -2 Query: 1338 CKSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFA 1159 C SSISDLF WCQQ GK+Y+S QE+ YRLKVFEENYAY+ +HNS NS+YTL LN ++ Sbjct: 20 CTCSSISDLFETWCQQNGKSYSSEQERVYRLKVFEENYAYIIEHNSKGNSTYTLGLNAYS 79 Query: 1158 DLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 979 DLTHHEFK +LGLS SA+D IRL GS S + V + D+PSSLDWR+KGAVT +K+ Sbjct: 80 DLTHHEFKNSFLGLSSSAND-FIRLKTGSSSAGVFSDVGDVDVPSSLDWREKGAVTKVKN 138 Query: 978 QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 799 QGSCGACWSFSATGAIEGINKIV+GSL+SLSEQELIDCD+SYNDGCGGGLMDYA+EFV K Sbjct: 139 QGSCGACWSFSATGAIEGINKIVSGSLVSLSEQELIDCDKSYNDGCGGGLMDYAFEFVKK 198 Query: 798 NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 619 N GIDTE+DYP+ RE TCNK KL+R VVTID Y D+P +E +LL+AVA QP+SVGI G Sbjct: 199 NGGIDTEEDYPFIEREGTCNKNKLQRRVVTIDGYTDVPQNDEDKLLKAVAKQPVSVGICG 258 Query: 618 SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 439 S +FQ YS GIFTG CST LDHAVLIVGY S++G DYWI+KNSWG WGI+GYM++ RN Sbjct: 259 SERAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYMHMQRN 318 Query: 438 NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 259 +GN EG+CGIN LASYP K ++C+ FT C GETCCC LG+CL W Sbjct: 319 SGNQEGICGINKLASYPTKSSPNPPSPPSPGPSKCSMFTSCGQGETCCCGWSLLGVCLSW 378 Query: 258 KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQ 127 KCC SAVCCKD HCCPHDYPICDT RNLCLK++ NATI +Q Sbjct: 379 KCCGLDSAVCCKDGRHCCPHDYPICDTSRNLCLKRMSNATIVQQ 422 >ref|XP_011048840.1| PREDICTED: low-temperature-induced cysteine proteinase [Populus euphratica] Length = 480 Score = 581 bits (1497), Expect = e-163 Identities = 265/398 (66%), Positives = 321/398 (80%) Frame = -2 Query: 1332 SSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADL 1153 SS IS LF WC+++GK+Y S +E+ +RLKVFE+NY +V++HNS NSSYTLSLN F+DL Sbjct: 66 SSGISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYTLSLNAFSDL 125 Query: 1152 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 973 THHEFK LGLS + +N G ++E + +V DIP+S+DWR KGAVT +KDQG Sbjct: 126 THHEFKTSRLGLSAAP------MNLGHRNLEITGVV--GDIPASIDWRNKGAVTNVKDQG 177 Query: 972 SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 793 SCGACWSFSATGAIEGINKIVTGSL+SLSEQELI+CD+S+NDGCGGGLMDYA++FVI N Sbjct: 178 SCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSFNDGCGGGLMDYAFQFVINNH 237 Query: 792 GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 613 GIDTE+DYPY+ R+ TCNK+K+KR VVTID YVD+P NEK+LLQAVA QP+SVGI GS Sbjct: 238 GIDTEEDYPYRARDGTCNKDKMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSE 297 Query: 612 NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 433 +FQ+YS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG WG+ GYM++ RN+G Sbjct: 298 RAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSG 357 Query: 432 NPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKC 253 N +GVCGINMLASYPVK T+C+ F+YC++GETCCCAR+F G+C+ WKC Sbjct: 358 NSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCDLFSYCAAGETCCCARKFFGICISWKC 417 Query: 252 CEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNAT 139 C SAVCCKDR HCCPHDYP+CDT +N+C K+ GNAT Sbjct: 418 CGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNAT 455 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 580 bits (1494), Expect = e-162 Identities = 270/393 (68%), Positives = 309/393 (78%), Gaps = 1/393 (0%) Frame = -2 Query: 1332 SSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADL 1153 SS IS LF +W +++GKTY S ++K YR K+FEENY +V +HNS NSSYTLSLN FADL Sbjct: 25 SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84 Query: 1152 THHEFKAKYLGLSP-SADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQ 976 THHEFKA LGLS S + R N G D+P S+DWRKKGAV+ +KDQ Sbjct: 85 THHEFKASRLGLSAFSTSGKLSRRNFPLHDFVG-------DVPISIDWRKKGAVSQVKDQ 137 Query: 975 GSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKN 796 G+CGACWSFSATGAIEGINKIVTGSL+SLSEQEL+DCDRSYN+GC GGLMDYAY+FVI+N Sbjct: 138 GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIEN 197 Query: 795 KGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGS 616 GIDTE+DYPYQ REKTCNKEKLKRHVVTID Y D+P NEK LL+AVA QP+SVGI GS Sbjct: 198 NGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGS 257 Query: 615 GNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNN 436 +FQLYS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG +WGI+GYMY++RN+ Sbjct: 258 ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNS 317 Query: 435 GNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWK 256 GN +G+CGINMLAS+PVK T+C+ FT C GETCCC RR GLC WK Sbjct: 318 GNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWK 377 Query: 255 CCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLK 157 CCE SAVCCKD HCCPHDYP+CDTKRN+CLK Sbjct: 378 CCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410 >ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa] gi|550339725|gb|EEE94684.2| cysteine protease family protein [Populus trichocarpa] Length = 436 Score = 578 bits (1489), Expect = e-162 Identities = 264/398 (66%), Positives = 318/398 (79%) Frame = -2 Query: 1332 SSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADL 1153 SS IS LF WC+++GK+Y S +E+ +RLKVFE+NY +V++HNS NSSY+L+LN FADL Sbjct: 22 SSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADL 81 Query: 1152 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 973 THHEFK LGLS + LN ++E + +V DIP+S+DWR KG VT +KDQG Sbjct: 82 THHEFKTSRLGLSAAP------LNLAHRNLEITGVV--GDIPASIDWRNKGVVTNVKDQG 133 Query: 972 SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 793 SCGACWSFSATGAIEGINKIVTGSL+SLSEQELI+CD+SYNDGCGGGLMDYA++FVI N Sbjct: 134 SCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNH 193 Query: 792 GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 613 GIDTE+DYPY+ R+ TCNK+++KR VVTID YVD+P NEK+LLQAVA QP+SVGI GS Sbjct: 194 GIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSE 253 Query: 612 NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 433 +FQ+YS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG WG+ GYM++ RN+G Sbjct: 254 RAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSG 313 Query: 432 NPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKC 253 N +GVCGINMLASYPVK T+CN TYC++GETCCCAR+F G+C+ WKC Sbjct: 314 NSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKC 373 Query: 252 CEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNAT 139 C SAVCCKDR HCCPHDYP+CDT +N+C K+ GNAT Sbjct: 374 CGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNAT 411 >ref|XP_002280937.1| PREDICTED: zingipain-2 [Vitis vinifera] gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera] Length = 436 Score = 573 bits (1478), Expect = e-160 Identities = 266/428 (62%), Positives = 325/428 (75%) Frame = -2 Query: 1335 KSSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFAD 1156 ++SS +DLF WC+QYGKTY+S +EK RLKVFEEN+A+V+QHNS+AN+SYTL+LN FAD Sbjct: 21 EASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFAD 80 Query: 1155 LTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQ 976 LTHHEFKA LG SP SI + V+E +P ++DWRK GAVTG+KDQ Sbjct: 81 LTHHEFKASRLGFSPGRAQSIRSVGTP---------VQELHVPPAVDWRKSGAVTGVKDQ 131 Query: 975 GSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKN 796 G+CG CWSFS TGAIEGINKIVTGSL+SLSEQEL+DCDRSYN GC GGLMDYAY+FVIKN Sbjct: 132 GNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIKN 191 Query: 795 KGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGS 616 +GID+E DYPY G +K CNKEKLK+H+VTID Y DIPP +EK+LLQ VA QP+SVGI GS Sbjct: 192 QGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGS 251 Query: 615 GNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNN 436 +FQLYS G++TG CS+ LDHAVLIVGY ++DG D+WIVKNSWG++WG+ GY++++RNN Sbjct: 252 EKTFQLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWGMRGYIHMLRNN 311 Query: 435 GNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWK 256 G EG+CGINMLASYP K T+C+ F+ CS GETCCC+ RF+G+CL W Sbjct: 312 GTAEGICGINMLASYPAKTSPNPPPPPTPGPTKCDFFSSCSEGETCCCSWRFIGVCLSWN 371 Query: 255 CCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG*AIGVLLSM 76 CC AKSAVCC + ++CCP +PICDTKRN CLK GN T + L S+ Sbjct: 372 CCTAKSAVCCDNNNYCCPASHPICDTKRNRCLKPAGNGTGVEVLKRRGSS---------- 421 Query: 75 IELGGWSS 52 ++ GGWSS Sbjct: 422 VKFGGWSS 429 >ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] Length = 438 Score = 571 bits (1472), Expect = e-160 Identities = 266/408 (65%), Positives = 315/408 (77%) Frame = -2 Query: 1329 SSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADLT 1150 S IS LF WC Q+GK Y+S +EK YRLKVFEENYA+V+QHN + NSSY+L+LN FADLT Sbjct: 24 SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83 Query: 1149 HHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGS 970 HHEFKA LGLS +A + +++ LV+ DIP+S+DWR KGAVT +KDQGS Sbjct: 84 HHEFKASRLGLSAAA------IEGSRPNLQLPGLVR--DIPASMDWRTKGAVTKVKDQGS 135 Query: 969 CGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKG 790 CGACWSFSATGAIEGINKIVTG+L+SLSEQEL+DCDRSYN GC GGLMDYAY+FVI N G Sbjct: 136 CGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHG 195 Query: 789 IDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 610 ID E+DYPY GREKTCNKEK KR VVTID Y +P NE LLQAVA QP+SVGI GS Sbjct: 196 IDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSER 255 Query: 609 SFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGN 430 +FQLYS GIFTG CS+ LDHAVLIVGY S++G DYWIVKNSWG WG++GY++++RN+G+ Sbjct: 256 AFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGD 315 Query: 429 PEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCC 250 +G+CGINMLASYP K T+C+ FTYCS+GETCCC R G+C WKCC Sbjct: 316 SKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCC 375 Query: 249 EAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFST 106 E SAVCCKD HCCP+DYP+CDTK++ CLK++GNAT + ST Sbjct: 376 ELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKRHST 423 >ref|XP_008444761.1| PREDICTED: zingipain-2 [Cucumis melo] Length = 431 Score = 570 bits (1470), Expect = e-160 Identities = 272/428 (63%), Positives = 319/428 (74%) Frame = -2 Query: 1332 SSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADL 1153 +S++S+LF WC ++GK+Y+S +EK YRL VF +NY +V+ HN+L NSSYTLSLN +ADL Sbjct: 22 TSNVSELFEIWCTEHGKSYSSAEEKLYRLSVFADNYEFVTHHNNLGNSSYTLSLNSYADL 81 Query: 1152 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 973 THHEFK LG SP+ R V D+P SLDWRKKGAVT +KDQG Sbjct: 82 THHEFKVSRLGFSPAL--------RNFRPVLPQEPSLPRDVPDSLDWRKKGAVTAVKDQG 133 Query: 972 SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 793 SCGACWSFSATGAIEGIN+I+TGSLIS+SEQELIDCDRSYN GCGGGLMDYAY+FVI N Sbjct: 134 SCGACWSFSATGAIEGINQIMTGSLISVSEQELIDCDRSYNSGCGGGLMDYAYQFVISNH 193 Query: 792 GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 613 GIDTE DYPYQGR+ +C K+KL+R+VVTID Y DIPP +E +LLQAVA QP+SVGI GS Sbjct: 194 GIDTEDDYPYQGRDGSCRKDKLQRNVVTIDGYTDIPPNDEGKLLQAVAAQPVSVGICGSE 253 Query: 612 NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 433 +FQLYS GIF+G CST LDHAVLIVGY S++G DYWIVKNSWGK WG+DGYM++ RN+G Sbjct: 254 RAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSG 313 Query: 432 NPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKC 253 N EGVCGIN LASYP K T+C+ T C++GETCCCA++FLGLCL WKC Sbjct: 314 NSEGVCGINKLASYPTKTSPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKC 373 Query: 252 CEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG*AIGVLLSMI 73 C SAVCCKD HCCP DYPICDT RNLCLK+ N T + L N S+G Sbjct: 374 CGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKRTMNGTRMEVLENRSSSG---------- 423 Query: 72 ELGGWSSY 49 G WSS+ Sbjct: 424 SSGTWSSF 431 >ref|XP_004500967.1| PREDICTED: zingipain-2 [Cicer arietinum] Length = 436 Score = 568 bits (1464), Expect = e-159 Identities = 260/397 (65%), Positives = 306/397 (77%), Gaps = 2/397 (0%) Frame = -2 Query: 1320 SDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADLTHHE 1141 S LF WC+Q+GKTY S QEK YR VFE+NYA+V+QHN + NSSYTLSLN FADLTHHE Sbjct: 27 SKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHHE 86 Query: 1140 FKAKYLGLSPSADDSIIRL--NRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGSC 967 FKA LGL PS S++R NR D + ++ +PS +DWRK GAV+ +KDQGSC Sbjct: 87 FKATRLGLPPS---SLLRFKFNRFQDQQRSDDFLQ---VPSEIDWRKNGAVSIVKDQGSC 140 Query: 966 GACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKGI 787 GACWSFSATGAIEGINKIVTGSL+SLSEQEL+DCD +YN GC GGLMDYAY+F+I N GI Sbjct: 141 GACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGI 200 Query: 786 DTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGNS 607 DTE+DYPYQ R+ C K+KLKR VVTID Y D+PP +EK+LL+AVA QP+SVGI GS + Sbjct: 201 DTEEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARA 260 Query: 606 FQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGNP 427 FQLYS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWGKYWG++GY++++RN + Sbjct: 261 FQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSS 320 Query: 426 EGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCCE 247 G+CGINMLASYP K +CN FTYCS GETCCCA++FLG+C WKCC Sbjct: 321 AGLCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCG 380 Query: 246 AKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATI 136 SAVCCKD+ HCCP DYP+CD CLK+I N TI Sbjct: 381 VTSAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTI 417 >ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana] gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana] gi|332190386|gb|AEE28507.1| papain-like cysteine peptidase [Arabidopsis thaliana] Length = 437 Score = 568 bits (1463), Expect = e-159 Identities = 266/401 (66%), Positives = 320/401 (79%) Frame = -2 Query: 1332 SSSISDLFNNWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNVFADL 1153 S IS+LF++WCQ++GKTY S +E++ R+++F++N+ +V+QHN + N++Y+LSLN FADL Sbjct: 25 SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84 Query: 1152 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 973 THHEFKA LGLS SA S+I ++G S+ GS VK +P S+DWRKKGAVT +KDQG Sbjct: 85 THHEFKASRLGLSVSAP-SVIMASKGQ-SLGGS--VK---VPDSVDWRKKGAVTNVKDQG 137 Query: 972 SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 793 SCGACWSFSATGA+EGIN+IVTG LISLSEQELIDCD+SYN GC GGLMDYA+EFVIKN Sbjct: 138 SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197 Query: 792 GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 613 GIDTEKDYPYQ R+ TC K+KLK+ VVTIDSY + +EK L++AVA QP+SVGI GS Sbjct: 198 GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257 Query: 612 NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 433 +FQLYS GIF+G CST LDHAVLIVGY S++G DYWIVKNSWGK WG+DG+M++ RN Sbjct: 258 RAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 317 Query: 432 NPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKC 253 N +GVCGINMLASYP+K T+CN FTYCSSGETCCCAR GLC WKC Sbjct: 318 NSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKC 377 Query: 252 CEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAK 130 CE +SAVCCKD HCCPHDYP+CDT R+LCLK+ GN T K Sbjct: 378 CEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIK 418