BLASTX nr result

ID: Forsythia21_contig00011755 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00011755
         (1915 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011074626.1| PREDICTED: zingipain-2 [Sesamum indicum]          666   0.0  
ref|XP_012839123.1| PREDICTED: zingipain-2 [Erythranthe guttatus...   665   0.0  
emb|CDP07460.1| unnamed protein product [Coffea canephora]            643   0.0  
gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlise...   597   e-167
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   596   e-167
ref|XP_009597081.1| PREDICTED: zingipain-2 [Nicotiana tomentosif...   593   e-166
ref|XP_004238304.1| PREDICTED: zingipain-2 [Solanum lycopersicum]     592   e-166
gb|KHG28107.1| Cysteinease RD21a -like protein [Gossypium arboreum]   591   e-166
ref|XP_010271530.1| PREDICTED: low-temperature-induced cysteine ...   591   e-166
ref|XP_012444786.1| PREDICTED: zingipain-2 [Gossypium raimondii]...   590   e-165
ref|XP_012071947.1| PREDICTED: low-temperature-induced cysteine ...   588   e-165
ref|XP_009771293.1| PREDICTED: zingipain-2 [Nicotiana sylvestris]     587   e-164
ref|XP_011048840.1| PREDICTED: low-temperature-induced cysteine ...   586   e-164
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   582   e-163
ref|XP_002307688.2| cysteine protease family protein [Populus tr...   580   e-162
ref|XP_002280937.1| PREDICTED: zingipain-2 [Vitis vinifera] gi|3...   576   e-161
ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087...   573   e-160
ref|XP_008444761.1| PREDICTED: zingipain-2 [Cucumis melo]             571   e-160
ref|XP_004500967.1| PREDICTED: zingipain-2 [Cicer arietinum]          570   e-159
emb|CDX94938.1| BnaC05g07330D [Brassica napus]                        569   e-159

>ref|XP_011074626.1| PREDICTED: zingipain-2 [Sesamum indicum]
          Length = 439

 Score =  666 bits (1719), Expect = 0.0
 Identities = 307/406 (75%), Positives = 346/406 (85%)
 Frame = -3

Query: 1586 TCKSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAF 1407
            TCKSSSIS LF+SWC++YGKTY S QEK+ R KVFE+NY YV+ HN+  NSSYTLSLNAF
Sbjct: 19   TCKSSSISHLFDSWCKEYGKTYASEQEKQQRFKVFEQNYEYVTLHNARPNSSYTLSLNAF 78

Query: 1406 ADLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIK 1227
            ADLT+HEFKAKYLGLS SA +  IRLN     +EG +LVKESD+PSS+DWRK+GAVT +K
Sbjct: 79   ADLTNHEFKAKYLGLSLSASNLAIRLNSEQVGIEGPDLVKESDLPSSVDWRKQGAVTEVK 138

Query: 1226 DQGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVI 1047
            DQGSCGACWSFS TGA+EGINKIVTGSLISLSEQELIDCD+SYNDGCGGGLMDYAY+F+I
Sbjct: 139  DQGSCGACWSFSTTGAVEGINKIVTGSLISLSEQELIDCDKSYNDGCGGGLMDYAYQFII 198

Query: 1046 KNKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGIS 867
            KNKGIDTEKDYPYQGRE TC KEKLK+HVVTIDSY DI  KNEK+L QAVATQP+SVGI 
Sbjct: 199  KNKGIDTEKDYPYQGREGTCKKEKLKKHVVTIDSYADITAKNEKKLQQAVATQPVSVGIC 258

Query: 866  GSGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIR 687
            GS  SFQLYSGGIFTG CS  LDHAVLIVGYDSKDG+DYWI+KNSWG+YWG+DGYM++ R
Sbjct: 259  GSEKSFQLYSGGIFTGPCSASLDHAVLIVGYDSKDGQDYWIIKNSWGRYWGMDGYMHMQR 318

Query: 686  NNGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLK 507
            N GN EG+CGIN+LASYP+K              RCN FTYCSSGETCCC R   G+CLK
Sbjct: 319  NTGNGEGLCGINLLASYPIKTSPNPPPSPPPGPVRCNLFTYCSSGETCCCTRDIFGICLK 378

Query: 506  WKCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQL 369
            WKCC A+SAVCC+DR  CCPHDYP+CDTKRN+CLK IGN+T+A+ L
Sbjct: 379  WKCCGAESAVCCQDRRSCCPHDYPVCDTKRNMCLKWIGNSTVAQPL 424


>ref|XP_012839123.1| PREDICTED: zingipain-2 [Erythranthe guttatus]
            gi|604331887|gb|EYU36745.1| hypothetical protein
            MIMGU_mgv1a006749mg [Erythranthe guttata]
          Length = 433

 Score =  665 bits (1717), Expect = 0.0
 Identities = 305/406 (75%), Positives = 351/406 (86%), Gaps = 1/406 (0%)
 Frame = -3

Query: 1580 KSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFAD 1401
            KSS ISDLF+SWC++YGKTY S QEK++RL VF ENY YV+QHN+ ANSSYTLS+NAFAD
Sbjct: 21   KSSLISDLFDSWCEEYGKTYASEQEKQHRLNVFHENYKYVNQHNADANSSYTLSVNAFAD 80

Query: 1400 LTHHEFKAKYLGLSPSADDSIIRLN-RGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 1224
            LT+HEF+A YLGLSPS  DS+IRLN R + +++G NL+KES+IPSSLDWR KGAVT +KD
Sbjct: 81   LTNHEFRANYLGLSPSKSDSVIRLNSRSASAIDGDNLIKESEIPSSLDWRNKGAVTAVKD 140

Query: 1223 QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 1044
            QGSCGACWSFSATGA+EGIN+I TGSL+SLSEQELIDCD+SYNDGC GGLMDYAY+F+IK
Sbjct: 141  QGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCNGGLMDYAYDFIIK 200

Query: 1043 NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 864
            NKGIDTE+DY Y+GR  TC+K K+ +HVVTIDSYVDIP K+EK+LLQAVATQPISVGI G
Sbjct: 201  NKGIDTEEDYSYKGRSATCDKNKMNKHVVTIDSYVDIPEKDEKKLLQAVATQPISVGICG 260

Query: 863  SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 684
            S +SFQLYSGGIFTG CST LDHAVLIVGYDSKDGKDYWI+KNSWGK WGI GYM+++RN
Sbjct: 261  SDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGKSWGIKGYMHMVRN 320

Query: 683  NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 504
            +G+ EGVCGIN LASYPVK             T+CN FTYCSSGETCCCAR FLG+CL W
Sbjct: 321  SGSEEGVCGINTLASYPVKSSTNPPPSPTPGPTKCNIFTYCSSGETCCCARYFLGVCLSW 380

Query: 503  KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLG 366
             CCEA+SAVCC D  HCCPHDYP+CDTK+NLCLK+ GN T++K LG
Sbjct: 381  NCCEAESAVCCDDHRHCCPHDYPVCDTKKNLCLKKSGNTTVSKPLG 426


>emb|CDP07460.1| unnamed protein product [Coffea canephora]
          Length = 441

 Score =  643 bits (1659), Expect = 0.0
 Identities = 300/431 (69%), Positives = 348/431 (80%)
 Frame = -3

Query: 1583 CKSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFA 1404
            CKSS  +DLF +WC+Q+GKTY S +EK+YRL+VFE+NY YV++HNSLANS+YTLSLNAFA
Sbjct: 20   CKSSLTADLFENWCKQHGKTYPSEEEKQYRLRVFEDNYDYVTKHNSLANSTYTLSLNAFA 79

Query: 1403 DLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 1224
            DLTHHEFKAKYLG S SAD  +IRLNRGS S+  S  V + DIPSSLDWR KGAVT +KD
Sbjct: 80   DLTHHEFKAKYLGFSASAD-GLIRLNRGSSSIGASGAVGKYDIPSSLDWRNKGAVTNVKD 138

Query: 1223 QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 1044
            QGSCGACW+FSATGAIEGIN+IVTGSL+SLSEQELIDCDRSYN+GC GGLMDY YEFV+K
Sbjct: 139  QGSCGACWAFSATGAIEGINEIVTGSLVSLSEQELIDCDRSYNNGCNGGLMDYTYEFVVK 198

Query: 1043 NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 864
            N GIDTE+DYP++GR+ TCN  KLKR VV+ID Y+D+P  NE+ LLQAVA QP+SVGI G
Sbjct: 199  NGGIDTEQDYPFKGRDGTCNSNKLKRRVVSIDGYIDVPANNEQELLQAVAAQPVSVGICG 258

Query: 863  SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 684
            S   FQLYSGGIFTG CST LDHAVLIVGYDSK+G DYWIVKNSWG  WGI+GY++IIRN
Sbjct: 259  SERGFQLYSGGIFTGPCSTSLDHAVLIVGYDSKNGADYWIVKNSWGTSWGINGYIHIIRN 318

Query: 683  NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 504
            +GN  GVCGINM+ASYP K             T+C+ F+ C +GETCCC+  FLGLCL W
Sbjct: 319  SGNSAGVCGINMMASYPTKSSLNPPPSPPPGPTKCSLFSSCPAGETCCCSMEFLGLCLSW 378

Query: 503  KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG*AIGVLLS 324
            KCC+  SAVCCKDR HCCPHDYPICDTKRNLCL+++GN+T+ KQL N   +G        
Sbjct: 379  KCCDLDSAVCCKDRLHCCPHDYPICDTKRNLCLRRMGNSTLVKQLKNGGRSG-------- 430

Query: 323  MIELGGWSSHF 291
              + G WSS F
Sbjct: 431  --KFGDWSSLF 439


>gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlisea aurea]
          Length = 424

 Score =  597 bits (1539), Expect = e-167
 Identities = 280/405 (69%), Positives = 328/405 (80%), Gaps = 1/405 (0%)
 Frame = -3

Query: 1577 SSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADL 1398
            SSSISDLF+SWCQ++GKTY S +E+E+RL VF ENY +++ HN+ AN SYTLSLNAFADL
Sbjct: 23   SSSISDLFDSWCQEHGKTYVSEEEREHRLGVFSENYDFIASHNARANYSYTLSLNAFADL 82

Query: 1397 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 1218
            T  EF  +YLG SPS  D +IR NRGS S    N    S +PSS+DWRKKGAVTGIKDQG
Sbjct: 83   TRSEFGGRYLGFSPSGHDLLIRKNRGSGSYRSRNY---SAVPSSIDWRKKGAVTGIKDQG 139

Query: 1217 SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 1038
            SCGACWSFSATGAIEGIN+IVTGSL+SLSEQELIDCD SYN GC GGLMDYAYEF++KNK
Sbjct: 140  SCGACWSFSATGAIEGINQIVTGSLVSLSEQELIDCDHSYNQGCNGGLMDYAYEFILKNK 199

Query: 1037 GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 858
            GIDTE+DY Y+GR+ +C++ KL + VVTIDSYVDIP KNE+ LL+AVA+QP+SVGISG  
Sbjct: 200  GIDTEEDYSYKGRDASCSQNKLNKRVVTIDSYVDIPEKNEQMLLEAVASQPVSVGISGGD 259

Query: 857  NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 678
              FQ YS GIFTG CST LDHAVLIVGYDSK+GKDYWIVKNSWGK WG+DGYMY+ RN G
Sbjct: 260  APFQFYSQGIFTGPCSTSLDHAVLIVGYDSKNGKDYWIVKNSWGKSWGMDGYMYVQRNTG 319

Query: 677  NPEGVCGINMLASYPVK-XXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWK 501
            N  G+C INM+ASYPVK              T+C+ F+YCS GETCCCARRFLGLC+++K
Sbjct: 320  NQNGICEINMMASYPVKTNPNPSPSPSPPGPTKCSLFSYCSQGETCCCARRFLGLCMRYK 379

Query: 500  CCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLG 366
            CC A+SAVCC+D  HCCP DYPICDT +++C K  GN+T+A  +G
Sbjct: 380  CCGAESAVCCEDNVHCCPQDYPICDTAQSVCRKMSGNSTMAIPVG 424


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  596 bits (1537), Expect = e-167
 Identities = 278/412 (67%), Positives = 325/412 (78%)
 Frame = -3

Query: 1583 CKSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFA 1404
            C  SSISDLF +WCQQ GK Y+S QE+ YR KVFEENYAY+++HNS  NSSYTL LNA++
Sbjct: 20   CTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYITEHNSKENSSYTLGLNAYS 79

Query: 1403 DLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 1224
            DLTHHEF+  +LGLS SA+D I    RGS S E + ++ + D PSSLDWR+KGAVT +K+
Sbjct: 80   DLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSE-TGVLSDVDAPSSLDWREKGAVTDVKN 138

Query: 1223 QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 1044
            QGSCGACWSFSATGA+EGINKI TGSL+SLSEQELIDCDRSYN+GCGGGLMDYA+EFVIK
Sbjct: 139  QGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGGGLMDYAFEFVIK 198

Query: 1043 NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 864
            N GIDTEKDYP++ RE TCNK KL+RHVVTID Y DIP  +E +LL+AVATQP+SVGI G
Sbjct: 199  NGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICG 258

Query: 863  SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 684
            S  +FQ YS GIFTG CST LDHAVLIVGY S++G DYWI+KNSWG  WGI+GY+++ RN
Sbjct: 259  SARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYIHMQRN 318

Query: 683  NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 504
            +GN EG+CGIN LASYP K             ++C+ FT C  GETCCC  +FLG+CL W
Sbjct: 319  SGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCCCGSKFLGICLSW 378

Query: 503  KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG 348
            KCC   SAVCCKD  HCCP DYPICDT RNLCLK++ NATI +Q      TG
Sbjct: 379  KCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQQPQKEAFTG 430


>ref|XP_009597081.1| PREDICTED: zingipain-2 [Nicotiana tomentosiformis]
          Length = 439

 Score =  593 bits (1529), Expect = e-166
 Identities = 280/404 (69%), Positives = 320/404 (79%)
 Frame = -3

Query: 1583 CKSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFA 1404
            C  SSISDLF SWCQQ GKTY+S QE+ YRL+VFEENYAY+ +HNS  NS+YTL LNAF+
Sbjct: 20   CTCSSISDLFESWCQQNGKTYSSEQERVYRLEVFEENYAYIIEHNSKGNSTYTLDLNAFS 79

Query: 1403 DLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 1224
            DLTHHEFK  +LGLS SA+D  IRL  GS S    N V   DIPSSLDWR+KGAVT +K+
Sbjct: 80   DLTHHEFKNSFLGLSSSAND-FIRLKTGSSSAGVFNDVGVVDIPSSLDWREKGAVTKVKN 138

Query: 1223 QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 1044
            QGSCGACWSFSATGAIEGINKIVTGSL+SLSEQELIDCD+SYNDGCGGGLMDYA+EFV K
Sbjct: 139  QGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAFEFVKK 198

Query: 1043 NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 864
            N GIDTE+DYP+  RE TCNK KL+R VVTID Y D+P  +E +LL+AVA QP+SVGI G
Sbjct: 199  NGGIDTEEDYPFNEREGTCNKNKLQRRVVTIDGYTDVPQYDEDKLLKAVANQPVSVGICG 258

Query: 863  SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 684
            S  +FQ YS GIFTG CST LDHAVLIVGY S++G DYWI+KNSWG  WGI+GYM++ RN
Sbjct: 259  SERAFQSYSKGIFTGPCSTVLDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYMHMQRN 318

Query: 683  NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 504
            +GN EG+CGIN LASYP K             ++C+ FT C  GETCCC  R LG+C+ W
Sbjct: 319  SGNQEGICGINKLASYPTKSSPNPPSPPSPGPSKCSMFTSCGQGETCCCGWRLLGVCVSW 378

Query: 503  KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQ 372
            KCC   SAVCCKD  HCCPHDYPICDT RNLCLK++ NATI +Q
Sbjct: 379  KCCGLDSAVCCKDGRHCCPHDYPICDTSRNLCLKRMSNATIVQQ 422


>ref|XP_004238304.1| PREDICTED: zingipain-2 [Solanum lycopersicum]
          Length = 439

 Score =  592 bits (1526), Expect = e-166
 Identities = 274/404 (67%), Positives = 320/404 (79%)
 Frame = -3

Query: 1583 CKSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFA 1404
            C  SSISDLF +WCQQ GK Y+S QE+ YR KVFEENYAY+++HNS  NSSYTL LNA++
Sbjct: 20   CTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYITEHNSKGNSSYTLGLNAYS 79

Query: 1403 DLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 1224
            DLTHHEF+  +LGLS SA+D I    RGS S   + ++ + D PSSLDWR KGAVT +K+
Sbjct: 80   DLTHHEFRNSFLGLSSSANDFIRLKGRGSGS-SAAGVLSDVDAPSSLDWRDKGAVTNVKN 138

Query: 1223 QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 1044
            QGSCGACWSFSATGAIEGINKI TGSL+SLSEQELIDCDRSYN GCGGGLMDYA+EFVIK
Sbjct: 139  QGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGGGLMDYAFEFVIK 198

Query: 1043 NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 864
            N GIDTEKDYP++ +E TCNK KL+R VVTID Y DIP  +E +LL+AVATQP+SVGI G
Sbjct: 199  NGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICG 258

Query: 863  SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 684
            S  +FQ YS GIFTG C TDLDHAVLIVGY S++G DYWI+KNSWG  WGI+GY+++ RN
Sbjct: 259  SARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTSWGINGYIHMQRN 318

Query: 683  NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 504
            +GN EG+CG+N LASYP K             ++C+TFT C  GETCCC  +FLG+CL W
Sbjct: 319  SGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCCCGLKFLGICLSW 378

Query: 503  KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQ 372
            KCC   SAVCCKD  HCCP DYPICDT RNLCLK++ NATI +Q
Sbjct: 379  KCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQQ 422


>gb|KHG28107.1| Cysteinease RD21a -like protein [Gossypium arboreum]
          Length = 431

 Score =  591 bits (1523), Expect = e-166
 Identities = 274/407 (67%), Positives = 323/407 (79%)
 Frame = -3

Query: 1574 SSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADLT 1395
            S IS  F +WCQQ+GK+Y S +EK YRLKVFE+NYA+V+QHN++ NSSY+L+LNAFAD T
Sbjct: 26   SHISKKFETWCQQHGKSYLSEEEKSYRLKVFEDNYAFVTQHNAMVNSSYSLALNAFADFT 85

Query: 1394 HHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGS 1215
            HHEFKA  LGLS +A      +     ++    LV+  DIP SLDWR+KGAVT +KDQGS
Sbjct: 86   HHEFKASRLGLSGAA------IQFRHPNLREPRLVR--DIPDSLDWREKGAVTQVKDQGS 137

Query: 1214 CGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKG 1035
            CGACWSFSATGAIEG+NKIVTGSLISLSEQEL+DCD++YN GC GGLMDYA++FVI N G
Sbjct: 138  CGACWSFSATGAIEGVNKIVTGSLISLSEQELVDCDKTYNTGCEGGLMDYAFQFVINNHG 197

Query: 1034 IDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 855
            IDTE+DYPYQGRE TCNKEKLKRHVVTID Y D+P  NEK+LLQAVATQP+SVGI GS  
Sbjct: 198  IDTEEDYPYQGREHTCNKEKLKRHVVTIDDYTDVPMNNEKKLLQAVATQPVSVGICGSER 257

Query: 854  SFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGN 675
            +FQLYS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG+ WG++GY+++IRN+G 
Sbjct: 258  AFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGRRWGMNGYIHMIRNSGK 317

Query: 674  PEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCC 495
             EG+CGINMLASYP+K             T+C+ FTYCS+GETCCC  R  G+C  WKCC
Sbjct: 318  SEGICGINMLASYPIKTSPNPPPSPPPGPTKCDFFTYCSAGETCCCTHRIFGICFSWKCC 377

Query: 494  EAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFS 354
               SAVCCKD  HCCPH+YPICDTK N CLK++GNATI +    N +
Sbjct: 378  GLDSAVCCKDNRHCCPHNYPICDTKNNQCLKRVGNATIMESSDTNLA 424


>ref|XP_010271530.1| PREDICTED: low-temperature-induced cysteine proteinase [Nelumbo
            nucifera]
          Length = 443

 Score =  591 bits (1523), Expect = e-166
 Identities = 270/425 (63%), Positives = 332/425 (78%)
 Frame = -3

Query: 1574 SSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADLT 1395
            SS SDLF+ WC+++G+TY+S +E+ +RLKVFE+N A+V++HNS+ANS+Y+L+LNAFADLT
Sbjct: 23   SSTSDLFDRWCEEHGRTYSSEEERLFRLKVFEDNLAFVTEHNSMANSTYSLALNAFADLT 82

Query: 1394 HHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGS 1215
            HHEFK   LGL+ +A D +    R    +E  ++  +  +PSS+DWR+KGAVT +KDQGS
Sbjct: 83   HHEFKISRLGLAAAATDMVRSSPRAPSLIESPSIAGQ--LPSSIDWREKGAVTNVKDQGS 140

Query: 1214 CGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKG 1035
            CGACWSFSATGAIEGINKIVTGS +SLSEQEL+DCDRSYN GCGGGLMDYA+++VIKNKG
Sbjct: 141  CGACWSFSATGAIEGINKIVTGSPLSLSEQELVDCDRSYNSGCGGGLMDYAFQWVIKNKG 200

Query: 1034 IDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 855
            IDTE DYPYQG E+TCNK+KL++HVVTID Y D+P  +EK LLQAVA+QP+SVGI GS  
Sbjct: 201  IDTEDDYPYQGGERTCNKDKLRKHVVTIDGYTDVPSNSEKHLLQAVASQPVSVGICGSER 260

Query: 854  SFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGN 675
            +FQLYS GIF+G CST LDHAVLIVGY S++G DYWIVKNSWG  WG++GYM+++RN+G+
Sbjct: 261  AFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYMHMLRNSGS 320

Query: 674  PEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCC 495
            P+GVCGINMLASYP K             TRC+  TYC  GETCCC RR LG+C  WKCC
Sbjct: 321  PQGVCGINMLASYPTKTSPNPPPSPSPGPTRCDLLTYCQEGETCCCTRRILGICFSWKCC 380

Query: 494  EAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG*AIGVLLSMIE 315
            E  SAVCCKD  +CCPHDYPICDT+R  CLK  GN T  K L  +           S+++
Sbjct: 381  ELDSAVCCKDHRYCCPHDYPICDTERKQCLKSTGNFTSVKSLDKS----------SSLVK 430

Query: 314  LGGWS 300
             GGW+
Sbjct: 431  FGGWN 435


>ref|XP_012444786.1| PREDICTED: zingipain-2 [Gossypium raimondii]
            gi|763791179|gb|KJB58175.1| hypothetical protein
            B456_009G197900 [Gossypium raimondii]
          Length = 431

 Score =  590 bits (1521), Expect = e-165
 Identities = 273/407 (67%), Positives = 323/407 (79%)
 Frame = -3

Query: 1574 SSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADLT 1395
            S IS +F +WC Q+GK+Y+S +EK YRLKVFE+NYA+V+QHN++ NSSY+L+LNAFADLT
Sbjct: 26   SHISKIFETWCHQHGKSYSSEEEKSYRLKVFEDNYAFVTQHNAMTNSSYSLALNAFADLT 85

Query: 1394 HHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGS 1215
            HHEFKA  LGLS +A      +     ++    LV+  DIP+SLDWR+KGAVT +KDQGS
Sbjct: 86   HHEFKASRLGLSGAA------IQFRCSNLREPRLVR--DIPASLDWREKGAVTQVKDQGS 137

Query: 1214 CGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKG 1035
            CGACWSFSATGAIEG+NKIVTGSLISLSEQEL+DCD++YN GC GGLMDYA++FVI N G
Sbjct: 138  CGACWSFSATGAIEGVNKIVTGSLISLSEQELVDCDKTYNTGCEGGLMDYAFQFVINNHG 197

Query: 1034 IDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 855
            IDTE+DYPYQGRE TCNKEKLKRHVVTID Y D+P  NEK+LLQAVATQP+SVGI GS  
Sbjct: 198  IDTEEDYPYQGREHTCNKEKLKRHVVTIDDYTDVPMTNEKKLLQAVATQPVSVGICGSER 257

Query: 854  SFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGN 675
            +FQLY  GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG  WG++GY+++IRN G 
Sbjct: 258  AFQLYCKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMIRNTGK 317

Query: 674  PEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCC 495
             EG+CGINMLASYP+K             T+C+ FTYCS+GETCCC  R  G+C  WKCC
Sbjct: 318  SEGICGINMLASYPIKTSPNPPPSPPPGPTKCDFFTYCSAGETCCCTHRIFGICFLWKCC 377

Query: 494  EAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFS 354
               SAVCCKD  HCCPH+YPICDTK N CLK++GNATI +    N +
Sbjct: 378  GLDSAVCCKDNRHCCPHNYPICDTKNNQCLKRVGNATIMESSNTNLA 424


>ref|XP_012071947.1| PREDICTED: low-temperature-induced cysteine proteinase [Jatropha
            curcas] gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha
            curcas] gi|643731232|gb|KDP38570.1| hypothetical protein
            JCGZ_04495 [Jatropha curcas]
          Length = 441

 Score =  588 bits (1515), Expect = e-165
 Identities = 271/410 (66%), Positives = 324/410 (79%)
 Frame = -3

Query: 1577 SSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADL 1398
            SS I+ LF +WCQQ+GKTY S +EK +RLKVF++NY +V++HNS  NSSYTLSLNAFADL
Sbjct: 23   SSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADL 82

Query: 1397 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 1218
            THHEFKA  LGLS +A  S+  ++R +  +        +D+P+S+DWRK GAVT +KDQG
Sbjct: 83   THHEFKASRLGLSSAASASL-NVDRSNRQIPDF----VADVPASVDWRKNGAVTQVKDQG 137

Query: 1217 SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 1038
            +CGACWSFSATGAIEGINKIVTGSL+SLSEQEL+DCD+SYN+GC GG+MDYA++FVI N 
Sbjct: 138  NCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNH 197

Query: 1037 GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 858
            GIDTE+DYPYQGR+++CNKEKLKRHVVTID YVD+P  NEK LL+AVA QP+SVGI GS 
Sbjct: 198  GIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSE 257

Query: 857  NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 678
             +FQLYS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG YWG+DGYM++ RN+G
Sbjct: 258  RAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSG 317

Query: 677  NPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKC 498
            +  G+CGINMLASYP K             TRC+ FT+C  GETCCC     G+CL WKC
Sbjct: 318  SSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKC 377

Query: 497  CEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG 348
            CE  SAVCCKD  HCCP DYP+CDT RN+CLK  GNAT  ++   N S+G
Sbjct: 378  CELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSSG 427


>ref|XP_009771293.1| PREDICTED: zingipain-2 [Nicotiana sylvestris]
          Length = 439

 Score =  587 bits (1513), Expect = e-164
 Identities = 275/404 (68%), Positives = 320/404 (79%)
 Frame = -3

Query: 1583 CKSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFA 1404
            C  SSISDLF +WCQQ GK+Y+S QE+ YRLKVFEENYAY+ +HNS  NS+YTL LNA++
Sbjct: 20   CTCSSISDLFETWCQQNGKSYSSEQERVYRLKVFEENYAYIIEHNSKGNSTYTLGLNAYS 79

Query: 1403 DLTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKD 1224
            DLTHHEFK  +LGLS SA+D  IRL  GS S    + V + D+PSSLDWR+KGAVT +K+
Sbjct: 80   DLTHHEFKNSFLGLSSSAND-FIRLKTGSSSAGVFSDVGDVDVPSSLDWREKGAVTKVKN 138

Query: 1223 QGSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIK 1044
            QGSCGACWSFSATGAIEGINKIV+GSL+SLSEQELIDCD+SYNDGCGGGLMDYA+EFV K
Sbjct: 139  QGSCGACWSFSATGAIEGINKIVSGSLVSLSEQELIDCDKSYNDGCGGGLMDYAFEFVKK 198

Query: 1043 NKGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISG 864
            N GIDTE+DYP+  RE TCNK KL+R VVTID Y D+P  +E +LL+AVA QP+SVGI G
Sbjct: 199  NGGIDTEEDYPFIEREGTCNKNKLQRRVVTIDGYTDVPQNDEDKLLKAVAKQPVSVGICG 258

Query: 863  SGNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRN 684
            S  +FQ YS GIFTG CST LDHAVLIVGY S++G DYWI+KNSWG  WGI+GYM++ RN
Sbjct: 259  SERAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYMHMQRN 318

Query: 683  NGNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKW 504
            +GN EG+CGIN LASYP K             ++C+ FT C  GETCCC    LG+CL W
Sbjct: 319  SGNQEGICGINKLASYPTKSSPNPPSPPSPGPSKCSMFTSCGQGETCCCGWSLLGVCLSW 378

Query: 503  KCCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQ 372
            KCC   SAVCCKD  HCCPHDYPICDT RNLCLK++ NATI +Q
Sbjct: 379  KCCGLDSAVCCKDGRHCCPHDYPICDTSRNLCLKRMSNATIVQQ 422


>ref|XP_011048840.1| PREDICTED: low-temperature-induced cysteine proteinase [Populus
            euphratica]
          Length = 480

 Score =  586 bits (1511), Expect = e-164
 Identities = 274/438 (62%), Positives = 336/438 (76%)
 Frame = -3

Query: 1697 KSREKKQFQTFPLLSPLSKMHXXXXXXXXXXXXXXXPTCKSSSISDLFNSWCQQYGKTYN 1518
            +S +  + Q  PL S   KM+               P+  SS IS LF +WC+++GK+Y 
Sbjct: 26   QSSKTPESQKSPLFSLKQKMNFLCIFALTLLISVLSPSTASSGISQLFETWCKEHGKSYT 85

Query: 1517 SVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADLTHHEFKAKYLGLSPSADDSI 1338
            S +E+ +RLKVFE+NY +V++HNS  NSSYTLSLNAF+DLTHHEFK   LGLS +     
Sbjct: 86   SQEERSHRLKVFEDNYDFVTKHNSKGNSSYTLSLNAFSDLTHHEFKTSRLGLSAAP---- 141

Query: 1337 IRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGSCGACWSFSATGAIEGINKI 1158
              +N G  ++E + +V   DIP+S+DWR KGAVT +KDQGSCGACWSFSATGAIEGINKI
Sbjct: 142  --MNLGHRNLEITGVV--GDIPASIDWRNKGAVTNVKDQGSCGACWSFSATGAIEGINKI 197

Query: 1157 VTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKGIDTEKDYPYQGREKTCNKE 978
            VTGSL+SLSEQELI+CD+S+NDGCGGGLMDYA++FVI N GIDTE+DYPY+ R+ TCNK+
Sbjct: 198  VTGSLVSLSEQELIECDKSFNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKD 257

Query: 977  KLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGNSFQLYSGGIFTGACSTDLD 798
            K+KR VVTID YVD+P  NEK+LLQAVA QP+SVGI GS  +FQ+YS GIFTG CST LD
Sbjct: 258  KMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLD 317

Query: 797  HAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGNPEGVCGINMLASYPVKXXX 618
            HAVLIVGY S++G DYWIVKNSWG  WG+ GYM++ RN+GN +GVCGINMLASYPVK   
Sbjct: 318  HAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSP 377

Query: 617  XXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCCEAKSAVCCKDRSHCCPHDY 438
                      T+C+ F+YC++GETCCCAR+F G+C+ WKCC   SAVCCKDR HCCPHDY
Sbjct: 378  NPPPPPPPGPTKCDLFSYCAAGETCCCARKFFGICISWKCCGLDSAVCCKDRLHCCPHDY 437

Query: 437  PICDTKRNLCLKQIGNAT 384
            P+CDT +N+C K+ GNAT
Sbjct: 438  PVCDTDKNMCFKRAGNAT 455


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  582 bits (1501), Expect = e-163
 Identities = 272/393 (69%), Positives = 310/393 (78%), Gaps = 1/393 (0%)
 Frame = -3

Query: 1577 SSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADL 1398
            SS IS LF SW +++GKTY S ++K YR K+FEENY +V +HNS  NSSYTLSLNAFADL
Sbjct: 25   SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84

Query: 1397 THHEFKAKYLGLSP-SADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQ 1221
            THHEFKA  LGLS  S    + R N       G       D+P S+DWRKKGAV+ +KDQ
Sbjct: 85   THHEFKASRLGLSAFSTSGKLSRRNFPLHDFVG-------DVPISIDWRKKGAVSQVKDQ 137

Query: 1220 GSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKN 1041
            G+CGACWSFSATGAIEGINKIVTGSL+SLSEQEL+DCDRSYN+GC GGLMDYAY+FVI+N
Sbjct: 138  GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIEN 197

Query: 1040 KGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGS 861
             GIDTE+DYPYQ REKTCNKEKLKRHVVTID Y D+P  NEK LL+AVA QP+SVGI GS
Sbjct: 198  NGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGS 257

Query: 860  GNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNN 681
              +FQLYS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG +WGI+GYMY++RN+
Sbjct: 258  ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNS 317

Query: 680  GNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWK 501
            GN +G+CGINMLAS+PVK             T+C+ FT C  GETCCC RR  GLC  WK
Sbjct: 318  GNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWK 377

Query: 500  CCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLK 402
            CCE  SAVCCKD  HCCPHDYP+CDTKRN+CLK
Sbjct: 378  CCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410


>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  580 bits (1494), Expect = e-162
 Identities = 265/398 (66%), Positives = 320/398 (80%)
 Frame = -3

Query: 1577 SSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADL 1398
            SS IS LF +WC+++GK+Y S +E+ +RLKVFE+NY +V++HNS  NSSY+L+LNAFADL
Sbjct: 22   SSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADL 81

Query: 1397 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 1218
            THHEFK   LGLS +       LN    ++E + +V   DIP+S+DWR KG VT +KDQG
Sbjct: 82   THHEFKTSRLGLSAAP------LNLAHRNLEITGVV--GDIPASIDWRNKGVVTNVKDQG 133

Query: 1217 SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 1038
            SCGACWSFSATGAIEGINKIVTGSL+SLSEQELI+CD+SYNDGCGGGLMDYA++FVI N 
Sbjct: 134  SCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNH 193

Query: 1037 GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 858
            GIDTE+DYPY+ R+ TCNK+++KR VVTID YVD+P  NEK+LLQAVA QP+SVGI GS 
Sbjct: 194  GIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSE 253

Query: 857  NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 678
             +FQ+YS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWG  WG+ GYM++ RN+G
Sbjct: 254  RAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSG 313

Query: 677  NPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKC 498
            N +GVCGINMLASYPVK             T+CN  TYC++GETCCCAR+F G+C+ WKC
Sbjct: 314  NSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKC 373

Query: 497  CEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNAT 384
            C   SAVCCKDR HCCPHDYP+CDT +N+C K+ GNAT
Sbjct: 374  CGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNAT 411


>ref|XP_002280937.1| PREDICTED: zingipain-2 [Vitis vinifera] gi|302142569|emb|CBI19772.3|
            unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  576 bits (1485), Expect = e-161
 Identities = 267/428 (62%), Positives = 327/428 (76%)
 Frame = -3

Query: 1580 KSSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFAD 1401
            ++SS +DLF +WC+QYGKTY+S +EK  RLKVFEEN+A+V+QHNS+AN+SYTL+LNAFAD
Sbjct: 21   EASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFAD 80

Query: 1400 LTHHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQ 1221
            LTHHEFKA  LG SP    SI  +            V+E  +P ++DWRK GAVTG+KDQ
Sbjct: 81   LTHHEFKASRLGFSPGRAQSIRSVGTP---------VQELHVPPAVDWRKSGAVTGVKDQ 131

Query: 1220 GSCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKN 1041
            G+CG CWSFS TGAIEGINKIVTGSL+SLSEQEL+DCDRSYN GC GGLMDYAY+FVIKN
Sbjct: 132  GNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIKN 191

Query: 1040 KGIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGS 861
            +GID+E DYPY G +K CNKEKLK+H+VTID Y DIPP +EK+LLQ VA QP+SVGI GS
Sbjct: 192  QGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGS 251

Query: 860  GNSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNN 681
              +FQLYS G++TG CS+ LDHAVLIVGY ++DG D+WIVKNSWG++WG+ GY++++RNN
Sbjct: 252  EKTFQLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWGMRGYIHMLRNN 311

Query: 680  GNPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWK 501
            G  EG+CGINMLASYP K             T+C+ F+ CS GETCCC+ RF+G+CL W 
Sbjct: 312  GTAEGICGINMLASYPAKTSPNPPPPPTPGPTKCDFFSSCSEGETCCCSWRFIGVCLSWN 371

Query: 500  CCEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG*AIGVLLSM 321
            CC AKSAVCC + ++CCP  +PICDTKRN CLK  GN T  + L    S+          
Sbjct: 372  CCTAKSAVCCDNNNYCCPASHPICDTKRNRCLKPAGNGTGVEVLKRRGSS---------- 421

Query: 320  IELGGWSS 297
            ++ GGWSS
Sbjct: 422  VKFGGWSS 429


>ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1|
            JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  573 bits (1477), Expect = e-160
 Identities = 267/408 (65%), Positives = 317/408 (77%)
 Frame = -3

Query: 1574 SSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADLT 1395
            S IS LF +WC Q+GK Y+S +EK YRLKVFEENYA+V+QHN + NSSY+L+LNAFADLT
Sbjct: 24   SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83

Query: 1394 HHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGS 1215
            HHEFKA  LGLS +A      +     +++   LV+  DIP+S+DWR KGAVT +KDQGS
Sbjct: 84   HHEFKASRLGLSAAA------IEGSRPNLQLPGLVR--DIPASMDWRTKGAVTKVKDQGS 135

Query: 1214 CGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKG 1035
            CGACWSFSATGAIEGINKIVTG+L+SLSEQEL+DCDRSYN GC GGLMDYAY+FVI N G
Sbjct: 136  CGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHG 195

Query: 1034 IDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGN 855
            ID E+DYPY GREKTCNKEK KR VVTID Y  +P  NE  LLQAVA QP+SVGI GS  
Sbjct: 196  IDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSER 255

Query: 854  SFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGN 675
            +FQLYS GIFTG CS+ LDHAVLIVGY S++G DYWIVKNSWG  WG++GY++++RN+G+
Sbjct: 256  AFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGD 315

Query: 674  PEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCC 495
             +G+CGINMLASYP K             T+C+ FTYCS+GETCCC  R  G+C  WKCC
Sbjct: 316  SKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCC 375

Query: 494  EAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFST 351
            E  SAVCCKD  HCCP+DYP+CDTK++ CLK++GNAT  +      ST
Sbjct: 376  ELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKRHST 423


>ref|XP_008444761.1| PREDICTED: zingipain-2 [Cucumis melo]
          Length = 431

 Score =  571 bits (1471), Expect = e-160
 Identities = 268/410 (65%), Positives = 315/410 (76%)
 Frame = -3

Query: 1577 SSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADL 1398
            +S++S+LF  WC ++GK+Y+S +EK YRL VF +NY +V+ HN+L NSSYTLSLN++ADL
Sbjct: 22   TSNVSELFEIWCTEHGKSYSSAEEKLYRLSVFADNYEFVTHHNNLGNSSYTLSLNSYADL 81

Query: 1397 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 1218
            THHEFK   LG SP+         R    V         D+P SLDWRKKGAVT +KDQG
Sbjct: 82   THHEFKVSRLGFSPAL--------RNFRPVLPQEPSLPRDVPDSLDWRKKGAVTAVKDQG 133

Query: 1217 SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 1038
            SCGACWSFSATGAIEGIN+I+TGSLIS+SEQELIDCDRSYN GCGGGLMDYAY+FVI N 
Sbjct: 134  SCGACWSFSATGAIEGINQIMTGSLISVSEQELIDCDRSYNSGCGGGLMDYAYQFVISNH 193

Query: 1037 GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 858
            GIDTE DYPYQGR+ +C K+KL+R+VVTID Y DIPP +E +LLQAVA QP+SVGI GS 
Sbjct: 194  GIDTEDDYPYQGRDGSCRKDKLQRNVVTIDGYTDIPPNDEGKLLQAVAAQPVSVGICGSE 253

Query: 857  NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 678
             +FQLYS GIF+G CST LDHAVLIVGY S++G DYWIVKNSWGK WG+DGYM++ RN+G
Sbjct: 254  RAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSG 313

Query: 677  NPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKC 498
            N EGVCGIN LASYP K             T+C+  T C++GETCCCA++FLGLCL WKC
Sbjct: 314  NSEGVCGINKLASYPTKTSPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKC 373

Query: 497  CEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAKQLGNNFSTG 348
            C   SAVCCKD  HCCP DYPICDT RNLCLK+  N T  + L N  S+G
Sbjct: 374  CGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKRTMNGTRMEVLENRSSSG 423


>ref|XP_004500967.1| PREDICTED: zingipain-2 [Cicer arietinum]
          Length = 436

 Score =  570 bits (1468), Expect = e-159
 Identities = 261/397 (65%), Positives = 307/397 (77%), Gaps = 2/397 (0%)
 Frame = -3

Query: 1565 SDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADLTHHE 1386
            S LF  WC+Q+GKTY S QEK YR  VFE+NYA+V+QHN + NSSYTLSLNAFADLTHHE
Sbjct: 27   SKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHHE 86

Query: 1385 FKAKYLGLSPSADDSIIRL--NRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQGSC 1212
            FKA  LGL PS   S++R   NR  D     + ++   +PS +DWRK GAV+ +KDQGSC
Sbjct: 87   FKATRLGLPPS---SLLRFKFNRFQDQQRSDDFLQ---VPSEIDWRKNGAVSIVKDQGSC 140

Query: 1211 GACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNKGI 1032
            GACWSFSATGAIEGINKIVTGSL+SLSEQEL+DCD +YN GC GGLMDYAY+F+I N GI
Sbjct: 141  GACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGI 200

Query: 1031 DTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSGNS 852
            DTE+DYPYQ R+  C K+KLKR VVTID Y D+PP +EK+LL+AVA QP+SVGI GS  +
Sbjct: 201  DTEEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARA 260

Query: 851  FQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNGNP 672
            FQLYS GIFTG CST LDHAVLIVGY S++G DYWIVKNSWGKYWG++GY++++RN  + 
Sbjct: 261  FQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSS 320

Query: 671  EGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKCCE 492
             G+CGINMLASYP K              +CN FTYCS GETCCCA++FLG+C  WKCC 
Sbjct: 321  AGLCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCG 380

Query: 491  AKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATI 381
              SAVCCKD+ HCCP DYP+CD     CLK+I N TI
Sbjct: 381  VTSAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTI 417


>emb|CDX94938.1| BnaC05g07330D [Brassica napus]
          Length = 442

 Score =  569 bits (1467), Expect = e-159
 Identities = 261/401 (65%), Positives = 316/401 (78%)
 Frame = -3

Query: 1577 SSSISDLFNSWCQQYGKTYNSVQEKEYRLKVFEENYAYVSQHNSLANSSYTLSLNAFADL 1398
            S  IS+LF++WCQ++GKTY S +E+++R+++F +N+ +V++HN +ANS+Y+LSLNAFADL
Sbjct: 30   SDDISELFDAWCQRHGKTYASEEERQHRIEIFRDNHDFVTRHNGIANSTYSLSLNAFADL 89

Query: 1397 THHEFKAKYLGLSPSADDSIIRLNRGSDSVEGSNLVKESDIPSSLDWRKKGAVTGIKDQG 1218
            THHEFKA  LGLS S+   ++      ++V G        +P S+DWRKKGAVT +KDQG
Sbjct: 90   THHEFKASRLGLSASSAPLLVAKGESVENVGGK-------VPDSVDWRKKGAVTNVKDQG 142

Query: 1217 SCGACWSFSATGAIEGINKIVTGSLISLSEQELIDCDRSYNDGCGGGLMDYAYEFVIKNK 1038
            SCGACWSFSATGA+EGIN+IVTG LISLSEQELIDCD+SYNDGC GGLMDYA++FVIKN 
Sbjct: 143  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNDGCNGGLMDYAFQFVIKNH 202

Query: 1037 GIDTEKDYPYQGREKTCNKEKLKRHVVTIDSYVDIPPKNEKRLLQAVATQPISVGISGSG 858
            GIDTEKDYPYQ R+ TC K+KLKR VVTIDSY  +   +EK LL+AVA QP+SVGI GS 
Sbjct: 203  GIDTEKDYPYQERDGTCKKDKLKRKVVTIDSYAGVKSNDEKALLEAVAAQPVSVGICGSE 262

Query: 857  NSFQLYSGGIFTGACSTDLDHAVLIVGYDSKDGKDYWIVKNSWGKYWGIDGYMYIIRNNG 678
             +FQLYS GIF+G CST LDHAVLIVGY S++G DYWIVKNSWGK WG+DG+M++ RN G
Sbjct: 263  RAFQLYSKGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTG 322

Query: 677  NPEGVCGINMLASYPVKXXXXXXXXXXXXXTRCNTFTYCSSGETCCCARRFLGLCLKWKC 498
            N EGVCGINMLASYP+K             T+CN FTYC++ ETCCCAR   GLC  WKC
Sbjct: 323  NSEGVCGINMLASYPIKTHPNPPPPSPSGPTKCNLFTYCAADETCCCARNLFGLCFSWKC 382

Query: 497  CEAKSAVCCKDRSHCCPHDYPICDTKRNLCLKQIGNATIAK 375
            CE +SAVCCKD  HCCP DYP+CDT R+LCLK+ GN T  K
Sbjct: 383  CELESAVCCKDGRHCCPRDYPVCDTTRSLCLKKTGNFTEIK 423


Top