BLASTX nr result

ID: Mentha24_contig00022000 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00022000
         (1359 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU36745.1| hypothetical protein MIMGU_mgv1a006749mg [Mimulus...   654   0.0  
dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          577   e-162
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   576   e-162
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   575   e-161
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   574   e-161
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   568   e-159
ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr...   568   e-159
ref|XP_002307688.2| cysteine protease family protein [Populus tr...   565   e-158
ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S...   565   e-158
ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr...   563   e-158
ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C...   562   e-157
ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087...   558   e-156
ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps...   558   e-156
ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a...   558   e-156
gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlise...   557   e-156
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   556   e-156
gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase...   553   e-155
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   552   e-154
ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine...   550   e-154
gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]                  548   e-153

>gb|EYU36745.1| hypothetical protein MIMGU_mgv1a006749mg [Mimulus guttatus]
          Length = 433

 Score =  654 bits (1688), Expect = 0.0
 Identities = 303/426 (71%), Positives = 347/426 (81%), Gaps = 1/426 (0%)
 Frame = -1

Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177
            L+L +L SQ P   SS IS+LFD WCEE+GKTYASE+EKQHRL VF  NY+ V +HN  A
Sbjct: 8    LNLIMLFSQLPISKSSLISDLFDSWCEEYGKTYASEQEKQHRLNVFHENYKYVNQHNADA 67

Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREF-AIDGPDLVEESDLPASV 1000
            NSS+TLS+NAFADLTN EF+  YLGL PS  D +IRLNSR   AIDG +L++ES++P+S+
Sbjct: 68   NSSYTLSVNAFADLTNHEFRANYLGLSPSKSDSVIRLNSRSASAIDGDNLIKESEIPSSL 127

Query: 999  DWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCG 820
            DWR KGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCD +YN GC 
Sbjct: 128  DWRNKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCN 187

Query: 819  GGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 640
            GGLMDYA++FIIKNKGIDTEEDY Y+GR   C K K+ +HVVTIDSY D+P + EKKLLQ
Sbjct: 188  GGLMDYAYDFIIKNKGIDTEEDYSYKGRSATCDKNKMNKHVVTIDSYVDIPEKDEKKLLQ 247

Query: 639  AVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGK 460
            AVATQP+SVGICGSD  FQLYSGGIF+GPCST+LDHAVLIVGYDS+DG DYWI+KNSWGK
Sbjct: 248  AVATQPISVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGK 307

Query: 459  YWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETC 280
             WG+ GY+HM+RNS   EGVCGINTLAS+P+K             TKC++FTYC + ETC
Sbjct: 308  SWGIKGYMHMVRNSGSEEGVCGINTLASYPVKSSTNPPPSPTPGPTKCNIFTYCSSGETC 367

Query: 279  CCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEK 100
            CC    LG+C SW CCEAESAVCCDDH HCCP DYP CDT +NLCLK+ GN+T+SKP  K
Sbjct: 368  CCARYFLGVCLSWNCCEAESAVCCDDHRHCCPHDYPVCDTKKNLCLKKSGNTTVSKPLGK 427

Query: 99   KGFFTS 82
            K F  S
Sbjct: 428  KSFSAS 433


>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  577 bits (1486), Expect = e-162
 Identities = 274/421 (65%), Positives = 325/421 (77%), Gaps = 1/421 (0%)
 Frame = -1

Query: 1359 LLSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTK 1180
            LLS   L S     +SS I+ LF+ WC++HGKTYAS+EEK  RL+VF+ NY+ V EHN++
Sbjct: 12   LLSYLFLFSS----SSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQ 67

Query: 1179 ANSSHTLSLNAFADLTNQEFKDKYLGLLPSAD-DLLIRLNSREFAIDGPDLVEESDLPAS 1003
             NSS+TLSLNAFADLT+ EFK   LGL  +A   L +  ++R+     PD V  +D+PAS
Sbjct: 68   GNSSYTLSLNAFADLTHHEFKASRLGLSSAASASLNVDRSNRQI----PDFV--ADVPAS 121

Query: 1002 VDWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGC 823
            VDWRK GAVT VKDQG+CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD +YN+GC
Sbjct: 122  VDWRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGC 181

Query: 822  GGGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLL 643
             GG+MDYAF+F+I N GIDTEEDYPY+GRD  C+KEKLKRHVVTID Y DVP   EK+LL
Sbjct: 182  EGGIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELL 241

Query: 642  QAVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWG 463
            +AVA QPVSVGICGS+  FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG
Sbjct: 242  KAVANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG 301

Query: 462  KYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDET 283
             YWGM+GY+HM RNS  + G+CGIN LAS+P K             T+CDLFT+CG  ET
Sbjct: 302  SYWGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGET 361

Query: 282  CCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFE 103
            CCC   + GIC SW+CCE +SAVCC D  HCCPRDYP CDT RN+CLK  GN+T  + F 
Sbjct: 362  CCCVHHIFGICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFA 421

Query: 102  K 100
            K
Sbjct: 422  K 422


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  576 bits (1485), Expect = e-162
 Identities = 268/422 (63%), Positives = 326/422 (77%)
 Frame = -1

Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177
            L L LLI Q P C  SSIS+LF+ WC+++GK Y+SE+E+ +R +VFE NY  + EHN+K 
Sbjct: 8    LVLVLLIFQQPFCTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYITEHNSKE 67

Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVD 997
            NSS+TL LNA++DLT+ EF++ +LGL  SA+D  IRL  R        ++ + D P+S+D
Sbjct: 68   NSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSETGVLSDVDAPSSLD 126

Query: 996  WRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGG 817
            WR+KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD +YN GCGG
Sbjct: 127  WREKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGG 186

Query: 816  GLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 637
            GLMDYAFEF+IKN GIDTE+DYP+R R+G C+K KL+RHVVTID Y D+P   E KLL+A
Sbjct: 187  GLMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKA 246

Query: 636  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 457
            VATQPVSVGICGS   FQ YS GIF+GPCSTALDHAVLIVGY S++GVDYWI+KNSWG  
Sbjct: 247  VATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTS 306

Query: 456  WGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCC 277
            WG+NGYIHM RNS + EG+CGIN LAS+P K             +KC +FT CG  ETCC
Sbjct: 307  WGINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCC 366

Query: 276  CYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 97
            C    LGIC SW+CC  +SAVCC D  HCCP+DYP CDT+RNLCLKR+ N+T+ +  +K+
Sbjct: 367  CGSKFLGICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQQPQKE 426

Query: 96   GF 91
             F
Sbjct: 427  AF 428


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  575 bits (1481), Expect = e-161
 Identities = 271/420 (64%), Positives = 322/420 (76%)
 Frame = -1

Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177
            L+ F L+    + +S  ISELFD WC++HGKTY SEEE+Q R+++F+ N++ V +HN   
Sbjct: 11   LTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLIT 70

Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVD 997
            N++++LSLNAFADLT+ EFK   LGL  SA  +++       A  G  L     +P SVD
Sbjct: 71   NATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVD 123

Query: 996  WRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGG 817
            WRKKGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YN+GC G
Sbjct: 124  WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 183

Query: 816  GLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 637
            GLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L++A
Sbjct: 184  GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEA 243

Query: 636  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 457
            VA QPVSVGICGS+  FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK 
Sbjct: 244  VAAQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKS 303

Query: 456  WGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCC 277
            WGM+G++HM RN+E+++GVCGIN LAS+PIK             TKC+LFTYC + ETCC
Sbjct: 304  WGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCC 363

Query: 276  CYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 97
            C   L G+CFSW+CCE ESAVCC D  HCCP DYP CDT R+LCLK+ GN T  KPF KK
Sbjct: 364  CARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKK 423


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  574 bits (1479), Expect = e-161
 Identities = 271/420 (64%), Positives = 322/420 (76%)
 Frame = -1

Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177
            L+ F L+    + +S  ISELFD WC++HGKTY SEEE+Q R+++F+ N++ V +HN   
Sbjct: 11   LTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLIT 70

Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVD 997
            N++++LSLNAFADLT+ EFK   LGL  SA  +++       A  G  L     +P SVD
Sbjct: 71   NATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVD 123

Query: 996  WRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGG 817
            WRKKGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YN+GC G
Sbjct: 124  WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 183

Query: 816  GLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 637
            GLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L++A
Sbjct: 184  GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEA 243

Query: 636  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 457
            VA QPVSVGICGS+  FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK 
Sbjct: 244  VAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKS 303

Query: 456  WGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCC 277
            WGM+G++HM RN+E+++GVCGIN LAS+PIK             TKC+LFTYC + ETCC
Sbjct: 304  WGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCC 363

Query: 276  CYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 97
            C   L G+CFSW+CCE ESAVCC D  HCCP DYP CDT R+LCLK+ GN T  KPF KK
Sbjct: 364  CARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKK 423


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  568 bits (1465), Expect = e-159
 Identities = 273/423 (64%), Positives = 321/423 (75%), Gaps = 2/423 (0%)
 Frame = -1

Query: 1359 LLSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTK 1180
            L   FLL+   P+ +S  ISELFD WC+ HGKTY SEEE+Q R+++F+ N++ V +HN  
Sbjct: 11   LTFFFLLLVSSPS-SSDDISELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLI 69

Query: 1179 ANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASV 1000
             N++++LSLNAFADLT+ EFK   LGL  SA  L++       A  G  L   + +P SV
Sbjct: 70   TNATYSLSLNAFADLTHHEFKASRLGLSVSASSLIM-------ASKGQSLGGNAKVPDSV 122

Query: 999  DWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCG 820
            DWRKKGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YN+GC 
Sbjct: 123  DWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCN 182

Query: 819  GGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 640
            GGLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L +
Sbjct: 183  GGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALRE 242

Query: 639  AVATQPVSVGICGSDYKFQLYS--GGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSW 466
            AVA QPVSVGICGS+  FQLYS   GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSW
Sbjct: 243  AVAAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSW 302

Query: 465  GKYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDE 286
            GK WGM+G++HM RN+ ++EG+CGIN LAS+PIK             TKC+LFTYC   E
Sbjct: 303  GKSWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGE 362

Query: 285  TCCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPF 106
            TCCC  +L G+CFSW+CCE ESAVCC D  HCCP DYP CDT R+LCLK+ GN T  KPF
Sbjct: 363  TCCCARNLFGLCFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPF 422

Query: 105  EKK 97
             KK
Sbjct: 423  WKK 425


>ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum]
            gi|557095297|gb|ESQ35879.1| hypothetical protein
            EUTSA_v10007640mg [Eutrema salsugineum]
          Length = 444

 Score =  568 bits (1463), Expect = e-159
 Identities = 269/421 (63%), Positives = 324/421 (76%)
 Frame = -1

Query: 1359 LLSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTK 1180
            LL +  L     + +S  I+ELFD WC  HGKTY SEEE+QHR+++F  N++ V +HN  
Sbjct: 15   LLLVSSLSFSISSSSSDDIAELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHI 74

Query: 1179 ANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASV 1000
            +NS+++LSLNAFADLT+ EFK   LGL   +  L+    ++E ++   + V    +P SV
Sbjct: 75   SNSTYSLSLNAFADLTHHEFKASRLGLSAPSPSLM----AKEQSLGVSERVRVK-VPDSV 129

Query: 999  DWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCG 820
            DWRKKGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YN+GC 
Sbjct: 130  DWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCN 189

Query: 819  GGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 640
            GGLMDYAFEF+IKN GIDTE+DYPY+ +DG C K+KLK+ VVTIDSYA V    EK L++
Sbjct: 190  GGLMDYAFEFVIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALME 249

Query: 639  AVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGK 460
            AVA+QPVSVGICGS+  FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK
Sbjct: 250  AVASQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGK 309

Query: 459  YWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETC 280
             WGM+G++HM RN+ ++EGVCGIN LAS+PIK             TKC+LFTYC + ETC
Sbjct: 310  SWGMDGFMHMQRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETC 369

Query: 279  CCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEK 100
            CC  +L G+CFSW+CCE ESAVCC D  HCCPRDYP CDT ++LCLK+ GN T  KPF K
Sbjct: 370  CCARTLFGLCFSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPFWK 429

Query: 99   K 97
            K
Sbjct: 430  K 430


>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  565 bits (1457), Expect = e-158
 Identities = 266/422 (63%), Positives = 318/422 (75%), Gaps = 1/422 (0%)
 Frame = -1

Query: 1359 LLSLFLLISQF-PTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNT 1183
            + +L LLIS   P+ +SS IS+LF+ WC+EHGK+Y S+EE+ HRL+VFE NY+ V +HN+
Sbjct: 6    IFALTLLISVLSPSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNS 65

Query: 1182 KANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPAS 1003
            K NSS++L+LNAFADLT+ EFK   LGL  +     + L  R   I G       D+PAS
Sbjct: 66   KGNSSYSLALNAFADLTHHEFKTSRLGLSAAP----LNLAHRNLEITGV----VGDIPAS 117

Query: 1002 VDWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGC 823
            +DWR KG VT VKDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELI+CD +YN GC
Sbjct: 118  IDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGC 177

Query: 822  GGGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLL 643
            GGGLMDYAF+F+I N GIDTEEDYPYR RDG C+K+++KR VVTID Y DVP   EK+LL
Sbjct: 178  GGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLL 237

Query: 642  QAVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWG 463
            QAVA QPVSVGICGS+  FQ+YS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG
Sbjct: 238  QAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG 297

Query: 462  KYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDET 283
              WGM GY+HM RNS +++GVCGIN LAS+P+K             TKC+L TYC   ET
Sbjct: 298  TGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGET 357

Query: 282  CCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFE 103
            CCC     GIC SW+CC  +SAVCC D  HCCP DYP CDT +N+C KR GN+T  +  E
Sbjct: 358  CCCARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIE 417

Query: 102  KK 97
             K
Sbjct: 418  GK 419


>ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum]
          Length = 439

 Score =  565 bits (1455), Expect = e-158
 Identities = 262/422 (62%), Positives = 319/422 (75%)
 Frame = -1

Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177
            L L LLI Q P C  SSIS+LF+ WC+++GK Y+SE+E+ +R +VFE NY  + EHN+K 
Sbjct: 8    LVLVLLIFQQPLCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYITEHNSKG 67

Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVD 997
            NSS+TL LNA++DLT+ EF++ +LGL  SA+D  IRL  R        ++ + D P+S+D
Sbjct: 68   NSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSAAGVLSDVDAPSSLD 126

Query: 996  WRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGG 817
            WR KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD +YN GCGG
Sbjct: 127  WRDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGG 186

Query: 816  GLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 637
            GLMDYAFEF+IKN GIDTE+DYP+R ++G C+K KL+R VVTID Y D+P   E KLL+A
Sbjct: 187  GLMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKA 246

Query: 636  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 457
            VATQPVSVGICGS   FQ YS GIF+GPC T LDHAVLIVGY S++G DYWI+KNSWG  
Sbjct: 247  VATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTS 306

Query: 456  WGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCC 277
            WG+NGYIHM RNS + EG+CG+N LAS+P K             +KC  FT CG  ETCC
Sbjct: 307  WGINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCC 366

Query: 276  CYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 97
            C    LGIC SW+CC  +SAVCC D  HCCP DYP CDT+RNLCLKR+ N+T+ +  +K+
Sbjct: 367  CGLKFLGICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQQPQKE 426

Query: 96   GF 91
             F
Sbjct: 427  PF 428


>ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina]
            gi|557537201|gb|ESR48319.1| hypothetical protein
            CICLE_v10001178mg [Citrus clementina]
          Length = 441

 Score =  563 bits (1451), Expect = e-158
 Identities = 265/423 (62%), Positives = 323/423 (76%), Gaps = 1/423 (0%)
 Frame = -1

Query: 1359 LLSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTK 1180
            LLS+ LL+S  P    S I+ELF+ WC++HGK Y+SE+EKQ RL++FE NY  V +HN  
Sbjct: 8    LLSI-LLLSSLPLNYCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNM 66

Query: 1179 ANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASV 1000
             NSS TLSLNAFADLT+QEFK  +LG   ++ D   R N+   ++  P  +   D+PAS+
Sbjct: 67   GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA---SVQSPGTLR--DVPASI 121

Query: 999  DWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCG 820
            DWRKKGAVT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD +YNSGCG
Sbjct: 122  DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181

Query: 819  GGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 640
            GGLMDYA++F+IKN GIDTE+DYPYRG+ G+C+K+KL RH+VTID Y DVP   EK+LLQ
Sbjct: 182  GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241

Query: 639  AVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGK 460
            AV  QPVSVGICGS+  FQLYS GIF+GPCST+LDHAVLIVGYDS++GVDYWI+KNSWG+
Sbjct: 242  AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 301

Query: 459  YWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETC 280
             WGMNGY+HM RN+ ++ G+CGIN LAS+P K             T+C L TYC   ETC
Sbjct: 302  SWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETC 361

Query: 279  CCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRI-GNSTLSKPFE 103
            CC  S+LGIC SW+CC   SAVCC DH +CCP +YP CD+ R+ CL R  GN T ++  E
Sbjct: 362  CCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIE 421

Query: 102  KKG 94
             +G
Sbjct: 422  MRG 424


>ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 441

 Score =  562 bits (1449), Expect = e-157
 Identities = 264/423 (62%), Positives = 324/423 (76%), Gaps = 1/423 (0%)
 Frame = -1

Query: 1359 LLSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTK 1180
            LLS+ LL+S  P    S I+ELF+ WC++HGK Y+SE+EKQ RL++FE NY  V +HN  
Sbjct: 8    LLSI-LLLSSLPPNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM 66

Query: 1179 ANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASV 1000
             NSS TLSLNAFADLT+QEFK  +LG   ++ D   R N+   ++  P  +   D+PAS+
Sbjct: 67   GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA---SVQSPGNLR--DVPASI 121

Query: 999  DWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCG 820
            DWRKKGAVT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD +YNSGCG
Sbjct: 122  DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181

Query: 819  GGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 640
            GGLMDYA++F+IKN GIDTE+DYPYRG+ G+C+K+KL RH+VTID Y DVP   EK+LLQ
Sbjct: 182  GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241

Query: 639  AVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGK 460
            AV  QPVSVGICGS+  FQLYS GIF+GPCST+LDHAVLI+GYDS++GVDYWI+KNSWG+
Sbjct: 242  AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGR 301

Query: 459  YWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETC 280
             WGMNGY+HM RN+ ++ G+CGIN LAS+P K             T+C L TYC   ETC
Sbjct: 302  SWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETC 361

Query: 279  CCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRI-GNSTLSKPFE 103
            CC  S+LGIC SW+CC   SAVCC DH +CCP +YP CD+ R+ CL R+ GN T ++  E
Sbjct: 362  CCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIE 421

Query: 102  KKG 94
             +G
Sbjct: 422  MRG 424


>ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1|
            JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  558 bits (1439), Expect = e-156
 Identities = 266/417 (63%), Positives = 313/417 (75%)
 Frame = -1

Query: 1347 FLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKANSS 1168
            FLL       + S IS LF+ WC++HGK Y+SEEEK +RL+VFE NY  V +HN   NSS
Sbjct: 12   FLLFFDPSFASPSHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSS 71

Query: 1167 HTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRK 988
            ++L+LNAFADLT+ EFK   LGL  +A      +      +  P LV   D+PAS+DWR 
Sbjct: 72   YSLALNAFADLTHHEFKASRLGLSAAA------IEGSRPNLQLPGLVR--DIPASMDWRT 123

Query: 987  KGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGGGLM 808
            KGAVT VKDQGSCGACWSFSATGA+EGIN+I TG+LVSLSEQEL+DCD +YNSGC GGLM
Sbjct: 124  KGAVTKVKDQGSCGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLM 183

Query: 807  DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 628
            DYA++F+I N GID EEDYPY GR+  C+KEK KR VVTID YA VP   E  LLQAVA 
Sbjct: 184  DYAYQFVIDNHGIDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAK 243

Query: 627  QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 448
            QPVSVGICGS+  FQLYS GIF+GPCS++LDHAVLIVGY S++GVDYWIVKNSWG  WGM
Sbjct: 244  QPVSVGICGSERAFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGM 303

Query: 447  NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 268
            NGYIHMLRNS D++G+CGIN LAS+P K             TKCDLFTYC   ETCCC  
Sbjct: 304  NGYIHMLRNSGDSKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTH 363

Query: 267  SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 97
             + GICFSW+CCE +SAVCC D+ HCCP DYP CDT ++ CLKR+GN+T  + FEK+
Sbjct: 364  RIFGICFSWKCCELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKR 420


>ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella]
            gi|482576142|gb|EOA40329.1| hypothetical protein
            CARUB_v10009056mg [Capsella rubella]
          Length = 467

 Score =  558 bits (1438), Expect = e-156
 Identities = 270/444 (60%), Positives = 317/444 (71%), Gaps = 28/444 (6%)
 Frame = -1

Query: 1344 LLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKANSSH 1165
            LL+S   + +S  ISELFD WC+ HGKTYASE EKQHR ++F  N++ V +HN   N+++
Sbjct: 17   LLVSSSSSSSSDDISELFDDWCQRHGKTYASEAEKQHRFQIFRDNHDFVTQHNLITNATY 76

Query: 1164 TLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKK 985
            +LSLNAFADL + EFK   LGL  SA  +++       A  G  L     +P S+DWRKK
Sbjct: 77   SLSLNAFADLNHSEFKTSRLGLSVSAPSVIM-------ASKGKSLGGSVKVPDSLDWRKK 129

Query: 984  GAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGGGLMD 805
            GAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YN GC GGLMD
Sbjct: 130  GAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNDGCNGGLMD 189

Query: 804  YAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQ 625
            YAFEF+IKNKGIDTE+DYPY+ RDG C K+KLK+ VV+IDSYA V P  EK LL+AVA Q
Sbjct: 190  YAFEFVIKNKGIDTEKDYPYQERDGTCKKDKLKQRVVSIDSYAGVKPSDEKALLEAVAAQ 249

Query: 624  PVSVGICGSDYKFQLYSG----------------------------GIFSGPCSTALDHA 529
            PVSVGICGS+  FQLYS                             GIFSGPCST+LDHA
Sbjct: 250  PVSVGICGSERAFQLYSSVSFKIRDTSILSSECSTFPCLKLYLMMQGIFSGPCSTSLDHA 309

Query: 528  VLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXX 349
            VLIVGY SQ+GVDYWIVKNSWGK WGM+G++HM RN+ +++G+CGIN LAS+PIK     
Sbjct: 310  VLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSQGICGINMLASYPIKTHPNP 369

Query: 348  XXXXXXXXTKCDLFTYCGTDETCCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPT 169
                    TKC+LFTYC   ETCCC  +L G+C SW+CCE ESAVCC D  HCCP DYP 
Sbjct: 370  PPPSPPGPTKCNLFTYCSAAETCCCARNLFGLCLSWKCCEIESAVCCKDGRHCCPHDYPV 429

Query: 168  CDTARNLCLKRIGNSTLSKPFEKK 97
            CDT R+LCLK+ GN T  KPF KK
Sbjct: 430  CDTTRSLCLKKTGNFTAIKPFWKK 453


>ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum]
          Length = 436

 Score =  558 bits (1437), Expect = e-156
 Identities = 267/423 (63%), Positives = 308/423 (72%)
 Frame = -1

Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177
            L L L +S      +   S+LF  WC++HGKTY SE+EK++R  VFE NY  V +HN   
Sbjct: 9    LQLLLSLSLLSFVTAIDTSKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIG 68

Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVD 997
            NSS+TLSLNAFADLT+ EFK   LGL PS+   L+R     F  D     +   +P+ +D
Sbjct: 69   NSSYTLSLNAFADLTHHEFKATRLGLPPSS---LLRFKFNRFQ-DQQRSDDFLQVPSEID 124

Query: 996  WRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGG 817
            WRK GAV+ VKDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCDTTYNSGC G
Sbjct: 125  WRKNGAVSIVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDG 184

Query: 816  GLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 637
            GLMDYA++FII N GIDTEEDYPY+ R   C K+KLKR VVTID Y DVPP  EKKLL+A
Sbjct: 185  GLMDYAYQFIIDNNGIDTEEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKA 244

Query: 636  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 457
            VA QPVSVGICGS   FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWGKY
Sbjct: 245  VAVQPVSVGICGSARAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKY 304

Query: 456  WGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCC 277
            WGMNGYIHMLRN++ + G+CGIN LAS+P K              KC+LFTYC   ETCC
Sbjct: 305  WGMNGYIHMLRNTDSSAGLCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCC 364

Query: 276  CYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 97
            C    LGICFSW+CC   SAVCC D  HCCP DYP CD +   CLKRI N T+    +K+
Sbjct: 365  CAKKFLGICFSWKCCGVTSAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTSDKE 424

Query: 96   GFF 88
              F
Sbjct: 425  DPF 427


>gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlisea aurea]
          Length = 424

 Score =  557 bits (1436), Expect = e-156
 Identities = 264/418 (63%), Positives = 318/418 (76%), Gaps = 1/418 (0%)
 Frame = -1

Query: 1359 LLSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTK 1180
            L+ LFLL  Q     SSSIS+LFD WC+EHGKTY SEEE++HRL VF  NY+ +  HN +
Sbjct: 10   LIQLFLL--QVHPIVSSSISDLFDSWCQEHGKTYVSEEEREHRLGVFSENYDFIASHNAR 67

Query: 1179 ANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASV 1000
            AN S+TLSLNAFADLT  EF  +YLG  PS  DLLIR N    +    +    S +P+S+
Sbjct: 68   ANYSYTLSLNAFADLTRSEFGGRYLGFSPSGHDLLIRKNRGSGSYRSRNY---SAVPSSI 124

Query: 999  DWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCG 820
            DWRKKGAVT +KDQGSCGACWSFSATGA+EGINQI TGSLVSLSEQELIDCD +YN GC 
Sbjct: 125  DWRKKGAVTGIKDQGSCGACWSFSATGAIEGINQIVTGSLVSLSEQELIDCDHSYNQGCN 184

Query: 819  GGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 640
            GGLMDYA+EFI+KNKGIDTEEDY Y+GRD  CS+ KL + VVTIDSY D+P + E+ LL+
Sbjct: 185  GGLMDYAYEFILKNKGIDTEEDYSYKGRDASCSQNKLNKRVVTIDSYVDIPEKNEQMLLE 244

Query: 639  AVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGK 460
            AVA+QPVSVGI G D  FQ YS GIF+GPCST+LDHAVLIVGYDS++G DYWIVKNSWGK
Sbjct: 245  AVASQPVSVGISGGDAPFQFYSQGIFTGPCSTSLDHAVLIVGYDSKNGKDYWIVKNSWGK 304

Query: 459  YWGMNGYIHMLRNSEDAEGVCGINTLASFPIK-XXXXXXXXXXXXXTKCDLFTYCGTDET 283
             WGM+GY+++ RN+ +  G+C IN +AS+P+K              TKC LF+YC   ET
Sbjct: 305  SWGMDGYMYVQRNTGNQNGICEINMMASYPVKTNPNPSPSPSPPGPTKCSLFSYCSQGET 364

Query: 282  CCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKP 109
            CCC    LG+C  ++CC AESAVCC+D+ HCCP+DYP CDTA+++C K  GNST++ P
Sbjct: 365  CCCARRFLGLCMRYKCCGAESAVCCEDNVHCCPQDYPICDTAQSVCRKMSGNSTMAIP 422


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  556 bits (1434), Expect = e-156
 Identities = 265/407 (65%), Positives = 319/407 (78%)
 Frame = -1

Query: 1359 LLSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTK 1180
            LL   L IS F +  SS IS+LF+ W +EHGKTY S+E+K +R ++FE NYE V +HN++
Sbjct: 12   LLFFNLSISSFSS--SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQ 69

Query: 1179 ANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASV 1000
             NSS+TLSLNAFADLT+ EFK   LGL  SA     +L+ R F +   D V   D+P S+
Sbjct: 70   GNSSYTLSLNAFADLTHHEFKASRLGL--SAFSTSGKLSRRNFPLH--DFV--GDVPISI 123

Query: 999  DWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCG 820
            DWRKKGAV+ VKDQG+CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD +YN+GC 
Sbjct: 124  DWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCE 183

Query: 819  GGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 640
            GGLMDYA++F+I+N GIDTEEDYPY+ R+  C+KEKLKRHVVTID Y DVP   EK+LL+
Sbjct: 184  GGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLK 243

Query: 639  AVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGK 460
            AVA QPVSVGICGS+  FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG 
Sbjct: 244  AVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGT 303

Query: 459  YWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETC 280
            +WG+NGY++MLRNS +++G+CGIN LASFP+K             TKCDLFT CG  ETC
Sbjct: 304  HWGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETC 363

Query: 279  CCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 139
            CC   + G+CFSW+CCE +SAVCC D  HCCP DYP CDT RN+CLK
Sbjct: 364  CCTRRIFGLCFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410


>gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
            [Arabidopsis thaliana]
          Length = 416

 Score =  553 bits (1425), Expect = e-155
 Identities = 263/413 (63%), Positives = 313/413 (75%), Gaps = 7/413 (1%)
 Frame = -1

Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177
            L+ F L+    + +S  ISELFD WC++HGKTY SEEE+Q R+++F+ N++ V +HN   
Sbjct: 9    LTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLIT 68

Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVD 997
            N++++LSLNAFADLT+ EFK   LGL  SA  +++       A  G  L     +P SVD
Sbjct: 69   NATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVD 121

Query: 996  WRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGG 817
            WRKKGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YN+GC G
Sbjct: 122  WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 181

Query: 816  GLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 637
            GLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L++A
Sbjct: 182  GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEA 241

Query: 636  VATQPVSVGICGSDYKFQLYSG-------GIFSGPCSTALDHAVLIVGYDSQDGVDYWIV 478
            VA QPVSVGICGS+  FQLYS        GIFSGPCST+LDHAVLIVGY SQ+GVDYWIV
Sbjct: 242  VAAQPVSVGICGSERAFQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIV 301

Query: 477  KNSWGKYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYC 298
            KNSWGK WGM+G++HM RN+E+++GVCGIN LAS+PIK             TKC+LFTYC
Sbjct: 302  KNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYC 361

Query: 297  GTDETCCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 139
             + ETCCC   L G+CFSW+CCE ESAVCC D  HCCP DYP CDT R+LCLK
Sbjct: 362  SSGETCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 414


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  552 bits (1423), Expect = e-154
 Identities = 268/420 (63%), Positives = 312/420 (74%)
 Frame = -1

Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177
            L+LFLL+ + P   +S++SELF+ WC EHGK+Y+S EEK +RL VF  NYE V  HN   
Sbjct: 9    LTLFLLLFR-PLSATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLD 67

Query: 1176 NSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVD 997
            NSS+TLSLN++ADLT+ EFK   LG  P+        N R      P L    D+P S+D
Sbjct: 68   NSSYTLSLNSYADLTHHEFKVSRLGFSPALR------NFRPVLPQEPSLPR--DVPDSLD 119

Query: 996  WRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSGCGG 817
            WRKKGAVTAVKDQGSCGACWSFSATGA+EGINQI TGSL+SLSEQELIDCD +YNSGCGG
Sbjct: 120  WRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGG 179

Query: 816  GLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 637
            GLMDYA++F+I N GIDTE DYPY+ RDG C K+KL+R+VVTID YAD+P   E KLLQA
Sbjct: 180  GLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQA 239

Query: 636  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 457
            VA QPVSVGICGS+  FQLYS GIFSGPCST+LDHAVLIVGY S++GVDYWIVKNSWGK 
Sbjct: 240  VAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKS 299

Query: 456  WGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCC 277
            WGM+GY+HM RNS ++EGVCGIN LAS+P K             TKC + T C   ETCC
Sbjct: 300  WGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCC 359

Query: 276  CYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 97
            C    LG+C SW+CC   SAVCC D  HCCP DYP CDT RNLCLK+  N T ++  E +
Sbjct: 360  CAKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENR 419


>ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  550 bits (1417), Expect = e-154
 Identities = 266/427 (62%), Positives = 319/427 (74%), Gaps = 5/427 (1%)
 Frame = -1

Query: 1356 LSLFLLISQFPTCNSSSISELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHNTKA 1177
            LSL LL + F   ++S  SELF+ WC+EH KTY+SEEEK +RL+VFE NY  V +HN  A
Sbjct: 13   LSLILLFTLF-FLSASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNA 71

Query: 1176 N-----SSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDL 1012
            N     SS+TLSLNAFADLT+ EFK   LGL      L +    R       DL+    +
Sbjct: 72   NNNNNNSSYTLSLNAFADLTHHEFKTTRLGL-----PLTLLRFKRPQNQQSRDLLH---I 123

Query: 1011 PASVDWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYN 832
            P+ +DWR+ GAVT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCDT+YN
Sbjct: 124  PSQIDWRQSGAVTPVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYN 183

Query: 831  SGCGGGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEK 652
            SGCGGGLMD+A++F+I NKGIDTE+DYPY+ R   CSK+KLKR  VTI+ Y DVPP  E+
Sbjct: 184  SGCGGGLMDFAYQFVIDNKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPP-SEE 242

Query: 651  KLLQAVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKN 472
            ++L+AVA+QPVSVGICGS+ +FQLYS GIF+GPCST LDHAVLIVGY S++GVDYWIVKN
Sbjct: 243  EILKAVASQPVSVGICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKN 302

Query: 471  SWGKYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGT 292
            SWGKYWGMNGYIHM+RNS +++G+CGINTLAS+P+K              +C+LFT+C  
Sbjct: 303  SWGKYWGMNGYIHMIRNSGNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSE 362

Query: 291  DETCCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSK 112
             ETCCC  S LGICFSW+CC   SAVCC D  HCCP+DYP CDT R  CLKR  N T + 
Sbjct: 363  GETCCCAKSFLGICFSWKCCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTI 422

Query: 111  PFEKKGF 91
              E + F
Sbjct: 423  TSENQDF 429


>gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]
          Length = 517

 Score =  548 bits (1413), Expect = e-153
 Identities = 261/409 (63%), Positives = 309/409 (75%), Gaps = 2/409 (0%)
 Frame = -1

Query: 1359 LLSLFLLISQFPTCNSSSI--SELFDHWCEEHGKTYASEEEKQHRLRVFEHNYEIVVEHN 1186
            L  L LL+S   + +S S+  S+LF+ WCE+HG++Y+SEEE+ +RL VFE N   V +HN
Sbjct: 6    LFLLSLLLSSHLSLSSPSLNSSQLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVTQHN 65

Query: 1185 TKANSSHTLSLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPA 1006
               NSS+TLSLNAFADLT+ EFK   LG   +    L +L S+        L++  D+PA
Sbjct: 66   NMGNSSYTLSLNAFADLTHHEFKSSRLGFSSALLSSLPKLGSK--------LLDLRDVPA 117

Query: 1005 SVDWRKKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNSG 826
            S+DWRKKGAVT VKDQGSCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCDT+YN+G
Sbjct: 118  SLDWRKKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAG 177

Query: 825  CGGGLMDYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKL 646
            C GGLMDYA++F+I N GIDTEEDYPY+ RD  C KEKLKR VVTID Y DV P    +L
Sbjct: 178  CDGGLMDYAYQFVIDNHGIDTEEDYPYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQL 237

Query: 645  LQAVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSW 466
            LQAV TQPVSVGICGS+  FQLYS GIF+GPCST+LDHAVLIVGYDS++GVDYWIVKNSW
Sbjct: 238  LQAVVTQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSW 297

Query: 465  GKYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDE 286
            GK WGM+GYIHM RN+ +++GVCGIN LAS+P K             T+C  F  CG  E
Sbjct: 298  GKQWGMDGYIHMQRNTGNSQGVCGINMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGE 357

Query: 285  TCCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 139
            TCCC W  LG+CFSW+CC   SAVCC D  HCCP+DYP CDT RN+CLK
Sbjct: 358  TCCCSWRFLGLCFSWKCCGLNSAVCCKDKIHCCPQDYPLCDTQRNVCLK 406


Top