BLASTX nr result

ID: Mentha28_contig00008692 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00008692
         (1842 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU36745.1| hypothetical protein MIMGU_mgv1a006749mg [Mimulus...   646   0.0  
dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          572   e-160
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   570   e-160
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   569   e-159
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   569   e-159
ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr...   566   e-158
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   564   e-158
ref|XP_002307688.2| cysteine protease family protein [Populus tr...   558   e-156
ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S...   558   e-156
ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr...   556   e-155
ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C...   555   e-155
gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlise...   554   e-155
ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a...   553   e-155
ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087...   553   e-154
ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps...   552   e-154
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   552   e-154
gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase...   549   e-153
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   547   e-153
gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]                  545   e-152
ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine...   542   e-151

>gb|EYU36745.1| hypothetical protein MIMGU_mgv1a006749mg [Mimulus guttatus]
          Length = 433

 Score =  646 bits (1666), Expect = 0.0
 Identities = 299/418 (71%), Positives = 340/418 (81%), Gaps = 1/418 (0%)
 Frame = +1

Query: 352  QFPTCISSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSL 531
            Q P   SS IS+LFD WCEE+GKTYASEQEKQHRL VF  NY+ V +HN  ANSS+TLS+
Sbjct: 16   QLPISKSSLISDLFDSWCEEYGKTYASEQEKQHRLNVFHENYKYVNQHNADANSSYTLSV 75

Query: 532  NAFADLTNQEFKDKYLGLLPSADDLLIRLNSREF-AIDGPDLVEESDLPASVDWRKKGAV 708
            NAFADLTN EF+  YLGL PS  D +IRLNSR   AIDG +L++ES++P+S+DWR KGAV
Sbjct: 76   NAFADLTNHEFRANYLGLSPSKSDSVIRLNSRSASAIDGDNLIKESEIPSSLDWRNKGAV 135

Query: 709  TAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAF 888
            TAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCD +YN GC GGLMDYA+
Sbjct: 136  TAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCNGGLMDYAY 195

Query: 889  EFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVS 1068
            +FIIKNKGIDTEEDY Y+GR   C K K+ +HVVTIDSY D+P + EKKLLQAVATQP+S
Sbjct: 196  DFIIKNKGIDTEEDYSYKGRSATCDKNKMNKHVVTIDSYVDIPEKDEKKLLQAVATQPIS 255

Query: 1069 VGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYI 1248
            VGICGSD  FQLYSGGIF+GPCST+LDHAVLIVGYDS+DG DYWI+KNSWGK WG+ GY+
Sbjct: 256  VGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGKSWGIKGYM 315

Query: 1249 HMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLG 1428
            HM+RNS   EGVCGINTLAS+P+K              KC++FTYC + ETCCC    LG
Sbjct: 316  HMVRNSGSEEGVCGINTLASYPVKSSTNPPPSPTPGPTKCNIFTYCSSGETCCCARYFLG 375

Query: 1429 ICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQIGNSTLSKPFEKKGFFTS 1602
            +C SW CCEAESAVCCDDH HCCP DYP CDT +NLCLK+ GN+T+SKP  KK F  S
Sbjct: 376  VCLSWNCCEAESAVCCDDHRHCCPHDYPVCDTKKNLCLKKSGNTTVSKPLGKKSFSAS 433


>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  572 bits (1475), Expect = e-160
 Identities = 267/406 (65%), Positives = 317/406 (78%), Gaps = 1/406 (0%)
 Frame = +1

Query: 370  SSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSLNAFADL 549
            SS I+ LF+ WC++HGKTYAS++EK  RL+VF+ NY+ V EHN++ NSS+TLSLNAFADL
Sbjct: 23   SSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADL 82

Query: 550  TNQEFKDKYLGLLPSAD-DLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVTAVKDQ 726
            T+ EFK   LGL  +A   L +  ++R+     PD V  +D+PASVDWRK GAVT VKDQ
Sbjct: 83   THHEFKASRLGLSSAASASLNVDRSNRQI----PDFV--ADVPASVDWRKNGAVTQVKDQ 136

Query: 727  GSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFEFIIKN 906
            G+CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD +YN GC GG+MDYAF+F+I N
Sbjct: 137  GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDN 196

Query: 907  KGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSVGICGS 1086
             GIDTEEDYPY+GRD  C+KEKLKRHVVTID Y DVP   EK+LL+AVA QPVSVGICGS
Sbjct: 197  HGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGS 256

Query: 1087 DYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHMLRNS 1266
            +  FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG YWGM+GY+HM RNS
Sbjct: 257  ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNS 316

Query: 1267 EDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLGICFSWR 1446
              + G+CGIN LAS+P K              +CDLFT+CG  ETCCC   + GIC SW+
Sbjct: 317  GSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWK 376

Query: 1447 CCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQIGNSTLSKPFEK 1584
            CCE +SAVCC D  HCCPRDYP CDT RN+CLK  GN+T  + F K
Sbjct: 377  CCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAK 422


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  570 bits (1469), Expect = e-160
 Identities = 267/406 (65%), Positives = 314/406 (77%)
 Frame = +1

Query: 370  SSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSLNAFADL 549
            S  ISELFD WC++HGKTY SE+E+Q R+++F+ N++ V +HN   N++++LSLNAFADL
Sbjct: 25   SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84

Query: 550  TNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVTAVKDQG 729
            T+ EFK   LGL  SA  +++       A  G  L     +P SVDWRKKGAVT VKDQG
Sbjct: 85   THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 137

Query: 730  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFEFIIKNK 909
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YNAGC GGLMDYAFEF+IKN 
Sbjct: 138  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197

Query: 910  GIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSVGICGSD 1089
            GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L++AVA QPVSVGICGS+
Sbjct: 198  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257

Query: 1090 YKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHMLRNSE 1269
              FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK WGM+G++HM RN+E
Sbjct: 258  RAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 317

Query: 1270 DAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLGICFSWRC 1449
            +++GVCGIN LAS+PIK              KC+LFTYC + ETCCC   L G+CFSW+C
Sbjct: 318  NSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKC 377

Query: 1450 CEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQIGNSTLSKPFEKK 1587
            CE ESAVCC D  HCCP DYP CDT R+LCLK+ GN T  KPF KK
Sbjct: 378  CEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKK 423


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  569 bits (1467), Expect = e-159
 Identities = 263/414 (63%), Positives = 320/414 (77%)
 Frame = +1

Query: 352  QFPTCISSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSL 531
            Q P C  SSIS+LF+ WC+++GK Y+SEQE+ +R +VFE NY  + EHN+K NSS+TL L
Sbjct: 16   QQPFCTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYITEHNSKENSSYTLGL 75

Query: 532  NAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVT 711
            NA++DLT+ EF++ +LGL  SA+D  IRL  R        ++ + D P+S+DWR+KGAVT
Sbjct: 76   NAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSETGVLSDVDAPSSLDWREKGAVT 134

Query: 712  AVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFE 891
             VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD +YN GCGGGLMDYAFE
Sbjct: 135  DVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGGGLMDYAFE 194

Query: 892  FIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSV 1071
            F+IKN GIDTE+DYP+R R+G C+K KL+RHVVTID Y D+P   E KLL+AVATQPVSV
Sbjct: 195  FVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKAVATQPVSV 254

Query: 1072 GICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIH 1251
            GICGS   FQ YS GIF+GPCSTALDHAVLIVGY S++GVDYWI+KNSWG  WG+NGYIH
Sbjct: 255  GICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYIH 314

Query: 1252 MLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLGI 1431
            M RNS + EG+CGIN LAS+P K              KC +FT CG  ETCCC    LGI
Sbjct: 315  MQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCCCGSKFLGI 374

Query: 1432 CFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQIGNSTLSKPFEKKGF 1593
            C SW+CC  +SAVCC D  HCCP+DYP CDT+RNLCLK++ N+T+ +  +K+ F
Sbjct: 375  CLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQQPQKEAF 428


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  569 bits (1467), Expect = e-159
 Identities = 267/406 (65%), Positives = 314/406 (77%)
 Frame = +1

Query: 370  SSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSLNAFADL 549
            S  ISELFD WC++HGKTY SE+E+Q R+++F+ N++ V +HN   N++++LSLNAFADL
Sbjct: 25   SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84

Query: 550  TNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVTAVKDQG 729
            T+ EFK   LGL  SA  +++       A  G  L     +P SVDWRKKGAVT VKDQG
Sbjct: 85   THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 137

Query: 730  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFEFIIKNK 909
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YNAGC GGLMDYAFEF+IKN 
Sbjct: 138  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197

Query: 910  GIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSVGICGSD 1089
            GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L++AVA QPVSVGICGS+
Sbjct: 198  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257

Query: 1090 YKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHMLRNSE 1269
              FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK WGM+G++HM RN+E
Sbjct: 258  RAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 317

Query: 1270 DAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLGICFSWRC 1449
            +++GVCGIN LAS+PIK              KC+LFTYC + ETCCC   L G+CFSW+C
Sbjct: 318  NSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKC 377

Query: 1450 CEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQIGNSTLSKPFEKK 1587
            CE ESAVCC D  HCCP DYP CDT R+LCLK+ GN T  KPF KK
Sbjct: 378  CEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKK 423


>ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum]
            gi|557095297|gb|ESQ35879.1| hypothetical protein
            EUTSA_v10007640mg [Eutrema salsugineum]
          Length = 444

 Score =  566 bits (1459), Expect = e-158
 Identities = 265/406 (65%), Positives = 317/406 (78%)
 Frame = +1

Query: 370  SSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSLNAFADL 549
            S  I+ELFD WC  HGKTY SE+E+QHR+++F  N++ V +HN  +NS+++LSLNAFADL
Sbjct: 30   SDDIAELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTYSLSLNAFADL 89

Query: 550  TNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVTAVKDQG 729
            T+ EFK   LGL   +  L+    ++E ++   + V    +P SVDWRKKGAVT VKDQG
Sbjct: 90   THHEFKASRLGLSAPSPSLM----AKEQSLGVSERVRVK-VPDSVDWRKKGAVTNVKDQG 144

Query: 730  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFEFIIKNK 909
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YNAGC GGLMDYAFEF+IKN 
Sbjct: 145  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 204

Query: 910  GIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSVGICGSD 1089
            GIDTE+DYPY+ +DG C K+KLK+ VVTIDSYA V    EK L++AVA+QPVSVGICGS+
Sbjct: 205  GIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQPVSVGICGSE 264

Query: 1090 YKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHMLRNSE 1269
              FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK WGM+G++HM RN+ 
Sbjct: 265  RAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTG 324

Query: 1270 DAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLGICFSWRC 1449
            ++EGVCGIN LAS+PIK              KC+LFTYC + ETCCC  +L G+CFSW+C
Sbjct: 325  NSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARTLFGLCFSWKC 384

Query: 1450 CEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQIGNSTLSKPFEKK 1587
            CE ESAVCC D  HCCPRDYP CDT ++LCLK+ GN T  KPF KK
Sbjct: 385  CELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPFWKK 430


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  564 bits (1454), Expect = e-158
 Identities = 267/408 (65%), Positives = 312/408 (76%), Gaps = 2/408 (0%)
 Frame = +1

Query: 370  SSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSLNAFADL 549
            S  ISELFD WC+ HGKTY SE+E+Q R+++F+ N++ V +HN   N++++LSLNAFADL
Sbjct: 25   SDDISELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84

Query: 550  TNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVTAVKDQG 729
            T+ EFK   LGL  SA  L++       A  G  L   + +P SVDWRKKGAVT VKDQG
Sbjct: 85   THHEFKASRLGLSVSASSLIM-------ASKGQSLGGNAKVPDSVDWRKKGAVTNVKDQG 137

Query: 730  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFEFIIKNK 909
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YNAGC GGLMDYAFEF+IKN 
Sbjct: 138  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197

Query: 910  GIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSVGICGSD 1089
            GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L +AVA QPVSVGICGS+
Sbjct: 198  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSE 257

Query: 1090 YKFQLYS--GGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHMLRN 1263
              FQLYS   GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK WGM+G++HM RN
Sbjct: 258  RAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRN 317

Query: 1264 SEDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLGICFSW 1443
            + ++EG+CGIN LAS+PIK              KC+LFTYC   ETCCC  +L G+CFSW
Sbjct: 318  TGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETCCCARNLFGLCFSW 377

Query: 1444 RCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQIGNSTLSKPFEKK 1587
            +CCE ESAVCC D  HCCP DYP CDT R+LCLK+ GN T  KPF KK
Sbjct: 378  KCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKK 425


>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  558 bits (1438), Expect = e-156
 Identities = 258/410 (62%), Positives = 309/410 (75%)
 Frame = +1

Query: 358  PTCISSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSLNA 537
            P+  SS IS+LF+ WC+EHGK+Y S++E+ HRL+VFE NY+ V +HN+K NSS++L+LNA
Sbjct: 18   PSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNA 77

Query: 538  FADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVTAV 717
            FADLT+ EFK   LGL  +     + L  R   I G       D+PAS+DWR KG VT V
Sbjct: 78   FADLTHHEFKTSRLGLSAAP----LNLAHRNLEITGV----VGDIPASIDWRNKGVVTNV 129

Query: 718  KDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFEFI 897
            KDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELI+CD +YN GCGGGLMDYAF+F+
Sbjct: 130  KDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFV 189

Query: 898  IKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSVGI 1077
            I N GIDTEEDYPYR RDG C+K+++KR VVTID Y DVP   EK+LLQAVA QPVSVGI
Sbjct: 190  INNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGI 249

Query: 1078 CGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHML 1257
            CGS+  FQ+YS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG  WGM GY+HM 
Sbjct: 250  CGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQ 309

Query: 1258 RNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLGICF 1437
            RNS +++GVCGIN LAS+P+K              KC+L TYC   ETCCC     GIC 
Sbjct: 310  RNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICI 369

Query: 1438 SWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQIGNSTLSKPFEKK 1587
            SW+CC  +SAVCC D  HCCP DYP CDT +N+C K+ GN+T  +  E K
Sbjct: 370  SWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGK 419


>ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum]
          Length = 439

 Score =  558 bits (1437), Expect = e-156
 Identities = 257/414 (62%), Positives = 313/414 (75%)
 Frame = +1

Query: 352  QFPTCISSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSL 531
            Q P C  SSIS+LF+ WC+++GK Y+SEQE+ +R +VFE NY  + EHN+K NSS+TL L
Sbjct: 16   QQPLCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYITEHNSKGNSSYTLGL 75

Query: 532  NAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVT 711
            NA++DLT+ EF++ +LGL  SA+D  IRL  R        ++ + D P+S+DWR KGAVT
Sbjct: 76   NAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSAAGVLSDVDAPSSLDWRDKGAVT 134

Query: 712  AVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFE 891
             VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD +YN GCGGGLMDYAFE
Sbjct: 135  NVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGGGLMDYAFE 194

Query: 892  FIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSV 1071
            F+IKN GIDTE+DYP+R ++G C+K KL+R VVTID Y D+P   E KLL+AVATQPVSV
Sbjct: 195  FVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKAVATQPVSV 254

Query: 1072 GICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIH 1251
            GICGS   FQ YS GIF+GPC T LDHAVLIVGY S++G DYWI+KNSWG  WG+NGYIH
Sbjct: 255  GICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTSWGINGYIH 314

Query: 1252 MLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLGI 1431
            M RNS + EG+CG+N LAS+P K              KC  FT CG  ETCCC    LGI
Sbjct: 315  MQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCCCGLKFLGI 374

Query: 1432 CFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQIGNSTLSKPFEKKGF 1593
            C SW+CC  +SAVCC D  HCCP DYP CDT+RNLCLK++ N+T+ +  +K+ F
Sbjct: 375  CLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQQPQKEPF 428


>ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina]
            gi|557537201|gb|ESR48319.1| hypothetical protein
            CICLE_v10001178mg [Citrus clementina]
          Length = 441

 Score =  556 bits (1433), Expect = e-155
 Identities = 257/416 (61%), Positives = 315/416 (75%), Gaps = 1/416 (0%)
 Frame = +1

Query: 346  IPQFPTCISSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTL 525
            +   P    S I+ELF+ WC++HGK Y+SEQEKQ RL++FE NY  V +HN   NSS TL
Sbjct: 14   LSSLPLNYCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTL 73

Query: 526  SLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGA 705
            SLNAFADLT+QEFK  +LG   ++ D   R N+   ++  P  +   D+PAS+DWRKKGA
Sbjct: 74   SLNAFADLTHQEFKASFLGFSAASIDHDRRRNA---SVQSPGTLR--DVPASIDWRKKGA 128

Query: 706  VTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYA 885
            VT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD +YN+GCGGGLMDYA
Sbjct: 129  VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188

Query: 886  FEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPV 1065
            ++F+IKN GIDTE+DYPYRG+ G+C+K+KL RH+VTID Y DVP   EK+LLQAV  QPV
Sbjct: 189  YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248

Query: 1066 SVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGY 1245
            SVGICGS+  FQLYS GIF+GPCST+LDHAVLIVGYDS++GVDYWI+KNSWG+ WGMNGY
Sbjct: 249  SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 308

Query: 1246 IHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLL 1425
            +HM RN+ ++ G+CGIN LAS+P K              +C L TYC   ETCCC  S+L
Sbjct: 309  MHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSIL 368

Query: 1426 GICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQI-GNSTLSKPFEKKG 1590
            GIC SW+CC   SAVCC DH +CCP +YP CD+ R+ CL +  GN T ++  E +G
Sbjct: 369  GICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIEMRG 424


>ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 441

 Score =  555 bits (1431), Expect = e-155
 Identities = 256/416 (61%), Positives = 316/416 (75%), Gaps = 1/416 (0%)
 Frame = +1

Query: 346  IPQFPTCISSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTL 525
            +   P    S I+ELF+ WC++HGK Y+SEQEKQ RL++FE NY  V +HN   NSS TL
Sbjct: 14   LSSLPPNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTL 73

Query: 526  SLNAFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGA 705
            SLNAFADLT+QEFK  +LG   ++ D   R N+   ++  P  +   D+PAS+DWRKKGA
Sbjct: 74   SLNAFADLTHQEFKASFLGFSAASIDHDRRRNA---SVQSPGNLR--DVPASIDWRKKGA 128

Query: 706  VTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYA 885
            VT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD +YN+GCGGGLMDYA
Sbjct: 129  VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188

Query: 886  FEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPV 1065
            ++F+IKN GIDTE+DYPYRG+ G+C+K+KL RH+VTID Y DVP   EK+LLQAV  QPV
Sbjct: 189  YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248

Query: 1066 SVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGY 1245
            SVGICGS+  FQLYS GIF+GPCST+LDHAVLI+GYDS++GVDYWI+KNSWG+ WGMNGY
Sbjct: 249  SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGY 308

Query: 1246 IHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLL 1425
            +HM RN+ ++ G+CGIN LAS+P K              +C L TYC   ETCCC  S+L
Sbjct: 309  MHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSIL 368

Query: 1426 GICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQI-GNSTLSKPFEKKG 1590
            GIC SW+CC   SAVCC DH +CCP +YP CD+ R+ CL ++ GN T ++  E +G
Sbjct: 369  GICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIEMRG 424


>gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlisea aurea]
          Length = 424

 Score =  554 bits (1428), Expect = e-155
 Identities = 256/404 (63%), Positives = 311/404 (76%), Gaps = 1/404 (0%)
 Frame = +1

Query: 367  ISSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSLNAFAD 546
            +SSSIS+LFD WC+EHGKTY SE+E++HRL VF  NY+ +  HN +AN S+TLSLNAFAD
Sbjct: 22   VSSSISDLFDSWCQEHGKTYVSEEEREHRLGVFSENYDFIASHNARANYSYTLSLNAFAD 81

Query: 547  LTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVTAVKDQ 726
            LT  EF  +YLG  PS  DLLIR N    +    +    S +P+S+DWRKKGAVT +KDQ
Sbjct: 82   LTRSEFGGRYLGFSPSGHDLLIRKNRGSGSYRSRNY---SAVPSSIDWRKKGAVTGIKDQ 138

Query: 727  GSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFEFIIKN 906
            GSCGACWSFSATGA+EGINQI TGSLVSLSEQELIDCD +YN GC GGLMDYA+EFI+KN
Sbjct: 139  GSCGACWSFSATGAIEGINQIVTGSLVSLSEQELIDCDHSYNQGCNGGLMDYAYEFILKN 198

Query: 907  KGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSVGICGS 1086
            KGIDTEEDY Y+GRD  CS+ KL + VVTIDSY D+P + E+ LL+AVA+QPVSVGI G 
Sbjct: 199  KGIDTEEDYSYKGRDASCSQNKLNKRVVTIDSYVDIPEKNEQMLLEAVASQPVSVGISGG 258

Query: 1087 DYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHMLRNS 1266
            D  FQ YS GIF+GPCST+LDHAVLIVGYDS++G DYWIVKNSWGK WGM+GY+++ RN+
Sbjct: 259  DAPFQFYSQGIFTGPCSTSLDHAVLIVGYDSKNGKDYWIVKNSWGKSWGMDGYMYVQRNT 318

Query: 1267 EDAEGVCGINTLASFPIK-XXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLGICFSW 1443
             +  G+C IN +AS+P+K               KC LF+YC   ETCCC    LG+C  +
Sbjct: 319  GNQNGICEINMMASYPVKTNPNPSPSPSPPGPTKCSLFSYCSQGETCCCARRFLGLCMRY 378

Query: 1444 RCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQIGNSTLSKP 1575
            +CC AESAVCC+D+ HCCP+DYP CDTA+++C K  GNST++ P
Sbjct: 379  KCCGAESAVCCEDNVHCCPQDYPICDTAQSVCRKMSGNSTMAIP 422


>ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum]
          Length = 436

 Score =  553 bits (1426), Expect = e-155
 Identities = 265/414 (64%), Positives = 306/414 (73%)
 Frame = +1

Query: 355  FPTCISSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSLN 534
            F T I +S  +LF  WC++HGKTY SEQEK++R  VFE NY  V +HN   NSS+TLSLN
Sbjct: 20   FVTAIDTS--KLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLN 77

Query: 535  AFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVTA 714
            AFADLT+ EFK   LGL PS+   L+R     F  D     +   +P+ +DWRK GAV+ 
Sbjct: 78   AFADLTHHEFKATRLGLPPSS---LLRFKFNRFQ-DQQRSDDFLQVPSEIDWRKNGAVSI 133

Query: 715  VKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFEF 894
            VKDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCDTTYN+GC GGLMDYA++F
Sbjct: 134  VKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQF 193

Query: 895  IIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSVG 1074
            II N GIDTEEDYPY+ R   C K+KLKR VVTID Y DVPP  EKKLL+AVA QPVSVG
Sbjct: 194  IIDNNGIDTEEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVG 253

Query: 1075 ICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHM 1254
            ICGS   FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWGKYWGMNGYIHM
Sbjct: 254  ICGSARAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHM 313

Query: 1255 LRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLGIC 1434
            LRN++ + G+CGIN LAS+P K              KC+LFTYC   ETCCC    LGIC
Sbjct: 314  LRNTDSSAGLCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGIC 373

Query: 1435 FSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQIGNSTLSKPFEKKGFF 1596
            FSW+CC   SAVCC D  HCCP DYP CD +   CLK+I N T+    +K+  F
Sbjct: 374  FSWKCCGVTSAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTSDKEDPF 427


>ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1|
            JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  553 bits (1424), Expect = e-154
 Identities = 259/405 (63%), Positives = 308/405 (76%)
 Frame = +1

Query: 373  SSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSLNAFADLT 552
            S IS LF+ WC++HGK Y+SE+EK +RL+VFE NY  V +HN   NSS++L+LNAFADLT
Sbjct: 24   SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83

Query: 553  NQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVTAVKDQGS 732
            + EFK   LGL  +A      +      +  P LV   D+PAS+DWR KGAVT VKDQGS
Sbjct: 84   HHEFKASRLGLSAAA------IEGSRPNLQLPGLVR--DIPASMDWRTKGAVTKVKDQGS 135

Query: 733  CGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFEFIIKNKG 912
            CGACWSFSATGA+EGIN+I TG+LVSLSEQEL+DCD +YN+GC GGLMDYA++F+I N G
Sbjct: 136  CGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHG 195

Query: 913  IDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSVGICGSDY 1092
            ID EEDYPY GR+  C+KEK KR VVTID YA VP   E  LLQAVA QPVSVGICGS+ 
Sbjct: 196  IDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSER 255

Query: 1093 KFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHMLRNSED 1272
             FQLYS GIF+GPCS++LDHAVLIVGY S++GVDYWIVKNSWG  WGMNGYIHMLRNS D
Sbjct: 256  AFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGD 315

Query: 1273 AEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLGICFSWRCC 1452
            ++G+CGIN LAS+P K              KCDLFTYC   ETCCC   + GICFSW+CC
Sbjct: 316  SKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCC 375

Query: 1453 EAESAVCCDDHEHCCPRDYPTCDTARNLCLKQIGNSTLSKPFEKK 1587
            E +SAVCC D+ HCCP DYP CDT ++ CLK++GN+T  + FEK+
Sbjct: 376  ELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKR 420


>ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella]
            gi|482576142|gb|EOA40329.1| hypothetical protein
            CARUB_v10009056mg [Capsella rubella]
          Length = 467

 Score =  552 bits (1422), Expect = e-154
 Identities = 266/434 (61%), Positives = 310/434 (71%), Gaps = 28/434 (6%)
 Frame = +1

Query: 370  SSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSLNAFADL 549
            S  ISELFD WC+ HGKTYASE EKQHR ++F  N++ V +HN   N++++LSLNAFADL
Sbjct: 27   SDDISELFDDWCQRHGKTYASEAEKQHRFQIFRDNHDFVTQHNLITNATYSLSLNAFADL 86

Query: 550  TNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVTAVKDQG 729
             + EFK   LGL  SA  +++       A  G  L     +P S+DWRKKGAVT VKDQG
Sbjct: 87   NHSEFKTSRLGLSVSAPSVIM-------ASKGKSLGGSVKVPDSLDWRKKGAVTNVKDQG 139

Query: 730  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFEFIIKNK 909
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YN GC GGLMDYAFEF+IKNK
Sbjct: 140  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNDGCNGGLMDYAFEFVIKNK 199

Query: 910  GIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSVGICGSD 1089
            GIDTE+DYPY+ RDG C K+KLK+ VV+IDSYA V P  EK LL+AVA QPVSVGICGS+
Sbjct: 200  GIDTEKDYPYQERDGTCKKDKLKQRVVSIDSYAGVKPSDEKALLEAVAAQPVSVGICGSE 259

Query: 1090 YKFQLYSG----------------------------GIFSGPCSTALDHAVLIVGYDSQD 1185
              FQLYS                             GIFSGPCST+LDHAVLIVGY SQ+
Sbjct: 260  RAFQLYSSVSFKIRDTSILSSECSTFPCLKLYLMMQGIFSGPCSTSLDHAVLIVGYGSQN 319

Query: 1186 GVDYWIVKNSWGKYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXXK 1365
            GVDYWIVKNSWGK WGM+G++HM RN+ +++G+CGIN LAS+PIK              K
Sbjct: 320  GVDYWIVKNSWGKSWGMDGFMHMQRNTGNSQGICGINMLASYPIKTHPNPPPPSPPGPTK 379

Query: 1366 CDLFTYCGTDETCCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 1545
            C+LFTYC   ETCCC  +L G+C SW+CCE ESAVCC D  HCCP DYP CDT R+LCLK
Sbjct: 380  CNLFTYCSAAETCCCARNLFGLCLSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 439

Query: 1546 QIGNSTLSKPFEKK 1587
            + GN T  KPF KK
Sbjct: 440  KTGNFTAIKPFWKK 453


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  552 bits (1422), Expect = e-154
 Identities = 257/392 (65%), Positives = 310/392 (79%)
 Frame = +1

Query: 370  SSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSLNAFADL 549
            SS IS+LF+ W +EHGKTY S+++K +R ++FE NYE V +HN++ NSS+TLSLNAFADL
Sbjct: 25   SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84

Query: 550  TNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVTAVKDQG 729
            T+ EFK   LGL  SA     +L+ R F +   D V   D+P S+DWRKKGAV+ VKDQG
Sbjct: 85   THHEFKASRLGL--SAFSTSGKLSRRNFPLH--DFV--GDVPISIDWRKKGAVSQVKDQG 138

Query: 730  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFEFIIKNK 909
            +CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD +YN GC GGLMDYA++F+I+N 
Sbjct: 139  NCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENN 198

Query: 910  GIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSVGICGSD 1089
            GIDTEEDYPY+ R+  C+KEKLKRHVVTID Y DVP   EK+LL+AVA QPVSVGICGS+
Sbjct: 199  GIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSE 258

Query: 1090 YKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHMLRNSE 1269
              FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG +WG+NGY++MLRNS 
Sbjct: 259  RAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSG 318

Query: 1270 DAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLGICFSWRC 1449
            +++G+CGIN LASFP+K              KCDLFT CG  ETCCC   + G+CFSW+C
Sbjct: 319  NSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKC 378

Query: 1450 CEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 1545
            CE +SAVCC D  HCCP DYP CDT RN+CLK
Sbjct: 379  CELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410


>gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
            [Arabidopsis thaliana]
          Length = 416

 Score =  549 bits (1414), Expect = e-153
 Identities = 259/399 (64%), Positives = 305/399 (76%), Gaps = 7/399 (1%)
 Frame = +1

Query: 370  SSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSLNAFADL 549
            S  ISELFD WC++HGKTY SE+E+Q R+++F+ N++ V +HN   N++++LSLNAFADL
Sbjct: 23   SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 82

Query: 550  TNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVTAVKDQG 729
            T+ EFK   LGL  SA  +++       A  G  L     +P SVDWRKKGAVT VKDQG
Sbjct: 83   THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 135

Query: 730  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFEFIIKNK 909
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YNAGC GGLMDYAFEF+IKN 
Sbjct: 136  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 195

Query: 910  GIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSVGICGSD 1089
            GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L++AVA QPVSVGICGS+
Sbjct: 196  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 255

Query: 1090 YKFQLYSG-------GIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYI 1248
              FQLYS        GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK WGM+G++
Sbjct: 256  RAFQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFM 315

Query: 1249 HMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLG 1428
            HM RN+E+++GVCGIN LAS+PIK              KC+LFTYC + ETCCC   L G
Sbjct: 316  HMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFG 375

Query: 1429 ICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 1545
            +CFSW+CCE ESAVCC D  HCCP DYP CDT R+LCLK
Sbjct: 376  LCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 414


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  547 bits (1410), Expect = e-153
 Identities = 261/410 (63%), Positives = 303/410 (73%)
 Frame = +1

Query: 358  PTCISSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSLNA 537
            P   +S++SELF+ WC EHGK+Y+S +EK +RL VF  NYE V  HN   NSS+TLSLN+
Sbjct: 18   PLSATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNS 77

Query: 538  FADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVTAV 717
            +ADLT+ EFK   LG  P+        N R      P L    D+P S+DWRKKGAVTAV
Sbjct: 78   YADLTHHEFKVSRLGFSPALR------NFRPVLPQEPSLPR--DVPDSLDWRKKGAVTAV 129

Query: 718  KDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFEFI 897
            KDQGSCGACWSFSATGA+EGINQI TGSL+SLSEQELIDCD +YN+GCGGGLMDYA++F+
Sbjct: 130  KDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFV 189

Query: 898  IKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSVGI 1077
            I N GIDTE DYPY+ RDG C K+KL+R+VVTID YAD+P   E KLLQAVA QPVSVGI
Sbjct: 190  ISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGI 249

Query: 1078 CGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHML 1257
            CGS+  FQLYS GIFSGPCST+LDHAVLIVGY S++GVDYWIVKNSWGK WGM+GY+HM 
Sbjct: 250  CGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQ 309

Query: 1258 RNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLGICF 1437
            RNS ++EGVCGIN LAS+P K              KC + T C   ETCCC    LG+C 
Sbjct: 310  RNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCL 369

Query: 1438 SWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQIGNSTLSKPFEKK 1587
            SW+CC   SAVCC D  HCCP DYP CDT RNLCLKQ  N T ++  E +
Sbjct: 370  SWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENR 419


>gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]
          Length = 517

 Score =  545 bits (1405), Expect = e-152
 Identities = 254/393 (64%), Positives = 300/393 (76%)
 Frame = +1

Query: 370  SSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKANSSHTLSLNAFADL 549
            S + S+LF+ WCE+HG++Y+SE+E+ +RL VFE N   V +HN   NSS+TLSLNAFADL
Sbjct: 23   SLNSSQLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVTQHNNMGNSSYTLSLNAFADL 82

Query: 550  TNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVTAVKDQG 729
            T+ EFK   LG   +    L +L S+        L++  D+PAS+DWRKKGAVT VKDQG
Sbjct: 83   THHEFKSSRLGFSSALLSSLPKLGSK--------LLDLRDVPASLDWRKKGAVTNVKDQG 134

Query: 730  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFEFIIKNK 909
            SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCDT+YNAGC GGLMDYA++F+I N 
Sbjct: 135  SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAGCDGGLMDYAYQFVIDNH 194

Query: 910  GIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSVGICGSD 1089
            GIDTEEDYPY+ RD  C KEKLKR VVTID Y DV P    +LLQAV TQPVSVGICGS+
Sbjct: 195  GIDTEEDYPYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQAVVTQPVSVGICGSE 254

Query: 1090 YKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHMLRNSE 1269
              FQLYS GIF+GPCST+LDHAVLIVGYDS++GVDYWIVKNSWGK WGM+GYIHM RN+ 
Sbjct: 255  RAFQLYSKGIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQWGMDGYIHMQRNTG 314

Query: 1270 DAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLGICFSWRC 1449
            +++GVCGIN LAS+P K              +C  F  CG  ETCCC W  LG+CFSW+C
Sbjct: 315  NSQGVCGINMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGETCCCSWRFLGLCFSWKC 374

Query: 1450 CEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQ 1548
            C   SAVCC D  HCCP+DYP CDT RN+CLK+
Sbjct: 375  CGLNSAVCCKDKIHCCPQDYPLCDTQRNVCLKE 407


>ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  542 bits (1396), Expect = e-151
 Identities = 257/413 (62%), Positives = 311/413 (75%), Gaps = 5/413 (1%)
 Frame = +1

Query: 370  SSSISELFDHWCEEHGKTYASEQEKQHRLRVFEHNYEIVVEHNTKAN-----SSHTLSLN 534
            +S  SELF+ WC+EH KTY+SE+EK +RL+VFE NY  V +HN  AN     SS+TLSLN
Sbjct: 26   ASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLN 85

Query: 535  AFADLTNQEFKDKYLGLLPSADDLLIRLNSREFAIDGPDLVEESDLPASVDWRKKGAVTA 714
            AFADLT+ EFK   LGL      L +    R       DL+    +P+ +DWR+ GAVT 
Sbjct: 86   AFADLTHHEFKTTRLGL-----PLTLLRFKRPQNQQSRDLLH---IPSQIDWRQSGAVTP 137

Query: 715  VKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGGGLMDYAFEF 894
            VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCDT+YN+GCGGGLMD+A++F
Sbjct: 138  VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQF 197

Query: 895  IIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVATQPVSVG 1074
            +I NKGIDTE+DYPY+ R   CSK+KLKR  VTI+ Y DVPP  E+++L+AVA+QPVSVG
Sbjct: 198  VIDNKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPP-SEEEILKAVASQPVSVG 256

Query: 1075 ICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHM 1254
            ICGS+ +FQLYS GIF+GPCST LDHAVLIVGY S++GVDYWIVKNSWGKYWGMNGYIHM
Sbjct: 257  ICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHM 316

Query: 1255 LRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCCCYWSLLGIC 1434
            +RNS +++G+CGINTLAS+P+K              +C+LFT+C   ETCCC  S LGIC
Sbjct: 317  IRNSGNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGIC 376

Query: 1435 FSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKQIGNSTLSKPFEKKGF 1593
            FSW+CC   SAVCC D  HCCP+DYP CDT R  CLK+  N T +   E + F
Sbjct: 377  FSWKCCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDF 429


Top