BLASTX nr result

ID: Mentha29_contig00008506 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00008506
         (1333 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU36745.1| hypothetical protein MIMGU_mgv1a006749mg [Mimulus...   580   e-163
dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          513   e-143
ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr...   512   e-142
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   511   e-142
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   511   e-142
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   511   e-142
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   510   e-142
ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087...   505   e-140
ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a...   499   e-138
ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S...   499   e-138
ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr...   498   e-138
gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]                  498   e-138
ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C...   497   e-138
ref|XP_002307688.2| cysteine protease family protein [Populus tr...   496   e-138
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   496   e-137
ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine...   493   e-136
ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps...   491   e-136
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   491   e-136
gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase...   489   e-135
gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlise...   486   e-135

>gb|EYU36745.1| hypothetical protein MIMGU_mgv1a006749mg [Mimulus guttatus]
          Length = 433

 Score =  580 bits (1494), Expect = e-163
 Identities = 265/366 (72%), Positives = 304/366 (83%), Gaps = 1/366 (0%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRK-SAIEGSDLVEESDLPASV 179
            NSS+TLS+NAFADLTN EF+A YLGL PS  D +IRLNSR  SAI+G +L++ES++P+S+
Sbjct: 68   NSSYTLSVNAFADLTNHEFRANYLGLSPSKSDSVIRLNSRSASAIDGDNLIKESEIPSSL 127

Query: 180  DWRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCG 359
            DWR KGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCD +YN GC 
Sbjct: 128  DWRNKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCN 187

Query: 360  GGLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQ 539
            GGLMDYA++FIIKN+GIDTEEDY Y+GR   C K K+ +HVVTIDSY D+P + EKKLLQ
Sbjct: 188  GGLMDYAYDFIIKNKGIDTEEDYSYKGRSATCDKNKMNKHVVTIDSYVDIPEKDEKKLLQ 247

Query: 540  AVATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGK 719
            AVATQP+SVGICGSD  FQLYSGGIF+GPCST+LDHAVLIVGYDS+DG DYWI+KNSWGK
Sbjct: 248  AVATQPISVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGK 307

Query: 720  YWGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETC 899
             WG+ GY+HM+RNSG  EGVCGINTLAS+P+K              KC++FTYC + ETC
Sbjct: 308  SWGIKGYMHMVRNSGSEEGVCGINTLASYPVKSSTNPPPSPTPGPTKCNIFTYCSSGETC 367

Query: 900  CCHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEK 1079
            CC    LG+C SW CCEAESAVCCDDH HCCP DYP CDT +NLCLK+ GN+T+SKP  K
Sbjct: 368  CCARYFLGVCLSWNCCEAESAVCCDDHRHCCPHDYPVCDTKKNLCLKKSGNTTVSKPLGK 427

Query: 1080 KGFFTS 1097
            K F  S
Sbjct: 428  KSFSAS 433


>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  513 bits (1322), Expect = e-143
 Identities = 242/359 (67%), Positives = 279/359 (77%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            NSS+TLSLNAFADLT+ EFKA  LGL  +A      LN  +S  +  D V  +D+PASVD
Sbjct: 69   NSSYTLSLNAFADLTHHEFKASRLGLSSAAS---ASLNVDRSNRQIPDFV--ADVPASVD 123

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR  GAVT VKDQG+CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD +YN GC G
Sbjct: 124  WRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEG 183

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            G+MDYAF+F+I N GIDTEEDYPY+GRD  C+KEKLKRHVVTID Y DVP   EK+LL+A
Sbjct: 184  GIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKA 243

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            VA QPVSVGICGS+  FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG Y
Sbjct: 244  VANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSY 303

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902
            WGM+GY+HM RNSG + G+CGIN LAS+P K              +CDLFT+CG  ETCC
Sbjct: 304  WGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCC 363

Query: 903  CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEK 1079
            C   + GIC SW+CCE +SAVCC D  HCCPRDYP CDT RN+CLK  GN+T  + F K
Sbjct: 364  CVHHIFGICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAK 422


>ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum]
            gi|557095297|gb|ESQ35879.1| hypothetical protein
            EUTSA_v10007640mg [Eutrema salsugineum]
          Length = 444

 Score =  512 bits (1319), Expect = e-142
 Identities = 242/360 (67%), Positives = 285/360 (79%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            NS+++LSLNAFADLT+ EFKA  LGL   +  L+    +++ ++  S+ V    +P SVD
Sbjct: 76   NSTYSLSLNAFADLTHHEFKASRLGLSAPSPSLM----AKEQSLGVSERVRVK-VPDSVD 130

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YNAGC G
Sbjct: 131  WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 190

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYAFEF+IKN GIDTE+DYPY+ +DG C K+KLK+ VVTIDSYA V    EK L++A
Sbjct: 191  GLMDYAFEFVIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEA 250

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            VA+QPVSVGICGS+  FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK 
Sbjct: 251  VASQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKS 310

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902
            WGM+G++HM RN+G++EGVCGIN LAS+PIK              KC+LFTYC + ETCC
Sbjct: 311  WGMDGFMHMQRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCC 370

Query: 903  CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082
            C  +L G+CFSW+CCE ESAVCC D  HCCPRDYP CDT ++LCLK+ GN T  KPF KK
Sbjct: 371  CARTLFGLCFSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPFWKK 430


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  511 bits (1317), Expect = e-142
 Identities = 238/362 (65%), Positives = 282/362 (77%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            NSS+TL LNA++DLT+ EF+  +LGL  SA+D  IRL  R S    + ++ + D P+S+D
Sbjct: 68   NSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSETGVLSDVDAPSSLD 126

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD +YN GCGG
Sbjct: 127  WREKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGG 186

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYAFEF+IKN GIDTE+DYP+R R+G C+K KL+RHVVTID Y D+P   E KLL+A
Sbjct: 187  GLMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKA 246

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            VATQPVSVGICGS   FQ YS GIF+GPCSTALDHAVLIVGY S++GVDYWI+KNSWG  
Sbjct: 247  VATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTS 306

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902
            WG+NGYIHM RNSG+ EG+CGIN LAS+P K              KC +FT CG  ETCC
Sbjct: 307  WGINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCC 366

Query: 903  CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082
            C    LGIC SW+CC  +SAVCC D  HCCP+DYP CDT+RNLCLKR+ N+T+ +  +K+
Sbjct: 367  CGSKFLGICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQQPQKE 426

Query: 1083 GF 1088
             F
Sbjct: 427  AF 428


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  511 bits (1316), Expect = e-142
 Identities = 244/362 (67%), Positives = 280/362 (77%), Gaps = 2/362 (0%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            N++++LSLNAFADLT+ EFKA  LGL  SA  L++       A +G  L   + +P SVD
Sbjct: 71   NATYSLSLNAFADLTHHEFKASRLGLSVSASSLIM-------ASKGQSLGGNAKVPDSVD 123

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YNAGC G
Sbjct: 124  WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 183

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L +A
Sbjct: 184  GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREA 243

Query: 543  VATQPVSVGICGSDYKFQLYS--GGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWG 716
            VA QPVSVGICGS+  FQLYS   GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWG
Sbjct: 244  VAAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWG 303

Query: 717  KYWGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDET 896
            K WGM+G++HM RN+G++EG+CGIN LAS+PIK              KC+LFTYC   ET
Sbjct: 304  KSWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGET 363

Query: 897  CCCHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFE 1076
            CCC  +L G+CFSW+CCE ESAVCC D  HCCP DYP CDT R+LCLK+ GN T  KPF 
Sbjct: 364  CCCARNLFGLCFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFW 423

Query: 1077 KK 1082
            KK
Sbjct: 424  KK 425


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  511 bits (1315), Expect = e-142
 Identities = 242/360 (67%), Positives = 279/360 (77%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            N++++LSLNAFADLT+ EFKA  LGL  SA  +++       A +G  L     +P SVD
Sbjct: 71   NATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVD 123

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YNAGC G
Sbjct: 124  WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 183

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L++A
Sbjct: 184  GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEA 243

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            VA QPVSVGICGS+  FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK 
Sbjct: 244  VAAQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKS 303

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902
            WGM+G++HM RN+ +++GVCGIN LAS+PIK              KC+LFTYC + ETCC
Sbjct: 304  WGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCC 363

Query: 903  CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082
            C   L G+CFSW+CCE ESAVCC D  HCCP DYP CDT R+LCLK+ GN T  KPF KK
Sbjct: 364  CARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKK 423


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  510 bits (1313), Expect = e-142
 Identities = 242/360 (67%), Positives = 279/360 (77%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            N++++LSLNAFADLT+ EFKA  LGL  SA  +++       A +G  L     +P SVD
Sbjct: 71   NATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVD 123

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YNAGC G
Sbjct: 124  WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 183

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L++A
Sbjct: 184  GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEA 243

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            VA QPVSVGICGS+  FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK 
Sbjct: 244  VAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKS 303

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902
            WGM+G++HM RN+ +++GVCGIN LAS+PIK              KC+LFTYC + ETCC
Sbjct: 304  WGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCC 363

Query: 903  CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082
            C   L G+CFSW+CCE ESAVCC D  HCCP DYP CDT R+LCLK+ GN T  KPF KK
Sbjct: 364  CARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKK 423


>ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1|
            JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  505 bits (1300), Expect = e-140
 Identities = 237/360 (65%), Positives = 279/360 (77%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            NSS++L+LNAFADLT+ EFKA  LGL  +A      +   +  ++   LV   D+PAS+D
Sbjct: 69   NSSYSLALNAFADLTHHEFKASRLGLSAAA------IEGSRPNLQLPGLVR--DIPASMD 120

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WRTKGAVT VKDQGSCGACWSFSATGA+EGIN+I TG+LVSLSEQEL+DCD +YN+GC G
Sbjct: 121  WRTKGAVTKVKDQGSCGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEG 180

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYA++F+I N GID EEDYPY GR+  C+KEK KR VVTID YA VP   E  LLQA
Sbjct: 181  GLMDYAYQFVIDNHGIDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQA 240

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            VA QPVSVGICGS+  FQLYS GIF+GPCS++LDHAVLIVGY S++GVDYWIVKNSWG  
Sbjct: 241  VAKQPVSVGICGSERAFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTR 300

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902
            WGMNGYIHMLRNSGD++G+CGIN LAS+P K              KCDLFTYC   ETCC
Sbjct: 301  WGMNGYIHMLRNSGDSKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCC 360

Query: 903  CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082
            C   + GICFSW+CCE +SAVCC D+ HCCP DYP CDT ++ CLKR+GN+T  + FEK+
Sbjct: 361  CTHRIFGICFSWKCCELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKR 420


>ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum]
          Length = 436

 Score =  499 bits (1285), Expect = e-138
 Identities = 238/363 (65%), Positives = 272/363 (74%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            NSS+TLSLNAFADLT+ EFKA  LGL PS+  L  + N  +      D ++   +P+ +D
Sbjct: 69   NSSYTLSLNAFADLTHHEFKATRLGLPPSSL-LRFKFNRFQDQQRSDDFLQ---VPSEID 124

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR  GAV+ VKDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCDTTYN+GC G
Sbjct: 125  WRKNGAVSIVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDG 184

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYA++FII N GIDTEEDYPY+ R   C K+KLKR VVTID Y DVPP  EKKLL+A
Sbjct: 185  GLMDYAYQFIIDNNGIDTEEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKA 244

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            VA QPVSVGICGS   FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWGKY
Sbjct: 245  VAVQPVSVGICGSARAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKY 304

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902
            WGMNGYIHMLRN+  + G+CGIN LAS+P K              KC+LFTYC   ETCC
Sbjct: 305  WGMNGYIHMLRNTDSSAGLCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCC 364

Query: 903  CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082
            C    LGICFSW+CC   SAVCC D  HCCP DYP CD +   CLKRI N T+    +K+
Sbjct: 365  CAKKFLGICFSWKCCGVTSAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTSDKE 424

Query: 1083 GFF 1091
              F
Sbjct: 425  DPF 427


>ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum]
          Length = 439

 Score =  499 bits (1285), Expect = e-138
 Identities = 232/362 (64%), Positives = 276/362 (76%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            NSS+TL LNA++DLT+ EF+  +LGL  SA+D  IRL  R S    + ++ + D P+S+D
Sbjct: 68   NSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSAAGVLSDVDAPSSLD 126

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD +YN GCGG
Sbjct: 127  WRDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGG 186

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYAFEF+IKN GIDTE+DYP+R ++G C+K KL+R VVTID Y D+P   E KLL+A
Sbjct: 187  GLMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKA 246

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            VATQPVSVGICGS   FQ YS GIF+GPC T LDHAVLIVGY S++G DYWI+KNSWG  
Sbjct: 247  VATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTS 306

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902
            WG+NGYIHM RNSG+ EG+CG+N LAS+P K              KC  FT CG  ETCC
Sbjct: 307  WGINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCC 366

Query: 903  CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082
            C    LGIC SW+CC  +SAVCC D  HCCP DYP CDT+RNLCLKR+ N+T+ +  +K+
Sbjct: 367  CGLKFLGICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQQPQKE 426

Query: 1083 GF 1088
             F
Sbjct: 427  PF 428


>ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina]
            gi|557537201|gb|ESR48319.1| hypothetical protein
            CICLE_v10001178mg [Citrus clementina]
          Length = 441

 Score =  498 bits (1282), Expect = e-138
 Identities = 231/362 (63%), Positives = 280/362 (77%), Gaps = 1/362 (0%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            NSS TLSLNAFADLT+QEFKA +LG   ++ D   R   R ++++    +   D+PAS+D
Sbjct: 68   NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR---RNASVQSPGTLR--DVPASID 122

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR KGAVT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD +YN+GCGG
Sbjct: 123  WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGG 182

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYA++F+IKN GIDTE+DYPYRG+ G+C+K+KL RH+VTID Y DVP   EK+LLQA
Sbjct: 183  GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQA 242

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            V  QPVSVGICGS+  FQLYS GIF+GPCST+LDHAVLIVGYDS++GVDYWI+KNSWG+ 
Sbjct: 243  VVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRS 302

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902
            WGMNGY+HM RN+G++ G+CGIN LAS+P K              +C L TYC   ETCC
Sbjct: 303  WGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCC 362

Query: 903  CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRI-GNSTLSKPFEK 1079
            C  S+LGIC SW+CC   SAVCC DH +CCP +YP CD+ R+ CL R  GN T ++  E 
Sbjct: 363  CGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIEM 422

Query: 1080 KG 1085
            +G
Sbjct: 423  RG 424


>gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]
          Length = 517

 Score =  498 bits (1281), Expect = e-138
 Identities = 233/346 (67%), Positives = 267/346 (77%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            NSS+TLSLNAFADLT+ EFK+  LG   +    L +L        GS L++  D+PAS+D
Sbjct: 69   NSSYTLSLNAFADLTHHEFKSSRLGFSSALLSSLPKL--------GSKLLDLRDVPASLD 120

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR KGAVT VKDQGSCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCDT+YNAGC G
Sbjct: 121  WRKKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAGCDG 180

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYA++F+I N GIDTEEDYPY+ RD  C KEKLKR VVTID Y DV P    +LLQA
Sbjct: 181  GLMDYAYQFVIDNHGIDTEEDYPYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQA 240

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            V TQPVSVGICGS+  FQLYS GIF+GPCST+LDHAVLIVGYDS++GVDYWIVKNSWGK 
Sbjct: 241  VVTQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQ 300

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902
            WGM+GYIHM RN+G+++GVCGIN LAS+P K              +C  F  CG  ETCC
Sbjct: 301  WGMDGYIHMQRNTGNSQGVCGINMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGETCC 360

Query: 903  CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 1040
            C W  LG+CFSW+CC   SAVCC D  HCCP+DYP CDT RN+CLK
Sbjct: 361  CSWRFLGLCFSWKCCGLNSAVCCKDKIHCCPQDYPLCDTQRNVCLK 406


>ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 441

 Score =  497 bits (1280), Expect = e-138
 Identities = 232/362 (64%), Positives = 280/362 (77%), Gaps = 1/362 (0%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            NSS TLSLNAFADLT+QEFKA +LG   ++ D   R N+  S     +L    D+PAS+D
Sbjct: 68   NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA--SVQSPGNL---RDVPASID 122

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR KGAVT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD +YN+GCGG
Sbjct: 123  WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGG 182

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYA++F+IKN GIDTE+DYPYRG+ G+C+K+KL RH+VTID Y DVP   EK+LLQA
Sbjct: 183  GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQA 242

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            V  QPVSVGICGS+  FQLYS GIF+GPCST+LDHAVLI+GYDS++GVDYWI+KNSWG+ 
Sbjct: 243  VVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRS 302

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902
            WGMNGY+HM RN+G++ G+CGIN LAS+P K              +C L TYC   ETCC
Sbjct: 303  WGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCC 362

Query: 903  CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRI-GNSTLSKPFEK 1079
            C  S+LGIC SW+CC   SAVCC DH +CCP +YP CD+ R+ CL R+ GN T ++  E 
Sbjct: 363  CGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIEM 422

Query: 1080 KG 1085
            +G
Sbjct: 423  RG 424


>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  496 bits (1277), Expect = e-138
 Identities = 232/360 (64%), Positives = 272/360 (75%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            NSS++L+LNAFADLT+ EFK   LGL  +  +L  R N   + + G       D+PAS+D
Sbjct: 68   NSSYSLALNAFADLTHHEFKTSRLGLSAAPLNLAHR-NLEITGVVG-------DIPASID 119

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR KG VT VKDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELI+CD +YN GCGG
Sbjct: 120  WRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGG 179

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYAF+F+I N GIDTEEDYPYR RDG C+K+++KR VVTID Y DVP   EK+LLQA
Sbjct: 180  GLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQA 239

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            VA QPVSVGICGS+  FQ+YS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG  
Sbjct: 240  VAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTG 299

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902
            WGM GY+HM RNSG+++GVCGIN LAS+P+K              KC+L TYC   ETCC
Sbjct: 300  WGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCC 359

Query: 903  CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082
            C     GIC SW+CC  +SAVCC D  HCCP DYP CDT +N+C KR GN+T  +  E K
Sbjct: 360  CARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGK 419


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  496 bits (1276), Expect = e-137
 Identities = 233/346 (67%), Positives = 274/346 (79%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            NSS+TLSLNAFADLT+ EFKA  LGL  SA     +L+ R   +   D V   D+P S+D
Sbjct: 71   NSSYTLSLNAFADLTHHEFKASRLGL--SAFSTSGKLSRRNFPLH--DFV--GDVPISID 124

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR KGAV+ VKDQG+CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD +YN GC G
Sbjct: 125  WRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEG 184

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYA++F+I+N GIDTEEDYPY+ R+  C+KEKLKRHVVTID Y DVP   EK+LL+A
Sbjct: 185  GLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKA 244

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            VA QPVSVGICGS+  FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG +
Sbjct: 245  VAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTH 304

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902
            WG+NGY++MLRNSG+++G+CGIN LASFP+K              KCDLFT CG  ETCC
Sbjct: 305  WGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCC 364

Query: 903  CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 1040
            C   + G+CFSW+CCE +SAVCC D  HCCP DYP CDT RN+CLK
Sbjct: 365  CTRRIFGLCFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410


>ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  493 bits (1268), Expect = e-136
 Identities = 231/362 (63%), Positives = 277/362 (76%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            NSS+TLSLNAFADLT+ EFK   LGL      L +    R    +  DL+    +P+ +D
Sbjct: 77   NSSYTLSLNAFADLTHHEFKTTRLGL-----PLTLLRFKRPQNQQSRDLLH---IPSQID 128

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR  GAVT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCDT+YN+GCGG
Sbjct: 129  WRQSGAVTPVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGG 188

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMD+A++F+I N+GIDTE+DYPY+ R   CSK+KLKR  VTI+ Y DVPP  E+++L+A
Sbjct: 189  GLMDFAYQFVIDNKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPP-SEEEILKA 247

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            VA+QPVSVGICGS+ +FQLYS GIF+GPCST LDHAVLIVGY S++GVDYWIVKNSWGKY
Sbjct: 248  VASQPVSVGICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKY 307

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902
            WGMNGYIHM+RNSG+++G+CGINTLAS+P+K              +C+LFT+C   ETCC
Sbjct: 308  WGMNGYIHMIRNSGNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCC 367

Query: 903  CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082
            C  S LGICFSW+CC   SAVCC D  HCCP+DYP CDT R  CLKR  N T +   E +
Sbjct: 368  CAKSFLGICFSWKCCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQ 427

Query: 1083 GF 1088
             F
Sbjct: 428  DF 429


>ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella]
            gi|482576142|gb|EOA40329.1| hypothetical protein
            CARUB_v10009056mg [Capsella rubella]
          Length = 467

 Score =  491 bits (1265), Expect = e-136
 Identities = 238/388 (61%), Positives = 278/388 (71%), Gaps = 28/388 (7%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            N++++LSLNAFADL + EFK   LGL  SA  +++       A +G  L     +P S+D
Sbjct: 73   NATYSLSLNAFADLNHSEFKTSRLGLSVSAPSVIM-------ASKGKSLGGSVKVPDSLD 125

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YN GC G
Sbjct: 126  WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNDGCNG 185

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYAFEF+IKN+GIDTE+DYPY+ RDG C K+KLK+ VV+IDSYA V P  EK LL+A
Sbjct: 186  GLMDYAFEFVIKNKGIDTEKDYPYQERDGTCKKDKLKQRVVSIDSYAGVKPSDEKALLEA 245

Query: 543  VATQPVSVGICGSDYKFQLYSG----------------------------GIFSGPCSTA 638
            VA QPVSVGICGS+  FQLYS                             GIFSGPCST+
Sbjct: 246  VAAQPVSVGICGSERAFQLYSSVSFKIRDTSILSSECSTFPCLKLYLMMQGIFSGPCSTS 305

Query: 639  LDHAVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHMLRNSGDAEGVCGINTLASFPIKX 818
            LDHAVLIVGY SQ+GVDYWIVKNSWGK WGM+G++HM RN+G+++G+CGIN LAS+PIK 
Sbjct: 306  LDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSQGICGINMLASYPIKT 365

Query: 819  XXXXXXXXXXXXXKCDLFTYCGTDETCCCHWSLLGICFSWRCCEAESAVCCDDHEHCCPR 998
                         KC+LFTYC   ETCCC  +L G+C SW+CCE ESAVCC D  HCCP 
Sbjct: 366  HPNPPPPSPPGPTKCNLFTYCSAAETCCCARNLFGLCLSWKCCEIESAVCCKDGRHCCPH 425

Query: 999  DYPTCDTARNLCLKRIGNSTLSKPFEKK 1082
            DYP CDT R+LCLK+ GN T  KPF KK
Sbjct: 426  DYPVCDTTRSLCLKKTGNFTAIKPFWKK 453


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  491 bits (1263), Expect = e-136
 Identities = 232/360 (64%), Positives = 268/360 (74%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            NSS+TLSLN++ADLT+ EFK   LG  P+  +    L    S           D+P S+D
Sbjct: 68   NSSYTLSLNSYADLTHHEFKVSRLGFSPALRNFRPVLPQEPSL--------PRDVPDSLD 119

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR KGAVTAVKDQGSCGACWSFSATGA+EGINQI TGSL+SLSEQELIDCD +YN+GCGG
Sbjct: 120  WRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGG 179

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYA++F+I N GIDTE DYPY+ RDG C K+KL+R+VVTID YAD+P   E KLLQA
Sbjct: 180  GLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQA 239

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            VA QPVSVGICGS+  FQLYS GIFSGPCST+LDHAVLIVGY S++GVDYWIVKNSWGK 
Sbjct: 240  VAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKS 299

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYCGTDETCC 902
            WGM+GY+HM RNSG++EGVCGIN LAS+P K              KC + T C   ETCC
Sbjct: 300  WGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCC 359

Query: 903  CHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 1082
            C    LG+C SW+CC   SAVCC D  HCCP DYP CDT RNLCLK+  N T ++  E +
Sbjct: 360  CAKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENR 419


>gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
            [Arabidopsis thaliana]
          Length = 416

 Score =  489 bits (1259), Expect = e-135
 Identities = 234/353 (66%), Positives = 270/353 (76%), Gaps = 7/353 (1%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            N++++LSLNAFADLT+ EFKA  LGL  SA  +++       A +G  L     +P SVD
Sbjct: 69   NATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVD 121

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCD +YNAGC G
Sbjct: 122  WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNG 181

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L++A
Sbjct: 182  GLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEA 241

Query: 543  VATQPVSVGICGSDYKFQLYSG-------GIFSGPCSTALDHAVLIVGYDSQDGVDYWIV 701
            VA QPVSVGICGS+  FQLYS        GIFSGPCST+LDHAVLIVGY SQ+GVDYWIV
Sbjct: 242  VAAQPVSVGICGSERAFQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIV 301

Query: 702  KNSWGKYWGMNGYIHMLRNSGDAEGVCGINTLASFPIKXXXXXXXXXXXXXXKCDLFTYC 881
            KNSWGK WGM+G++HM RN+ +++GVCGIN LAS+PIK              KC+LFTYC
Sbjct: 302  KNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYC 361

Query: 882  GTDETCCCHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 1040
             + ETCCC   L G+CFSW+CCE ESAVCC D  HCCP DYP CDT R+LCLK
Sbjct: 362  SSGETCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 414


>gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlisea aurea]
          Length = 424

 Score =  486 bits (1251), Expect = e-135
 Identities = 226/357 (63%), Positives = 273/357 (76%), Gaps = 1/357 (0%)
 Frame = +3

Query: 3    NSSHTLSLNAFADLTNQEFKAKYLGLLPSADDLLIRLNSRKSAIEGSDLVEESDLPASVD 182
            N S+TLSLNAFADLT  EF  +YLG  PS  DLLIR N    +    +    S +P+S+D
Sbjct: 69   NYSYTLSLNAFADLTRSEFGGRYLGFSPSGHDLLIRKNRGSGSYRSRNY---SAVPSSID 125

Query: 183  WRTKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDTTYNAGCGG 362
            WR KGAVT +KDQGSCGACWSFSATGA+EGINQI TGSLVSLSEQELIDCD +YN GC G
Sbjct: 126  WRKKGAVTGIKDQGSCGACWSFSATGAIEGINQIVTGSLVSLSEQELIDCDHSYNQGCNG 185

Query: 363  GLMDYAFEFIIKNEGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQA 542
            GLMDYA+EFI+KN+GIDTEEDY Y+GRD  CS+ KL + VVTIDSY D+P + E+ LL+A
Sbjct: 186  GLMDYAYEFILKNKGIDTEEDYSYKGRDASCSQNKLNKRVVTIDSYVDIPEKNEQMLLEA 245

Query: 543  VATQPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKY 722
            VA+QPVSVGI G D  FQ YS GIF+GPCST+LDHAVLIVGYDS++G DYWIVKNSWGK 
Sbjct: 246  VASQPVSVGISGGDAPFQFYSQGIFTGPCSTSLDHAVLIVGYDSKNGKDYWIVKNSWGKS 305

Query: 723  WGMNGYIHMLRNSGDAEGVCGINTLASFPIK-XXXXXXXXXXXXXXKCDLFTYCGTDETC 899
            WGM+GY+++ RN+G+  G+C IN +AS+P+K               KC LF+YC   ETC
Sbjct: 306  WGMDGYMYVQRNTGNQNGICEINMMASYPVKTNPNPSPSPSPPGPTKCSLFSYCSQGETC 365

Query: 900  CCHWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKP 1070
            CC    LG+C  ++CC AESAVCC+D+ HCCP+DYP CDTA+++C K  GNST++ P
Sbjct: 366  CCARRFLGLCMRYKCCGAESAVCCEDNVHCCPQDYPICDTAQSVCRKMSGNSTMAIP 422


Top