BLASTX nr result

ID: Mentha27_contig00019402 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha27_contig00019402
         (958 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU36745.1| hypothetical protein MIMGU_mgv1a006749mg [Mimulus...   382   e-103
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   359   1e-96
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   358   2e-96
ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr...   358   2e-96
dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          356   7e-96
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   351   3e-94
ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087...   348   2e-93
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   345   1e-92
ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a...   345   2e-92
ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps...   343   8e-92
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   343   8e-92
ref|XP_002307688.2| cysteine protease family protein [Populus tr...   342   1e-91
ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine...   342   1e-91
gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase...   337   3e-90
ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C...   337   4e-90
ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr...   337   4e-90
ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phas...   337   5e-90
ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis ...   337   5e-90
ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prun...   336   9e-90
gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]                  334   3e-89

>gb|EYU36745.1| hypothetical protein MIMGU_mgv1a006749mg [Mimulus guttatus]
          Length = 433

 Score =  382 bits (980), Expect = e-103
 Identities = 168/242 (69%), Positives = 193/242 (79%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYA++FIIKNKGIDTEEDY Y+GR   C K K+ +HVVTIDSY D+P + EKKLLQAVAT
Sbjct: 192 DYAYDFIIKNKGIDTEEDYSYKGRSATCDKNKMNKHVVTIDSYVDIPEKDEKKLLQAVAT 251

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QP+SVGICGSD  FQLYSGGIF+GPCST+LDHAVLIVGYDS+DG DYWI+KNSWGK WG+
Sbjct: 252 QPISVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGKSWGI 311

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
            GY+HM+RNS   EGVCGINTLAS+P+K             TKC++FTYC + ETCCC  
Sbjct: 312 KGYMHMVRNSGSEEGVCGINTLASYPVKSSTNPPPSPTPGPTKCNIFTYCSSGETCCCAR 371

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKKGFF 237
             LG+C SW CCEAESAVCCDDH HCCP DYP CDT +NLCLK+ GN+T+SKP  KK F 
Sbjct: 372 YFLGVCLSWNCCEAESAVCCDDHRHCCPHDYPVCDTKKNLCLKKSGNTTVSKPLGKKSFS 431

Query: 236 TS 231
            S
Sbjct: 432 AS 433


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
           gi|110741821|dbj|BAE98853.1| papain-like cysteine
           peptidase XBCP3 [Arabidopsis thaliana]
           gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
           thaliana] gi|332190386|gb|AEE28507.1| papain-like
           cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  359 bits (921), Expect = 1e-96
 Identities = 163/237 (68%), Positives = 187/237 (78%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L++AVA 
Sbjct: 187 DYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAA 246

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGICGS+  FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK WGM
Sbjct: 247 QPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGM 306

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
           +G++HM RN+E+++GVCGIN LAS+PIK             TKC+LFTYC + ETCCC  
Sbjct: 307 DGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCAR 366

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 246
            L G+CFSW+CCE ESAVCC D  HCCP DYP CDT R+LCLK+ GN T  KPF KK
Sbjct: 367 ELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKK 423


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  358 bits (919), Expect = 2e-96
 Identities = 163/237 (68%), Positives = 187/237 (78%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L++AVA 
Sbjct: 187 DYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAA 246

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGICGS+  FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK WGM
Sbjct: 247 QPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGM 306

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
           +G++HM RN+E+++GVCGIN LAS+PIK             TKC+LFTYC + ETCCC  
Sbjct: 307 DGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCAR 366

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 246
            L G+CFSW+CCE ESAVCC D  HCCP DYP CDT R+LCLK+ GN T  KPF KK
Sbjct: 367 ELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKK 423


>ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum]
           gi|557095297|gb|ESQ35879.1| hypothetical protein
           EUTSA_v10007640mg [Eutrema salsugineum]
          Length = 444

 Score =  358 bits (918), Expect = 2e-96
 Identities = 162/237 (68%), Positives = 189/237 (79%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYAFEF+IKN GIDTE+DYPY+ +DG C K+KLK+ VVTIDSYA V    EK L++AVA+
Sbjct: 194 DYAFEFVIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVAS 253

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGICGS+  FQLYS GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK WGM
Sbjct: 254 QPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGM 313

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
           +G++HM RN+ ++EGVCGIN LAS+PIK             TKC+LFTYC + ETCCC  
Sbjct: 314 DGFMHMQRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCAR 373

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 246
           +L G+CFSW+CCE ESAVCC D  HCCPRDYP CDT ++LCLK+ GN T  KPF KK
Sbjct: 374 TLFGLCFSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPFWKK 430


>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  356 bits (914), Expect = 7e-96
 Identities = 159/236 (67%), Positives = 183/236 (77%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYAF+F+I N GIDTEEDYPY+GRD  C+KEKLKRHVVTID Y DVP   EK+LL+AVA 
Sbjct: 187 DYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVAN 246

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGICGS+  FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG YWGM
Sbjct: 247 QPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGM 306

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
           +GY+HM RNS  + G+CGIN LAS+P K             T+CDLFT+CG  ETCCC  
Sbjct: 307 DGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVH 366

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEK 249
            + GIC SW+CCE +SAVCC D  HCCPRDYP CDT RN+CLK  GN+T  + F K
Sbjct: 367 HIFGICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAK 422


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
           ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  351 bits (900), Expect = 3e-94
 Identities = 162/239 (67%), Positives = 185/239 (77%), Gaps = 2/239 (0%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L +AVA 
Sbjct: 187 DYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAA 246

Query: 776 QPVSVGICGSDYKFQLYS--GGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYW 603
           QPVSVGICGS+  FQLYS   GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNSWGK W
Sbjct: 247 QPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSW 306

Query: 602 GMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCC 423
           GM+G++HM RN+ ++EG+CGIN LAS+PIK             TKC+LFTYC   ETCCC
Sbjct: 307 GMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETCCC 366

Query: 422 YWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 246
             +L G+CFSW+CCE ESAVCC D  HCCP DYP CDT R+LCLK+ GN T  KPF KK
Sbjct: 367 ARNLFGLCFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKK 425


>ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1|
           JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  348 bits (893), Expect = 2e-93
 Identities = 157/237 (66%), Positives = 182/237 (76%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYA++F+I N GID EEDYPY GR+  C+KEK KR VVTID YA VP   E  LLQAVA 
Sbjct: 184 DYAYQFVIDNHGIDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAK 243

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGICGS+  FQLYS GIF+GPCS++LDHAVLIVGY S++GVDYWIVKNSWG  WGM
Sbjct: 244 QPVSVGICGSERAFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGM 303

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
           NGYIHMLRNS D++G+CGIN LAS+P K             TKCDLFTYC   ETCCC  
Sbjct: 304 NGYIHMLRNSGDSKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTH 363

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 246
            + GICFSW+CCE +SAVCC D+ HCCP DYP CDT ++ CLKR+GN+T  + FEK+
Sbjct: 364 RIFGICFSWKCCELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKR 420


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  345 bits (886), Expect = 1e-92
 Identities = 155/239 (64%), Positives = 183/239 (76%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYAFEF+IKN GIDTE+DYP+R R+G C+K KL+RHVVTID Y D+P   E KLL+AVAT
Sbjct: 190 DYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKAVAT 249

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGICGS   FQ YS GIF+GPCSTALDHAVLIVGY S++GVDYWI+KNSWG  WG+
Sbjct: 250 QPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGI 309

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
           NGYIHM RNS + EG+CGIN LAS+P K             +KC +FT CG  ETCCC  
Sbjct: 310 NGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCCCGS 369

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKKGF 240
             LGIC SW+CC  +SAVCC D  HCCP+DYP CDT+RNLCLKR+ N+T+ +  +K+ F
Sbjct: 370 KFLGICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQQPQKEAF 428


>ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum]
          Length = 436

 Score =  345 bits (884), Expect = 2e-92
 Identities = 157/240 (65%), Positives = 177/240 (73%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYA++FII N GIDTEEDYPY+ R   C K+KLKR VVTID Y DVPP  EKKLL+AVA 
Sbjct: 188 DYAYQFIIDNNGIDTEEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAV 247

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGICGS   FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWGKYWGM
Sbjct: 248 QPVSVGICGSARAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGM 307

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
           NGYIHMLRN++ + G+CGIN LAS+P K              KC+LFTYC   ETCCC  
Sbjct: 308 NGYIHMLRNTDSSAGLCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAK 367

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKKGFF 237
             LGICFSW+CC   SAVCC D  HCCP DYP CD +   CLKRI N T+    +K+  F
Sbjct: 368 KFLGICFSWKCCGVTSAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTSDKEDPF 427


>ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella]
           gi|482576142|gb|EOA40329.1| hypothetical protein
           CARUB_v10009056mg [Capsella rubella]
          Length = 467

 Score =  343 bits (879), Expect = 8e-92
 Identities = 162/265 (61%), Positives = 187/265 (70%), Gaps = 28/265 (10%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYAFEF+IKNKGIDTE+DYPY+ RDG C K+KLK+ VV+IDSYA V P  EK LL+AVA 
Sbjct: 189 DYAFEFVIKNKGIDTEKDYPYQERDGTCKKDKLKQRVVSIDSYAGVKPSDEKALLEAVAA 248

Query: 776 QPVSVGICGSDYKFQLYSG----------------------------GIFSGPCSTALDH 681
           QPVSVGICGS+  FQLYS                             GIFSGPCST+LDH
Sbjct: 249 QPVSVGICGSERAFQLYSSVSFKIRDTSILSSECSTFPCLKLYLMMQGIFSGPCSTSLDH 308

Query: 680 AVLIVGYDSQDGVDYWIVKNSWGKYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXX 501
           AVLIVGY SQ+GVDYWIVKNSWGK WGM+G++HM RN+ +++G+CGIN LAS+PIK    
Sbjct: 309 AVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSQGICGINMLASYPIKTHPN 368

Query: 500 XXXXXXXXXTKCDLFTYCGTDETCCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYP 321
                    TKC+LFTYC   ETCCC  +L G+C SW+CCE ESAVCC D  HCCP DYP
Sbjct: 369 PPPPSPPGPTKCNLFTYCSAAETCCCARNLFGLCLSWKCCEIESAVCCKDGRHCCPHDYP 428

Query: 320 TCDTARNLCLKRIGNSTLSKPFEKK 246
            CDT R+LCLK+ GN T  KPF KK
Sbjct: 429 VCDTTRSLCLKKTGNFTAIKPFWKK 453


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
           gi|223551160|gb|EEF52646.1| cysteine protease, putative
           [Ricinus communis]
          Length = 422

 Score =  343 bits (879), Expect = 8e-92
 Identities = 151/223 (67%), Positives = 179/223 (80%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYA++F+I+N GIDTEEDYPY+ R+  C+KEKLKRHVVTID Y DVP   EK+LL+AVA 
Sbjct: 188 DYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAA 247

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGICGS+  FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG +WG+
Sbjct: 248 QPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGI 307

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
           NGY++MLRNS +++G+CGIN LASFP+K             TKCDLFT CG  ETCCC  
Sbjct: 308 NGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTR 367

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 288
            + G+CFSW+CCE +SAVCC D  HCCP DYP CDT RN+CLK
Sbjct: 368 RIFGLCFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410


>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
           gi|550339725|gb|EEE94684.2| cysteine protease family
           protein [Populus trichocarpa]
          Length = 436

 Score =  342 bits (877), Expect = 1e-91
 Identities = 152/237 (64%), Positives = 178/237 (75%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYAF+F+I N GIDTEEDYPYR RDG C+K+++KR VVTID Y DVP   EK+LLQAVA 
Sbjct: 183 DYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAA 242

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGICGS+  FQ+YS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG  WGM
Sbjct: 243 QPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGM 302

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
            GY+HM RNS +++GVCGIN LAS+P+K             TKC+L TYC   ETCCC  
Sbjct: 303 RGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCAR 362

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKK 246
              GIC SW+CC  +SAVCC D  HCCP DYP CDT +N+C KR GN+T  +  E K
Sbjct: 363 KFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGK 419


>ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  342 bits (877), Expect = 1e-91
 Identities = 152/239 (63%), Positives = 184/239 (76%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           D+A++F+I NKGIDTE+DYPY+ R   CSK+KLKR  VTI+ Y DVPP  E+++L+AVA+
Sbjct: 192 DFAYQFVIDNKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPP-SEEEILKAVAS 250

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGICGS+ +FQLYS GIF+GPCST LDHAVLIVGY S++GVDYWIVKNSWGKYWGM
Sbjct: 251 QPVSVGICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGM 310

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
           NGYIHM+RNS +++G+CGINTLAS+P+K              +C+LFT+C   ETCCC  
Sbjct: 311 NGYIHMIRNSGNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAK 370

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKKGF 240
           S LGICFSW+CC   SAVCC D  HCCP+DYP CDT R  CLKR  N T +   E + F
Sbjct: 371 SFLGICFSWKCCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDF 429


>gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
           [Arabidopsis thaliana]
          Length = 416

 Score =  337 bits (865), Expect = 3e-90
 Identities = 155/230 (67%), Positives = 178/230 (77%), Gaps = 7/230 (3%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYAFEF+IKN GIDTE+DYPY+ RDG C K+KLK+ VVTIDSYA V    EK L++AVA 
Sbjct: 185 DYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAA 244

Query: 776 QPVSVGICGSDYKFQLYSG-------GIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNS 618
           QPVSVGICGS+  FQLYS        GIFSGPCST+LDHAVLIVGY SQ+GVDYWIVKNS
Sbjct: 245 QPVSVGICGSERAFQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNS 304

Query: 617 WGKYWGMNGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTD 438
           WGK WGM+G++HM RN+E+++GVCGIN LAS+PIK             TKC+LFTYC + 
Sbjct: 305 WGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSG 364

Query: 437 ETCCCYWSLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 288
           ETCCC   L G+CFSW+CCE ESAVCC D  HCCP DYP CDT R+LCLK
Sbjct: 365 ETCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 414


>ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 441

 Score =  337 bits (864), Expect = 4e-90
 Identities = 148/239 (61%), Positives = 183/239 (76%), Gaps = 1/239 (0%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYA++F+IKN GIDTE+DYPYRG+ G+C+K+KL RH+VTID Y DVP   EK+LLQAV  
Sbjct: 186 DYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA 245

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGICGS+  FQLYS GIF+GPCST+LDHAVLI+GYDS++GVDYWI+KNSWG+ WGM
Sbjct: 246 QPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGM 305

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
           NGY+HM RN+ ++ G+CGIN LAS+P K             T+C L TYC   ETCCC  
Sbjct: 306 NGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGS 365

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRI-GNSTLSKPFEKKG 243
           S+LGIC SW+CC   SAVCC DH +CCP +YP CD+ R+ CL R+ GN T ++  E +G
Sbjct: 366 SILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIEMRG 424


>ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina]
           gi|557537201|gb|ESR48319.1| hypothetical protein
           CICLE_v10001178mg [Citrus clementina]
          Length = 441

 Score =  337 bits (864), Expect = 4e-90
 Identities = 149/239 (62%), Positives = 182/239 (76%), Gaps = 1/239 (0%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYA++F+IKN GIDTE+DYPYRG+ G+C+K+KL RH+VTID Y DVP   EK+LLQAV  
Sbjct: 186 DYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA 245

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGICGS+  FQLYS GIF+GPCST+LDHAVLIVGYDS++GVDYWI+KNSWG+ WGM
Sbjct: 246 QPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGM 305

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
           NGY+HM RN+ ++ G+CGIN LAS+P K             T+C L TYC   ETCCC  
Sbjct: 306 NGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGS 365

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRI-GNSTLSKPFEKKG 243
           S+LGIC SW+CC   SAVCC DH +CCP +YP CD+ R+ CL R  GN T ++  E +G
Sbjct: 366 SILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIEMRG 424


>ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris]
           gi|561009128|gb|ESW08035.1| hypothetical protein
           PHAVU_009G013000g [Phaseolus vulgaris]
          Length = 428

 Score =  337 bits (863), Expect = 5e-90
 Identities = 151/229 (65%), Positives = 180/229 (78%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYA++F+I NKGIDTE+DYPY+ R   C+K+KLKRH+VTID Y D+PP +E+ LL+AVA+
Sbjct: 184 DYAYQFVIDNKGIDTEDDYPYQARQRPCNKDKLKRHIVTIDDYVDLPPNEEE-LLKAVAS 242

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGICGS+  FQLYS GIFSGPCST+LDHAVLIVGY S++GVDYWIVKNSWGKYWGM
Sbjct: 243 QPVSVGICGSERAFQLYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGM 302

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
            GYIHM+RN+ D +G+CGINTLAS+PIK              +C+LFT+C   ETCCC  
Sbjct: 303 EGYIHMIRNTGDPKGICGINTLASYPIK--TKPNPPPPPAPVRCNLFTHCSEGETCCCAK 360

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNST 270
           S LGICFSW+CC   SAVCC D  HCCPRDYP CDT ++ CLK    +T
Sbjct: 361 SFLGICFSWKCCGLTSAVCCKDKRHCCPRDYPICDTEKSQCLKITNGTT 409


>ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
           gi|302142569|emb|CBI19772.3| unnamed protein product
           [Vitis vinifera]
          Length = 436

 Score =  337 bits (863), Expect = 5e-90
 Identities = 145/238 (60%), Positives = 180/238 (75%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYA++F+IKN+GID+E DYPY G D  C+KEKLK+H+VTID Y D+PP  EK+LLQ VA 
Sbjct: 182 DYAYQFVIKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAK 241

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGICGS+  FQLYS G+++GPCS+ LDHAVLIVGY ++DGVD+WIVKNSWG++WGM
Sbjct: 242 QPVSVGICGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWGM 301

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
            GYIHMLRN+  AEG+CGIN LAS+P K             TKCD F+ C   ETCCC W
Sbjct: 302 RGYIHMLRNNGTAEGICGINMLASYPAKTSPNPPPPPTPGPTKCDFFSSCSEGETCCCSW 361

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKKG 243
             +G+C SW CC A+SAVCCD++ +CCP  +P CDT RN CLK  GN T  +  +++G
Sbjct: 362 RFIGVCLSWNCCTAKSAVCCDNNNYCCPASHPICDTKRNRCLKPAGNGTGVEVLKRRG 419


>ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prunus persica]
           gi|462420299|gb|EMJ24562.1| hypothetical protein
           PRUPE_ppa005615mg [Prunus persica]
          Length = 451

 Score =  336 bits (861), Expect = 9e-90
 Identities = 158/249 (63%), Positives = 185/249 (74%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           D AF F+I N GIDTEEDYPY+G D  C K+KLKR+ VTID Y DVP   E++LLQAVA+
Sbjct: 188 DDAFRFVIDNNGIDTEEDYPYKGWDDTCIKKKLKRNAVTIDDYTDVPSNDEEQLLQAVAS 247

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGI GSD  FQLYS GIF+GPCST+LDHAVLIVGY S++GVDYWIVKNSWG +WGM
Sbjct: 248 QPVSVGISGSDMGFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGM 307

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
           NGY+HMLR+  + +G+CGINTLAS+PIK             T+CD+FT+C   ETCCC  
Sbjct: 308 NGYMHMLRDHSNPKGICGINTLASYPIK-TGENPPLPPPGPTRCDIFTHCAAGETCCCAK 366

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLKRIGNSTLSKPFEKKGFF 237
            ++GICFSWRCCE +SAVCC D  HCCPRDYP CDT R LCL+   N  LS      G  
Sbjct: 367 RVVGICFSWRCCELDSAVCCKDQRHCCPRDYPICDTERTLCLQ--SNEQLSTQSHATGNL 424

Query: 236 TS*AVNSKG 210
           TS A+ S+G
Sbjct: 425 TSKALESRG 433


>gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]
          Length = 517

 Score =  334 bits (857), Expect = 3e-89
 Identities = 149/223 (66%), Positives = 170/223 (76%)
 Frame = -3

Query: 956 DYAFEFIIKNKGIDTEEDYPYRGRDGKCSKEKLKRHVVTIDSYADVPPRKEKKLLQAVAT 777
           DYA++F+I N GIDTEEDYPY+ RD  C KEKLKR VVTID Y DV P    +LLQAV T
Sbjct: 184 DYAYQFVIDNHGIDTEEDYPYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQAVVT 243

Query: 776 QPVSVGICGSDYKFQLYSGGIFSGPCSTALDHAVLIVGYDSQDGVDYWIVKNSWGKYWGM 597
           QPVSVGICGS+  FQLYS GIF+GPCST+LDHAVLIVGYDS++GVDYWIVKNSWGK WGM
Sbjct: 244 QPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQWGM 303

Query: 596 NGYIHMLRNSEDAEGVCGINTLASFPIKXXXXXXXXXXXXXTKCDLFTYCGTDETCCCYW 417
           +GYIHM RN+ +++GVCGIN LAS+P K             T+C  F  CG  ETCCC W
Sbjct: 304 DGYIHMQRNTGNSQGVCGINMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGETCCCSW 363

Query: 416 SLLGICFSWRCCEAESAVCCDDHEHCCPRDYPTCDTARNLCLK 288
             LG+CFSW+CC   SAVCC D  HCCP+DYP CDT RN+CLK
Sbjct: 364 RFLGLCFSWKCCGLNSAVCCKDKIHCCPQDYPLCDTQRNVCLK 406


Top