BLASTX nr result

ID: Mentha22_contig00033698 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00033698
         (870 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAN20235.1| cathepsin L [Riptortus pedestris]                      76   1e-11
ref|XP_006360817.1| PREDICTED: pro-cathepsin H-like [Solanum tub...    74   6e-11
ref|XP_003559143.1| PREDICTED: thiol protease SEN102-like [Brach...    71   5e-10
ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prun...    71   6e-10
ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [S...    70   1e-09
ref|XP_006347646.1| PREDICTED: uncharacterized protein LOC102578...    69   3e-09
ref|XP_002522572.1| cysteine protease, putative [Ricinus communi...    69   3e-09
gb|AAT07054.1| cathepsin L-like cysteine proteinase [Brugia malayi]    67   7e-09
gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indi...    67   7e-09
ref|XP_005355370.1| PREDICTED: testin-2-like isoform X2 [Microtu...    65   3e-08
ref|XP_005355369.1| PREDICTED: testin-2-like isoform X1 [Microtu...    65   3e-08
ref|XP_004953685.1| PREDICTED: xylem cysteine proteinase 1-like ...    65   3e-08
ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a...    65   3e-08
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...    65   3e-08
ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group] g...    65   3e-08
ref|XP_970951.1| PREDICTED: similar to cathepsin l [Tribolium ca...    65   3e-08
ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [S...    65   4e-08
gb|AAG17127.1|AF190653_1 cathepsin L-like cysteine proteinase CA...    65   4e-08
ref|XP_006415025.1| hypothetical protein EUTSA_v10007664mg [Eutr...    64   6e-08
ref|XP_006647791.1| PREDICTED: xylem cysteine proteinase 1-like ...    64   1e-07

>dbj|BAN20235.1| cathepsin L [Riptortus pedestris]
          Length = 334

 Score = 76.3 bits (186), Expect = 1e-11
 Identities = 66/256 (25%), Positives = 104/256 (40%), Gaps = 7/256 (2%)
 Frame = -3

Query: 853 VPDWR-RGCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHLPFPP 677
           + DWR  G +T + +QG    CWA     AL    F   G  +  S Q+L+D        
Sbjct: 117 IVDWRAEGAVTPVKNQGQCGSCWAFSAVGALEGQHFRQTGELVNLSEQNLVDCTR----- 171

Query: 676 DDALYDAAGRMMIDHRGLGNSF-HHALTYVMRFGVIIEDELPFLWWCRNLEWIDRPFGDR 500
                     M+  + G    F + A  Y+ + G+  E+  P+    +  ++     G  
Sbjct: 172 ---------NMIFGNNGCNGGFMNRAFRYIKKKGIDTEESYPYRGKEQKCKFSPENIGAN 222

Query: 499 IFLHGYGHVDELDYEEVEWRLRIQPLMAE-----IAXXXXXXXXXXXSGVYLGFQPDEVS 335
             + GY  +          +L +Q  +A+     +A           SGVY  ++P    
Sbjct: 223 --MTGYAKIKRGS------QLALQDAVAKAGPISVAIEAHKSIKYYKSGVY--YEPQ--- 269

Query: 334 LISDTWALNGTYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIA 155
                      +    HA+LVVGYGVE+  E        Y+L KNSWG+ WG  GY+KI 
Sbjct: 270 -------CGVGFQKLNHAVLVVGYGVEDGRE--------YWLVKNSWGDKWGDGGYIKIV 314

Query: 154 RTFFDGMGGFYFSEQP 107
           + F++G G   ++  P
Sbjct: 315 KNFWNGCGVAEYASYP 330


>ref|XP_006360817.1| PREDICTED: pro-cathepsin H-like [Solanum tuberosum]
          Length = 346

 Score = 74.3 bits (181), Expect = 6e-11
 Identities = 56/231 (24%), Positives = 103/231 (44%), Gaps = 4/231 (1%)
 Frame = -3

Query: 832 CITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLD--FVHLPFPPDDALYD 659
           C++ +  QGD   CWA V  +A+ AL  +   +    S + ++D  F  +  P +   + 
Sbjct: 125 CLSSVEDQGDCPTCWAFVAAEAISALYHIKFNKEKFLSKKHVIDDLFTVIKEPKEIQYHK 184

Query: 658 AAGRMMIDHRGLGNSFHHALTYVMRFGVIIEDELPFLWWCRNLEWIDRPFGDRIFLHGYG 479
           A G          + +++   Y +  GV  +   P+L      E ++ P  ++  +  Y 
Sbjct: 185 ATGCFP-------SHYNNYFQYAIEKGVYPDKPYPYL--AERGECLELPNEEKTKIKAYK 235

Query: 478 HVDEL--DYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVSLISDTWALNG 305
            V++L  D + +E  ++ QP+   +              +Y+G   +E+     + A   
Sbjct: 236 KVNDLGLDKKSIEELIQKQPICGSVKLAKNFQKHKGKD-IYMGQTKEEIY----SEASKN 290

Query: 304 TYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIAR 152
                 HA+L++G+G+EN +E        YYL KNSWG  WG+ GY ++ R
Sbjct: 291 NQSRGRHAVLIIGFGIENGIE--------YYLIKNSWGVNWGYLGYARVER 333


>ref|XP_003559143.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 388

 Score = 71.2 bits (173), Expect = 5e-10
 Identities = 68/235 (28%), Positives = 102/235 (43%), Gaps = 3/235 (1%)
 Frame = -3

Query: 847 DWR-RGCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHLPFPPDD 671
           DWR +G +T + +Q     CWA     A+ +L FL  G  I  S Q L+D          
Sbjct: 166 DWRVKGAVTEVKNQNPCGGCWAFAAAAAVESLVFLKTGSLIPLSEQQLIDC--------- 216

Query: 670 ALYDAAGRMMIDHRGL-GNSFHHALTYVMRFGVIIEDELPFLWWCRNLEW-IDRPFGDRI 497
              D +      ++G  G S   A  Y+   G+  ++  P+       E  +D P   RI
Sbjct: 217 ---DTSS----PNKGCNGGSVAKAFIYISSTGLTSQEHYPYERSQGPCEAEMDTPVFARI 269

Query: 496 FLHGYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVSLISDTW 317
              G+  V   D + +  R+ +QP+ A IA            GV+ G             
Sbjct: 270 --SGFASVPPFDGDAMLRRVAVQPVAAVIAVNENVLTGYTG-GVFTG------------- 313

Query: 316 ALNGTYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIAR 152
            L G+ P   HA+ VVGYG  +  ++G   S+ Y++ KNSWG  WG  GY++I+R
Sbjct: 314 -LCGS-PTVWHAVTVVGYGTHD--DDGDGTSLNYWVVKNSWGPSWGEGGYIRISR 364


>ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prunus persica]
           gi|462420299|gb|EMJ24562.1| hypothetical protein
           PRUPE_ppa005615mg [Prunus persica]
          Length = 451

 Score = 70.9 bits (172), Expect = 6e-10
 Identities = 62/237 (26%), Positives = 101/237 (42%), Gaps = 5/237 (2%)
 Frame = -3

Query: 847 DWRR-GCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHLPFPPDD 671
           DWR+ G +T +  QG    CWA  T  A+  +  ++ G  I  S Q+L+D   + +P + 
Sbjct: 123 DWRKKGAVTNVKDQGSCGACWAFSTTGAIEGINKIVTGSLISLSEQELVDCDRV-YPNNG 181

Query: 670 A---LYDAAGRMMIDHRGLGNSFHHALTYVMRFGVIIEDELPFLWWCRNLEWIDRPFG-D 503
               L D A R +ID+ G+                  E++ P+  W      I +    +
Sbjct: 182 CNGGLMDDAFRFVIDNNGIDT----------------EEDYPYKGWDDTC--IKKKLKRN 223

Query: 502 RIFLHGYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVSLISD 323
            + +  Y  V   D E++   +  QP+   I+            G  +GFQ     + + 
Sbjct: 224 AVTIDDYTDVPSNDEEQLLQAVASQPVSVGIS------------GSDMGFQLYSKGIFNG 271

Query: 322 TWALNGTYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIAR 152
             + +       HA+L+VGYG EN V+        Y++ KNSWG  WG  GY+ + R
Sbjct: 272 PCSTS-----LDHAVLIVGYGSENGVD--------YWIVKNSWGTHWGMNGYMHMLR 315


>ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
           gi|241934223|gb|EES07368.1| hypothetical protein
           SORBIDRAFT_04g029960 [Sorghum bicolor]
          Length = 356

 Score = 70.1 bits (170), Expect = 1e-09
 Identities = 63/235 (26%), Positives = 95/235 (40%), Gaps = 5/235 (2%)
 Frame = -3

Query: 847 DWR-RGCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHLPFPPDD 671
           DWR +G +T + +QG    CWA  +  A+  +  ++ G+ +  S Q+L+D          
Sbjct: 138 DWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLSEQELMDC--------- 188

Query: 670 ALYDAAGRMMIDHRGLGNSFHHALTYVMRF-GVIIEDELPFLW---WCRNLEWIDRPFGD 503
                    M+DH   G     A  Y+M   G+  ED+ P+L    +C+  +    P+ +
Sbjct: 189 -------DTMLDHGCEGGLMDFAFAYIMGSQGIHAEDDYPYLMEEGYCKEKQ----PYAN 237

Query: 502 RIFLHGYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVSLISD 323
            + + GY  V E     +   L  QP+   IA            GV+ G   DE+     
Sbjct: 238 VVTITGYEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYKG-GVFDGSCSDELD---- 292

Query: 322 TWALNGTYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKI 158
                       HA+  VGYG          Y   Y   KNSWG+ WG  GYV+I
Sbjct: 293 ------------HALTAVGYGSS--------YGQNYITMKNSWGKNWGEQGYVRI 327


>ref|XP_006347646.1| PREDICTED: uncharacterized protein LOC102578529 [Solanum tuberosum]
          Length = 893

 Score = 68.6 bits (166), Expect = 3e-09
 Identities = 63/231 (27%), Positives = 96/231 (41%), Gaps = 15/231 (6%)
 Frame = -3

Query: 796  CCWAVVTRDALYALQFLLEGRAIRG-SVQDLLDFVHLPFPPDDALYDAAGRMMIDHRGLG 620
            CCWA  + +A+ A   L   R I   S Q L+D ++  +       D   +         
Sbjct: 671  CCWAFTSTEAITAAYALKNKREIVPLSKQQLIDCMYTKYKKPSYFADLGEKECFPC---- 726

Query: 619  NSFHHALTYVMRFGVIIEDELPFLWWCRNLEWIDRPFGDRIF-LHGYGHVDEL------- 464
             S++ A  + M +G+ +E + PF+      E        RI  ++G+  V EL       
Sbjct: 727  -SYNKAYKFAMDYGITVETKYPFMEERGKCECQSEM---RIIKINGFQRVSELIKELEEK 782

Query: 463  ------DYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVSLISDTWALNGT 302
                  + E +E  +R QP+    A            GVY+G   +E++ +         
Sbjct: 783  AIEKLDEKEIIEKLIRQQPITCA-ALHVPSLQLHRGKGVYMGPTENEIAQVRQKETEGQV 841

Query: 301  YPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIART 149
                 HA+L+VGYG E  VE        +YL KNSWG  WG+ GY KI R+
Sbjct: 842  VG--KHAMLIVGYGEEEGVE--------FYLVKNSWGTEWGYQGYAKIKRS 882


>ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
           gi|223538263|gb|EEF39872.1| cysteine protease, putative
           [Ricinus communis]
          Length = 340

 Score = 68.6 bits (166), Expect = 3e-09
 Identities = 62/233 (26%), Positives = 93/233 (39%), Gaps = 1/233 (0%)
 Frame = -3

Query: 847 DWR-RGCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHLPFPPDD 671
           DWR RG +T + +QG   CCWA     A  A++ ++ G  +  S Q LLD V      + 
Sbjct: 134 DWRTRGVVTPVKNQGRCGCCWAF---SAAAAVEGII-GNGVSLSAQQLLDCVPDSNGCNG 189

Query: 670 ALYDAAGRMMIDHRGLGNSFHHALTYVMRFGVIIEDELPFLWWCRNLEWIDRPFGDRIFL 491
              D A R +I ++GL ++ ++    +                CR       P  +   +
Sbjct: 190 GFMDNAFRYIIQNQGLASATYYPYQLMREM-------------CR-------PSNNAARI 229

Query: 490 HGYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVSLISDTWAL 311
            GY  V   D E ++  +  QP+ A +             G+   F P +          
Sbjct: 230 SGYVDVTPADEETLKSAVARQPVSAAVDATSELNFKYYGGGI---FPPQDCGST------ 280

Query: 310 NGTYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIAR 152
                  THAI +VGYG              Y+L KNSWGE WG  GY+++ R
Sbjct: 281 ------LTHAITIVGYGTSAE-------GTKYWLIKNSWGEGWGEGGYMRLQR 320


>gb|AAT07054.1| cathepsin L-like cysteine proteinase [Brugia malayi]
          Length = 368

 Score = 67.4 bits (163), Expect = 7e-09
 Identities = 69/248 (27%), Positives = 103/248 (41%), Gaps = 6/248 (2%)
 Frame = -3

Query: 847 DWRR-GCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHLPFPPDD 671
           DWR  G +T++  QG    CW      AL    FL  G+ +  S+Q+LLD        DD
Sbjct: 145 DWRTSGAVTKVKDQGYCGSCWTFSAVGALKGQHFLQTGKLVELSMQNLLDC------SDD 198

Query: 670 AL--YDAAGRMMIDHRGLGNSFHHALTYVMRF-GVIIEDELPFLWWCRNLEWIDRPFGDR 500
               Y   G +M++          A  YV++  G+  E   P+  +     + +   G  
Sbjct: 199 TYGNYGCDGGLMME----------AFEYVVKNDGIDTEKSYPYQGYQNTCRYSNSTRGTT 248

Query: 499 IFLHGYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVSLISDT 320
            +        +L  E  E  L++Q  +A I                + F    +   S  
Sbjct: 249 AY------AGKLLPEGDE--LQLQAAIATIGPISVAVDAKL-----MKFYRRGIFSTSKC 295

Query: 319 WALNGTYPYYTHAILVVGYGVEN-RVENGIPYSIPYYLAKNSWGELWGFWGYVKIARTFF 143
               G      HA+L VGYG E  +++NG   S+ Y+L KNSW + WG  GY+K+AR   
Sbjct: 296 TTRMG------HALLAVGYGTEEVKLQNGTKKSVDYWLLKNSWSKRWGIGGYLKLARNQE 349

Query: 142 DGMG-GFY 122
           +  G GFY
Sbjct: 350 NMCGIGFY 357


>gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
          Length = 357

 Score = 67.4 bits (163), Expect = 7e-09
 Identities = 63/240 (26%), Positives = 94/240 (39%), Gaps = 8/240 (3%)
 Frame = -3

Query: 847 DWRR-GCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHLPFPPDD 671
           DWR+ G +T + +QG+   CWA  T  A+  +  ++ G+ +  S Q+L+D  +       
Sbjct: 139 DWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLSEQELMDCDN------- 191

Query: 670 ALYDAAGRMMIDHRGLGNSFHHALTYVM-RFGVIIEDELPFLW---WCRNLEWIDRPFGD 503
                      +H   G     A  Y+M   G+  E++ P+L    +CR      +P   
Sbjct: 192 ---------TFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCRE----KQPHSK 238

Query: 502 RIFLHGYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVY---LGFQPDEVSL 332
            I + GY  V E     +   L  QP+   IA            G++    G QPD    
Sbjct: 239 VITITGYEDVPENSETSLLKALAHQPVSVGIA-AGSRDFQFYKGGIFDGECGIQPD---- 293

Query: 331 ISDTWALNGTYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIAR 152
                          HA+  VGYG          Y   Y + KNSWG+ WG  GY +I R
Sbjct: 294 ---------------HALTAVGYGSY--------YGQDYIIMKNSWGKNWGEQGYFRIRR 330


>ref|XP_005355370.1| PREDICTED: testin-2-like isoform X2 [Microtus ochrogaster]
          Length = 320

 Score = 65.1 bits (157), Expect = 3e-08
 Identities = 74/263 (28%), Positives = 99/263 (37%), Gaps = 13/263 (4%)
 Frame = -3

Query: 856 SVP---DWR-RGCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHL 689
           SVP   DWR RG +T + +QG     WA     AL    F   GR I  S Q+LLD +  
Sbjct: 100 SVPKSVDWRERGFVTPVKNQGHCASSWAFSATGALEGQMFRKTGRLIPLSEQNLLDCMDF 159

Query: 688 PFPPDDALYDAAGRMMIDHRGLGNSFHHALTYVMRFG-VIIEDELPFLWWCRNLEWIDRP 512
                           + H   G     A  YV   G +  E   P+           + 
Sbjct: 160 N---------------VTHGCRGGFMQRAFQYVKDSGGLATEKSYPY-----------KG 193

Query: 511 FGDRIFLH-GYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVS 335
            G     H G    +  D+ +V    +   LM  +A                     + S
Sbjct: 194 LGGECRFHAGISAANVSDFVQVPGCEKA--LMKAVAKVGPISVAV------------DAS 239

Query: 334 LISDTWALNGTY-------PYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGF 176
             S  +  NGTY        +  HA+LVVGYG E    NG  Y    +L KNSWGE WG 
Sbjct: 240 HSSFRFYENGTYYEPQCRRVHLNHAVLVVGYGFEGEESNGNSY----WLVKNSWGEEWGM 295

Query: 175 WGYVKIARTFFDGMGGFYFSEQP 107
            GYVK+A+ + +  G   ++  P
Sbjct: 296 RGYVKMAKDWNNNCGIATYATYP 318


>ref|XP_005355369.1| PREDICTED: testin-2-like isoform X1 [Microtus ochrogaster]
          Length = 333

 Score = 65.1 bits (157), Expect = 3e-08
 Identities = 74/263 (28%), Positives = 99/263 (37%), Gaps = 13/263 (4%)
 Frame = -3

Query: 856 SVP---DWR-RGCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHL 689
           SVP   DWR RG +T + +QG     WA     AL    F   GR I  S Q+LLD +  
Sbjct: 113 SVPKSVDWRERGFVTPVKNQGHCASSWAFSATGALEGQMFRKTGRLIPLSEQNLLDCMDF 172

Query: 688 PFPPDDALYDAAGRMMIDHRGLGNSFHHALTYVMRFG-VIIEDELPFLWWCRNLEWIDRP 512
                           + H   G     A  YV   G +  E   P+           + 
Sbjct: 173 N---------------VTHGCRGGFMQRAFQYVKDSGGLATEKSYPY-----------KG 206

Query: 511 FGDRIFLH-GYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVS 335
            G     H G    +  D+ +V    +   LM  +A                     + S
Sbjct: 207 LGGECRFHAGISAANVSDFVQVPGCEKA--LMKAVAKVGPISVAV------------DAS 252

Query: 334 LISDTWALNGTY-------PYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGF 176
             S  +  NGTY        +  HA+LVVGYG E    NG  Y    +L KNSWGE WG 
Sbjct: 253 HSSFRFYENGTYYEPQCRRVHLNHAVLVVGYGFEGEESNGNSY----WLVKNSWGEEWGM 308

Query: 175 WGYVKIARTFFDGMGGFYFSEQP 107
            GYVK+A+ + +  G   ++  P
Sbjct: 309 RGYVKMAKDWNNNCGIATYATYP 331


>ref|XP_004953685.1| PREDICTED: xylem cysteine proteinase 1-like [Setaria italica]
          Length = 358

 Score = 65.1 bits (157), Expect = 3e-08
 Identities = 62/237 (26%), Positives = 93/237 (39%), Gaps = 5/237 (2%)
 Frame = -3

Query: 847 DWR-RGCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHLPFPPDD 671
           DWR +G +T + +QG    CWA  T  A+  +  +  G+ +  S Q+L+D          
Sbjct: 138 DWRYKGAVTPVKNQGKCGSCWAFSTVAAVEGINQIDTGKLVSLSEQELMDC--------- 188

Query: 670 ALYDAAGRMMIDHRGLGNSFHHALTYVM-RFGVIIEDELPFLW---WCRNLEWIDRPFGD 503
                     +DH   G     A  ++M   G+  E++ P+L    +C+  +    P   
Sbjct: 189 -------DTTLDHGCGGGIMDFAFAFIMGNQGIHTEEDYPYLMEEGYCKERQ----PHAS 237

Query: 502 RIFLHGYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVSLISD 323
            + + GY  V E     +   L  QP+   IA            GV+ G   DE+     
Sbjct: 238 VVTITGYEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYKG-GVFDGACSDELD---- 292

Query: 322 TWALNGTYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIAR 152
                       HA+  VGYG          Y   Y + KNSWG+ WG  GYV+I R
Sbjct: 293 ------------HALTAVGYGSS--------YGQDYIVMKNSWGKNWGEQGYVRIKR 329


>ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum]
          Length = 436

 Score = 65.1 bits (157), Expect = 3e-08
 Identities = 61/234 (26%), Positives = 98/234 (41%), Gaps = 2/234 (0%)
 Frame = -3

Query: 847 DWRR-GCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDF-VHLPFPPD 674
           DWR+ G ++ +  QG    CW+     A+  +  ++ G  +  S Q+L+D         D
Sbjct: 124 DWRKNGAVSIVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCD 183

Query: 673 DALYDAAGRMMIDHRGLGNSFHHALTYVMRFGVIIEDELPFLWWCRNLEWIDRPFGDRIF 494
             L D A + +ID+ G+     +   Y  R  +  +D+L      R +  ID        
Sbjct: 184 GGLMDYAYQFIIDNNGIDTEEDYP--YQARQLLCKKDKLK-----RRVVTID-------- 228

Query: 493 LHGYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVSLISDTWA 314
             GY  V   D +++   + +QP+   I             G++ G  P   SL      
Sbjct: 229 --GYTDVPPNDEKKLLKAVAVQPVSVGICGSARAFQLYSK-GIFTG--PCSTSL------ 277

Query: 313 LNGTYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIAR 152
                    HA+L+VGYG EN V+        Y++ KNSWG+ WG  GY+ + R
Sbjct: 278 --------DHAVLIVGYGSENGVD--------YWIVKNSWGKYWGMNGYIHMLR 315


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
           gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
           proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score = 65.1 bits (157), Expect = 3e-08
 Identities = 62/234 (26%), Positives = 95/234 (40%), Gaps = 2/234 (0%)
 Frame = -3

Query: 847 DWRR-GCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVH-LPFPPD 674
           DWR+ G +T +  QG    CW+     A+  +  ++ G  I  S Q+L+D          
Sbjct: 119 DWRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCG 178

Query: 673 DALYDAAGRMMIDHRGLGNSFHHALTYVMRFGVIIEDELPFLWWCRNLEWIDRPFGDRIF 494
             L D A + +I + G+     +   Y  R G   +D+L      RN+  ID        
Sbjct: 179 GGLMDYAYQFVISNHGIDTENDYP--YQARDGSCRKDKLQ-----RNVVTID-------- 223

Query: 493 LHGYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVSLISDTWA 314
             GY  +   D  ++   +  QP+   I             G++ G  P   SL      
Sbjct: 224 --GYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLYSK-GIFSG--PCSTSL------ 272

Query: 313 LNGTYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIAR 152
                    HA+L+VGYG EN V+        Y++ KNSWG+ WG  GY+ + R
Sbjct: 273 --------DHAVLIVGYGSENGVD--------YWIVKNSWGKSWGMDGYMHMQR 310


>ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
           gi|42408029|dbj|BAD09165.1| putative cysteine proteinase
           [Oryza sativa Japonica Group]
           gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa
           Japonica Group] gi|215737450|dbj|BAG96580.1| unnamed
           protein product [Oryza sativa Japonica Group]
           gi|215765786|dbj|BAG87483.1| unnamed protein product
           [Oryza sativa Japonica Group]
           gi|222623551|gb|EEE57683.1| hypothetical protein
           OsJ_08138 [Oryza sativa Japonica Group]
          Length = 366

 Score = 65.1 bits (157), Expect = 3e-08
 Identities = 62/240 (25%), Positives = 93/240 (38%), Gaps = 8/240 (3%)
 Frame = -3

Query: 847 DWRR-GCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHLPFPPDD 671
           DWR+ G +T + +QG+   CWA  T  A+  +  ++ G+ +  S Q+L+D  +       
Sbjct: 148 DWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLSEQELMDCDN------- 200

Query: 670 ALYDAAGRMMIDHRGLGNSFHHALTYVM-RFGVIIEDELPFLW---WCRNLEWIDRPFGD 503
                      +H   G     A  Y+M   G+  E++ P+L    +CR      +P   
Sbjct: 201 ---------TFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCRE----KQPHSK 247

Query: 502 RIFLHGYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVY---LGFQPDEVSL 332
            I + GY  V       +   L  QP+   IA            G++    G QPD    
Sbjct: 248 VITITGYEDVPANSETSLLKALAHQPVSVGIA-AGSRDFQFYKGGIFDGECGIQPD---- 302

Query: 331 ISDTWALNGTYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIAR 152
                          HA+  VGYG          Y   Y + KNSWG+ WG  GY +I R
Sbjct: 303 ---------------HALTAVGYGSY--------YGQDYIIMKNSWGKNWGEQGYFRIRR 339


>ref|XP_970951.1| PREDICTED: similar to cathepsin l [Tribolium castaneum]
           gi|270001246|gb|EEZ97693.1| cathepsin L precursor
           [Tribolium castaneum]
          Length = 343

 Score = 65.1 bits (157), Expect = 3e-08
 Identities = 67/238 (28%), Positives = 92/238 (38%), Gaps = 6/238 (2%)
 Frame = -3

Query: 847 DWRR-GCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHLPFPPDD 671
           DWR  G +T + +QG    CWA     AL    F   GR +  S Q+L+D          
Sbjct: 129 DWREVGAVTPVKNQGSCAGCWAFSAAGALEGHNFRKTGRLVELSPQNLIDC--------S 180

Query: 670 ALYDAAGRMMIDHRGLGNSFHHALTYVMRF-GVIIEDELPFLWWCRNLEWIDRPFGDRIF 494
             Y   G         G   + A  YV    G+  ED  P+    RN     RP     +
Sbjct: 181 TNYGNDGCS-------GGLMNPAYEYVRTNPGIDTEDSYPYE--ARNGPCRFRPETVGAY 231

Query: 493 LHGYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYL----GFQPDEVSLIS 326
             GY  + E D + +E  +     ++               G+Y     G +PD+V+   
Sbjct: 232 CTGYVDIAEGDEQGLEAAIATLGPVSAAMDAGRQSFQFYSDGIYYDPQCGNRPDDVN--- 288

Query: 325 DTWALNGTYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIAR 152
                        HA+LVVGYG E       P    Y+L KNS+G  WG  GYVK+A+
Sbjct: 289 -------------HAVLVVGYGTE-------PNGQKYWLVKNSYGPQWGIGGYVKLAK 326


>ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
           gi|241936792|gb|EES09937.1| hypothetical protein
           SORBIDRAFT_05g021550 [Sorghum bicolor]
          Length = 371

 Score = 64.7 bits (156), Expect = 4e-08
 Identities = 62/242 (25%), Positives = 95/242 (39%), Gaps = 3/242 (1%)
 Frame = -3

Query: 847 DWR-RGCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHLPFPPDD 671
           DWR RG +T I  QG    CWA  T  A+  +  +  G+ +  S Q+L+D        DD
Sbjct: 143 DWRQRGAVTGIKDQGQCGSCWAFSTIAAVEGINKIRTGKLVSLSEQELVDC-------DD 195

Query: 670 ALYDAAGRMMIDHRGL-GNSFHHALTYVMRFG-VIIEDELPFLWWCRNLEWIDRPFGDRI 497
                     +D++G  G    +A  Y+ R G +  E   P+L   R+         D +
Sbjct: 196 ----------VDNQGCNGGLMDYAFQYIKRNGGITTESNYPYLAEQRSCNKAKERSHD-V 244

Query: 496 FLHGYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVSLISDTW 317
            + GY  V   + + ++  +  QP+   I             GV+ G    E+       
Sbjct: 245 TIDGYEDVPANNEDALQKAVANQPVSIAIEASGQDFQFYSE-GVFTGSCGTELD------ 297

Query: 316 ALNGTYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIARTFFDG 137
                     H +  VGYG+             Y++ KNSWGE WG  GY+++ R   D 
Sbjct: 298 ----------HGVAAVGYGITRD-------GTKYWIVKNSWGEDWGERGYIRMQRGISDS 340

Query: 136 MG 131
            G
Sbjct: 341 QG 342


>gb|AAG17127.1|AF190653_1 cathepsin L-like cysteine proteinase CAL1 [Diabrotica virgifera
           virgifera]
          Length = 322

 Score = 64.7 bits (156), Expect = 4e-08
 Identities = 64/235 (27%), Positives = 94/235 (40%), Gaps = 3/235 (1%)
 Frame = -3

Query: 847 DWR-RGCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHLPFPPDD 671
           DWR +G +  +  QG    CWA     +L    +++ G++   S Q+LLD   + +   D
Sbjct: 113 DWRQKGAVLGVKDQGQCGSCWAFSATGSLEGQNYIVNGKSEPLSEQELLD-CSVEYGNGD 171

Query: 670 ALYDAAGRMMIDHRGLGNSFHHALTYVMRFGVIIEDELPFLWWCRNLEWIDRPFGDRIFL 491
              D  G M +           A  +V   G++ E   P+      ++   R   D+  L
Sbjct: 172 C--DEGGLMTL-----------AFEFVEENGIVSEASYPY----EAIQGDCRTTNDKAVL 214

Query: 490 HGYGHVDELDYEEVEWRL--RIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVSLISDTW 317
           H  G+ +    EE   +    + P+ A I            SG+Y            D  
Sbjct: 215 HIQGYNEVYPSEEALRQAVGTVGPISAAI---WAEPIQFFSSGIY-----------DDPN 260

Query: 316 ALNGTYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIAR 152
            LN    Y  H ILVVGYG EN          PY++ KNSWG  WG  GY ++ R
Sbjct: 261 CLNYV-EYLDHGILVVGYGEEN--------GTPYWIVKNSWGATWGEEGYFRLKR 306


>ref|XP_006415025.1| hypothetical protein EUTSA_v10007664mg [Eutrema salsugineum]
           gi|557092796|gb|ESQ33378.1| hypothetical protein
           EUTSA_v10007664mg [Eutrema salsugineum]
          Length = 438

 Score = 64.3 bits (155), Expect = 6e-08
 Identities = 64/246 (26%), Positives = 99/246 (40%), Gaps = 7/246 (2%)
 Frame = -3

Query: 847 DWRRG--CITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHLPFPPD 674
           DWR G   + R+  QGD   CWA     A+  +  +  G  I  S Q+L+D         
Sbjct: 194 DWREGGAVVPRVKKQGDCGSCWAFAATAAVEGINQITTGELISLSEQELID--------- 244

Query: 673 DALYDAAGRMMIDHRGLGNSFHHALTYVMRFGVIIEDEL-PFLW----WCRNLEWIDRPF 509
                   R   +   +G     A  +++R G I+ DE+ P++      C+ +E     F
Sbjct: 245 ------CDRGNGNFGCVGGGAVWAFEFIVRNGGIVTDEVYPYVGNTSAACKAIEMPTTRF 298

Query: 508 GDRIFLHGYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVSLI 329
              + + G+  V E D   +   +  QP+   I+            GVY G         
Sbjct: 299 ---VTIDGHEVVPEKDEMSLMKAVAHQPVSVMISAANMSDYTS---GVYKG-------PC 345

Query: 328 SDTWALNGTYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIART 149
           S+ W          H +L+VGYG  +  E G      Y+L +NSWG  WG  GY+++ R 
Sbjct: 346 SNLWG--------DHNVLIVGYGTSS--EEG-----DYWLIRNSWGPEWGEGGYLRLQRN 390

Query: 148 FFDGMG 131
           F +  G
Sbjct: 391 FHESTG 396


>ref|XP_006647791.1| PREDICTED: xylem cysteine proteinase 1-like [Oryza brachyantha]
          Length = 369

 Score = 63.5 bits (153), Expect = 1e-07
 Identities = 63/237 (26%), Positives = 96/237 (40%), Gaps = 5/237 (2%)
 Frame = -3

Query: 847 DWRR-GCITRIMHQGDTNCCWAVVTRDALYALQFLLEGRAIRGSVQDLLDFVHLPFPPDD 671
           DWR+ G +T + +QG+   CWA  T  A+  +  ++ G+ I  S Q+L+D        D+
Sbjct: 151 DWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLISLSEQELMDC-------DN 203

Query: 670 ALYDAAGRMMIDHRGLGNSFHHALTYVM-RFGVIIEDELPFLW---WCRNLEWIDRPFGD 503
                 G  ++D          A  Y+M   G+  E++ P+L    +CR      +P   
Sbjct: 204 TFNHGCGGGLMD---------FAFAYIMGNQGIHTEEDYPYLMEEGYCRE----KQPHSK 250

Query: 502 RIFLHGYGHVDELDYEEVEWRLRIQPLMAEIAXXXXXXXXXXXSGVYLGFQPDEVSLISD 323
            + + GY  V E     +   L  QP+   IA            G++ G    E  +  D
Sbjct: 251 VVTITGYEDVPENSEASLLKALAHQPVSVGIA-AGSRDFQFYKGGIFDG----ECGVRLD 305

Query: 322 TWALNGTYPYYTHAILVVGYGVENRVENGIPYSIPYYLAKNSWGELWGFWGYVKIAR 152
                       HA+  VGYG          Y   Y + KNSWG+ WG  GY +I R
Sbjct: 306 ------------HALTAVGYGSY--------YGQDYIVMKNSWGKKWGEQGYFRIRR 342


Top