BLASTX nr result

ID: Perilla23_contig00018229 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Perilla23_contig00018229
         (783 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011095837.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   404   e-110
ref|XP_012849177.1| PREDICTED: aspartic proteinase nepenthesin-1...   370   e-100
ref|XP_002265771.3| PREDICTED: aspartic proteinase nepenthesin-2...   369   2e-99
emb|CBI24128.3| unnamed protein product [Vitis vinifera]              369   2e-99
emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]   369   2e-99
gb|EPS68033.1| hypothetical protein M569_06741 [Genlisea aurea]       327   6e-87
ref|XP_012486822.1| PREDICTED: aspartic proteinase nepenthesin-2...   302   2e-79
gb|KJB10346.1| hypothetical protein B456_001G196900 [Gossypium r...   302   2e-79
ref|XP_007022806.1| Eukaryotic aspartyl protease family protein,...   300   8e-79
ref|XP_012463657.1| PREDICTED: aspartic proteinase nepenthesin-1...   295   3e-77
ref|XP_010064103.1| PREDICTED: aspartic proteinase CDR1 [Eucalyp...   295   3e-77
ref|XP_007049083.1| Eukaryotic aspartyl protease family protein,...   294   4e-77
gb|KHG15209.1| Asparticase nepenthesin-1 [Gossypium arboreum]         294   5e-77
gb|KDO61509.1| hypothetical protein CISIN_1g046757mg [Citrus sin...   293   7e-77
ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citr...   293   7e-77
ref|XP_004293837.1| PREDICTED: aspartic proteinase CDR1-like [Fr...   287   7e-75
ref|XP_010092446.1| Aspartic proteinase nepenthesin-1 [Morus not...   286   9e-75
ref|XP_010260839.1| PREDICTED: aspartic proteinase nepenthesin-1...   281   3e-73
ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Caps...   281   3e-73
ref|XP_007211847.1| hypothetical protein PRUPE_ppa004710mg [Prun...   281   3e-73

>ref|XP_011095837.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Sesamum
           indicum]
          Length = 488

 Score =  404 bits (1038), Expect = e-110
 Identities = 197/261 (75%), Positives = 218/261 (83%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNYSL 604
           YSDGSA +GLFA E VTF L+N RK R+ NVLVGCSES+ GQS +  DGV+GLGYS+YS 
Sbjct: 217 YSDGSAALGLFANEMVTFTLTNRRKTRLRNVLVGCSESTRGQSFQGADGVMGLGYSDYSF 276

Query: 603 ALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINPFYAV 424
           A+K A +FGGKFSYCLVDHLSP NVSSYLIFGSHK V I+  RMRYTEL+LGVI PFYAV
Sbjct: 277 AVKAAKRFGGKFSYCLVDHLSPENVSSYLIFGSHKEVGITYRRMRYTELLLGVITPFYAV 336

Query: 423 NIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAFQRL 244
            IKGISIGG ML IP ETWNL G GG I+DSG+SLT LTQ AY PVMAAL  SL  F+ L
Sbjct: 337 KIKGISIGGLMLDIPPETWNLTGQGGAIIDSGSSLTGLTQKAYQPVMAALKLSLLNFKNL 396

Query: 243 DLDIGPLEYCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPAASP 64
           +LDIGPLEYCFNSTGFN+++VPRL FHF DGA F+PPVKSYVIDAAP VKCLGF P + P
Sbjct: 397 NLDIGPLEYCFNSTGFNESVVPRLVFHFEDGARFEPPVKSYVIDAAPAVKCLGFVPLSWP 456

Query: 63  GASVIGNIMQQNHFWEFDIAN 1
           GASVIGNIMQQNH WEFD+AN
Sbjct: 457 GASVIGNIMQQNHLWEFDLAN 477


>ref|XP_012849177.1| PREDICTED: aspartic proteinase nepenthesin-1 [Erythranthe guttatus]
            gi|604314897|gb|EYU27603.1| hypothetical protein
            MIMGU_mgv1a004950mg [Erythranthe guttata]
          Length = 503

 Score =  370 bits (951), Expect = e-100
 Identities = 180/261 (68%), Positives = 209/261 (80%)
 Frame = -1

Query: 783  YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNYSL 604
            YSDGSA  GLF  ETVT  L+NGRK R+ NVL+GCS SS G + ++ DGVIGLGYSNYSL
Sbjct: 235  YSDGSAAQGLFGNETVTLSLTNGRKTRLHNVLIGCSISSSGPTFQSADGVIGLGYSNYSL 294

Query: 603  ALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINPFYAV 424
            A+K ++ F G FSYCLVDHLSP+N+SSYL FGS K    +   M YT L+L VINPFYAV
Sbjct: 295  AVKASNLFRGIFSYCLVDHLSPKNISSYLTFGSAKQQTDT---MHYTALILDVINPFYAV 351

Query: 423  NIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAFQRL 244
            ++ GISIGGSML IPAE W++ G+GGVI+DSGTSLT L  PAY PVMAAL  SL  F++L
Sbjct: 352  SMNGISIGGSMLDIPAEVWDVKGSGGVILDSGTSLTSLVGPAYRPVMAALTASLSGFEKL 411

Query: 243  DLDIGPLEYCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPAASP 64
             LD+GPLEYCFNSTGF +++VPRL FHF DGA F+PPVKSYVIDAAPGVKCLGF   A P
Sbjct: 412  GLDVGPLEYCFNSTGFVESVVPRLVFHFGDGARFEPPVKSYVIDAAPGVKCLGFVGGAWP 471

Query: 63   GASVIGNIMQQNHFWEFDIAN 1
            G SV+GNIMQQN+FWEFD+ N
Sbjct: 472  GVSVVGNIMQQNYFWEFDLVN 492


>ref|XP_002265771.3| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 489

 Score =  369 bits (946), Expect = 2e-99
 Identities = 174/259 (67%), Positives = 207/259 (79%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNYSL 604
           YSDGS  +G FA ETVT  L  GRK ++ NVL+GCSES  GQS +A DGV+GLGYS YS 
Sbjct: 218 YSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSF 277

Query: 603 ALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINPFYAV 424
           A+K A KFGGKFSYCLVDHLS +NVS+YL FGS +     +  M YTELVLG++N FYAV
Sbjct: 278 AIKAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAV 337

Query: 423 NIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAFQRL 244
           N+ GISIGG+ML+IP+E W++ G GG I+DSG+SLT LT+PAY PVMAAL  SL  F+++
Sbjct: 338 NMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKV 397

Query: 243 DLDIGPLEYCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPAASP 64
           ++DIGPLEYCFNSTGF ++LVPRL FHFADGA F+PPVKSYVI AA GV+CLGF   A P
Sbjct: 398 EMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWP 457

Query: 63  GASVIGNIMQQNHFWEFDI 7
           G SV+GNIMQQNH WEFD+
Sbjct: 458 GTSVVGNIMQQNHLWEFDL 476


>emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score =  369 bits (946), Expect = 2e-99
 Identities = 174/259 (67%), Positives = 207/259 (79%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNYSL 604
           YSDGS  +G FA ETVT  L  GRK ++ NVL+GCSES  GQS +A DGV+GLGYS YS 
Sbjct: 107 YSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSF 166

Query: 603 ALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINPFYAV 424
           A+K A KFGGKFSYCLVDHLS +NVS+YL FGS +     +  M YTELVLG++N FYAV
Sbjct: 167 AIKAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAV 226

Query: 423 NIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAFQRL 244
           N+ GISIGG+ML+IP+E W++ G GG I+DSG+SLT LT+PAY PVMAAL  SL  F+++
Sbjct: 227 NMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKV 286

Query: 243 DLDIGPLEYCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPAASP 64
           ++DIGPLEYCFNSTGF ++LVPRL FHFADGA F+PPVKSYVI AA GV+CLGF   A P
Sbjct: 287 EMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWP 346

Query: 63  GASVIGNIMQQNHFWEFDI 7
           G SV+GNIMQQNH WEFD+
Sbjct: 347 GTSVVGNIMQQNHLWEFDL 365


>emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  369 bits (946), Expect = 2e-99
 Identities = 174/259 (67%), Positives = 207/259 (79%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNYSL 604
           YSDGS  +G FA ETVT  L  GRK ++ NVL+GCSES  GQS +A DGV+GLGYS YS 
Sbjct: 178 YSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSF 237

Query: 603 ALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINPFYAV 424
           A+K A KFGGKFSYCLVDHLS +NVS+YL FGS +     +  M YTELVLG++N FYAV
Sbjct: 238 AIKAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAV 297

Query: 423 NIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAFQRL 244
           N+ GISIGG+ML+IP+E W++ G GG I+DSG+SLT LT+PAY PVMAAL  SL  F+++
Sbjct: 298 NMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKV 357

Query: 243 DLDIGPLEYCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPAASP 64
           ++DIGPLEYCFNSTGF ++LVPRL FHFADGA F+PPVKSYVI AA GV+CLGF   A P
Sbjct: 358 EMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWP 417

Query: 63  GASVIGNIMQQNHFWEFDI 7
           G SV+GNIMQQNH WEFD+
Sbjct: 418 GTSVVGNIMQQNHLWEFDL 436


>gb|EPS68033.1| hypothetical protein M569_06741 [Genlisea aurea]
          Length = 449

 Score =  327 bits (838), Expect = 6e-87
 Identities = 160/265 (60%), Positives = 206/265 (77%), Gaps = 4/265 (1%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGR-KRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNYS 607
           Y+DGS+  G+FA ETV  +L+ GR K R++NVL+GC+++  G S +  DGV+GLGYSN+S
Sbjct: 174 YADGSSAEGIFAGETVELKLAKGRGKARLQNVLIGCTKNFSGSSFQTSDGVLGLGYSNFS 233

Query: 606 LALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMR--MRYTELVLGVINPF 433
            A   A +FG KFSYCL+DHL+ +N SSY+ F S + ++ S+    +RYT+LVLGVI   
Sbjct: 234 FAHAAAARFGDKFSYCLLDHLAAKNKSSYITFSSGRSISASISAGPIRYTDLVLGVIGSN 293

Query: 432 YAVNIKGISIGGSMLQIPAETWN-LNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQA 256
           YAVN++GISIGGS L+IP++TWN L+G+GGVI+DSG+SLT L  PAY PV+AALN SL  
Sbjct: 294 YAVNVRGISIGGSWLRIPSDTWNNLSGSGGVIIDSGSSLTALAPPAYAPVIAALNRSLAR 353

Query: 255 FQRLDLDIGPLEYCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAP 76
           F    + IGP+E CFNSTGF++++VP+L  HFA G  F+PPVKSYVIDAAPGV CLGF  
Sbjct: 354 FGDPHVKIGPMECCFNSTGFHESVVPKLAIHFAGGTRFEPPVKSYVIDAAPGVVCLGFVQ 413

Query: 75  AASPGASVIGNIMQQNHFWEFDIAN 1
           AASPG SVIGNI+QQNH+WEFD+ N
Sbjct: 414 AASPGVSVIGNILQQNHWWEFDLGN 438


>ref|XP_012486822.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Gossypium
           raimondii]
          Length = 490

 Score =  302 bits (774), Expect = 2e-79
 Identities = 143/263 (54%), Positives = 192/263 (73%), Gaps = 2/263 (0%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNYSL 604
           YSDG+  +G+F  +TV  RL+NG+K +V +V++GCSE+  G   +  DGV+GLG+  +S 
Sbjct: 217 YSDGTKVLGIFGNDTVIVRLTNGKKIKVPDVMIGCSETIFGNFHDI-DGVMGLGFDQHSF 275

Query: 603 ALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINPFYAV 424
           A+K A KFG KFSYCLVDHLSP ++ ++L+FG     + ++ +M+YTEL+LG++NP+YAV
Sbjct: 276 AVKAAEKFGNKFSYCLVDHLSPSDLVNFLVFGEVD--DSTLPKMQYTELLLGIVNPYYAV 333

Query: 423 NIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAFQRL 244
           N+ GISI G ML IP+  W+L   GG IVDSG+SLT L +P ++ V+AA    +  F++L
Sbjct: 334 NVSGISIDGEMLAIPSYAWDLKSGGGFIVDSGSSLTHLVEPVFNQVIAAFQAPISKFKKL 393

Query: 243 DLDIGPLE--YCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPAA 70
            L +GP E  YCF   G+ ++L+P+LE HFADGA   PPVKSYVIDAA GVKCLGF P  
Sbjct: 394 SLSVGPSEPEYCFGDVGYKESLMPKLEVHFADGAKLTPPVKSYVIDAAEGVKCLGFVPTR 453

Query: 69  SPGASVIGNIMQQNHFWEFDIAN 1
            PG SVIGNI+QQNH WEFD+ N
Sbjct: 454 WPGPSVIGNILQQNHLWEFDLLN 476


>gb|KJB10346.1| hypothetical protein B456_001G196900 [Gossypium raimondii]
          Length = 480

 Score =  302 bits (774), Expect = 2e-79
 Identities = 143/263 (54%), Positives = 192/263 (73%), Gaps = 2/263 (0%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNYSL 604
           YSDG+  +G+F  +TV  RL+NG+K +V +V++GCSE+  G   +  DGV+GLG+  +S 
Sbjct: 207 YSDGTKVLGIFGNDTVIVRLTNGKKIKVPDVMIGCSETIFGNFHDI-DGVMGLGFDQHSF 265

Query: 603 ALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINPFYAV 424
           A+K A KFG KFSYCLVDHLSP ++ ++L+FG     + ++ +M+YTEL+LG++NP+YAV
Sbjct: 266 AVKAAEKFGNKFSYCLVDHLSPSDLVNFLVFGEVD--DSTLPKMQYTELLLGIVNPYYAV 323

Query: 423 NIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAFQRL 244
           N+ GISI G ML IP+  W+L   GG IVDSG+SLT L +P ++ V+AA    +  F++L
Sbjct: 324 NVSGISIDGEMLAIPSYAWDLKSGGGFIVDSGSSLTHLVEPVFNQVIAAFQAPISKFKKL 383

Query: 243 DLDIGPLE--YCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPAA 70
            L +GP E  YCF   G+ ++L+P+LE HFADGA   PPVKSYVIDAA GVKCLGF P  
Sbjct: 384 SLSVGPSEPEYCFGDVGYKESLMPKLEVHFADGAKLTPPVKSYVIDAAEGVKCLGFVPTR 443

Query: 69  SPGASVIGNIMQQNHFWEFDIAN 1
            PG SVIGNI+QQNH WEFD+ N
Sbjct: 444 WPGPSVIGNILQQNHLWEFDLLN 466


>ref|XP_007022806.1| Eukaryotic aspartyl protease family protein, putative [Theobroma
           cacao] gi|508722434|gb|EOY14331.1| Eukaryotic aspartyl
           protease family protein, putative [Theobroma cacao]
          Length = 473

 Score =  300 bits (768), Expect = 8e-79
 Identities = 147/259 (56%), Positives = 189/259 (72%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNYSL 604
           Y DGS  MG+FAKE+VT  L+N R  R+ +VL+GCS+SS G++++  DGV+GL  S YS 
Sbjct: 200 YIDGSDAMGVFAKESVTVGLTNSRMARLHDVLIGCSDSSQGRTVKNVDGVLGLANSKYSF 259

Query: 603 ALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINPFYAV 424
             K A ++GGKFSYCLVDHLS  N S+YLIFG++ +    +   RYT L L +++  YAV
Sbjct: 260 VTKAAERWGGKFSYCLVDHLSHINASNYLIFGANNNQLTVLGNTRYTRLELNLVSFSYAV 319

Query: 423 NIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAFQRL 244
           N++GISIGG ML IP + W+    GG I+DSGTSL+ LT PAY PVMAA+  S+  + ++
Sbjct: 320 NVQGISIGGKMLDIPLQVWDTRKGGGTILDSGTSLSFLTDPAYQPVMAAIKMSVSKYPQV 379

Query: 243 DLDIGPLEYCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPAASP 64
            L   P+EYCFNSTGF++TLVP+L  HFADGA F+P  +SYVI AA GV+CLGF PA  P
Sbjct: 380 KLHGVPMEYCFNSTGFDETLVPKLIIHFADGARFEPHWRSYVISAADGVRCLGFLPARFP 439

Query: 63  GASVIGNIMQQNHFWEFDI 7
             SVIGNIMQQN+ WEFD+
Sbjct: 440 SVSVIGNIMQQNYLWEFDL 458


>ref|XP_012463657.1| PREDICTED: aspartic proteinase nepenthesin-1 [Gossypium raimondii]
           gi|763814626|gb|KJB81478.1| hypothetical protein
           B456_013G147300 [Gossypium raimondii]
          Length = 473

 Score =  295 bits (754), Expect = 3e-77
 Identities = 146/259 (56%), Positives = 184/259 (71%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNYSL 604
           YSDGSA MG+FA ETV+  L+NGRK R+ NVL+GC++S  G +L+  DG++GL  + YS 
Sbjct: 200 YSDGSAAMGVFANETVSAGLTNGRKTRLHNVLIGCTDSFQGPTLQNVDGIMGLANTKYSF 259

Query: 603 ALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINPFYAV 424
           A   A  FGGKFSYCLVDHLS  N ++Y+IFG++++        R+T+L L  I  FYAV
Sbjct: 260 ATNAAATFGGKFSYCLVDHLSHLNATNYIIFGTNRNQVKVSGNTRHTQLELDAIPSFYAV 319

Query: 423 NIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAFQRL 244
           N+ GIS+G  ML+IP + W+ +  GG I+DSGTSLT L  PAY  VM AL  S+  +QR+
Sbjct: 320 NVIGISVGNKMLEIPMQVWDASVGGGTIIDSGTSLTFLADPAYQAVMEALKVSVSKYQRV 379

Query: 243 DLDIGPLEYCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPAASP 64
            LD  P+EYCFNS GFN +LVP+L  HF DGA F+P   SYVI AA GV+CLGF PA  P
Sbjct: 380 KLDGVPMEYCFNSEGFNGSLVPKLIIHFNDGARFEPHWNSYVIAAAAGVRCLGFLPARFP 439

Query: 63  GASVIGNIMQQNHFWEFDI 7
             SVIGNIMQQN+ WEFD+
Sbjct: 440 ALSVIGNIMQQNYLWEFDL 458


>ref|XP_010064103.1| PREDICTED: aspartic proteinase CDR1 [Eucalyptus grandis]
            gi|629105951|gb|KCW71420.1| hypothetical protein
            EUGRSUZ_F04481 [Eucalyptus grandis]
          Length = 477

 Score =  295 bits (754), Expect = 3e-77
 Identities = 152/269 (56%), Positives = 194/269 (72%), Gaps = 10/269 (3%)
 Frame = -1

Query: 783  YSDGSATMGLFAKETVTFRLSN--GRKRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNY 610
            YSDGS  +G+FA+ETVT  ++N  GR  +VE+V+VGC+ +  GQ  +  DGV+GL YSNY
Sbjct: 194  YSDGSGALGIFARETVTAEITNEKGRATKVEDVVVGCTLTLQGQGFQGADGVLGLAYSNY 253

Query: 609  SLALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNIS--VMRMRYTELVLGVINP 436
            S A + +H FGG FSYCLVDHLS + +S+YL FG     + S  + RM YT+L L  + P
Sbjct: 254  SFATRASHTFGGTFSYCLVDHLSHKYLSNYLTFGYAGATSHSGLLSRMHYTKLDLASLIP 313

Query: 435  FYAVNIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSL-Q 259
            FYAVN++GISI G +L+IP+  W  +  GG I+DSGTSLT+LT+ AY PV+ AL  +L +
Sbjct: 314  FYAVNVEGISINGELLKIPSLIWAADRGGGTIIDSGTSLTILTELAYRPVVGALGEALSK 373

Query: 258  AFQRLDLDIG-PLEYCFNSTG----FNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVK 94
              +R  LD G PLEYC+NST     F D+ VPRL FHF+DGA F+PPV+SYVIDAAPGVK
Sbjct: 374  RLERTKLDGGGPLEYCYNSTNPWSRFEDSWVPRLAFHFSDGARFEPPVRSYVIDAAPGVK 433

Query: 93   CLGFAPAASPGASVIGNIMQQNHFWEFDI 7
            CLGF  A  PG SVIGNI+QQ H WEF++
Sbjct: 434  CLGFLSATWPGVSVIGNIIQQKHVWEFNL 462


>ref|XP_007049083.1| Eukaryotic aspartyl protease family protein, putative [Theobroma
           cacao] gi|508701344|gb|EOX93240.1| Eukaryotic aspartyl
           protease family protein, putative [Theobroma cacao]
          Length = 478

 Score =  294 bits (753), Expect = 4e-77
 Identities = 138/261 (52%), Positives = 193/261 (73%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNYSL 604
           Y+DG+  +G+F  +TV  RLS G+K +V +V+VGCSE+  G   +  DGV+GLG+  +S 
Sbjct: 209 YADGTRVVGIFGNDTVKVRLSGGQKIKVTDVMVGCSEAIRGNFHDI-DGVMGLGFDQHSF 267

Query: 603 ALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINPFYAV 424
           A+K A +FG KFSYCLVDHLSP N+ ++L+FG     +  +  M++T+L+LG++NP+YAV
Sbjct: 268 AVKAAKEFGDKFSYCLVDHLSPSNLVNFLVFGGV--TSSPLPNMQFTQLILGIVNPYYAV 325

Query: 423 NIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAFQRL 244
           N+ GIS+ G ML IP+  W++ G GGVI+DSG+SLT L +P +  V+AA    L  F++L
Sbjct: 326 NVSGISVNGKMLDIPSYIWDVKGDGGVIMDSGSSLTYLVKPLFDKVIAAFQAPLSKFKKL 385

Query: 243 DLDIGPLEYCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPAASP 64
           +L++GP +YCF++ GF ++L+P+L FHFADGA   PPVKSYVIDA   VKCLGF+  + P
Sbjct: 386 ELNLGP-DYCFSAAGFEESLMPKLAFHFADGAKLVPPVKSYVIDAEEAVKCLGFSSTSWP 444

Query: 63  GASVIGNIMQQNHFWEFDIAN 1
           G SVIGNI+QQNH WEFD+ N
Sbjct: 445 GPSVIGNILQQNHLWEFDLLN 465


>gb|KHG15209.1| Asparticase nepenthesin-1 [Gossypium arboreum]
          Length = 473

 Score =  294 bits (752), Expect = 5e-77
 Identities = 146/259 (56%), Positives = 184/259 (71%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNYSL 604
           YSDGSA MG+FA ETV+  L+NGRK R+ NVL+GC++S  G +L+  DG++GL  + YS 
Sbjct: 200 YSDGSAAMGVFANETVSAGLTNGRKTRLHNVLIGCTDSFQGPTLQNVDGIMGLANTKYSF 259

Query: 603 ALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINPFYAV 424
           A   A  FGGKFSYCLVDHLS  N ++Y+IFG++++        R+T+L L  I  FYAV
Sbjct: 260 ATNAAATFGGKFSYCLVDHLSHLNATNYIIFGTNRNQVKVSGNTRHTKLELDAIPSFYAV 319

Query: 423 NIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAFQRL 244
           N+ GIS+G  ML+IP + W+ +  GG I+DSGTSLT L  PAY  VM AL  S+  +QR+
Sbjct: 320 NVIGISVGNKMLEIPMQVWDASEGGGTIIDSGTSLTFLADPAYQAVMEALKVSVSKYQRV 379

Query: 243 DLDIGPLEYCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPAASP 64
            LD  P+EYCFNSTGFN +LVP+L  HF DGA F+P   SYVI AA  V+CLGF PA  P
Sbjct: 380 KLDGVPMEYCFNSTGFNGSLVPKLIIHFDDGARFEPHWNSYVIAAAAEVRCLGFLPARFP 439

Query: 63  GASVIGNIMQQNHFWEFDI 7
             SVIGNIMQQN+ WEFD+
Sbjct: 440 ALSVIGNIMQQNYLWEFDL 458


>gb|KDO61509.1| hypothetical protein CISIN_1g046757mg [Citrus sinensis]
          Length = 445

 Score =  293 bits (751), Expect = 7e-77
 Identities = 150/262 (57%), Positives = 183/262 (69%), Gaps = 3/262 (1%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNYSL 604
           Y+DGSA  G+F KE VT  L NG K R+E V++GCS++  GQ     DGV+GL Y  YS 
Sbjct: 175 YADGSAAKGIFGKERVTIGLENGGKTRIEEVVMGCSDTIQGQIFAEADGVLGLSYDKYSF 234

Query: 603 ALKTAHKFG---GKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINPF 433
           A K  +      GKF+YCLVDHLS +NVS+YLIFG         MRMRYT  +LG+I P 
Sbjct: 235 AQKVTNGSTFARGKFAYCLVDHLSHKNVSNYLIFGEES--KRMRMRMRYT--LLGLIGPD 290

Query: 432 YAVNIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAF 253
           Y V++KGISIGG ML IP++ W+ N  GG   DSGT+LT L +PAY PV+AAL  SL  +
Sbjct: 291 YGVSVKGISIGGVMLNIPSQVWDFNRGGGTAFDSGTTLTFLAEPAYKPVVAALEMSLSRY 350

Query: 252 QRLDLDIGPLEYCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPA 73
           QRL  D  P EYCFNSTGF+++ VP+L FHFADGA F+P  KSY+I  A G++CLGF  A
Sbjct: 351 QRLKRD-APFEYCFNSTGFDESSVPKLVFHFADGARFEPHTKSYIIRVAHGIRCLGFVSA 409

Query: 72  ASPGASVIGNIMQQNHFWEFDI 7
             PGAS IGNIMQQN+FWEFD+
Sbjct: 410 TWPGASAIGNIMQQNYFWEFDL 431


>ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citrus clementina]
           gi|568881779|ref|XP_006493729.1| PREDICTED: aspartic
           proteinase nepenthesin-1-like [Citrus sinensis]
           gi|557524190|gb|ESR35557.1| hypothetical protein
           CICLE_v10004908mg [Citrus clementina]
          Length = 470

 Score =  293 bits (751), Expect = 7e-77
 Identities = 150/262 (57%), Positives = 183/262 (69%), Gaps = 3/262 (1%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNYSL 604
           Y+DGSA  G+F KE VT  L NG K R+E V++GCS++  GQ     DGV+GL Y  YS 
Sbjct: 200 YADGSAAKGIFGKERVTIGLENGGKTRIEEVVMGCSDTIQGQIFAEADGVLGLSYDKYSF 259

Query: 603 ALKTAHKFG---GKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINPF 433
           A K  +      GKF+YCLVDHLS +NVS+YLIFG         MRMRYT  +LG+I P 
Sbjct: 260 AQKVTNGSTFARGKFAYCLVDHLSHKNVSNYLIFGEES--KRMRMRMRYT--LLGLIGPD 315

Query: 432 YAVNIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAF 253
           Y V++KGISIGG ML IP++ W+ N  GG   DSGT+LT L +PAY PV+AAL  SL  +
Sbjct: 316 YGVSVKGISIGGVMLNIPSQVWDFNRGGGTAFDSGTTLTFLAEPAYKPVVAALEMSLSRY 375

Query: 252 QRLDLDIGPLEYCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPA 73
           QRL  D  P EYCFNSTGF+++ VP+L FHFADGA F+P  KSY+I  A G++CLGF  A
Sbjct: 376 QRLKRD-APFEYCFNSTGFDESSVPKLVFHFADGARFEPHTKSYIIRVAHGIRCLGFVSA 434

Query: 72  ASPGASVIGNIMQQNHFWEFDI 7
             PGAS IGNIMQQN+FWEFD+
Sbjct: 435 TWPGASAIGNIMQQNYFWEFDL 456


>ref|XP_004293837.1| PREDICTED: aspartic proteinase CDR1-like [Fragaria vesca subsp.
           vesca]
          Length = 482

 Score =  287 bits (734), Expect = 7e-75
 Identities = 147/264 (55%), Positives = 179/264 (67%), Gaps = 5/264 (1%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSES---SLGQSLEAGDGVIGLGYSN 613
           Y++ S  +G FA ETV   L+NGR+ R+ +VL+GC+ES     G S+ AGDG++GLG+  
Sbjct: 208 YAESSGALGFFANETVRVPLTNGRRARLNDVLIGCTESIEGPKGASIRAGDGILGLGFGK 267

Query: 612 YSLALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLG--VIN 439
           +S   K A   G KFSYCLVDH+S +NVSSYL FG +        RMRYT+L LG   I 
Sbjct: 268 HSFVAKAASNLGDKFSYCLVDHMSNKNVSSYLTFGRNAETAQQNSRMRYTKLALGGPKIG 327

Query: 438 PFYAVNIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQ 259
           PFYAVN+ GIS G  ML+IP E WN N  GG IVDSGTSLT LT PAY  VM  L  +L 
Sbjct: 328 PFYAVNLVGISAGSKMLKIPNEVWNENLGGGTIVDSGTSLTFLTSPAYIHVMDELTMALS 387

Query: 258 AFQRLDLDIGPLEYCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFA 79
            ++++  D    E+CFNSTG++ +LVPR   HFADGA F+PPVKSYVID A   KCLGF 
Sbjct: 388 KYKKIPSDA--FEFCFNSTGYDQSLVPRFAIHFADGAKFEPPVKSYVIDVAIQTKCLGFQ 445

Query: 78  PAASPGASVIGNIMQQNHFWEFDI 7
            A  PG  VIGNIMQQN+ WEFD+
Sbjct: 446 SAPFPGTIVIGNIMQQNYLWEFDL 469


>ref|XP_010092446.1| Aspartic proteinase nepenthesin-1 [Morus notabilis]
           gi|587861358|gb|EXB51212.1| Aspartic proteinase
           nepenthesin-1 [Morus notabilis]
          Length = 464

 Score =  286 bits (733), Expect = 9e-75
 Identities = 142/267 (53%), Positives = 188/267 (70%), Gaps = 6/267 (2%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLG---QSLEAGDGVIGLGYSN 613
           Y +GS+ +G FA ET++ RL+NG+KR++ +VLVGC+ES  G      +  DGV+GLG+ N
Sbjct: 187 YLEGSSAIGFFANETISVRLANGKKRKLRDVLVGCTESVQGAEESGFKGADGVLGLGFGN 246

Query: 612 YSLALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMR-MRYTELVLGV-IN 439
           ++   K A  FGGKFSYCLVDHLSP+N+S+Y+IFG  K    S    +++T+LVLG    
Sbjct: 247 HTFTRKAAQYFGGKFSYCLVDHLSPKNLSNYIIFGHDKADKASCSSSLQHTDLVLGGDYG 306

Query: 438 PFYAVNIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQ 259
           PFY VN+ GISIGG +L+IP+  WN +  GG I++SGTSLT LT P Y PV + LN    
Sbjct: 307 PFYGVNLSGISIGGVLLRIPSVAWNASLGGGAILESGTSLTFLTDPVYGPVTSELNKFTS 366

Query: 258 AFQRL-DLDIGPLEYCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGF 82
            F  L     GP E+CFNSTG++++ +P L  HF++GA F+PPVKSY++D AP  KCLGF
Sbjct: 367 RFGTLLPPGGGPFEFCFNSTGYDESKMPPLRIHFSNGAIFEPPVKSYILDIAPEKKCLGF 426

Query: 81  APAASPGASVIGNIMQQNHFWEFDIAN 1
             A+ PG S+IGNIMQQNH WEFD+ N
Sbjct: 427 VSASWPGTSIIGNIMQQNHLWEFDLEN 453


>ref|XP_010260839.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Nelumbo
           nucifera]
          Length = 481

 Score =  281 bits (720), Expect = 3e-73
 Identities = 141/261 (54%), Positives = 182/261 (69%), Gaps = 2/261 (0%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLE-AGDGVIGLGYSNYS 607
           YS G +  G FA E+VT RL+NGRK ++ +VLVGC++++ GQ      DG++GLGYS  S
Sbjct: 210 YSSGQSAQGFFANESVTVRLTNGRKMKIHHVLVGCTQTTQGQKFSNVVDGILGLGYSPNS 269

Query: 606 LALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINPFYA 427
            A K    FG KFSYCLVDHLSPRNVS+YL+FG +  VN++   M+YTEL++G + P+YA
Sbjct: 270 FATKVLQVFGSKFSYCLVDHLSPRNVSNYLVFGRNHIVNVNPPEMQYTELMVGKVLPYYA 329

Query: 426 VNIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAFQR 247
           VN+ GISIGG +L+IP   WNL+  GG I+DSGTSLT+L +PAY  V+ AL  +L  +++
Sbjct: 330 VNVIGISIGGVLLRIPLSVWNLDKNGGTILDSGTSLTLLVEPAYRLVIDALKVALIMYKK 389

Query: 246 LDLDIGPLEYCFN-STGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPAA 70
              ++   E C      F++ LVPRL  HFA GA   PPVKSY+ID A G+KCLGF    
Sbjct: 390 --AEVPEFEVCIKVDKAFDEGLVPRLGIHFAGGARLLPPVKSYLIDVADGIKCLGFRSVF 447

Query: 69  SPGASVIGNIMQQNHFWEFDI 7
            PG SVIGNIMQQN FWE D+
Sbjct: 448 WPGISVIGNIMQQNFFWELDL 468


>ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Capsella rubella]
           gi|482566377|gb|EOA30566.1| hypothetical protein
           CARUB_v10013693mg [Capsella rubella]
          Length = 448

 Score =  281 bits (720), Expect = 3e-73
 Identities = 139/260 (53%), Positives = 181/260 (69%), Gaps = 1/260 (0%)
 Frame = -1

Query: 783 YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLEAGDGVIGLGYSNYSL 604
           Y+DGSA  G+FAKETVT  L+NGRK R+  +L+GCS S  GQS    DGV+GL +S++S 
Sbjct: 177 YADGSAAQGIFAKETVTVGLTNGRKARLHGLLIGCSSSFSGQSFRGADGVLGLAFSDFSF 236

Query: 603 ALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINPFYAV 424
                  FG KFSYCLVDHLSP+NVS+YLIFGS      +    R T L L +I PFYA+
Sbjct: 237 TSTATSLFGAKFSYCLVDHLSPKNVSNYLIFGSSSSATKNAPG-RTTPLDLTLIPPFYAI 295

Query: 423 NIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAFQRL 244
           ++ GIS+G  ML IPA+ W+    GG ++DSGTSLT+L++ AY PV+  L   L   +R+
Sbjct: 296 SVIGISLGEDMLDIPAQVWDATTGGGTVLDSGTSLTLLSEAAYKPVVTGLARYLDELERV 355

Query: 243 DLDIGPLEYCFNST-GFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPAAS 67
             +  P+EYCF+ST GFN++ +P+L FH   GA F+P  KSY+ID APGVKCLGF  A +
Sbjct: 356 KPEGVPIEYCFSSTSGFNESKLPQLTFHMKGGARFEPHRKSYLIDTAPGVKCLGFMSAGT 415

Query: 66  PGASVIGNIMQQNHFWEFDI 7
           P  +V+GNIMQQN+ WEFD+
Sbjct: 416 PATNVVGNIMQQNYLWEFDL 435


>ref|XP_007211847.1| hypothetical protein PRUPE_ppa004710mg [Prunus persica]
            gi|462407712|gb|EMJ13046.1| hypothetical protein
            PRUPE_ppa004710mg [Prunus persica]
          Length = 495

 Score =  281 bits (720), Expect = 3e-73
 Identities = 139/262 (53%), Positives = 182/262 (69%), Gaps = 3/262 (1%)
 Frame = -1

Query: 783  YSDGSATMGLFAKETVTFRLSNGRKRRVENVLVGCSESSLGQSLEAG-DGVIGLGYSNYS 607
            Y +GS+ +G F  + V   LSNGR+ R+++VL+GC+ES +G+    G DG++GLG+  YS
Sbjct: 223  YVEGSSALGTFGTDIVRASLSNGRRNRMKDVLIGCTESIIGKGTAKGSDGILGLGFGKYS 282

Query: 606  LALKTAHKFGGKFSYCLVDHLSPRNVSSYLIFGSHKHVNISVMRMRYTELVLGVINP--F 433
               K A K+GGK SYCL+DH+SP+NV+SYL FG +K   +   +MRYT+LV G  N   F
Sbjct: 283  FTTKAALKYGGKVSYCLLDHMSPKNVTSYLTFGDNKKAVLQG-KMRYTQLVFGNPNKGSF 341

Query: 432  YAVNIKGISIGGSMLQIPAETWNLNGTGGVIVDSGTSLTVLTQPAYHPVMAALNGSLQAF 253
            Y VN++GIS+GG ML IP   WN    GG +VDSG SLT LT+PAY PVM AL   L  F
Sbjct: 342  YGVNLQGISVGGKMLNIPLHIWNPKLGGGALVDSGMSLTFLTKPAYKPVMTALTMPLTKF 401

Query: 252  QRLDLDIGPLEYCFNSTGFNDTLVPRLEFHFADGASFQPPVKSYVIDAAPGVKCLGFAPA 73
            +RL  +    ++CF+  G+ D LVP+L FHFA GA F PPVKSYVID +PG+KC+G  P 
Sbjct: 402  RRLRSEEDDFDFCFDPRGYRDRLVPKLVFHFAGGAKFAPPVKSYVIDVSPGMKCIGILPL 461

Query: 72   ASPGASVIGNIMQQNHFWEFDI 7
            A  GA +IGNI+QQNH WEF++
Sbjct: 462  AE-GACIIGNIIQQNHLWEFNL 482


Top