BLASTX nr result

ID: Mentha28_contig00004441 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00004441
         (1214 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU27603.1| hypothetical protein MIMGU_mgv1a004950mg [Mimulus...   447   e-123
emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]   435   e-119
ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2...   434   e-119
emb|CBI24128.3| unnamed protein product [Vitis vinifera]              434   e-119
ref|XP_007049083.1| Eukaryotic aspartyl protease family protein,...   378   e-102
ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citr...   375   e-101
gb|EPS68033.1| hypothetical protein M569_06741 [Genlisea aurea]       370   e-100
ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Caps...   368   2e-99
gb|EXB51212.1| Aspartic proteinase nepenthesin-1 [Morus notabilis]    367   7e-99
ref|XP_007022806.1| Eukaryotic aspartyl protease family protein,...   362   2e-97
ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arab...   356   1e-95
ref|XP_006407304.1| hypothetical protein EUTSA_v10020732mg [Eutr...   350   9e-94
ref|NP_187876.2| aspartyl protease family protein [Arabidopsis t...   345   2e-92
gb|AAL49921.1| unknown protein [Arabidopsis thaliana]                 345   2e-92
ref|XP_004293837.1| PREDICTED: aspartic proteinase nepenthesin-1...   345   3e-92
ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative...   340   7e-91
ref|XP_006429804.1| hypothetical protein CICLE_v10013820mg [Citr...   334   5e-89
ref|XP_007211847.1| hypothetical protein PRUPE_ppa004710mg [Prun...   328   4e-87
ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group] g...   319   1e-84
ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2...   315   2e-83

>gb|EYU27603.1| hypothetical protein MIMGU_mgv1a004950mg [Mimulus guttatus]
          Length = 503

 Score =  447 bits (1150), Expect = e-123
 Identities = 236/374 (63%), Positives = 267/374 (71%), Gaps = 5/374 (1%)
 Frame = -2

Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXG-TXXXXXXXX 1037
            I S AD+G G Y V+ RVGSPAQK+ LIADTGSDLTW N          G          
Sbjct: 130  ISSGADFGTGQYFVQFRVGSPAQKVVLIADTGSDLTWMNCKYRCRGGGGGGCRRNSNKRR 189

Query: 1036 XXXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANET 857
                      RTVPCSST C  DLANLF           CAYDYRYSDGSA  GLF NET
Sbjct: 190  LFWADRSSSFRTVPCSSTTCTNDLANLFSLTRCPSPISPCAYDYRYSDGSAAQGLFGNET 249

Query: 856  VTFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYC 677
            VT  L+NGRK R+H+VL+GCS SS G +F +ADGV+GLGYSNYS AV+A++ F G FSYC
Sbjct: 250  VTLSLTNGRKTRLHNVLIGCSISSSGPTFQSADGVIGLGYSNYSLAVKASNLFRGIFSYC 309

Query: 676  LVDHLSPNNLSSYLIFGSQPQHT-RMRYTELVLGVVNPFYAVAIKGISIGGAMLDIPPDT 500
            LVDHLSP N+SSYL FGS  Q T  M YT L+L V+NPFYAV++ GISIGG+MLDIP + 
Sbjct: 310  LVDHLSPKNISSYLTFGSAKQQTDTMHYTALILDVINPFYAVSMNGISIGGSMLDIPAEV 369

Query: 499  WDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPL---ERVDLGIGPLEYCFNSSAGFN 329
            WD+ G GG I+DSGTSLT L  PAY+ V+AAL   L   E++ L +GPLEYCFNS+ GF 
Sbjct: 370  WDVKGSGGVILDSGTSLTSLVGPAYRPVMAALTASLSGFEKLGLDVGPLEYCFNST-GFV 428

Query: 328  ETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWEF 149
            E+ VPRLV HF DGARFEPPVKSYVIDAAPGVKCLGF   AWPGVSVVGNIMQQN+ WEF
Sbjct: 429  ESVVPRLVFHFGDGARFEPPVKSYVIDAAPGVKCLGFVGGAWPGVSVVGNIMQQNYFWEF 488

Query: 148  DIVKSRLGFAPSSC 107
            D+V  RLGF  SSC
Sbjct: 489  DLVNKRLGFGSSSC 502


>emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  435 bits (1118), Expect = e-119
 Identities = 223/378 (58%), Positives = 265/378 (70%), Gaps = 9/378 (2%)
 Frame = -2

Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034
            +H AADYG G Y V  +VG+P+QK  L+ADTGSDLTW +                     
Sbjct: 72   MHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 131

Query: 1033 XXXXXXXXXR--TVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANE 860
                        T+PC + +CK++L +LF           C YDYRYSDGS  +G FANE
Sbjct: 132  RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 191

Query: 859  TVTFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSY 680
            TVT  L  GRK ++H+VL+GCSES +GQSF AADGVMGLGYS YSFA++AA+KFGGKFSY
Sbjct: 192  TVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSY 251

Query: 679  CLVDHLSPNNLSSYLIFGSQPQH----TRMRYTELVLGVVNPFYAVAIKGISIGGAMLDI 512
            CLVDHLS  N+S+YL FGS          M YTELVLG+VN FYAV + GISIGGAML I
Sbjct: 252  CLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKI 311

Query: 511  PPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPL---ERVDLGIGPLEYCFNSS 341
            P + WD+ G GG I+DSG+SLT LT PAY+ V+AAL++ L    +V++ IGPLEYCFNS+
Sbjct: 312  PSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNST 371

Query: 340  AGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNH 161
             GF E+ VPRLV HFADGA FEPPVKSYVI AA GV+CLGF + AWPG SVVGNIMQQNH
Sbjct: 372  -GFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNH 430

Query: 160  LWEFDIVKSRLGFAPSSC 107
            LWEFD+   +LGFAPSSC
Sbjct: 431  LWEFDLGLKKLGFAPSSC 448


>ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score =  434 bits (1116), Expect = e-119
 Identities = 223/378 (58%), Positives = 265/378 (70%), Gaps = 9/378 (2%)
 Frame = -2

Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034
            +H AADYG G Y V  +VG+P+QK  L+ADTGSDLTW +                     
Sbjct: 72   MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 131

Query: 1033 XXXXXXXXXR--TVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANE 860
                        T+PC + +CK++L +LF           C YDYRYSDGS  +G FANE
Sbjct: 132  RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 191

Query: 859  TVTFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSY 680
            TVT  L  GRK ++H+VL+GCSES +GQSF AADGVMGLGYS YSFA++AA+KFGGKFSY
Sbjct: 192  TVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSY 251

Query: 679  CLVDHLSPNNLSSYLIFGSQPQH----TRMRYTELVLGVVNPFYAVAIKGISIGGAMLDI 512
            CLVDHLS  N+S+YL FGS          M YTELVLG+VN FYAV + GISIGGAML I
Sbjct: 252  CLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKI 311

Query: 511  PPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPL---ERVDLGIGPLEYCFNSS 341
            P + WD+ G GG I+DSG+SLT LT PAY+ V+AAL++ L    +V++ IGPLEYCFNS+
Sbjct: 312  PSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNST 371

Query: 340  AGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNH 161
             GF E+ VPRLV HFADGA FEPPVKSYVI AA GV+CLGF + AWPG SVVGNIMQQNH
Sbjct: 372  -GFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNH 430

Query: 160  LWEFDIVKSRLGFAPSSC 107
            LWEFD+   +LGFAPSSC
Sbjct: 431  LWEFDLGLKKLGFAPSSC 448


>emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score =  434 bits (1116), Expect = e-119
 Identities = 223/378 (58%), Positives = 265/378 (70%), Gaps = 9/378 (2%)
 Frame = -2

Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034
            +H AADYG G Y V  +VG+P+QK  L+ADTGSDLTW +                     
Sbjct: 1    MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 60

Query: 1033 XXXXXXXXXR--TVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANE 860
                        T+PC + +CK++L +LF           C YDYRYSDGS  +G FANE
Sbjct: 61   RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 120

Query: 859  TVTFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSY 680
            TVT  L  GRK ++H+VL+GCSES +GQSF AADGVMGLGYS YSFA++AA+KFGGKFSY
Sbjct: 121  TVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSY 180

Query: 679  CLVDHLSPNNLSSYLIFGSQPQH----TRMRYTELVLGVVNPFYAVAIKGISIGGAMLDI 512
            CLVDHLS  N+S+YL FGS          M YTELVLG+VN FYAV + GISIGGAML I
Sbjct: 181  CLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKI 240

Query: 511  PPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPL---ERVDLGIGPLEYCFNSS 341
            P + WD+ G GG I+DSG+SLT LT PAY+ V+AAL++ L    +V++ IGPLEYCFNS+
Sbjct: 241  PSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNST 300

Query: 340  AGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNH 161
             GF E+ VPRLV HFADGA FEPPVKSYVI AA GV+CLGF + AWPG SVVGNIMQQNH
Sbjct: 301  -GFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNH 359

Query: 160  LWEFDIVKSRLGFAPSSC 107
            LWEFD+   +LGFAPSSC
Sbjct: 360  LWEFDLGLKKLGFAPSSC 377


>ref|XP_007049083.1| Eukaryotic aspartyl protease family protein, putative [Theobroma
            cacao] gi|508701344|gb|EOX93240.1| Eukaryotic aspartyl
            protease family protein, putative [Theobroma cacao]
          Length = 478

 Score =  378 bits (970), Expect = e-102
 Identities = 195/374 (52%), Positives = 250/374 (66%), Gaps = 5/374 (1%)
 Frame = -2

Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034
            + SAAD G G Y V  RVGSP +K  +IADTGS LTW                       
Sbjct: 107  MRSAADIGTGQYFVSFRVGSPPKKFIMIADTGSSLTWMRCSYKCKNFSMDRTKLHERIFY 166

Query: 1033 XXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETV 854
                       +PCSS +CKV+L+  F           CAYDYRY+DG+  VG+F N+TV
Sbjct: 167  ANQSRTFKP--IPCSSDVCKVELSQSFSLALCPTPMAPCAYDYRYADGTRVVGIFGNDTV 224

Query: 853  TFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYCL 674
               LS G+K +V DV+VGCSE+ RG +F   DGVMGLG+  +SFAV+AA +FG KFSYCL
Sbjct: 225  KVRLSGGQKIKVTDVMVGCSEAIRG-NFHDIDGVMGLGFDQHSFAVKAAKEFGDKFSYCL 283

Query: 673  VDHLSPNNLSSYLIFGSQPQHT--RMRYTELVLGVVNPFYAVAIKGISIGGAMLDIPPDT 500
            VDHLSP+NL ++L+FG         M++T+L+LG+VNP+YAV + GIS+ G MLDIP   
Sbjct: 284  VDHLSPSNLVNFLVFGGVTSSPLPNMQFTQLILGIVNPYYAVNVSGISVNGKMLDIPSYI 343

Query: 499  WDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPL---ERVDLGIGPLEYCFNSSAGFN 329
            WD+ G GG I+DSG+SLT L  P +  V+AA Q PL   ++++L +GP +YCF S+AGF 
Sbjct: 344  WDVKGDGGVIMDSGSSLTYLVKPLFDKVIAAFQAPLSKFKKLELNLGP-DYCF-SAAGFE 401

Query: 328  ETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWEF 149
            E+ +P+L  HFADGA+  PPVKSYVIDA   VKCLGF++ +WPG SV+GNI+QQNHLWEF
Sbjct: 402  ESLMPKLAFHFADGAKLVPPVKSYVIDAEEAVKCLGFSSTSWPGPSVIGNILQQNHLWEF 461

Query: 148  DIVKSRLGFAPSSC 107
            D++ SRLGFA SSC
Sbjct: 462  DLLNSRLGFAASSC 475


>ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citrus clementina]
            gi|568881779|ref|XP_006493729.1| PREDICTED: aspartic
            proteinase nepenthesin-1-like [Citrus sinensis]
            gi|557524190|gb|ESR35557.1| hypothetical protein
            CICLE_v10004908mg [Citrus clementina]
          Length = 470

 Score =  375 bits (963), Expect = e-101
 Identities = 189/375 (50%), Positives = 244/375 (65%), Gaps = 6/375 (1%)
 Frame = -2

Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTW-SNXXXXXXXXXXGTXXXXXXXX 1037
            + +  DYG G+Y V+++VG+P+QKL LI DTGS+ +W S                     
Sbjct: 95   LQAGRDYGTGMYFVEIKVGTPSQKLRLIVDTGSEFSWISCRYHCGPSCTKKGTIAGSRRR 154

Query: 1036 XXXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANET 857
                      +T+PCSS +CK + A LF           CAYDYRY+DGSA  G+F  E 
Sbjct: 155  VFKADLSSSFKTIPCSSDMCKSEFARLFSLTFCPTPTSPCAYDYRYADGSAAKGIFGKER 214

Query: 856  VTFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFG---GKF 686
            VT GL NG K R+ +V++GCS++ +GQ F  ADGV+GL Y  YSFA +  +      GKF
Sbjct: 215  VTIGLENGGKTRIEEVVMGCSDTIQGQIFAEADGVLGLSYDKYSFAQKVTNGSTFARGKF 274

Query: 685  SYCLVDHLSPNNLSSYLIFGSQPQHTRMRYTELVLGVVNPFYAVAIKGISIGGAMLDIPP 506
            +YCLVDHLS  N+S+YLIFG + +  RMR    +LG++ P Y V++KGISIGG ML+IP 
Sbjct: 275  AYCLVDHLSHKNVSNYLIFGEESKRMRMRMRYTLLGLIGPDYGVSVKGISIGGVMLNIPS 334

Query: 505  DTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERVD--LGIGPLEYCFNSSAGF 332
              WD + GGG   DSGT+LT L  PAYK VVAAL++ L R        P EYCFNS+ GF
Sbjct: 335  QVWDFNRGGGTAFDSGTTLTFLAEPAYKPVVAALEMSLSRYQRLKRDAPFEYCFNST-GF 393

Query: 331  NETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWE 152
            +E++VP+LV HFADGARFEP  KSY+I  A G++CLGF +A WPG S +GNIMQQN+ WE
Sbjct: 394  DESSVPKLVFHFADGARFEPHTKSYIIRVAHGIRCLGFVSATWPGASAIGNIMQQNYFWE 453

Query: 151  FDIVKSRLGFAPSSC 107
            FD++K RLGFAPS+C
Sbjct: 454  FDLLKDRLGFAPSTC 468


>gb|EPS68033.1| hypothetical protein M569_06741 [Genlisea aurea]
          Length = 449

 Score =  370 bits (951), Expect = e-100
 Identities = 203/380 (53%), Positives = 248/380 (65%), Gaps = 11/380 (2%)
 Frame = -2

Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034
            +++ AD G   YLV  RVGSPAQ + LIADTGSDLTW+            +         
Sbjct: 75   MYAGADLGIAQYLVAFRVGSPAQSVALIADTGSDLTWTKCSYGCGGGCRRS-----SGRL 129

Query: 1033 XXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETV 854
                     +TV CSST C VDLA  F           CAYDYRY+DGS+  G+FA ETV
Sbjct: 130  FDADRSTSFKTVECSSTTCTVDLAGAFSLSRCSPPSDPCAYDYRYADGSSAEGIFAGETV 189

Query: 853  TFGLSNGR-KRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYC 677
               L+ GR K R+ +VL+GC+++  G SF  +DGV+GLGYSN+SFA  AA +FG KFSYC
Sbjct: 190  ELKLAKGRGKARLQNVLIGCTKNFSGSSFQTSDGVLGLGYSNFSFAHAAAARFGDKFSYC 249

Query: 676  LVDHLSPNNLSSYLIFGSQPQHTR------MRYTELVLGVVNPFYAVAIKGISIGGAMLD 515
            L+DHL+  N SSY+ F S    +       +RYT+LVLGV+   YAV ++GISIGG+ L 
Sbjct: 250  LLDHLAAKNKSSYITFSSGRSISASISAGPIRYTDLVLGVIGSNYAVNVRGISIGGSWLR 309

Query: 514  IPPDTW-DLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERV---DLGIGPLEYCFN 347
            IP DTW +L G GG I+DSG+SLT L  PAY  V+AAL   L R     + IGP+E CFN
Sbjct: 310  IPSDTWNNLSGSGGVIIDSGSSLTALAPPAYAPVIAALNRSLARFGDPHVKIGPMECCFN 369

Query: 346  SSAGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQ 167
            S+ GF+E+ VP+L +HFA G RFEPPVKSYVIDAAPGV CLGF  AA PGVSV+GNI+QQ
Sbjct: 370  ST-GFHESVVPKLAIHFAGGTRFEPPVKSYVIDAAPGVVCLGFVQAASPGVSVIGNILQQ 428

Query: 166  NHLWEFDIVKSRLGFAPSSC 107
            NH WEFD+   RLGFA S C
Sbjct: 429  NHWWEFDLGNRRLGFAASDC 448


>ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Capsella rubella]
            gi|482566377|gb|EOA30566.1| hypothetical protein
            CARUB_v10013693mg [Capsella rubella]
          Length = 448

 Score =  368 bits (945), Expect = 2e-99
 Identities = 195/373 (52%), Positives = 240/373 (64%), Gaps = 6/373 (1%)
 Frame = -2

Query: 1207 SAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXXXX 1028
            S  DYG   Y  ++RVG+PA+K  ++ DTGS+LTW N                       
Sbjct: 80   SGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCKYRGRG-----KGRVENRRVFR 134

Query: 1027 XXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETVTF 848
                   RTV C +  CKVDL NLF           C+YDYRY+DGSA  G+FA ETVT 
Sbjct: 135  AEESKSFRTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGIFAKETVTV 194

Query: 847  GLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYCLVD 668
            GL+NGRK R+H +L+GCS S  GQSF  ADGV+GL +S++SF   A   FG KFSYCLVD
Sbjct: 195  GLTNGRKARLHGLLIGCSSSFSGQSFRGADGVLGLAFSDFSFTSTATSLFGAKFSYCLVD 254

Query: 667  HLSPNNLSSYLIFGSQPQHTRM---RYTELVLGVVNPFYAVAIKGISIGGAMLDIPPDTW 497
            HLSP N+S+YLIFGS    T+    R T L L ++ PFYA+++ GIS+G  MLDIP   W
Sbjct: 255  HLSPKNVSNYLIFGSSSSATKNAPGRTTPLDLTLIPPFYAISVIGISLGEDMLDIPAQVW 314

Query: 496  DLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQL---PLERVDLGIGPLEYCFNSSAGFNE 326
            D   GGG ++DSGTSLT+L+  AYK VV  L      LERV     P+EYCF+S++GFNE
Sbjct: 315  DATTGGGTVLDSGTSLTLLSEAAYKPVVTGLARYLDELERVKPEGVPIEYCFSSTSGFNE 374

Query: 325  TAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWEFD 146
            + +P+L  H   GARFEP  KSY+ID APGVKCLGF +A  P  +VVGNIMQQN+LWEFD
Sbjct: 375  SKLPQLTFHMKGGARFEPHRKSYLIDTAPGVKCLGFMSAGTPATNVVGNIMQQNYLWEFD 434

Query: 145  IVKSRLGFAPSSC 107
            ++ S L FAPSSC
Sbjct: 435  LMASTLSFAPSSC 447


>gb|EXB51212.1| Aspartic proteinase nepenthesin-1 [Morus notabilis]
          Length = 464

 Score =  367 bits (941), Expect = 7e-99
 Identities = 191/382 (50%), Positives = 250/382 (65%), Gaps = 13/382 (3%)
 Frame = -2

Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034
            +++ ADYG G Y V + VG+P Q+  L+ADTGSDLTW +                     
Sbjct: 85   MNAGADYGVGEYFVHVTVGTPGQRFMLVADTGSDLTWMHCRCGRRCGTHK--GRLNNRRV 142

Query: 1033 XXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETV 854
                     +T+PC S +CKV+LANLF           CAYDYRY +GS+ +G FANET+
Sbjct: 143  FHADRSSSFKTIPCLSEMCKVELANLFSLSKCPTPLTPCAYDYRYLEGSSAIGFFANETI 202

Query: 853  TFGLSNGRKRRVHDVLVGCSESSRG---QSFVAADGVMGLGYSNYSFAVRAADKFGGKFS 683
            +  L+NG+KR++ DVLVGC+ES +G     F  ADGV+GLG+ N++F  +AA  FGGKFS
Sbjct: 203  SVRLANGKKRKLRDVLVGCTESVQGAEESGFKGADGVLGLGFGNHTFTRKAAQYFGGKFS 262

Query: 682  YCLVDHLSPNNLSSYLIFGSQPQH-----TRMRYTELVLGV-VNPFYAVAIKGISIGGAM 521
            YCLVDHLSP NLS+Y+IFG          + +++T+LVLG    PFY V + GISIGG +
Sbjct: 263  YCLVDHLSPKNLSNYIIFGHDKADKASCSSSLQHTDLVLGGDYGPFYGVNLSGISIGGVL 322

Query: 520  LDIPPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERVDL----GIGPLEYC 353
            L IP   W+   GGGAI++SGTSLT LT P Y  V + L     R       G GP E+C
Sbjct: 323  LRIPSVAWNASLGGGAILESGTSLTFLTDPVYGPVTSELNKFTSRFGTLLPPGGGPFEFC 382

Query: 352  FNSSAGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIM 173
            FNS+ G++E+ +P L +HF++GA FEPPVKSY++D AP  KCLGF +A+WPG S++GNIM
Sbjct: 383  FNST-GYDESKMPPLRIHFSNGAIFEPPVKSYILDIAPEKKCLGFVSASWPGTSIIGNIM 441

Query: 172  QQNHLWEFDIVKSRLGFAPSSC 107
            QQNHLWEFD+  +RLGFAPS+C
Sbjct: 442  QQNHLWEFDLENTRLGFAPSTC 463


>ref|XP_007022806.1| Eukaryotic aspartyl protease family protein, putative [Theobroma
            cacao] gi|508722434|gb|EOY14331.1| Eukaryotic aspartyl
            protease family protein, putative [Theobroma cacao]
          Length = 473

 Score =  362 bits (928), Expect = 2e-97
 Identities = 196/388 (50%), Positives = 251/388 (64%), Gaps = 19/388 (4%)
 Frame = -2

Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGT--XXXXXXX 1040
            + +  D+G G Y+   +VG+P+QK  LI DTGSDLTW N           T         
Sbjct: 84   LSAGRDFGIGQYVTTFKVGTPSQKFRLIVDTGSDLTWINCRYRCARGDNCTTQERGIKRG 143

Query: 1039 XXXXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDY----------RYSDG 890
                       R +PC S +CKV+L NLF           CAYDY          RY DG
Sbjct: 144  RVFRAHLSSSFRPIPCFSQMCKVELRNLFSLTICPTPLTPCAYDYRFNSLKLVLNRYIDG 203

Query: 889  SATVGLFANETVTFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRA 710
            S  +G+FA E+VT GL+N R  R+HDVL+GCS+SS+G++    DGV+GL  S YSF  +A
Sbjct: 204  SDAMGVFAKESVTVGLTNSRMARLHDVLIGCSDSSQGRTVKNVDGVLGLANSKYSFVTKA 263

Query: 709  ADKFGGKFSYCLVDHLSPNNLSSYLIFGSQPQHTRM----RYTELVLGVVNPFYAVAIKG 542
            A+++GGKFSYCLVDHLS  N S+YLIFG+      +    RYT L L +V+  YAV ++G
Sbjct: 264  AERWGGKFSYCLVDHLSHINASNYLIFGANNNQLTVLGNTRYTRLELNLVSFSYAVNVQG 323

Query: 541  ISIGGAMLDIPPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLER---VDLGI 371
            ISIGG MLDIP   WD   GGG I+DSGTSL+ LT PAY+ V+AA+++ + +   V L  
Sbjct: 324  ISIGGKMLDIPLQVWDTRKGGGTILDSGTSLSFLTDPAYQPVMAAIKMSVSKYPQVKLHG 383

Query: 370  GPLEYCFNSSAGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVS 191
             P+EYCFNS+ GF+ET VP+L++HFADGARFEP  +SYVI AA GV+CLGF  A +P VS
Sbjct: 384  VPMEYCFNST-GFDETLVPKLIIHFADGARFEPHWRSYVISAADGVRCLGFLPARFPSVS 442

Query: 190  VVGNIMQQNHLWEFDIVKSRLGFAPSSC 107
            V+GNIMQQN+LWEFD+  ++L FAPSSC
Sbjct: 443  VIGNIMQQNYLWEFDLEGNKLRFAPSSC 470


>ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
            lyrata] gi|297328626|gb|EFH59045.1| hypothetical protein
            ARALYDRAFT_478632 [Arabidopsis lyrata subsp. lyrata]
          Length = 449

 Score =  356 bits (913), Expect = 1e-95
 Identities = 191/375 (50%), Positives = 235/375 (62%), Gaps = 8/375 (2%)
 Frame = -2

Query: 1207 SAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXXXX 1028
            S  DYG   Y  ++RVG+PA+K  ++ DTGS+LTW N                       
Sbjct: 79   SGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRG-----KGKVKNRRVFR 133

Query: 1027 XXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETVTF 848
                   +TV C +  CKVDL NLF           C+YDYRY+DGSA  G+FA ET+T 
Sbjct: 134  AEESKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITV 193

Query: 847  GLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYCLVD 668
            GL+NGRK R+  +LVGCS S  GQSF  ADGV+GL +S++SF   A   FG K SYCLVD
Sbjct: 194  GLTNGRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCLVD 253

Query: 667  HLSPNNLSSYLIFGSQPQHTRM-----RYTELVLGVVNPFYAVAIKGISIGGAMLDIPPD 503
            HLS  N+S+YLIFG     T       R T L L ++ PFYA+ I GISIG  MLDIP  
Sbjct: 254  HLSNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQ 313

Query: 502  TWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQ---LPLERVDLGIGPLEYCFNSSAGF 332
             WD   GGG I+DSGTSLT+L   AYK VV  L    + L+RV     P+EYCF+S++GF
Sbjct: 314  VWDATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGF 373

Query: 331  NETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWE 152
            NE+ +P+L  H   GARFEP  KSY++DAAPGVKCLGF +A  P  +VVGNIMQQN+LWE
Sbjct: 374  NESKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFMSAGTPATNVVGNIMQQNYLWE 433

Query: 151  FDIVKSRLGFAPSSC 107
            FD++ S L FAPS+C
Sbjct: 434  FDLMASTLSFAPSTC 448


>ref|XP_006407304.1| hypothetical protein EUTSA_v10020732mg [Eutrema salsugineum]
            gi|557108450|gb|ESQ48757.1| hypothetical protein
            EUTSA_v10020732mg [Eutrema salsugineum]
          Length = 444

 Score =  350 bits (897), Expect = 9e-94
 Identities = 185/375 (49%), Positives = 235/375 (62%), Gaps = 7/375 (1%)
 Frame = -2

Query: 1207 SAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXXXX 1028
            S  DYGA  Y  ++RVG+PA++  ++ DTGS+LTW N                       
Sbjct: 78   SGFDYGAAQYFAEVRVGTPAKRFRVVVDTGSELTWVNCRFHGKGKENRRVFRAEESSSFR 137

Query: 1027 XXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETVTF 848
                     V C +  CKVDL NLF           C+YDYRY+DGSA  G+FA ET T 
Sbjct: 138  K--------VGCLTQTCKVDLMNLFSLSNCPTPSTPCSYDYRYADGSAAQGVFAKETFTV 189

Query: 847  GLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYCLVD 668
            GL+NGRK ++  +L+GCS S  G SF  ADGV+GL  S+YSF  +A + FGGKFSYCLVD
Sbjct: 190  GLTNGRKAKLRGLLIGCSSSFSGDSFRGADGVLGLALSDYSFTSKATNIFGGKFSYCLVD 249

Query: 667  HLSPNNLSSYLIFGSQPQHTR----MRYTELVLGVVNPFYAVAIKGISIGGAMLDIPPDT 500
            HLS  N+S+YL FGS    T+    +R T L L ++ PFYA+ I GISIG  MLDIP   
Sbjct: 250  HLSNKNVSNYLTFGSSSSTTKTAASIRTTPLDLKLIPPFYAINIIGISIGDDMLDIPTQV 309

Query: 499  WDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQ---LPLERVDLGIGPLEYCFNSSAGFN 329
            WD   GGG I+DSGTSLT L   AYK VV+ L+   +  +RV     P+EYCF++++GFN
Sbjct: 310  WDATAGGGTILDSGTSLTFLADAAYKAVVSGLERYLVGFKRVKPEGVPIEYCFDTTSGFN 369

Query: 328  ETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWEF 149
            E+ +P+L  HF  GARFEP  +SYV+D   GV+CLGF +   P  +VVGNIMQQN+LWEF
Sbjct: 370  ESKLPQLTFHFKGGARFEPHRRSYVVDTLEGVRCLGFVSTGSPATNVVGNIMQQNYLWEF 429

Query: 148  DIVKSRLGFAPSSCV 104
            D+V S L FAPS+C+
Sbjct: 430  DLVASTLSFAPSTCL 444


>ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
            gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA
            binding protein-like [Arabidopsis thaliana]
            gi|332641715|gb|AEE75236.1| aspartyl protease family
            protein [Arabidopsis thaliana]
          Length = 461

 Score =  345 bits (886), Expect = 2e-92
 Identities = 184/372 (49%), Positives = 231/372 (62%), Gaps = 5/372 (1%)
 Frame = -2

Query: 1207 SAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXXXX 1028
            S  DYG   Y  ++RVG+PA+K  ++ DTGS+LTW N                       
Sbjct: 97   SGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFK 156

Query: 1027 XXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETVTF 848
                    TV C +  CKVDL NLF           C+YDYRY+DGSA  G+FA ET+T 
Sbjct: 157  --------TVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITV 208

Query: 847  GLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYCLVD 668
            GL+NGR  R+   L+GCS S  GQSF  ADGV+GL +S++SF   A   +G KFSYCLVD
Sbjct: 209  GLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVD 268

Query: 667  HLSPNNLSSYLIFGS--QPQHTRMRYTELVLGVVNPFYAVAIKGISIGGAMLDIPPDTWD 494
            HLS  N+S+YLIFGS    +    R T L L  + PFYA+ + GIS+G  MLDIP   WD
Sbjct: 269  HLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWD 328

Query: 493  LDGGGGAIVDSGTSLTVLTLPAYKLVVAALQ---LPLERVDLGIGPLEYCFNSSAGFNET 323
               GGG I+DSGTSLT+L   AYK VV  L    + L+RV     P+EYCF+ ++GFN +
Sbjct: 329  ATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVS 388

Query: 322  AVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWEFDI 143
             +P+L  H   GARFEP  KSY++DAAPGVKCLGF +A  P  +V+GNIMQQN+LWEFD+
Sbjct: 389  KLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDL 448

Query: 142  VKSRLGFAPSSC 107
            + S L FAPS+C
Sbjct: 449  MASTLSFAPSAC 460


>gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score =  345 bits (886), Expect = 2e-92
 Identities = 184/372 (49%), Positives = 231/372 (62%), Gaps = 5/372 (1%)
 Frame = -2

Query: 1207 SAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXXXX 1028
            S  DYG   Y  ++RVG+PA+K  ++ DTGS+LTW N                       
Sbjct: 75   SGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFK 134

Query: 1027 XXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETVTF 848
                    TV C +  CKVDL NLF           C+YDYRY+DGSA  G+FA ET+T 
Sbjct: 135  --------TVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITV 186

Query: 847  GLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYCLVD 668
            GL+NGR  R+   L+GCS S  GQSF  ADGV+GL +S++SF   A   +G KFSYCLVD
Sbjct: 187  GLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVD 246

Query: 667  HLSPNNLSSYLIFGS--QPQHTRMRYTELVLGVVNPFYAVAIKGISIGGAMLDIPPDTWD 494
            HLS  N+S+YLIFGS    +    R T L L  + PFYA+ + GIS+G  MLDIP   WD
Sbjct: 247  HLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWD 306

Query: 493  LDGGGGAIVDSGTSLTVLTLPAYKLVVAALQ---LPLERVDLGIGPLEYCFNSSAGFNET 323
               GGG I+DSGTSLT+L   AYK VV  L    + L+RV     P+EYCF+ ++GFN +
Sbjct: 307  ATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVS 366

Query: 322  AVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWEFDI 143
             +P+L  H   GARFEP  KSY++DAAPGVKCLGF +A  P  +V+GNIMQQN+LWEFD+
Sbjct: 367  KLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDL 426

Query: 142  VKSRLGFAPSSC 107
            + S L FAPS+C
Sbjct: 427  MASTLSFAPSAC 438


>ref|XP_004293837.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Fragaria vesca
            subsp. vesca]
          Length = 482

 Score =  345 bits (884), Expect = 3e-92
 Identities = 188/382 (49%), Positives = 243/382 (63%), Gaps = 13/382 (3%)
 Frame = -2

Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034
            + SA D+GAG Y V+++VG+P+Q+  LIADTGSDLTW            G          
Sbjct: 103  LSSAWDFGAGQYFVQIKVGTPSQRFLLIADTGSDLTWMKCKYRCVADKCGLKRATMKKNK 162

Query: 1033 XXXXXXXXXRT---VPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFAN 863
                      T   +PCSS +CK +L   F           C YDYRY++ S  +G FAN
Sbjct: 163  KKVFRPAQSSTFKIIPCSSEMCKFELE--FSRQECPTPLSPCKYDYRYAESSGALGFFAN 220

Query: 862  ETVTFGLSNGRKRRVHDVLVGCSES---SRGQSFVAADGVMGLGYSNYSFAVRAADKFGG 692
            ETV   L+NGR+ R++DVL+GC+ES    +G S  A DG++GLG+  +SF  +AA   G 
Sbjct: 221  ETVRVPLTNGRRARLNDVLIGCTESIEGPKGASIRAGDGILGLGFGKHSFVAKAASNLGD 280

Query: 691  KFSYCLVDHLSPNNLSSYLIFG----SQPQHTRMRYTELVLG--VVNPFYAVAIKGISIG 530
            KFSYCLVDH+S  N+SSYL FG    +  Q++RMRYT+L LG   + PFYAV + GIS G
Sbjct: 281  KFSYCLVDHMSNKNVSSYLTFGRNAETAQQNSRMRYTKLALGGPKIGPFYAVNLVGISAG 340

Query: 529  GAMLDIPPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERVD-LGIGPLEYC 353
              ML IP + W+ + GGG IVDSGTSLT LT PAY  V+  L + L +   +     E+C
Sbjct: 341  SKMLKIPNEVWNENLGGGTIVDSGTSLTFLTSPAYIHVMDELTMALSKYKKIPSDAFEFC 400

Query: 352  FNSSAGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIM 173
            FNS+ G++++ VPR  +HFADGA+FEPPVKSYVID A   KCLGF +A +PG  V+GNIM
Sbjct: 401  FNST-GYDQSLVPRFAIHFADGAKFEPPVKSYVIDVAIQTKCLGFQSAPFPGTIVIGNIM 459

Query: 172  QQNHLWEFDIVKSRLGFAPSSC 107
            QQN+LWEFD+   RLG+APSSC
Sbjct: 460  QQNYLWEFDLRGGRLGYAPSSC 481


>ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
            gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1
            precursor, putative [Ricinus communis]
          Length = 489

 Score =  340 bits (872), Expect = 7e-91
 Identities = 184/387 (47%), Positives = 243/387 (62%), Gaps = 9/387 (2%)
 Frame = -2

Query: 1213 IHSAADYGAGLYLVKLRVGSPA-QKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXX 1037
            IHS AD G   Y V +R+G+P  QK  L+ DTGSDLTW N                    
Sbjct: 108  IHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFR 167

Query: 1036 XXXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANET 857
                       T+PCSS  CK++L + F           C +DYRY +G   +G+FANET
Sbjct: 168  ANDSSSFR---TIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANET 224

Query: 856  VTFGLSNGRKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYC 677
            VT GL++ +K R+ DVL+GC+ES   ++    DGVMGLGY  +S A+R A+ FG KFSYC
Sbjct: 225  VTVGLNDHKKIRLFDVLIGCTESFN-ETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYC 283

Query: 676  LVDHLSPNNLSSYLIFGSQPQHT--RMRYTELVLGVVNPFYAVAIKGISIGGAMLDIPPD 503
            LVDHLS +N  ++L FG  P+    +M++TEL+LG +N FY V + GIS+GG+ML I  D
Sbjct: 284  LVDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSD 343

Query: 502  TWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERVDLGIGPLE------YCFNSS 341
             W++ G GG IVDSGTSLT+L   AY  VV AL+ P+      + P+E      +CF   
Sbjct: 344  IWNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALK-PIFDKHKKVVPIELPELNNFCFEDK 402

Query: 340  AGFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNH 161
             GF+  AVPRL++HFADGA F+PPVKSY+ID A G+KCLG   A +PG S++GN+MQQNH
Sbjct: 403  -GFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAEGIKCLGIIKADFPGSSILGNVMQQNH 461

Query: 160  LWEFDIVKSRLGFAPSSCV*FWTNSSH 80
            LWE+D+ + +LGF PSSC+   +NS H
Sbjct: 462  LWEYDLGRGKLGFGPSSCIMSNSNSKH 488


>ref|XP_006429804.1| hypothetical protein CICLE_v10013820mg [Citrus clementina]
            gi|557531861|gb|ESR43044.1| hypothetical protein
            CICLE_v10013820mg [Citrus clementina]
          Length = 475

 Score =  334 bits (856), Expect = 5e-89
 Identities = 184/359 (51%), Positives = 230/359 (64%), Gaps = 6/359 (1%)
 Frame = -2

Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXX 1034
            + S AD G G Y V  RVGSP QK  LIADTGSDLTW +                     
Sbjct: 117  LRSGADRGLGQYFVSFRVGSPPQKFVLIADTGSDLTWMHCNHKGENCPKD--GLTPPNRM 174

Query: 1033 XXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETV 854
                     +T+PCSS  CKVDL + F           CAYDY Y DGS   G FANETV
Sbjct: 175  FQADASSTFKTIPCSSRTCKVDLQDTFSLSMCPTPVTPCAYDYSYFDGSKVRGFFANETV 234

Query: 853  TFGLSNGRKR-RVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAADKFGGKFSYC 677
            T G  + RK+ R+ +V VGC++ + G +F  ADGV+GLG+   SFA  AA  F  KFSYC
Sbjct: 235  TAGSIDRRKKVRLKEVTVGCTDWANG-NFHNADGVLGLGFGKNSFAATAAKLFDNKFSYC 293

Query: 676  LVDHLSPNNLSSYLIFGS-QPQHTR-MRYTELVLGVVNPFYAVAIKGISIGGAMLDIPPD 503
            LVDHLSP+N +++L FG+   QH + M++T+L+LG +NPFYAV + GISI G ML++PP+
Sbjct: 294  LVDHLSPSNFANFLNFGNTSKQHIQNMQHTQLILGELNPFYAVNVSGISIAGKMLNVPPE 353

Query: 502  TWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERVD-LG--IGPLEYCFNSSAGF 332
             W + G GG I+DSGT+LT L  PAY   VAAL+ PLE+   LG  +GPL +C+N    F
Sbjct: 354  MWHIHGAGGVILDSGTTLTFLGEPAYAAAVAALRAPLEKYKKLGHVLGPLRFCYNDPR-F 412

Query: 331  NETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLW 155
            +   VP+ V+HFADGA+F PP KSYVIDA  GVKC+GFA+A WP  +V+GNIMQQNHLW
Sbjct: 413  DMADVPQFVLHFADGAKFVPPKKSYVIDADVGVKCIGFASAGWPANTVIGNIMQQNHLW 471


>ref|XP_007211847.1| hypothetical protein PRUPE_ppa004710mg [Prunus persica]
            gi|462407712|gb|EMJ13046.1| hypothetical protein
            PRUPE_ppa004710mg [Prunus persica]
          Length = 495

 Score =  328 bits (840), Expect = 4e-87
 Identities = 175/373 (46%), Positives = 231/373 (61%), Gaps = 9/373 (2%)
 Frame = -2

Query: 1198 DYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXGTXXXXXXXXXXXXXX 1019
            DYG G YLVKL++G+PAQK  +I  TGSDLTW                            
Sbjct: 124  DYGIGQYLVKLKLGTPAQKFTVIPSTGSDLTWVRCGSHCGKSCGIRKGRIDHSRVFNTDR 183

Query: 1018 XXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANETVTFGLS 839
                ++V CSS +C+ DLAN             C YDY Y +GS+ +G F  + V   LS
Sbjct: 184  SSTFKSVTCSSKMCEFDLANFNSLNKCPRPLSPCRYDYSYVEGSSALGTFGTDIVRASLS 243

Query: 838  NGRKRRVHDVLVGCSESSRGQSFV-AADGVMGLGYSNYSFAVRAADKFGGKFSYCLVDHL 662
            NGR+ R+ DVL+GC+ES  G+     +DG++GLG+  YSF  +AA K+GGK SYCL+DH+
Sbjct: 244  NGRRNRMKDVLIGCTESIIGKGTAKGSDGILGLGFGKYSFTTKAALKYGGKVSYCLLDHM 303

Query: 661  SPNNLSSYLIFGSQPQHT---RMRYTELVLGVVNP--FYAVAIKGISIGGAMLDIPPDTW 497
            SP N++SYL FG   +     +MRYT+LV G  N   FY V ++GIS+GG ML+IP   W
Sbjct: 304  SPKNVTSYLTFGDNKKAVLQGKMRYTQLVFGNPNKGSFYGVNLQGISVGGKMLNIPLHIW 363

Query: 496  DLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPL---ERVDLGIGPLEYCFNSSAGFNE 326
            +   GGGA+VDSG SLT LT PAYK V+ AL +PL    R+       ++CF+   G+ +
Sbjct: 364  NPKLGGGALVDSGMSLTFLTKPAYKPVMTALTMPLTKFRRLRSEEDDFDFCFD-PRGYRD 422

Query: 325  TAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAAWPGVSVVGNIMQQNHLWEFD 146
              VP+LV HFA GA+F PPVKSYVID +PG+KC+G    A  G  ++GNI+QQNHLWEF+
Sbjct: 423  RLVPKLVFHFAGGAKFAPPVKSYVIDVSPGMKCIGILPLA-EGACIIGNIIQQNHLWEFN 481

Query: 145  IVKSRLGFAPSSC 107
            +V+  LGFAPS+C
Sbjct: 482  LVRKTLGFAPSTC 494


>ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
            gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa
            Japonica Group] gi|125553268|gb|EAY98977.1| hypothetical
            protein OsI_20935 [Oryza sativa Indica Group]
          Length = 494

 Score =  319 bits (818), Expect = 1e-84
 Identities = 181/396 (45%), Positives = 225/396 (56%), Gaps = 27/396 (6%)
 Frame = -2

Query: 1213 IHSAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTW--------SNXXXXXXXXXXGTX 1058
            + S A  G G Y V+ RVG+PAQ   LIADTGSDLTW         +             
Sbjct: 99   LSSGAYTGTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPS 158

Query: 1057 XXXXXXXXXXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATV 878
                               +PCSS  CK  +   F           C+YDYRY+D SA  
Sbjct: 159  PAVAPPRVFRPGDSKTWSPIPCSSETCKSTIP--FSLANCSSSTAACSYDYRYNDNSAAR 216

Query: 877  GLFANETVTFGLSNGR--------KRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSF 722
            G+   ++ T  LS GR        K ++  V++GC+ +  GQ F A+DGV+ LGYSN SF
Sbjct: 217  GVVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASDGVLSLGYSNISF 276

Query: 721  AVRAADKFGGKFSYCLVDHLSPNNLSSYLIFGSQPQHTRMRY------TELVLGV-VNPF 563
            A RAA +FGG+FSYCLVDHL+P N +SYL FG+ P             T L+L   V PF
Sbjct: 277  ASRAASRFGGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPF 336

Query: 562  YAVAIKGISIGGAMLDIPPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERV 383
            YAVA+  +S+ G  LDIP + WD+   GG I+DSGTSLTVL  PAYK VVAAL   L  +
Sbjct: 337  YAVAVDSVSVDGVALDIPAEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQLAGL 396

Query: 382  D-LGIGPLEYCFNSSA---GFNETAVPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFA 215
              + + P +YC+N +A   G  + AVP+L V FA  AR EPP KSYVIDAAPGVKC+G  
Sbjct: 397  PRVAMDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAPGVKCIGVQ 456

Query: 214  AAAWPGVSVVGNIMQQNHLWEFDIVKSRLGFAPSSC 107
              AWPGVSV+GNI+QQ HLWEFD+    L F  +SC
Sbjct: 457  EGAWPGVSVIGNILQQEHLWEFDLNNRWLRFRQTSC 492


>ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
            distachyon]
          Length = 479

 Score =  315 bits (808), Expect = 2e-83
 Identities = 182/393 (46%), Positives = 223/393 (56%), Gaps = 26/393 (6%)
 Frame = -2

Query: 1207 SAADYGAGLYLVKLRVGSPAQKLELIADTGSDLTWSNXXXXXXXXXXG----TXXXXXXX 1040
            SAA  G G Y V+ RVG+PAQ   L+ADTGSDLTW                 +       
Sbjct: 86   SAAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPR 145

Query: 1039 XXXXXXXXXXXRTVPCSSTICKVDLANLFXXXXXXXXXXXCAYDYRYSDGSATVGLFANE 860
                         +PC+S  C   L   F           CAYDYRY DGSA  G    E
Sbjct: 146  RAFRPEKSKTWAPIPCASDTCSKSLP--FSLSTCPTPGSPCAYDYRYKDGSAARGTVGTE 203

Query: 859  TVTFGLSNG--------RKRRVHDVLVGCSESSRGQSFVAADGVMGLGYSNYSFAVRAAD 704
            + T  LS+         +K ++  +++GC+ S  G SF A+DGV+ LGYSN SFA  AA 
Sbjct: 204  SATIALSSSSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAAS 263

Query: 703  KFGGKFSYCLVDHLSPNNLSSYLIFGSQPQHTR---------MRYTELVLGV-VNPFYAV 554
            +FGG+FSYCLVDHLSP N +SYL FG     +           R T LVL   + PFY V
Sbjct: 264  RFGGRFSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDV 323

Query: 553  AIKGISIGGAMLDIPPDTWDLDGGGGAIVDSGTSLTVLTLPAYKLVVAALQLPLERVD-L 377
            +IK IS+ G +L IP D W++DGGGG IVDSGTSLTVL  PAY+ VVAAL   L R   +
Sbjct: 324  SIKAISVDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFPRV 383

Query: 376  GIGPLEYCFNSSAGFNETA---VPRLVVHFADGARFEPPVKSYVIDAAPGVKCLGFAAAA 206
             + P EYC+N ++   +     +P+L VHFA  AR EPP KSYVIDAAPGVKC+G     
Sbjct: 384  AMDPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEGP 443

Query: 205  WPGVSVVGNIMQQNHLWEFDIVKSRLGFAPSSC 107
            WPG+SV+GNI+QQ HLWEFD+   RL F  S C
Sbjct: 444  WPGISVIGNILQQEHLWEFDLKNRRLRFKRSRC 476


Top