BLASTX nr result

ID: Catharanthus22_contig00017723 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00017723
         (1436 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAD80835.1| nucellin-like protein [Daucus carota]                 616   e-174
ref|XP_004240685.1| PREDICTED: aspartic proteinase Asp1-like [So...   608   e-171
ref|XP_006346815.1| PREDICTED: aspartic proteinase Asp1-like [So...   602   e-170
ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis v...   575   e-161
gb|EPS67652.1| hypothetical protein M569_07122 [Genlisea aurea]       555   e-155
gb|EOX92687.1| Eukaryotic aspartyl protease family protein isofo...   553   e-155
emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]   552   e-154
gb|EMJ12886.1| hypothetical protein PRUPE_ppa005961mg [Prunus pe...   551   e-154
gb|EXC30733.1| Aspartic proteinase Asp1 [Morus notabilis]             547   e-153
ref|XP_006412352.1| hypothetical protein EUTSA_v10025289mg [Eutr...   540   e-151
ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arab...   539   e-150
ref|XP_002310541.2| hypothetical protein POPTR_0007s04800g [Popu...   538   e-150
ref|NP_001190905.1| aspartyl protease family protein [Arabidopsi...   536   e-150
ref|XP_006432088.1| hypothetical protein CICLE_v10001122mg [Citr...   536   e-149
ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic pro...   536   e-149
ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cu...   536   e-149
ref|XP_006464925.1| PREDICTED: aspartic proteinase Asp1-like [Ci...   535   e-149
dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]              535   e-149
ref|XP_006282959.1| hypothetical protein CARUB_v10007649mg [Caps...   534   e-149
ref|XP_006382886.1| hypothetical protein POPTR_0005s07070g [Popu...   533   e-149

>dbj|BAD80835.1| nucellin-like protein [Daucus carota]
          Length = 426

 Score =  616 bits (1588), Expect = e-174
 Identities = 286/416 (68%), Positives = 336/416 (80%), Gaps = 1/416 (0%)
 Frame = -3

Query: 1341 ILIVFLGIALAGGASSDRQQQGWWKLRSAGVGSSKAKPFASSIVLPLYGNVYPDGYYFAQ 1162
            I+ VFL + + G  SSD QQQ WWK  S+G  SS      SS+VLPLYGNVYP GYY  Q
Sbjct: 12   IMSVFLVLMIVG-VSSDDQQQSWWKWFSSGASSSVVSSVGSSVVLPLYGNVYPSGYYHVQ 70

Query: 1161 VNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPLYRPTNDLVVCRDPLCASLHSG 982
             N+GQPPKPYFLDPDTGSDLTWLQCDAPC+ CT APHPLY+PTNDLVVC+DP+CASLH  
Sbjct: 71   FNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHPLYQPTNDLVVCKDPICASLHPD 130

Query: 981  DYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGGARVSPRLAFGCGYDQIPGASHH 802
            +Y+CD P+QCDYEVEYADGGSS+GVLVND+F  N T G R  PRL  GCGYDQ+PG ++H
Sbjct: 131  NYRCDDPDQCDYEVEYADGGSSIGVLVNDLFPVNLTSGMRARPRLTIGCGYDQLPGIAYH 190

Query: 801  PLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXXXXXXXGDDVYASSPVVWTPMSN 622
            PLDGVLGLG+G SSIV QL +QGL+RNVVGHC S R       GDD+Y SS V+WTPMS 
Sbjct: 191  PLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRRGGGYLFFGDDIYDSSKVIWTPMSR 250

Query: 621  DYTKHYSAGSAELTFGGRNVGLKNLLVVFDSGSSYSYLNSQAYLALLSLVKKELNGKPLR 442
            DY KHY+ G AEL   GR+ GLKNLLVVFDSGSSY+Y N+Q Y  LLS +KK+L+GKPL+
Sbjct: 251  DYLKHYTPGFAELILNGRSSGLKNLLVVFDSGSSYTYFNTQTYQTLLSFIKKDLHGKPLK 310

Query: 441  EAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPGGWRSKPKFEILPESYLILSTRGS 262
            EA++D TLPVCW+G+KPF+SI D +KYFKPL LSF  GW++K +FEI  ESYLI+S++GS
Sbjct: 311  EAVEDDTLPVCWRGKKPFKSIRDAKKYFKPLALSFGSGWKTKSQFEIQQESYLIISSKGS 370

Query: 261  VCLGILNGTEIGLQ-YNIIGDISMLDKMVIYDNERKAIGWAAANCDRPPKFNTFLM 97
            VCLGILNGTE+GLQ YNIIGDISM +K+VIYDNE++ IGW  +NCDRPPK +TF M
Sbjct: 371  VCLGILNGTEVGLQNYNIIGDISMQEKLVIYDNEKQVIGWQPSNCDRPPKGDTFSM 426


>ref|XP_004240685.1| PREDICTED: aspartic proteinase Asp1-like [Solanum lycopersicum]
          Length = 427

 Score =  608 bits (1569), Expect = e-171
 Identities = 281/427 (65%), Positives = 343/427 (80%), Gaps = 5/427 (1%)
 Frame = -3

Query: 1362 MGGEKVIILIVFLGIALA----GGASSDRQQQGWWKLRSAGVGSSKAKPFASSIVLPLYG 1195
            MGG K++ +++F+ + ++    GG +   QQQ WWK  S+   +      +SSIVLPLYG
Sbjct: 1    MGGGKIVGILIFVVVVVSAAGGGGENHHHQQQKWWKWMSSTSAAMVNPVVSSSIVLPLYG 60

Query: 1194 NVYPDGYYFAQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPLYRPTNDLVVC 1015
            NVYP GYY+ Q+N+GQP +P+FLDPDTGSDLTWLQCDAPCV CT APHP Y+P NDLV C
Sbjct: 61   NVYPLGYYYVQLNIGQPSRPFFLDPDTGSDLTWLQCDAPCVRCTTAPHPFYKPNNDLVPC 120

Query: 1014 RDPLCASLHSGDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGGARVSPRLAFGC 835
            +DPLCASLH   Y+C+SPEQCDY+V+YADGGSSLGVL+NDVF FN T GAR+ PRL+ GC
Sbjct: 121  KDPLCASLHPAGYKCESPEQCDYQVDYADGGSSLGVLLNDVFHFNMTSGARMIPRLSLGC 180

Query: 834  GYDQIPGASHHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXXXXXXXGDDVYA 655
            GYDQ+PG S+HPLDGVLGLG+GK+SIV+QLH++G ++NVVGHCLS R       GD+VY 
Sbjct: 181  GYDQLPGQSYHPLDGVLGLGRGKTSIVSQLHSKGAVQNVVGHCLSGRGGGFLFFGDEVYD 240

Query: 654  SSPVVWTPMSNDYTKHYSAGSAELTFGGRNVGLKNLLVVFDSGSSYSYLNSQAYLALLSL 475
            SS +VWTPM++D  KHYSAGS EL FGG+  GLKNL VVFDSGSS+SYLN+  Y   +SL
Sbjct: 241  SSRIVWTPMAHDRMKHYSAGSGELIFGGKGTGLKNLFVVFDSGSSFSYLNAHTYEGFISL 300

Query: 474  VKKELNGKPLREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPGGWRSKPKFEILP 295
            +KKELNGKPLRE  DD+TLP+CWKGR+PF++I D +KYFK   LSF  GW+SK  FEI P
Sbjct: 301  LKKELNGKPLRETKDDYTLPLCWKGRRPFKTINDAKKYFKQFALSFGNGWKSKAHFEIPP 360

Query: 294  ESYLILSTRGSVCLGILNGTEIGLQ-YNIIGDISMLDKMVIYDNERKAIGWAAANCDRPP 118
            ESYLI+S++GSVCLG+LNGTE GLQ  N+IGDISM DKMVIYDNE++AIGW +ANCDRPP
Sbjct: 361  ESYLIISSKGSVCLGVLNGTEAGLQNVNLIGDISMQDKMVIYDNEKQAIGWMSANCDRPP 420

Query: 117  KFNTFLM 97
            K +  +M
Sbjct: 421  KSSNMIM 427


>ref|XP_006346815.1| PREDICTED: aspartic proteinase Asp1-like [Solanum tuberosum]
          Length = 437

 Score =  602 bits (1553), Expect = e-170
 Identities = 280/428 (65%), Positives = 345/428 (80%), Gaps = 6/428 (1%)
 Frame = -3

Query: 1362 MGGEKVIILIVFLGIALA-----GGASSDRQQQGWWKLRSAGVGSSKAKPFASSIVLPLY 1198
            MGG K++ +++F+ + ++     GG +  +QQQ   K  S+   ++     +SSIVLPLY
Sbjct: 10   MGGGKIVGILIFVVVVVSAAGGGGGENHQQQQQQQQKWMSSTSAAAVNPVVSSSIVLPLY 69

Query: 1197 GNVYPDGYYFAQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPLYRPTNDLVV 1018
            GNVYP GYY+ Q+N+GQP +P+FLDPDTGSDLTWLQCDAPCV CT APHP Y+P NDLV 
Sbjct: 70   GNVYPLGYYYVQLNIGQPSRPFFLDPDTGSDLTWLQCDAPCVRCTTAPHPFYKPNNDLVP 129

Query: 1017 CRDPLCASLHSGDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGGARVSPRLAFG 838
            C+DPLCASLH   Y+C+SPEQCDY+V+YADGGSSLGVL+NDVF FN T GAR+ PRL+ G
Sbjct: 130  CKDPLCASLHPAGYKCESPEQCDYQVDYADGGSSLGVLLNDVFHFNMTSGARMIPRLSLG 189

Query: 837  CGYDQIPGASHHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXXXXXXXGDDVY 658
            CGYDQ+PG S+HPLDGVLGLG+GK+SIV+QLH++G+++NVVGHCLS R       GD+VY
Sbjct: 190  CGYDQLPGQSYHPLDGVLGLGRGKTSIVSQLHSKGVVQNVVGHCLSGRGGGFLFFGDEVY 249

Query: 657  ASSPVVWTPMSNDYTKHYSAGSAELTFGGRNVGLKNLLVVFDSGSSYSYLNSQAYLALLS 478
             SS +VWTPM++D  KHYSAGS EL FGG+  GLKNL VVFDSGSS+SYLN+  Y   +S
Sbjct: 250  DSSRIVWTPMAHDRMKHYSAGSGELIFGGKGTGLKNLFVVFDSGSSFSYLNAHTYEGFIS 309

Query: 477  LVKKELNGKPLREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPGGWRSKPKFEIL 298
            L+KKELNGKPLRE  DD+TLP+CWKGR+PF++I DV+KYFK   LSF  GW+SK  FEI 
Sbjct: 310  LLKKELNGKPLRETKDDYTLPLCWKGRRPFKTINDVKKYFKQFALSFGNGWKSKAHFEIP 369

Query: 297  PESYLILSTRGSVCLGILNGTEIGLQ-YNIIGDISMLDKMVIYDNERKAIGWAAANCDRP 121
            PESYLI+S++GSVCLG+LNGTE GLQ  N+IGDISM DKMVIYDNE++AIGW +ANCDRP
Sbjct: 370  PESYLIISSKGSVCLGVLNGTEAGLQNVNLIGDISMQDKMVIYDNEKQAIGWTSANCDRP 429

Query: 120  PKFNTFLM 97
            PK +  +M
Sbjct: 430  PKSSNMIM 437


>ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
            gi|296082608|emb|CBI21613.3| unnamed protein product
            [Vitis vinifera]
          Length = 426

 Score =  575 bits (1481), Expect = e-161
 Identities = 270/418 (64%), Positives = 330/418 (78%), Gaps = 1/418 (0%)
 Frame = -3

Query: 1347 VIILIVFLGIALAGGASSDRQQQGWWKLRSAGVGSSKAKPFASSIVLPLYGNVYPDGYYF 1168
            +++L+V +G++    AS  + ++           SS      SS+V PLYGNVYP GYY+
Sbjct: 9    LVVLVVLVGLSGWSSASDHQHKRKKAVFPEPAASSSLINIIQSSVVFPLYGNVYPLGYYY 68

Query: 1167 AQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPLYRPTNDLVVCRDPLCASLH 988
              +++GQPPKPYFLDPDTGSDL+WLQCDAPCV CT+APHPLYRP N+LV+C+DP+CASLH
Sbjct: 69   VSLSIGQPPKPYFLDPDTGSDLSWLQCDAPCVRCTKAPHPLYRPNNNLVICKDPMCASLH 128

Query: 987  SGDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGGARVSPRLAFGCGYDQIPGAS 808
               Y+C+ PEQCDYEVEYADGGSSLGVLV DVF  N T G R++PRLA GCGYDQIPG S
Sbjct: 129  PPGYKCEHPEQCDYEVEYADGGSSLGVLVKDVFPLNFTNGLRLAPRLALGCGYDQIPGQS 188

Query: 807  HHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXXXXXXXGDDVYASSPVVWTPM 628
            +HPLDGVLGLGKGKSSIV+QLH+QG+IRNVVGHC+SSR       GDD+Y SS VVWTPM
Sbjct: 189  YHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSRGGGFLFFGDDLYDSSRVVWTPM 248

Query: 627  SNDYTKHYSAGSAELTFGGRNVGLKNLLVVFDSGSSYSYLNSQAYLALLSLVKKELNGKP 448
              D   HYS+G AEL  GG+    KNLLV FDSGSSY+YLNS AY AL+ LV+KEL+ KP
Sbjct: 249  LRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALVHLVRKELSEKP 308

Query: 447  LREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPGGWRSKPKFEILPESYLILSTR 268
            +REA+DD TLP+CW+G++PF+S+ DV+K+FKPL LSFPGG R+K +++I  ESYLI+S +
Sbjct: 309  VREALDDQTLPLCWRGKRPFKSVRDVKKFFKPLALSFPGGGRTKTQYDIPLESYLIISLK 368

Query: 267  GSVCLGILNGTEIGLQ-YNIIGDISMLDKMVIYDNERKAIGWAAANCDRPPKFNTFLM 97
            G+VCLGILNGTE GLQ +N+IGDISM DKMV+YDNE+  IGWA  NCDR PKF   ++
Sbjct: 369  GNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPTNCDRLPKFKAAIL 426


>gb|EPS67652.1| hypothetical protein M569_07122 [Genlisea aurea]
          Length = 401

 Score =  555 bits (1430), Expect = e-155
 Identities = 255/375 (68%), Positives = 303/375 (80%), Gaps = 2/375 (0%)
 Frame = -3

Query: 1242 SKAKPFASSIVLPLYGNVYPDGYYFAQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCT 1063
            S    F SSI+LP+YGNVYPDG+YF QV LG PP+PYFLDPDTGSDLTWLQCDAPCV CT
Sbjct: 17   SATNTFGSSIMLPVYGNVYPDGFYFVQVYLGYPPRPYFLDPDTGSDLTWLQCDAPCVRCT 76

Query: 1062 RAPHPLYRPTNDLVVCRDPLCASLHSGDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFF 883
               HPLYRP+NDLVVC+DPLCASLHS DY CD+PEQCDYEVEYADGGSSLGVLVND F  
Sbjct: 77   EGFHPLYRPSNDLVVCKDPLCASLHSSDYTCDNPEQCDYEVEYADGGSSLGVLVNDFFTL 136

Query: 882  NCTGGARVSPRLAFGCGYDQIPGASHHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCL 703
            N T G R+SPRL  GCGYDQ+ G+S HPLDGVLGLGKGKSSIV+QL +QG+++NV+GHCL
Sbjct: 137  NLTAGVRMSPRLTIGCGYDQLAGSSDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVIGHCL 196

Query: 702  SS-RXXXXXXXGDDVYASSPVVWTPMSNDYTKHYSAGSAELTFGGRNVGLKNLLVVFDSG 526
            S          GDD+Y SS V WTPMS+++  HY+AG AEL FGGR+ G KNL VVFDSG
Sbjct: 197  SRVGKGGFVFFGDDLYDSSRVTWTPMSHEHNNHYAAGLAELRFGGRSTGFKNLNVVFDSG 256

Query: 525  SSYSYLNSQAYLALLSLVKKELNGKPLREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLG 346
            SSY+Y  S  Y A++S++ K+LNGKPL    +D TLP+CWKG+KPFR+  DV+KYFK L 
Sbjct: 257  SSYTYFTSHIYQAVVSMITKDLNGKPLTAEPEDQTLPMCWKGKKPFRTTRDVKKYFKTLA 316

Query: 345  LSFPGGWRSKPKFEILPESYLILSTRGSVCLGILNGTEIGLQ-YNIIGDISMLDKMVIYD 169
             +FP GWRSK  F++ PE YL++S++G+ CLGILNGT +GL+ +N+IGDISM DKMVIYD
Sbjct: 317  FAFPNGWRSKASFDVTPEGYLVVSSKGNACLGILNGTSVGLENFNVIGDISMQDKMVIYD 376

Query: 168  NERKAIGWAAANCDR 124
            NE++ IGW AANCD+
Sbjct: 377  NEKQMIGWTAANCDQ 391


>gb|EOX92687.1| Eukaryotic aspartyl protease family protein isoform 1 [Theobroma
            cacao]
          Length = 421

 Score =  553 bits (1424), Expect = e-155
 Identities = 269/425 (63%), Positives = 327/425 (76%), Gaps = 4/425 (0%)
 Frame = -3

Query: 1359 GGEKVIILIVFLGIALAGGASSDRQQQGWWK-LRSAGVGSSKA-KPFASSIVLPLYGNVY 1186
            G   V++L++F     A         Q W K + S   GSS       SSI+ P++GNVY
Sbjct: 4    GRMSVLLLLLFFSFCSAS-------DQKWRKAMISTDKGSSMMMNRVGSSILFPIHGNVY 56

Query: 1185 PDGYYFAQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPLYRPTNDLVVCRDP 1006
            P GYY   +++GQPPKPYFLD DTGSDLTWLQCDAPCVHC  APHPLYRPTNDLV C+DP
Sbjct: 57   PTGYYNVTISIGQPPKPYFLDLDTGSDLTWLQCDAPCVHCVEAPHPLYRPTNDLVPCKDP 116

Query: 1005 LCASLHS-GDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGGARVSPRLAFGCGY 829
            LCA+LH  GDY+C++PEQCDYEVEYADGGSSLGVLV DVF  N T G R+SPRLA GCGY
Sbjct: 117  LCAALHPPGDYKCENPEQCDYEVEYADGGSSLGVLVRDVFSLNYTNGIRLSPRLALGCGY 176

Query: 828  DQIPGASHHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXXXXXXXGDDVYASS 649
            DQIPG+S+HPLDG+LGLG+GK+SIV+QL +QGL+RNVVGHCLS R       GD +Y SS
Sbjct: 177  DQIPGSSYHPLDGILGLGRGKASIVSQLQSQGLVRNVVGHCLSGRGGGFLFFGDGLYDSS 236

Query: 648  PVVWTPMSNDYTKHYSAGSAELTFGGRNVGLKNLLVVFDSGSSYSYLNSQAYLALLSLVK 469
             V WT MS + TK+YS G AEL FGG+   +KNL+VVFDSGSSY+YLNSQAY  L  L+K
Sbjct: 237  RVTWTSMSQELTKYYSPGIAELQFGGKATSVKNLIVVFDSGSSYTYLNSQAYQTLTVLLK 296

Query: 468  KELNGKPLREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPGGWRSKPKFEILPES 289
            KEL+G+ L+EA +D TLP+CWKGRKPF+++ DV+KYFK L L+F    R+K +FE+ PE+
Sbjct: 297  KELSGRSLKEAPEDQTLPLCWKGRKPFKNVRDVKKYFKTLALAFASSSRTKTQFELPPEA 356

Query: 288  YLILSTRGSVCLGILNGTEIGLQ-YNIIGDISMLDKMVIYDNERKAIGWAAANCDRPPKF 112
            YLI+S +G+VCLGILNGT++GLQ  N+IGDISM D+MVIYDNE++ IGWA ANCD+ P+ 
Sbjct: 357  YLIISNKGNVCLGILNGTQVGLQNLNVIGDISMQDRMVIYDNEKQVIGWAPANCDQLPRS 416

Query: 111  NTFLM 97
             T  M
Sbjct: 417  TTGYM 421


>emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
          Length = 424

 Score =  552 bits (1423), Expect = e-154
 Identities = 264/418 (63%), Positives = 322/418 (77%), Gaps = 1/418 (0%)
 Frame = -3

Query: 1347 VIILIVFLGIALAGGASSDRQQQGWWKLRSAGVGSSKAKPFASSIVLPLYGNVYPDGYYF 1168
            +++L+V +G++    AS  + ++           SS      SS+V PLYGNVYP GYY+
Sbjct: 9    LVVLVVLVGLSGWSSASDHQHKRKKAVFPEPAASSSLINIIQSSVVFPLYGNVYPLGYYY 68

Query: 1167 AQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPLYRPTNDLVVCRDPLCASLH 988
              +++GQPP PYFLDP TGSDL+WLQCDAPCV CT+A H LYRP N+LV+C+DP+CA LH
Sbjct: 69   VSLSIGQPPXPYFLDPXTGSDLSWLQCDAPCVRCTKAXHXLYRPNNNLVICKDPMCAXLH 128

Query: 987  SGDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGGARVSPRLAFGCGYDQIPGAS 808
               Y+C+ PEQCDYEVEYADGGSSLGVLV DVF  N T G R++PRLA GCGYDQIPG S
Sbjct: 129  PPGYKCEHPEQCDYEVEYADGGSSLGVLVKDVFPLNFTNGLRLAPRLALGCGYDQIPGXS 188

Query: 807  HHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXXXXXXXGDDVYASSPVVWTPM 628
            +HPLDGVLGLGKGKSSIV+QLH+QG+IRNVVGHC+SS        GDD+Y SS VVWTPM
Sbjct: 189  YHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSHGGGFLFFGDDLYDSSRVVWTPM 248

Query: 627  SNDYTKHYSAGSAELTFGGRNVGLKNLLVVFDSGSSYSYLNSQAYLALLSLVKKELNGKP 448
              D   HYS+G AEL  GG+    KNLLV FDSGSSY+YLNS AY AL+ LV+KEL+ KP
Sbjct: 249  LRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALVHLVRKELSEKP 308

Query: 447  LREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPGGWRSKPKFEILPESYLILSTR 268
            +REA+DD TLP+CW+G++PF+S+ DVRK+FKPL LSF GG R+K +++I  ESYLI+S  
Sbjct: 309  VREALDDQTLPLCWRGKRPFKSVRDVRKFFKPLALSFAGGGRTKTQYDIPLESYLIIS-- 366

Query: 267  GSVCLGILNGTEIGLQ-YNIIGDISMLDKMVIYDNERKAIGWAAANCDRPPKFNTFLM 97
            G+VCLGILNGTE GLQ +N+IGDISM DKMV+YDNE+  IGWA  NCDR PKF   ++
Sbjct: 367  GNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPTNCDRLPKFKAAIL 424


>gb|EMJ12886.1| hypothetical protein PRUPE_ppa005961mg [Prunus persica]
          Length = 435

 Score =  551 bits (1421), Expect = e-154
 Identities = 264/417 (63%), Positives = 321/417 (76%), Gaps = 4/417 (0%)
 Frame = -3

Query: 1341 ILIVFLGIALAGGASSDRQQQGWWK--LRSAGVGSSKAKPFASSIVLPLYGNVYPDGYYF 1168
            +L++ L   ++  +  D+  +G  K  L      S      ASSIVLP++GNVYP G Y 
Sbjct: 16   LLVMGLSATMSSASFGDQYHRGRRKTMLPDEATSSLGLNRAASSIVLPVHGNVYPIGSYN 75

Query: 1167 AQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPLYRPTNDLVVCRDPLCASLH 988
              +N+GQPPKPYFLDPDTGSDLTWLQCDAPCV CT APHP YRP NDLVVC+DPLC +LH
Sbjct: 76   VTLNIGQPPKPYFLDPDTGSDLTWLQCDAPCVRCTEAPHPFYRPNNDLVVCKDPLCEALH 135

Query: 987  S-GDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGGARVSPRLAFGCGYDQIPGA 811
            + G ++CD+PEQCDYEVEYADGGSSLGVLV D F  N T G + +  LA GCGYDQ+PG+
Sbjct: 136  APGSHKCDNPEQCDYEVEYADGGSSLGVLVRDAFLLNFTNGNQRTTHLALGCGYDQLPGS 195

Query: 810  SHHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXXXXXXXGDDVYASSPVVWTP 631
            S+HP+DGVLGLGKGKSSIV+QL NQGL+R+V+GHCLS R       GD +Y SS +VWTP
Sbjct: 196  SYHPIDGVLGLGKGKSSIVSQLSNQGLVRHVIGHCLSGRGGGFFFLGDGLYDSSRIVWTP 255

Query: 630  MSNDYTKHYSAGSAELTFGGRNVGLKNLLVVFDSGSSYSYLNSQAYLALLSLVKKELNGK 451
            MS DY KHYS G AEL  GG++ G +NL++VFDSGSSY+YLNSQAY  L S +K+EL GK
Sbjct: 256  MSPDYAKHYSPGLAELIVGGKSTGFRNLVMVFDSGSSYTYLNSQAYQFLTSWLKRELTGK 315

Query: 450  PLREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPGGWRSKPKFEILPESYLILST 271
            PL+EA+DD TLP+CWKGRKPFR+I DV+ YFKPL L F  G +   +FE+ PE+YLI+S+
Sbjct: 316  PLKEALDDRTLPLCWKGRKPFRNIRDVKTYFKPLALRFASGRKDTTQFELPPEAYLIISS 375

Query: 270  RGSVCLGILNGTEIGLQ-YNIIGDISMLDKMVIYDNERKAIGWAAANCDRPPKFNTF 103
            +G+VCLGILNG+E+GLQ  NIIGDISM DKMVIYDNE++ IGW   NCD+ PK  +F
Sbjct: 376  KGNVCLGILNGSEVGLQNSNIIGDISMQDKMVIYDNEKQMIGWGPGNCDKLPKSRSF 432


>gb|EXC30733.1| Aspartic proteinase Asp1 [Morus notabilis]
          Length = 432

 Score =  547 bits (1410), Expect = e-153
 Identities = 268/417 (64%), Positives = 325/417 (77%), Gaps = 5/417 (1%)
 Frame = -3

Query: 1338 LIVFLGIA--LAGGASSDRQQQGWWKLRSAGVGSSKAKPFASSIVLPLYGNVYPDGYYFA 1165
            L++F+G+   ++  A  + + +        G  S +     SS+V P++GNVYP G+Y  
Sbjct: 13   LVLFMGLCTTISSAAFLENRHRRKSTHPVPGTSSFELNRVGSSVVFPIHGNVYPIGFYNV 72

Query: 1164 QVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPLYRPTNDLVVCRDPLCASLH- 988
             +N+GQPPKPYFLDPDTGSDLTWLQCDAPCV CT  PHPLYRP+NDLV CRDPLC +LH 
Sbjct: 73   TLNIGQPPKPYFLDPDTGSDLTWLQCDAPCVQCTETPHPLYRPSNDLVGCRDPLCIALHL 132

Query: 987  SGDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGGARVSPRLAFGCGYDQIPGAS 808
             G  +CD+PEQCDYEVEYADGGSSLGVLV D F+FN T G ++ PRLA GCGYDQ+PG+S
Sbjct: 133  PGTPKCDNPEQCDYEVEYADGGSSLGVLVKDAFYFNSTKGDQLKPRLALGCGYDQVPGSS 192

Query: 807  HH-PLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXXXXXXXGDDVYASSPVVWTP 631
            H  PLDGVLGLG+GK+SIV+QLH+QGL+RNVVGHCLS R       GD+VY SS V WTP
Sbjct: 193  HPLPLDGVLGLGRGKTSIVSQLHSQGLMRNVVGHCLSGRGGGFLFFGDNVYDSSRVDWTP 252

Query: 630  MSNDYTKHYSAGSAELTFGGRNVGLKNLLVVFDSGSSYSYLNSQAYLALLSLVKKELNGK 451
            MS+DY KHYS GSAEL F G+  GLKNLL VFDSGSSY+YL SQAY  L  L+K+EL  K
Sbjct: 253  MSSDYLKHYSPGSAELRFDGKPTGLKNLLTVFDSGSSYTYLTSQAYQTLTFLIKRELPRK 312

Query: 450  PLREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPGGWRSKPKFEILPESYLILST 271
             LREA DD TLP+CWKG++PF+ + DVRKYFKPL L F  G ++K  +E+ PE+YLI+S+
Sbjct: 313  VLREATDDQTLPLCWKGKRPFKRVSDVRKYFKPLALDFTTGGKTK-TYELPPEAYLIVSS 371

Query: 270  RGSVCLGILNGTEIGLQ-YNIIGDISMLDKMVIYDNERKAIGWAAANCDRPPKFNTF 103
            +G+VCLGILNG+EIGLQ  NIIGDISM DKMVIYDNE++ IGWA+ANCD+ PK ++F
Sbjct: 372  KGNVCLGILNGSEIGLQNSNIIGDISMQDKMVIYDNEKQMIGWASANCDKLPKTSSF 428


>ref|XP_006412352.1| hypothetical protein EUTSA_v10025289mg [Eutrema salsugineum]
            gi|312282457|dbj|BAJ34094.1| unnamed protein product
            [Thellungiella halophila] gi|557113522|gb|ESQ53805.1|
            hypothetical protein EUTSA_v10025289mg [Eutrema
            salsugineum]
          Length = 424

 Score =  540 bits (1391), Expect = e-151
 Identities = 255/370 (68%), Positives = 308/370 (83%), Gaps = 4/370 (1%)
 Frame = -3

Query: 1224 ASSIVLPLYGNVYPDGYYFAQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPL 1045
            ASS+V P++GNVYP GYY   +N+GQPP+PY+LD DTGSDLTWLQCDAPCVHC  APHPL
Sbjct: 40   ASSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVHCLEAPHPL 99

Query: 1044 YRPTNDLVVCRDPLCASLH-SGDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGG 868
            Y+P+NDL+ C DPLC +LH +G+++C++PEQCDYEVEYADGGSSLGVLV DVF  N T G
Sbjct: 100  YQPSNDLIPCNDPLCKALHFNGNHRCETPEQCDYEVEYADGGSSLGVLVRDVFSLNYTKG 159

Query: 867  ARVSPRLAFGCGYDQIPGAS-HHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRX 691
             R++PRLA GCGYDQIPGAS HHPLDGVLGLG+GK SI++QLH+QG ++NVVGHCLSS  
Sbjct: 160  LRLTPRLALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGHCLSSLG 219

Query: 690  XXXXXXGDDVYASSPVVWTPMSNDYTKHYS-AGSAELTFGGRNVGLKNLLVVFDSGSSYS 514
                  G+D+Y SS V WTPM+ + +KHYS A   EL FGGR  GLKNLL VFDSGSSY+
Sbjct: 220  GGILFFGNDLYDSSRVSWTPMARENSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYT 279

Query: 513  YLNSQAYLALLSLVKKELNGKPLREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFP 334
            Y NS+AY A+  L+K+EL+GKPL+EA DDHTLP+CW+GR+PF SI +V+KYFKPL LSF 
Sbjct: 280  YFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFK 339

Query: 333  GGWRSKPKFEILPESYLILSTRGSVCLGILNGTEIGLQ-YNIIGDISMLDKMVIYDNERK 157
             GWRSK  FEI PE+YLI+S +G+VCLGILNGTEIGLQ  N+IGDISM D+M+IYDNE++
Sbjct: 340  TGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQ 399

Query: 156  AIGWAAANCD 127
            +IGW  A+CD
Sbjct: 400  SIGWIPADCD 409


>ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
            lyrata] gi|297313011|gb|EFH43434.1| hypothetical protein
            ARALYDRAFT_328390 [Arabidopsis lyrata subsp. lyrata]
          Length = 425

 Score =  539 bits (1389), Expect = e-150
 Identities = 261/412 (63%), Positives = 326/412 (79%), Gaps = 4/412 (0%)
 Frame = -3

Query: 1350 KVIILIVFLGIALAGGASSDRQQQGWWKLRSAGVGSSKAKPFASSIVLPLYGNVYPDGYY 1171
            + +IL++ + + L   ++ D +    W+ ++AG  S +     SS+V P++GNVYP GYY
Sbjct: 7    RFMILLIVMSLVLGFSSAVDFR----WR-KTAGF-SDRFTRAVSSVVFPVHGNVYPLGYY 60

Query: 1170 FAQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPLYRPTNDLVVCRDPLCASL 991
               +N+GQPP+PY+LD DTGSDLTWLQCDAPCV C  APHPLY+P++DL+ C DPLC +L
Sbjct: 61   NVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPLCKAL 120

Query: 990  H-SGDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGGARVSPRLAFGCGYDQIPG 814
            H + + +C++PEQCDYEVEYADGGSSLGVLV DVF  N T G R++PRLA GCGYDQIPG
Sbjct: 121  HLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTKGLRLTPRLALGCGYDQIPG 180

Query: 813  AS-HHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXXXXXXXGDDVYASSPVVW 637
            AS HHPLDGVLGLG+GK SI++QLH+QG ++NV+GHCLSS        GDD+Y SS V W
Sbjct: 181  ASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSSRVSW 240

Query: 636  TPMSNDYTKHYS-AGSAELTFGGRNVGLKNLLVVFDSGSSYSYLNSQAYLALLSLVKKEL 460
            TPMS +Y+KHYS A   EL FGGR  GLKNLL VFDSGSSY+Y NS+AY A+  L+K+EL
Sbjct: 241  TPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKREL 300

Query: 459  NGKPLREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPGGWRSKPKFEILPESYLI 280
            +GKPL+EA DDHTLP+CW+GR+PF SI +V+KYFKPL LSF  GWRSK  FEI PE+YLI
Sbjct: 301  SGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLI 360

Query: 279  LSTRGSVCLGILNGTEIGLQ-YNIIGDISMLDKMVIYDNERKAIGWAAANCD 127
            +S +G+VCLGILNGTEIGLQ  N+IGDISM D+M+IYDNE+++IGW  A+CD
Sbjct: 361  ISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPADCD 412


>ref|XP_002310541.2| hypothetical protein POPTR_0007s04800g [Populus trichocarpa]
            gi|550334146|gb|EEE90991.2| hypothetical protein
            POPTR_0007s04800g [Populus trichocarpa]
          Length = 430

 Score =  538 bits (1385), Expect = e-150
 Identities = 263/431 (61%), Positives = 325/431 (75%), Gaps = 9/431 (2%)
 Frame = -3

Query: 1362 MGGEKV---IILIVFLGIALAGGASSDRQQQGWWKLRSAG--VGSSKA-KPFASSIVLPL 1201
            MG EKV   ++ ++ L + L   A+SD +QQ W K   +G  +GSS       SSIVLPL
Sbjct: 1    MGNEKVGFWVVGVLVLVLILGSSAASDDRQQRWRKAMMSGETMGSSMLMNRVPSSIVLPL 60

Query: 1200 YGNVYPDGYYFAQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPLYRPTNDLV 1021
            +GNVYP G+Y   +N+GQP KPYFLD DTGSDLTWLQCDAPCVHCT APHP Y+P+N+LV
Sbjct: 61   HGNVYPTGFYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVHCTEAPHPYYKPSNNLV 120

Query: 1020 VCRDPLCASLHSG-DYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGGARVSPRLA 844
             C+DP+C SLH+G D +C++P QCDYEVEYADGGSSLGVLV D F  N T   R SP LA
Sbjct: 121  ACKDPICQSLHTGGDQRCENPGQCDYEVEYADGGSSLGVLVKDAFNLNFTSEKRQSPLLA 180

Query: 843  FG-CGYDQIPGASHHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXXXXXXXGD 667
             G CGYDQ+PG ++HP+DGVLGLG+GK SIV+QL   GL+RNV+GHCLS R       GD
Sbjct: 181  LGLCGYDQLPGGTYHPIDGVLGLGRGKPSIVSQLSGLGLVRNVIGHCLSGRGGGFLFFGD 240

Query: 666  DVYASSPVVWTPMSNDYTKHYSAGSAELTFGGRNVGLKNLLVVFDSGSSYSYLNSQAYLA 487
            D+Y SS V WTPMS +  KHYS G AELTF G+  G KNL+V FDSG+SY+YLNSQ Y  
Sbjct: 241  DLYDSSRVAWTPMSPN-AKHYSPGFAELTFDGKTTGFKNLIVAFDSGASYTYLNSQVYQG 299

Query: 486  LLSLVKKELNGKPLREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPGGWRSKPKF 307
            L+SL+K+EL+ KPLREA+DD TLP+CWKGRKPF+S+ DV+KYFK   LSF    +SK + 
Sbjct: 300  LISLIKRELSTKPLREALDDQTLPICWKGRKPFKSVRDVKKYFKTFALSFANDGKSKTQL 359

Query: 306  EILPESYLILSTRGSVCLGILNGTEIGL-QYNIIGDISMLDKMVIYDNERKAIGWAAANC 130
            E  PE+YLI+S++G+ CLG+LNGTE+GL   N+IGDISM D++VIYDNE++ IGWA  NC
Sbjct: 360  EFPPEAYLIVSSKGNACLGVLNGTEVGLNDLNVIGDISMQDRVVIYDNEKQLIGWAPGNC 419

Query: 129  DRPPKFNTFLM 97
            DR PK  + ++
Sbjct: 420  DRLPKSRSIII 430


>ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
            gi|21592493|gb|AAM64443.1| nucellin-like protein
            [Arabidopsis thaliana] gi|332660834|gb|AEE86234.1|
            aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  536 bits (1381), Expect = e-150
 Identities = 263/411 (63%), Positives = 324/411 (78%), Gaps = 4/411 (0%)
 Frame = -3

Query: 1347 VIILIVFLGIALAGGASSDRQQQGWWKLRSAGVGSSKAKPFASSIVLPLYGNVYPDGYYF 1168
            V  +IV + ++L  G SS    +  W+ ++AG  S +     SS+V P++GNVYP GYY 
Sbjct: 6    VRFMIVLMVMSLVLGFSSAVDFR--WR-KTAGF-SDRFTRAVSSVVFPVHGNVYPLGYYN 61

Query: 1167 AQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPLYRPTNDLVVCRDPLCASLH 988
              +N+GQPP+PY+LD DTGSDLTWLQCDAPCV C  APHPLY+P++DL+ C DPLC +LH
Sbjct: 62   VTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPLCKALH 121

Query: 987  -SGDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGGARVSPRLAFGCGYDQIPGA 811
             + + +C++PEQCDYEVEYADGGSSLGVLV DVF  N T G R++PRLA GCGYDQIPGA
Sbjct: 122  LNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGYDQIPGA 181

Query: 810  S-HHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXXXXXXXGDDVYASSPVVWT 634
            S HHPLDGVLGLG+GK SI++QLH+QG ++NV+GHCLSS        GDD+Y SS V WT
Sbjct: 182  SSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSSRVSWT 241

Query: 633  PMSNDYTKHYS-AGSAELTFGGRNVGLKNLLVVFDSGSSYSYLNSQAYLALLSLVKKELN 457
            PMS +Y+KHYS A   EL FGGR  GLKNLL VFDSGSSY+Y NS+AY A+  L+K+EL+
Sbjct: 242  PMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELS 301

Query: 456  GKPLREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPGGWRSKPKFEILPESYLIL 277
            GKPL+EA DDHTLP+CW+GR+PF SI +V+KYFKPL LSF  GWRSK  FEI PE+YLI+
Sbjct: 302  GKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLII 361

Query: 276  STRGSVCLGILNGTEIGLQ-YNIIGDISMLDKMVIYDNERKAIGWAAANCD 127
            S +G+VCLGILNGTEIGLQ  N+IGDISM D+M+IYDNE+++IGW   +CD
Sbjct: 362  SMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPVDCD 412


>ref|XP_006432088.1| hypothetical protein CICLE_v10001122mg [Citrus clementina]
            gi|557534210|gb|ESR45328.1| hypothetical protein
            CICLE_v10001122mg [Citrus clementina]
          Length = 451

 Score =  536 bits (1380), Expect = e-149
 Identities = 266/432 (61%), Positives = 325/432 (75%), Gaps = 15/432 (3%)
 Frame = -3

Query: 1365 QMGGEKV-IILIVFLGIALAGGASSDRQQQGWWKL---------RSAGVGSSKAKPF--- 1225
            +MG E+V ++L + L   +   +SSD  Q  W K           S+   SS +  F   
Sbjct: 15   KMGKERVGLVLALVLMSFVISTSSSDEHQLRWRKSLFSTATTSSSSSSSSSSSSLLFNRV 74

Query: 1224 ASSIVLPLYGNVYPDGYYFAQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPL 1045
             SS++  + GNVYP GYY   V +GQPPKPYFLD DTGSDL WLQCDAPCV C  APHPL
Sbjct: 75   GSSLLFRVQGNVYPTGYYNVTVYVGQPPKPYFLDLDTGSDLIWLQCDAPCVQCVEAPHPL 134

Query: 1044 YRPTNDLVVCRDPLCASLHS-GDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGG 868
            YRP+NDLV C DP+CASLH+ G ++C+ P QCDYEVEYADGGSSLGVLV D F FN T G
Sbjct: 135  YRPSNDLVPCEDPICASLHAPGQHKCEDPTQCDYEVEYADGGSSLGVLVKDAFAFNYTNG 194

Query: 867  ARVSPRLAFGCGYDQIPGASHHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXX 688
             R++PRLA GCGYDQ+PGASHHPLDG+LGLGKGKSSIV+QLH+Q LIRNVVGHCLS R  
Sbjct: 195  QRLNPRLALGCGYDQVPGASHHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGRGG 254

Query: 687  XXXXXGDDVYASSPVVWTPMSNDYTKHYSAGSAELTFGGRNVGLKNLLVVFDSGSSYSYL 508
                 GDD+Y SS VVWT MS+DYTK+YS G AEL FGG+  GLKNL +VFDSGSSY+YL
Sbjct: 255  GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELLFGGKTTGLKNLPLVFDSGSSYTYL 314

Query: 507  NSQAYLALLSLVKKELNGKPLREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPGG 328
            +  AY  L S++K+E++ K L+EA +D TLP+CWKG++PF+++ DV+KYFK L LSF  G
Sbjct: 315  SHVAYQTLTSMMKREISAKSLKEAPEDRTLPLCWKGKRPFKNVRDVKKYFKALALSFTDG 374

Query: 327  WRSKPKFEILPESYLILSTRGSVCLGILNGTEIGLQ-YNIIGDISMLDKMVIYDNERKAI 151
             +++  FE+ PE+YLI+S RG+VCLGILNG E+GLQ  N+IGDISM D++VIYDNE++ I
Sbjct: 375  -KTRTLFELTPEAYLIISNRGNVCLGILNGAEVGLQDLNVIGDISMQDRVVIYDNEKQRI 433

Query: 150  GWAAANCDRPPK 115
            GW  ANCDR PK
Sbjct: 434  GWMPANCDRIPK 445


>ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
            [Cucumis sativus]
          Length = 418

 Score =  536 bits (1380), Expect = e-149
 Identities = 261/400 (65%), Positives = 312/400 (78%), Gaps = 4/400 (1%)
 Frame = -3

Query: 1302 ASSDRQQQGWWKLRSAGVGSSKAKPFASS-IVLPLYGNVYPDGYYFAQVNLGQPPKPYFL 1126
            ASS  + + W + R      + +  FASS IVLPL GNVYP+G+Y   + +GQPPKPYFL
Sbjct: 13   ASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQGNVYPNGFYNVTLYVGQPPKPYFL 72

Query: 1125 DPDTGSDLTWLQCDAPCVHCTRAPHPLYRPTNDLVVCRDPLCASLHSG-DYQCDSPEQCD 949
            DPDTGSDLTWLQCDAPC  CT   HPLY+P+NDLV C+DPLC SLHS  D++C++P+QCD
Sbjct: 73   DPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRCENPDQCD 132

Query: 948  YEVEYADGGSSLGVLVNDVFFFNCTGGARVSPRLAFGCGYDQIPGAS-HHPLDGVLGLGK 772
            YEVEYADGGSSLGVLV DVF  N T G  + PRLA GCGYDQ PG+S +HP+DG+LGLG+
Sbjct: 133  YEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGR 192

Query: 771  GKSSIVTQLHNQGLIRNVVGHCLSSRXXXXXXXGDDVYASSPVVWTPMSNDYTKHYSAGS 592
            G  SIV+QLHNQG++RNVVGHC +S+       GD +Y    +VWTPMS DY KHYS G 
Sbjct: 193  GAVSIVSQLHNQGIVRNVVGHCFNSKGGGYXFFGDGIYDPYRLVWTPMSRDYPKHYSPGF 252

Query: 591  AELTFGGRNVGLKNLLVVFDSGSSYSYLNSQAYLALLSLVKKELNGKPLREAMDDHTLPV 412
             EL F GR+ GL+NL VVFDSGSSY+Y N+QAY  L SL+ +EL GKPLREAMDD TLP+
Sbjct: 253  GELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPL 312

Query: 411  CWKGRKPFRSIYDVRKYFKPLGLSFPGGWRSKPKFEILPESYLILSTRGSVCLGILNGTE 232
            CW+GRKP +S+ DVRKYFKPL LSF  G RSK  FEI  E Y+I+S+ G+VCLGILNGT+
Sbjct: 313  CWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPTEGYMIISSMGNVCLGILNGTD 372

Query: 231  IGLQ-YNIIGDISMLDKMVIYDNERKAIGWAAANCDRPPK 115
            +GL+  NIIGDISM DKMV+Y+NE++AIGWA ANCDR PK
Sbjct: 373  VGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPK 412


>ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 418

 Score =  536 bits (1380), Expect = e-149
 Identities = 260/400 (65%), Positives = 311/400 (77%), Gaps = 4/400 (1%)
 Frame = -3

Query: 1302 ASSDRQQQGWWKLRSAGVGSSKAKPFASS-IVLPLYGNVYPDGYYFAQVNLGQPPKPYFL 1126
            ASS  + + W + R      + +  FASS IVLPL GNVYP+G+Y   + +GQPPKPYFL
Sbjct: 13   ASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQGNVYPNGFYNVTLYVGQPPKPYFL 72

Query: 1125 DPDTGSDLTWLQCDAPCVHCTRAPHPLYRPTNDLVVCRDPLCASLHSG-DYQCDSPEQCD 949
            DPDTGSDLTWLQCDAPC  CT   HPLY+P+NDLV C+DPLC SLHS  D++C++P+QCD
Sbjct: 73   DPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRCENPDQCD 132

Query: 948  YEVEYADGGSSLGVLVNDVFFFNCTGGARVSPRLAFGCGYDQIPGAS-HHPLDGVLGLGK 772
            YEVEYADGGSSLGVLV DVF  N T G  + PRLA GCGYDQ PG+S +HP+DG+LGLG+
Sbjct: 133  YEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGR 192

Query: 771  GKSSIVTQLHNQGLIRNVVGHCLSSRXXXXXXXGDDVYASSPVVWTPMSNDYTKHYSAGS 592
            G  SIV+QLHNQG++RNVVGHC +S+       GD +Y    +VWTPMS DY KHYS G 
Sbjct: 193  GAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGF 252

Query: 591  AELTFGGRNVGLKNLLVVFDSGSSYSYLNSQAYLALLSLVKKELNGKPLREAMDDHTLPV 412
             EL F GR+ GL+NL VVFDSGSSY+Y N+QAY  L SL+ +EL GKPLREAMDD TLP+
Sbjct: 253  GELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPL 312

Query: 411  CWKGRKPFRSIYDVRKYFKPLGLSFPGGWRSKPKFEILPESYLILSTRGSVCLGILNGTE 232
            CW+GRKP +S+ DVRKYFKPL LSF  G RSK  FEI  E Y+I+S+ G+VCLGILNGT+
Sbjct: 313  CWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPTEGYMIISSMGNVCLGILNGTD 372

Query: 231  IGLQ-YNIIGDISMLDKMVIYDNERKAIGWAAANCDRPPK 115
            +GL+  NIIGDISM DKMV+Y+NE++AIGWA ANCDR PK
Sbjct: 373  VGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPK 412


>ref|XP_006464925.1| PREDICTED: aspartic proteinase Asp1-like [Citrus sinensis]
          Length = 436

 Score =  535 bits (1379), Expect = e-149
 Identities = 266/431 (61%), Positives = 324/431 (75%), Gaps = 15/431 (3%)
 Frame = -3

Query: 1362 MGGEKV-IILIVFLGIALAGGASSDRQQQGWWKL---------RSAGVGSSKAKPF---A 1222
            MG E+V ++L + L   +   +SSD  Q  W K           S+   SS +  F    
Sbjct: 1    MGKERVGLVLALVLMSFVISTSSSDEHQLRWRKSLFSTATTSSSSSSSSSSSSLLFNRVG 60

Query: 1221 SSIVLPLYGNVYPDGYYFAQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPLY 1042
            SS++  + GNVYP GYY   V +GQPPKPYFLD DTGSDL WLQCDAPCV C  APHPLY
Sbjct: 61   SSLLFRVQGNVYPTGYYNVTVYVGQPPKPYFLDLDTGSDLIWLQCDAPCVQCVEAPHPLY 120

Query: 1041 RPTNDLVVCRDPLCASLHS-GDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGGA 865
            RP+NDLV C DP+CASLH+ G ++C+ P QCDYEVEYADGGSSLGVLV D F FN T G 
Sbjct: 121  RPSNDLVPCEDPICASLHAPGQHKCEDPTQCDYEVEYADGGSSLGVLVKDAFAFNYTNGQ 180

Query: 864  RVSPRLAFGCGYDQIPGASHHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXXX 685
            R++PRLA GCGYDQ+PGASHHPLDG+LGLGKGKSSIV+QLH+Q LIRNVVGHCLS R   
Sbjct: 181  RLNPRLALGCGYDQVPGASHHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGRGGG 240

Query: 684  XXXXGDDVYASSPVVWTPMSNDYTKHYSAGSAELTFGGRNVGLKNLLVVFDSGSSYSYLN 505
                GDD+Y SS VVWT MS+DYTK+YS G AEL FGG+  GLKNL +VFDSGSSY+YL+
Sbjct: 241  FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELLFGGKTTGLKNLPLVFDSGSSYTYLS 300

Query: 504  SQAYLALLSLVKKELNGKPLREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPGGW 325
              AY  L S++K+E++ K L+EA +D TLP+CWKG++PF+++ DV+KYFK L LSF  G 
Sbjct: 301  HVAYQTLTSMMKREISAKSLKEAPEDRTLPLCWKGKRPFKNVRDVKKYFKALALSFTDG- 359

Query: 324  RSKPKFEILPESYLILSTRGSVCLGILNGTEIGLQ-YNIIGDISMLDKMVIYDNERKAIG 148
            +++  FE+ PE+YLI+S RG+VCLGILNG E+GLQ  N+IGDISM D++VIYDNE++ IG
Sbjct: 360  KTRTLFELTPEAYLIISNRGNVCLGILNGAEVGLQDLNVIGDISMQDRVVIYDNEKQRIG 419

Query: 147  WAAANCDRPPK 115
            W  ANCDR PK
Sbjct: 420  WMPANCDRIPK 430


>dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
          Length = 413

 Score =  535 bits (1377), Expect = e-149
 Identities = 252/369 (68%), Positives = 304/369 (82%), Gaps = 4/369 (1%)
 Frame = -3

Query: 1221 SSIVLPLYGNVYPDGYYFAQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPLY 1042
            SS+V P++GNVYP GYY   +N+GQPP+PY+LD DTGSDLTWLQCDAPCV C  APHPLY
Sbjct: 32   SSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLY 91

Query: 1041 RPTNDLVVCRDPLCASLH-SGDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGGA 865
            +P++DL+ C DPLC +LH + + +C++PEQCDYEVEYADGGSSLGVLV DVF  N T G 
Sbjct: 92   QPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGL 151

Query: 864  RVSPRLAFGCGYDQIPGAS-HHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXX 688
            R++PRLA GCGYDQIPGAS HHPLDGVLGLG+GK SI++QLH+QG ++NV+GHCLSS   
Sbjct: 152  RLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGG 211

Query: 687  XXXXXGDDVYASSPVVWTPMSNDYTKHYS-AGSAELTFGGRNVGLKNLLVVFDSGSSYSY 511
                 GDD+Y SS V WTPMS +Y+KHYS A   EL FGGR  GLKNLL VFDSGSSY+Y
Sbjct: 212  GILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTY 271

Query: 510  LNSQAYLALLSLVKKELNGKPLREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPG 331
             NS+AY A+  L+K+EL+GKPL+EA DDHTLP+CW+GR+PF SI +V+KYFKPL LSF  
Sbjct: 272  FNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKT 331

Query: 330  GWRSKPKFEILPESYLILSTRGSVCLGILNGTEIGLQ-YNIIGDISMLDKMVIYDNERKA 154
            GWRSK  FEI PE+YLI+S +G+VCLGILNGTEIGLQ  N+IGDISM D+M+IYDNE+++
Sbjct: 332  GWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQS 391

Query: 153  IGWAAANCD 127
            IGW   +CD
Sbjct: 392  IGWMPVDCD 400


>ref|XP_006282959.1| hypothetical protein CARUB_v10007649mg [Capsella rubella]
            gi|482551664|gb|EOA15857.1| hypothetical protein
            CARUB_v10007649mg [Capsella rubella]
          Length = 425

 Score =  534 bits (1375), Expect = e-149
 Identities = 264/411 (64%), Positives = 324/411 (78%), Gaps = 4/411 (0%)
 Frame = -3

Query: 1347 VIILIVFLGIALAGGASSDRQQQGWWKLRSAGVGSSKAKPFASSIVLPLYGNVYPDGYYF 1168
            V  +IV + + LA G SS    +  W+ R+AG  S +     SS+V P+ GNVYP GYY 
Sbjct: 6    VRFMIVLMVMCLALGYSSAVDFR--WR-RTAGF-SDRFTRAVSSVVFPVNGNVYPLGYYN 61

Query: 1167 AQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPLYRPTNDLVVCRDPLCASLH 988
              +++GQPP+PY+LD DTGSDLTWLQCDAPCV C  APHPLY+P++DL+ C DPLC +LH
Sbjct: 62   VTIHIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPLCKALH 121

Query: 987  -SGDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGGARVSPRLAFGCGYDQIPGA 811
             +G+ +C++PEQCDYEVEYADGGSSLGVLV DVF  N T G R++PRLA GCGYDQIPGA
Sbjct: 122  LNGNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSLNYTKGLRLTPRLALGCGYDQIPGA 181

Query: 810  S-HHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXXXXXXXGDDVYASSPVVWT 634
            S HHPLDGVLGLG+GK SI++QLH+QG ++NV+GHCLSS        G+D+Y SS V WT
Sbjct: 182  SSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGNDLYDSSRVSWT 241

Query: 633  PMSNDYTKHYS-AGSAELTFGGRNVGLKNLLVVFDSGSSYSYLNSQAYLALLSLVKKELN 457
            PMS +Y+KHYS A   EL FGGR  GLKNLL VFDSGSSY+Y NS+AY A+  L+K+EL+
Sbjct: 242  PMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELS 301

Query: 456  GKPLREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPGGWRSKPKFEILPESYLIL 277
            GK L+EA DDHTLP+CW+GR+PF SI +V+KYFKPL LSF  GWRSK  FEI PE+YLI+
Sbjct: 302  GKALKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLII 361

Query: 276  STRGSVCLGILNGTEIGLQ-YNIIGDISMLDKMVIYDNERKAIGWAAANCD 127
            S +G+VCLGILNGTEIGLQ  N+IGDISM D+M+IYDNE+++IGW  A+CD
Sbjct: 362  SMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPADCD 412


>ref|XP_006382886.1| hypothetical protein POPTR_0005s07070g [Populus trichocarpa]
            gi|550338300|gb|ERP60683.1| hypothetical protein
            POPTR_0005s07070g [Populus trichocarpa]
          Length = 393

 Score =  533 bits (1374), Expect = e-149
 Identities = 255/377 (67%), Positives = 298/377 (79%), Gaps = 2/377 (0%)
 Frame = -3

Query: 1221 SSIVLPLYGNVYPDGYYFAQVNLGQPPKPYFLDPDTGSDLTWLQCDAPCVHCTRAPHPLY 1042
            SSIVLPL+GNVYP+GYY   +N+GQP KPYFLD DTGSDLTWLQCDAPCV CT APHP Y
Sbjct: 18   SSIVLPLHGNVYPNGYYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYY 77

Query: 1041 RPTNDLVVCRDPLCASLHS-GDYQCDSPEQCDYEVEYADGGSSLGVLVNDVFFFNCTGGA 865
            RP N+LV C DP+C SLHS GD++C++P QCDYEVEYADGGSS GVLV D F  N T   
Sbjct: 78   RPRNNLVPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVRDTFNLNFTSEK 137

Query: 864  RVSPRLAFGCGYDQIPGASHHPLDGVLGLGKGKSSIVTQLHNQGLIRNVVGHCLSSRXXX 685
            R SP LA GCGYDQ PG SHHP+DGVLGLGKGKSSIV+QL + GL+RNV+GHCLS     
Sbjct: 138  RHSPLLALGCGYDQFPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGG 197

Query: 684  XXXXGDDVYASSPVVWTPMSNDYTKHYSAGSAELTFGGRNVGLKNLLVVFDSGSSYSYLN 505
                GDD+Y SS V WTPMS D  KHYS G AELTF G+  G KNLL  FDSG+SY+YLN
Sbjct: 198  FLFFGDDLYDSSRVAWTPMSPD-AKHYSPGLAELTFDGKTTGFKNLLTTFDSGASYTYLN 256

Query: 504  SQAYLALLSLVKKELNGKPLREAMDDHTLPVCWKGRKPFRSIYDVRKYFKPLGLSFPGGW 325
            SQAY  L+SL+KKEL+GKPLREA+DD TLP+CWKGRKPF+SI DV+KYFK   LSF    
Sbjct: 257  SQAYQGLISLLKKELSGKPLREALDDQTLPLCWKGRKPFKSIRDVKKYFKTFALSFTNER 316

Query: 324  RSKPKFEILPESYLILSTRGSVCLGILNGTEIGL-QYNIIGDISMLDKMVIYDNERKAIG 148
            +SK + E  PE+YLI+S++G+ CLGILNGTE+GL   N+IGDISM D++VIYDNE++ IG
Sbjct: 317  KSKTELEFPPEAYLIISSKGNACLGILNGTEVGLNDLNVIGDISMQDRVVIYDNEKERIG 376

Query: 147  WAAANCDRPPKFNTFLM 97
            WA  NC+R PK  +F++
Sbjct: 377  WAPGNCNRLPKSKSFII 393


Top