BLASTX nr result

ID: Forsythia22_contig00031892 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00031892
         (1956 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011099684.1| PREDICTED: aspartic proteinase Asp1 [Sesamum...   706   0.0  
ref|XP_012857588.1| PREDICTED: aspartic proteinase Asp1 [Erythra...   678   0.0  
gb|EPS67652.1| hypothetical protein M569_07122 [Genlisea aurea]       624   e-175
dbj|BAD80835.1| nucellin-like protein [Daucus carota]                 610   e-171
ref|XP_009631909.1| PREDICTED: aspartic proteinase Asp1 [Nicotia...   604   e-170
ref|XP_009766881.1| PREDICTED: aspartic proteinase Asp1 [Nicotia...   594   e-167
ref|XP_006346815.1| PREDICTED: aspartic proteinase Asp1-like [So...   588   e-165
ref|NP_001297397.1| aspartic proteinase Asp1 precursor [Solanum ...   587   e-165
emb|CDP04854.1| unnamed protein product [Coffea canephora]            583   e-163
ref|XP_007048530.1| Eukaryotic aspartyl protease family protein ...   573   e-160
ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis v...   572   e-160
ref|XP_012489372.1| PREDICTED: aspartic proteinase Asp1 [Gossypi...   565   e-158
ref|XP_002522918.1| nucellin, putative [Ricinus communis] gi|223...   555   e-155
emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]   554   e-155
ref|XP_012087626.1| PREDICTED: aspartic proteinase Asp1 [Jatroph...   550   e-153
ref|XP_004147327.2| PREDICTED: aspartic proteinase Asp1 isoform ...   546   e-152
ref|XP_009355861.1| PREDICTED: aspartic proteinase Asp1-like [Py...   545   e-152
ref|XP_006432088.1| hypothetical protein CICLE_v10001122mg [Citr...   545   e-152
ref|XP_010111255.1| Aspartic proteinase Asp1 [Morus notabilis] g...   544   e-152
ref|XP_007211687.1| hypothetical protein PRUPE_ppa005961mg [Prun...   543   e-151

>ref|XP_011099684.1| PREDICTED: aspartic proteinase Asp1 [Sesamum indicum]
          Length = 422

 Score =  706 bits (1823), Expect = 0.0
 Identities = 332/420 (79%), Positives = 363/420 (86%), Gaps = 2/420 (0%)
 Frame = +1

Query: 412  MGKGKQIVIMIFVVLAVSGASSI-DQQQKWRKWMSPGASSSEANRIGASVIFPLYGNVYP 588
            M K K + ++ F VLA S A S  DQQ KWRKW    +S S  NR+G+S+IFPLYGNVYP
Sbjct: 1    MKKEKLVFMIAFAVLAASCAGSCNDQQLKWRKWGPSSSSPSSINRLGSSIIFPLYGNVYP 60

Query: 589  NGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDPL 768
            NGFYFVQV++GYPP+PYFLDPDTGSDLTWLQCDAPCV CT GFHPLYRPSN+LV+CKDPL
Sbjct: 61   NGFYFVQVYLGYPPKPYFLDPDTGSDLTWLQCDAPCVRCTTGFHPLYRPSNELVICKDPL 120

Query: 769  CASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGYDQ 948
            CASLHS DY CDNPEQCDYEVEYADGGSSLGVLVND F+LNLTSG+R++PRLT+GCGYDQ
Sbjct: 121  CASLHSQDYNCDNPEQCDYEVEYADGGSSLGVLVNDFFTLNLTSGVRMSPRLTIGCGYDQ 180

Query: 949  LPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQGGFLFFGEDVYDSSRIT 1128
            LPG SDHPLDGVLGLGKGKSSIVSQLR+QGV+KNV+GHCLSGQGGFLFFGEDVYDSSR+T
Sbjct: 181  LPGASDHPLDGVLGLGKGKSSIVSQLREQGVMKNVIGHCLSGQGGFLFFGEDVYDSSRVT 240

Query: 1129 WTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXXXXX 1305
            WT M RDYTKHY+AG+AEL FGGKSTG KNLNVIFDSGSSYTYF+S IY+          
Sbjct: 241  WTSMARDYTKHYAAGSAELRFGGKSTGFKNLNVIFDSGSSYTYFSSHIYHTLLSLINKEL 300

Query: 1306 XXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAYLI 1485
                 REATDD TLPFCWKGKKPFKSTRDV+KYFKPL+L FPNGW+ K QFEI PE YLI
Sbjct: 301  RRTSLREATDDHTLPFCWKGKKPFKSTRDVRKYFKPLSLSFPNGWRAKAQFEIPPEGYLI 360

Query: 1486 ISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRSNRV 1665
            ISSKGNACLGILNGTD+GL+NFNMIGDISMQDK+LIYDNEK  IGWT ANCDQ P SN V
Sbjct: 361  ISSKGNACLGILNGTDVGLSNFNMIGDISMQDKMLIYDNEKHVIGWTPANCDQRPTSNSV 420


>ref|XP_012857588.1| PREDICTED: aspartic proteinase Asp1 [Erythranthe guttatus]
            gi|604300852|gb|EYU20602.1| hypothetical protein
            MIMGU_mgv1a007047mg [Erythranthe guttata]
          Length = 422

 Score =  678 bits (1749), Expect = 0.0
 Identities = 320/420 (76%), Positives = 360/420 (85%), Gaps = 4/420 (0%)
 Frame = +1

Query: 412  MGKGKQIVIMIFVVLAVS---GASSIDQQQKWRKWMSPGASSSEANRIGASVIFPLYGNV 582
            M K K I I+I VV+ V+   G +   QQ KWRKW  P  S S+  + G+S+IFPLYGNV
Sbjct: 1    MKKEKLIFIIIVVVVLVASCAGCNKDQQQLKWRKW-GPSCSPSKFTKFGSSIIFPLYGNV 59

Query: 583  YPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKD 762
            YPNGFYFVQV++GYPP+PYFLDPDTGSDLTWLQCDAPCV CT GFHPLYRPSNDLV+CKD
Sbjct: 60   YPNGFYFVQVYLGYPPKPYFLDPDTGSDLTWLQCDAPCVRCTTGFHPLYRPSNDLVICKD 119

Query: 763  PLCASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGY 942
            PLCASLHSSDY+C+N EQCDYEVEYADGGSSLGVLVND F+LNLT+G+R++PRLTLGCGY
Sbjct: 120  PLCASLHSSDYKCENTEQCDYEVEYADGGSSLGVLVNDFFTLNLTTGVRMSPRLTLGCGY 179

Query: 943  DQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQGGFLFFGEDVYDSSR 1122
            DQLPG SDHPLDGV GLG+GKSSIVSQLR+QG+VKN+VGHCLS QGG+LF GED YDSSR
Sbjct: 180  DQLPGPSDHPLDGVFGLGRGKSSIVSQLREQGIVKNIVGHCLSEQGGYLFLGEDAYDSSR 239

Query: 1123 ITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXXX 1299
            +TWTPM RD TKHY+AG+AEL FGGKSTG KNLNVIFDSGSSYTYFNSQIY+        
Sbjct: 240  MTWTPMSRDDTKHYTAGSAELRFGGKSTGFKNLNVIFDSGSSYTYFNSQIYHTLISLIKK 299

Query: 1300 XXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAY 1479
                   +EA+DDRTLPFCWKGKKPFK+TRDV+KYFK L+L F NGW+ K QF+I PE Y
Sbjct: 300  ELSGKSLKEASDDRTLPFCWKGKKPFKTTRDVRKYFKSLSLSFVNGWRTKAQFDIPPEGY 359

Query: 1480 LIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRSN 1659
            LIISSKGNACLGILNGTD+GL+NFNMIGDISMQDKLL+YDNEKQ IGWT ANCDQ P+S+
Sbjct: 360  LIISSKGNACLGILNGTDVGLSNFNMIGDISMQDKLLVYDNEKQVIGWTPANCDQIPKSS 419


>gb|EPS67652.1| hypothetical protein M569_07122 [Genlisea aurea]
          Length = 401

 Score =  624 bits (1608), Expect = e-175
 Identities = 286/375 (76%), Positives = 325/375 (86%), Gaps = 3/375 (0%)
 Frame = +1

Query: 529  SEANRIGASVIFPLYGNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCT 708
            S  N  G+S++ P+YGNVYP+GFYFVQV++GYPPRPYFLDPDTGSDLTWLQCDAPCV CT
Sbjct: 17   SATNTFGSSIMLPVYGNVYPDGFYFVQVYLGYPPRPYFLDPDTGSDLTWLQCDAPCVRCT 76

Query: 709  KGFHPLYRPSNDLVVCKDPLCASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSL 888
            +GFHPLYRPSNDLVVCKDPLCASLHSSDY CDNPEQCDYEVEYADGGSSLGVLVND F+L
Sbjct: 77   EGFHPLYRPSNDLVVCKDPLCASLHSSDYTCDNPEQCDYEVEYADGGSSLGVLVNDFFTL 136

Query: 889  NLTSGIRINPRLTLGCGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCL 1068
            NLT+G+R++PRLT+GCGYDQL G SDHPLDGVLGLGKGKSSIVSQLRDQGVVKNV+GHCL
Sbjct: 137  NLTAGVRMSPRLTIGCGYDQLAGSSDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVIGHCL 196

Query: 1069 S--GQGGFLFFGEDVYDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSG 1242
            S  G+GGF+FFG+D+YDSSR+TWTPM  ++  HY+AG AEL FGG+STG KNLNV+FDSG
Sbjct: 197  SRVGKGGFVFFGDDLYDSSRVTWTPMSHEHNNHYAAGLAELRFGGRSTGFKNLNVVFDSG 256

Query: 1243 SSYTYFNSQIYYAXXXXXXXXXXXXXREA-TDDRTLPFCWKGKKPFKSTRDVKKYFKPLA 1419
            SSYTYF S IY A               A  +D+TLP CWKGKKPF++TRDVKKYFK LA
Sbjct: 257  SSYTYFTSHIYQAVVSMITKDLNGKPLTAEPEDQTLPMCWKGKKPFRTTRDVKKYFKTLA 316

Query: 1420 LRFPNGWKGKPQFEISPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYD 1599
              FPNGW+ K  F+++PE YL++SSKGNACLGILNGT +GL NFN+IGDISMQDK++IYD
Sbjct: 317  FAFPNGWRSKASFDVTPEGYLVVSSKGNACLGILNGTSVGLENFNVIGDISMQDKMVIYD 376

Query: 1600 NEKQAIGWTAANCDQ 1644
            NEKQ IGWTAANCDQ
Sbjct: 377  NEKQMIGWTAANCDQ 391


>dbj|BAD80835.1| nucellin-like protein [Daucus carota]
          Length = 426

 Score =  610 bits (1574), Expect = e-171
 Identities = 281/418 (67%), Positives = 339/418 (81%), Gaps = 2/418 (0%)
 Frame = +1

Query: 412  MGKGKQIVIMIFVVLAVSGASSIDQQQKWRKWMSPGASSSEANRIGASVIFPLYGNVYPN 591
            M K  + ++ +F+VL + G SS DQQQ W KW S GASSS  + +G+SV+ PLYGNVYP+
Sbjct: 5    MAKICKQIMSVFLVLMIVGVSSDDQQQSWWKWFSSGASSSVVSSVGSSVVLPLYGNVYPS 64

Query: 592  GFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDPLC 771
            G+Y VQ  +G PP+PYFLDPDTGSDLTWLQCDAPC+ CT   HPLY+P+NDLVVCKDP+C
Sbjct: 65   GYYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHPLYQPTNDLVVCKDPIC 124

Query: 772  ASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGYDQL 951
            ASLH  +YRCD+P+QCDYEVEYADGGSS+GVLVNDLF +NLTSG+R  PRLT+GCGYDQL
Sbjct: 125  ASLHPDNYRCDDPDQCDYEVEYADGGSSIGVLVNDLFPVNLTSGMRARPRLTIGCGYDQL 184

Query: 952  PGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVYDSSRIT 1128
            PGI+ HPLDGVLGLG+G SSIV+QL  QG+V+NVVGHC S + GG+LFFG+D+YDSS++ 
Sbjct: 185  PGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRRGGGYLFFGDDIYDSSKVI 244

Query: 1129 WTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXXXXX 1305
            WTPM RDY KHY+ G AEL+  G+S+GLKNL V+FDSGSSYTYFN+Q Y           
Sbjct: 245  WTPMSRDYLKHYTPGFAELILNGRSSGLKNLLVVFDSGSSYTYFNTQTYQTLLSFIKKDL 304

Query: 1306 XXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAYLI 1485
                 +EA +D TLP CW+GKKPFKS RD KKYFKPLAL F +GWK K QFEI  E+YLI
Sbjct: 305  HGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKKYFKPLALSFGSGWKTKSQFEIQQESYLI 364

Query: 1486 ISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRSN 1659
            ISSKG+ CLGILNGT++GL N+N+IGDISMQ+KL+IYDNEKQ IGW  +NCD+PP+ +
Sbjct: 365  ISSKGSVCLGILNGTEVGLQNYNIIGDISMQEKLVIYDNEKQVIGWQPSNCDRPPKGD 422


>ref|XP_009631909.1| PREDICTED: aspartic proteinase Asp1 [Nicotiana tomentosiformis]
          Length = 431

 Score =  604 bits (1558), Expect = e-170
 Identities = 288/431 (66%), Positives = 343/431 (79%), Gaps = 11/431 (2%)
 Frame = +1

Query: 412  MGKGKQIVIMIFVVLAVSGASS------IDQQQKWRKWMSPG--ASSSEANRIGAS-VIF 564
            MG GK I +++FVV+AVS A+       + QQQKW KWMS G  ASSS    + +S ++ 
Sbjct: 1    MGGGKIIGMVMFVVIAVSAAAGSGDNQQLQQQQKWWKWMSSGSAASSSVVKPVASSSIVL 60

Query: 565  PLYGNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSND 744
            PLYGNVYP G+Y+VQ+ +G P +PYFLDPDTGSDLTWLQCDAPCV CT+  HP Y+P+ND
Sbjct: 61   PLYGNVYPIGYYYVQLNIGQPSKPYFLDPDTGSDLTWLQCDAPCVRCTRAPHPFYKPNND 120

Query: 745  LVVCKDPLCASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRL 924
            LV CKDPLCASLH  DY+C++PEQCDY+V+YADGGSSLGVL+ND+F+ N TSG RI PRL
Sbjct: 121  LVPCKDPLCASLHHVDYKCESPEQCDYQVDYADGGSSLGVLLNDVFNFNATSGARIIPRL 180

Query: 925  TLGCGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGE 1101
             LGCGYDQLPG S HPLDGVLGLGKGK+SIVSQL  +G+V+NVVGHCLSG+ GGFLFFG+
Sbjct: 181  ALGCGYDQLPGQSHHPLDGVLGLGKGKASIVSQLHSKGLVRNVVGHCLSGRGGGFLFFGD 240

Query: 1102 DVYDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA 1281
            +VYDSSRI WTPM  D  KHYSAG+ EL+FGGK+TG KNL V+FDSGSS++Y NSQ Y  
Sbjct: 241  EVYDSSRIVWTPMAHDRMKHYSAGSGELIFGGKATGFKNLFVVFDSGSSFSYLNSQTYQG 300

Query: 1282 -XXXXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQF 1458
                          REA DD TLP CWKG++PFK+  DVKKYFK  AL FP+GWK K  F
Sbjct: 301  FISLLKKELNGKPLREAKDDYTLPLCWKGRRPFKTINDVKKYFKNFALSFPHGWKSKAHF 360

Query: 1459 EISPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANC 1638
            EI PE+YLIISSKG+ CLG+LNGT+ GL N N+IGDISMQDK++IYDNEKQAIGW+ ANC
Sbjct: 361  EIPPESYLIISSKGSVCLGVLNGTEAGLQNVNLIGDISMQDKMVIYDNEKQAIGWSPANC 420

Query: 1639 DQPPRSNRVIL 1671
            D+PP+SN +I+
Sbjct: 421  DRPPKSNNMIM 431


>ref|XP_009766881.1| PREDICTED: aspartic proteinase Asp1 [Nicotiana sylvestris]
          Length = 433

 Score =  594 bits (1532), Expect = e-167
 Identities = 279/433 (64%), Positives = 342/433 (78%), Gaps = 13/433 (3%)
 Frame = +1

Query: 412  MGKGKQIVIMIFVVLAVSGASS--------IDQQQKWRKWMSPGASSSEA---NRIGASV 558
            MG GK I ++IFV ++V+ +++        + QQQKW KWMS G+++S +     + +S+
Sbjct: 1    MGGGKIIGMVIFVAVSVAVSAAAGYGDNQQLQQQQKWWKWMSSGSAASSSVVKPVVSSSI 60

Query: 559  IFPLYGNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPS 738
            + PLYGN+YP G+Y+VQ+ +G P +PYFLDPDTGSDLTWLQCDAPCV CT+  HP Y+P+
Sbjct: 61   VLPLYGNIYPIGYYYVQLNIGQPSKPYFLDPDTGSDLTWLQCDAPCVRCTRAPHPFYKPN 120

Query: 739  NDLVVCKDPLCASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINP 918
            NDLV CKDPLCASLH  DY+C++PEQCDY+V+YADGGSSLGVL+ND+F  N TSG RI P
Sbjct: 121  NDLVPCKDPLCASLHHVDYKCESPEQCDYQVDYADGGSSLGVLLNDVFHFNGTSGARIIP 180

Query: 919  RLTLGCGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFF 1095
            RL LGCGYDQLPG S HPLDGVLGLGKGK+SIVSQL  +G+V+NVVGHCLSG+ GGFLFF
Sbjct: 181  RLALGCGYDQLPGQSHHPLDGVLGLGKGKASIVSQLHSKGLVRNVVGHCLSGRGGGFLFF 240

Query: 1096 GEDVYDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIY 1275
            G++VYDSSRI WTPM  D  KHYSAG+ EL+FGGK+TG KNL V+FDSGSS++Y NSQ Y
Sbjct: 241  GDEVYDSSRIVWTPMAHDRMKHYSAGSGELIFGGKATGFKNLFVVFDSGSSFSYLNSQTY 300

Query: 1276 YA-XXXXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKP 1452
                            +EA DD TLP CWKG++PFK+  DVKKYFK   L FP+GWK K 
Sbjct: 301  QGFISLLKKELNGKPLKEAKDDYTLPLCWKGRRPFKTINDVKKYFKNFVLSFPHGWKSKA 360

Query: 1453 QFEISPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAA 1632
             FEI PE+YLIISSKG+ CLG+LNGT+ GL N N+IGDISMQDK++IYDNEKQAIGW+ A
Sbjct: 361  HFEIPPESYLIISSKGSVCLGVLNGTEAGLQNVNLIGDISMQDKMVIYDNEKQAIGWSPA 420

Query: 1633 NCDQPPRSNRVIL 1671
            NCD+PP+SN +I+
Sbjct: 421  NCDRPPKSNNMIM 433


>ref|XP_006346815.1| PREDICTED: aspartic proteinase Asp1-like [Solanum tuberosum]
          Length = 437

 Score =  588 bits (1517), Expect = e-165
 Identities = 277/428 (64%), Positives = 338/428 (78%), Gaps = 8/428 (1%)
 Frame = +1

Query: 412  MGKGKQIVIMIFVVLAVS------GASSIDQQQKWRKWMSPGASSSEANRIGASVIFPLY 573
            MG GK + I+IFVV+ VS      G +   QQQ+ +KWMS  ++++    + +S++ PLY
Sbjct: 10   MGGGKIVGILIFVVVVVSAAGGGGGENHQQQQQQQQKWMSSTSAAAVNPVVSSSIVLPLY 69

Query: 574  GNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVV 753
            GNVYP G+Y+VQ+ +G P RP+FLDPDTGSDLTWLQCDAPCV CT   HP Y+P+NDLV 
Sbjct: 70   GNVYPLGYYYVQLNIGQPSRPFFLDPDTGSDLTWLQCDAPCVRCTTAPHPFYKPNNDLVP 129

Query: 754  CKDPLCASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLG 933
            CKDPLCASLH + Y+C++PEQCDY+V+YADGGSSLGVL+ND+F  N+TSG R+ PRL+LG
Sbjct: 130  CKDPLCASLHPAGYKCESPEQCDYQVDYADGGSSLGVLLNDVFHFNMTSGARMIPRLSLG 189

Query: 934  CGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVY 1110
            CGYDQLPG S HPLDGVLGLG+GK+SIVSQL  +GVV+NVVGHCLSG+ GGFLFFG++VY
Sbjct: 190  CGYDQLPGQSYHPLDGVLGLGRGKTSIVSQLHSKGVVQNVVGHCLSGRGGGFLFFGDEVY 249

Query: 1111 DSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XX 1287
            DSSRI WTPM  D  KHYSAG+ EL+FGGK TGLKNL V+FDSGSS++Y N+  Y     
Sbjct: 250  DSSRIVWTPMAHDRMKHYSAGSGELIFGGKGTGLKNLFVVFDSGSSFSYLNAHTYEGFIS 309

Query: 1288 XXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEIS 1467
                       RE  DD TLP CWKG++PFK+  DVKKYFK  AL F NGWK K  FEI 
Sbjct: 310  LLKKELNGKPLRETKDDYTLPLCWKGRRPFKTINDVKKYFKQFALSFGNGWKSKAHFEIP 369

Query: 1468 PEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQP 1647
            PE+YLIISSKG+ CLG+LNGT+ GL N N+IGDISMQDK++IYDNEKQAIGWT+ANCD+P
Sbjct: 370  PESYLIISSKGSVCLGVLNGTEAGLQNVNLIGDISMQDKMVIYDNEKQAIGWTSANCDRP 429

Query: 1648 PRSNRVIL 1671
            P+S+ +I+
Sbjct: 430  PKSSNMIM 437


>ref|NP_001297397.1| aspartic proteinase Asp1 precursor [Solanum lycopersicum]
          Length = 427

 Score =  587 bits (1514), Expect = e-165
 Identities = 276/427 (64%), Positives = 334/427 (78%), Gaps = 7/427 (1%)
 Frame = +1

Query: 412  MGKGKQIVIMIFVVLAVS-----GASSIDQQQKWRKWMSPGASSSEANRIGASVIFPLYG 576
            MG GK + I+IFVV+ VS     G +   QQQKW KWMS  +++     + +S++ PLYG
Sbjct: 1    MGGGKIVGILIFVVVVVSAAGGGGENHHHQQQKWWKWMSSTSAAMVNPVVSSSIVLPLYG 60

Query: 577  NVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVC 756
            NVYP G+Y+VQ+ +G P RP+FLDPDTGSDLTWLQCDAPCV CT   HP Y+P+NDLV C
Sbjct: 61   NVYPLGYYYVQLNIGQPSRPFFLDPDTGSDLTWLQCDAPCVRCTTAPHPFYKPNNDLVPC 120

Query: 757  KDPLCASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGC 936
            KDPLCASLH + Y+C++PEQCDY+V+YADGGSSLGVL+ND+F  N+TSG R+ PRL+LGC
Sbjct: 121  KDPLCASLHPAGYKCESPEQCDYQVDYADGGSSLGVLLNDVFHFNMTSGARMIPRLSLGC 180

Query: 937  GYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVYD 1113
            GYDQLPG S HPLDGVLGLG+GK+SIVSQL  +G V+NVVGHCLSG+ GGFLFFG++VYD
Sbjct: 181  GYDQLPGQSYHPLDGVLGLGRGKTSIVSQLHSKGAVQNVVGHCLSGRGGGFLFFGDEVYD 240

Query: 1114 SSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXX 1290
            SSRI WTPM  D  KHYSAG+ EL+FGGK TGLKNL V+FDSGSS++Y N+  Y      
Sbjct: 241  SSRIVWTPMAHDRMKHYSAGSGELIFGGKGTGLKNLFVVFDSGSSFSYLNAHTYEGFISL 300

Query: 1291 XXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISP 1470
                      RE  DD TLP CWKG++PFK+  D KKYFK  AL F NGWK K  FEI P
Sbjct: 301  LKKELNGKPLRETKDDYTLPLCWKGRRPFKTINDAKKYFKQFALSFGNGWKSKAHFEIPP 360

Query: 1471 EAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPP 1650
            E+YLIISSKG+ CLG+LNGT+ GL N N+IGDISMQDK++IYDNEKQAIGW +ANCD+PP
Sbjct: 361  ESYLIISSKGSVCLGVLNGTEAGLQNVNLIGDISMQDKMVIYDNEKQAIGWMSANCDRPP 420

Query: 1651 RSNRVIL 1671
            +S+ +I+
Sbjct: 421  KSSNMIM 427


>emb|CDP04854.1| unnamed protein product [Coffea canephora]
          Length = 410

 Score =  583 bits (1502), Expect = e-163
 Identities = 277/405 (68%), Positives = 324/405 (80%), Gaps = 6/405 (1%)
 Frame = +1

Query: 475  SIDQQQKWRKW---MSPGASSSEANRIGA--SVIFPLYGNVYPNGFYFVQVFVGYPPRPY 639
            S DQ QKW KW   +S  ASSSEAN + +  S++F LYGNV+P+G+YF QV VG PP+PY
Sbjct: 6    SRDQLQKWCKWKSRVSTEASSSEANPVSSYSSILFKLYGNVHPDGYYFAQVNVGQPPKPY 65

Query: 640  FLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDPLCASLHSSDYRCDNPEQC 819
            FLDPDTGSDLTWLQCDAPCV CT+  HPLYRP+NDLVVC+DPLCASLHS  Y C NPEQC
Sbjct: 66   FLDPDTGSDLTWLQCDAPCVRCTEAPHPLYRPTNDLVVCRDPLCASLHSGAYECPNPEQC 125

Query: 820  DYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGYDQLPGISDHPLDGVLGLGK 999
            DYEVEYADGGSS GVLVND+FSLNLT+GIR+  RL  GCGYDQLP +   PLDGVLGLGK
Sbjct: 126  DYEVEYADGGSSFGVLVNDVFSLNLTTGIRLGLRLAFGCGYDQLPSVYAPPLDGVLGLGK 185

Query: 1000 GKSSIVSQLRDQGVVKNVVGHCLSGQGGFLFFGEDVYDSSRITWTPMLRDYTKHYSAGNA 1179
            G SSIVSQL +QG+V+N++GHCLS  GGFLFFG+D+YD+S++ W PM +D TK YS  +A
Sbjct: 186  GNSSIVSQLHNQGIVRNIIGHCLSATGGFLFFGDDLYDASQVNWAPMSQDSTKRYSVSSA 245

Query: 1180 ELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXXXXXXXXXXREATDDRTLPFC 1356
            EL FGGK  G+KNL+VIFDSGSSY+Y NSQ Y A              +EA DDRTLP C
Sbjct: 246  ELTFGGKGVGIKNLDVIFDSGSSYSYLNSQAYRAIISLIEKDLKGKPLKEAKDDRTLPEC 305

Query: 1357 WKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAYLIISSKGNACLGILNGTDI 1536
            W+G+KPFKS  DV+KYFKPL L F +G + + QFEI P+AYLIISSKGNACLGILNGT+I
Sbjct: 306  WRGRKPFKSVHDVRKYFKPLGLSFHHGQRVRTQFEIPPDAYLIISSKGNACLGILNGTEI 365

Query: 1537 GLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRSNRVIL 1671
            GL N N+IGDISMQDK++IYDNEK AIGW+ ANC +PP+SN  I+
Sbjct: 366  GLQNVNLIGDISMQDKMVIYDNEKGAIGWSPANCSRPPKSNTFIM 410


>ref|XP_007048530.1| Eukaryotic aspartyl protease family protein isoform 1 [Theobroma
            cacao] gi|508700791|gb|EOX92687.1| Eukaryotic aspartyl
            protease family protein isoform 1 [Theobroma cacao]
          Length = 421

 Score =  573 bits (1476), Expect = e-160
 Identities = 274/420 (65%), Positives = 331/420 (78%), Gaps = 5/420 (1%)
 Frame = +1

Query: 412  MGKGKQIVIMIFVVLAVSGASSIDQQQKWRKWM--SPGASSSEANRIGASVIFPLYGNVY 585
            MGKG+  V+++ +  +   AS     QKWRK M  +   SS   NR+G+S++FP++GNVY
Sbjct: 1    MGKGRMSVLLLLLFFSFCSASD----QKWRKAMISTDKGSSMMMNRVGSSILFPIHGNVY 56

Query: 586  PNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDP 765
            P G+Y V + +G PP+PYFLD DTGSDLTWLQCDAPCV C +  HPLYRP+NDLV CKDP
Sbjct: 57   PTGYYNVTISIGQPPKPYFLDLDTGSDLTWLQCDAPCVHCVEAPHPLYRPTNDLVPCKDP 116

Query: 766  LCASLHS-SDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGY 942
            LCA+LH   DY+C+NPEQCDYEVEYADGGSSLGVLV D+FSLN T+GIR++PRL LGCGY
Sbjct: 117  LCAALHPPGDYKCENPEQCDYEVEYADGGSSLGVLVRDVFSLNYTNGIRLSPRLALGCGY 176

Query: 943  DQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVYDSS 1119
            DQ+PG S HPLDG+LGLG+GK+SIVSQL+ QG+V+NVVGHCLSG+ GGFLFFG+ +YDSS
Sbjct: 177  DQIPGSSYHPLDGILGLGRGKASIVSQLQSQGLVRNVVGHCLSGRGGGFLFFGDGLYDSS 236

Query: 1120 RITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXX 1296
            R+TWT M ++ TK+YS G AEL FGGK+T +KNL V+FDSGSSYTY NSQ Y        
Sbjct: 237  RVTWTSMSQELTKYYSPGIAELQFGGKATSVKNLIVVFDSGSSYTYLNSQAYQTLTVLLK 296

Query: 1297 XXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEA 1476
                    +EA +D+TLP CWKG+KPFK+ RDVKKYFK LAL F +  + K QFE+ PEA
Sbjct: 297  KELSGRSLKEAPEDQTLPLCWKGRKPFKNVRDVKKYFKTLALAFASSSRTKTQFELPPEA 356

Query: 1477 YLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRS 1656
            YLIIS+KGN CLGILNGT +GL N N+IGDISMQD+++IYDNEKQ IGW  ANCDQ PRS
Sbjct: 357  YLIISNKGNVCLGILNGTQVGLQNLNVIGDISMQDRMVIYDNEKQVIGWAPANCDQLPRS 416


>ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
            gi|296082608|emb|CBI21613.3| unnamed protein product
            [Vitis vinifera]
          Length = 426

 Score =  572 bits (1474), Expect = e-160
 Identities = 272/419 (64%), Positives = 327/419 (78%), Gaps = 5/419 (1%)
 Frame = +1

Query: 430  IVIMIFVVLAVSGASSIDQQQKWRK---WMSPGASSSEANRIGASVIFPLYGNVYPNGFY 600
            +++++ V++ +SG SS    Q  RK   +  P ASSS  N I +SV+FPLYGNVYP G+Y
Sbjct: 8    VLVVLVVLVGLSGWSSASDHQHKRKKAVFPEPAASSSLINIIQSSVVFPLYGNVYPLGYY 67

Query: 601  FVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDPLCASL 780
            +V + +G PP+PYFLDPDTGSDL+WLQCDAPCV CTK  HPLYRP+N+LV+CKDP+CASL
Sbjct: 68   YVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPCVRCTKAPHPLYRPNNNLVICKDPMCASL 127

Query: 781  HSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGYDQLPGI 960
            H   Y+C++PEQCDYEVEYADGGSSLGVLV D+F LN T+G+R+ PRL LGCGYDQ+PG 
Sbjct: 128  HPPGYKCEHPEQCDYEVEYADGGSSLGVLVKDVFPLNFTNGLRLAPRLALGCGYDQIPGQ 187

Query: 961  SDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVYDSSRITWTP 1137
            S HPLDGVLGLGKGKSSIVSQL  QGV++NVVGHC+S + GGFLFFG+D+YDSSR+ WTP
Sbjct: 188  SYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSRGGGFLFFGDDLYDSSRVVWTP 247

Query: 1138 MLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXXXXXXXX 1314
            MLRD   HYS+G AEL+ GGK+T  KNL V FDSGSSYTY NS  Y A            
Sbjct: 248  MLRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALVHLVRKELSEK 307

Query: 1315 XXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAYLIISS 1494
              REA DD+TLP CW+GK+PFKS RDVKK+FKPLAL FP G + K Q++I  E+YLIIS 
Sbjct: 308  PVREALDDQTLPLCWRGKRPFKSVRDVKKFFKPLALSFPGGGRTKTQYDIPLESYLIISL 367

Query: 1495 KGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRSNRVIL 1671
            KGN CLGILNGT+ GL +FN+IGDISMQDK+++YDNEK  IGW   NCD+ P+    IL
Sbjct: 368  KGNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPTNCDRLPKFKAAIL 426


>ref|XP_012489372.1| PREDICTED: aspartic proteinase Asp1 [Gossypium raimondii]
            gi|763773371|gb|KJB40494.1| hypothetical protein
            B456_007G066800 [Gossypium raimondii]
          Length = 426

 Score =  565 bits (1456), Expect = e-158
 Identities = 269/423 (63%), Positives = 332/423 (78%), Gaps = 9/423 (2%)
 Frame = +1

Query: 412  MGKGKQIVIMIFVVLAVSGASSIDQQQKWRKWM------SPGASSSEANRIGASVIFPLY 573
            M KG+  V  + + L++S AS     QKWRK M      S  +SS   NR+G+S++FP++
Sbjct: 1    MRKGQVNVFFLLLFLSLSSASD----QKWRKAMMSAYNGSSSSSSMMMNRVGSSILFPIH 56

Query: 574  GNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVV 753
            GNVYP G+Y V + +G+PP+PYFLD DTGSDLTWLQC+APCV C +  HPLY+PSNDLV 
Sbjct: 57   GNVYPTGYYNVTINIGHPPKPYFLDLDTGSDLTWLQCNAPCVHCIEAPHPLYQPSNDLVA 116

Query: 754  CKDPLCASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLG 933
            C+ PLCA+LH  DY+C++P+QCDYEVEYADGGSSLGVLV D+FSLN T+G+R++PRL LG
Sbjct: 117  CRHPLCAALHPPDYKCESPDQCDYEVEYADGGSSLGVLVRDVFSLNYTNGVRLSPRLALG 176

Query: 934  CGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVY 1110
            CGYDQ+PG S HPLDG+LGLG+GKSSIVSQL+ QG+V+NVVGHCLSG+ GGFLFFG+ +Y
Sbjct: 177  CGYDQIPGTSYHPLDGILGLGRGKSSIVSQLQSQGLVRNVVGHCLSGRGGGFLFFGDGLY 236

Query: 1111 DSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XX 1287
            DSS +TWT M +++TK+YS G+AEL FGGK+TG+KNL VIFDSGSSYTY NSQ Y A   
Sbjct: 237  DSSHVTWTSMSQEFTKYYSPGSAELHFGGKATGIKNLIVIFDSGSSYTYLNSQAYQALTL 296

Query: 1288 XXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFK-PLALRFPNGWKGKPQFEI 1464
                       +EA +D+TLP CWKG+KPF+S  D KKYFK  LAL F N  + K QFE+
Sbjct: 297  LLKKELSGRSLKEAPEDQTLPLCWKGRKPFRSVHDAKKYFKTSLALAFANSGRRKTQFEL 356

Query: 1465 SPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQ 1644
             PEAYLIIS+KGN CLGILNGT +GL N N+IGDISMQD++++YDNEKQ IGW+ ANCD 
Sbjct: 357  HPEAYLIISNKGNVCLGILNGTQVGLQNLNVIGDISMQDRMVVYDNEKQVIGWSPANCDH 416

Query: 1645 PPR 1653
             PR
Sbjct: 417  LPR 419


>ref|XP_002522918.1| nucellin, putative [Ricinus communis] gi|223537845|gb|EEF39461.1|
            nucellin, putative [Ricinus communis]
          Length = 433

 Score =  555 bits (1430), Expect = e-155
 Identities = 276/433 (63%), Positives = 332/433 (76%), Gaps = 13/433 (3%)
 Frame = +1

Query: 412  MGKGKQ---IVIMIFVVLAVSG---ASSIDQQQKWRKWMSPG--ASSSEANRIGASVIFP 567
            MGKG     +V M+ ++  +SG   ASS D+QQ+WRK +  G   SS   NR G+S++FP
Sbjct: 1    MGKGDVGFWVVTMLVLIGLISGSSAASSDDRQQRWRKAVLSGEITSSMMINRAGSSLVFP 60

Query: 568  LYGNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDL 747
            L+GNVYP G+Y V + +G P +PYFLD DTGSDLTWLQCDAPC  C +  HPLYRPSN+L
Sbjct: 61   LHGNVYPAGYYNVTLSIGQPAKPYFLDVDTGSDLTWLQCDAPCRQCIEAPHPLYRPSNNL 120

Query: 748  VVCKDPLCASLHSSD-YRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRL 924
            V+C+DPLCASL     + C +P+QCDYEVEYADGGSSLGVLV D+F LN T+G R+NP L
Sbjct: 121  VICEDPLCASLQPPGVHNCQDPDQCDYEVEYADGGSSLGVLVKDVFVLNFTNGKRLNPLL 180

Query: 925  TLGCGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGE 1101
             LGCGYDQLPG S+HPLDG+LGLG+G SSI SQL  QG+V NV+GHCLSG+ GGFLFFGE
Sbjct: 181  ALGCGYDQLPGRSNHPLDGILGLGRGISSIPSQLSSQGLVSNVIGHCLSGRGGGFLFFGE 240

Query: 1102 DVYDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIY-Y 1278
            D+YDSS +TWTPM RD+ KHYS G AEL+F GKSTG++NL V+FDSGSSYTY N+Q Y +
Sbjct: 241  DIYDSSGVTWTPMSRDHLKHYSPGFAELIFDGKSTGIRNLLVVFDSGSSYTYLNAQAYQH 300

Query: 1279 AXXXXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRF--PNGWKGKP 1452
                           EA DD+TLP CWKGK+PFKS RDVKKYFKP AL F   +G   K 
Sbjct: 301  LVFSLKRELSRKPISEALDDQTLPLCWKGKRPFKSIRDVKKYFKPFALVFKTSSGRSSKT 360

Query: 1453 QFEISPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAA 1632
            QFE SPEAYLIISSKGNACLGILNGT++GL + N+IGD+SM D+L+IY+NEKQ IGW AA
Sbjct: 361  QFEFSPEAYLIISSKGNACLGILNGTEVGLRDLNVIGDVSMLDRLVIYNNEKQMIGWAAA 420

Query: 1633 NCDQPPRSNRVIL 1671
            +CD+ P+S R I+
Sbjct: 421  SCDRLPKSKRNII 433


>emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
          Length = 424

 Score =  554 bits (1428), Expect = e-155
 Identities = 266/419 (63%), Positives = 320/419 (76%), Gaps = 5/419 (1%)
 Frame = +1

Query: 430  IVIMIFVVLAVSGASSIDQQQKWRK---WMSPGASSSEANRIGASVIFPLYGNVYPNGFY 600
            +++++ V++ +SG SS    Q  RK   +  P ASSS  N I +SV+FPLYGNVYP G+Y
Sbjct: 8    VLVVLVVLVGLSGWSSASDHQHKRKKAVFPEPAASSSLINIIQSSVVFPLYGNVYPLGYY 67

Query: 601  FVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDPLCASL 780
            +V + +G PP PYFLDP TGSDL+WLQCDAPCV CTK  H LYRP+N+LV+CKDP+CA L
Sbjct: 68   YVSLSIGQPPXPYFLDPXTGSDLSWLQCDAPCVRCTKAXHXLYRPNNNLVICKDPMCAXL 127

Query: 781  HSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGYDQLPGI 960
            H   Y+C++PEQCDYEVEYADGGSSLGVLV D+F LN T+G+R+ PRL LGCGYDQ+PG 
Sbjct: 128  HPPGYKCEHPEQCDYEVEYADGGSSLGVLVKDVFPLNFTNGLRLAPRLALGCGYDQIPGX 187

Query: 961  SDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVYDSSRITWTP 1137
            S HPLDGVLGLGKGKSSIVSQL  QGV++NVVGHC+S   GGFLFFG+D+YDSSR+ WTP
Sbjct: 188  SYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSHGGGFLFFGDDLYDSSRVVWTP 247

Query: 1138 MLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXXXXXXXX 1314
            MLRD   HYS+G AEL+ GGK+T  KNL V FDSGSSYTY NS  Y A            
Sbjct: 248  MLRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALVHLVRKELSEK 307

Query: 1315 XXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAYLIISS 1494
              REA DD+TLP CW+GK+PFKS RDV+K+FKPLAL F  G + K Q++I  E+YLIIS 
Sbjct: 308  PVREALDDQTLPLCWRGKRPFKSVRDVRKFFKPLALSFAGGGRTKTQYDIPLESYLIIS- 366

Query: 1495 KGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRSNRVIL 1671
             GN CLGILNGT+ GL +FN+IGDISMQDK+++YDNEK  IGW   NCD+ P+    IL
Sbjct: 367  -GNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPTNCDRLPKFKAAIL 424


>ref|XP_012087626.1| PREDICTED: aspartic proteinase Asp1 [Jatropha curcas]
            gi|643710892|gb|KDP24798.1| hypothetical protein
            JCGZ_25323 [Jatropha curcas]
          Length = 424

 Score =  550 bits (1417), Expect = e-153
 Identities = 269/423 (63%), Positives = 323/423 (76%), Gaps = 8/423 (1%)
 Frame = +1

Query: 412  MGKGK------QIVIMIFVVLAVSGASSIDQQQKWRKWMSPGASSSEANRIGASVIFPLY 573
            MGKGK       +++++ +VL  S ASS   QQKWRK M    SS   +++G+S++FPL+
Sbjct: 1    MGKGKVGFSVLALMLLLAMVLVSSAASSDGAQQKWRKAM---LSSMMLSKVGSSLVFPLH 57

Query: 574  GNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVV 753
            GNVYP G+Y V + +G P +PYFLD DTGSDLTWLQCDAPC  CT+  HPLYRPSN+LVV
Sbjct: 58   GNVYPAGYYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCRQCTEAPHPLYRPSNNLVV 117

Query: 754  CKDPLCASLHS-SDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTL 930
            C DPLC SL +  +++C++PEQCDYEVEYADGGSSLGVLV D+F LN T+G R+NP L L
Sbjct: 118  CNDPLCRSLQAPGEHKCEDPEQCDYEVEYADGGSSLGVLVRDVFLLNFTNGQRLNPLLAL 177

Query: 931  GCGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQGG-FLFFGEDV 1107
            GCGYDQLPG S HPLDG+LGLG+G SSI SQL  QG+VKNV+GHCLSG+GG FLFFG+D+
Sbjct: 178  GCGYDQLPGRSHHPLDGILGLGRGISSIPSQLSSQGLVKNVIGHCLSGRGGGFLFFGDDI 237

Query: 1108 YDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYAXX 1287
            YDSSRITWT M RD++K+YS G +EL+F GKSTG++NL V FDSGSSYTY NSQ Y    
Sbjct: 238  YDSSRITWTQMSRDHSKYYSPGFSELMFDGKSTGIQNLLVAFDSGSSYTYLNSQAYRGLL 297

Query: 1288 XXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEIS 1467
                            D+TLP CWKGKKPFKS RDVKKYFK  AL F N  + +  FE  
Sbjct: 298  YSLRTALSGKPLSEVPDQTLPVCWKGKKPFKSLRDVKKYFKSFALGFANSGRARTHFEFP 357

Query: 1468 PEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQP 1647
            PEAYLIISSKGNACLGILNGT IGL + N+IGDISMQD+++IY+NEKQ IGW  ANC++ 
Sbjct: 358  PEAYLIISSKGNACLGILNGTQIGLRDLNVIGDISMQDRMMIYNNEKQVIGWAPANCERL 417

Query: 1648 PRS 1656
            P+S
Sbjct: 418  PKS 420


>ref|XP_004147327.2| PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis sativus]
          Length = 429

 Score =  546 bits (1406), Expect = e-152
 Identities = 269/427 (62%), Positives = 321/427 (75%), Gaps = 12/427 (2%)
 Frame = +1

Query: 412  MGKGKQIVIMIFVV----LAVSGASSIDQQQKWRKWMS----PGASSSEANRIGASVIFP 567
            MGK   +V+++ V     LA   ASS  + + W +       P ASSS A+   +S++ P
Sbjct: 1    MGKRVLVVLVLMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFAS---SSIVLP 57

Query: 568  LYGNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDL 747
            L GNVYPNGFY V ++VG PP+PYFLDPDTGSDLTWLQCDAPC  CT+  HPLY+PSNDL
Sbjct: 58   LQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDL 117

Query: 748  VVCKDPLCASLHSS-DYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRL 924
            V CKDPLC SLHSS D+RC+NP+QCDYEVEYADGGSSLGVLV D+F LNLT+G  I PRL
Sbjct: 118  VPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRL 177

Query: 925  TLGCGYDQLPGISD-HPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQGG-FLFFG 1098
             LGCGYDQ PG S  HP+DG+LGLG+G  SIVSQL +QG+V+NVVGHC + +GG +LFFG
Sbjct: 178  ALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFG 237

Query: 1099 EDVYDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYY 1278
            + +YD  R+ WTPM RDY KHYS G  EL+F G+STGL+NL V+FDSGSSYTYFN+Q Y 
Sbjct: 238  DGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQ 297

Query: 1279 AXXXXXXXXXXXXX-REATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQ 1455
                           REA DD TLP CW+G+KP KS RDV+KYFKPLAL F +G + K  
Sbjct: 298  VLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAV 357

Query: 1456 FEISPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAAN 1635
            FEI  E Y+IISS GN CLGILNGTD+GL N N+IGDISMQDK+++Y+NEKQAIGW  AN
Sbjct: 358  FEIPTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATAN 417

Query: 1636 CDQPPRS 1656
            CD+ P+S
Sbjct: 418  CDRVPKS 424


>ref|XP_009355861.1| PREDICTED: aspartic proteinase Asp1-like [Pyrus x bretschneideri]
          Length = 430

 Score =  545 bits (1403), Expect = e-152
 Identities = 260/411 (63%), Positives = 316/411 (76%), Gaps = 3/411 (0%)
 Frame = +1

Query: 430  IVIMIFVVLAVSGASSIDQQQKWRKWMSPGASSSEANRIGASVIFPLYGNVYPNGFYFVQ 609
            +V+M++ V  +S AS  DQ  +  +     +S   +    +S++FP++GNVYP G Y V 
Sbjct: 13   MVVMVWCV-TLSSASFGDQYYRGSRKTDATSSLGFSRAAPSSIVFPVHGNVYPTGSYNVT 71

Query: 610  VFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDPLCASLHS- 786
            + +G PP+PYFLDPDTGSDLTWLQCDAPCV CT+  HP YRPSNDLV CKDPLC +LHS 
Sbjct: 72   LNIGQPPKPYFLDPDTGSDLTWLQCDAPCVSCTQAPHPYYRPSNDLVACKDPLCEALHSP 131

Query: 787  SDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGYDQLPGISD 966
              ++CD PEQCDYEVEYADGGSSLGVLV D FSLN TSG+++ P+L LGCGYDQLPG S 
Sbjct: 132  GSHKCDAPEQCDYEVEYADGGSSLGVLVRDSFSLNFTSGLQLRPKLALGCGYDQLPGSSY 191

Query: 967  HPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQGGFLF-FGEDVYDSSRITWTPML 1143
            HP+DGVLGLG+GK+SI+SQL  QG+V+NV+GHCLSG+GG  F FG+D+YD SRI WTPM 
Sbjct: 192  HPIDGVLGLGRGKTSIISQLSSQGLVRNVIGHCLSGRGGGYFVFGDDIYDYSRIVWTPMS 251

Query: 1144 RDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIY-YAXXXXXXXXXXXXX 1320
             DY+KHYS G AEL+  GKSTG  NL+++FDSGSSYTY +SQ+Y +              
Sbjct: 252  LDYSKHYSPGPAELMVDGKSTGFGNLHMVFDSGSSYTYLSSQVYQFLTSWLKRELTEKPL 311

Query: 1321 REATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAYLIISSKG 1500
            +EA DD TLP CWKG+KPFKS RDVKKYFKPLALRF +G K   Q+E+ PEAYLI+SSKG
Sbjct: 312  KEAPDDGTLPLCWKGRKPFKSIRDVKKYFKPLALRFGSGRKDTAQYELPPEAYLILSSKG 371

Query: 1501 NACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPR 1653
            N CLGILNGT++GL + N+IGDISMQDK++IYDNEKQ IGW   NCD  P+
Sbjct: 372  NVCLGILNGTEVGLQDNNIIGDISMQDKMVIYDNEKQMIGWAPGNCDHLPK 422


>ref|XP_006432088.1| hypothetical protein CICLE_v10001122mg [Citrus clementina]
            gi|557534210|gb|ESR45328.1| hypothetical protein
            CICLE_v10001122mg [Citrus clementina]
          Length = 451

 Score =  545 bits (1403), Expect = e-152
 Identities = 268/442 (60%), Positives = 334/442 (75%), Gaps = 15/442 (3%)
 Frame = +1

Query: 376  IITQIMGQREKVMGKGKQIVIMIFVVLAVSGASSIDQQQKWRKWMSPGASSSEA------ 537
            I+TQ MG+    +G    +V+M FV+   S +SS + Q +WRK +   A++S +      
Sbjct: 11   IVTQKMGKER--VGLVLALVLMSFVI---STSSSDEHQLRWRKSLFSTATTSSSSSSSSS 65

Query: 538  ------NRIGASVIFPLYGNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCV 699
                  NR+G+S++F + GNVYP G+Y V V+VG PP+PYFLD DTGSDL WLQCDAPCV
Sbjct: 66   SSSLLFNRVGSSLLFRVQGNVYPTGYYNVTVYVGQPPKPYFLDLDTGSDLIWLQCDAPCV 125

Query: 700  CCTKGFHPLYRPSNDLVVCKDPLCASLHS-SDYRCDNPEQCDYEVEYADGGSSLGVLVND 876
             C +  HPLYRPSNDLV C+DP+CASLH+   ++C++P QCDYEVEYADGGSSLGVLV D
Sbjct: 126  QCVEAPHPLYRPSNDLVPCEDPICASLHAPGQHKCEDPTQCDYEVEYADGGSSLGVLVKD 185

Query: 877  LFSLNLTSGIRINPRLTLGCGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVV 1056
             F+ N T+G R+NPRL LGCGYDQ+PG S HPLDG+LGLGKGKSSIVSQL  Q +++NVV
Sbjct: 186  AFAFNYTNGQRLNPRLALGCGYDQVPGASHHPLDGILGLGKGKSSIVSQLHSQKLIRNVV 245

Query: 1057 GHCLSGQ-GGFLFFGEDVYDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIF 1233
            GHCLSG+ GGFLFFG+D+YDSSR+ WT M  DYTK+YS G AEL+FGGK+TGLKNL ++F
Sbjct: 246  GHCLSGRGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELLFGGKTTGLKNLPLVF 305

Query: 1234 DSGSSYTYFNSQIYYA-XXXXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFK 1410
            DSGSSYTY +   Y                +EA +DRTLP CWKGK+PFK+ RDVKKYFK
Sbjct: 306  DSGSSYTYLSHVAYQTLTSMMKREISAKSLKEAPEDRTLPLCWKGKRPFKNVRDVKKYFK 365

Query: 1411 PLALRFPNGWKGKPQFEISPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLL 1590
             LAL F +G K +  FE++PEAYLIIS++GN CLGILNG ++GL + N+IGDISMQD+++
Sbjct: 366  ALALSFTDG-KTRTLFELTPEAYLIISNRGNVCLGILNGAEVGLQDLNVIGDISMQDRVV 424

Query: 1591 IYDNEKQAIGWTAANCDQPPRS 1656
            IYDNEKQ IGW  ANCD+ P+S
Sbjct: 425  IYDNEKQRIGWMPANCDRIPKS 446


>ref|XP_010111255.1| Aspartic proteinase Asp1 [Morus notabilis]
            gi|587944251|gb|EXC30733.1| Aspartic proteinase Asp1
            [Morus notabilis]
          Length = 432

 Score =  544 bits (1402), Expect = e-152
 Identities = 267/416 (64%), Positives = 320/416 (76%), Gaps = 6/416 (1%)
 Frame = +1

Query: 430  IVIMIFVVLAVSGASSIDQQQKWRKWMS-PGASSSEANRIGASVIFPLYGNVYPNGFYFV 606
            +V+ + +   +S A+ ++ + + +     PG SS E NR+G+SV+FP++GNVYP GFY V
Sbjct: 13   LVLFMGLCTTISSAAFLENRHRRKSTHPVPGTSSFELNRVGSSVVFPIHGNVYPIGFYNV 72

Query: 607  QVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDPLCASLH- 783
             + +G PP+PYFLDPDTGSDLTWLQCDAPCV CT+  HPLYRPSNDLV C+DPLC +LH 
Sbjct: 73   TLNIGQPPKPYFLDPDTGSDLTWLQCDAPCVQCTETPHPLYRPSNDLVGCRDPLCIALHL 132

Query: 784  SSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGYDQLPGIS 963
                +CDNPEQCDYEVEYADGGSSLGVLV D F  N T G ++ PRL LGCGYDQ+PG S
Sbjct: 133  PGTPKCDNPEQCDYEVEYADGGSSLGVLVKDAFYFNSTKGDQLKPRLALGCGYDQVPG-S 191

Query: 964  DH--PLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVYDSSRITWT 1134
             H  PLDGVLGLG+GK+SIVSQL  QG+++NVVGHCLSG+ GGFLFFG++VYDSSR+ WT
Sbjct: 192  SHPLPLDGVLGLGRGKTSIVSQLHSQGLMRNVVGHCLSGRGGGFLFFGDNVYDSSRVDWT 251

Query: 1135 PMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXXXXXXX 1311
            PM  DY KHYS G+AEL F GK TGLKNL  +FDSGSSYTY  SQ Y             
Sbjct: 252  PMSSDYLKHYSPGSAELRFDGKPTGLKNLLTVFDSGSSYTYLTSQAYQTLTFLIKRELPR 311

Query: 1312 XXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAYLIIS 1491
               REATDD+TLP CWKGK+PFK   DV+KYFKPLAL F  G K K  +E+ PEAYLI+S
Sbjct: 312  KVLREATDDQTLPLCWKGKRPFKRVSDVRKYFKPLALDFTTGGKTK-TYELPPEAYLIVS 370

Query: 1492 SKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRSN 1659
            SKGN CLGILNG++IGL N N+IGDISMQDK++IYDNEKQ IGW +ANCD+ P+++
Sbjct: 371  SKGNVCLGILNGSEIGLQNSNIIGDISMQDKMVIYDNEKQMIGWASANCDKLPKTS 426


>ref|XP_007211687.1| hypothetical protein PRUPE_ppa005961mg [Prunus persica]
            gi|462407552|gb|EMJ12886.1| hypothetical protein
            PRUPE_ppa005961mg [Prunus persica]
          Length = 435

 Score =  543 bits (1400), Expect = e-151
 Identities = 267/428 (62%), Positives = 323/428 (75%), Gaps = 11/428 (2%)
 Frame = +1

Query: 406  KVMGKGKQIVIMIFVVL-----AVSGASSIDQQQKWR-KWMSPGASSSEA--NRIGASVI 561
            K  GK   +++++ +++      +S AS  DQ  + R K M P  ++S    NR  +S++
Sbjct: 2    KTEGKSGWLLLLMSLLVMGLSATMSSASFGDQYHRGRRKTMLPDEATSSLGLNRAASSIV 61

Query: 562  FPLYGNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSN 741
             P++GNVYP G Y V + +G PP+PYFLDPDTGSDLTWLQCDAPCV CT+  HP YRP+N
Sbjct: 62   LPVHGNVYPIGSYNVTLNIGQPPKPYFLDPDTGSDLTWLQCDAPCVRCTEAPHPFYRPNN 121

Query: 742  DLVVCKDPLCASLHS-SDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINP 918
            DLVVCKDPLC +LH+   ++CDNPEQCDYEVEYADGGSSLGVLV D F LN T+G +   
Sbjct: 122  DLVVCKDPLCEALHAPGSHKCDNPEQCDYEVEYADGGSSLGVLVRDAFLLNFTNGNQRTT 181

Query: 919  RLTLGCGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQGG-FLFF 1095
             L LGCGYDQLPG S HP+DGVLGLGKGKSSIVSQL +QG+V++V+GHCLSG+GG F F 
Sbjct: 182  HLALGCGYDQLPGSSYHPIDGVLGLGKGKSSIVSQLSNQGLVRHVIGHCLSGRGGGFFFL 241

Query: 1096 GEDVYDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIY 1275
            G+ +YDSSRI WTPM  DY KHYS G AEL+ GGKSTG +NL ++FDSGSSYTY NSQ Y
Sbjct: 242  GDGLYDSSRIVWTPMSPDYAKHYSPGLAELIVGGKSTGFRNLVMVFDSGSSYTYLNSQAY 301

Query: 1276 -YAXXXXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKP 1452
             +              +EA DDRTLP CWKG+KPF++ RDVK YFKPLALRF +G K   
Sbjct: 302  QFLTSWLKRELTGKPLKEALDDRTLPLCWKGRKPFRNIRDVKTYFKPLALRFASGRKDTT 361

Query: 1453 QFEISPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAA 1632
            QFE+ PEAYLIISSKGN CLGILNG+++GL N N+IGDISMQDK++IYDNEKQ IGW   
Sbjct: 362  QFELPPEAYLIISSKGNVCLGILNGSEVGLQNSNIIGDISMQDKMVIYDNEKQMIGWGPG 421

Query: 1633 NCDQPPRS 1656
            NCD+ P+S
Sbjct: 422  NCDKLPKS 429


Top