BLASTX nr result
ID: Forsythia22_contig00031892
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia22_contig00031892 (1956 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011099684.1| PREDICTED: aspartic proteinase Asp1 [Sesamum... 706 0.0 ref|XP_012857588.1| PREDICTED: aspartic proteinase Asp1 [Erythra... 678 0.0 gb|EPS67652.1| hypothetical protein M569_07122 [Genlisea aurea] 624 e-175 dbj|BAD80835.1| nucellin-like protein [Daucus carota] 610 e-171 ref|XP_009631909.1| PREDICTED: aspartic proteinase Asp1 [Nicotia... 604 e-170 ref|XP_009766881.1| PREDICTED: aspartic proteinase Asp1 [Nicotia... 594 e-167 ref|XP_006346815.1| PREDICTED: aspartic proteinase Asp1-like [So... 588 e-165 ref|NP_001297397.1| aspartic proteinase Asp1 precursor [Solanum ... 587 e-165 emb|CDP04854.1| unnamed protein product [Coffea canephora] 583 e-163 ref|XP_007048530.1| Eukaryotic aspartyl protease family protein ... 573 e-160 ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis v... 572 e-160 ref|XP_012489372.1| PREDICTED: aspartic proteinase Asp1 [Gossypi... 565 e-158 ref|XP_002522918.1| nucellin, putative [Ricinus communis] gi|223... 555 e-155 emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera] 554 e-155 ref|XP_012087626.1| PREDICTED: aspartic proteinase Asp1 [Jatroph... 550 e-153 ref|XP_004147327.2| PREDICTED: aspartic proteinase Asp1 isoform ... 546 e-152 ref|XP_009355861.1| PREDICTED: aspartic proteinase Asp1-like [Py... 545 e-152 ref|XP_006432088.1| hypothetical protein CICLE_v10001122mg [Citr... 545 e-152 ref|XP_010111255.1| Aspartic proteinase Asp1 [Morus notabilis] g... 544 e-152 ref|XP_007211687.1| hypothetical protein PRUPE_ppa005961mg [Prun... 543 e-151 >ref|XP_011099684.1| PREDICTED: aspartic proteinase Asp1 [Sesamum indicum] Length = 422 Score = 706 bits (1823), Expect = 0.0 Identities = 332/420 (79%), Positives = 363/420 (86%), Gaps = 2/420 (0%) Frame = +1 Query: 412 MGKGKQIVIMIFVVLAVSGASSI-DQQQKWRKWMSPGASSSEANRIGASVIFPLYGNVYP 588 M K K + ++ F VLA S A S DQQ KWRKW +S S NR+G+S+IFPLYGNVYP Sbjct: 1 MKKEKLVFMIAFAVLAASCAGSCNDQQLKWRKWGPSSSSPSSINRLGSSIIFPLYGNVYP 60 Query: 589 NGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDPL 768 NGFYFVQV++GYPP+PYFLDPDTGSDLTWLQCDAPCV CT GFHPLYRPSN+LV+CKDPL Sbjct: 61 NGFYFVQVYLGYPPKPYFLDPDTGSDLTWLQCDAPCVRCTTGFHPLYRPSNELVICKDPL 120 Query: 769 CASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGYDQ 948 CASLHS DY CDNPEQCDYEVEYADGGSSLGVLVND F+LNLTSG+R++PRLT+GCGYDQ Sbjct: 121 CASLHSQDYNCDNPEQCDYEVEYADGGSSLGVLVNDFFTLNLTSGVRMSPRLTIGCGYDQ 180 Query: 949 LPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQGGFLFFGEDVYDSSRIT 1128 LPG SDHPLDGVLGLGKGKSSIVSQLR+QGV+KNV+GHCLSGQGGFLFFGEDVYDSSR+T Sbjct: 181 LPGASDHPLDGVLGLGKGKSSIVSQLREQGVMKNVIGHCLSGQGGFLFFGEDVYDSSRVT 240 Query: 1129 WTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXXXXX 1305 WT M RDYTKHY+AG+AEL FGGKSTG KNLNVIFDSGSSYTYF+S IY+ Sbjct: 241 WTSMARDYTKHYAAGSAELRFGGKSTGFKNLNVIFDSGSSYTYFSSHIYHTLLSLINKEL 300 Query: 1306 XXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAYLI 1485 REATDD TLPFCWKGKKPFKSTRDV+KYFKPL+L FPNGW+ K QFEI PE YLI Sbjct: 301 RRTSLREATDDHTLPFCWKGKKPFKSTRDVRKYFKPLSLSFPNGWRAKAQFEIPPEGYLI 360 Query: 1486 ISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRSNRV 1665 ISSKGNACLGILNGTD+GL+NFNMIGDISMQDK+LIYDNEK IGWT ANCDQ P SN V Sbjct: 361 ISSKGNACLGILNGTDVGLSNFNMIGDISMQDKMLIYDNEKHVIGWTPANCDQRPTSNSV 420 >ref|XP_012857588.1| PREDICTED: aspartic proteinase Asp1 [Erythranthe guttatus] gi|604300852|gb|EYU20602.1| hypothetical protein MIMGU_mgv1a007047mg [Erythranthe guttata] Length = 422 Score = 678 bits (1749), Expect = 0.0 Identities = 320/420 (76%), Positives = 360/420 (85%), Gaps = 4/420 (0%) Frame = +1 Query: 412 MGKGKQIVIMIFVVLAVS---GASSIDQQQKWRKWMSPGASSSEANRIGASVIFPLYGNV 582 M K K I I+I VV+ V+ G + QQ KWRKW P S S+ + G+S+IFPLYGNV Sbjct: 1 MKKEKLIFIIIVVVVLVASCAGCNKDQQQLKWRKW-GPSCSPSKFTKFGSSIIFPLYGNV 59 Query: 583 YPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKD 762 YPNGFYFVQV++GYPP+PYFLDPDTGSDLTWLQCDAPCV CT GFHPLYRPSNDLV+CKD Sbjct: 60 YPNGFYFVQVYLGYPPKPYFLDPDTGSDLTWLQCDAPCVRCTTGFHPLYRPSNDLVICKD 119 Query: 763 PLCASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGY 942 PLCASLHSSDY+C+N EQCDYEVEYADGGSSLGVLVND F+LNLT+G+R++PRLTLGCGY Sbjct: 120 PLCASLHSSDYKCENTEQCDYEVEYADGGSSLGVLVNDFFTLNLTTGVRMSPRLTLGCGY 179 Query: 943 DQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQGGFLFFGEDVYDSSR 1122 DQLPG SDHPLDGV GLG+GKSSIVSQLR+QG+VKN+VGHCLS QGG+LF GED YDSSR Sbjct: 180 DQLPGPSDHPLDGVFGLGRGKSSIVSQLREQGIVKNIVGHCLSEQGGYLFLGEDAYDSSR 239 Query: 1123 ITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXXX 1299 +TWTPM RD TKHY+AG+AEL FGGKSTG KNLNVIFDSGSSYTYFNSQIY+ Sbjct: 240 MTWTPMSRDDTKHYTAGSAELRFGGKSTGFKNLNVIFDSGSSYTYFNSQIYHTLISLIKK 299 Query: 1300 XXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAY 1479 +EA+DDRTLPFCWKGKKPFK+TRDV+KYFK L+L F NGW+ K QF+I PE Y Sbjct: 300 ELSGKSLKEASDDRTLPFCWKGKKPFKTTRDVRKYFKSLSLSFVNGWRTKAQFDIPPEGY 359 Query: 1480 LIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRSN 1659 LIISSKGNACLGILNGTD+GL+NFNMIGDISMQDKLL+YDNEKQ IGWT ANCDQ P+S+ Sbjct: 360 LIISSKGNACLGILNGTDVGLSNFNMIGDISMQDKLLVYDNEKQVIGWTPANCDQIPKSS 419 >gb|EPS67652.1| hypothetical protein M569_07122 [Genlisea aurea] Length = 401 Score = 624 bits (1608), Expect = e-175 Identities = 286/375 (76%), Positives = 325/375 (86%), Gaps = 3/375 (0%) Frame = +1 Query: 529 SEANRIGASVIFPLYGNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCT 708 S N G+S++ P+YGNVYP+GFYFVQV++GYPPRPYFLDPDTGSDLTWLQCDAPCV CT Sbjct: 17 SATNTFGSSIMLPVYGNVYPDGFYFVQVYLGYPPRPYFLDPDTGSDLTWLQCDAPCVRCT 76 Query: 709 KGFHPLYRPSNDLVVCKDPLCASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSL 888 +GFHPLYRPSNDLVVCKDPLCASLHSSDY CDNPEQCDYEVEYADGGSSLGVLVND F+L Sbjct: 77 EGFHPLYRPSNDLVVCKDPLCASLHSSDYTCDNPEQCDYEVEYADGGSSLGVLVNDFFTL 136 Query: 889 NLTSGIRINPRLTLGCGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCL 1068 NLT+G+R++PRLT+GCGYDQL G SDHPLDGVLGLGKGKSSIVSQLRDQGVVKNV+GHCL Sbjct: 137 NLTAGVRMSPRLTIGCGYDQLAGSSDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVIGHCL 196 Query: 1069 S--GQGGFLFFGEDVYDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSG 1242 S G+GGF+FFG+D+YDSSR+TWTPM ++ HY+AG AEL FGG+STG KNLNV+FDSG Sbjct: 197 SRVGKGGFVFFGDDLYDSSRVTWTPMSHEHNNHYAAGLAELRFGGRSTGFKNLNVVFDSG 256 Query: 1243 SSYTYFNSQIYYAXXXXXXXXXXXXXREA-TDDRTLPFCWKGKKPFKSTRDVKKYFKPLA 1419 SSYTYF S IY A A +D+TLP CWKGKKPF++TRDVKKYFK LA Sbjct: 257 SSYTYFTSHIYQAVVSMITKDLNGKPLTAEPEDQTLPMCWKGKKPFRTTRDVKKYFKTLA 316 Query: 1420 LRFPNGWKGKPQFEISPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYD 1599 FPNGW+ K F+++PE YL++SSKGNACLGILNGT +GL NFN+IGDISMQDK++IYD Sbjct: 317 FAFPNGWRSKASFDVTPEGYLVVSSKGNACLGILNGTSVGLENFNVIGDISMQDKMVIYD 376 Query: 1600 NEKQAIGWTAANCDQ 1644 NEKQ IGWTAANCDQ Sbjct: 377 NEKQMIGWTAANCDQ 391 >dbj|BAD80835.1| nucellin-like protein [Daucus carota] Length = 426 Score = 610 bits (1574), Expect = e-171 Identities = 281/418 (67%), Positives = 339/418 (81%), Gaps = 2/418 (0%) Frame = +1 Query: 412 MGKGKQIVIMIFVVLAVSGASSIDQQQKWRKWMSPGASSSEANRIGASVIFPLYGNVYPN 591 M K + ++ +F+VL + G SS DQQQ W KW S GASSS + +G+SV+ PLYGNVYP+ Sbjct: 5 MAKICKQIMSVFLVLMIVGVSSDDQQQSWWKWFSSGASSSVVSSVGSSVVLPLYGNVYPS 64 Query: 592 GFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDPLC 771 G+Y VQ +G PP+PYFLDPDTGSDLTWLQCDAPC+ CT HPLY+P+NDLVVCKDP+C Sbjct: 65 GYYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHPLYQPTNDLVVCKDPIC 124 Query: 772 ASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGYDQL 951 ASLH +YRCD+P+QCDYEVEYADGGSS+GVLVNDLF +NLTSG+R PRLT+GCGYDQL Sbjct: 125 ASLHPDNYRCDDPDQCDYEVEYADGGSSIGVLVNDLFPVNLTSGMRARPRLTIGCGYDQL 184 Query: 952 PGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVYDSSRIT 1128 PGI+ HPLDGVLGLG+G SSIV+QL QG+V+NVVGHC S + GG+LFFG+D+YDSS++ Sbjct: 185 PGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRRGGGYLFFGDDIYDSSKVI 244 Query: 1129 WTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXXXXX 1305 WTPM RDY KHY+ G AEL+ G+S+GLKNL V+FDSGSSYTYFN+Q Y Sbjct: 245 WTPMSRDYLKHYTPGFAELILNGRSSGLKNLLVVFDSGSSYTYFNTQTYQTLLSFIKKDL 304 Query: 1306 XXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAYLI 1485 +EA +D TLP CW+GKKPFKS RD KKYFKPLAL F +GWK K QFEI E+YLI Sbjct: 305 HGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKKYFKPLALSFGSGWKTKSQFEIQQESYLI 364 Query: 1486 ISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRSN 1659 ISSKG+ CLGILNGT++GL N+N+IGDISMQ+KL+IYDNEKQ IGW +NCD+PP+ + Sbjct: 365 ISSKGSVCLGILNGTEVGLQNYNIIGDISMQEKLVIYDNEKQVIGWQPSNCDRPPKGD 422 >ref|XP_009631909.1| PREDICTED: aspartic proteinase Asp1 [Nicotiana tomentosiformis] Length = 431 Score = 604 bits (1558), Expect = e-170 Identities = 288/431 (66%), Positives = 343/431 (79%), Gaps = 11/431 (2%) Frame = +1 Query: 412 MGKGKQIVIMIFVVLAVSGASS------IDQQQKWRKWMSPG--ASSSEANRIGAS-VIF 564 MG GK I +++FVV+AVS A+ + QQQKW KWMS G ASSS + +S ++ Sbjct: 1 MGGGKIIGMVMFVVIAVSAAAGSGDNQQLQQQQKWWKWMSSGSAASSSVVKPVASSSIVL 60 Query: 565 PLYGNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSND 744 PLYGNVYP G+Y+VQ+ +G P +PYFLDPDTGSDLTWLQCDAPCV CT+ HP Y+P+ND Sbjct: 61 PLYGNVYPIGYYYVQLNIGQPSKPYFLDPDTGSDLTWLQCDAPCVRCTRAPHPFYKPNND 120 Query: 745 LVVCKDPLCASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRL 924 LV CKDPLCASLH DY+C++PEQCDY+V+YADGGSSLGVL+ND+F+ N TSG RI PRL Sbjct: 121 LVPCKDPLCASLHHVDYKCESPEQCDYQVDYADGGSSLGVLLNDVFNFNATSGARIIPRL 180 Query: 925 TLGCGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGE 1101 LGCGYDQLPG S HPLDGVLGLGKGK+SIVSQL +G+V+NVVGHCLSG+ GGFLFFG+ Sbjct: 181 ALGCGYDQLPGQSHHPLDGVLGLGKGKASIVSQLHSKGLVRNVVGHCLSGRGGGFLFFGD 240 Query: 1102 DVYDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA 1281 +VYDSSRI WTPM D KHYSAG+ EL+FGGK+TG KNL V+FDSGSS++Y NSQ Y Sbjct: 241 EVYDSSRIVWTPMAHDRMKHYSAGSGELIFGGKATGFKNLFVVFDSGSSFSYLNSQTYQG 300 Query: 1282 -XXXXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQF 1458 REA DD TLP CWKG++PFK+ DVKKYFK AL FP+GWK K F Sbjct: 301 FISLLKKELNGKPLREAKDDYTLPLCWKGRRPFKTINDVKKYFKNFALSFPHGWKSKAHF 360 Query: 1459 EISPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANC 1638 EI PE+YLIISSKG+ CLG+LNGT+ GL N N+IGDISMQDK++IYDNEKQAIGW+ ANC Sbjct: 361 EIPPESYLIISSKGSVCLGVLNGTEAGLQNVNLIGDISMQDKMVIYDNEKQAIGWSPANC 420 Query: 1639 DQPPRSNRVIL 1671 D+PP+SN +I+ Sbjct: 421 DRPPKSNNMIM 431 >ref|XP_009766881.1| PREDICTED: aspartic proteinase Asp1 [Nicotiana sylvestris] Length = 433 Score = 594 bits (1532), Expect = e-167 Identities = 279/433 (64%), Positives = 342/433 (78%), Gaps = 13/433 (3%) Frame = +1 Query: 412 MGKGKQIVIMIFVVLAVSGASS--------IDQQQKWRKWMSPGASSSEA---NRIGASV 558 MG GK I ++IFV ++V+ +++ + QQQKW KWMS G+++S + + +S+ Sbjct: 1 MGGGKIIGMVIFVAVSVAVSAAAGYGDNQQLQQQQKWWKWMSSGSAASSSVVKPVVSSSI 60 Query: 559 IFPLYGNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPS 738 + PLYGN+YP G+Y+VQ+ +G P +PYFLDPDTGSDLTWLQCDAPCV CT+ HP Y+P+ Sbjct: 61 VLPLYGNIYPIGYYYVQLNIGQPSKPYFLDPDTGSDLTWLQCDAPCVRCTRAPHPFYKPN 120 Query: 739 NDLVVCKDPLCASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINP 918 NDLV CKDPLCASLH DY+C++PEQCDY+V+YADGGSSLGVL+ND+F N TSG RI P Sbjct: 121 NDLVPCKDPLCASLHHVDYKCESPEQCDYQVDYADGGSSLGVLLNDVFHFNGTSGARIIP 180 Query: 919 RLTLGCGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFF 1095 RL LGCGYDQLPG S HPLDGVLGLGKGK+SIVSQL +G+V+NVVGHCLSG+ GGFLFF Sbjct: 181 RLALGCGYDQLPGQSHHPLDGVLGLGKGKASIVSQLHSKGLVRNVVGHCLSGRGGGFLFF 240 Query: 1096 GEDVYDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIY 1275 G++VYDSSRI WTPM D KHYSAG+ EL+FGGK+TG KNL V+FDSGSS++Y NSQ Y Sbjct: 241 GDEVYDSSRIVWTPMAHDRMKHYSAGSGELIFGGKATGFKNLFVVFDSGSSFSYLNSQTY 300 Query: 1276 YA-XXXXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKP 1452 +EA DD TLP CWKG++PFK+ DVKKYFK L FP+GWK K Sbjct: 301 QGFISLLKKELNGKPLKEAKDDYTLPLCWKGRRPFKTINDVKKYFKNFVLSFPHGWKSKA 360 Query: 1453 QFEISPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAA 1632 FEI PE+YLIISSKG+ CLG+LNGT+ GL N N+IGDISMQDK++IYDNEKQAIGW+ A Sbjct: 361 HFEIPPESYLIISSKGSVCLGVLNGTEAGLQNVNLIGDISMQDKMVIYDNEKQAIGWSPA 420 Query: 1633 NCDQPPRSNRVIL 1671 NCD+PP+SN +I+ Sbjct: 421 NCDRPPKSNNMIM 433 >ref|XP_006346815.1| PREDICTED: aspartic proteinase Asp1-like [Solanum tuberosum] Length = 437 Score = 588 bits (1517), Expect = e-165 Identities = 277/428 (64%), Positives = 338/428 (78%), Gaps = 8/428 (1%) Frame = +1 Query: 412 MGKGKQIVIMIFVVLAVS------GASSIDQQQKWRKWMSPGASSSEANRIGASVIFPLY 573 MG GK + I+IFVV+ VS G + QQQ+ +KWMS ++++ + +S++ PLY Sbjct: 10 MGGGKIVGILIFVVVVVSAAGGGGGENHQQQQQQQQKWMSSTSAAAVNPVVSSSIVLPLY 69 Query: 574 GNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVV 753 GNVYP G+Y+VQ+ +G P RP+FLDPDTGSDLTWLQCDAPCV CT HP Y+P+NDLV Sbjct: 70 GNVYPLGYYYVQLNIGQPSRPFFLDPDTGSDLTWLQCDAPCVRCTTAPHPFYKPNNDLVP 129 Query: 754 CKDPLCASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLG 933 CKDPLCASLH + Y+C++PEQCDY+V+YADGGSSLGVL+ND+F N+TSG R+ PRL+LG Sbjct: 130 CKDPLCASLHPAGYKCESPEQCDYQVDYADGGSSLGVLLNDVFHFNMTSGARMIPRLSLG 189 Query: 934 CGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVY 1110 CGYDQLPG S HPLDGVLGLG+GK+SIVSQL +GVV+NVVGHCLSG+ GGFLFFG++VY Sbjct: 190 CGYDQLPGQSYHPLDGVLGLGRGKTSIVSQLHSKGVVQNVVGHCLSGRGGGFLFFGDEVY 249 Query: 1111 DSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XX 1287 DSSRI WTPM D KHYSAG+ EL+FGGK TGLKNL V+FDSGSS++Y N+ Y Sbjct: 250 DSSRIVWTPMAHDRMKHYSAGSGELIFGGKGTGLKNLFVVFDSGSSFSYLNAHTYEGFIS 309 Query: 1288 XXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEIS 1467 RE DD TLP CWKG++PFK+ DVKKYFK AL F NGWK K FEI Sbjct: 310 LLKKELNGKPLRETKDDYTLPLCWKGRRPFKTINDVKKYFKQFALSFGNGWKSKAHFEIP 369 Query: 1468 PEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQP 1647 PE+YLIISSKG+ CLG+LNGT+ GL N N+IGDISMQDK++IYDNEKQAIGWT+ANCD+P Sbjct: 370 PESYLIISSKGSVCLGVLNGTEAGLQNVNLIGDISMQDKMVIYDNEKQAIGWTSANCDRP 429 Query: 1648 PRSNRVIL 1671 P+S+ +I+ Sbjct: 430 PKSSNMIM 437 >ref|NP_001297397.1| aspartic proteinase Asp1 precursor [Solanum lycopersicum] Length = 427 Score = 587 bits (1514), Expect = e-165 Identities = 276/427 (64%), Positives = 334/427 (78%), Gaps = 7/427 (1%) Frame = +1 Query: 412 MGKGKQIVIMIFVVLAVS-----GASSIDQQQKWRKWMSPGASSSEANRIGASVIFPLYG 576 MG GK + I+IFVV+ VS G + QQQKW KWMS +++ + +S++ PLYG Sbjct: 1 MGGGKIVGILIFVVVVVSAAGGGGENHHHQQQKWWKWMSSTSAAMVNPVVSSSIVLPLYG 60 Query: 577 NVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVC 756 NVYP G+Y+VQ+ +G P RP+FLDPDTGSDLTWLQCDAPCV CT HP Y+P+NDLV C Sbjct: 61 NVYPLGYYYVQLNIGQPSRPFFLDPDTGSDLTWLQCDAPCVRCTTAPHPFYKPNNDLVPC 120 Query: 757 KDPLCASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGC 936 KDPLCASLH + Y+C++PEQCDY+V+YADGGSSLGVL+ND+F N+TSG R+ PRL+LGC Sbjct: 121 KDPLCASLHPAGYKCESPEQCDYQVDYADGGSSLGVLLNDVFHFNMTSGARMIPRLSLGC 180 Query: 937 GYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVYD 1113 GYDQLPG S HPLDGVLGLG+GK+SIVSQL +G V+NVVGHCLSG+ GGFLFFG++VYD Sbjct: 181 GYDQLPGQSYHPLDGVLGLGRGKTSIVSQLHSKGAVQNVVGHCLSGRGGGFLFFGDEVYD 240 Query: 1114 SSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXX 1290 SSRI WTPM D KHYSAG+ EL+FGGK TGLKNL V+FDSGSS++Y N+ Y Sbjct: 241 SSRIVWTPMAHDRMKHYSAGSGELIFGGKGTGLKNLFVVFDSGSSFSYLNAHTYEGFISL 300 Query: 1291 XXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISP 1470 RE DD TLP CWKG++PFK+ D KKYFK AL F NGWK K FEI P Sbjct: 301 LKKELNGKPLRETKDDYTLPLCWKGRRPFKTINDAKKYFKQFALSFGNGWKSKAHFEIPP 360 Query: 1471 EAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPP 1650 E+YLIISSKG+ CLG+LNGT+ GL N N+IGDISMQDK++IYDNEKQAIGW +ANCD+PP Sbjct: 361 ESYLIISSKGSVCLGVLNGTEAGLQNVNLIGDISMQDKMVIYDNEKQAIGWMSANCDRPP 420 Query: 1651 RSNRVIL 1671 +S+ +I+ Sbjct: 421 KSSNMIM 427 >emb|CDP04854.1| unnamed protein product [Coffea canephora] Length = 410 Score = 583 bits (1502), Expect = e-163 Identities = 277/405 (68%), Positives = 324/405 (80%), Gaps = 6/405 (1%) Frame = +1 Query: 475 SIDQQQKWRKW---MSPGASSSEANRIGA--SVIFPLYGNVYPNGFYFVQVFVGYPPRPY 639 S DQ QKW KW +S ASSSEAN + + S++F LYGNV+P+G+YF QV VG PP+PY Sbjct: 6 SRDQLQKWCKWKSRVSTEASSSEANPVSSYSSILFKLYGNVHPDGYYFAQVNVGQPPKPY 65 Query: 640 FLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDPLCASLHSSDYRCDNPEQC 819 FLDPDTGSDLTWLQCDAPCV CT+ HPLYRP+NDLVVC+DPLCASLHS Y C NPEQC Sbjct: 66 FLDPDTGSDLTWLQCDAPCVRCTEAPHPLYRPTNDLVVCRDPLCASLHSGAYECPNPEQC 125 Query: 820 DYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGYDQLPGISDHPLDGVLGLGK 999 DYEVEYADGGSS GVLVND+FSLNLT+GIR+ RL GCGYDQLP + PLDGVLGLGK Sbjct: 126 DYEVEYADGGSSFGVLVNDVFSLNLTTGIRLGLRLAFGCGYDQLPSVYAPPLDGVLGLGK 185 Query: 1000 GKSSIVSQLRDQGVVKNVVGHCLSGQGGFLFFGEDVYDSSRITWTPMLRDYTKHYSAGNA 1179 G SSIVSQL +QG+V+N++GHCLS GGFLFFG+D+YD+S++ W PM +D TK YS +A Sbjct: 186 GNSSIVSQLHNQGIVRNIIGHCLSATGGFLFFGDDLYDASQVNWAPMSQDSTKRYSVSSA 245 Query: 1180 ELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXXXXXXXXXXREATDDRTLPFC 1356 EL FGGK G+KNL+VIFDSGSSY+Y NSQ Y A +EA DDRTLP C Sbjct: 246 ELTFGGKGVGIKNLDVIFDSGSSYSYLNSQAYRAIISLIEKDLKGKPLKEAKDDRTLPEC 305 Query: 1357 WKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAYLIISSKGNACLGILNGTDI 1536 W+G+KPFKS DV+KYFKPL L F +G + + QFEI P+AYLIISSKGNACLGILNGT+I Sbjct: 306 WRGRKPFKSVHDVRKYFKPLGLSFHHGQRVRTQFEIPPDAYLIISSKGNACLGILNGTEI 365 Query: 1537 GLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRSNRVIL 1671 GL N N+IGDISMQDK++IYDNEK AIGW+ ANC +PP+SN I+ Sbjct: 366 GLQNVNLIGDISMQDKMVIYDNEKGAIGWSPANCSRPPKSNTFIM 410 >ref|XP_007048530.1| Eukaryotic aspartyl protease family protein isoform 1 [Theobroma cacao] gi|508700791|gb|EOX92687.1| Eukaryotic aspartyl protease family protein isoform 1 [Theobroma cacao] Length = 421 Score = 573 bits (1476), Expect = e-160 Identities = 274/420 (65%), Positives = 331/420 (78%), Gaps = 5/420 (1%) Frame = +1 Query: 412 MGKGKQIVIMIFVVLAVSGASSIDQQQKWRKWM--SPGASSSEANRIGASVIFPLYGNVY 585 MGKG+ V+++ + + AS QKWRK M + SS NR+G+S++FP++GNVY Sbjct: 1 MGKGRMSVLLLLLFFSFCSASD----QKWRKAMISTDKGSSMMMNRVGSSILFPIHGNVY 56 Query: 586 PNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDP 765 P G+Y V + +G PP+PYFLD DTGSDLTWLQCDAPCV C + HPLYRP+NDLV CKDP Sbjct: 57 PTGYYNVTISIGQPPKPYFLDLDTGSDLTWLQCDAPCVHCVEAPHPLYRPTNDLVPCKDP 116 Query: 766 LCASLHS-SDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGY 942 LCA+LH DY+C+NPEQCDYEVEYADGGSSLGVLV D+FSLN T+GIR++PRL LGCGY Sbjct: 117 LCAALHPPGDYKCENPEQCDYEVEYADGGSSLGVLVRDVFSLNYTNGIRLSPRLALGCGY 176 Query: 943 DQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVYDSS 1119 DQ+PG S HPLDG+LGLG+GK+SIVSQL+ QG+V+NVVGHCLSG+ GGFLFFG+ +YDSS Sbjct: 177 DQIPGSSYHPLDGILGLGRGKASIVSQLQSQGLVRNVVGHCLSGRGGGFLFFGDGLYDSS 236 Query: 1120 RITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXX 1296 R+TWT M ++ TK+YS G AEL FGGK+T +KNL V+FDSGSSYTY NSQ Y Sbjct: 237 RVTWTSMSQELTKYYSPGIAELQFGGKATSVKNLIVVFDSGSSYTYLNSQAYQTLTVLLK 296 Query: 1297 XXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEA 1476 +EA +D+TLP CWKG+KPFK+ RDVKKYFK LAL F + + K QFE+ PEA Sbjct: 297 KELSGRSLKEAPEDQTLPLCWKGRKPFKNVRDVKKYFKTLALAFASSSRTKTQFELPPEA 356 Query: 1477 YLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRS 1656 YLIIS+KGN CLGILNGT +GL N N+IGDISMQD+++IYDNEKQ IGW ANCDQ PRS Sbjct: 357 YLIISNKGNVCLGILNGTQVGLQNLNVIGDISMQDRMVIYDNEKQVIGWAPANCDQLPRS 416 >ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera] gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera] Length = 426 Score = 572 bits (1474), Expect = e-160 Identities = 272/419 (64%), Positives = 327/419 (78%), Gaps = 5/419 (1%) Frame = +1 Query: 430 IVIMIFVVLAVSGASSIDQQQKWRK---WMSPGASSSEANRIGASVIFPLYGNVYPNGFY 600 +++++ V++ +SG SS Q RK + P ASSS N I +SV+FPLYGNVYP G+Y Sbjct: 8 VLVVLVVLVGLSGWSSASDHQHKRKKAVFPEPAASSSLINIIQSSVVFPLYGNVYPLGYY 67 Query: 601 FVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDPLCASL 780 +V + +G PP+PYFLDPDTGSDL+WLQCDAPCV CTK HPLYRP+N+LV+CKDP+CASL Sbjct: 68 YVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPCVRCTKAPHPLYRPNNNLVICKDPMCASL 127 Query: 781 HSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGYDQLPGI 960 H Y+C++PEQCDYEVEYADGGSSLGVLV D+F LN T+G+R+ PRL LGCGYDQ+PG Sbjct: 128 HPPGYKCEHPEQCDYEVEYADGGSSLGVLVKDVFPLNFTNGLRLAPRLALGCGYDQIPGQ 187 Query: 961 SDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVYDSSRITWTP 1137 S HPLDGVLGLGKGKSSIVSQL QGV++NVVGHC+S + GGFLFFG+D+YDSSR+ WTP Sbjct: 188 SYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSRGGGFLFFGDDLYDSSRVVWTP 247 Query: 1138 MLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXXXXXXXX 1314 MLRD HYS+G AEL+ GGK+T KNL V FDSGSSYTY NS Y A Sbjct: 248 MLRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALVHLVRKELSEK 307 Query: 1315 XXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAYLIISS 1494 REA DD+TLP CW+GK+PFKS RDVKK+FKPLAL FP G + K Q++I E+YLIIS Sbjct: 308 PVREALDDQTLPLCWRGKRPFKSVRDVKKFFKPLALSFPGGGRTKTQYDIPLESYLIISL 367 Query: 1495 KGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRSNRVIL 1671 KGN CLGILNGT+ GL +FN+IGDISMQDK+++YDNEK IGW NCD+ P+ IL Sbjct: 368 KGNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPTNCDRLPKFKAAIL 426 >ref|XP_012489372.1| PREDICTED: aspartic proteinase Asp1 [Gossypium raimondii] gi|763773371|gb|KJB40494.1| hypothetical protein B456_007G066800 [Gossypium raimondii] Length = 426 Score = 565 bits (1456), Expect = e-158 Identities = 269/423 (63%), Positives = 332/423 (78%), Gaps = 9/423 (2%) Frame = +1 Query: 412 MGKGKQIVIMIFVVLAVSGASSIDQQQKWRKWM------SPGASSSEANRIGASVIFPLY 573 M KG+ V + + L++S AS QKWRK M S +SS NR+G+S++FP++ Sbjct: 1 MRKGQVNVFFLLLFLSLSSASD----QKWRKAMMSAYNGSSSSSSMMMNRVGSSILFPIH 56 Query: 574 GNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVV 753 GNVYP G+Y V + +G+PP+PYFLD DTGSDLTWLQC+APCV C + HPLY+PSNDLV Sbjct: 57 GNVYPTGYYNVTINIGHPPKPYFLDLDTGSDLTWLQCNAPCVHCIEAPHPLYQPSNDLVA 116 Query: 754 CKDPLCASLHSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLG 933 C+ PLCA+LH DY+C++P+QCDYEVEYADGGSSLGVLV D+FSLN T+G+R++PRL LG Sbjct: 117 CRHPLCAALHPPDYKCESPDQCDYEVEYADGGSSLGVLVRDVFSLNYTNGVRLSPRLALG 176 Query: 934 CGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVY 1110 CGYDQ+PG S HPLDG+LGLG+GKSSIVSQL+ QG+V+NVVGHCLSG+ GGFLFFG+ +Y Sbjct: 177 CGYDQIPGTSYHPLDGILGLGRGKSSIVSQLQSQGLVRNVVGHCLSGRGGGFLFFGDGLY 236 Query: 1111 DSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XX 1287 DSS +TWT M +++TK+YS G+AEL FGGK+TG+KNL VIFDSGSSYTY NSQ Y A Sbjct: 237 DSSHVTWTSMSQEFTKYYSPGSAELHFGGKATGIKNLIVIFDSGSSYTYLNSQAYQALTL 296 Query: 1288 XXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFK-PLALRFPNGWKGKPQFEI 1464 +EA +D+TLP CWKG+KPF+S D KKYFK LAL F N + K QFE+ Sbjct: 297 LLKKELSGRSLKEAPEDQTLPLCWKGRKPFRSVHDAKKYFKTSLALAFANSGRRKTQFEL 356 Query: 1465 SPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQ 1644 PEAYLIIS+KGN CLGILNGT +GL N N+IGDISMQD++++YDNEKQ IGW+ ANCD Sbjct: 357 HPEAYLIISNKGNVCLGILNGTQVGLQNLNVIGDISMQDRMVVYDNEKQVIGWSPANCDH 416 Query: 1645 PPR 1653 PR Sbjct: 417 LPR 419 >ref|XP_002522918.1| nucellin, putative [Ricinus communis] gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis] Length = 433 Score = 555 bits (1430), Expect = e-155 Identities = 276/433 (63%), Positives = 332/433 (76%), Gaps = 13/433 (3%) Frame = +1 Query: 412 MGKGKQ---IVIMIFVVLAVSG---ASSIDQQQKWRKWMSPG--ASSSEANRIGASVIFP 567 MGKG +V M+ ++ +SG ASS D+QQ+WRK + G SS NR G+S++FP Sbjct: 1 MGKGDVGFWVVTMLVLIGLISGSSAASSDDRQQRWRKAVLSGEITSSMMINRAGSSLVFP 60 Query: 568 LYGNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDL 747 L+GNVYP G+Y V + +G P +PYFLD DTGSDLTWLQCDAPC C + HPLYRPSN+L Sbjct: 61 LHGNVYPAGYYNVTLSIGQPAKPYFLDVDTGSDLTWLQCDAPCRQCIEAPHPLYRPSNNL 120 Query: 748 VVCKDPLCASLHSSD-YRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRL 924 V+C+DPLCASL + C +P+QCDYEVEYADGGSSLGVLV D+F LN T+G R+NP L Sbjct: 121 VICEDPLCASLQPPGVHNCQDPDQCDYEVEYADGGSSLGVLVKDVFVLNFTNGKRLNPLL 180 Query: 925 TLGCGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGE 1101 LGCGYDQLPG S+HPLDG+LGLG+G SSI SQL QG+V NV+GHCLSG+ GGFLFFGE Sbjct: 181 ALGCGYDQLPGRSNHPLDGILGLGRGISSIPSQLSSQGLVSNVIGHCLSGRGGGFLFFGE 240 Query: 1102 DVYDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIY-Y 1278 D+YDSS +TWTPM RD+ KHYS G AEL+F GKSTG++NL V+FDSGSSYTY N+Q Y + Sbjct: 241 DIYDSSGVTWTPMSRDHLKHYSPGFAELIFDGKSTGIRNLLVVFDSGSSYTYLNAQAYQH 300 Query: 1279 AXXXXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRF--PNGWKGKP 1452 EA DD+TLP CWKGK+PFKS RDVKKYFKP AL F +G K Sbjct: 301 LVFSLKRELSRKPISEALDDQTLPLCWKGKRPFKSIRDVKKYFKPFALVFKTSSGRSSKT 360 Query: 1453 QFEISPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAA 1632 QFE SPEAYLIISSKGNACLGILNGT++GL + N+IGD+SM D+L+IY+NEKQ IGW AA Sbjct: 361 QFEFSPEAYLIISSKGNACLGILNGTEVGLRDLNVIGDVSMLDRLVIYNNEKQMIGWAAA 420 Query: 1633 NCDQPPRSNRVIL 1671 +CD+ P+S R I+ Sbjct: 421 SCDRLPKSKRNII 433 >emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera] Length = 424 Score = 554 bits (1428), Expect = e-155 Identities = 266/419 (63%), Positives = 320/419 (76%), Gaps = 5/419 (1%) Frame = +1 Query: 430 IVIMIFVVLAVSGASSIDQQQKWRK---WMSPGASSSEANRIGASVIFPLYGNVYPNGFY 600 +++++ V++ +SG SS Q RK + P ASSS N I +SV+FPLYGNVYP G+Y Sbjct: 8 VLVVLVVLVGLSGWSSASDHQHKRKKAVFPEPAASSSLINIIQSSVVFPLYGNVYPLGYY 67 Query: 601 FVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDPLCASL 780 +V + +G PP PYFLDP TGSDL+WLQCDAPCV CTK H LYRP+N+LV+CKDP+CA L Sbjct: 68 YVSLSIGQPPXPYFLDPXTGSDLSWLQCDAPCVRCTKAXHXLYRPNNNLVICKDPMCAXL 127 Query: 781 HSSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGYDQLPGI 960 H Y+C++PEQCDYEVEYADGGSSLGVLV D+F LN T+G+R+ PRL LGCGYDQ+PG Sbjct: 128 HPPGYKCEHPEQCDYEVEYADGGSSLGVLVKDVFPLNFTNGLRLAPRLALGCGYDQIPGX 187 Query: 961 SDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVYDSSRITWTP 1137 S HPLDGVLGLGKGKSSIVSQL QGV++NVVGHC+S GGFLFFG+D+YDSSR+ WTP Sbjct: 188 SYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSHGGGFLFFGDDLYDSSRVVWTP 247 Query: 1138 MLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXXXXXXXX 1314 MLRD HYS+G AEL+ GGK+T KNL V FDSGSSYTY NS Y A Sbjct: 248 MLRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALVHLVRKELSEK 307 Query: 1315 XXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAYLIISS 1494 REA DD+TLP CW+GK+PFKS RDV+K+FKPLAL F G + K Q++I E+YLIIS Sbjct: 308 PVREALDDQTLPLCWRGKRPFKSVRDVRKFFKPLALSFAGGGRTKTQYDIPLESYLIIS- 366 Query: 1495 KGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRSNRVIL 1671 GN CLGILNGT+ GL +FN+IGDISMQDK+++YDNEK IGW NCD+ P+ IL Sbjct: 367 -GNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPTNCDRLPKFKAAIL 424 >ref|XP_012087626.1| PREDICTED: aspartic proteinase Asp1 [Jatropha curcas] gi|643710892|gb|KDP24798.1| hypothetical protein JCGZ_25323 [Jatropha curcas] Length = 424 Score = 550 bits (1417), Expect = e-153 Identities = 269/423 (63%), Positives = 323/423 (76%), Gaps = 8/423 (1%) Frame = +1 Query: 412 MGKGK------QIVIMIFVVLAVSGASSIDQQQKWRKWMSPGASSSEANRIGASVIFPLY 573 MGKGK +++++ +VL S ASS QQKWRK M SS +++G+S++FPL+ Sbjct: 1 MGKGKVGFSVLALMLLLAMVLVSSAASSDGAQQKWRKAM---LSSMMLSKVGSSLVFPLH 57 Query: 574 GNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVV 753 GNVYP G+Y V + +G P +PYFLD DTGSDLTWLQCDAPC CT+ HPLYRPSN+LVV Sbjct: 58 GNVYPAGYYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCRQCTEAPHPLYRPSNNLVV 117 Query: 754 CKDPLCASLHS-SDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTL 930 C DPLC SL + +++C++PEQCDYEVEYADGGSSLGVLV D+F LN T+G R+NP L L Sbjct: 118 CNDPLCRSLQAPGEHKCEDPEQCDYEVEYADGGSSLGVLVRDVFLLNFTNGQRLNPLLAL 177 Query: 931 GCGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQGG-FLFFGEDV 1107 GCGYDQLPG S HPLDG+LGLG+G SSI SQL QG+VKNV+GHCLSG+GG FLFFG+D+ Sbjct: 178 GCGYDQLPGRSHHPLDGILGLGRGISSIPSQLSSQGLVKNVIGHCLSGRGGGFLFFGDDI 237 Query: 1108 YDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYAXX 1287 YDSSRITWT M RD++K+YS G +EL+F GKSTG++NL V FDSGSSYTY NSQ Y Sbjct: 238 YDSSRITWTQMSRDHSKYYSPGFSELMFDGKSTGIQNLLVAFDSGSSYTYLNSQAYRGLL 297 Query: 1288 XXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEIS 1467 D+TLP CWKGKKPFKS RDVKKYFK AL F N + + FE Sbjct: 298 YSLRTALSGKPLSEVPDQTLPVCWKGKKPFKSLRDVKKYFKSFALGFANSGRARTHFEFP 357 Query: 1468 PEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQP 1647 PEAYLIISSKGNACLGILNGT IGL + N+IGDISMQD+++IY+NEKQ IGW ANC++ Sbjct: 358 PEAYLIISSKGNACLGILNGTQIGLRDLNVIGDISMQDRMMIYNNEKQVIGWAPANCERL 417 Query: 1648 PRS 1656 P+S Sbjct: 418 PKS 420 >ref|XP_004147327.2| PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis sativus] Length = 429 Score = 546 bits (1406), Expect = e-152 Identities = 269/427 (62%), Positives = 321/427 (75%), Gaps = 12/427 (2%) Frame = +1 Query: 412 MGKGKQIVIMIFVV----LAVSGASSIDQQQKWRKWMS----PGASSSEANRIGASVIFP 567 MGK +V+++ V LA ASS + + W + P ASSS A+ +S++ P Sbjct: 1 MGKRVLVVLVLMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFAS---SSIVLP 57 Query: 568 LYGNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDL 747 L GNVYPNGFY V ++VG PP+PYFLDPDTGSDLTWLQCDAPC CT+ HPLY+PSNDL Sbjct: 58 LQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDL 117 Query: 748 VVCKDPLCASLHSS-DYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRL 924 V CKDPLC SLHSS D+RC+NP+QCDYEVEYADGGSSLGVLV D+F LNLT+G I PRL Sbjct: 118 VPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRL 177 Query: 925 TLGCGYDQLPGISD-HPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQGG-FLFFG 1098 LGCGYDQ PG S HP+DG+LGLG+G SIVSQL +QG+V+NVVGHC + +GG +LFFG Sbjct: 178 ALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFG 237 Query: 1099 EDVYDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYY 1278 + +YD R+ WTPM RDY KHYS G EL+F G+STGL+NL V+FDSGSSYTYFN+Q Y Sbjct: 238 DGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQ 297 Query: 1279 AXXXXXXXXXXXXX-REATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQ 1455 REA DD TLP CW+G+KP KS RDV+KYFKPLAL F +G + K Sbjct: 298 VLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAV 357 Query: 1456 FEISPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAAN 1635 FEI E Y+IISS GN CLGILNGTD+GL N N+IGDISMQDK+++Y+NEKQAIGW AN Sbjct: 358 FEIPTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATAN 417 Query: 1636 CDQPPRS 1656 CD+ P+S Sbjct: 418 CDRVPKS 424 >ref|XP_009355861.1| PREDICTED: aspartic proteinase Asp1-like [Pyrus x bretschneideri] Length = 430 Score = 545 bits (1403), Expect = e-152 Identities = 260/411 (63%), Positives = 316/411 (76%), Gaps = 3/411 (0%) Frame = +1 Query: 430 IVIMIFVVLAVSGASSIDQQQKWRKWMSPGASSSEANRIGASVIFPLYGNVYPNGFYFVQ 609 +V+M++ V +S AS DQ + + +S + +S++FP++GNVYP G Y V Sbjct: 13 MVVMVWCV-TLSSASFGDQYYRGSRKTDATSSLGFSRAAPSSIVFPVHGNVYPTGSYNVT 71 Query: 610 VFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDPLCASLHS- 786 + +G PP+PYFLDPDTGSDLTWLQCDAPCV CT+ HP YRPSNDLV CKDPLC +LHS Sbjct: 72 LNIGQPPKPYFLDPDTGSDLTWLQCDAPCVSCTQAPHPYYRPSNDLVACKDPLCEALHSP 131 Query: 787 SDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGYDQLPGISD 966 ++CD PEQCDYEVEYADGGSSLGVLV D FSLN TSG+++ P+L LGCGYDQLPG S Sbjct: 132 GSHKCDAPEQCDYEVEYADGGSSLGVLVRDSFSLNFTSGLQLRPKLALGCGYDQLPGSSY 191 Query: 967 HPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQGGFLF-FGEDVYDSSRITWTPML 1143 HP+DGVLGLG+GK+SI+SQL QG+V+NV+GHCLSG+GG F FG+D+YD SRI WTPM Sbjct: 192 HPIDGVLGLGRGKTSIISQLSSQGLVRNVIGHCLSGRGGGYFVFGDDIYDYSRIVWTPMS 251 Query: 1144 RDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIY-YAXXXXXXXXXXXXX 1320 DY+KHYS G AEL+ GKSTG NL+++FDSGSSYTY +SQ+Y + Sbjct: 252 LDYSKHYSPGPAELMVDGKSTGFGNLHMVFDSGSSYTYLSSQVYQFLTSWLKRELTEKPL 311 Query: 1321 REATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAYLIISSKG 1500 +EA DD TLP CWKG+KPFKS RDVKKYFKPLALRF +G K Q+E+ PEAYLI+SSKG Sbjct: 312 KEAPDDGTLPLCWKGRKPFKSIRDVKKYFKPLALRFGSGRKDTAQYELPPEAYLILSSKG 371 Query: 1501 NACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPR 1653 N CLGILNGT++GL + N+IGDISMQDK++IYDNEKQ IGW NCD P+ Sbjct: 372 NVCLGILNGTEVGLQDNNIIGDISMQDKMVIYDNEKQMIGWAPGNCDHLPK 422 >ref|XP_006432088.1| hypothetical protein CICLE_v10001122mg [Citrus clementina] gi|557534210|gb|ESR45328.1| hypothetical protein CICLE_v10001122mg [Citrus clementina] Length = 451 Score = 545 bits (1403), Expect = e-152 Identities = 268/442 (60%), Positives = 334/442 (75%), Gaps = 15/442 (3%) Frame = +1 Query: 376 IITQIMGQREKVMGKGKQIVIMIFVVLAVSGASSIDQQQKWRKWMSPGASSSEA------ 537 I+TQ MG+ +G +V+M FV+ S +SS + Q +WRK + A++S + Sbjct: 11 IVTQKMGKER--VGLVLALVLMSFVI---STSSSDEHQLRWRKSLFSTATTSSSSSSSSS 65 Query: 538 ------NRIGASVIFPLYGNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCV 699 NR+G+S++F + GNVYP G+Y V V+VG PP+PYFLD DTGSDL WLQCDAPCV Sbjct: 66 SSSLLFNRVGSSLLFRVQGNVYPTGYYNVTVYVGQPPKPYFLDLDTGSDLIWLQCDAPCV 125 Query: 700 CCTKGFHPLYRPSNDLVVCKDPLCASLHS-SDYRCDNPEQCDYEVEYADGGSSLGVLVND 876 C + HPLYRPSNDLV C+DP+CASLH+ ++C++P QCDYEVEYADGGSSLGVLV D Sbjct: 126 QCVEAPHPLYRPSNDLVPCEDPICASLHAPGQHKCEDPTQCDYEVEYADGGSSLGVLVKD 185 Query: 877 LFSLNLTSGIRINPRLTLGCGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVV 1056 F+ N T+G R+NPRL LGCGYDQ+PG S HPLDG+LGLGKGKSSIVSQL Q +++NVV Sbjct: 186 AFAFNYTNGQRLNPRLALGCGYDQVPGASHHPLDGILGLGKGKSSIVSQLHSQKLIRNVV 245 Query: 1057 GHCLSGQ-GGFLFFGEDVYDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIF 1233 GHCLSG+ GGFLFFG+D+YDSSR+ WT M DYTK+YS G AEL+FGGK+TGLKNL ++F Sbjct: 246 GHCLSGRGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELLFGGKTTGLKNLPLVF 305 Query: 1234 DSGSSYTYFNSQIYYA-XXXXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFK 1410 DSGSSYTY + Y +EA +DRTLP CWKGK+PFK+ RDVKKYFK Sbjct: 306 DSGSSYTYLSHVAYQTLTSMMKREISAKSLKEAPEDRTLPLCWKGKRPFKNVRDVKKYFK 365 Query: 1411 PLALRFPNGWKGKPQFEISPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLL 1590 LAL F +G K + FE++PEAYLIIS++GN CLGILNG ++GL + N+IGDISMQD+++ Sbjct: 366 ALALSFTDG-KTRTLFELTPEAYLIISNRGNVCLGILNGAEVGLQDLNVIGDISMQDRVV 424 Query: 1591 IYDNEKQAIGWTAANCDQPPRS 1656 IYDNEKQ IGW ANCD+ P+S Sbjct: 425 IYDNEKQRIGWMPANCDRIPKS 446 >ref|XP_010111255.1| Aspartic proteinase Asp1 [Morus notabilis] gi|587944251|gb|EXC30733.1| Aspartic proteinase Asp1 [Morus notabilis] Length = 432 Score = 544 bits (1402), Expect = e-152 Identities = 267/416 (64%), Positives = 320/416 (76%), Gaps = 6/416 (1%) Frame = +1 Query: 430 IVIMIFVVLAVSGASSIDQQQKWRKWMS-PGASSSEANRIGASVIFPLYGNVYPNGFYFV 606 +V+ + + +S A+ ++ + + + PG SS E NR+G+SV+FP++GNVYP GFY V Sbjct: 13 LVLFMGLCTTISSAAFLENRHRRKSTHPVPGTSSFELNRVGSSVVFPIHGNVYPIGFYNV 72 Query: 607 QVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSNDLVVCKDPLCASLH- 783 + +G PP+PYFLDPDTGSDLTWLQCDAPCV CT+ HPLYRPSNDLV C+DPLC +LH Sbjct: 73 TLNIGQPPKPYFLDPDTGSDLTWLQCDAPCVQCTETPHPLYRPSNDLVGCRDPLCIALHL 132 Query: 784 SSDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINPRLTLGCGYDQLPGIS 963 +CDNPEQCDYEVEYADGGSSLGVLV D F N T G ++ PRL LGCGYDQ+PG S Sbjct: 133 PGTPKCDNPEQCDYEVEYADGGSSLGVLVKDAFYFNSTKGDQLKPRLALGCGYDQVPG-S 191 Query: 964 DH--PLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQ-GGFLFFGEDVYDSSRITWT 1134 H PLDGVLGLG+GK+SIVSQL QG+++NVVGHCLSG+ GGFLFFG++VYDSSR+ WT Sbjct: 192 SHPLPLDGVLGLGRGKTSIVSQLHSQGLMRNVVGHCLSGRGGGFLFFGDNVYDSSRVDWT 251 Query: 1135 PMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIYYA-XXXXXXXXXX 1311 PM DY KHYS G+AEL F GK TGLKNL +FDSGSSYTY SQ Y Sbjct: 252 PMSSDYLKHYSPGSAELRFDGKPTGLKNLLTVFDSGSSYTYLTSQAYQTLTFLIKRELPR 311 Query: 1312 XXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKPQFEISPEAYLIIS 1491 REATDD+TLP CWKGK+PFK DV+KYFKPLAL F G K K +E+ PEAYLI+S Sbjct: 312 KVLREATDDQTLPLCWKGKRPFKRVSDVRKYFKPLALDFTTGGKTK-TYELPPEAYLIVS 370 Query: 1492 SKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAANCDQPPRSN 1659 SKGN CLGILNG++IGL N N+IGDISMQDK++IYDNEKQ IGW +ANCD+ P+++ Sbjct: 371 SKGNVCLGILNGSEIGLQNSNIIGDISMQDKMVIYDNEKQMIGWASANCDKLPKTS 426 >ref|XP_007211687.1| hypothetical protein PRUPE_ppa005961mg [Prunus persica] gi|462407552|gb|EMJ12886.1| hypothetical protein PRUPE_ppa005961mg [Prunus persica] Length = 435 Score = 543 bits (1400), Expect = e-151 Identities = 267/428 (62%), Positives = 323/428 (75%), Gaps = 11/428 (2%) Frame = +1 Query: 406 KVMGKGKQIVIMIFVVL-----AVSGASSIDQQQKWR-KWMSPGASSSEA--NRIGASVI 561 K GK +++++ +++ +S AS DQ + R K M P ++S NR +S++ Sbjct: 2 KTEGKSGWLLLLMSLLVMGLSATMSSASFGDQYHRGRRKTMLPDEATSSLGLNRAASSIV 61 Query: 562 FPLYGNVYPNGFYFVQVFVGYPPRPYFLDPDTGSDLTWLQCDAPCVCCTKGFHPLYRPSN 741 P++GNVYP G Y V + +G PP+PYFLDPDTGSDLTWLQCDAPCV CT+ HP YRP+N Sbjct: 62 LPVHGNVYPIGSYNVTLNIGQPPKPYFLDPDTGSDLTWLQCDAPCVRCTEAPHPFYRPNN 121 Query: 742 DLVVCKDPLCASLHS-SDYRCDNPEQCDYEVEYADGGSSLGVLVNDLFSLNLTSGIRINP 918 DLVVCKDPLC +LH+ ++CDNPEQCDYEVEYADGGSSLGVLV D F LN T+G + Sbjct: 122 DLVVCKDPLCEALHAPGSHKCDNPEQCDYEVEYADGGSSLGVLVRDAFLLNFTNGNQRTT 181 Query: 919 RLTLGCGYDQLPGISDHPLDGVLGLGKGKSSIVSQLRDQGVVKNVVGHCLSGQGG-FLFF 1095 L LGCGYDQLPG S HP+DGVLGLGKGKSSIVSQL +QG+V++V+GHCLSG+GG F F Sbjct: 182 HLALGCGYDQLPGSSYHPIDGVLGLGKGKSSIVSQLSNQGLVRHVIGHCLSGRGGGFFFL 241 Query: 1096 GEDVYDSSRITWTPMLRDYTKHYSAGNAELVFGGKSTGLKNLNVIFDSGSSYTYFNSQIY 1275 G+ +YDSSRI WTPM DY KHYS G AEL+ GGKSTG +NL ++FDSGSSYTY NSQ Y Sbjct: 242 GDGLYDSSRIVWTPMSPDYAKHYSPGLAELIVGGKSTGFRNLVMVFDSGSSYTYLNSQAY 301 Query: 1276 -YAXXXXXXXXXXXXXREATDDRTLPFCWKGKKPFKSTRDVKKYFKPLALRFPNGWKGKP 1452 + +EA DDRTLP CWKG+KPF++ RDVK YFKPLALRF +G K Sbjct: 302 QFLTSWLKRELTGKPLKEALDDRTLPLCWKGRKPFRNIRDVKTYFKPLALRFASGRKDTT 361 Query: 1453 QFEISPEAYLIISSKGNACLGILNGTDIGLNNFNMIGDISMQDKLLIYDNEKQAIGWTAA 1632 QFE+ PEAYLIISSKGN CLGILNG+++GL N N+IGDISMQDK++IYDNEKQ IGW Sbjct: 362 QFELPPEAYLIISSKGNVCLGILNGSEVGLQNSNIIGDISMQDKMVIYDNEKQMIGWGPG 421 Query: 1633 NCDQPPRS 1656 NCD+ P+S Sbjct: 422 NCDKLPKS 429