BLASTX nr result
ID: Catharanthus23_contig00017321
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00017321 (948 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002512416.1| serine protease htra2, putative [Ricinus com... 269 1e-69 ref|XP_002318995.2| hypothetical protein POPTR_0013s01900g [Popu... 268 2e-69 gb|EOY11481.1| Trypsin family protein with PDZ domain isoform 4 ... 266 1e-68 gb|EOY11478.1| Protease Do-like 14, putative isoform 1 [Theobrom... 266 1e-68 ref|XP_006394951.1| hypothetical protein EUTSA_v10004245mg [Eutr... 265 1e-68 ref|XP_002279678.2| PREDICTED: putative protease Do-like 14-like... 265 2e-68 emb|CBI39500.3| unnamed protein product [Vitis vinifera] 265 2e-68 gb|EOY11482.1| Trypsin family protein with PDZ domain isoform 5 ... 262 2e-67 gb|EOY11480.1| Trypsin family protein with PDZ domain isoform 3 ... 262 2e-67 gb|EOY11479.1| Trypsin family protein with PDZ domain isoform 2,... 262 2e-67 ref|XP_006287782.1| hypothetical protein CARUB_v10000993mg [Caps... 262 2e-67 ref|XP_006472073.1| PREDICTED: putative protease Do-like 14-like... 256 1e-65 ref|XP_006433397.1| hypothetical protein CICLE_v10001167mg [Citr... 256 1e-65 ref|NP_198118.3| Trypsin family protein with PDZ domain [Arabido... 254 3e-65 ref|XP_002872263.1| serine-type peptidase/ trypsin [Arabidopsis ... 254 3e-65 sp|Q3E6S8.2|DGP14_ARATH RecName: Full=Putative protease Do-like 14 254 3e-65 ref|XP_004494604.1| PREDICTED: putative protease Do-like 14-like... 248 2e-63 ref|XP_004302401.1| PREDICTED: putative protease Do-like 14-like... 245 2e-62 ref|XP_006858692.1| hypothetical protein AMTR_s00066p00095010 [A... 234 3e-59 gb|EMJ06461.1| hypothetical protein PRUPE_ppa006348mg [Prunus pe... 234 4e-59 >ref|XP_002512416.1| serine protease htra2, putative [Ricinus communis] gi|223548377|gb|EEF49868.1| serine protease htra2, putative [Ricinus communis] Length = 428 Score = 269 bits (688), Expect = 1e-69 Identities = 150/272 (55%), Positives = 185/272 (68%) Frame = +2 Query: 131 LLRKFPVSDRKSLFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKWPWSNMLE 310 L+RK P+ R S+ R + Y++ DS ++S+S PA L + L ++ Sbjct: 3 LMRKAPL--RNSIIRTLAYAASGSGILYANINSDSDAAVSLSFPAHLRESLSEALISLNP 60 Query: 311 QAPCPAYTPLNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTSDIKKE 490 C DNW +PLF SR A+PV +DI +E Sbjct: 61 SFICA------DNWHFGNLPLFSSR-------------ASPV----------PAADIDRE 91 Query: 491 TSGEAGDGPRHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGTIIDADG 670 +SG AG+ + SC CLGRDTIA+AAA+V P+VVNLSVP GF+G++ G+SIGSGTIID+DG Sbjct: 92 SSGFAGEDKKPSCGCLGRDTIADAAAKVAPAVVNLSVPLGFYGISTGESIGSGTIIDSDG 151 Query: 671 TILTCAHVVVDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSKTPLPTA 850 TILTCAHVVVD QG R+LSKGKV VTLQDGRT+EGTVVNADL SDIA+VKI SKTPLPTA Sbjct: 152 TILTCAHVVVDSQGRRALSKGKVHVTLQDGRTFEGTVVNADLHSDIAMVKIKSKTPLPTA 211 Query: 851 KFGTSSKLRPGDWVVALGCPLTLQNTITAGIV 946 K G+SSKLRPGDWV+A+GCPL+LQNT+TAGIV Sbjct: 212 KLGSSSKLRPGDWVIAMGCPLSLQNTVTAGIV 243 >ref|XP_002318995.2| hypothetical protein POPTR_0013s01900g [Populus trichocarpa] gi|550324725|gb|EEE94918.2| hypothetical protein POPTR_0013s01900g [Populus trichocarpa] Length = 422 Score = 268 bits (686), Expect = 2e-69 Identities = 148/240 (61%), Positives = 170/240 (70%), Gaps = 1/240 (0%) Frame = +2 Query: 230 DSGTSISVSVPA-ALCDKLKWPWSNMLEQAPCPAYTPLNDNWKHCIVPLFFSRPAADRSS 406 DS T IS+S A +L + L PW L+ +W +PLF SR Sbjct: 43 DSDTRISLSFRAESLHESLLLPWRTPLDLT--------QHSWHFGNLPLFSSR------- 87 Query: 407 DIDIKKAAPVGITDGSNHQYSTSDIKKETSGEAGDGPRHSCNCLGRDTIANAAARVGPSV 586 +PV + DIK E G G+ P+ SC CLGRDTIANAAARVGP+V Sbjct: 88 ------ISPV----------PSGDIKNENPGVVGESPKPSCGCLGRDTIANAAARVGPAV 131 Query: 587 VNLSVPQGFHGMTVGKSIGSGTIIDADGTILTCAHVVVDFQGLRSLSKGKVDVTLQDGRT 766 VNLSVP+GF+G+T GKSIGSGTIID++GTILTCAHVVVDFQ +R SKGKVDVTLQDGRT Sbjct: 132 VNLSVPKGFYGITTGKSIGSGTIIDSNGTILTCAHVVVDFQDMRDSSKGKVDVTLQDGRT 191 Query: 767 YEGTVVNADLQSDIAIVKINSKTPLPTAKFGTSSKLRPGDWVVALGCPLTLQNTITAGIV 946 +EGTVVNADL SDIAIVKI SKTPLPTAK G+SSKLRPGDWVVA+GCPL+LQNT+TAGIV Sbjct: 192 FEGTVVNADLHSDIAIVKIKSKTPLPTAKLGSSSKLRPGDWVVAMGCPLSLQNTVTAGIV 251 >gb|EOY11481.1| Trypsin family protein with PDZ domain isoform 4 [Theobroma cacao] Length = 358 Score = 266 bits (679), Expect = 1e-68 Identities = 151/273 (55%), Positives = 178/273 (65%), Gaps = 1/273 (0%) Frame = +2 Query: 131 LLRKFPVS-DRKSLFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKWPWSNML 307 LLR VS R SL R+V Y + DS T++ +S+P L + L + W Sbjct: 4 LLRNASVSCSRSSLIRIVAIGTAGSGLLYWNTNPDSETTVKLSIPVPLREHLSFQWRRPF 63 Query: 308 EQAPCPAYTPLNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTSDIKK 487 + +W+ +PLF SR +A AP G D K Sbjct: 64 LSSY---------HWEIGNLPLFSSRVSA-----------APAG------------DTTK 91 Query: 488 ETSGEAGDGPRHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGTIIDAD 667 E D + C CL RD+IANAAA+VGP+VVNLSVPQG +G+T G+SIGSGTIIDAD Sbjct: 92 EAPVAVWDDKKPCCGCLSRDSIANAAAKVGPAVVNLSVPQGIYGITTGRSIGSGTIIDAD 151 Query: 668 GTILTCAHVVVDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSKTPLPT 847 GTILTCAHVVV+FQG+RS KGKVDVTLQDGRT+EGTVVNADL SDIAIVKI SKTPLPT Sbjct: 152 GTILTCAHVVVEFQGMRSTIKGKVDVTLQDGRTFEGTVVNADLHSDIAIVKIKSKTPLPT 211 Query: 848 AKFGTSSKLRPGDWVVALGCPLTLQNTITAGIV 946 AKFG+SS LRPGDWV+A+GCPL+LQNTITAGIV Sbjct: 212 AKFGSSSNLRPGDWVIAMGCPLSLQNTITAGIV 244 >gb|EOY11478.1| Protease Do-like 14, putative isoform 1 [Theobroma cacao] Length = 429 Score = 266 bits (679), Expect = 1e-68 Identities = 151/273 (55%), Positives = 178/273 (65%), Gaps = 1/273 (0%) Frame = +2 Query: 131 LLRKFPVS-DRKSLFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKWPWSNML 307 LLR VS R SL R+V Y + DS T++ +S+P L + L + W Sbjct: 4 LLRNASVSCSRSSLIRIVAIGTAGSGLLYWNTNPDSETTVKLSIPVPLREHLSFQWRRPF 63 Query: 308 EQAPCPAYTPLNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTSDIKK 487 + +W+ +PLF SR +A AP G D K Sbjct: 64 LSSY---------HWEIGNLPLFSSRVSA-----------APAG------------DTTK 91 Query: 488 ETSGEAGDGPRHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGTIIDAD 667 E D + C CL RD+IANAAA+VGP+VVNLSVPQG +G+T G+SIGSGTIIDAD Sbjct: 92 EAPVAVWDDKKPCCGCLSRDSIANAAAKVGPAVVNLSVPQGIYGITTGRSIGSGTIIDAD 151 Query: 668 GTILTCAHVVVDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSKTPLPT 847 GTILTCAHVVV+FQG+RS KGKVDVTLQDGRT+EGTVVNADL SDIAIVKI SKTPLPT Sbjct: 152 GTILTCAHVVVEFQGMRSTIKGKVDVTLQDGRTFEGTVVNADLHSDIAIVKIKSKTPLPT 211 Query: 848 AKFGTSSKLRPGDWVVALGCPLTLQNTITAGIV 946 AKFG+SS LRPGDWV+A+GCPL+LQNTITAGIV Sbjct: 212 AKFGSSSNLRPGDWVIAMGCPLSLQNTITAGIV 244 >ref|XP_006394951.1| hypothetical protein EUTSA_v10004245mg [Eutrema salsugineum] gi|557091590|gb|ESQ32237.1| hypothetical protein EUTSA_v10004245mg [Eutrema salsugineum] Length = 437 Score = 265 bits (678), Expect = 1e-68 Identities = 149/275 (54%), Positives = 180/275 (65%) Frame = +2 Query: 122 MRYLLRKFPVSDRKSLFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKWPWSN 301 M +L R S + SL R+V Y+ D+GT+IS+++P ++ + L PW Sbjct: 2 MNFLRRAVSSSKQSSLIRIVAVATTTSGIVYAKTNPDAGTTISLAIPESVKESLSLPWQ- 60 Query: 302 MLEQAPCPAYTPLNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTSDI 481 + H F AA SS + K AP + +D Sbjct: 61 ------------IPQGLIHRPDQSLFGNIAAF-SSRVSPKSEAPA----------NANDD 97 Query: 482 KKETSGEAGDGPRHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGTIID 661 ++ EA D P+ S LGRDTIANAAAR+GP+VVNLSVPQGF+G++ GKSIGSGTIID Sbjct: 98 EERVPVEASDSPKPSSGYLGRDTIANAAARIGPAVVNLSVPQGFYGISTGKSIGSGTIID 157 Query: 662 ADGTILTCAHVVVDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSKTPL 841 ADGTILTCAHVVVDFQ +R SKG+VDVTLQDGRT+EG VVNADLQSDIA+VKI SKTPL Sbjct: 158 ADGTILTCAHVVVDFQNIRQSSKGRVDVTLQDGRTFEGVVVNADLQSDIALVKIKSKTPL 217 Query: 842 PTAKFGTSSKLRPGDWVVALGCPLTLQNTITAGIV 946 PTAK G SSKL PGDWV+A+GCPL+LQNTITAGIV Sbjct: 218 PTAKLGFSSKLCPGDWVIAVGCPLSLQNTITAGIV 252 >ref|XP_002279678.2| PREDICTED: putative protease Do-like 14-like [Vitis vinifera] Length = 431 Score = 265 bits (676), Expect = 2e-68 Identities = 146/266 (54%), Positives = 173/266 (65%) Frame = +2 Query: 149 VSDRKSLFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKWPWSNMLEQAPCPA 328 +S S+ R V Y DS T +S+SVPA + L PW + + Sbjct: 4 ISTMNSVLRKVSVAAAASGLLYLCRDSDSKTMVSISVPAQFREPLLRPWQIAQDIIHRSS 63 Query: 329 YTPLNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTSDIKKETSGEAG 508 + D + +P FSR PV ++D+ KE G+ G Sbjct: 64 FLSQGDASQSGNLPPIFSR-------------IGPV----------PSADVNKEAFGKVG 100 Query: 509 DGPRHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGTIIDADGTILTCA 688 DG + SC LGRD+IANAAA VGP+VVN+SVPQGF+GMT+GKSIGSGTIID DGTILTCA Sbjct: 101 DGVKPSCGFLGRDSIANAAAMVGPAVVNISVPQGFNGMTIGKSIGSGTIIDPDGTILTCA 160 Query: 689 HVVVDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSKTPLPTAKFGTSS 868 HVVVDF GL SKGKVDVTLQDGR+++GTV+NADL SDIAIVKI S TPLPTAK GTSS Sbjct: 161 HVVVDFHGLNDSSKGKVDVTLQDGRSFQGTVLNADLHSDIAIVKIKSSTPLPTAKLGTSS 220 Query: 869 KLRPGDWVVALGCPLTLQNTITAGIV 946 LRPGDWV+ALGCPL+LQNT+TAGIV Sbjct: 221 MLRPGDWVIALGCPLSLQNTVTAGIV 246 >emb|CBI39500.3| unnamed protein product [Vitis vinifera] Length = 486 Score = 265 bits (676), Expect = 2e-68 Identities = 146/266 (54%), Positives = 173/266 (65%) Frame = +2 Query: 149 VSDRKSLFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKWPWSNMLEQAPCPA 328 +S S+ R V Y DS T +S+SVPA + L PW + + Sbjct: 59 ISTMNSVLRKVSVAAAASGLLYLCRDSDSKTMVSISVPAQFREPLLRPWQIAQDIIHRSS 118 Query: 329 YTPLNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTSDIKKETSGEAG 508 + D + +P FSR PV ++D+ KE G+ G Sbjct: 119 FLSQGDASQSGNLPPIFSR-------------IGPV----------PSADVNKEAFGKVG 155 Query: 509 DGPRHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGTIIDADGTILTCA 688 DG + SC LGRD+IANAAA VGP+VVN+SVPQGF+GMT+GKSIGSGTIID DGTILTCA Sbjct: 156 DGVKPSCGFLGRDSIANAAAMVGPAVVNISVPQGFNGMTIGKSIGSGTIIDPDGTILTCA 215 Query: 689 HVVVDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSKTPLPTAKFGTSS 868 HVVVDF GL SKGKVDVTLQDGR+++GTV+NADL SDIAIVKI S TPLPTAK GTSS Sbjct: 216 HVVVDFHGLNDSSKGKVDVTLQDGRSFQGTVLNADLHSDIAIVKIKSSTPLPTAKLGTSS 275 Query: 869 KLRPGDWVVALGCPLTLQNTITAGIV 946 LRPGDWV+ALGCPL+LQNT+TAGIV Sbjct: 276 MLRPGDWVIALGCPLSLQNTVTAGIV 301 >gb|EOY11482.1| Trypsin family protein with PDZ domain isoform 5 [Theobroma cacao] Length = 366 Score = 262 bits (669), Expect = 2e-67 Identities = 145/263 (55%), Positives = 173/263 (65%) Frame = +2 Query: 158 RKSLFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKWPWSNMLEQAPCPAYTP 337 R +L R+V Y + DS T++ +S+P L + L + W + Sbjct: 22 RTALIRIVAIGTAGSGLLYWNTNPDSETTVKLSIPVPLREHLSFQWRRPFLSSY------ 75 Query: 338 LNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTSDIKKETSGEAGDGP 517 +W+ +PLF SR +A AP G D KE D Sbjct: 76 ---HWEIGNLPLFSSRVSA-----------APAG------------DTTKEAPVAVWDDK 109 Query: 518 RHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGTIIDADGTILTCAHVV 697 + C CL RD+IANAAA+VGP+VVNLSVPQG +G+T G+SIGSGTIIDADGTILTCAHVV Sbjct: 110 KPCCGCLSRDSIANAAAKVGPAVVNLSVPQGIYGITTGRSIGSGTIIDADGTILTCAHVV 169 Query: 698 VDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSKTPLPTAKFGTSSKLR 877 V+FQG+RS KGKVDVTLQDGRT+EGTVVNADL SDIAIVKI SKTPLPTAKFG+SS LR Sbjct: 170 VEFQGMRSTIKGKVDVTLQDGRTFEGTVVNADLHSDIAIVKIKSKTPLPTAKFGSSSNLR 229 Query: 878 PGDWVVALGCPLTLQNTITAGIV 946 PGDWV+A+GCPL+LQNTITAGIV Sbjct: 230 PGDWVIAMGCPLSLQNTITAGIV 252 >gb|EOY11480.1| Trypsin family protein with PDZ domain isoform 3 [Theobroma cacao] Length = 353 Score = 262 bits (669), Expect = 2e-67 Identities = 145/263 (55%), Positives = 173/263 (65%) Frame = +2 Query: 158 RKSLFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKWPWSNMLEQAPCPAYTP 337 R +L R+V Y + DS T++ +S+P L + L + W + Sbjct: 22 RTALIRIVAIGTAGSGLLYWNTNPDSETTVKLSIPVPLREHLSFQWRRPFLSSY------ 75 Query: 338 LNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTSDIKKETSGEAGDGP 517 +W+ +PLF SR +A AP G D KE D Sbjct: 76 ---HWEIGNLPLFSSRVSA-----------APAG------------DTTKEAPVAVWDDK 109 Query: 518 RHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGTIIDADGTILTCAHVV 697 + C CL RD+IANAAA+VGP+VVNLSVPQG +G+T G+SIGSGTIIDADGTILTCAHVV Sbjct: 110 KPCCGCLSRDSIANAAAKVGPAVVNLSVPQGIYGITTGRSIGSGTIIDADGTILTCAHVV 169 Query: 698 VDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSKTPLPTAKFGTSSKLR 877 V+FQG+RS KGKVDVTLQDGRT+EGTVVNADL SDIAIVKI SKTPLPTAKFG+SS LR Sbjct: 170 VEFQGMRSTIKGKVDVTLQDGRTFEGTVVNADLHSDIAIVKIKSKTPLPTAKFGSSSNLR 229 Query: 878 PGDWVVALGCPLTLQNTITAGIV 946 PGDWV+A+GCPL+LQNTITAGIV Sbjct: 230 PGDWVIAMGCPLSLQNTITAGIV 252 >gb|EOY11479.1| Trypsin family protein with PDZ domain isoform 2, partial [Theobroma cacao] Length = 418 Score = 262 bits (669), Expect = 2e-67 Identities = 145/263 (55%), Positives = 173/263 (65%) Frame = +2 Query: 158 RKSLFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKWPWSNMLEQAPCPAYTP 337 R +L R+V Y + DS T++ +S+P L + L + W + Sbjct: 24 RTALIRIVAIGTAGSGLLYWNTNPDSETTVKLSIPVPLREHLSFQWRRPFLSSY------ 77 Query: 338 LNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTSDIKKETSGEAGDGP 517 +W+ +PLF SR +A AP G D KE D Sbjct: 78 ---HWEIGNLPLFSSRVSA-----------APAG------------DTTKEAPVAVWDDK 111 Query: 518 RHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGTIIDADGTILTCAHVV 697 + C CL RD+IANAAA+VGP+VVNLSVPQG +G+T G+SIGSGTIIDADGTILTCAHVV Sbjct: 112 KPCCGCLSRDSIANAAAKVGPAVVNLSVPQGIYGITTGRSIGSGTIIDADGTILTCAHVV 171 Query: 698 VDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSKTPLPTAKFGTSSKLR 877 V+FQG+RS KGKVDVTLQDGRT+EGTVVNADL SDIAIVKI SKTPLPTAKFG+SS LR Sbjct: 172 VEFQGMRSTIKGKVDVTLQDGRTFEGTVVNADLHSDIAIVKIKSKTPLPTAKFGSSSNLR 231 Query: 878 PGDWVVALGCPLTLQNTITAGIV 946 PGDWV+A+GCPL+LQNTITAGIV Sbjct: 232 PGDWVIAMGCPLSLQNTITAGIV 254 >ref|XP_006287782.1| hypothetical protein CARUB_v10000993mg [Capsella rubella] gi|482556488|gb|EOA20680.1| hypothetical protein CARUB_v10000993mg [Capsella rubella] Length = 435 Score = 262 bits (669), Expect = 2e-67 Identities = 147/265 (55%), Positives = 175/265 (66%) Frame = +2 Query: 152 SDRKSLFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKWPWSNMLEQAPCPAY 331 S R L R+V Y++ D GT IS ++P ++ + + PW Sbjct: 13 SKRSELIRIVAVATATSGIVYANCNPDLGTRISFAIPESVRESVSLPWR----------- 61 Query: 332 TPLNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTSDIKKETSGEAGD 511 ++ H F A SS + K AP+ +D +K S EA D Sbjct: 62 --ISQGLIHRPDQSLFGNFAF--SSRVSPKSEAPI------------NDDEKGVSVEASD 105 Query: 512 GPRHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGTIIDADGTILTCAH 691 + S LGRDTIANAAAR+GP+VVNLSVPQGFHG+++GKSIGSGTIIDADGTILTCAH Sbjct: 106 SSKPSNGYLGRDTIANAAARIGPAVVNLSVPQGFHGISMGKSIGSGTIIDADGTILTCAH 165 Query: 692 VVVDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSKTPLPTAKFGTSSK 871 VVVDFQ +R SKG+VDVTLQDGRT+EG VVNADLQSDIA+VKI SKTPLPTAK G SSK Sbjct: 166 VVVDFQNIRQSSKGRVDVTLQDGRTFEGVVVNADLQSDIALVKIQSKTPLPTAKIGFSSK 225 Query: 872 LRPGDWVVALGCPLTLQNTITAGIV 946 LRPGDWV+A+GCPL+LQNTITAGIV Sbjct: 226 LRPGDWVIAVGCPLSLQNTITAGIV 250 >ref|XP_006472073.1| PREDICTED: putative protease Do-like 14-like [Citrus sinensis] Length = 449 Score = 256 bits (653), Expect = 1e-65 Identities = 146/268 (54%), Positives = 176/268 (65%), Gaps = 5/268 (1%) Frame = +2 Query: 158 RKSLFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKWPWSNMLEQAPCPAYTP 337 R SL R+V Y DS T IS+S+PA L + + ++++ ++TP Sbjct: 14 RNSLIRVVAIAAAGSGLFYGSSNPDSKTRISLSIPATLHESV------LVQRQMSQSFTP 67 Query: 338 -----LNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTSDIKKETSGE 502 +D W+ V L SR + +S IKK PV + +K+ET+G+ Sbjct: 68 HSPFISSDCWQFGNVSLVSSR--VNPASSGSIKKEYPVT---------EEAPVKEETTGD 116 Query: 503 AGDGPRHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGTIIDADGTILT 682 DG C CLGRDTIANAAARV P+VVNLS P+ F G+ G+ IGSG I+DADGTILT Sbjct: 117 VKDGKDSCCRCLGRDTIANAAARVCPAVVNLSAPREFLGILSGRGIGSGAIVDADGTILT 176 Query: 683 CAHVVVDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSKTPLPTAKFGT 862 CAHVVVDF G R+L KGKVDVTLQDGRT+EGTV+NAD SDIAIVKINSKTPLP AK GT Sbjct: 177 CAHVVVDFHGSRALPKGKVDVTLQDGRTFEGTVLNADFHSDIAIVKINSKTPLPAAKLGT 236 Query: 863 SSKLRPGDWVVALGCPLTLQNTITAGIV 946 SSKL PGDWVVA+GCP LQNT+TAGIV Sbjct: 237 SSKLCPGDWVVAMGCPHYLQNTVTAGIV 264 >ref|XP_006433397.1| hypothetical protein CICLE_v10001167mg [Citrus clementina] gi|557535519|gb|ESR46637.1| hypothetical protein CICLE_v10001167mg [Citrus clementina] Length = 443 Score = 256 bits (653), Expect = 1e-65 Identities = 146/268 (54%), Positives = 176/268 (65%), Gaps = 5/268 (1%) Frame = +2 Query: 158 RKSLFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKWPWSNMLEQAPCPAYTP 337 R SL R+V Y DS T IS+S+PA L + + ++++ ++TP Sbjct: 8 RNSLIRVVAIAAAGSGLFYGSSNPDSKTRISLSIPATLHESV------LVQRQMSQSFTP 61 Query: 338 -----LNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTSDIKKETSGE 502 +D W+ V L SR + +S IKK PV + +K+ET+G+ Sbjct: 62 HSPFISSDCWQFGNVSLVSSR--VNPASSGSIKKEYPVT---------EEAPVKEETTGD 110 Query: 503 AGDGPRHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGTIIDADGTILT 682 DG C CLGRDTIANAAARV P+VVNLS P+ F G+ G+ IGSG I+DADGTILT Sbjct: 111 VKDGKDSCCRCLGRDTIANAAARVCPAVVNLSAPREFLGILSGRGIGSGAIVDADGTILT 170 Query: 683 CAHVVVDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSKTPLPTAKFGT 862 CAHVVVDF G R+L KGKVDVTLQDGRT+EGTV+NAD SDIAIVKINSKTPLP AK GT Sbjct: 171 CAHVVVDFHGSRALPKGKVDVTLQDGRTFEGTVLNADFHSDIAIVKINSKTPLPAAKLGT 230 Query: 863 SSKLRPGDWVVALGCPLTLQNTITAGIV 946 SSKL PGDWVVA+GCP LQNT+TAGIV Sbjct: 231 SSKLCPGDWVVAMGCPHYLQNTVTAGIV 258 >ref|NP_198118.3| Trypsin family protein with PDZ domain [Arabidopsis thaliana] gi|332006329|gb|AED93712.1| Trypsin family protein with PDZ domain [Arabidopsis thaliana] Length = 428 Score = 254 bits (649), Expect = 3e-65 Identities = 144/276 (52%), Positives = 179/276 (64%), Gaps = 1/276 (0%) Frame = +2 Query: 122 MRYLLRKFPVSDRKSLFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKW-PWS 298 M +L R S R L R++ Y+ D+ T +S+++P ++ + L PW Sbjct: 2 MNFLRRAVSSSKRSELIRIISVATATSGILYASTNPDARTRVSLAIPESVRESLSLLPW- 60 Query: 299 NMLEQAPCPAYTPLNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTSD 478 + +P + P + + + SS + K AP+ G + + S S Sbjct: 61 ---QISPGLIHRPEQSLFGNFVF-----------SSRVSPKSEAPINDEKGVSVEASDSS 106 Query: 479 IKKETSGEAGDGPRHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGTII 658 K S LGRDTIANAAAR+GP+VVNLSVPQGFHG+++GKSIGSGTII Sbjct: 107 SKP------------SNGYLGRDTIANAAARIGPAVVNLSVPQGFHGISMGKSIGSGTII 154 Query: 659 DADGTILTCAHVVVDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSKTP 838 DADGTILTCAHVVVDFQ +R SKG+VDVTLQDGRT+EG VVNADLQSDIA+VKI SKTP Sbjct: 155 DADGTILTCAHVVVDFQNIRHSSKGRVDVTLQDGRTFEGVVVNADLQSDIALVKIKSKTP 214 Query: 839 LPTAKFGTSSKLRPGDWVVALGCPLTLQNTITAGIV 946 LPTAK G SSKLRPGDWV+A+GCPL+LQNT+TAGIV Sbjct: 215 LPTAKLGFSSKLRPGDWVIAVGCPLSLQNTVTAGIV 250 >ref|XP_002872263.1| serine-type peptidase/ trypsin [Arabidopsis lyrata subsp. lyrata] gi|297318100|gb|EFH48522.1| serine-type peptidase/ trypsin [Arabidopsis lyrata subsp. lyrata] Length = 428 Score = 254 bits (649), Expect = 3e-65 Identities = 148/277 (53%), Positives = 182/277 (65%), Gaps = 2/277 (0%) Frame = +2 Query: 122 MRYLLRKFPVSDRKS-LFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKW-PW 295 M +L R S ++S L R+V Y++ D+ T IS+++P ++ + L PW Sbjct: 1 MNFLRRAVSSSSKRSELIRIVAVATATSGIVYANSNPDARTRISLAIPESVRESLLLLPW 60 Query: 296 SNMLEQAPCPAYTPLNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTS 475 +P + P + + F SR + + ++ +K PV Sbjct: 61 ----RISPGLIHRPDQSLFGNFA---FSSRVSPKSEAAVNDEKGVPV------------- 100 Query: 476 DIKKETSGEAGDGPRHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGTI 655 EA D + S LGRDTIANAAARVGP+VVNLSVPQGFHG+++GKSIGSGTI Sbjct: 101 --------EASDSSKPSNGYLGRDTIANAAARVGPAVVNLSVPQGFHGISMGKSIGSGTI 152 Query: 656 IDADGTILTCAHVVVDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSKT 835 IDADGTILTCAHVVVDFQ +R SKG+VDVTLQDGRT+EG VVNADLQSDIA+VKI SKT Sbjct: 153 IDADGTILTCAHVVVDFQNIRQSSKGRVDVTLQDGRTFEGVVVNADLQSDIALVKIKSKT 212 Query: 836 PLPTAKFGTSSKLRPGDWVVALGCPLTLQNTITAGIV 946 PLPTAK G SSKLRPGDWV+A+GCPL+LQNTITAGIV Sbjct: 213 PLPTAKLGFSSKLRPGDWVIAVGCPLSLQNTITAGIV 249 >sp|Q3E6S8.2|DGP14_ARATH RecName: Full=Putative protease Do-like 14 Length = 429 Score = 254 bits (649), Expect = 3e-65 Identities = 144/276 (52%), Positives = 179/276 (64%), Gaps = 1/276 (0%) Frame = +2 Query: 122 MRYLLRKFPVSDRKSLFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKW-PWS 298 M +L R S R L R++ Y+ D+ T +S+++P ++ + L PW Sbjct: 2 MNFLRRAVSSSKRSELIRIISVATATSGILYASTNPDARTRVSLAIPESVRESLSLLPW- 60 Query: 299 NMLEQAPCPAYTPLNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTSD 478 + +P + P + + + SS + K AP+ G + + S S Sbjct: 61 ---QISPGLIHRPEQSLFGNFVF-----------SSRVSPKSEAPINDEKGVSVEASDSS 106 Query: 479 IKKETSGEAGDGPRHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGTII 658 K S LGRDTIANAAAR+GP+VVNLSVPQGFHG+++GKSIGSGTII Sbjct: 107 SKP------------SNGYLGRDTIANAAARIGPAVVNLSVPQGFHGISMGKSIGSGTII 154 Query: 659 DADGTILTCAHVVVDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSKTP 838 DADGTILTCAHVVVDFQ +R SKG+VDVTLQDGRT+EG VVNADLQSDIA+VKI SKTP Sbjct: 155 DADGTILTCAHVVVDFQNIRHSSKGRVDVTLQDGRTFEGVVVNADLQSDIALVKIKSKTP 214 Query: 839 LPTAKFGTSSKLRPGDWVVALGCPLTLQNTITAGIV 946 LPTAK G SSKLRPGDWV+A+GCPL+LQNT+TAGIV Sbjct: 215 LPTAKLGFSSKLRPGDWVIAVGCPLSLQNTVTAGIV 250 >ref|XP_004494604.1| PREDICTED: putative protease Do-like 14-like [Cicer arietinum] Length = 432 Score = 248 bits (633), Expect = 2e-63 Identities = 119/160 (74%), Positives = 139/160 (86%) Frame = +2 Query: 467 STSDIKKETSGEAGDGPRHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGS 646 S+SDI KE SG DG + C C GRDTIANAAA+VGP+VVN+S+PQ F+G+T G+SIGS Sbjct: 89 SSSDISKEASGAVHDGSK-PCGCFGRDTIANAAAKVGPAVVNISIPQDFYGVTTGRSIGS 147 Query: 647 GTIIDADGTILTCAHVVVDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKIN 826 GTIID DGTILTCAHVVVDF G RS SKGK++VTLQDGRT+EG VVNAD+ SDIA+VKIN Sbjct: 148 GTIIDKDGTILTCAHVVVDFHGSRSSSKGKIEVTLQDGRTFEGKVVNADMHSDIAVVKIN 207 Query: 827 SKTPLPTAKFGTSSKLRPGDWVVALGCPLTLQNTITAGIV 946 S+TPLP AK G SS+LRPGDWV+A+GCPL+LQNT+TAGIV Sbjct: 208 SETPLPDAKLGNSSRLRPGDWVIAMGCPLSLQNTVTAGIV 247 >ref|XP_004302401.1| PREDICTED: putative protease Do-like 14-like [Fragaria vesca subsp. vesca] Length = 423 Score = 245 bits (626), Expect = 2e-62 Identities = 141/278 (50%), Positives = 177/278 (63%), Gaps = 3/278 (1%) Frame = +2 Query: 122 MRYLLRKFPVSDRKSLFRLVXXXXXXXXXXYSHYGGDSGTSISVSVPAALCDKLKWPWSN 301 M + LR +++R R++ Y+ +S SVS+PA + L P Sbjct: 1 MSHFLRNVLIANRNRN-RILAIAAVGSALLYAKGNESFNSSFSVSIPAPWRESLLLP--- 56 Query: 302 MLEQAPCPAYTPLNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNHQYSTSDI 481 NW +VPLF +T+GS + DI Sbjct: 57 -------------RQNWPFGVVPLF--------------------SVTNGSA---PSPDI 80 Query: 482 KKETSG--EAGDGPRHSCN-CLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSIGSGT 652 K+ SG AG+ P+ C+ CLG+DTIA AAA+VGP+VVN+S+ QG +G+ VGK IGSGT Sbjct: 81 GKDVSGFSVAGESPKPCCSGCLGKDTIAKAAAKVGPAVVNISLQQGMYGVGVGKGIGSGT 140 Query: 653 IIDADGTILTCAHVVVDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVKINSK 832 IID DGTILTCAH VVDF GLR+ SKGKV VTLQDGRT+EGTVVNADLQSD+AIVKINSK Sbjct: 141 IIDEDGTILTCAHAVVDFHGLRASSKGKVGVTLQDGRTFEGTVVNADLQSDVAIVKINSK 200 Query: 833 TPLPTAKFGTSSKLRPGDWVVALGCPLTLQNTITAGIV 946 TPLP+AK GTSS+L+PGDWV+A+GCPL+LQNT+T+GIV Sbjct: 201 TPLPSAKLGTSSRLQPGDWVIAVGCPLSLQNTVTSGIV 238 >ref|XP_006858692.1| hypothetical protein AMTR_s00066p00095010 [Amborella trichopoda] gi|548862803|gb|ERN20159.1| hypothetical protein AMTR_s00066p00095010 [Amborella trichopoda] Length = 485 Score = 234 bits (598), Expect = 3e-59 Identities = 129/222 (58%), Positives = 154/222 (69%) Frame = +2 Query: 281 LKWPWSNMLEQAPCPAYTPLNDNWKHCIVPLFFSRPAADRSSDIDIKKAAPVGITDGSNH 460 L+W W ++L + +P D +P+ FSR D + ++ +GI + Sbjct: 84 LRW-WQSLLGEVH--QLSPWLDTTNKGYLPVNFSR--TDGTDTLNSNSPPFIGIKKKGST 138 Query: 461 QYSTSDIKKETSGEAGDGPRHSCNCLGRDTIANAAARVGPSVVNLSVPQGFHGMTVGKSI 640 +K++ S + GD S CLGR++IANAAA VGP+VVNLSV QGF GMT+GK+I Sbjct: 139 SPPYIGVKEKGSDDVGDENNSSSGCLGRNSIANAAALVGPAVVNLSVTQGFSGMTLGKNI 198 Query: 641 GSGTIIDADGTILTCAHVVVDFQGLRSLSKGKVDVTLQDGRTYEGTVVNADLQSDIAIVK 820 GSGTIID DGTILTCAHVVV FQ RS K KVDVTLQDGRT+EG VVNAD SDIA+VK Sbjct: 199 GSGTIIDPDGTILTCAHVVVGFQSARSPYKRKVDVTLQDGRTFEGEVVNADFHSDIAVVK 258 Query: 821 INSKTPLPTAKFGTSSKLRPGDWVVALGCPLTLQNTITAGIV 946 I SKTPLP AK G+S LRPGDWVVALGCPL+LQNTITAGIV Sbjct: 259 IKSKTPLPAAKLGSSGMLRPGDWVVALGCPLSLQNTITAGIV 300 >gb|EMJ06461.1| hypothetical protein PRUPE_ppa006348mg [Prunus persica] Length = 416 Score = 234 bits (597), Expect = 4e-59 Identities = 132/247 (53%), Positives = 161/247 (65%), Gaps = 2/247 (0%) Frame = +2 Query: 212 YSHYGGDSGTSISVSVPAALCDKLKWPWSNMLEQAPCPAYTPLNDNWKHCIVPLFFSRPA 391 Y++ +S + SVS+PA L + L PW + L VPLF Sbjct: 27 YANGNRNSDYTASVSLPAPLRESLWLPWQSELS------------------VPLFSL--- 65 Query: 392 ADRSSDIDIKKAAPVGITDGSNHQYSTSDIKKETSG--EAGDGPRHSCNCLGRDTIANAA 565 N +SDI K+ SG AG+ P+ CLGRD+ A AA Sbjct: 66 --------------------GNSSVPSSDISKDVSGVSAAGEIPKTCSGCLGRDSFAKAA 105 Query: 566 ARVGPSVVNLSVPQGFHGMTVGKSIGSGTIIDADGTILTCAHVVVDFQGLRSLSKGKVDV 745 A+VGP+VVN+S PQG G++ GK +GSGTII+ DGTILTCAH VVDF GLR+ SKGKV V Sbjct: 106 AKVGPAVVNVSAPQGEFGISPGKGMGSGTIINQDGTILTCAHAVVDFHGLRASSKGKVHV 165 Query: 746 TLQDGRTYEGTVVNADLQSDIAIVKINSKTPLPTAKFGTSSKLRPGDWVVALGCPLTLQN 925 TLQDGRT+EGTVVNADLQSD+AIVKINSKTPLPTAK G+SSKL+PGD V+A+GCPL+LQN Sbjct: 166 TLQDGRTFEGTVVNADLQSDVAIVKINSKTPLPTAKLGSSSKLQPGDCVIAVGCPLSLQN 225 Query: 926 TITAGIV 946 T+T+GIV Sbjct: 226 TVTSGIV 232