BLASTX nr result
ID: Akebia23_contig00030719
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00030719 (901 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007037887.1| Cysteine proteinases superfamily protein, pu... 280 4e-73 ref|XP_007037882.1| Cysteine proteinases superfamily protein, pu... 280 4e-73 ref|XP_002262951.2| PREDICTED: uncharacterized protein LOC100251... 273 9e-71 ref|XP_007037888.1| Cysteine proteinases superfamily protein, pu... 267 4e-69 ref|XP_007037885.1| Cysteine proteinases superfamily protein, pu... 267 4e-69 ref|XP_007037884.1| Cysteine proteinases superfamily protein, pu... 267 4e-69 ref|XP_002315114.1| hypothetical protein POPTR_0010s18760g [Popu... 256 8e-66 ref|XP_006476972.1| PREDICTED: sentrin-specific protease 1-like ... 249 1e-63 ref|XP_006440039.1| hypothetical protein CICLE_v10021330mg [Citr... 249 1e-63 ref|XP_002322226.2| hypothetical protein POPTR_0015s10250g [Popu... 247 5e-63 ref|XP_002511251.1| sentrin/sumo-specific protease, putative [Ri... 240 5e-61 ref|XP_004309800.1| PREDICTED: uncharacterized protein LOC101305... 239 1e-60 ref|XP_004293126.1| PREDICTED: uncharacterized protein LOC101303... 237 5e-60 ref|XP_007037883.1| Cysteine proteinases superfamily protein, pu... 236 9e-60 gb|EXC04030.1| Ubiquitin-like-specific protease 1D [Morus notabi... 227 6e-57 ref|XP_007037886.1| Cysteine proteinases superfamily protein, pu... 223 8e-56 gb|AFK37750.1| unknown [Lotus japonicus] 221 3e-55 ref|XP_007137950.1| hypothetical protein PHAVU_009G168700g [Phas... 214 3e-53 gb|ADN33966.1| sentrin/sumo-specific protease [Cucumis melo subs... 213 1e-52 ref|XP_007210601.1| hypothetical protein PRUPE_ppa017098mg [Prun... 212 1e-52 >ref|XP_007037887.1| Cysteine proteinases superfamily protein, putative isoform 6, partial [Theobroma cacao] gi|508775132|gb|EOY22388.1| Cysteine proteinases superfamily protein, putative isoform 6, partial [Theobroma cacao] Length = 232 Score = 280 bits (717), Expect = 4e-73 Identities = 136/218 (62%), Positives = 165/218 (75%), Gaps = 1/218 (0%) Frame = -2 Query: 651 DLVSSSLEVYDDQKPKSHSSCCHHIAAGCQALPDRVTKKKAREIKSFELAFPFFSGTIPR 472 DL SS E Y+ H SC H+ +A +++K++A++++ F L P F G IP Sbjct: 2 DLASSDPE-YNGYPISKHRSCWVHVIGSLKARKKKISKQEAQKLRDFRLTAPCFLGNIPC 60 Query: 471 RERSKR-ILIKNSISKQHRKLDSNVFESFLEKLWSSFSEEKKASFTYLDCLWFALYRKWS 295 R+RSKR + KNSISKQ +LDS FE ++EKLWSSF EEK+ SF Y DC WFA YRK S Sbjct: 61 RQRSKRRVKSKNSISKQTNRLDSGAFECYMEKLWSSFPEEKRTSFAYFDCQWFAWYRKAS 120 Query: 294 TKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLIFCHFGESLQSKTRTPCMLLLDSLEMA 115 + KVL+WIK + IFS+KYV VP++CW HWSLLIFCHFGESLQS+T+TPCMLLLDSLE+A Sbjct: 121 FREKVLSWIKREQIFSKKYVLVPVVCWSHWSLLIFCHFGESLQSETKTPCMLLLDSLEIA 180 Query: 114 NPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPFLVPK 1 NP+RLEPDIRKFVLDIYR EGRPEKKE I +IP LVPK Sbjct: 181 NPRRLEPDIRKFVLDIYRAEGRPEKKEMIYRIPLLVPK 218 >ref|XP_007037882.1| Cysteine proteinases superfamily protein, putative isoform 1 [Theobroma cacao] gi|508775127|gb|EOY22383.1| Cysteine proteinases superfamily protein, putative isoform 1 [Theobroma cacao] Length = 291 Score = 280 bits (717), Expect = 4e-73 Identities = 136/218 (62%), Positives = 165/218 (75%), Gaps = 1/218 (0%) Frame = -2 Query: 651 DLVSSSLEVYDDQKPKSHSSCCHHIAAGCQALPDRVTKKKAREIKSFELAFPFFSGTIPR 472 DL SS E Y+ H SC H+ +A +++K++A++++ F L P F G IP Sbjct: 15 DLASSDPE-YNGYPISKHRSCWVHVIGSLKARKKKISKQEAQKLRDFRLTAPCFLGNIPC 73 Query: 471 RERSKR-ILIKNSISKQHRKLDSNVFESFLEKLWSSFSEEKKASFTYLDCLWFALYRKWS 295 R+RSKR + KNSISKQ +LDS FE ++EKLWSSF EEK+ SF Y DC WFA YRK S Sbjct: 74 RQRSKRRVKSKNSISKQTNRLDSGAFECYMEKLWSSFPEEKRTSFAYFDCQWFAWYRKAS 133 Query: 294 TKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLIFCHFGESLQSKTRTPCMLLLDSLEMA 115 + KVL+WIK + IFS+KYV VP++CW HWSLLIFCHFGESLQS+T+TPCMLLLDSLE+A Sbjct: 134 FREKVLSWIKREQIFSKKYVLVPVVCWSHWSLLIFCHFGESLQSETKTPCMLLLDSLEIA 193 Query: 114 NPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPFLVPK 1 NP+RLEPDIRKFVLDIYR EGRPEKKE I +IP LVPK Sbjct: 194 NPRRLEPDIRKFVLDIYRAEGRPEKKEMIYRIPLLVPK 231 >ref|XP_002262951.2| PREDICTED: uncharacterized protein LOC100251251 [Vitis vinifera] gi|297733618|emb|CBI14865.3| unnamed protein product [Vitis vinifera] Length = 295 Score = 273 bits (697), Expect = 9e-71 Identities = 141/230 (61%), Positives = 163/230 (70%), Gaps = 3/230 (1%) Frame = -2 Query: 681 KKKLKEIASFDLVSSSLEVYDDQKPKSHSSCCHHIAAGCQALPDRVTKKKAREIKS-FEL 505 KK A DL S+ E Y D H SC H+ A QA R+TK + EIK FE Sbjct: 4 KKPRNSNAPIDLASADSESYLDYS--KHRSCWRHMVAHLQAQNKRMTKHEIEEIKEIFEF 61 Query: 504 AFPFFSGTIPRRERSKR-ILIKNSI-SKQHRKLDSNVFESFLEKLWSSFSEEKKASFTYL 331 P FS T PR ERSKR I KN I K+ +KLD+ FE + LW SFS++KK+SF YL Sbjct: 62 TTPCFSNTFPRHERSKRRINCKNIIIRKEKKKLDTAAFEWYFRNLWKSFSDDKKSSFGYL 121 Query: 330 DCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLIFCHFGESLQSKTRT 151 DCLWF+ Y K S++ KVL WIK K IFSRKYVFVPI+CW+HWSLLI CHFGESL+SK R Sbjct: 122 DCLWFSFYLKTSSREKVLNWIKKKRIFSRKYVFVPIVCWNHWSLLILCHFGESLESKIRA 181 Query: 150 PCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPFLVPK 1 PCMLLLDSL+MANPKRLEP+IRKFV DIY+EEGRPE K+ I+KIP LVPK Sbjct: 182 PCMLLLDSLQMANPKRLEPNIRKFVFDIYKEEGRPESKQLISKIPLLVPK 231 >ref|XP_007037888.1| Cysteine proteinases superfamily protein, putative isoform 7 [Theobroma cacao] gi|508775133|gb|EOY22389.1| Cysteine proteinases superfamily protein, putative isoform 7 [Theobroma cacao] Length = 227 Score = 267 bits (683), Expect = 4e-69 Identities = 126/189 (66%), Positives = 153/189 (80%), Gaps = 1/189 (0%) Frame = -2 Query: 564 QALPDRVTKKKAREIKSFELAFPFFSGTIPRRERSKR-ILIKNSISKQHRKLDSNVFESF 388 +A +++K++A++++ F L P F G IP R+RSKR + KNSISKQ +LDS FE + Sbjct: 25 KARKKKISKQEAQKLRDFRLTAPCFLGNIPCRQRSKRRVKSKNSISKQTNRLDSGAFECY 84 Query: 387 LEKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHH 208 +EKLWSSF EEK+ SF Y DC WFA YRK S + KVL+WIK + IFS+KYV VP++CW H Sbjct: 85 MEKLWSSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSWIKREQIFSKKYVLVPVVCWSH 144 Query: 207 WSLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESI 28 WSLLIFCHFGESLQS+T+TPCMLLLDSLE+ANP+RLEPDIRKFVLDIYR EGRPEKKE I Sbjct: 145 WSLLIFCHFGESLQSETKTPCMLLLDSLEIANPRRLEPDIRKFVLDIYRAEGRPEKKEMI 204 Query: 27 AKIPFLVPK 1 +IP LVPK Sbjct: 205 YRIPLLVPK 213 >ref|XP_007037885.1| Cysteine proteinases superfamily protein, putative isoform 4 [Theobroma cacao] gi|508775130|gb|EOY22386.1| Cysteine proteinases superfamily protein, putative isoform 4 [Theobroma cacao] Length = 270 Score = 267 bits (683), Expect = 4e-69 Identities = 126/189 (66%), Positives = 153/189 (80%), Gaps = 1/189 (0%) Frame = -2 Query: 564 QALPDRVTKKKAREIKSFELAFPFFSGTIPRRERSKR-ILIKNSISKQHRKLDSNVFESF 388 +A +++K++A++++ F L P F G IP R+RSKR + KNSISKQ +LDS FE + Sbjct: 25 KARKKKISKQEAQKLRDFRLTAPCFLGNIPCRQRSKRRVKSKNSISKQTNRLDSGAFECY 84 Query: 387 LEKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHH 208 +EKLWSSF EEK+ SF Y DC WFA YRK S + KVL+WIK + IFS+KYV VP++CW H Sbjct: 85 MEKLWSSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSWIKREQIFSKKYVLVPVVCWSH 144 Query: 207 WSLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESI 28 WSLLIFCHFGESLQS+T+TPCMLLLDSLE+ANP+RLEPDIRKFVLDIYR EGRPEKKE I Sbjct: 145 WSLLIFCHFGESLQSETKTPCMLLLDSLEIANPRRLEPDIRKFVLDIYRAEGRPEKKEMI 204 Query: 27 AKIPFLVPK 1 +IP LVPK Sbjct: 205 YRIPLLVPK 213 >ref|XP_007037884.1| Cysteine proteinases superfamily protein, putative isoform 3 [Theobroma cacao] gi|508775129|gb|EOY22385.1| Cysteine proteinases superfamily protein, putative isoform 3 [Theobroma cacao] Length = 273 Score = 267 bits (683), Expect = 4e-69 Identities = 126/189 (66%), Positives = 153/189 (80%), Gaps = 1/189 (0%) Frame = -2 Query: 564 QALPDRVTKKKAREIKSFELAFPFFSGTIPRRERSKR-ILIKNSISKQHRKLDSNVFESF 388 +A +++K++A++++ F L P F G IP R+RSKR + KNSISKQ +LDS FE + Sbjct: 25 KARKKKISKQEAQKLRDFRLTAPCFLGNIPCRQRSKRRVKSKNSISKQTNRLDSGAFECY 84 Query: 387 LEKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHH 208 +EKLWSSF EEK+ SF Y DC WFA YRK S + KVL+WIK + IFS+KYV VP++CW H Sbjct: 85 MEKLWSSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSWIKREQIFSKKYVLVPVVCWSH 144 Query: 207 WSLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESI 28 WSLLIFCHFGESLQS+T+TPCMLLLDSLE+ANP+RLEPDIRKFVLDIYR EGRPEKKE I Sbjct: 145 WSLLIFCHFGESLQSETKTPCMLLLDSLEIANPRRLEPDIRKFVLDIYRAEGRPEKKEMI 204 Query: 27 AKIPFLVPK 1 +IP LVPK Sbjct: 205 YRIPLLVPK 213 >ref|XP_002315114.1| hypothetical protein POPTR_0010s18760g [Populus trichocarpa] gi|222864154|gb|EEF01285.1| hypothetical protein POPTR_0010s18760g [Populus trichocarpa] Length = 298 Score = 256 bits (654), Expect = 8e-66 Identities = 128/205 (62%), Positives = 150/205 (73%), Gaps = 1/205 (0%) Frame = -2 Query: 612 KPKSHSSCCHHIAAGCQALPDRVTKKKAREIKSFELAFPFFSGTIPRRERSK-RILIKNS 436 +P H +C HI A A R+TKK+A EI+SF+L P F TIP RERSK R N+ Sbjct: 31 QPSKHRTCWKHIQARMHARRTRMTKKQAEEIESFKLTSPCFLQTIPCRERSKKRFKRNNA 90 Query: 435 ISKQHRKLDSNVFESFLEKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKH 256 +SK ++LDS F ++E LW SFSE+KK SF YLD LWF +Y + S+ KVL WIK KH Sbjct: 91 VSKLKKELDSVSFNCYMENLWKSFSEDKKMSFAYLDSLWFTMYTEASSGVKVLEWIKRKH 150 Query: 255 IFSRKYVFVPIICWHHWSLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFV 76 IFS+KYV VPI+ W HWSLLIFCHFGESL S+ TPCMLLLDSLEMA+PKRLEPDIRKFV Sbjct: 151 IFSKKYVLVPIVRWCHWSLLIFCHFGESLLSENITPCMLLLDSLEMASPKRLEPDIRKFV 210 Query: 75 LDIYREEGRPEKKESIAKIPFLVPK 1 DIY EGRPE K I++IP LVPK Sbjct: 211 WDIYESEGRPENKHMISQIPLLVPK 235 >ref|XP_006476972.1| PREDICTED: sentrin-specific protease 1-like isoform X1 [Citrus sinensis] gi|568883543|ref|XP_006494525.1| PREDICTED: sentrin-specific protease 1-like isoform X1 [Citrus sinensis] Length = 303 Score = 249 bits (635), Expect = 1e-63 Identities = 136/248 (54%), Positives = 166/248 (66%), Gaps = 19/248 (7%) Frame = -2 Query: 687 MGKKKLKEIAS-FDLVSSSLEVYDDQKPKSHSSCCHHIAAGCQALPDRVTKKKAREIKSF 511 MGK+K + + D+VSS+ E D H +C H A A +++K+K I++F Sbjct: 1 MGKRKRGDANNPIDIVSSTPE--DPGHLSKHRTCWLHTVAFLHARKMKISKQK---IRNF 55 Query: 510 ELAFPFFSGTIPRRERSKR-----------------ILIKNSISKQHR-KLDSNVFESFL 385 EL P F GT R RSKR + K+ I+K+ + KLDS FE L Sbjct: 56 ELTAPCFLGTFSCRRRSKRRVKCKNTSLIKGKNSSSVKCKDMITKRKKNKLDSGKFEHLL 115 Query: 384 EKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHW 205 + LW SFSE+KKA FTYLD LWF LYRK S+KAKVLTWIK KHIFS+KYV VPI+CW HW Sbjct: 116 DNLWRSFSEDKKAGFTYLDSLWFDLYRKPSSKAKVLTWIKRKHIFSKKYVLVPIVCWRHW 175 Query: 204 SLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIA 25 +LLI C+FG S +SKTRTPCMLLLDSLEM+NP R EPDIRKFV+DIY+ E RPE KE I+ Sbjct: 176 NLLILCNFGGSFESKTRTPCMLLLDSLEMSNPWRFEPDIRKFVMDIYKAEDRPETKELIS 235 Query: 24 KIPFLVPK 1 +IP LVPK Sbjct: 236 RIPLLVPK 243 >ref|XP_006440039.1| hypothetical protein CICLE_v10021330mg [Citrus clementina] gi|557542301|gb|ESR53279.1| hypothetical protein CICLE_v10021330mg [Citrus clementina] Length = 303 Score = 249 bits (635), Expect = 1e-63 Identities = 136/248 (54%), Positives = 166/248 (66%), Gaps = 19/248 (7%) Frame = -2 Query: 687 MGKKKLKEIAS-FDLVSSSLEVYDDQKPKSHSSCCHHIAAGCQALPDRVTKKKAREIKSF 511 MGK+K + + D+VSS+ E D H +C H A A +++K+K I++F Sbjct: 1 MGKRKRGDANNPIDIVSSTPE--DPGHLSKHRTCWLHTVAFLHARKMKISKQK---IRNF 55 Query: 510 ELAFPFFSGTIPRRERSKR-----------------ILIKNSISKQHR-KLDSNVFESFL 385 EL P F GT R RSKR + K+ I+K+ + KLDS FE L Sbjct: 56 ELTAPCFLGTFSCRRRSKRRVKCKNTSLIKGKNSSSVKCKDMITKRRKNKLDSGKFEHLL 115 Query: 384 EKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHW 205 + LW SFSE+KKA FTYLD LWF LYRK S+KAKVLTWIK KHIFS+KYV VPI+CW HW Sbjct: 116 DNLWRSFSEDKKAGFTYLDSLWFDLYRKPSSKAKVLTWIKRKHIFSKKYVLVPIVCWRHW 175 Query: 204 SLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIA 25 +LLI C+FG S +SKTRTPCMLLLDSLEM+NP R EPDIRKFV+DIY+ E RPE KE I+ Sbjct: 176 NLLILCNFGGSFESKTRTPCMLLLDSLEMSNPWRFEPDIRKFVMDIYKAEERPETKELIS 235 Query: 24 KIPFLVPK 1 +IP LVPK Sbjct: 236 RIPLLVPK 243 >ref|XP_002322226.2| hypothetical protein POPTR_0015s10250g [Populus trichocarpa] gi|550322421|gb|EEF06353.2| hypothetical protein POPTR_0015s10250g [Populus trichocarpa] Length = 292 Score = 247 bits (630), Expect = 5e-63 Identities = 132/234 (56%), Positives = 160/234 (68%), Gaps = 5/234 (2%) Frame = -2 Query: 687 MGKKKLKE-IASFDLVSSSLEVYDDQKPKSHSSCCHHIAAGCQALPDRVTKKKAREIKSF 511 M K+K ++ I+S D S E Y+ + H SC H+ A A ++TK++A E++SF Sbjct: 1 MAKRKREDGISSADTKSPISETYE--RMAKHRSCWIHMLAHMYAGGKKITKQEAEELRSF 58 Query: 510 ELAFPFFSGTIPRRERSKR-ILIKNSISKQHR---KLDSNVFESFLEKLWSSFSEEKKAS 343 +L + GT P RSKR I K +I K+ R KLDS F+ + E +W +FSE+K+ Sbjct: 59 KLTSQCYLGTFPCSARSKRRIKRKKAIVKEIREKIKLDSGAFDCYFEHMWRNFSEDKRTF 118 Query: 342 FTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLIFCHFGESLQS 163 TY DCLWF LY K S K KVLTWIK K IFS+KYV VPI+ W HWSLLIFCH GESLQS Sbjct: 119 ITYFDCLWFNLYTKASFKGKVLTWIKKKQIFSKKYVLVPIVHWSHWSLLIFCHLGESLQS 178 Query: 162 KTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPFLVPK 1 K RTPCMLLLDSLE A P+ LEPDIRKFVLDIY+ EGR E KE I+KIP LVPK Sbjct: 179 KLRTPCMLLLDSLEKAGPRCLEPDIRKFVLDIYKSEGRAENKELISKIPLLVPK 232 >ref|XP_002511251.1| sentrin/sumo-specific protease, putative [Ricinus communis] gi|223550366|gb|EEF51853.1| sentrin/sumo-specific protease, putative [Ricinus communis] Length = 294 Score = 240 bits (613), Expect = 5e-61 Identities = 123/234 (52%), Positives = 154/234 (65%), Gaps = 7/234 (2%) Frame = -2 Query: 681 KKKLKEIASFDLVSSSLEVYDDQKPKSHSSCCHHIAAGCQALPDRVTKKKAREIKSFELA 502 +K E D+ S EV+ + H SC H+ ++ KK+A +++ F+L Sbjct: 4 RKPQDEFIVVDVDSPMSEVF--ARISKHRSCWKHMVTSLYTHGKKIKKKEAEKLRRFDLI 61 Query: 501 FPFFSGTIPRRERSKRILIKNSIS-------KQHRKLDSNVFESFLEKLWSSFSEEKKAS 343 F GT P R+RS+R IK+ + K+ ++LDS F+ + + LW SFS+EK+ S Sbjct: 62 SQCFLGTFPTRQRSRR-RIKHKFAITRVIKEKEKKRLDSGEFDCYFQNLWKSFSKEKRTS 120 Query: 342 FTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLIFCHFGESLQS 163 F YLD LWF Y K S K KVLTWIK K IFS+KYV VPI+CW HWSLLIFCH GE +S Sbjct: 121 FVYLDSLWFYWYLKASWKGKVLTWIKRKQIFSKKYVLVPIVCWGHWSLLIFCHLGEVSES 180 Query: 162 KTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPFLVPK 1 RTPCMLLLDSLEMANP+RLEPDIRKFVLDIY EGRPE K+ I++IP LVPK Sbjct: 181 NDRTPCMLLLDSLEMANPRRLEPDIRKFVLDIYTSEGRPEDKKLISQIPLLVPK 234 >ref|XP_004309800.1| PREDICTED: uncharacterized protein LOC101305332 [Fragaria vesca subsp. vesca] Length = 330 Score = 239 bits (610), Expect = 1e-60 Identities = 122/249 (48%), Positives = 157/249 (63%), Gaps = 32/249 (12%) Frame = -2 Query: 651 DLVSSSLEVYDDQKPKSHSSCCHHIAAGCQALPDRVTKKKAREIKSFELAFPFFSGTIPR 472 DL S +Y+ + H +C H+ A +A + ++ EIK P F + P Sbjct: 25 DLKCSVSGIYNIDEMSKHRTCWMHVLAFSKAQRQSLGLRETEEIKKIS---PCFLTSCPH 81 Query: 471 RERSKR------------------------------ILIKNS--ISKQHRKLDSNVFESF 388 R RS R +L+ +S++ ++LDS F+ + Sbjct: 82 RRRSVRSFKTKYVNLEVSRKTQNQESKACAVSRRKPVLVSRGCRVSRRKQELDSGTFQCY 141 Query: 387 LEKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHH 208 E LW SFSE+KK SFTYLDC+WF+LY K +TK KVLTWIK KHIFS+KYVFVPI+CW H Sbjct: 142 FESLWKSFSEDKKTSFTYLDCIWFSLYIKPTTKDKVLTWIKKKHIFSKKYVFVPIVCWSH 201 Query: 207 WSLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESI 28 W+LLI CHFGE+L+SKT+ PCMLLLDSLEMA+P+RLEPDIRKFV+DI+REEGRPE + + Sbjct: 202 WNLLILCHFGENLESKTQRPCMLLLDSLEMADPRRLEPDIRKFVVDIFREEGRPENMDLL 261 Query: 27 AKIPFLVPK 1 KIP LVPK Sbjct: 262 RKIPLLVPK 270 >ref|XP_004293126.1| PREDICTED: uncharacterized protein LOC101303677 [Fragaria vesca subsp. vesca] Length = 360 Score = 237 bits (604), Expect = 5e-60 Identities = 125/247 (50%), Positives = 160/247 (64%), Gaps = 30/247 (12%) Frame = -2 Query: 651 DLVSSSLEVYDDQKPKSHSSCCHHIAAGCQALPDRVTKKKAREIKSFELAFPFFSGTIPR 472 DL S E+Y+DQ K H +C H+ A +A +++++ +EIK F F P Sbjct: 23 DLNCSVSEIYNDQMSK-HRTCWMHVLAASKAQRQSLSQRETQEIKKISPCFLTFH---PH 78 Query: 471 RERSKR----------ILIKNS--------------------ISKQHRKLDSNVFESFLE 382 R+RS R +L K +S++ ++LDS F+S E Sbjct: 79 RQRSVRSFKTKYVKRQVLRKKQNQESKACAVSRRKPVSRGCRVSRKKQELDSGSFQSCFE 138 Query: 381 KLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHWS 202 LW SFSE+KK FTYLDCLWF+LY + +TK KVLTWIK KHIFS+KYVFVPI+CW HWS Sbjct: 139 SLWKSFSEDKKTYFTYLDCLWFSLYIEPTTKDKVLTWIKKKHIFSKKYVFVPIVCWCHWS 198 Query: 201 LLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIAK 22 LLI CHFGE+L+SKT+ PCMLLLDSLEM +PKRLEP+IR+FV+DI+REEGR E + + K Sbjct: 199 LLILCHFGENLESKTQRPCMLLLDSLEMTDPKRLEPNIRRFVVDIFREEGRRENMDLLRK 258 Query: 21 IPFLVPK 1 IP LVPK Sbjct: 259 IPLLVPK 265 >ref|XP_007037883.1| Cysteine proteinases superfamily protein, putative isoform 2 [Theobroma cacao] gi|508775128|gb|EOY22384.1| Cysteine proteinases superfamily protein, putative isoform 2 [Theobroma cacao] Length = 277 Score = 236 bits (602), Expect = 9e-60 Identities = 122/218 (55%), Positives = 151/218 (69%), Gaps = 1/218 (0%) Frame = -2 Query: 651 DLVSSSLEVYDDQKPKSHSSCCHHIAAGCQALPDRVTKKKAREIKSFELAFPFFSGTIPR 472 DL SS E Y+ H SC H+ +A +++K++A++++ F L P F G IP Sbjct: 15 DLASSDPE-YNGYPISKHRSCWVHVIGSLKARKKKISKQEAQKLRDFRLTAPCFLGNIPC 73 Query: 471 RERSKR-ILIKNSISKQHRKLDSNVFESFLEKLWSSFSEEKKASFTYLDCLWFALYRKWS 295 R+RSKR + KNSISKQ +LDS FE ++EKLWSSF EEK+ SF Y DC WFA YRK S Sbjct: 74 RQRSKRRVKSKNSISKQTNRLDSGAFECYMEKLWSSFPEEKRTSFAYFDCQWFAWYRKAS 133 Query: 294 TKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLIFCHFGESLQSKTRTPCMLLLDSLEMA 115 + KVL+WIK + IFS+KYV VP++C LQS+T+TPCMLLLDSLE+A Sbjct: 134 FREKVLSWIKREQIFSKKYVLVPVVC--------------CLQSETKTPCMLLLDSLEIA 179 Query: 114 NPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPFLVPK 1 NP+RLEPDIRKFVLDIYR EGRPEKKE I +IP LVPK Sbjct: 180 NPRRLEPDIRKFVLDIYRAEGRPEKKEMIYRIPLLVPK 217 >gb|EXC04030.1| Ubiquitin-like-specific protease 1D [Morus notabilis] Length = 316 Score = 227 bits (578), Expect = 6e-57 Identities = 125/251 (49%), Positives = 158/251 (62%), Gaps = 22/251 (8%) Frame = -2 Query: 687 MGKKKL-KEIASFDLVS------------SSLEVYD------DQKPKSHSSCCHHIAAGC 565 MGK+KL KEI + DL S S L V+ D H SC H+ A Sbjct: 1 MGKRKLSKEIITIDLESPTSPVAGKSFLASLLGVFGVRNVALDYGFSQHRSCWKHVLATL 60 Query: 564 QALPDRVTKKKAREIKSFELAFPFFSGTIPRRERSKRILIKNS---ISKQHRKLDSNVFE 394 +A R+TKK+ I SF+L P ++ +N+ +SK +++L S+ FE Sbjct: 61 KARKKRLTKKETEAIDSFKLTAPCLLNHTCGERSKRKTTYENAGHGVSKLNKELLSSTFE 120 Query: 393 SFLEKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICW 214 + E LW FSE+K AS YLDCLWF+LY+K K+KVL WIK K+IFS+KYV VPI+ W Sbjct: 121 MYFEFLWRGFSEDKGASCAYLDCLWFSLYKKRDYKSKVLKWIKDKNIFSKKYVLVPIVIW 180 Query: 213 HHWSLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKE 34 HWS LIFC+F ESL+S TRTPCMLLLDSLE A+P+RLEPDIRKFV DIYR E RP+ ++ Sbjct: 181 SHWSFLIFCNFDESLESTTRTPCMLLLDSLESADPRRLEPDIRKFVYDIYRTEDRPQTQK 240 Query: 33 SIAKIPFLVPK 1 SI KIP L P+ Sbjct: 241 SILKIPLLTPQ 251 >ref|XP_007037886.1| Cysteine proteinases superfamily protein, putative isoform 5 [Theobroma cacao] gi|508775131|gb|EOY22387.1| Cysteine proteinases superfamily protein, putative isoform 5 [Theobroma cacao] Length = 259 Score = 223 bits (568), Expect = 8e-56 Identities = 112/189 (59%), Positives = 139/189 (73%), Gaps = 1/189 (0%) Frame = -2 Query: 564 QALPDRVTKKKAREIKSFELAFPFFSGTIPRRERSKR-ILIKNSISKQHRKLDSNVFESF 388 +A +++K++A++++ F L P F G IP R+RSKR + KNSISKQ +LDS FE + Sbjct: 25 KARKKKISKQEAQKLRDFRLTAPCFLGNIPCRQRSKRRVKSKNSISKQTNRLDSGAFECY 84 Query: 387 LEKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHH 208 +EKLWSSF EEK+ SF Y DC WFA YRK S + KVL+WIK + IFS+KYV VP++C Sbjct: 85 MEKLWSSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSWIKREQIFSKKYVLVPVVC--- 141 Query: 207 WSLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESI 28 LQS+T+TPCMLLLDSLE+ANP+RLEPDIRKFVLDIYR EGRPEKKE I Sbjct: 142 -----------CLQSETKTPCMLLLDSLEIANPRRLEPDIRKFVLDIYRAEGRPEKKEMI 190 Query: 27 AKIPFLVPK 1 +IP LVPK Sbjct: 191 YRIPLLVPK 199 >gb|AFK37750.1| unknown [Lotus japonicus] Length = 284 Score = 221 bits (563), Expect = 3e-55 Identities = 108/184 (58%), Positives = 134/184 (72%), Gaps = 4/184 (2%) Frame = -2 Query: 540 KKKAREIK----SFELAFPFFSGTIPRRERSKRILIKNSISKQHRKLDSNVFESFLEKLW 373 +KK + ++ S + P + IPRR R+K+ K + KLDS VF++ L K+W Sbjct: 42 RKKGKPVRDVIGSVISSLPSYLSDIPRRPRTKKKKFKAEEALPRPKLDSGVFDNNLVKIW 101 Query: 372 SSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLI 193 +SFSE+K+ F Y D LWF+LYR S+K KVLTWIK +HIFS+ YVFVPI+CW HWSLLI Sbjct: 102 NSFSEDKRKPFAYFDSLWFSLYRAASSKDKVLTWIKKEHIFSKAYVFVPIVCWGHWSLLI 161 Query: 192 FCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPF 13 FCHFGESLQS TR+ CMLLLDSLEM NP+RLEPDIR+FV+DIY+ RPE K I +IP Sbjct: 162 FCHFGESLQSTTRSRCMLLLDSLEMVNPRRLEPDIRRFVVDIYKAWDRPETKNLIYQIPL 221 Query: 12 LVPK 1 LVPK Sbjct: 222 LVPK 225 >ref|XP_007137950.1| hypothetical protein PHAVU_009G168700g [Phaseolus vulgaris] gi|561011037|gb|ESW09944.1| hypothetical protein PHAVU_009G168700g [Phaseolus vulgaris] Length = 268 Score = 214 bits (546), Expect = 3e-53 Identities = 106/204 (51%), Positives = 135/204 (66%), Gaps = 24/204 (11%) Frame = -2 Query: 540 KKKAREIKSFELAFPFFSGTIPRRERSKR------------------------ILIKNSI 433 + K ++S FPF +P+R R+KR K ++ Sbjct: 6 RSKPYVMESSSSPFPFVWSNVPQRLRTKRKRKLNGKKALSRPNKEHSRPKEAPCRPKETL 65 Query: 432 SKQHRKLDSNVFESFLEKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHI 253 S+ KLDS +F++FL+K+W F E++K FTY D LWF+LYR S+K KVL WIK + I Sbjct: 66 SRIKEKLDSGIFDTFLKKIWKIFPEDRKGQFTYFDSLWFSLYRSASSKDKVLAWIKREPI 125 Query: 252 FSRKYVFVPIICWHHWSLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVL 73 FS+ YVFVPI+CW HWSLLI CHFGESLQS TR+ CMLLLDSLEMANP+RLEP+IR+FVL Sbjct: 126 FSKAYVFVPIVCWGHWSLLILCHFGESLQSSTRSRCMLLLDSLEMANPRRLEPEIRRFVL 185 Query: 72 DIYREEGRPEKKESIAKIPFLVPK 1 DIY+ RPE K +++IPFLVPK Sbjct: 186 DIYKSGDRPETKNILSQIPFLVPK 209 >gb|ADN33966.1| sentrin/sumo-specific protease [Cucumis melo subsp. melo] Length = 274 Score = 213 bits (541), Expect = 1e-52 Identities = 104/185 (56%), Positives = 128/185 (69%), Gaps = 3/185 (1%) Frame = -2 Query: 546 VTKKKAREIKSFELAFPFFSGTIP---RRERSKRILIKNSISKQHRKLDSNVFESFLEKL 376 V +++ +K F+ P SGT P RR+ K++ +I + RKLDS FE + L Sbjct: 26 VELEESENVKKFQPVSPSVSGTGPVRRRRQLKKKVGCNGAIPVRKRKLDSRAFEYCFQNL 85 Query: 375 WSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLL 196 W S EEKK FTYLDCLWF LY K S + KVL WIK K IFS+KYVFVPI+CW HWSLL Sbjct: 86 WRSSPEEKKIQFTYLDCLWFNLYLKASHRRKVLKWIKDKEIFSKKYVFVPIVCWSHWSLL 145 Query: 195 IFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIAKIP 16 IFCHF S +SK R PCMLLLDSL+ ANP+RLEP+IRKFV DI++E+G+ + I KIP Sbjct: 146 IFCHFDASPESKRRKPCMLLLDSLQEANPRRLEPEIRKFVFDIFKEDGKCKNLNVICKIP 205 Query: 15 FLVPK 1 +VPK Sbjct: 206 LMVPK 210 >ref|XP_007210601.1| hypothetical protein PRUPE_ppa017098mg [Prunus persica] gi|462406336|gb|EMJ11800.1| hypothetical protein PRUPE_ppa017098mg [Prunus persica] Length = 303 Score = 212 bits (540), Expect = 1e-52 Identities = 113/230 (49%), Positives = 143/230 (62%), Gaps = 30/230 (13%) Frame = -2 Query: 600 HSSCCHHIAAGCQALPDRVTKKKAREIKSFELA---------FP--FFSGTIPRRERSKR 454 H SC H+ A + +KK +K EL FP F G +R+ + Sbjct: 19 HRSCWRHVFAYL------IVQKKKLALKDIELIKKRYPCLLEFPCRFHRGERLKRKGKRE 72 Query: 453 IL--------IKNSISKQHRKLDSNVFES-----------FLEKLWSSFSEEKKASFTYL 331 + KN++S++ KLDS FE + + LW + SE+K+ SF YL Sbjct: 73 EMKELRPPKDAKNAVSRKKEKLDSQAFECKEKLGSEAFDRYFQNLWKNLSEDKRTSFAYL 132 Query: 330 DCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLIFCHFGESLQSKTRT 151 DC+WF+LY + S++ KVLTWIK KHIFS+KYV VPI+CW HW+LLIFCHFGES QS+T Sbjct: 133 DCMWFSLYLQPSSRDKVLTWIKKKHIFSKKYVIVPIVCWGHWNLLIFCHFGESEQSETHK 192 Query: 150 PCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPFLVPK 1 PCMLLLDSLE A+P+R EPDIRKFVLDIY EGR E K+ I +IPFLVPK Sbjct: 193 PCMLLLDSLENADPRRYEPDIRKFVLDIYEAEGRSETKDFIYRIPFLVPK 242