BLASTX nr result
ID: Akebia24_contig00017627
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00017627 (1230 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007037882.1| Cysteine proteinases superfamily protein, pu... 356 1e-95 ref|XP_002262951.2| PREDICTED: uncharacterized protein LOC100251... 347 5e-93 ref|XP_007037884.1| Cysteine proteinases superfamily protein, pu... 344 5e-92 ref|XP_007037885.1| Cysteine proteinases superfamily protein, pu... 332 2e-88 ref|XP_002315114.1| hypothetical protein POPTR_0010s18760g [Popu... 327 8e-87 ref|XP_006476972.1| PREDICTED: sentrin-specific protease 1-like ... 321 4e-85 ref|XP_006440039.1| hypothetical protein CICLE_v10021330mg [Citr... 321 4e-85 ref|XP_002511251.1| sentrin/sumo-specific protease, putative [Ri... 314 4e-83 ref|XP_007037883.1| Cysteine proteinases superfamily protein, pu... 311 3e-82 ref|XP_004309800.1| PREDICTED: uncharacterized protein LOC101305... 306 1e-80 ref|XP_002322226.2| hypothetical protein POPTR_0015s10250g [Popu... 306 2e-80 ref|XP_004293126.1| PREDICTED: uncharacterized protein LOC101303... 306 2e-80 ref|XP_007037886.1| Cysteine proteinases superfamily protein, pu... 300 1e-78 gb|AFK37750.1| unknown [Lotus japonicus] 292 2e-76 ref|XP_007210601.1| hypothetical protein PRUPE_ppa017098mg [Prun... 291 4e-76 gb|EXC04030.1| Ubiquitin-like-specific protease 1D [Morus notabi... 285 2e-74 gb|ADN33966.1| sentrin/sumo-specific protease [Cucumis melo subs... 285 2e-74 ref|XP_007137950.1| hypothetical protein PHAVU_009G168700g [Phas... 284 5e-74 ref|XP_004501822.1| PREDICTED: probable ubiquitin-like-specific ... 281 4e-73 ref|XP_007037887.1| Cysteine proteinases superfamily protein, pu... 281 5e-73 >ref|XP_007037882.1| Cysteine proteinases superfamily protein, putative isoform 1 [Theobroma cacao] gi|508775127|gb|EOY22383.1| Cysteine proteinases superfamily protein, putative isoform 1 [Theobroma cacao] Length = 291 Score = 356 bits (913), Expect = 1e-95 Identities = 171/277 (61%), Positives = 214/277 (77%), Gaps = 1/277 (0%) Frame = -1 Query: 1038 DLVSSSLEVYDDQKPKSHGSCCHHIATGCQALPDRVTKKKAREIKSFELAFPFFSGTIPR 859 DL SS E Y+ H SC H+ +A +++K++A++++ F L P F G IP Sbjct: 15 DLASSDPE-YNGYPISKHRSCWVHVIGSLKARKKKISKQEAQKLRDFRLTAPCFLGNIPC 73 Query: 858 RERSKR-ILIKNSISKQHRKLDSNVFESFLEKLWSSFSEEKKASFTYLDCLWFALYRKWS 682 R+RSKR + KNSISKQ +LDS FE ++EKLWSSF EEK+ SF Y DC WFA YRK S Sbjct: 74 RQRSKRRVKSKNSISKQTNRLDSGAFECYMEKLWSSFPEEKRTSFAYFDCQWFAWYRKAS 133 Query: 681 TKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLIFCHFGESLQSKTRTPCMLLLDSLEMA 502 + KVL+WIK + IFS+KYV VP++CW HWSLLIFCHFGESLQS+T+TPCMLLLDSLE+A Sbjct: 134 FREKVLSWIKREQIFSKKYVLVPVVCWSHWSLLIFCHFGESLQSETKTPCMLLLDSLEIA 193 Query: 501 NPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPFLVPKVPQQKNSEDCGIFVLYFMNLFL 322 NP+RLEPDIRKFVLDIYR EGRPEKKE I +IP LVPKVPQQ++ E+CG FVLYF+NLF+ Sbjct: 194 NPRRLEPDIRKFVLDIYRAEGRPEKKEMIYRIPLLVPKVPQQRDGEECGKFVLYFINLFV 253 Query: 321 ESAPEDFNIFGGYPYFMNENWFSSEGLARFCKKFCTF 211 E APE+F+I GYPYFM ++WF++EG+ FC+K +F Sbjct: 254 EGAPENFSI-EGYPYFMRKDWFNAEGVECFCEKLDSF 289 >ref|XP_002262951.2| PREDICTED: uncharacterized protein LOC100251251 [Vitis vinifera] gi|297733618|emb|CBI14865.3| unnamed protein product [Vitis vinifera] Length = 295 Score = 347 bits (891), Expect = 5e-93 Identities = 173/285 (60%), Positives = 205/285 (71%), Gaps = 3/285 (1%) Frame = -1 Query: 1068 KKKLKEIASFDLVSSSLEVYDDQKPKSHGSCCHHIATGCQALPDRVTKKKAREIKS-FEL 892 KK A DL S+ E Y D H SC H+ QA R+TK + EIK FE Sbjct: 4 KKPRNSNAPIDLASADSESYLDYS--KHRSCWRHMVAHLQAQNKRMTKHEIEEIKEIFEF 61 Query: 891 AFPFFSGTIPRRERSKR-ILIKNSI-SKQHRKLDSNVFESFLEKLWSSFSEEKKASFTYL 718 P FS T PR ERSKR I KN I K+ +KLD+ FE + LW SFS++KK+SF YL Sbjct: 62 TTPCFSNTFPRHERSKRRINCKNIIIRKEKKKLDTAAFEWYFRNLWKSFSDDKKSSFGYL 121 Query: 717 DCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLIFCHFGESLQSKTRT 538 DCLWF+ Y K S++ KVL WIK K IFSRKYVFVPI+CW+HWSLLI CHFGESL+SK R Sbjct: 122 DCLWFSFYLKTSSREKVLNWIKKKRIFSRKYVFVPIVCWNHWSLLILCHFGESLESKIRA 181 Query: 537 PCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPFLVPKVPQQKNSEDC 358 PCMLLLDSL+MANPKRLEP+IRKFV DIY+EEGRPE K+ I+KIP LVPKVPQQ+N E+C Sbjct: 182 PCMLLLDSLQMANPKRLEPNIRKFVFDIYKEEGRPESKQLISKIPLLVPKVPQQRNGEEC 241 Query: 357 GIFVLYFMNLFLESAPEDFNIFGGYPYFMNENWFSSEGLARFCKK 223 G FVLYF+NLF++ APE+F++ GYPYFM +NWF E L F +K Sbjct: 242 GNFVLYFINLFMDGAPENFSVSEGYPYFMKKNWFGPEALEHFFRK 286 >ref|XP_007037884.1| Cysteine proteinases superfamily protein, putative isoform 3 [Theobroma cacao] gi|508775129|gb|EOY22385.1| Cysteine proteinases superfamily protein, putative isoform 3 [Theobroma cacao] Length = 273 Score = 344 bits (882), Expect = 5e-92 Identities = 161/248 (64%), Positives = 202/248 (81%), Gaps = 1/248 (0%) Frame = -1 Query: 951 QALPDRVTKKKAREIKSFELAFPFFSGTIPRRERSKR-ILIKNSISKQHRKLDSNVFESF 775 +A +++K++A++++ F L P F G IP R+RSKR + KNSISKQ +LDS FE + Sbjct: 25 KARKKKISKQEAQKLRDFRLTAPCFLGNIPCRQRSKRRVKSKNSISKQTNRLDSGAFECY 84 Query: 774 LEKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHH 595 +EKLWSSF EEK+ SF Y DC WFA YRK S + KVL+WIK + IFS+KYV VP++CW H Sbjct: 85 MEKLWSSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSWIKREQIFSKKYVLVPVVCWSH 144 Query: 594 WSLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESI 415 WSLLIFCHFGESLQS+T+TPCMLLLDSLE+ANP+RLEPDIRKFVLDIYR EGRPEKKE I Sbjct: 145 WSLLIFCHFGESLQSETKTPCMLLLDSLEIANPRRLEPDIRKFVLDIYRAEGRPEKKEMI 204 Query: 414 AKIPFLVPKVPQQKNSEDCGIFVLYFMNLFLESAPEDFNIFGGYPYFMNENWFSSEGLAR 235 +IP LVPKVPQQ++ E+CG FVLYF+NLF+E APE+F+I GYPYFM ++WF++EG+ Sbjct: 205 YRIPLLVPKVPQQRDGEECGKFVLYFINLFVEGAPENFSI-EGYPYFMRKDWFNAEGVEC 263 Query: 234 FCKKFCTF 211 FC+K +F Sbjct: 264 FCEKLDSF 271 >ref|XP_007037885.1| Cysteine proteinases superfamily protein, putative isoform 4 [Theobroma cacao] gi|508775130|gb|EOY22386.1| Cysteine proteinases superfamily protein, putative isoform 4 [Theobroma cacao] Length = 270 Score = 332 bits (852), Expect = 2e-88 Identities = 158/248 (63%), Positives = 199/248 (80%), Gaps = 1/248 (0%) Frame = -1 Query: 951 QALPDRVTKKKAREIKSFELAFPFFSGTIPRRERSKR-ILIKNSISKQHRKLDSNVFESF 775 +A +++K++A++++ F L P F G IP R+RSKR + KNSISKQ +LDS FE + Sbjct: 25 KARKKKISKQEAQKLRDFRLTAPCFLGNIPCRQRSKRRVKSKNSISKQTNRLDSGAFECY 84 Query: 774 LEKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHH 595 +EKLWSSF EEK+ SF Y DC WFA YRK S + KVL+WIK + IFS+KYV VP++CW H Sbjct: 85 MEKLWSSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSWIKREQIFSKKYVLVPVVCWSH 144 Query: 594 WSLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESI 415 WSLLIFCHFGESLQS+T+TPCMLLLDSLE+ANP+RLEPDIRKFVLDIYR EGRPEKKE I Sbjct: 145 WSLLIFCHFGESLQSETKTPCMLLLDSLEIANPRRLEPDIRKFVLDIYRAEGRPEKKEMI 204 Query: 414 AKIPFLVPKVPQQKNSEDCGIFVLYFMNLFLESAPEDFNIFGGYPYFMNENWFSSEGLAR 235 +IP LVPK Q++ E+CG FVLYF+NLF+E APE+F+I GYPYFM ++WF++EG+ Sbjct: 205 YRIPLLVPK---QRDGEECGKFVLYFINLFVEGAPENFSI-EGYPYFMRKDWFNAEGVEC 260 Query: 234 FCKKFCTF 211 FC+K +F Sbjct: 261 FCEKLDSF 268 >ref|XP_002315114.1| hypothetical protein POPTR_0010s18760g [Populus trichocarpa] gi|222864154|gb|EEF01285.1| hypothetical protein POPTR_0010s18760g [Populus trichocarpa] Length = 298 Score = 327 bits (837), Expect = 8e-87 Identities = 160/260 (61%), Positives = 193/260 (74%), Gaps = 1/260 (0%) Frame = -1 Query: 999 KPKSHGSCCHHIATGCQALPDRVTKKKAREIKSFELAFPFFSGTIPRRERSK-RILIKNS 823 +P H +C HI A R+TKK+A EI+SF+L P F TIP RERSK R N+ Sbjct: 31 QPSKHRTCWKHIQARMHARRTRMTKKQAEEIESFKLTSPCFLQTIPCRERSKKRFKRNNA 90 Query: 822 ISKQHRKLDSNVFESFLEKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKH 643 +SK ++LDS F ++E LW SFSE+KK SF YLD LWF +Y + S+ KVL WIK KH Sbjct: 91 VSKLKKELDSVSFNCYMENLWKSFSEDKKMSFAYLDSLWFTMYTEASSGVKVLEWIKRKH 150 Query: 642 IFSRKYVFVPIICWHHWSLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFV 463 IFS+KYV VPI+ W HWSLLIFCHFGESL S+ TPCMLLLDSLEMA+PKRLEPDIRKFV Sbjct: 151 IFSKKYVLVPIVRWCHWSLLIFCHFGESLLSENITPCMLLLDSLEMASPKRLEPDIRKFV 210 Query: 462 LDIYREEGRPEKKESIAKIPFLVPKVPQQKNSEDCGIFVLYFMNLFLESAPEDFNIFGGY 283 DIY EGRPE K I++IP LVPKVPQQ+N +CG +VL F+NLF++ APE+F++ GY Sbjct: 211 WDIYESEGRPENKHMISQIPLLVPKVPQQRNGVECGNYVLNFINLFVQDAPENFHM-EGY 269 Query: 282 PYFMNENWFSSEGLARFCKK 223 PYFM +NWFS EGL FC+K Sbjct: 270 PYFMKDNWFSPEGLEHFCEK 289 >ref|XP_006476972.1| PREDICTED: sentrin-specific protease 1-like isoform X1 [Citrus sinensis] gi|568883543|ref|XP_006494525.1| PREDICTED: sentrin-specific protease 1-like isoform X1 [Citrus sinensis] Length = 303 Score = 321 bits (823), Expect = 4e-85 Identities = 169/303 (55%), Positives = 210/303 (69%), Gaps = 19/303 (6%) Frame = -1 Query: 1074 MGKKKLKEIAS-FDLVSSSLEVYDDQKPKSHGSCCHHIATGCQALPDRVTKKKAREIKSF 898 MGK+K + + D+VSS+ E D H +C H A +++K+K I++F Sbjct: 1 MGKRKRGDANNPIDIVSSTPE--DPGHLSKHRTCWLHTVAFLHARKMKISKQK---IRNF 55 Query: 897 ELAFPFFSGTIPRRERSKR-----------------ILIKNSISKQHR-KLDSNVFESFL 772 EL P F GT R RSKR + K+ I+K+ + KLDS FE L Sbjct: 56 ELTAPCFLGTFSCRRRSKRRVKCKNTSLIKGKNSSSVKCKDMITKRKKNKLDSGKFEHLL 115 Query: 771 EKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHW 592 + LW SFSE+KKA FTYLD LWF LYRK S+KAKVLTWIK KHIFS+KYV VPI+CW HW Sbjct: 116 DNLWRSFSEDKKAGFTYLDSLWFDLYRKPSSKAKVLTWIKRKHIFSKKYVLVPIVCWRHW 175 Query: 591 SLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIA 412 +LLI C+FG S +SKTRTPCMLLLDSLEM+NP R EPDIRKFV+DIY+ E RPE KE I+ Sbjct: 176 NLLILCNFGGSFESKTRTPCMLLLDSLEMSNPWRFEPDIRKFVMDIYKAEDRPETKELIS 235 Query: 411 KIPFLVPKVPQQKNSEDCGIFVLYFMNLFLESAPEDFNIFGGYPYFMNENWFSSEGLARF 232 +IP LVPKVPQQ+N E+CG FVLYF+NLF+E APE+FN+ YPYFM +NWF++E L F Sbjct: 236 RIPLLVPKVPQQRNGEECGNFVLYFINLFVEGAPENFNL-EDYPYFMEKNWFTAEDLDCF 294 Query: 231 CKK 223 C++ Sbjct: 295 CER 297 >ref|XP_006440039.1| hypothetical protein CICLE_v10021330mg [Citrus clementina] gi|557542301|gb|ESR53279.1| hypothetical protein CICLE_v10021330mg [Citrus clementina] Length = 303 Score = 321 bits (823), Expect = 4e-85 Identities = 169/303 (55%), Positives = 210/303 (69%), Gaps = 19/303 (6%) Frame = -1 Query: 1074 MGKKKLKEIAS-FDLVSSSLEVYDDQKPKSHGSCCHHIATGCQALPDRVTKKKAREIKSF 898 MGK+K + + D+VSS+ E D H +C H A +++K+K I++F Sbjct: 1 MGKRKRGDANNPIDIVSSTPE--DPGHLSKHRTCWLHTVAFLHARKMKISKQK---IRNF 55 Query: 897 ELAFPFFSGTIPRRERSKR-----------------ILIKNSISKQHR-KLDSNVFESFL 772 EL P F GT R RSKR + K+ I+K+ + KLDS FE L Sbjct: 56 ELTAPCFLGTFSCRRRSKRRVKCKNTSLIKGKNSSSVKCKDMITKRRKNKLDSGKFEHLL 115 Query: 771 EKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHW 592 + LW SFSE+KKA FTYLD LWF LYRK S+KAKVLTWIK KHIFS+KYV VPI+CW HW Sbjct: 116 DNLWRSFSEDKKAGFTYLDSLWFDLYRKPSSKAKVLTWIKRKHIFSKKYVLVPIVCWRHW 175 Query: 591 SLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIA 412 +LLI C+FG S +SKTRTPCMLLLDSLEM+NP R EPDIRKFV+DIY+ E RPE KE I+ Sbjct: 176 NLLILCNFGGSFESKTRTPCMLLLDSLEMSNPWRFEPDIRKFVMDIYKAEERPETKELIS 235 Query: 411 KIPFLVPKVPQQKNSEDCGIFVLYFMNLFLESAPEDFNIFGGYPYFMNENWFSSEGLARF 232 +IP LVPKVPQQ+N E+CG FVLYF+NLF+E APE+FN+ YPYFM +NWF++E L F Sbjct: 236 RIPLLVPKVPQQRNGEECGNFVLYFINLFVEGAPENFNL-EDYPYFMEKNWFTAEDLDCF 294 Query: 231 CKK 223 C++ Sbjct: 295 CER 297 >ref|XP_002511251.1| sentrin/sumo-specific protease, putative [Ricinus communis] gi|223550366|gb|EEF51853.1| sentrin/sumo-specific protease, putative [Ricinus communis] Length = 294 Score = 314 bits (805), Expect = 4e-83 Identities = 159/293 (54%), Positives = 201/293 (68%), Gaps = 7/293 (2%) Frame = -1 Query: 1068 KKKLKEIASFDLVSSSLEVYDDQKPKSHGSCCHHIATGCQALPDRVTKKKAREIKSFELA 889 +K E D+ S EV+ + H SC H+ T ++ KK+A +++ F+L Sbjct: 4 RKPQDEFIVVDVDSPMSEVF--ARISKHRSCWKHMVTSLYTHGKKIKKKEAEKLRRFDLI 61 Query: 888 FPFFSGTIPRRERSKRILIKNSIS-------KQHRKLDSNVFESFLEKLWSSFSEEKKAS 730 F GT P R+RS+R IK+ + K+ ++LDS F+ + + LW SFS+EK+ S Sbjct: 62 SQCFLGTFPTRQRSRR-RIKHKFAITRVIKEKEKKRLDSGEFDCYFQNLWKSFSKEKRTS 120 Query: 729 FTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLIFCHFGESLQS 550 F YLD LWF Y K S K KVLTWIK K IFS+KYV VPI+CW HWSLLIFCH GE +S Sbjct: 121 FVYLDSLWFYWYLKASWKGKVLTWIKRKQIFSKKYVLVPIVCWGHWSLLIFCHLGEVSES 180 Query: 549 KTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPFLVPKVPQQKN 370 RTPCMLLLDSLEMANP+RLEPDIRKFVLDIY EGRPE K+ I++IP LVPKVPQQ+N Sbjct: 181 NDRTPCMLLLDSLEMANPRRLEPDIRKFVLDIYTSEGRPEDKKLISQIPLLVPKVPQQRN 240 Query: 369 SEDCGIFVLYFMNLFLESAPEDFNIFGGYPYFMNENWFSSEGLARFCKKFCTF 211 E+CG +VLYF+NLF+ AP+DF+I YPYFMN+NWFS E L RF ++ +F Sbjct: 241 GEECGNYVLYFINLFMLGAPDDFSI-KDYPYFMNKNWFSPECLERFSEELESF 292 >ref|XP_007037883.1| Cysteine proteinases superfamily protein, putative isoform 2 [Theobroma cacao] gi|508775128|gb|EOY22384.1| Cysteine proteinases superfamily protein, putative isoform 2 [Theobroma cacao] Length = 277 Score = 311 bits (798), Expect = 3e-82 Identities = 157/277 (56%), Positives = 200/277 (72%), Gaps = 1/277 (0%) Frame = -1 Query: 1038 DLVSSSLEVYDDQKPKSHGSCCHHIATGCQALPDRVTKKKAREIKSFELAFPFFSGTIPR 859 DL SS E Y+ H SC H+ +A +++K++A++++ F L P F G IP Sbjct: 15 DLASSDPE-YNGYPISKHRSCWVHVIGSLKARKKKISKQEAQKLRDFRLTAPCFLGNIPC 73 Query: 858 RERSKR-ILIKNSISKQHRKLDSNVFESFLEKLWSSFSEEKKASFTYLDCLWFALYRKWS 682 R+RSKR + KNSISKQ +LDS FE ++EKLWSSF EEK+ SF Y DC WFA YRK S Sbjct: 74 RQRSKRRVKSKNSISKQTNRLDSGAFECYMEKLWSSFPEEKRTSFAYFDCQWFAWYRKAS 133 Query: 681 TKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLIFCHFGESLQSKTRTPCMLLLDSLEMA 502 + KVL+WIK + IFS+KYV VP++C LQS+T+TPCMLLLDSLE+A Sbjct: 134 FREKVLSWIKREQIFSKKYVLVPVVC--------------CLQSETKTPCMLLLDSLEIA 179 Query: 501 NPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPFLVPKVPQQKNSEDCGIFVLYFMNLFL 322 NP+RLEPDIRKFVLDIYR EGRPEKKE I +IP LVPKVPQQ++ E+CG FVLYF+NLF+ Sbjct: 180 NPRRLEPDIRKFVLDIYRAEGRPEKKEMIYRIPLLVPKVPQQRDGEECGKFVLYFINLFV 239 Query: 321 ESAPEDFNIFGGYPYFMNENWFSSEGLARFCKKFCTF 211 E APE+F+I GYPYFM ++WF++EG+ FC+K +F Sbjct: 240 EGAPENFSI-EGYPYFMRKDWFNAEGVECFCEKLDSF 275 >ref|XP_004309800.1| PREDICTED: uncharacterized protein LOC101305332 [Fragaria vesca subsp. vesca] Length = 330 Score = 306 bits (784), Expect = 1e-80 Identities = 153/303 (50%), Positives = 199/303 (65%), Gaps = 32/303 (10%) Frame = -1 Query: 1038 DLVSSSLEVYDDQKPKSHGSCCHHIATGCQALPDRVTKKKAREIKSFELAFPFFSGTIPR 859 DL S +Y+ + H +C H+ +A + ++ EIK P F + P Sbjct: 25 DLKCSVSGIYNIDEMSKHRTCWMHVLAFSKAQRQSLGLRETEEIKKIS---PCFLTSCPH 81 Query: 858 RERSKR------------------------------ILIKNS--ISKQHRKLDSNVFESF 775 R RS R +L+ +S++ ++LDS F+ + Sbjct: 82 RRRSVRSFKTKYVNLEVSRKTQNQESKACAVSRRKPVLVSRGCRVSRRKQELDSGTFQCY 141 Query: 774 LEKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHH 595 E LW SFSE+KK SFTYLDC+WF+LY K +TK KVLTWIK KHIFS+KYVFVPI+CW H Sbjct: 142 FESLWKSFSEDKKTSFTYLDCIWFSLYIKPTTKDKVLTWIKKKHIFSKKYVFVPIVCWSH 201 Query: 594 WSLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESI 415 W+LLI CHFGE+L+SKT+ PCMLLLDSLEMA+P+RLEPDIRKFV+DI+REEGRPE + + Sbjct: 202 WNLLILCHFGENLESKTQRPCMLLLDSLEMADPRRLEPDIRKFVVDIFREEGRPENMDLL 261 Query: 414 AKIPFLVPKVPQQKNSEDCGIFVLYFMNLFLESAPEDFNIFGGYPYFMNENWFSSEGLAR 235 KIP LVPKVPQQ+N ++CG FVLYF+NLF+ESAP+ F++ YPYFM +NWF+ E L Sbjct: 262 RKIPLLVPKVPQQRNDQECGNFVLYFINLFMESAPQTFSM-EEYPYFMKKNWFAYESLDC 320 Query: 234 FCK 226 FC+ Sbjct: 321 FCQ 323 >ref|XP_002322226.2| hypothetical protein POPTR_0015s10250g [Populus trichocarpa] gi|550322421|gb|EEF06353.2| hypothetical protein POPTR_0015s10250g [Populus trichocarpa] Length = 292 Score = 306 bits (783), Expect = 2e-80 Identities = 160/289 (55%), Positives = 198/289 (68%), Gaps = 5/289 (1%) Frame = -1 Query: 1074 MGKKKLKE-IASFDLVSSSLEVYDDQKPKSHGSCCHHIATGCQALPDRVTKKKAREIKSF 898 M K+K ++ I+S D S E Y+ + H SC H+ A ++TK++A E++SF Sbjct: 1 MAKRKREDGISSADTKSPISETYE--RMAKHRSCWIHMLAHMYAGGKKITKQEAEELRSF 58 Query: 897 ELAFPFFSGTIPRRERSKR-ILIKNSISKQHR---KLDSNVFESFLEKLWSSFSEEKKAS 730 +L + GT P RSKR I K +I K+ R KLDS F+ + E +W +FSE+K+ Sbjct: 59 KLTSQCYLGTFPCSARSKRRIKRKKAIVKEIREKIKLDSGAFDCYFEHMWRNFSEDKRTF 118 Query: 729 FTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLIFCHFGESLQS 550 TY DCLWF LY K S K KVLTWIK K IFS+KYV VPI+ W HWSLLIFCH GESLQS Sbjct: 119 ITYFDCLWFNLYTKASFKGKVLTWIKKKQIFSKKYVLVPIVHWSHWSLLIFCHLGESLQS 178 Query: 549 KTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPFLVPKVPQQKN 370 K RTPCMLLLDSLE A P+ LEPDIRKFVLDIY+ EGR E KE I+KIP LVPKVPQQ+ Sbjct: 179 KLRTPCMLLLDSLEKAGPRCLEPDIRKFVLDIYKSEGRAENKELISKIPLLVPKVPQQRG 238 Query: 369 SEDCGIFVLYFMNLFLESAPEDFNIFGGYPYFMNENWFSSEGLARFCKK 223 E+CG +VLY++NLF++ APE+F YPYFM +NWFS L F +K Sbjct: 239 GEECGNYVLYYINLFVQGAPENF-CMDDYPYFMKQNWFSPGCLEAFFEK 286 >ref|XP_004293126.1| PREDICTED: uncharacterized protein LOC101303677 [Fragaria vesca subsp. vesca] Length = 360 Score = 306 bits (783), Expect = 2e-80 Identities = 156/302 (51%), Positives = 204/302 (67%), Gaps = 30/302 (9%) Frame = -1 Query: 1038 DLVSSSLEVYDDQKPKSHGSCCHHIATGCQALPDRVTKKKAREIKSFELAFPFFSGTIPR 859 DL S E+Y+DQ K H +C H+ +A +++++ +EIK F F P Sbjct: 23 DLNCSVSEIYNDQMSK-HRTCWMHVLAASKAQRQSLSQRETQEIKKISPCFLTFH---PH 78 Query: 858 RERSKR----------ILIKNS--------------------ISKQHRKLDSNVFESFLE 769 R+RS R +L K +S++ ++LDS F+S E Sbjct: 79 RQRSVRSFKTKYVKRQVLRKKQNQESKACAVSRRKPVSRGCRVSRKKQELDSGSFQSCFE 138 Query: 768 KLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHWS 589 LW SFSE+KK FTYLDCLWF+LY + +TK KVLTWIK KHIFS+KYVFVPI+CW HWS Sbjct: 139 SLWKSFSEDKKTYFTYLDCLWFSLYIEPTTKDKVLTWIKKKHIFSKKYVFVPIVCWCHWS 198 Query: 588 LLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIAK 409 LLI CHFGE+L+SKT+ PCMLLLDSLEM +PKRLEP+IR+FV+DI+REEGR E + + K Sbjct: 199 LLILCHFGENLESKTQRPCMLLLDSLEMTDPKRLEPNIRRFVVDIFREEGRRENMDLLRK 258 Query: 408 IPFLVPKVPQQKNSEDCGIFVLYFMNLFLESAPEDFNIFGGYPYFMNENWFSSEGLARFC 229 IP LVPKVP+Q+N ++CG FVLYF+NLF+ESAP+ F++ GYPYFM +NWF+ E L FC Sbjct: 259 IPLLVPKVPKQRNDQECGNFVLYFINLFMESAPQTFSM-EGYPYFMKKNWFAYESLDCFC 317 Query: 228 KK 223 ++ Sbjct: 318 QE 319 >ref|XP_007037886.1| Cysteine proteinases superfamily protein, putative isoform 5 [Theobroma cacao] gi|508775131|gb|EOY22387.1| Cysteine proteinases superfamily protein, putative isoform 5 [Theobroma cacao] Length = 259 Score = 300 bits (767), Expect = 1e-78 Identities = 147/248 (59%), Positives = 188/248 (75%), Gaps = 1/248 (0%) Frame = -1 Query: 951 QALPDRVTKKKAREIKSFELAFPFFSGTIPRRERSKR-ILIKNSISKQHRKLDSNVFESF 775 +A +++K++A++++ F L P F G IP R+RSKR + KNSISKQ +LDS FE + Sbjct: 25 KARKKKISKQEAQKLRDFRLTAPCFLGNIPCRQRSKRRVKSKNSISKQTNRLDSGAFECY 84 Query: 774 LEKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHH 595 +EKLWSSF EEK+ SF Y DC WFA YRK S + KVL+WIK + IFS+KYV VP++C Sbjct: 85 MEKLWSSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSWIKREQIFSKKYVLVPVVC--- 141 Query: 594 WSLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESI 415 LQS+T+TPCMLLLDSLE+ANP+RLEPDIRKFVLDIYR EGRPEKKE I Sbjct: 142 -----------CLQSETKTPCMLLLDSLEIANPRRLEPDIRKFVLDIYRAEGRPEKKEMI 190 Query: 414 AKIPFLVPKVPQQKNSEDCGIFVLYFMNLFLESAPEDFNIFGGYPYFMNENWFSSEGLAR 235 +IP LVPKVPQQ++ E+CG FVLYF+NLF+E APE+F+I GYPYFM ++WF++EG+ Sbjct: 191 YRIPLLVPKVPQQRDGEECGKFVLYFINLFVEGAPENFSI-EGYPYFMRKDWFNAEGVEC 249 Query: 234 FCKKFCTF 211 FC+K +F Sbjct: 250 FCEKLDSF 257 >gb|AFK37750.1| unknown [Lotus japonicus] Length = 284 Score = 292 bits (748), Expect = 2e-76 Identities = 140/239 (58%), Positives = 178/239 (74%), Gaps = 4/239 (1%) Frame = -1 Query: 927 KKKAREIK----SFELAFPFFSGTIPRRERSKRILIKNSISKQHRKLDSNVFESFLEKLW 760 +KK + ++ S + P + IPRR R+K+ K + KLDS VF++ L K+W Sbjct: 42 RKKGKPVRDVIGSVISSLPSYLSDIPRRPRTKKKKFKAEEALPRPKLDSGVFDNNLVKIW 101 Query: 759 SSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLI 580 +SFSE+K+ F Y D LWF+LYR S+K KVLTWIK +HIFS+ YVFVPI+CW HWSLLI Sbjct: 102 NSFSEDKRKPFAYFDSLWFSLYRAASSKDKVLTWIKKEHIFSKAYVFVPIVCWGHWSLLI 161 Query: 579 FCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPF 400 FCHFGESLQS TR+ CMLLLDSLEM NP+RLEPDIR+FV+DIY+ RPE K I +IP Sbjct: 162 FCHFGESLQSTTRSRCMLLLDSLEMVNPRRLEPDIRRFVVDIYKAWDRPETKNLIYQIPL 221 Query: 399 LVPKVPQQKNSEDCGIFVLYFMNLFLESAPEDFNIFGGYPYFMNENWFSSEGLARFCKK 223 LVPKVPQQ++ +CG FVLYF+NLFL APE+F++ GGYPYFM ++WF+ E RFC++ Sbjct: 222 LVPKVPQQRDGNECGNFVLYFINLFLRCAPENFSM-GGYPYFMKKDWFTFEDFDRFCER 279 >ref|XP_007210601.1| hypothetical protein PRUPE_ppa017098mg [Prunus persica] gi|462406336|gb|EMJ11800.1| hypothetical protein PRUPE_ppa017098mg [Prunus persica] Length = 303 Score = 291 bits (745), Expect = 4e-76 Identities = 133/214 (62%), Positives = 166/214 (77%), Gaps = 11/214 (5%) Frame = -1 Query: 831 KNSISKQHRKLDSNVFES-----------FLEKLWSSFSEEKKASFTYLDCLWFALYRKW 685 KN++S++ KLDS FE + + LW + SE+K+ SF YLDC+WF+LY + Sbjct: 84 KNAVSRKKEKLDSQAFECKEKLGSEAFDRYFQNLWKNLSEDKRTSFAYLDCMWFSLYLQP 143 Query: 684 STKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLIFCHFGESLQSKTRTPCMLLLDSLEM 505 S++ KVLTWIK KHIFS+KYV VPI+CW HW+LLIFCHFGES QS+T PCMLLLDSLE Sbjct: 144 SSRDKVLTWIKKKHIFSKKYVIVPIVCWGHWNLLIFCHFGESEQSETHKPCMLLLDSLEN 203 Query: 504 ANPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPFLVPKVPQQKNSEDCGIFVLYFMNLF 325 A+P+R EPDIRKFVLDIY EGR E K+ I +IPFLVPKVPQQ+N +CG FVLY++NLF Sbjct: 204 ADPRRYEPDIRKFVLDIYEAEGRSETKDFIYRIPFLVPKVPQQRNDVECGNFVLYYINLF 263 Query: 324 LESAPEDFNIFGGYPYFMNENWFSSEGLARFCKK 223 +E APE+F+I GGYPYFM +NWF+ EGL FC++ Sbjct: 264 IEGAPENFSIEGGYPYFMKKNWFTPEGLECFCQQ 297 >gb|EXC04030.1| Ubiquitin-like-specific protease 1D [Morus notabilis] Length = 316 Score = 285 bits (730), Expect = 2e-74 Identities = 152/305 (49%), Positives = 197/305 (64%), Gaps = 22/305 (7%) Frame = -1 Query: 1074 MGKKKL-KEIASFDLVS------------SSLEVYD------DQKPKSHGSCCHHIATGC 952 MGK+KL KEI + DL S S L V+ D H SC H+ Sbjct: 1 MGKRKLSKEIITIDLESPTSPVAGKSFLASLLGVFGVRNVALDYGFSQHRSCWKHVLATL 60 Query: 951 QALPDRVTKKKAREIKSFELAFPFFSGTIPRRERSKRILIKNS---ISKQHRKLDSNVFE 781 +A R+TKK+ I SF+L P ++ +N+ +SK +++L S+ FE Sbjct: 61 KARKKRLTKKETEAIDSFKLTAPCLLNHTCGERSKRKTTYENAGHGVSKLNKELLSSTFE 120 Query: 780 SFLEKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICW 601 + E LW FSE+K AS YLDCLWF+LY+K K+KVL WIK K+IFS+KYV VPI+ W Sbjct: 121 MYFEFLWRGFSEDKGASCAYLDCLWFSLYKKRDYKSKVLKWIKDKNIFSKKYVLVPIVIW 180 Query: 600 HHWSLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKE 421 HWS LIFC+F ESL+S TRTPCMLLLDSLE A+P+RLEPDIRKFV DIYR E RP+ ++ Sbjct: 181 SHWSFLIFCNFDESLESTTRTPCMLLLDSLESADPRRLEPDIRKFVYDIYRTEDRPQTQK 240 Query: 420 SIAKIPFLVPKVPQQKNSEDCGIFVLYFMNLFLESAPEDFNIFGGYPYFMNENWFSSEGL 241 SI KIP L P+VPQQ++ +CG FVLYF+ LF++ APE+F+I +PYFM NWF+ E + Sbjct: 241 SILKIPLLTPQVPQQRSDWECGNFVLYFIKLFMDGAPENFSI-KDFPYFMKRNWFTPEDV 299 Query: 240 ARFCK 226 FC+ Sbjct: 300 DCFCE 304 >gb|ADN33966.1| sentrin/sumo-specific protease [Cucumis melo subsp. melo] Length = 274 Score = 285 bits (730), Expect = 2e-74 Identities = 138/241 (57%), Positives = 172/241 (71%), Gaps = 3/241 (1%) Frame = -1 Query: 933 VTKKKAREIKSFELAFPFFSGTIP---RRERSKRILIKNSISKQHRKLDSNVFESFLEKL 763 V +++ +K F+ P SGT P RR+ K++ +I + RKLDS FE + L Sbjct: 26 VELEESENVKKFQPVSPSVSGTGPVRRRRQLKKKVGCNGAIPVRKRKLDSRAFEYCFQNL 85 Query: 762 WSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLL 583 W S EEKK FTYLDCLWF LY K S + KVL WIK K IFS+KYVFVPI+CW HWSLL Sbjct: 86 WRSSPEEKKIQFTYLDCLWFNLYLKASHRRKVLKWIKDKEIFSKKYVFVPIVCWSHWSLL 145 Query: 582 IFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIAKIP 403 IFCHF S +SK R PCMLLLDSL+ ANP+RLEP+IRKFV DI++E+G+ + I KIP Sbjct: 146 IFCHFDASPESKRRKPCMLLLDSLQEANPRRLEPEIRKFVFDIFKEDGKCKNLNVICKIP 205 Query: 402 FLVPKVPQQKNSEDCGIFVLYFMNLFLESAPEDFNIFGGYPYFMNENWFSSEGLARFCKK 223 +VPKVPQQKN ++CG FVLYF++LF+E+AP +F I YPYFM ENWF+ EG+ +F K Sbjct: 206 LMVPKVPQQKNGDECGKFVLYFIHLFMEAAPANFRI-KDYPYFMKENWFTEEGVCQFYKT 264 Query: 222 F 220 F Sbjct: 265 F 265 >ref|XP_007137950.1| hypothetical protein PHAVU_009G168700g [Phaseolus vulgaris] gi|561011037|gb|ESW09944.1| hypothetical protein PHAVU_009G168700g [Phaseolus vulgaris] Length = 268 Score = 284 bits (727), Expect = 5e-74 Identities = 138/256 (53%), Positives = 178/256 (69%), Gaps = 24/256 (9%) Frame = -1 Query: 927 KKKAREIKSFELAFPFFSGTIPRRERSKR------------------------ILIKNSI 820 + K ++S FPF +P+R R+KR K ++ Sbjct: 6 RSKPYVMESSSSPFPFVWSNVPQRLRTKRKRKLNGKKALSRPNKEHSRPKEAPCRPKETL 65 Query: 819 SKQHRKLDSNVFESFLEKLWSSFSEEKKASFTYLDCLWFALYRKWSTKAKVLTWIKSKHI 640 S+ KLDS +F++FL+K+W F E++K FTY D LWF+LYR S+K KVL WIK + I Sbjct: 66 SRIKEKLDSGIFDTFLKKIWKIFPEDRKGQFTYFDSLWFSLYRSASSKDKVLAWIKREPI 125 Query: 639 FSRKYVFVPIICWHHWSLLIFCHFGESLQSKTRTPCMLLLDSLEMANPKRLEPDIRKFVL 460 FS+ YVFVPI+CW HWSLLI CHFGESLQS TR+ CMLLLDSLEMANP+RLEP+IR+FVL Sbjct: 126 FSKAYVFVPIVCWGHWSLLILCHFGESLQSSTRSRCMLLLDSLEMANPRRLEPEIRRFVL 185 Query: 459 DIYREEGRPEKKESIAKIPFLVPKVPQQKNSEDCGIFVLYFMNLFLESAPEDFNIFGGYP 280 DIY+ RPE K +++IPFLVPKVPQQ++ +CG FVLYF+NLFLE AP++F++ GYP Sbjct: 186 DIYKSGDRPETKNILSQIPFLVPKVPQQRDGNECGFFVLYFINLFLEHAPDNFSM-EGYP 244 Query: 279 YFMNENWFSSEGLARF 232 YFM ++WFS +GL RF Sbjct: 245 YFMTKDWFSFDGLDRF 260 >ref|XP_004501822.1| PREDICTED: probable ubiquitin-like-specific protease 2A-like [Cicer arietinum] Length = 385 Score = 281 bits (719), Expect = 4e-73 Identities = 141/225 (62%), Positives = 170/225 (75%), Gaps = 4/225 (1%) Frame = -1 Query: 888 FPFFSGTIPRRER--SKRILIKNSI-SKQHRKLDSNVFESFLEKLWSSFSEEKKASFTYL 718 FPF S IPRR R SKR N S+ KL+S VF+++L K+W SFSE++K SF YL Sbjct: 158 FPFDSNIIPRRPRTKSKRKFNGNEAPSRPKEKLNSEVFDNYLAKIWKSFSEDRKRSFAYL 217 Query: 717 DCLWFALYRKWSTKAKVLTWIKSK-HIFSRKYVFVPIICWHHWSLLIFCHFGESLQSKTR 541 D LWF+LYR S+K KVL WIK K HIF++ YVFVPI+CW HWSLLI CHFGE LQ T Sbjct: 218 DSLWFSLYRNASSKDKVLNWIKKKEHIFTKAYVFVPIVCWGHWSLLILCHFGEDLQLVTG 277 Query: 540 TPCMLLLDSLEMANPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPFLVPKVPQQKNSED 361 + CMLLLDSLEMA+P+RLEP+IR+FV DIY+ RPE K I+KIP LVPKVPQQK+ D Sbjct: 278 SRCMLLLDSLEMADPRRLEPEIRRFVQDIYKAGDRPETKHLISKIPLLVPKVPQQKDGTD 337 Query: 360 CGIFVLYFMNLFLESAPEDFNIFGGYPYFMNENWFSSEGLARFCK 226 CG FVLYF+ LFLE AP++F+I GYPYFM ++WF+ E L RFC+ Sbjct: 338 CGNFVLYFIKLFLELAPKNFSI-EGYPYFMKKDWFTFEDLDRFCE 381 >ref|XP_007037887.1| Cysteine proteinases superfamily protein, putative isoform 6, partial [Theobroma cacao] gi|508775132|gb|EOY22388.1| Cysteine proteinases superfamily protein, putative isoform 6, partial [Theobroma cacao] Length = 232 Score = 281 bits (718), Expect = 5e-73 Identities = 137/219 (62%), Positives = 166/219 (75%), Gaps = 1/219 (0%) Frame = -1 Query: 1038 DLVSSSLEVYDDQKPKSHGSCCHHIATGCQALPDRVTKKKAREIKSFELAFPFFSGTIPR 859 DL SS E Y+ H SC H+ +A +++K++A++++ F L P F G IP Sbjct: 2 DLASSDPE-YNGYPISKHRSCWVHVIGSLKARKKKISKQEAQKLRDFRLTAPCFLGNIPC 60 Query: 858 RERSKR-ILIKNSISKQHRKLDSNVFESFLEKLWSSFSEEKKASFTYLDCLWFALYRKWS 682 R+RSKR + KNSISKQ +LDS FE ++EKLWSSF EEK+ SF Y DC WFA YRK S Sbjct: 61 RQRSKRRVKSKNSISKQTNRLDSGAFECYMEKLWSSFPEEKRTSFAYFDCQWFAWYRKAS 120 Query: 681 TKAKVLTWIKSKHIFSRKYVFVPIICWHHWSLLIFCHFGESLQSKTRTPCMLLLDSLEMA 502 + KVL+WIK + IFS+KYV VP++CW HWSLLIFCHFGESLQS+T+TPCMLLLDSLE+A Sbjct: 121 FREKVLSWIKREQIFSKKYVLVPVVCWSHWSLLIFCHFGESLQSETKTPCMLLLDSLEIA 180 Query: 501 NPKRLEPDIRKFVLDIYREEGRPEKKESIAKIPFLVPKV 385 NP+RLEPDIRKFVLDIYR EGRPEKKE I +IP LVPKV Sbjct: 181 NPRRLEPDIRKFVLDIYRAEGRPEKKEMIYRIPLLVPKV 219