BLASTX nr result
ID: Rehmannia31_contig00015316
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia31_contig00015316 (1558 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_020550086.1| ubiquitin-like protease 4 isoform X2 [Sesamu... 461 e-156 ref|XP_020550087.1| ubiquitin-like protease 4 isoform X3 [Sesamu... 454 e-154 ref|XP_011080477.1| ubiquitin-like protease 4 isoform X1 [Sesamu... 454 e-154 ref|XP_020550088.1| ubiquitin-like protease 4 isoform X4 [Sesamu... 435 e-147 ref|XP_012836834.1| PREDICTED: probable ubiquitin-like-specific ... 360 e-118 gb|EYU46158.1| hypothetical protein MIMGU_mgv1a013645mg [Erythra... 303 2e-97 gb|KZV30590.1| Cysteine protein superfamily protein isoform 1 [D... 291 1e-92 ref|XP_022898367.1| probable ubiquitin-like-specific protease 2A... 258 6e-78 ref|XP_022898365.1| probable ubiquitin-like-specific protease 2A... 256 2e-77 gb|EOY22385.1| Cysteine proteinases superfamily protein, putativ... 252 6e-77 ref|XP_007037884.2| PREDICTED: probable ubiquitin-like-specific ... 252 8e-77 gb|EOY22383.1| Cysteine proteinases superfamily protein, putativ... 252 1e-76 ref|XP_007037882.2| PREDICTED: probable ubiquitin-like-specific ... 252 2e-76 ref|XP_020548056.1| probable ubiquitin-like-specific protease 2B... 253 2e-76 ref|XP_016455022.1| PREDICTED: probable ubiquitin-like-specific ... 252 2e-76 ref|XP_009626322.1| PREDICTED: probable ubiquitin-like-specific ... 251 4e-76 ref|XP_011092003.2| uncharacterized protein LOC105172313 isoform... 253 9e-76 ref|XP_011092000.2| uncharacterized protein LOC105172313 isoform... 253 9e-76 ref|XP_019267197.1| PREDICTED: probable ubiquitin-like-specific ... 250 1e-75 dbj|GAV68013.1| Peptidase_C48 domain-containing protein [Cephalo... 249 2e-75 >ref|XP_020550086.1| ubiquitin-like protease 4 isoform X2 [Sesamum indicum] Length = 405 Score = 461 bits (1186), Expect = e-156 Identities = 244/373 (65%), Positives = 275/373 (73%), Gaps = 5/373 (1%) Frame = +1 Query: 145 KGFPGTESHDQENVMSGGYNIHRC-LTLI---EVYEKTQYVLVDEESKFLNTSSQFPCFP 312 KGF G++SHDQ+N GGY I + LTL+ VYEK +L+ +E K +TSS F CFP Sbjct: 39 KGFAGSKSHDQDN---GGYKIQKHHLTLMLNPRVYEK---ILLAKERKLFDTSSTFSCFP 92 Query: 313 RGPRSKAKAACKQTEMNNNIAGVSAEETFNECPCPTSSTARPRAHRNGPKRKTKPKDERS 492 RGPRSKAKA + T + ++AG S + EC CPTSST A R G KRKTKPK+E S Sbjct: 93 RGPRSKAKARNQMT--SRHMAGFSLGVS-KECQCPTSSTTCRPARRRGRKRKTKPKNELS 149 Query: 493 DREVIVPXXXXXXXXXXXXQ-NSNTITNERGKLDSATFLHYFMHIWSAFPEEKTKSVTYF 669 D EV VP Q SNT T+ERGKLDSATFLHYFMHIWSAFPEEK KSV YF Sbjct: 150 DSEVTVPLRIHFRGQARRGQYGSNTTTDERGKLDSATFLHYFMHIWSAFPEEKVKSVAYF 209 Query: 670 DPLWFDLYANEKNRGMVLNWIKETGVFLKKYVLVPIVTWSHWSLLIFCNLGESLDSKINT 849 DPLWFDLYANE NR MVLNWIKE +F KKYVLVPIV WSHWSLLIFC+LGES S+ T Sbjct: 210 DPLWFDLYANENNRVMVLNWIKEKNIFSKKYVLVPIVMWSHWSLLIFCHLGESRHSRTAT 269 Query: 850 PCMLLLDSLHSIGPTRLEPLMRRLLCDIHKSEERQEGKEQLKKVRLMIPKVPQQKKGDEC 1029 PCMLLLDSLH+IGPTRLEPL+RRLL DI SEE+ E KEQLKK+ L+IPKVPQQ+KG+EC Sbjct: 270 PCMLLLDSLHAIGPTRLEPLIRRLLFDIFVSEEKLESKEQLKKIPLLIPKVPQQRKGEEC 329 Query: 1030 GFYVLYFIKLFLESAPENFDIFEGYPYFMKKDWFTVEEVESFCKSLDAFPLVLXXXXXXX 1209 GFYVLY+I LFLESAP+NFDI EGYPYFMKKDWFT E VESFCK LD+FP+V Sbjct: 330 GFYVLYYINLFLESAPQNFDISEGYPYFMKKDWFTAEGVESFCKRLDSFPVVSSDHDDSA 389 Query: 1210 XXXXXXCVEFIKT 1248 VE I+T Sbjct: 390 SVDSSNSVELIET 402 >ref|XP_020550087.1| ubiquitin-like protease 4 isoform X3 [Sesamum indicum] Length = 398 Score = 454 bits (1169), Expect = e-154 Identities = 244/379 (64%), Positives = 275/379 (72%), Gaps = 11/379 (2%) Frame = +1 Query: 145 KGFPGTESHDQENVMSGGYNIHRC-LTLI---EVYEKTQYVLVDEESKFLNTSSQFPCFP 312 KGF G++SHDQ+N GGY I + LTL+ VYEK +L+ +E K +TSS F CFP Sbjct: 26 KGFAGSKSHDQDN---GGYKIQKHHLTLMLNPRVYEK---ILLAKERKLFDTSSTFSCFP 79 Query: 313 RGPRSKAKAACKQTEMNNNIAGVSAEETFNECPCPTSSTARPRAHRNGPKRKTKPKDERS 492 RGPRSKAKA + T + ++AG S + EC CPTSST A R G KRKTKPK+E S Sbjct: 80 RGPRSKAKARNQMT--SRHMAGFSLGVS-KECQCPTSSTTCRPARRRGRKRKTKPKNELS 136 Query: 493 DREVIVPXXXXXXXXXXXXQ-NSNTITNERGKLDSATFLHYFMHIWSAFPEEKTKSVTYF 669 D EV VP Q SNT T+ERGKLDSATFLHYFMHIWSAFPEEK KSV YF Sbjct: 137 DSEVTVPLRIHFRGQARRGQYGSNTTTDERGKLDSATFLHYFMHIWSAFPEEKVKSVAYF 196 Query: 670 DPLWFDLYANEKNRGMVLNWIKETGVFLKKYVLVPIVTWSHWSLLIFCNLGESLDSKINT 849 DPLWFDLYANE NR MVLNWIKE +F KKYVLVPIV WSHWSLLIFC+LGES S+ T Sbjct: 197 DPLWFDLYANENNRVMVLNWIKEKNIFSKKYVLVPIVMWSHWSLLIFCHLGESRHSRTAT 256 Query: 850 PCMLLLDSLHSIGPTRLEPLMRRLLCDIHKSEERQEGKEQLKKVRLMIPKVPQQKKGDEC 1029 PCMLLLDSLH+IGPTRLEPL+RRLL DI SEE+ E KEQLKK+ L+IPKVPQQ+KG+EC Sbjct: 257 PCMLLLDSLHAIGPTRLEPLIRRLLFDIFVSEEKLESKEQLKKIPLLIPKVPQQRKGEEC 316 Query: 1030 GFYVLYFIKLFLESAPENFDIFEGYPYF------MKKDWFTVEEVESFCKSLDAFPLVLX 1191 GFYVLY+I LFLESAP+NFDI EGYPYF MKKDWFT E VESFCK LD+FP+V Sbjct: 317 GFYVLYYINLFLESAPQNFDISEGYPYFFPLYLQMKKDWFTAEGVESFCKRLDSFPVVSS 376 Query: 1192 XXXXXXXXXXXXCVEFIKT 1248 VE I+T Sbjct: 377 DHDDSASVDSSNSVELIET 395 >ref|XP_011080477.1| ubiquitin-like protease 4 isoform X1 [Sesamum indicum] Length = 411 Score = 454 bits (1169), Expect = e-154 Identities = 244/379 (64%), Positives = 275/379 (72%), Gaps = 11/379 (2%) Frame = +1 Query: 145 KGFPGTESHDQENVMSGGYNIHRC-LTLI---EVYEKTQYVLVDEESKFLNTSSQFPCFP 312 KGF G++SHDQ+N GGY I + LTL+ VYEK +L+ +E K +TSS F CFP Sbjct: 39 KGFAGSKSHDQDN---GGYKIQKHHLTLMLNPRVYEK---ILLAKERKLFDTSSTFSCFP 92 Query: 313 RGPRSKAKAACKQTEMNNNIAGVSAEETFNECPCPTSSTARPRAHRNGPKRKTKPKDERS 492 RGPRSKAKA + T + ++AG S + EC CPTSST A R G KRKTKPK+E S Sbjct: 93 RGPRSKAKARNQMT--SRHMAGFSLGVS-KECQCPTSSTTCRPARRRGRKRKTKPKNELS 149 Query: 493 DREVIVPXXXXXXXXXXXXQ-NSNTITNERGKLDSATFLHYFMHIWSAFPEEKTKSVTYF 669 D EV VP Q SNT T+ERGKLDSATFLHYFMHIWSAFPEEK KSV YF Sbjct: 150 DSEVTVPLRIHFRGQARRGQYGSNTTTDERGKLDSATFLHYFMHIWSAFPEEKVKSVAYF 209 Query: 670 DPLWFDLYANEKNRGMVLNWIKETGVFLKKYVLVPIVTWSHWSLLIFCNLGESLDSKINT 849 DPLWFDLYANE NR MVLNWIKE +F KKYVLVPIV WSHWSLLIFC+LGES S+ T Sbjct: 210 DPLWFDLYANENNRVMVLNWIKEKNIFSKKYVLVPIVMWSHWSLLIFCHLGESRHSRTAT 269 Query: 850 PCMLLLDSLHSIGPTRLEPLMRRLLCDIHKSEERQEGKEQLKKVRLMIPKVPQQKKGDEC 1029 PCMLLLDSLH+IGPTRLEPL+RRLL DI SEE+ E KEQLKK+ L+IPKVPQQ+KG+EC Sbjct: 270 PCMLLLDSLHAIGPTRLEPLIRRLLFDIFVSEEKLESKEQLKKIPLLIPKVPQQRKGEEC 329 Query: 1030 GFYVLYFIKLFLESAPENFDIFEGYPYF------MKKDWFTVEEVESFCKSLDAFPLVLX 1191 GFYVLY+I LFLESAP+NFDI EGYPYF MKKDWFT E VESFCK LD+FP+V Sbjct: 330 GFYVLYYINLFLESAPQNFDISEGYPYFFPLYLQMKKDWFTAEGVESFCKRLDSFPVVSS 389 Query: 1192 XXXXXXXXXXXXCVEFIKT 1248 VE I+T Sbjct: 390 DHDDSASVDSSNSVELIET 408 >ref|XP_020550088.1| ubiquitin-like protease 4 isoform X4 [Sesamum indicum] ref|XP_020550089.1| ubiquitin-like protease 4 isoform X4 [Sesamum indicum] Length = 348 Score = 435 bits (1118), Expect = e-147 Identities = 228/346 (65%), Positives = 254/346 (73%), Gaps = 7/346 (2%) Frame = +1 Query: 232 VYEKTQYVLVDEESKFLNTSSQFPCFPRGPRSKAKAACKQTEMNNNIAGVSAEETFNECP 411 VYEK +L+ +E K +TSS F CFPRGPRSKAKA + T + ++AG S + EC Sbjct: 6 VYEK---ILLAKERKLFDTSSTFSCFPRGPRSKAKARNQMT--SRHMAGFSLGVS-KECQ 59 Query: 412 CPTSSTARPRAHRNGPKRKTKPKDERSDREVIVPXXXXXXXXXXXXQ-NSNTITNERGKL 588 CPTSST A R G KRKTKPK+E SD EV VP Q SNT T+ERGKL Sbjct: 60 CPTSSTTCRPARRRGRKRKTKPKNELSDSEVTVPLRIHFRGQARRGQYGSNTTTDERGKL 119 Query: 589 DSATFLHYFMHIWSAFPEEKTKSVTYFDPLWFDLYANEKNRGMVLNWIKETGVFLKKYVL 768 DSATFLHYFMHIWSAFPEEK KSV YFDPLWFDLYANE NR MVLNWIKE +F KKYVL Sbjct: 120 DSATFLHYFMHIWSAFPEEKVKSVAYFDPLWFDLYANENNRVMVLNWIKEKNIFSKKYVL 179 Query: 769 VPIVTWSHWSLLIFCNLGESLDSKINTPCMLLLDSLHSIGPTRLEPLMRRLLCDIHKSEE 948 VPIV WSHWSLLIFC+LGES S+ TPCMLLLDSLH+IGPTRLEPL+RRLL DI SEE Sbjct: 180 VPIVMWSHWSLLIFCHLGESRHSRTATPCMLLLDSLHAIGPTRLEPLIRRLLFDIFVSEE 239 Query: 949 RQEGKEQLKKVRLMIPKVPQQKKGDECGFYVLYFIKLFLESAPENFDIFEGYPYF----- 1113 + E KEQLKK+ L+IPKVPQQ+KG+ECGFYVLY+I LFLESAP+NFDI EGYPYF Sbjct: 240 KLESKEQLKKIPLLIPKVPQQRKGEECGFYVLYYINLFLESAPQNFDISEGYPYFFPLYL 299 Query: 1114 -MKKDWFTVEEVESFCKSLDAFPLVLXXXXXXXXXXXXXCVEFIKT 1248 MKKDWFT E VESFCK LD+FP+V VE I+T Sbjct: 300 QMKKDWFTAEGVESFCKRLDSFPVVSSDHDDSASVDSSNSVELIET 345 >ref|XP_012836834.1| PREDICTED: probable ubiquitin-like-specific protease 2A [Erythranthe guttata] Length = 293 Score = 360 bits (924), Expect = e-118 Identities = 181/280 (64%), Positives = 206/280 (73%), Gaps = 3/280 (1%) Frame = +1 Query: 358 MNNNIAGVSAEETFNECPCPTSSTARPRAHRNGPKRKTKPKDERSDR--EVIVPXXXXXX 531 M + +AGVSAE E PC S KRKTK K+E+SD EV +P Sbjct: 1 MTSRMAGVSAEGISEEYPCSLKSR----------KRKTKTKNEQSDYSDEVTIPRRIHLR 50 Query: 532 XXXXXXQNS-NTITNERGKLDSATFLHYFMHIWSAFPEEKTKSVTYFDPLWFDLYANEKN 708 +N+ NTIT ERGKLDS+TF HYFMHIW AFPEEK S+ YFDPLWF+LY N+ Sbjct: 51 GRAGRGRNNCNTITKERGKLDSSTFFHYFMHIWRAFPEEKMNSIAYFDPLWFELYTNKHY 110 Query: 709 RGMVLNWIKETGVFLKKYVLVPIVTWSHWSLLIFCNLGESLDSKINTPCMLLLDSLHSIG 888 V++WIK VF KKYV VPIV WSHWSLLIFC++ ES DSK NTPCMLLLDSLH+IG Sbjct: 111 GPKVVDWIKAKSVFSKKYVFVPIVMWSHWSLLIFCHMSESPDSKTNTPCMLLLDSLHAIG 170 Query: 889 PTRLEPLMRRLLCDIHKSEERQEGKEQLKKVRLMIPKVPQQKKGDECGFYVLYFIKLFLE 1068 PTRLE + RRLL D+H SEER E KEQLKK+R +IP VPQQK GDECGFYVLY+IKLFLE Sbjct: 171 PTRLESIARRLLFDMHVSEERLESKEQLKKMRFLIPNVPQQKNGDECGFYVLYYIKLFLE 230 Query: 1069 SAPENFDIFEGYPYFMKKDWFTVEEVESFCKSLDAFPLVL 1188 SAPENF+I EGYPYFMKK+WFT EEVESFCK+LD P+ L Sbjct: 231 SAPENFNISEGYPYFMKKEWFTEEEVESFCKNLDTLPVDL 270 >gb|EYU46158.1| hypothetical protein MIMGU_mgv1a013645mg [Erythranthe guttata] Length = 214 Score = 303 bits (775), Expect = 2e-97 Identities = 141/191 (73%), Positives = 158/191 (82%) Frame = +1 Query: 616 MHIWSAFPEEKTKSVTYFDPLWFDLYANEKNRGMVLNWIKETGVFLKKYVLVPIVTWSHW 795 MHIW AFPEEK S+ YFDPLWF+LY N+ V++WIK VF KKYV VPIV WSHW Sbjct: 1 MHIWRAFPEEKMNSIAYFDPLWFELYTNKHYGPKVVDWIKAKSVFSKKYVFVPIVMWSHW 60 Query: 796 SLLIFCNLGESLDSKINTPCMLLLDSLHSIGPTRLEPLMRRLLCDIHKSEERQEGKEQLK 975 SLLIFC++ ES DSK NTPCMLLLDSLH+IGPTRLE + RRLL D+H SEER E KEQLK Sbjct: 61 SLLIFCHMSESPDSKTNTPCMLLLDSLHAIGPTRLESIARRLLFDMHVSEERLESKEQLK 120 Query: 976 KVRLMIPKVPQQKKGDECGFYVLYFIKLFLESAPENFDIFEGYPYFMKKDWFTVEEVESF 1155 K+R +IP VPQQK GDECGFYVLY+IKLFLESAPENF+I EGYPYFMKK+WFT EEVESF Sbjct: 121 KMRFLIPNVPQQKNGDECGFYVLYYIKLFLESAPENFNISEGYPYFMKKEWFTEEEVESF 180 Query: 1156 CKSLDAFPLVL 1188 CK+LD P+ L Sbjct: 181 CKNLDTLPVDL 191 >gb|KZV30590.1| Cysteine protein superfamily protein isoform 1 [Dorcoceras hygrometricum] Length = 223 Score = 291 bits (744), Expect = 1e-92 Identities = 134/188 (71%), Positives = 158/188 (84%) Frame = +1 Query: 616 MHIWSAFPEEKTKSVTYFDPLWFDLYANEKNRGMVLNWIKETGVFLKKYVLVPIVTWSHW 795 MHIW+ F EEK K V YFDPLWF+LYA+E++R MVL+WIKE G+F KKYVLVPIV WSHW Sbjct: 1 MHIWNDFAEEKVKPVAYFDPLWFNLYADERHRSMVLDWIKEMGIFSKKYVLVPIVLWSHW 60 Query: 796 SLLIFCNLGESLDSKINTPCMLLLDSLHSIGPTRLEPLMRRLLCDIHKSEERQEGKEQLK 975 SLLIFCNLGESLDS+ +TPC+LLLDSLH+IGP RLEPL+RRLL DI+K E R E ++QLK Sbjct: 61 SLLIFCNLGESLDSESDTPCLLLLDSLHAIGPKRLEPLIRRLLSDIYKIEGRSETRDQLK 120 Query: 976 KVRLMIPKVPQQKKGDECGFYVLYFIKLFLESAPENFDIFEGYPYFMKKDWFTVEEVESF 1155 K+ L+IPKVPQQKKG+ECG+ VLY++ LF+E APE F GYPYFM KDWFT E +ESF Sbjct: 121 KMPLLIPKVPQQKKGEECGYVVLYYVSLFVECAPETFRASNGYPYFMNKDWFTDEGLESF 180 Query: 1156 CKSLDAFP 1179 K LD+FP Sbjct: 181 YKRLDSFP 188 >ref|XP_022898367.1| probable ubiquitin-like-specific protease 2A isoform X2 [Olea europaea var. sylvestris] Length = 352 Score = 258 bits (658), Expect = 6e-78 Identities = 135/304 (44%), Positives = 181/304 (59%), Gaps = 7/304 (2%) Frame = +1 Query: 283 NTSSQFPCFPRGPRSKAKAACKQTEMNNNIAGVSAEETFNECPCPTSSTARPRAHRNG-- 456 N+ + F PRG RS+ + A ++T + AG AEE EC P R H+ Sbjct: 59 NSQAPFLFIPRGQRSRGRVAKRRTAELLHTAGDFAEEIVEECQSPRKCQKRKCRHKTNSS 118 Query: 457 -----PKRKTKPKDERSDREVIVPXXXXXXXXXXXXQNSNTITNERGKLDSATFLHYFMH 621 P R+++P R R V SNT +N RGK++ A F F+ Sbjct: 119 NLKGIPARRSRPA--RRSRGV-----------------SNTTSNLRGKINDALFQRCFLK 159 Query: 622 IWSAFPEEKTKSVTYFDPLWFDLYANEKNRGMVLNWIKETGVFLKKYVLVPIVTWSHWSL 801 IW PEEK T+ + +WF +Y N+ R VL WIK+ +F K+YVLVPIV WSHW L Sbjct: 160 IWEESPEEKRNLCTFMECIWFSMYTNDHWRKRVLTWIKKINIFSKEYVLVPIVLWSHWYL 219 Query: 802 LIFCNLGESLDSKINTPCMLLLDSLHSIGPTRLEPLMRRLLCDIHKSEERQEGKEQLKKV 981 LIFC+LGES +SK PC+LLL+SL + LEPL+RRL+ DI+++EER K+ + K+ Sbjct: 220 LIFCHLGESSESKTRNPCILLLNSLREL-DRGLEPLIRRLVIDIYETEERPVDKKLITKI 278 Query: 982 RLMIPKVPQQKKGDECGFYVLYFIKLFLESAPENFDIFEGYPYFMKKDWFTVEEVESFCK 1161 L++PKVPQQ +ECG +VLY+ LFL++ PENF +GYPYFMK DWF EE+E FCK Sbjct: 279 PLLVPKVPQQTNSEECGIFVLYYANLFLQNTPENFSTSDGYPYFMKDDWFGKEELEGFCK 338 Query: 1162 SLDA 1173 L+A Sbjct: 339 RLEA 342 >ref|XP_022898365.1| probable ubiquitin-like-specific protease 2A isoform X1 [Olea europaea var. sylvestris] Length = 353 Score = 256 bits (654), Expect = 2e-77 Identities = 135/305 (44%), Positives = 182/305 (59%), Gaps = 8/305 (2%) Frame = +1 Query: 283 NTSSQFPCFPRGPRSKAKAACKQT-EMNNNIAGVSAEETFNECPCPTSSTARPRAHRNG- 456 N+ + F PRG RS+ + A ++T E+ + G AEE EC P R H+ Sbjct: 59 NSQAPFLFIPRGQRSRGRVAKRRTAELLHTAEGDFAEEIVEECQSPRKCQKRKCRHKTNS 118 Query: 457 ------PKRKTKPKDERSDREVIVPXXXXXXXXXXXXQNSNTITNERGKLDSATFLHYFM 618 P R+++P R R V SNT +N RGK++ A F F+ Sbjct: 119 SNLKGIPARRSRPA--RRSRGV-----------------SNTTSNLRGKINDALFQRCFL 159 Query: 619 HIWSAFPEEKTKSVTYFDPLWFDLYANEKNRGMVLNWIKETGVFLKKYVLVPIVTWSHWS 798 IW PEEK T+ + +WF +Y N+ R VL WIK+ +F K+YVLVPIV WSHW Sbjct: 160 KIWEESPEEKRNLCTFMECIWFSMYTNDHWRKRVLTWIKKINIFSKEYVLVPIVLWSHWY 219 Query: 799 LLIFCNLGESLDSKINTPCMLLLDSLHSIGPTRLEPLMRRLLCDIHKSEERQEGKEQLKK 978 LLIFC+LGES +SK PC+LLL+SL + LEPL+RRL+ DI+++EER K+ + K Sbjct: 220 LLIFCHLGESSESKTRNPCILLLNSLREL-DRGLEPLIRRLVIDIYETEERPVDKKLITK 278 Query: 979 VRLMIPKVPQQKKGDECGFYVLYFIKLFLESAPENFDIFEGYPYFMKKDWFTVEEVESFC 1158 + L++PKVPQQ +ECG +VLY+ LFL++ PENF +GYPYFMK DWF EE+E FC Sbjct: 279 IPLLVPKVPQQTNSEECGIFVLYYANLFLQNTPENFSTSDGYPYFMKDDWFGKEELEGFC 338 Query: 1159 KSLDA 1173 K L+A Sbjct: 339 KRLEA 343 >gb|EOY22385.1| Cysteine proteinases superfamily protein, putative isoform 3 [Theobroma cacao] Length = 273 Score = 252 bits (644), Expect = 6e-77 Identities = 118/209 (56%), Positives = 149/209 (71%) Frame = +1 Query: 550 QNSNTITNERGKLDSATFLHYFMHIWSAFPEEKTKSVTYFDPLWFDLYANEKNRGMVLNW 729 ++ N+I+ + +LDS F Y +WS+FPEEK S YFD WF Y R VL+W Sbjct: 64 KSKNSISKQTNRLDSGAFECYMEKLWSSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSW 123 Query: 730 IKETGVFLKKYVLVPIVTWSHWSLLIFCNLGESLDSKINTPCMLLLDSLHSIGPTRLEPL 909 IK +F KKYVLVP+V WSHWSLLIFC+ GESL S+ TPCMLLLDSL P RLEP Sbjct: 124 IKREQIFSKKYVLVPVVCWSHWSLLIFCHFGESLQSETKTPCMLLLDSLEIANPRRLEPD 183 Query: 910 MRRLLCDIHKSEERQEGKEQLKKVRLMIPKVPQQKKGDECGFYVLYFIKLFLESAPENFD 1089 +R+ + DI+++E R E KE + ++ L++PKVPQQ+ G+ECG +VLYFI LF+E APENF Sbjct: 184 IRKFVLDIYRAEGRPEKKEMIYRIPLLVPKVPQQRDGEECGKFVLYFINLFVEGAPENFS 243 Query: 1090 IFEGYPYFMKKDWFTVEEVESFCKSLDAF 1176 I EGYPYFM+KDWF E VE FC+ LD+F Sbjct: 244 I-EGYPYFMRKDWFNAEGVECFCEKLDSF 271 >ref|XP_007037884.2| PREDICTED: probable ubiquitin-like-specific protease 2A isoform X4 [Theobroma cacao] Length = 273 Score = 252 bits (643), Expect = 8e-77 Identities = 118/209 (56%), Positives = 149/209 (71%) Frame = +1 Query: 550 QNSNTITNERGKLDSATFLHYFMHIWSAFPEEKTKSVTYFDPLWFDLYANEKNRGMVLNW 729 ++ N+I+ + +LDS F Y +WS+FPEEK S YFD WF Y R VL+W Sbjct: 64 KSKNSISKQTNRLDSGAFECYMEKLWSSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSW 123 Query: 730 IKETGVFLKKYVLVPIVTWSHWSLLIFCNLGESLDSKINTPCMLLLDSLHSIGPTRLEPL 909 IK +F KKYVLVP+V WSHWSLLIFC+ GESL S+ TPCMLLLDSL P RLEP Sbjct: 124 IKREQIFSKKYVLVPVVCWSHWSLLIFCHFGESLQSETKTPCMLLLDSLEIANPRRLEPD 183 Query: 910 MRRLLCDIHKSEERQEGKEQLKKVRLMIPKVPQQKKGDECGFYVLYFIKLFLESAPENFD 1089 +R+ + DI+++E R E KE + ++ L++PKVPQQ+ G+ECG +VLYFI LF+E APENF Sbjct: 184 IRKFVLDIYRAEGRPEKKEMIYRIPLLVPKVPQQRDGEECGKFVLYFINLFVEGAPENFS 243 Query: 1090 IFEGYPYFMKKDWFTVEEVESFCKSLDAF 1176 I EGYPYFM+KDWF E VE FC+ LD+F Sbjct: 244 I-EGYPYFMRKDWFDAEGVECFCEKLDSF 271 >gb|EOY22383.1| Cysteine proteinases superfamily protein, putative isoform 1 [Theobroma cacao] Length = 291 Score = 252 bits (644), Expect = 1e-76 Identities = 118/209 (56%), Positives = 149/209 (71%) Frame = +1 Query: 550 QNSNTITNERGKLDSATFLHYFMHIWSAFPEEKTKSVTYFDPLWFDLYANEKNRGMVLNW 729 ++ N+I+ + +LDS F Y +WS+FPEEK S YFD WF Y R VL+W Sbjct: 82 KSKNSISKQTNRLDSGAFECYMEKLWSSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSW 141 Query: 730 IKETGVFLKKYVLVPIVTWSHWSLLIFCNLGESLDSKINTPCMLLLDSLHSIGPTRLEPL 909 IK +F KKYVLVP+V WSHWSLLIFC+ GESL S+ TPCMLLLDSL P RLEP Sbjct: 142 IKREQIFSKKYVLVPVVCWSHWSLLIFCHFGESLQSETKTPCMLLLDSLEIANPRRLEPD 201 Query: 910 MRRLLCDIHKSEERQEGKEQLKKVRLMIPKVPQQKKGDECGFYVLYFIKLFLESAPENFD 1089 +R+ + DI+++E R E KE + ++ L++PKVPQQ+ G+ECG +VLYFI LF+E APENF Sbjct: 202 IRKFVLDIYRAEGRPEKKEMIYRIPLLVPKVPQQRDGEECGKFVLYFINLFVEGAPENFS 261 Query: 1090 IFEGYPYFMKKDWFTVEEVESFCKSLDAF 1176 I EGYPYFM+KDWF E VE FC+ LD+F Sbjct: 262 I-EGYPYFMRKDWFNAEGVECFCEKLDSF 289 >ref|XP_007037882.2| PREDICTED: probable ubiquitin-like-specific protease 2A isoform X1 [Theobroma cacao] Length = 291 Score = 252 bits (643), Expect = 2e-76 Identities = 118/209 (56%), Positives = 149/209 (71%) Frame = +1 Query: 550 QNSNTITNERGKLDSATFLHYFMHIWSAFPEEKTKSVTYFDPLWFDLYANEKNRGMVLNW 729 ++ N+I+ + +LDS F Y +WS+FPEEK S YFD WF Y R VL+W Sbjct: 82 KSKNSISKQTNRLDSGAFECYMEKLWSSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSW 141 Query: 730 IKETGVFLKKYVLVPIVTWSHWSLLIFCNLGESLDSKINTPCMLLLDSLHSIGPTRLEPL 909 IK +F KKYVLVP+V WSHWSLLIFC+ GESL S+ TPCMLLLDSL P RLEP Sbjct: 142 IKREQIFSKKYVLVPVVCWSHWSLLIFCHFGESLQSETKTPCMLLLDSLEIANPRRLEPD 201 Query: 910 MRRLLCDIHKSEERQEGKEQLKKVRLMIPKVPQQKKGDECGFYVLYFIKLFLESAPENFD 1089 +R+ + DI+++E R E KE + ++ L++PKVPQQ+ G+ECG +VLYFI LF+E APENF Sbjct: 202 IRKFVLDIYRAEGRPEKKEMIYRIPLLVPKVPQQRDGEECGKFVLYFINLFVEGAPENFS 261 Query: 1090 IFEGYPYFMKKDWFTVEEVESFCKSLDAF 1176 I EGYPYFM+KDWF E VE FC+ LD+F Sbjct: 262 I-EGYPYFMRKDWFDAEGVECFCEKLDSF 289 >ref|XP_020548056.1| probable ubiquitin-like-specific protease 2B isoform X6 [Sesamum indicum] ref|XP_020548058.1| probable ubiquitin-like-specific protease 2B isoform X6 [Sesamum indicum] Length = 325 Score = 253 bits (645), Expect = 2e-76 Identities = 142/307 (46%), Positives = 169/307 (55%) Frame = +1 Query: 253 VLVDEESKFLNTSSQFPCFPRGPRSKAKAACKQTEMNNNIAGVSAEETFNECPCPTSSTA 432 + V EE K N S PRG RSK + K T + AG + A Sbjct: 15 IQVAEEEKCHNLESSLSQVPRGQRSKGRGTEKVTAAQD--AGSNDISIMAVSRSQLDFAA 72 Query: 433 RPRAHRNGPKRKTKPKDERSDREVIVPXXXXXXXXXXXXQNSNTITNERGKLDSATFLHY 612 PR RN +TK E SD EV + S T+ + GKLDS F Y Sbjct: 73 HPRKPRNV---RTKRAAEPSDSEVTCERSLRSCRQLRRLKCSTTVVQKWGKLDSEKFEVY 129 Query: 613 FMHIWSAFPEEKTKSVTYFDPLWFDLYANEKNRGMVLNWIKETGVFLKKYVLVPIVTWSH 792 +W F EE+ TY D LWF LYA E + VL+WIK +F K YV VPIV W H Sbjct: 130 MESVWRRFTEERKNIFTYLDSLWFSLYAKEPLKTKVLSWIKRKDIFSKTYVFVPIVQWGH 189 Query: 793 WSLLIFCNLGESLDSKINTPCMLLLDSLHSIGPTRLEPLMRRLLCDIHKSEERQEGKEQL 972 W LLIFC+ GES S CMLLLDSL +LEP +RR + DI K E R E KE + Sbjct: 190 WFLLIFCHFGESPQSTTKRRCMLLLDSLQKANSKQLEPEIRRFVFDIFKIEARPEKKELI 249 Query: 973 KKVRLMIPKVPQQKKGDECGFYVLYFIKLFLESAPENFDIFEGYPYFMKKDWFTVEEVES 1152 K+ L+IPKVPQQK G+ECG +VLY+I LFLE AP+NF GYP+FMK+DWFT EEVES Sbjct: 250 SKIPLLIPKVPQQKYGEECGLFVLYYINLFLEMAPDNFSFSMGYPHFMKEDWFTHEEVES 309 Query: 1153 FCKSLDA 1173 F K LD+ Sbjct: 310 FAKGLDS 316 >ref|XP_016455022.1| PREDICTED: probable ubiquitin-like-specific protease 2B [Nicotiana tabacum] Length = 314 Score = 252 bits (644), Expect = 2e-76 Identities = 131/285 (45%), Positives = 170/285 (59%), Gaps = 2/285 (0%) Frame = +1 Query: 307 FPRGPRSKAKAACKQTEMN--NNIAGVSAEETFNECPCPTSSTARPRAHRNGPKRKTKPK 480 F R P K+++MN N + G + T E P + S PR + KRK+K Sbjct: 3 FQRAPTLLRSCQGKESQMNCRNIVKGFGKKGTDEELPSQSRSKEHPRTCQ---KRKSKKV 59 Query: 481 DERSDREVIVPXXXXXXXXXXXXQNSNTITNERGKLDSATFLHYFMHIWSAFPEEKTKSV 660 D V++ + GKL+S F Y +IW PE+K S Sbjct: 60 AAALDSAVLLRRSARLRGLSNKRNGKSD-----GKLNSTDFDCYLENIWRELPEDKKSSF 114 Query: 661 TYFDPLWFDLYANEKNRGMVLNWIKETGVFLKKYVLVPIVTWSHWSLLIFCNLGESLDSK 840 Y + +WF LY + + VL WIK +F KKYV VPIV W HW LLIFC+LGESL+S+ Sbjct: 115 AYLESMWFYLYTTKLFKPKVLRWIKGLDIFSKKYVFVPIVLWDHWCLLIFCHLGESLESE 174 Query: 841 INTPCMLLLDSLHSIGPTRLEPLMRRLLCDIHKSEERQEGKEQLKKVRLMIPKVPQQKKG 1020 TPCMLLLDSLH GP R EP +R+ + DI K+EER E ++ ++K++L+IPKVPQQ G Sbjct: 175 SKTPCMLLLDSLHMAGPLRYEPEIRKFVLDIFKNEERPESQQLIRKIKLLIPKVPQQTNG 234 Query: 1021 DECGFYVLYFIKLFLESAPENFDIFEGYPYFMKKDWFTVEEVESF 1155 +CG + LY+I LFLESAPENF I EGYPYFMKKDWFT +++ESF Sbjct: 235 TDCGKFALYYISLFLESAPENFSISEGYPYFMKKDWFTTDQLESF 279 >ref|XP_009626322.1| PREDICTED: probable ubiquitin-like-specific protease 2B isoform X1 [Nicotiana tomentosiformis] Length = 314 Score = 251 bits (642), Expect = 4e-76 Identities = 131/285 (45%), Positives = 170/285 (59%), Gaps = 2/285 (0%) Frame = +1 Query: 307 FPRGPRSKAKAACKQTEMN--NNIAGVSAEETFNECPCPTSSTARPRAHRNGPKRKTKPK 480 F R P K+++MN N + G + T E P + S PR + KRK+K Sbjct: 3 FKRAPTLLRSCQGKESQMNCRNIVKGFGKKGTDEELPSQSRSKEHPRTCQ---KRKSKKV 59 Query: 481 DERSDREVIVPXXXXXXXXXXXXQNSNTITNERGKLDSATFLHYFMHIWSAFPEEKTKSV 660 D V++ + GKL+S F Y +IW PE+K S Sbjct: 60 AAALDSAVLLRRSARLRGLSNKRNGKSD-----GKLNSTDFDCYLENIWRELPEDKKSSF 114 Query: 661 TYFDPLWFDLYANEKNRGMVLNWIKETGVFLKKYVLVPIVTWSHWSLLIFCNLGESLDSK 840 Y + +WF LY + + VL WIK +F KKYV VPIV W HW LLIFC+LGESL+S+ Sbjct: 115 AYLESMWFYLYTTKLFKPKVLRWIKGLDIFSKKYVFVPIVLWDHWCLLIFCHLGESLESE 174 Query: 841 INTPCMLLLDSLHSIGPTRLEPLMRRLLCDIHKSEERQEGKEQLKKVRLMIPKVPQQKKG 1020 TPCMLLLDSLH GP R EP +R+ + DI K+EER E ++ ++K++L+IPKVPQQ G Sbjct: 175 SKTPCMLLLDSLHMAGPLRYEPEIRKFVLDIFKNEERPESQQLIRKIKLLIPKVPQQTNG 234 Query: 1021 DECGFYVLYFIKLFLESAPENFDIFEGYPYFMKKDWFTVEEVESF 1155 +CG + LY+I LFLESAPENF I EGYPYFMKKDWFT +++ESF Sbjct: 235 TDCGKFALYYISLFLESAPENFSISEGYPYFMKKDWFTSDQLESF 279 >ref|XP_011092003.2| uncharacterized protein LOC105172313 isoform X2 [Sesamum indicum] Length = 372 Score = 253 bits (645), Expect = 9e-76 Identities = 142/307 (46%), Positives = 169/307 (55%) Frame = +1 Query: 253 VLVDEESKFLNTSSQFPCFPRGPRSKAKAACKQTEMNNNIAGVSAEETFNECPCPTSSTA 432 + V EE K N S PRG RSK + K T + AG + A Sbjct: 62 IQVAEEEKCHNLESSLSQVPRGQRSKGRGTEKVTAAQD--AGSNDISIMAVSRSQLDFAA 119 Query: 433 RPRAHRNGPKRKTKPKDERSDREVIVPXXXXXXXXXXXXQNSNTITNERGKLDSATFLHY 612 PR RN +TK E SD EV + S T+ + GKLDS F Y Sbjct: 120 HPRKPRNV---RTKRAAEPSDSEVTCERSLRSCRQLRRLKCSTTVVQKWGKLDSEKFEVY 176 Query: 613 FMHIWSAFPEEKTKSVTYFDPLWFDLYANEKNRGMVLNWIKETGVFLKKYVLVPIVTWSH 792 +W F EE+ TY D LWF LYA E + VL+WIK +F K YV VPIV W H Sbjct: 177 MESVWRRFTEERKNIFTYLDSLWFSLYAKEPLKTKVLSWIKRKDIFSKTYVFVPIVQWGH 236 Query: 793 WSLLIFCNLGESLDSKINTPCMLLLDSLHSIGPTRLEPLMRRLLCDIHKSEERQEGKEQL 972 W LLIFC+ GES S CMLLLDSL +LEP +RR + DI K E R E KE + Sbjct: 237 WFLLIFCHFGESPQSTTKRRCMLLLDSLQKANSKQLEPEIRRFVFDIFKIEARPEKKELI 296 Query: 973 KKVRLMIPKVPQQKKGDECGFYVLYFIKLFLESAPENFDIFEGYPYFMKKDWFTVEEVES 1152 K+ L+IPKVPQQK G+ECG +VLY+I LFLE AP+NF GYP+FMK+DWFT EEVES Sbjct: 297 SKIPLLIPKVPQQKYGEECGLFVLYYINLFLEMAPDNFSFSMGYPHFMKEDWFTHEEVES 356 Query: 1153 FCKSLDA 1173 F K LD+ Sbjct: 357 FAKGLDS 363 >ref|XP_011092000.2| uncharacterized protein LOC105172313 isoform X1 [Sesamum indicum] Length = 373 Score = 253 bits (645), Expect = 9e-76 Identities = 142/307 (46%), Positives = 169/307 (55%) Frame = +1 Query: 253 VLVDEESKFLNTSSQFPCFPRGPRSKAKAACKQTEMNNNIAGVSAEETFNECPCPTSSTA 432 + V EE K N S PRG RSK + K T + AG + A Sbjct: 63 IQVAEEEKCHNLESSLSQVPRGQRSKGRGTEKVTAAQD--AGSNDISIMAVSRSQLDFAA 120 Query: 433 RPRAHRNGPKRKTKPKDERSDREVIVPXXXXXXXXXXXXQNSNTITNERGKLDSATFLHY 612 PR RN +TK E SD EV + S T+ + GKLDS F Y Sbjct: 121 HPRKPRNV---RTKRAAEPSDSEVTCERSLRSCRQLRRLKCSTTVVQKWGKLDSEKFEVY 177 Query: 613 FMHIWSAFPEEKTKSVTYFDPLWFDLYANEKNRGMVLNWIKETGVFLKKYVLVPIVTWSH 792 +W F EE+ TY D LWF LYA E + VL+WIK +F K YV VPIV W H Sbjct: 178 MESVWRRFTEERKNIFTYLDSLWFSLYAKEPLKTKVLSWIKRKDIFSKTYVFVPIVQWGH 237 Query: 793 WSLLIFCNLGESLDSKINTPCMLLLDSLHSIGPTRLEPLMRRLLCDIHKSEERQEGKEQL 972 W LLIFC+ GES S CMLLLDSL +LEP +RR + DI K E R E KE + Sbjct: 238 WFLLIFCHFGESPQSTTKRRCMLLLDSLQKANSKQLEPEIRRFVFDIFKIEARPEKKELI 297 Query: 973 KKVRLMIPKVPQQKKGDECGFYVLYFIKLFLESAPENFDIFEGYPYFMKKDWFTVEEVES 1152 K+ L+IPKVPQQK G+ECG +VLY+I LFLE AP+NF GYP+FMK+DWFT EEVES Sbjct: 298 SKIPLLIPKVPQQKYGEECGLFVLYYINLFLEMAPDNFSFSMGYPHFMKEDWFTHEEVES 357 Query: 1153 FCKSLDA 1173 F K LD+ Sbjct: 358 FAKGLDS 364 >ref|XP_019267197.1| PREDICTED: probable ubiquitin-like-specific protease 2B isoform X1 [Nicotiana attenuata] ref|XP_019267198.1| PREDICTED: probable ubiquitin-like-specific protease 2B isoform X1 [Nicotiana attenuata] Length = 315 Score = 250 bits (639), Expect = 1e-75 Identities = 131/285 (45%), Positives = 170/285 (59%), Gaps = 2/285 (0%) Frame = +1 Query: 307 FPRGPRSKAKAACKQTEMN--NNIAGVSAEETFNECPCPTSSTARPRAHRNGPKRKTKPK 480 F R P K++++N N + G + T E P + ST PR + KRK+K Sbjct: 3 FKRAPTLLRSCQGKESQVNCRNIVKGFGKKGTDEELPSQSRSTEHPRTCQ---KRKSKKV 59 Query: 481 DERSDREVIVPXXXXXXXXXXXXQNSNTITNERGKLDSATFLHYFMHIWSAFPEEKTKSV 660 D V++ + GKL+S F Y +IW PE+K S Sbjct: 60 AAALDSAVLLRRSARLRGLSNKRNGKSD-----GKLNSVDFDCYLENIWRELPEDKKSSF 114 Query: 661 TYFDPLWFDLYANEKNRGMVLNWIKETGVFLKKYVLVPIVTWSHWSLLIFCNLGESLDSK 840 TY + +WF LY + + VL WIK +F KKYV VPIV W HW LLIFCNLG SL+S+ Sbjct: 115 TYLESMWFYLYTTKLFKAKVLRWIKGLDIFSKKYVFVPIVLWDHWCLLIFCNLGGSLESE 174 Query: 841 INTPCMLLLDSLHSIGPTRLEPLMRRLLCDIHKSEERQEGKEQLKKVRLMIPKVPQQKKG 1020 TPCMLLLDSLH GP+ E +R+ + DI K+EER E ++ +KK++L+IPKVPQQ G Sbjct: 175 SKTPCMLLLDSLHMAGPSPYESEIRKFVLDIFKNEERPESQQLIKKIKLLIPKVPQQTNG 234 Query: 1021 DECGFYVLYFIKLFLESAPENFDIFEGYPYFMKKDWFTVEEVESF 1155 +CG + LYFI LFLESAPENF I EGYPYFMK+DWFT +++ESF Sbjct: 235 TDCGKFALYFISLFLESAPENFSISEGYPYFMKRDWFTPDQLESF 279 >dbj|GAV68013.1| Peptidase_C48 domain-containing protein [Cephalotus follicularis] Length = 288 Score = 249 bits (636), Expect = 2e-75 Identities = 121/247 (48%), Positives = 163/247 (65%), Gaps = 3/247 (1%) Frame = +1 Query: 442 AHRNGPKRKTKPKDERS---DREVIVPXXXXXXXXXXXXQNSNTITNERGKLDSATFLHY 612 AH + K K KP++ S + + ++ + + + KL+SATF Y Sbjct: 40 AHLHARKMKIKPENLNSFQLNSPCFIHTFPHRERSKRRVRHKSAVPKLKKKLNSATFDRY 99 Query: 613 FMHIWSAFPEEKTKSVTYFDPLWFDLYANEKNRGMVLNWIKETGVFLKKYVLVPIVTWSH 792 ++W +FP+EK S +Y D LWF LY N +R VL+WIK + +F KKYV VPIV WSH Sbjct: 100 LENLWRSFPKEKITSFSYIDSLWFSLYTNASSRTKVLDWIKRSHIFSKKYVFVPIVLWSH 159 Query: 793 WSLLIFCNLGESLDSKINTPCMLLLDSLHSIGPTRLEPLMRRLLCDIHKSEERQEGKEQL 972 WSLLIFC+ GESL SK TPCMLLLDSL GP RLEP +R+ + DI+K+E R E K+ + Sbjct: 160 WSLLIFCHFGESLQSKRRTPCMLLLDSLEMAGPKRLEPDIRKFVLDIYKAEGRPETKKMI 219 Query: 973 KKVRLMIPKVPQQKKGDECGFYVLYFIKLFLESAPENFDIFEGYPYFMKKDWFTVEEVES 1152 ++ L++PKVPQQ+ +ECG +VLY+I LF+E APENF E +PYFMKKDWF+ E +E Sbjct: 220 SRIPLLVPKVPQQRDDEECGRFVLYYINLFVEGAPENFST-ENFPYFMKKDWFSEEGLER 278 Query: 1153 FCKSLDA 1173 FC+ LD+ Sbjct: 279 FCERLDS 285