BLASTX nr result
ID: Catharanthus22_contig00003542
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00003542 (1756 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006366658.1| PREDICTED: probable ubiquitin-like-specific ... 267 1e-68 ref|XP_004253246.1| PREDICTED: uncharacterized protein LOC101254... 264 8e-68 ref|XP_002262951.2| PREDICTED: uncharacterized protein LOC100251... 258 7e-66 gb|AAT40499.2| Ulp1 protease family protein, putative [Solanum d... 251 5e-64 gb|EOY22383.1| Cysteine proteinases superfamily protein, putativ... 251 9e-64 gb|EMJ11800.1| hypothetical protein PRUPE_ppa017098mg [Prunus pe... 250 2e-63 gb|EOY22385.1| Cysteine proteinases superfamily protein, putativ... 249 3e-63 ref|XP_004309800.1| PREDICTED: uncharacterized protein LOC101305... 247 1e-62 ref|XP_002315114.1| hypothetical protein POPTR_0010s18760g [Popu... 246 3e-62 ref|XP_006476972.1| PREDICTED: sentrin-specific protease 1-like ... 245 4e-62 ref|XP_006340363.1| PREDICTED: sentrin-specific protease 1-like ... 245 4e-62 ref|XP_004251247.1| PREDICTED: uncharacterized protein LOC101243... 245 4e-62 ref|XP_002511251.1| sentrin/sumo-specific protease, putative [Ri... 244 7e-62 ref|XP_006440039.1| hypothetical protein CICLE_v10021330mg [Citr... 244 1e-61 ref|XP_004501822.1| PREDICTED: probable ubiquitin-like-specific ... 241 7e-61 ref|XP_002322226.2| hypothetical protein POPTR_0015s10250g [Popu... 239 2e-60 gb|EOY22386.1| Cysteine proteinases superfamily protein, putativ... 238 8e-60 gb|AFK37750.1| unknown [Lotus japonicus] 235 4e-59 gb|ESW09944.1| hypothetical protein PHAVU_009G168700g [Phaseolus... 234 1e-58 ref|XP_004293126.1| PREDICTED: uncharacterized protein LOC101303... 233 2e-58 >ref|XP_006366658.1| PREDICTED: probable ubiquitin-like-specific protease 2B-like [Solanum tuberosum] Length = 427 Score = 267 bits (682), Expect = 1e-68 Identities = 151/307 (49%), Positives = 198/307 (64%), Gaps = 4/307 (1%) Frame = +2 Query: 326 EDLHYKLSSIFQNIPRSKRSKGRYLLKKMANVVDSDYVPHSRVPRKQIGSKHARNSQKRI 505 E + + S Q+ PR S+ R K AN DS+ +P QI K +S+KR Sbjct: 119 EVIPQQASCCLQSKPRP-HSRKRTKNKITANTTDSEAIPQQASCCLQI--KPRPHSRKR- 174 Query: 506 TEKRKGKADLSDSETYAPRL-RVRGQKRALLHRSNSITQQGLFDTNIFQVYFENIWNGIS 682 K K AD +DSE R R GQ R R+NS Q+GL + F++Y E+IW Sbjct: 175 -SKSKITADSTDSEVIPQRASRCHGQSR----RNNS--QKGLGSSK-FELYLESIWKLHP 226 Query: 683 VEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFSKKYVLVPIVMWSHWSLLILCHF 862 ++RN+F+YLDSLWF++Y+E S KVLNWI KK IFSK+YV VPIV+W HWSLLI CH Sbjct: 227 EDRRNTFSYLDSLWFSLYSERSHKAKVLNWIAKKKIFSKEYVFVPIVLWGHWSLLIFCHL 286 Query: 863 GE--QSKSSSPCMVLLDSLQSAN-HNLEPQIRKFVLEIYKKEDRPVTRESIQKIPFLVPK 1033 GE QSK SPCM+LLDSL AN +P IRKFV++++K E RP T++ I KIP ++PK Sbjct: 287 GESLQSKERSPCMLLLDSLHMANPERFDPGIRKFVVDLFKAEQRPETKDQIMKIPLMIPK 346 Query: 1034 VPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYFMKKDWFASQDLDEFCQSLGSVR 1213 VPQQ+N E+CG +VL+Y++LFLE+APE+FSIS GYPYFM +DWF + L+ F Q + S Sbjct: 347 VPQQRNDEDCGNFVLYYINLFLESAPENFSISKGYPYFMTEDWFTPERLECFLQKVQSTC 406 Query: 1214 AESSTQD 1234 + D Sbjct: 407 GSTCDSD 413 >ref|XP_004253246.1| PREDICTED: uncharacterized protein LOC101254774 [Solanum lycopersicum] Length = 460 Score = 264 bits (675), Expect = 8e-68 Identities = 151/309 (48%), Positives = 197/309 (63%), Gaps = 6/309 (1%) Frame = +2 Query: 326 EDLHYKLSSIFQNIPRSKRSKGRYLLKKMANVVDSDYVPHSRVPRKQIGSK-HARN-SQK 499 E + + S Q+ PR K R K AN DS+ + QI + H+R S+ Sbjct: 152 EVITQQASCFLQSKPRPHPRK-RKKNKITANTTDSEAIQQQASCCSQIKPRPHSRKRSKS 210 Query: 500 RITEKRKGKADLSDSETYAPRL-RVRGQKRALLHRSNSITQQGLFDTNIFQVYFENIWNG 676 RIT AD +DSE R R GQ R R+NS Q+GL + F++Y E+IW Sbjct: 211 RIT------ADSTDSEVIPLRASRCHGQSR----RNNS--QKGLGSSK-FELYLESIWKL 257 Query: 677 ISVEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFSKKYVLVPIVMWSHWSLLILC 856 ++RN+F YLDSLWF++Y+E S KVLNWI KK IFSK+YV VPIV+W HWSLLI C Sbjct: 258 HPEDRRNTFTYLDSLWFSLYSERSHKAKVLNWIAKKKIFSKEYVFVPIVLWGHWSLLIFC 317 Query: 857 HFGE--QSKSSSPCMVLLDSLQSAN-HNLEPQIRKFVLEIYKKEDRPVTRESIQKIPFLV 1027 H GE QSK SPCM+LLDSL AN +P IRKFV++++K E RP T++ I KIP ++ Sbjct: 318 HLGESLQSKERSPCMLLLDSLHMANPERFDPGIRKFVIDLFKAEQRPETKDQIMKIPLMI 377 Query: 1028 PKVPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYFMKKDWFASQDLDEFCQSLGS 1207 PKVPQQ+N E+CG +VL+Y++LFLE+APE+FSIS GYPYFM +DWF + L+ F Q + S Sbjct: 378 PKVPQQQNDEDCGNFVLYYINLFLESAPENFSISKGYPYFMTEDWFTPERLECFLQEVQS 437 Query: 1208 VRAESSTQD 1234 +S D Sbjct: 438 ASGSTSDSD 446 >ref|XP_002262951.2| PREDICTED: uncharacterized protein LOC100251251 [Vitis vinifera] gi|297733618|emb|CBI14865.3| unnamed protein product [Vitis vinifera] Length = 295 Score = 258 bits (658), Expect = 7e-66 Identities = 119/196 (60%), Positives = 152/196 (77%), Gaps = 3/196 (1%) Frame = +2 Query: 632 DTNIFQVYFENIWNGISVEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFSKKYVL 811 DT F+ YF N+W S +K++SF YLD LWF+ Y + S KVLNWI KK IFS+KYV Sbjct: 95 DTAAFEWYFRNLWKSFSDDKKSSFGYLDCLWFSFYLKTSSREKVLNWIKKKRIFSRKYVF 154 Query: 812 VPIVMWSHWSLLILCHFGE--QSKSSSPCMVLLDSLQSAN-HNLEPQIRKFVLEIYKKED 982 VPIV W+HWSLLILCHFGE +SK +PCM+LLDSLQ AN LEP IRKFV +IYK+E Sbjct: 155 VPIVCWNHWSLLILCHFGESLESKIRAPCMLLLDSLQMANPKRLEPNIRKFVFDIYKEEG 214 Query: 983 RPVTRESIQKIPFLVPKVPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYFMKKDW 1162 RP +++ I KIP LVPKVPQQ+NGEECG +VL++++LF++ APE+FS+S+GYPYFMKK+W Sbjct: 215 RPESKQLISKIPLLVPKVPQQRNGEECGNFVLYFINLFMDGAPENFSVSEGYPYFMKKNW 274 Query: 1163 FASQDLDEFCQSLGSV 1210 F + L+ F + L S+ Sbjct: 275 FGPEALEHFFRKLDSI 290 >gb|AAT40499.2| Ulp1 protease family protein, putative [Solanum demissum] Length = 440 Score = 251 bits (642), Expect = 5e-64 Identities = 132/262 (50%), Positives = 170/262 (64%), Gaps = 9/262 (3%) Frame = +2 Query: 476 KHARNSQKRITEKRKGKADLSDSETYAPRLRVRGQK------RALLHRSNSITQQGLFDT 637 +H RN Q +E + D S+ E R + R LL R NS ++ L Sbjct: 170 EHMRNDQ---SEGERTGNDRSEGERLRELSNSRNSRSEGKRFRGLLKRRNSRSEGKLNSI 226 Query: 638 NIFQVYFENIWNGISVEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFSKKYVLVP 817 N F Y ENIW + +K+N FA LDS+WF+ Y KVL WI KDIFSKKYV VP Sbjct: 227 N-FDCYLENIWMKLPEDKKNLFACLDSMWFSSYRNKQYESKVLRWIKSKDIFSKKYVFVP 285 Query: 818 IVMWSHWSLLILCHFGE--QSKSSSPCMVLLDSLQSANHN-LEPQIRKFVLEIYKKEDRP 988 IV+W HW LLI CH GE +S+S++PCM+LLDSLQ A+ + P+IRKFV I+ E+RP Sbjct: 286 IVLWGHWCLLIFCHLGESLESESTTPCMLLLDSLQIADSSRFAPEIRKFVSSIFNNEERP 345 Query: 989 VTRESIQKIPFLVPKVPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYFMKKDWFA 1168 +++ I+KIP LVP+VPQQ+N +CG +VLFY+SLFLENAPE+FSIS+GYPYFMK+DWF Sbjct: 346 ESKQLIKKIPLLVPQVPQQRNATDCGKFVLFYISLFLENAPETFSISEGYPYFMKEDWFT 405 Query: 1169 SQDLDEFCQSLGSVRAESSTQD 1234 L+ F Q L +V SS+ D Sbjct: 406 HDQLESFWQDLQTVNKNSSSAD 427 >gb|EOY22383.1| Cysteine proteinases superfamily protein, putative isoform 1 [Theobroma cacao] Length = 291 Score = 251 bits (640), Expect = 9e-64 Identities = 131/255 (51%), Positives = 176/255 (69%), Gaps = 8/255 (3%) Frame = +2 Query: 467 IGSKHARNSQKRITEKRKGKADLSDSETYAP----RLRVRGQKRALLHRSNSITQQ-GLF 631 IGS AR +K+I+++ K L D AP + R + + + NSI++Q Sbjct: 39 IGSLKAR--KKKISKQEAQK--LRDFRLTAPCFLGNIPCRQRSKRRVKSKNSISKQTNRL 94 Query: 632 DTNIFQVYFENIWNGISVEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFSKKYVL 811 D+ F+ Y E +W+ EKR SFAY D WF Y + S KVL+WI ++ IFSKKYVL Sbjct: 95 DSGAFECYMEKLWSSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSWIKREQIFSKKYVL 154 Query: 812 VPIVMWSHWSLLILCHFGE--QSKSSSPCMVLLDSLQSAN-HNLEPQIRKFVLEIYKKED 982 VP+V WSHWSLLI CHFGE QS++ +PCM+LLDSL+ AN LEP IRKFVL+IY+ E Sbjct: 155 VPVVCWSHWSLLIFCHFGESLQSETKTPCMLLLDSLEIANPRRLEPDIRKFVLDIYRAEG 214 Query: 983 RPVTRESIQKIPFLVPKVPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYFMKKDW 1162 RP +E I +IP LVPKVPQQ++GEECG +VL++++LF+E APE+FSI +GYPYFM+KDW Sbjct: 215 RPEKKEMIYRIPLLVPKVPQQRDGEECGKFVLYFINLFVEGAPENFSI-EGYPYFMRKDW 273 Query: 1163 FASQDLDEFCQSLGS 1207 F ++ ++ FC+ L S Sbjct: 274 FNAEGVECFCEKLDS 288 >gb|EMJ11800.1| hypothetical protein PRUPE_ppa017098mg [Prunus persica] Length = 303 Score = 250 bits (638), Expect = 2e-63 Identities = 125/240 (52%), Positives = 162/240 (67%), Gaps = 8/240 (3%) Frame = +2 Query: 512 KRKGKADLSDSETYAPRLRVRGQKRALLHRSNSITQQGL-----FDTNIFQVYFENIWNG 676 KRKGK + E P+ K A+ + + Q + F YF+N+W Sbjct: 66 KRKGKRE-EMKELRPPK----DAKNAVSRKKEKLDSQAFECKEKLGSEAFDRYFQNLWKN 120 Query: 677 ISVEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFSKKYVLVPIVMWSHWSLLILC 856 +S +KR SFAYLD +WF++Y + S KVL WI KK IFSKKYV+VPIV W HW+LLI C Sbjct: 121 LSEDKRTSFAYLDCMWFSLYLQPSSRDKVLTWIKKKHIFSKKYVIVPIVCWGHWNLLIFC 180 Query: 857 HFGE--QSKSSSPCMVLLDSLQSAN-HNLEPQIRKFVLEIYKKEDRPVTRESIQKIPFLV 1027 HFGE QS++ PCM+LLDSL++A+ EP IRKFVL+IY+ E R T++ I +IPFLV Sbjct: 181 HFGESEQSETHKPCMLLLDSLENADPRRYEPDIRKFVLDIYEAEGRSETKDFIYRIPFLV 240 Query: 1028 PKVPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYFMKKDWFASQDLDEFCQSLGS 1207 PKVPQQ+N ECG +VL+Y++LF+E APE+FSI GYPYFMKK+WF + L+ FCQ L S Sbjct: 241 PKVPQQRNDVECGNFVLYYINLFIEGAPENFSIEGGYPYFMKKNWFTPEGLECFCQQLYS 300 >gb|EOY22385.1| Cysteine proteinases superfamily protein, putative isoform 3 [Theobroma cacao] Length = 273 Score = 249 bits (636), Expect = 3e-63 Identities = 130/254 (51%), Positives = 175/254 (68%), Gaps = 8/254 (3%) Frame = +2 Query: 470 GSKHARNSQKRITEKRKGKADLSDSETYAP----RLRVRGQKRALLHRSNSITQQ-GLFD 634 GS AR +K+I+++ K L D AP + R + + + NSI++Q D Sbjct: 22 GSLKAR--KKKISKQEAQK--LRDFRLTAPCFLGNIPCRQRSKRRVKSKNSISKQTNRLD 77 Query: 635 TNIFQVYFENIWNGISVEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFSKKYVLV 814 + F+ Y E +W+ EKR SFAY D WF Y + S KVL+WI ++ IFSKKYVLV Sbjct: 78 SGAFECYMEKLWSSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSWIKREQIFSKKYVLV 137 Query: 815 PIVMWSHWSLLILCHFGE--QSKSSSPCMVLLDSLQSAN-HNLEPQIRKFVLEIYKKEDR 985 P+V WSHWSLLI CHFGE QS++ +PCM+LLDSL+ AN LEP IRKFVL+IY+ E R Sbjct: 138 PVVCWSHWSLLIFCHFGESLQSETKTPCMLLLDSLEIANPRRLEPDIRKFVLDIYRAEGR 197 Query: 986 PVTRESIQKIPFLVPKVPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYFMKKDWF 1165 P +E I +IP LVPKVPQQ++GEECG +VL++++LF+E APE+FSI +GYPYFM+KDWF Sbjct: 198 PEKKEMIYRIPLLVPKVPQQRDGEECGKFVLYFINLFVEGAPENFSI-EGYPYFMRKDWF 256 Query: 1166 ASQDLDEFCQSLGS 1207 ++ ++ FC+ L S Sbjct: 257 NAEGVECFCEKLDS 270 >ref|XP_004309800.1| PREDICTED: uncharacterized protein LOC101305332 [Fragaria vesca subsp. vesca] Length = 330 Score = 247 bits (630), Expect = 1e-62 Identities = 123/260 (47%), Positives = 175/260 (67%), Gaps = 4/260 (1%) Frame = +2 Query: 440 PHSRVPRKQIGSKHARNSQKRITEKRKGKADLSDSETYAPRLRVRGQKRALLHRSNSITQ 619 PH R + +K+ R T+ ++ KA V +K L+ R +++ Sbjct: 80 PHRRRSVRSFKTKYVNLEVSRKTQNQESKA-----------CAVSRRKPVLVSRGCRVSR 128 Query: 620 QGL-FDTNIFQVYFENIWNGISVEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFS 796 + D+ FQ YFE++W S +K+ SF YLD +WF++Y + + KVL WI KK IFS Sbjct: 129 RKQELDSGTFQCYFESLWKSFSEDKKTSFTYLDCIWFSLYIKPTTKDKVLTWIKKKHIFS 188 Query: 797 KKYVLVPIVMWSHWSLLILCHFGE--QSKSSSPCMVLLDSLQSAN-HNLEPQIRKFVLEI 967 KKYV VPIV WSHW+LLILCHFGE +SK+ PCM+LLDSL+ A+ LEP IRKFV++I Sbjct: 189 KKYVFVPIVCWSHWNLLILCHFGENLESKTQRPCMLLLDSLEMADPRRLEPDIRKFVVDI 248 Query: 968 YKKEDRPVTRESIQKIPFLVPKVPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYF 1147 +++E RP + ++KIP LVPKVPQQ+N +ECG +VL++++LF+E+AP++FS+ + YPYF Sbjct: 249 FREEGRPENMDLLRKIPLLVPKVPQQRNDQECGNFVLYFINLFMESAPQTFSMEE-YPYF 307 Query: 1148 MKKDWFASQDLDEFCQSLGS 1207 MKK+WFA + LD FCQ + S Sbjct: 308 MKKNWFAYESLDCFCQDIYS 327 >ref|XP_002315114.1| hypothetical protein POPTR_0010s18760g [Populus trichocarpa] gi|222864154|gb|EEF01285.1| hypothetical protein POPTR_0010s18760g [Populus trichocarpa] Length = 298 Score = 246 bits (627), Expect = 3e-62 Identities = 122/251 (48%), Positives = 171/251 (68%), Gaps = 4/251 (1%) Frame = +2 Query: 479 HARNSQKRITEKRKGKADLSDSETYAPRLRVRGQKRALLHRSNSITQ-QGLFDTNIFQVY 655 HAR ++ + + ++ S + + R + + R+N++++ + D+ F Y Sbjct: 47 HARRTRMTKKQAEEIESFKLTSPCFLQTIPCRERSKKRFKRNNAVSKLKKELDSVSFNCY 106 Query: 656 FENIWNGISVEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFSKKYVLVPIVMWSH 835 EN+W S +K+ SFAYLDSLWF MYTE S KVL WI +K IFSKKYVLVPIV W H Sbjct: 107 MENLWKSFSEDKKMSFAYLDSLWFTMYTEASSGVKVLEWIKRKHIFSKKYVLVPIVRWCH 166 Query: 836 WSLLILCHFGEQ--SKSSSPCMVLLDSLQSAN-HNLEPQIRKFVLEIYKKEDRPVTRESI 1006 WSLLI CHFGE S++ +PCM+LLDSL+ A+ LEP IRKFV +IY+ E RP + I Sbjct: 167 WSLLIFCHFGESLLSENITPCMLLLDSLEMASPKRLEPDIRKFVWDIYESEGRPENKHMI 226 Query: 1007 QKIPFLVPKVPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYFMKKDWFASQDLDE 1186 +IP LVPKVPQQ+NG ECG YVL +++LF+++APE+F + +GYPYFMK +WF+ + L+ Sbjct: 227 SQIPLLVPKVPQQRNGVECGNYVLNFINLFVQDAPENFHM-EGYPYFMKDNWFSPEGLEH 285 Query: 1187 FCQSLGSVRAE 1219 FC+ L S+ ++ Sbjct: 286 FCEKLESLESD 296 >ref|XP_006476972.1| PREDICTED: sentrin-specific protease 1-like isoform X1 [Citrus sinensis] gi|568883543|ref|XP_006494525.1| PREDICTED: sentrin-specific protease 1-like isoform X1 [Citrus sinensis] Length = 303 Score = 245 bits (626), Expect = 4e-62 Identities = 116/218 (53%), Positives = 161/218 (73%), Gaps = 5/218 (2%) Frame = +2 Query: 569 VRGQKRALLHRSNSITQQ--GLFDTNIFQVYFENIWNGISVEKRNSFAYLDSLWFNMYTE 742 ++G+ + + + IT++ D+ F+ +N+W S +K+ F YLDSLWF++Y + Sbjct: 84 IKGKNSSSVKCKDMITKRKKNKLDSGKFEHLLDNLWRSFSEDKKAGFTYLDSLWFDLYRK 143 Query: 743 GSGNGKVLNWITKKDIFSKKYVLVPIVMWSHWSLLILCHFGE--QSKSSSPCMVLLDSLQ 916 S KVL WI +K IFSKKYVLVPIV W HW+LLILC+FG +SK+ +PCM+LLDSL+ Sbjct: 144 PSSKAKVLTWIKRKHIFSKKYVLVPIVCWRHWNLLILCNFGGSFESKTRTPCMLLLDSLE 203 Query: 917 SAN-HNLEPQIRKFVLEIYKKEDRPVTRESIQKIPFLVPKVPQQKNGEECGCYVLFYVSL 1093 +N EP IRKFV++IYK EDRP T+E I +IP LVPKVPQQ+NGEECG +VL++++L Sbjct: 204 MSNPWRFEPDIRKFVMDIYKAEDRPETKELISRIPLLVPKVPQQRNGEECGNFVLYFINL 263 Query: 1094 FLENAPESFSISDGYPYFMKKDWFASQDLDEFCQSLGS 1207 F+E APE+F++ D YPYFM+K+WF ++DLD FC+ L S Sbjct: 264 FVEGAPENFNLED-YPYFMEKNWFTAEDLDCFCERLNS 300 >ref|XP_006340363.1| PREDICTED: sentrin-specific protease 1-like [Solanum tuberosum] Length = 371 Score = 245 bits (626), Expect = 4e-62 Identities = 128/309 (41%), Positives = 190/309 (61%), Gaps = 11/309 (3%) Frame = +2 Query: 341 KLSSIFQNIPRSKRSKGRYLLKKMA-------NVVDSDYVPHSRVPRKQIGSKHARNSQK 499 +LS+ + R++RS+G + ++ ++ + +++ + + ++ G+ + + Sbjct: 53 ELSNSRNSRSRNERSEGEHTGNDISEGEHAGNDISEGEHMGNDQSEGERTGNDRSEGERL 112 Query: 500 R-ITEKRKGKADLSDSETYAPRLRVRGQKRALLHRSNSITQQGLFDTNIFQVYFENIWNG 676 R ++ R +++ R R + L S S +G ++ F Y ENIW Sbjct: 113 RELSNSRNSRSEGKRLRGLPKRRNSRSEGERLRRNSRS---EGKLNSINFDCYLENIWRK 169 Query: 677 ISVEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFSKKYVLVPIVMWSHWSLLILC 856 + +K+N FA LDS+WF+ Y KVL WI KDIFSKKYV VPIV+W HW LLI C Sbjct: 170 LPEDKKNLFACLDSMWFSSYRNKQYESKVLRWIKNKDIFSKKYVFVPIVLWGHWCLLIFC 229 Query: 857 HFGE--QSKSSSPCMVLLDSLQSANHN-LEPQIRKFVLEIYKKEDRPVTRESIQKIPFLV 1027 H GE +S+S++PCM+LLDSLQ A+ + P+IRKFV I+ E+RP ++ I+KIP LV Sbjct: 230 HLGESLESESTTPCMLLLDSLQIADSSRFAPEIRKFVSSIFNNEERPESKRLIKKIPLLV 289 Query: 1028 PKVPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYFMKKDWFASQDLDEFCQSLGS 1207 P+VPQQ+N +CG +VL+Y+S FLENAPE+FSIS+GYPYFMK+DWF L+ F Q L + Sbjct: 290 PQVPQQRNATDCGIFVLYYISRFLENAPETFSISEGYPYFMKEDWFTHDQLESFWQDLQT 349 Query: 1208 VRAESSTQD 1234 V SS+ D Sbjct: 350 VNKNSSSAD 358 >ref|XP_004251247.1| PREDICTED: uncharacterized protein LOC101243669 [Solanum lycopersicum] Length = 479 Score = 245 bits (626), Expect = 4e-62 Identities = 130/263 (49%), Positives = 170/263 (64%), Gaps = 9/263 (3%) Frame = +2 Query: 473 SKHARNSQKRITEKRKGKADLS-DSETYAPRLRVRGQKR-----ALLHRSNSITQQGLFD 634 S+ R R E+R K S +S + RLR ++R R NS ++ L Sbjct: 205 SEGERTGNDRSEEERLCKLSCSRNSRSEGERLRGLSKRRNSRSEGERFRRNSRSEGKLNS 264 Query: 635 TNIFQVYFENIWNGISVEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFSKKYVLV 814 N F Y ENIW + +K+N FA LDSLWF+ Y KVL WI KDIFSKKYV V Sbjct: 265 IN-FDCYLENIWRKLPEDKKNLFACLDSLWFSSYRNKRFESKVLRWIKNKDIFSKKYVFV 323 Query: 815 PIVMWSHWSLLILCHFGE--QSKSSSPCMVLLDSLQSANHN-LEPQIRKFVLEIYKKEDR 985 PIV+W HW LLI CH GE +S+S++PCM+LLDSLQ A+ + P+IRKF+ I+ E+R Sbjct: 324 PIVLWGHWCLLIFCHLGESLESESTTPCMLLLDSLQIADSSRFAPEIRKFISSIFNNEER 383 Query: 986 PVTRESIQKIPFLVPKVPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYFMKKDWF 1165 P +++ I+ IP LVP+VPQQ+N +CG +VL+Y+SLFLENAPE+FSIS+GYPYFMK+DWF Sbjct: 384 PESKQLIKNIPLLVPQVPQQRNATDCGKFVLYYISLFLENAPETFSISEGYPYFMKEDWF 443 Query: 1166 ASQDLDEFCQSLGSVRAESSTQD 1234 L+ F Q L +V SS+ D Sbjct: 444 THDQLESFWQDLQTVNKNSSSAD 466 >ref|XP_002511251.1| sentrin/sumo-specific protease, putative [Ricinus communis] gi|223550366|gb|EEF51853.1| sentrin/sumo-specific protease, putative [Ricinus communis] Length = 294 Score = 244 bits (624), Expect = 7e-62 Identities = 128/252 (50%), Positives = 166/252 (65%), Gaps = 9/252 (3%) Frame = +2 Query: 479 HARNSQKRITEKRKGKADLSDSETYAPRLRVRGQKRALLHRSNSIT------QQGLFDTN 640 H + +K+ EK + + DL S+ + R + R + +IT ++ D+ Sbjct: 43 HGKKIKKKEAEKLR-RFDLI-SQCFLGTFPTRQRSRRRIKHKFAITRVIKEKEKKRLDSG 100 Query: 641 IFQVYFENIWNGISVEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFSKKYVLVPI 820 F YF+N+W S EKR SF YLDSLWF Y + S GKVL WI +K IFSKKYVLVPI Sbjct: 101 EFDCYFQNLWKSFSKEKRTSFVYLDSLWFYWYLKASWKGKVLTWIKRKQIFSKKYVLVPI 160 Query: 821 VMWSHWSLLILCHFGEQSKSS--SPCMVLLDSLQSAN-HNLEPQIRKFVLEIYKKEDRPV 991 V W HWSLLI CH GE S+S+ +PCM+LLDSL+ AN LEP IRKFVL+IY E RP Sbjct: 161 VCWGHWSLLIFCHLGEVSESNDRTPCMLLLDSLEMANPRRLEPDIRKFVLDIYTSEGRPE 220 Query: 992 TRESIQKIPFLVPKVPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYFMKKDWFAS 1171 ++ I +IP LVPKVPQQ+NGEECG YVL++++LF+ AP+ FSI D YPYFM K+WF+ Sbjct: 221 DKKLISQIPLLVPKVPQQRNGEECGNYVLYFINLFMLGAPDDFSIKD-YPYFMNKNWFSP 279 Query: 1172 QDLDEFCQSLGS 1207 + L+ F + L S Sbjct: 280 ECLERFSEELES 291 >ref|XP_006440039.1| hypothetical protein CICLE_v10021330mg [Citrus clementina] gi|557542301|gb|ESR53279.1| hypothetical protein CICLE_v10021330mg [Citrus clementina] Length = 303 Score = 244 bits (622), Expect = 1e-61 Identities = 115/218 (52%), Positives = 161/218 (73%), Gaps = 5/218 (2%) Frame = +2 Query: 569 VRGQKRALLHRSNSITQQ--GLFDTNIFQVYFENIWNGISVEKRNSFAYLDSLWFNMYTE 742 ++G+ + + + IT++ D+ F+ +N+W S +K+ F YLDSLWF++Y + Sbjct: 84 IKGKNSSSVKCKDMITKRRKNKLDSGKFEHLLDNLWRSFSEDKKAGFTYLDSLWFDLYRK 143 Query: 743 GSGNGKVLNWITKKDIFSKKYVLVPIVMWSHWSLLILCHFGE--QSKSSSPCMVLLDSLQ 916 S KVL WI +K IFSKKYVLVPIV W HW+LLILC+FG +SK+ +PCM+LLDSL+ Sbjct: 144 PSSKAKVLTWIKRKHIFSKKYVLVPIVCWRHWNLLILCNFGGSFESKTRTPCMLLLDSLE 203 Query: 917 SAN-HNLEPQIRKFVLEIYKKEDRPVTRESIQKIPFLVPKVPQQKNGEECGCYVLFYVSL 1093 +N EP IRKFV++IYK E+RP T+E I +IP LVPKVPQQ+NGEECG +VL++++L Sbjct: 204 MSNPWRFEPDIRKFVMDIYKAEERPETKELISRIPLLVPKVPQQRNGEECGNFVLYFINL 263 Query: 1094 FLENAPESFSISDGYPYFMKKDWFASQDLDEFCQSLGS 1207 F+E APE+F++ D YPYFM+K+WF ++DLD FC+ L S Sbjct: 264 FVEGAPENFNLED-YPYFMEKNWFTAEDLDCFCERLNS 300 >ref|XP_004501822.1| PREDICTED: probable ubiquitin-like-specific protease 2A-like [Cicer arietinum] Length = 385 Score = 241 bits (615), Expect = 7e-61 Identities = 134/292 (45%), Positives = 182/292 (62%), Gaps = 14/292 (4%) Frame = +2 Query: 368 PRSKRSKGRYLLKKMANVVDSDYVPHSRVPRKQIGSKH----ARNSQKRITEKRKGKADL 535 P+ S+ R K + +V++ D ++I S+H A + TE K Sbjct: 94 PKLVYSRRRNKTKTVRDVIEIDPEVDFANQEREINSRHGTEIAYAHHRNKTENVKDAIGC 153 Query: 536 S------DSETYAPRLRVRGQKRALLHRSNSITQQGLFDTNIFQVYFENIWNGISVEKRN 697 DS R R + +++ + + S ++ L ++ +F Y IW S +++ Sbjct: 154 VSSNFPFDSNIIPRRPRTKSKRKFNGNEAPSRPKEKL-NSEVFDNYLAKIWKSFSEDRKR 212 Query: 698 SFAYLDSLWFNMYTEGSGNGKVLNWITKKD-IFSKKYVLVPIVMWSHWSLLILCHFGE-- 868 SFAYLDSLWF++Y S KVLNWI KK+ IF+K YV VPIV W HWSLLILCHFGE Sbjct: 213 SFAYLDSLWFSLYRNASSKDKVLNWIKKKEHIFTKAYVFVPIVCWGHWSLLILCHFGEDL 272 Query: 869 QSKSSSPCMVLLDSLQSAN-HNLEPQIRKFVLEIYKKEDRPVTRESIQKIPFLVPKVPQQ 1045 Q + S CM+LLDSL+ A+ LEP+IR+FV +IYK DRP T+ I KIP LVPKVPQQ Sbjct: 273 QLVTGSRCMLLLDSLEMADPRRLEPEIRRFVQDIYKAGDRPETKHLISKIPLLVPKVPQQ 332 Query: 1046 KNGEECGCYVLFYVSLFLENAPESFSISDGYPYFMKKDWFASQDLDEFCQSL 1201 K+G +CG +VL+++ LFLE AP++FSI +GYPYFMKKDWF +DLD FC++L Sbjct: 333 KDGTDCGNFVLYFIKLFLELAPKNFSI-EGYPYFMKKDWFTFEDLDRFCENL 383 >ref|XP_002322226.2| hypothetical protein POPTR_0015s10250g [Populus trichocarpa] gi|550322421|gb|EEF06353.2| hypothetical protein POPTR_0015s10250g [Populus trichocarpa] Length = 292 Score = 239 bits (611), Expect = 2e-60 Identities = 119/211 (56%), Positives = 149/211 (70%), Gaps = 3/211 (1%) Frame = +2 Query: 578 QKRALLHRSNSITQQGLFDTNIFQVYFENIWNGISVEKRNSFAYLDSLWFNMYTEGSGNG 757 +K+A++ I ++ D+ F YFE++W S +KR Y D LWFN+YT+ S G Sbjct: 81 RKKAIV---KEIREKIKLDSGAFDCYFEHMWRNFSEDKRTFITYFDCLWFNLYTKASFKG 137 Query: 758 KVLNWITKKDIFSKKYVLVPIVMWSHWSLLILCHFGE--QSKSSSPCMVLLDSLQSANHN 931 KVL WI KK IFSKKYVLVPIV WSHWSLLI CH GE QSK +PCM+LLDSL+ A Sbjct: 138 KVLTWIKKKQIFSKKYVLVPIVHWSHWSLLIFCHLGESLQSKLRTPCMLLLDSLEKAGPR 197 Query: 932 -LEPQIRKFVLEIYKKEDRPVTRESIQKIPFLVPKVPQQKNGEECGCYVLFYVSLFLENA 1108 LEP IRKFVL+IYK E R +E I KIP LVPKVPQQ+ GEECG YVL+Y++LF++ A Sbjct: 198 CLEPDIRKFVLDIYKSEGRAENKELISKIPLLVPKVPQQRGGEECGNYVLYYINLFVQGA 257 Query: 1109 PESFSISDGYPYFMKKDWFASQDLDEFCQSL 1201 PE+F + D YPYFMK++WF+ L+ F + L Sbjct: 258 PENFCMDD-YPYFMKQNWFSPGCLEAFFEKL 287 >gb|EOY22386.1| Cysteine proteinases superfamily protein, putative isoform 4 [Theobroma cacao] Length = 270 Score = 238 bits (606), Expect = 8e-60 Identities = 127/254 (50%), Positives = 172/254 (67%), Gaps = 8/254 (3%) Frame = +2 Query: 470 GSKHARNSQKRITEKRKGKADLSDSETYAP----RLRVRGQKRALLHRSNSITQQ-GLFD 634 GS AR +K+I+++ K L D AP + R + + + NSI++Q D Sbjct: 22 GSLKAR--KKKISKQEAQK--LRDFRLTAPCFLGNIPCRQRSKRRVKSKNSISKQTNRLD 77 Query: 635 TNIFQVYFENIWNGISVEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFSKKYVLV 814 + F+ Y E +W+ EKR SFAY D WF Y + S KVL+WI ++ IFSKKYVLV Sbjct: 78 SGAFECYMEKLWSSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSWIKREQIFSKKYVLV 137 Query: 815 PIVMWSHWSLLILCHFGE--QSKSSSPCMVLLDSLQSAN-HNLEPQIRKFVLEIYKKEDR 985 P+V WSHWSLLI CHFGE QS++ +PCM+LLDSL+ AN LEP IRKFVL+IY+ E R Sbjct: 138 PVVCWSHWSLLIFCHFGESLQSETKTPCMLLLDSLEIANPRRLEPDIRKFVLDIYRAEGR 197 Query: 986 PVTRESIQKIPFLVPKVPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYFMKKDWF 1165 P +E I +IP LVPK Q++GEECG +VL++++LF+E APE+FSI +GYPYFM+KDWF Sbjct: 198 PEKKEMIYRIPLLVPK---QRDGEECGKFVLYFINLFVEGAPENFSI-EGYPYFMRKDWF 253 Query: 1166 ASQDLDEFCQSLGS 1207 ++ ++ FC+ L S Sbjct: 254 NAEGVECFCEKLDS 267 >gb|AFK37750.1| unknown [Lotus japonicus] Length = 284 Score = 235 bits (600), Expect = 4e-59 Identities = 115/199 (57%), Positives = 142/199 (71%), Gaps = 3/199 (1%) Frame = +2 Query: 623 GLFDTNIFQVYFENIWNGISVEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFSKK 802 G+FD N+ + IWN S +KR FAY DSLWF++Y S KVL WI K+ IFSK Sbjct: 91 GVFDNNLVK-----IWNSFSEDKRKPFAYFDSLWFSLYRAASSKDKVLTWIKKEHIFSKA 145 Query: 803 YVLVPIVMWSHWSLLILCHFGE--QSKSSSPCMVLLDSLQSAN-HNLEPQIRKFVLEIYK 973 YV VPIV W HWSLLI CHFGE QS + S CM+LLDSL+ N LEP IR+FV++IYK Sbjct: 146 YVFVPIVCWGHWSLLIFCHFGESLQSTTRSRCMLLLDSLEMVNPRRLEPDIRRFVVDIYK 205 Query: 974 KEDRPVTRESIQKIPFLVPKVPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYFMK 1153 DRP T+ I +IP LVPKVPQQ++G ECG +VL++++LFL APE+FS+ GYPYFMK Sbjct: 206 AWDRPETKNLIYQIPLLVPKVPQQRDGNECGNFVLYFINLFLRCAPENFSMG-GYPYFMK 264 Query: 1154 KDWFASQDLDEFCQSLGSV 1210 KDWF +D D FC+ L S+ Sbjct: 265 KDWFTFEDFDRFCERLYSL 283 >gb|ESW09944.1| hypothetical protein PHAVU_009G168700g [Phaseolus vulgaris] Length = 268 Score = 234 bits (596), Expect = 1e-58 Identities = 116/244 (47%), Positives = 160/244 (65%), Gaps = 3/244 (1%) Frame = +2 Query: 488 NSQKRITEKRKGKADLSDSETYAPRLRVRGQKRALLHRSNSITQQGLFDTNIFQVYFENI 667 N +R+ KRK K + + + + R ++ + + D+ IF + + I Sbjct: 25 NVPQRLRTKRKRKLNGKKALSRPNKEHSRPKEAPCRPKETLSRIKEKLDSGIFDTFLKKI 84 Query: 668 WNGISVEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFSKKYVLVPIVMWSHWSLL 847 W +++ F Y DSLWF++Y S KVL WI ++ IFSK YV VPIV W HWSLL Sbjct: 85 WKIFPEDRKGQFTYFDSLWFSLYRSASSKDKVLAWIKREPIFSKAYVFVPIVCWGHWSLL 144 Query: 848 ILCHFGE--QSKSSSPCMVLLDSLQSAN-HNLEPQIRKFVLEIYKKEDRPVTRESIQKIP 1018 ILCHFGE QS + S CM+LLDSL+ AN LEP+IR+FVL+IYK DRP T+ + +IP Sbjct: 145 ILCHFGESLQSSTRSRCMLLLDSLEMANPRRLEPEIRRFVLDIYKSGDRPETKNILSQIP 204 Query: 1019 FLVPKVPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYFMKKDWFASQDLDEFCQS 1198 FLVPKVPQQ++G ECG +VL++++LFLE+AP++FS+ +GYPYFM KDWF+ LD F + Sbjct: 205 FLVPKVPQQRDGNECGFFVLYFINLFLEHAPDNFSM-EGYPYFMTKDWFSFDGLDRFHEG 263 Query: 1199 LGSV 1210 L S+ Sbjct: 264 LNSL 267 >ref|XP_004293126.1| PREDICTED: uncharacterized protein LOC101303677 [Fragaria vesca subsp. vesca] Length = 360 Score = 233 bits (594), Expect = 2e-58 Identities = 119/260 (45%), Positives = 171/260 (65%), Gaps = 4/260 (1%) Frame = +2 Query: 440 PHSRVPRKQIGSKHARNSQKRITEKRKGKA-DLSDSETYAPRLRVRGQKRALLHRSNSIT 616 PH + + +K+ + R + ++ KA +S + + RV +K+ L Sbjct: 77 PHRQRSVRSFKTKYVKRQVLRKKQNQESKACAVSRRKPVSRGCRVSRKKQEL-------- 128 Query: 617 QQGLFDTNIFQVYFENIWNGISVEKRNSFAYLDSLWFNMYTEGSGNGKVLNWITKKDIFS 796 D+ FQ FE++W S +K+ F YLD LWF++Y E + KVL WI KK IFS Sbjct: 129 -----DSGSFQSCFESLWKSFSEDKKTYFTYLDCLWFSLYIEPTTKDKVLTWIKKKHIFS 183 Query: 797 KKYVLVPIVMWSHWSLLILCHFGE--QSKSSSPCMVLLDSLQSAN-HNLEPQIRKFVLEI 967 KKYV VPIV W HWSLLILCHFGE +SK+ PCM+LLDSL+ + LEP IR+FV++I Sbjct: 184 KKYVFVPIVCWCHWSLLILCHFGENLESKTQRPCMLLLDSLEMTDPKRLEPNIRRFVVDI 243 Query: 968 YKKEDRPVTRESIQKIPFLVPKVPQQKNGEECGCYVLFYVSLFLENAPESFSISDGYPYF 1147 +++E R + ++KIP LVPKVP+Q+N +ECG +VL++++LF+E+AP++FS+ +GYPYF Sbjct: 244 FREEGRRENMDLLRKIPLLVPKVPKQRNDQECGNFVLYFINLFMESAPQTFSM-EGYPYF 302 Query: 1148 MKKDWFASQDLDEFCQSLGS 1207 MKK+WFA + LD FCQ + S Sbjct: 303 MKKNWFAYESLDCFCQEIYS 322