BLASTX nr result
ID: Forsythia22_contig00025637
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia22_contig00025637 (783 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_009793129.1| PREDICTED: uncharacterized protein LOC104240... 306 7e-81 ref|XP_009603537.1| PREDICTED: uncharacterized protein LOC104098... 306 9e-81 ref|XP_004250001.1| PREDICTED: uncharacterized protein LOC101253... 273 7e-71 ref|XP_011101645.1| PREDICTED: OTU domain-containing protein 6B-... 271 3e-70 ref|XP_006360486.1| PREDICTED: uncharacterized protein LOC102606... 270 7e-70 ref|XP_010658710.1| PREDICTED: uncharacterized protein LOC100245... 248 2e-63 ref|XP_007010219.1| Cysteine proteinases superfamily protein iso... 246 1e-62 emb|CDO99851.1| unnamed protein product [Coffea canephora] 243 1e-61 ref|XP_003536306.1| PREDICTED: uncharacterized protein LOC100793... 241 3e-61 ref|XP_002316423.1| hypothetical protein POPTR_0010s24050g [Popu... 240 6e-61 ref|XP_002315401.2| hypothetical protein POPTR_0010s24050g [Popu... 240 6e-61 ref|XP_007010220.1| Cysteine proteinases superfamily protein iso... 240 6e-61 ref|XP_011024271.1| PREDICTED: uncharacterized protein LOC105125... 240 8e-61 ref|XP_002311041.1| hypothetical protein POPTR_0008s02620g [Popu... 240 8e-61 ref|XP_008446786.1| PREDICTED: OTU domain-containing protein At3... 239 1e-60 ref|XP_007143828.1| hypothetical protein PHAVU_007G105100g [Phas... 239 1e-60 ref|XP_004142455.1| PREDICTED: OTU domain-containing protein At3... 239 2e-60 ref|XP_003556279.1| PREDICTED: OTU domain-containing protein At3... 238 3e-60 gb|KJB73389.1| hypothetical protein B456_011G230700 [Gossypium r... 237 5e-60 ref|XP_012456105.1| PREDICTED: uncharacterized protein LOC105777... 237 5e-60 >ref|XP_009793129.1| PREDICTED: uncharacterized protein LOC104240043 [Nicotiana sylvestris] Length = 328 Score = 306 bits (785), Expect = 7e-81 Identities = 163/261 (62%), Positives = 187/261 (71%), Gaps = 22/261 (8%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSLSYAH-SSPALYS----DRWRSVVAGGGESFD---YAGRCR 220 MLGVLC +RP+PWL SLSLS+AH S+PA Y+ +SV+ GG + ++ CR Sbjct: 1 MLGVLC-ARPKPWLFASLSLSHAHGSAPAAYNRLIGTPTKSVLVGGSDQLQRRHHSSHCR 59 Query: 221 -------GGEASIWKVILPVGRRT-------AVFGWHEHEVAKMAGGEGSWNVAWDARPA 358 GG ASIW ILP GRR VF H +E+AK GEGSWNVAWD RPA Sbjct: 60 LGASVNRGGAASIWHAILPAGRRNKDVKRRNTVFHHHHYELAKK--GEGSWNVAWDTRPA 117 Query: 359 RWLHHPDSAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPN 538 RWLH+PDSAWLL+GV +CLA P LD D NS+V + + GFS+N V SD AD S N Sbjct: 118 RWLHNPDSAWLLFGVCSCLAAPSLDLPDSNSDVVAPIENMSQGFSSNTVNSDEADRNSAN 177 Query: 539 YRVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWF 718 Y VTGVPADGRCLFRAIAHMACLRNG+ APDENRQ ELADELRAQVV ELLKR+KE EWF Sbjct: 178 YTVTGVPADGRCLFRAIAHMACLRNGEGAPDENRQRELADELRAQVVDELLKRRKEAEWF 237 Query: 719 IEEDFDAYVKRIQQPYVWGGE 781 IE DFDAYV+RI++PYVWGGE Sbjct: 238 IEGDFDAYVERIEKPYVWGGE 258 >ref|XP_009603537.1| PREDICTED: uncharacterized protein LOC104098494 [Nicotiana tomentosiformis] Length = 328 Score = 306 bits (784), Expect = 9e-81 Identities = 163/261 (62%), Positives = 186/261 (71%), Gaps = 22/261 (8%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSLSYAH-SSPALYS----DRWRSVVAGGGESFD---YAGRCR 220 MLGVLC +RP+PWL SLSLS+AH S+PA Y+ +SV+ GG + ++ CR Sbjct: 1 MLGVLC-ARPKPWLFASLSLSHAHGSAPAAYNRLIGTPTKSVLVGGSDQLQRRHHSSHCR 59 Query: 221 -------GGEASIWKVILPVGRRT-------AVFGWHEHEVAKMAGGEGSWNVAWDARPA 358 GG ASIW ILP GRR VF H + +AK GEGSWNVAWD RPA Sbjct: 60 LGASVNRGGAASIWHAILPAGRRNKDVKRRNTVFHHHHYVLAKK--GEGSWNVAWDTRPA 117 Query: 359 RWLHHPDSAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPN 538 RWLH+PDSAWLL+GV +CLA P LD D NSEV +K+ GFS+N V SD D S N Sbjct: 118 RWLHNPDSAWLLFGVCSCLAAPTLDLPDSNSEVVAPIENKSQGFSSNTVNSDEVDRNSAN 177 Query: 539 YRVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWF 718 Y VTGVPADGRCLFRAIAHMACLRNG+ APDENRQ ELADELRAQVV ELLKR+KE EWF Sbjct: 178 YTVTGVPADGRCLFRAIAHMACLRNGEGAPDENRQRELADELRAQVVDELLKRRKEAEWF 237 Query: 719 IEEDFDAYVKRIQQPYVWGGE 781 IE DFDAYV+RI++PYVWGGE Sbjct: 238 IEGDFDAYVERIEKPYVWGGE 258 >ref|XP_004250001.1| PREDICTED: uncharacterized protein LOC101253339 [Solanum lycopersicum] Length = 338 Score = 273 bits (699), Expect = 7e-71 Identities = 157/277 (56%), Positives = 187/277 (67%), Gaps = 38/277 (13%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSLSYAH-SSPALYSDRWRS---------VVAGGG-------- 190 MLGVLC +RP+PWL SL LS+AH S+P+ YS + +++GGG Sbjct: 1 MLGVLC-ARPKPWLFASLCLSHAHGSTPSGYSRLIPTNTANKSSLLLISGGGGGGGGGIG 59 Query: 191 --ESFDYAGRCR--------GGEASIWKVILPVGRRT---------AVFGWHEHEVAKMA 313 + +++ CR GG ASIW ILP GRR VF H +E+AK Sbjct: 60 VDQRRNHSSHCRIASSVNRVGGAASIWHAILPAGRRNKKDINRRNNTVFK-HHYELAKK- 117 Query: 314 GGEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDY-SDFNSEVSTSDVDKADGF 490 GEGSWNV WD+RPARWLH+PDSAWLL+GV +CLA P LD D NS+V+ +DK Sbjct: 118 -GEGSWNVNWDSRPARWLHNPDSAWLLFGVCSCLAAPSLDLLPDANSDVAVP-IDKQSAV 175 Query: 491 STNVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRA 670 ++ SD D S NYRVTGVPADGRCLFRAIAHMACLRNG++APDENRQ ELADELRA Sbjct: 176 NS----SDEDDQNSANYRVTGVPADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRA 231 Query: 671 QVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGE 781 QVV ELLKR+KE EWFIE DFDAYV+RI++PYVWGGE Sbjct: 232 QVVDELLKRRKEAEWFIEGDFDAYVERIEKPYVWGGE 268 >ref|XP_011101645.1| PREDICTED: OTU domain-containing protein 6B-like [Sesamum indicum] Length = 284 Score = 271 bits (694), Expect = 3e-70 Identities = 154/244 (63%), Positives = 171/244 (70%), Gaps = 5/244 (2%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDRW-RSVVAGGGESFDYAGR-CRGGEASI 238 MLGVLC +RPRPWLLTSLSLSYAH S A DR RS + + +++ C GG ASI Sbjct: 1 MLGVLC-ARPRPWLLTSLSLSYAHGSAAAPFDRLTRSSLHPPRDPCNHSPPPCGGGAASI 59 Query: 239 WKVILPVG---RRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAWLLYGVFA 409 W ILP RRTAV G E E K GGEGSWNVAWDARPARWLHHP+SAWLL+ Sbjct: 60 WHTILPSHWRRRRTAVLGRRERESVK--GGEGSWNVAWDARPARWLHHPESAWLLFA--- 114 Query: 410 CLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPNYRVTGVPADGRCLFRAI 589 A P +D SD + D K+D NYRVTGV ADGRCLFRA+ Sbjct: 115 --AAPAID-SDPIPNPAAEDELKSDVIC--------------NYRVTGVVADGRCLFRAV 157 Query: 590 AHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYV 769 AHMACLRNG++APDENRQ ELADELRAQVV+ELLKR+KEVEWFIEEDFD YVKRIQ+PYV Sbjct: 158 AHMACLRNGEEAPDENRQRELADELRAQVVEELLKRRKEVEWFIEEDFDVYVKRIQEPYV 217 Query: 770 WGGE 781 WGGE Sbjct: 218 WGGE 221 >ref|XP_006360486.1| PREDICTED: uncharacterized protein LOC102606023 isoform X1 [Solanum tuberosum] Length = 338 Score = 270 bits (690), Expect = 7e-70 Identities = 159/277 (57%), Positives = 183/277 (66%), Gaps = 38/277 (13%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSLSYAH-SSPALYSDRWRSVVA---------------GGGES 196 MLGVLC +RP+PWL SL LS+AH S+P+ YS + A GGG Sbjct: 1 MLGVLC-ARPKPWLFASLCLSHAHGSTPSGYSRLIATNTANKSSLLLISGGGSGGGGGTG 59 Query: 197 FD----YAGRCR--------GGEASIWKVILPVGRRT---------AVFGWHEHEVAKMA 313 D ++ CR GG ASIW ILP GRR VF H +E+AK Sbjct: 60 VDQRRNHSIHCRIASSVNRGGGAASIWHAILPAGRRNKKDINRRNNTVFK-HHYELAKK- 117 Query: 314 GGEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDY-SDFNSEVSTSDVDKADGF 490 GEGSWNV WD+RPARWLH+PDSAWLL+GV +CLA P LD D N +V+ +DK Sbjct: 118 -GEGSWNVNWDSRPARWLHNPDSAWLLFGVCSCLAAPSLDLLPDANFDVAVP-IDK---- 171 Query: 491 STNVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRA 670 + V SD D S NYRVTGVPADGRCLFRAIAHMACLRNG++APDENRQ ELADELRA Sbjct: 172 QSVVNSSDEDDQNSANYRVTGVPADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRA 231 Query: 671 QVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGE 781 QVV ELLKR+KE EWFIE DFDAYV+RI++PYVWGGE Sbjct: 232 QVVDELLKRRKEAEWFIEGDFDAYVERIEKPYVWGGE 268 >ref|XP_010658710.1| PREDICTED: uncharacterized protein LOC100245448 [Vitis vinifera] gi|296090402|emb|CBI40221.3| unnamed protein product [Vitis vinifera] Length = 317 Score = 248 bits (634), Expect = 2e-63 Identities = 143/255 (56%), Positives = 173/255 (67%), Gaps = 16/255 (6%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDRWRSVVA------GGGESF---DYAGRC 217 MLGVLC +R +PW+L +LS + ++ ++ GGG+ ++ C Sbjct: 1 MLGVLC-ARHKPWILATLSFVHGSATHHHLHLNHHHLLGTPIQFNGGGDDHRRRHHSRAC 59 Query: 218 R-----GGEASIWKVILPVG--RRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHP 376 R GG ASIW ILP G RR+++ H+ GEGSWNVAWDARPARWLH P Sbjct: 60 RQGSSGGGAASIWHAILPSGGDRRSSLRPALLHDQK----GEGSWNVAWDARPARWLHRP 115 Query: 377 DSAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPNYRVTGV 556 DSAWLL+GV ACLA LD D ++EV D DK +G + SD + S +YRVTGV Sbjct: 116 DSAWLLFGVCACLAP--LDSFDVDNEVVAVD-DKIEGCNQVNEISDENNNSSADYRVTGV 172 Query: 557 PADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDFD 736 PADGRCLFRAIAH ACLR+G++APDENRQTELAD+LRAQVV ELLKR++E EWFIE +FD Sbjct: 173 PADGRCLFRAIAHSACLRSGEEAPDENRQTELADDLRAQVVDELLKRREETEWFIEGNFD 232 Query: 737 AYVKRIQQPYVWGGE 781 AYVKRIQQPYVWGGE Sbjct: 233 AYVKRIQQPYVWGGE 247 >ref|XP_007010219.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao] gi|508727132|gb|EOY19029.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao] Length = 327 Score = 246 bits (627), Expect = 1e-62 Identities = 141/264 (53%), Positives = 166/264 (62%), Gaps = 25/264 (9%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSL-------SYAHSSPAL-YSDRWRSVVAGGGESFDYAGRCR 220 MLGVLC P+PW+L SLSL ++ H S + + + + A ++ CR Sbjct: 1 MLGVLCARPPKPWILNSLSLIAHGGLAAHHHDSRLVEWPTHFADLSADDRRCRHHSTACR 60 Query: 221 -----GGEASIWKVILPVGRRTAVFGWHEHEVAKMAG--GEGSWNVAWDARPARWLHHPD 379 GG ASIW ILP G G EV K GEGSWNVAWDARPARWLH PD Sbjct: 61 LGGSDGGAASIWHAILPCGGGGG--GRRRGEVWKNVERKGEGSWNVAWDARPARWLHRPD 118 Query: 380 SAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAAD----------CC 529 SAWLL+GV ACLA P++++ D N + DK +G N+V +AD Sbjct: 119 SAWLLFGVCACLA-PMIEFVDVNPDAD----DKIEGAELNLVSRLSADEKSSSSSSSVAA 173 Query: 530 SPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEV 709 + N +VTGV ADGRCLFRAIAH ACLR+G+ APDEN Q ELADELRAQVV ELLKR++E Sbjct: 174 ADNCKVTGVLADGRCLFRAIAHGACLRSGEDAPDENHQRELADELRAQVVNELLKRREET 233 Query: 710 EWFIEEDFDAYVKRIQQPYVWGGE 781 EWFIE DFDAYVK IQQPYVWGGE Sbjct: 234 EWFIEGDFDAYVKEIQQPYVWGGE 257 >emb|CDO99851.1| unnamed protein product [Coffea canephora] Length = 337 Score = 243 bits (620), Expect = 1e-61 Identities = 139/260 (53%), Positives = 165/260 (63%), Gaps = 21/260 (8%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSLSYAHSSPA-------LYSDRWRSVV-AGGGESFDYAGRCR 220 ML LC +RP+ WL T+L LS+AHSS A + S +SVV A + ++ CR Sbjct: 1 MLSALC-ARPKSWLFTALFLSHAHSSAAALVHNRLIGSPLLKSVVVANADQRRHHSSSCR 59 Query: 221 -------GGEASIWKVILPVG------RRTAVFGWHEHEVAKMAGGEGSWNVAWDARPAR 361 GG ASIW ILP G RT H M GEGSWNVAWDARPAR Sbjct: 60 LVDTSAQGGAASIWHAILPAGDGDLDLHRTKRNVLVHHHDELMNKGEGSWNVAWDARPAR 119 Query: 362 WLHHPDSAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPNY 541 WLH+ DSAWLL+GV ACLA P L +SE + D+ + + + C N+ Sbjct: 120 WLHNRDSAWLLFGVCACLAAPPLPLLADSSEFVDGETDEFRHEAAAMTVVENGKCA--NF 177 Query: 542 RVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFI 721 RVTGVPADGRCLFRAIAH+A LR G+ PDENRQ ELADELRA VV+ELLKR+K+ EWFI Sbjct: 178 RVTGVPADGRCLFRAIAHVAWLRKGESVPDENRQRELADELRALVVEELLKRRKDAEWFI 237 Query: 722 EEDFDAYVKRIQQPYVWGGE 781 E DFDAYV+RI++PYVWGGE Sbjct: 238 EGDFDAYVERIEKPYVWGGE 257 >ref|XP_003536306.1| PREDICTED: uncharacterized protein LOC100793001 [Glycine max] Length = 296 Score = 241 bits (616), Expect = 3e-61 Identities = 137/247 (55%), Positives = 160/247 (64%), Gaps = 8/247 (3%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDRWRSVVAGGGESFDYAGRCR-----GGE 229 MLGVLC +RP+PWLL SL + H+S S A ++ C+ G Sbjct: 1 MLGVLCATRPKPWLL---SLVHVHASLPRLPHSPLSPSASPPPRRRHSTACKLFLSGGAA 57 Query: 230 ASIWKVILPVGR---RTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAWLLYG 400 ASIW I+P G R V H+ + GEGSWNVAWDARPARWLH PDSAWLL+G Sbjct: 58 ASIWHAIMPRGDDGLRRGVVAVHDLK------GEGSWNVAWDARPARWLHRPDSAWLLFG 111 Query: 401 VFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPNYRVTGVPADGRCLF 580 V ACLA P D ++ + VD++ G D S +YRVTGVPADGRCLF Sbjct: 112 VCACLAPPP-GCVDADTNSAGIAVDESCGLLDKEREEDEV---SADYRVTGVPADGRCLF 167 Query: 581 RAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDFDAYVKRIQQ 760 RAIAH ACLRNG+KAPDENRQ ELADELRA+VV ELLKR++E EWFIE DFD Y++RIQQ Sbjct: 168 RAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYLQRIQQ 227 Query: 761 PYVWGGE 781 PYVWGGE Sbjct: 228 PYVWGGE 234 >ref|XP_002316423.1| hypothetical protein POPTR_0010s24050g [Populus trichocarpa] gi|222865463|gb|EEF02594.1| hypothetical protein POPTR_0010s24050g [Populus trichocarpa] Length = 318 Score = 240 bits (613), Expect = 6e-61 Identities = 136/256 (53%), Positives = 161/256 (62%), Gaps = 17/256 (6%) Frame = +2 Query: 65 MLGVLCGSRPRP-WLLTSLSLSYAHSSPALYSDRWRSVVAGGGESF----DYAGRCR--- 220 MLGVLC +RP+P W+L SL + + ++ R + G S ++ C Sbjct: 1 MLGVLC-ARPKPNWILNSLFTHFHLNHHHHHNSNNRLSLHLSGSSTAARRHHSNLCSADS 59 Query: 221 --GGEASIWKVILPVGRRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAWLL 394 GG A+IW VI P W + GEGSWN AWD RPARWLH PDSAWLL Sbjct: 60 GCGGAAAIWHVIQPAD-------WRRRTERRSVRGEGSWNAAWDGRPARWLHRPDSAWLL 112 Query: 395 YGVFACLALPLLDYSDFNS--EVSTSDVDKADGFSTNVVGSDAADCCSP-----NYRVTG 553 +GV ACLA + SD N+ +V + ++ DG N DA S +Y+VTG Sbjct: 113 FGVCACLAPAIEFLSDVNNIDDVDHQEKERIDGGDLNASSDDAKQDNSDATVGSDYKVTG 172 Query: 554 VPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDF 733 V ADGRCLFRAIAHMACLRNG++APDENRQ ELADELRAQVV ELLKR++E EWFIE DF Sbjct: 173 VLADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDELLKRREETEWFIEGDF 232 Query: 734 DAYVKRIQQPYVWGGE 781 DAYVKRIQQPYVWGGE Sbjct: 233 DAYVKRIQQPYVWGGE 248 >ref|XP_002315401.2| hypothetical protein POPTR_0010s24050g [Populus trichocarpa] gi|550330486|gb|EEF01572.2| hypothetical protein POPTR_0010s24050g [Populus trichocarpa] Length = 303 Score = 240 bits (613), Expect = 6e-61 Identities = 136/256 (53%), Positives = 161/256 (62%), Gaps = 17/256 (6%) Frame = +2 Query: 65 MLGVLCGSRPRP-WLLTSLSLSYAHSSPALYSDRWRSVVAGGGESF----DYAGRCR--- 220 MLGVLC +RP+P W+L SL + + ++ R + G S ++ C Sbjct: 1 MLGVLC-ARPKPNWILNSLFTHFHLNHHHHHNSNNRLSLHLSGSSTAARRHHSNLCSADS 59 Query: 221 --GGEASIWKVILPVGRRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAWLL 394 GG A+IW VI P W + GEGSWN AWD RPARWLH PDSAWLL Sbjct: 60 GCGGAAAIWHVIQPAD-------WRRRTERRSVRGEGSWNAAWDGRPARWLHRPDSAWLL 112 Query: 395 YGVFACLALPLLDYSDFNS--EVSTSDVDKADGFSTNVVGSDAADCCSP-----NYRVTG 553 +GV ACLA + SD N+ +V + ++ DG N DA S +Y+VTG Sbjct: 113 FGVCACLAPAIEFLSDVNNIDDVDHQEKERIDGGDLNASSDDAKQDNSDATVGSDYKVTG 172 Query: 554 VPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDF 733 V ADGRCLFRAIAHMACLRNG++APDENRQ ELADELRAQVV ELLKR++E EWFIE DF Sbjct: 173 VLADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDELLKRREETEWFIEGDF 232 Query: 734 DAYVKRIQQPYVWGGE 781 DAYVKRIQQPYVWGGE Sbjct: 233 DAYVKRIQQPYVWGGE 248 >ref|XP_007010220.1| Cysteine proteinases superfamily protein isoform 2 [Theobroma cacao] gi|508727133|gb|EOY19030.1| Cysteine proteinases superfamily protein isoform 2 [Theobroma cacao] Length = 330 Score = 240 bits (613), Expect = 6e-61 Identities = 141/267 (52%), Positives = 166/267 (62%), Gaps = 28/267 (10%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSL-------SYAHSSPAL-YSDRWRSVVAGGGESFDYAGRCR 220 MLGVLC P+PW+L SLSL ++ H S + + + + A ++ CR Sbjct: 1 MLGVLCARPPKPWILNSLSLIAHGGLAAHHHDSRLVEWPTHFADLSADDRRCRHHSTACR 60 Query: 221 -----GGEASIWKVILPVGRRTAVFGWHEHEVAKMAG--GEGSWNVAWDARPARWLHHPD 379 GG ASIW ILP G G EV K GEGSWNVAWDARPARWLH PD Sbjct: 61 LGGSDGGAASIWHAILPCGGGGG--GRRRGEVWKNVERKGEGSWNVAWDARPARWLHRPD 118 Query: 380 SAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAAD----------CC 529 SAWLL+GV ACLA P++++ D N + DK +G N+V +AD Sbjct: 119 SAWLLFGVCACLA-PMIEFVDVNPDAD----DKIEGAELNLVSRLSADEKSSSSSSSVAA 173 Query: 530 SPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQ---VVQELLKRK 700 + N +VTGV ADGRCLFRAIAH ACLR+G+ APDEN Q ELADELRAQ VV ELLKR+ Sbjct: 174 ADNCKVTGVLADGRCLFRAIAHGACLRSGEDAPDENHQRELADELRAQVSLVVNELLKRR 233 Query: 701 KEVEWFIEEDFDAYVKRIQQPYVWGGE 781 +E EWFIE DFDAYVK IQQPYVWGGE Sbjct: 234 EETEWFIEGDFDAYVKEIQQPYVWGGE 260 >ref|XP_011024271.1| PREDICTED: uncharacterized protein LOC105125498 [Populus euphratica] Length = 320 Score = 240 bits (612), Expect = 8e-61 Identities = 135/258 (52%), Positives = 163/258 (63%), Gaps = 19/258 (7%) Frame = +2 Query: 65 MLGVLCGSRPRP-WLLTSL----SLSYAHSSPALYSDRWRSVVAGGGESF--DYAGRCR- 220 MLGVLC +RP+P W+L SL L++ H ++R ++G + ++ C Sbjct: 1 MLGVLC-ARPKPNWILNSLFTHFHLNHHHHQHHNSNNRLSLHLSGSSTAARRHHSSLCSA 59 Query: 221 ----GGEASIWKVILPVGRRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAW 388 GG A+IW VI P W + GEGSWN AWD RPARWLH PDSAW Sbjct: 60 DSGCGGAAAIWHVIQPAD-------WRRRTERRSVRGEGSWNAAWDGRPARWLHRPDSAW 112 Query: 389 LLYGVFACLALPLLDYSDFNS--EVSTSDVDKADGFSTNVVGSDAADCCSPN-----YRV 547 LL+GV AC+ + SD N+ +V + ++ DG N DA S + Y+V Sbjct: 113 LLFGVCACVTPAIEFLSDVNNIDDVDHQEKERIDGGDLNASSDDARQDSSDSTVGSDYKV 172 Query: 548 TGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEE 727 TGV ADGRCLFRAIAHMACLRNG++APDENRQ ELADELRAQVV ELLKR++E EWFIE Sbjct: 173 TGVLADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDELLKRREETEWFIEG 232 Query: 728 DFDAYVKRIQQPYVWGGE 781 DFDAYVKRIQQPYVWGGE Sbjct: 233 DFDAYVKRIQQPYVWGGE 250 >ref|XP_002311041.1| hypothetical protein POPTR_0008s02620g [Populus trichocarpa] gi|222850861|gb|EEE88408.1| hypothetical protein POPTR_0008s02620g [Populus trichocarpa] Length = 326 Score = 240 bits (612), Expect = 8e-61 Identities = 137/264 (51%), Positives = 161/264 (60%), Gaps = 25/264 (9%) Frame = +2 Query: 65 MLGVLCGSRPRP-WLLTSLSLSYAHSSP--------ALYSDRWRSVVAGGGESFDYAGRC 217 MLGVLC +RP+P W+L SL + H +L+ + SF A Sbjct: 1 MLGVLC-ARPKPNWILNSLFTHFHHQHHHHQSNDRLSLHLPHSFTAARRHHSSFCSADCG 59 Query: 218 RGGEASIWKVILPVGRRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAWLLY 397 GG A+IW V+ P W + GEGSWNVAWD RPARWLH PDSAWLL+ Sbjct: 60 GGGAAAIWHVVQPAD-------WRRRRGRRSVRGEGSWNVAWDGRPARWLHRPDSAWLLF 112 Query: 398 GVFACLALPLLDYSDFNSE--------VSTSDVDKADGFSTNV--VGSD------AADCC 529 GV ACLA + + D N E V + ++ DG N V SD ++ Sbjct: 113 GVCACLAPAIELFCDVNIEGGENVVVDVDHQEKERIDGGDLNASAVNSDDVKQDSSSSTA 172 Query: 530 SPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEV 709 +Y+VTGV ADGRCLFRAIAHMACLRNG++APDENRQ ELADELRAQVV ELLKR++E Sbjct: 173 GSDYKVTGVLADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDELLKRREET 232 Query: 710 EWFIEEDFDAYVKRIQQPYVWGGE 781 EWFIE DFDAYVKRIQQPYVWGGE Sbjct: 233 EWFIEGDFDAYVKRIQQPYVWGGE 256 >ref|XP_008446786.1| PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis melo] Length = 313 Score = 239 bits (611), Expect = 1e-60 Identities = 139/255 (54%), Positives = 171/255 (67%), Gaps = 16/255 (6%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDRWRS-VVAGGGESFD-----YAGRCR-- 220 MLGVLC +RP+PW+L SLS ++ H S + +S ++ FD ++ C+ Sbjct: 1 MLGVLC-ARPKPWILVSLS-NFIHGSAVYHHHHHQSRLLVQSPIQFDRRQRHHSSACKLA 58 Query: 221 -GGEASIWKVILPVGR-------RTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHP 376 GG ASIW ILP G R A+ H HE GEGSWNVAWDARPARWLH P Sbjct: 59 GGGAASIWHAILPSGAGSSSNLCRPAI---HCHE----RKGEGSWNVAWDARPARWLHRP 111 Query: 377 DSAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPNYRVTGV 556 DSAWLL+GV AC+A LD+ D + E + D K + ++ + D S +YRVTGV Sbjct: 112 DSAWLLFGVCACIAP--LDWVDASHEAVSLD-QKKEVCESSGPEFNQNDESSADYRVTGV 168 Query: 557 PADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDFD 736 ADGRCLFRAIAH ACLR+G++APD++RQ ELADELRA+VV ELLKR+KE EW+IE DFD Sbjct: 169 LADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFD 228 Query: 737 AYVKRIQQPYVWGGE 781 AYVKRIQQP+VWGGE Sbjct: 229 AYVKRIQQPFVWGGE 243 >ref|XP_007143828.1| hypothetical protein PHAVU_007G105100g [Phaseolus vulgaris] gi|561017018|gb|ESW15822.1| hypothetical protein PHAVU_007G105100g [Phaseolus vulgaris] Length = 305 Score = 239 bits (611), Expect = 1e-60 Identities = 136/245 (55%), Positives = 158/245 (64%), Gaps = 6/245 (2%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSLSY---AHSSPALYSDRWRSVVAGGGESFDYAGRCRGGEAS 235 MLGVLC +RPRPWL + + S H+S +L + R + + F AG G AS Sbjct: 17 MLGVLCATRPRPWLFSHVHASLPRLVHASVSLSASPPRRHHSSACKIFGSAG----GAAS 72 Query: 236 IWKVILPVGR---RTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAWLLYGVF 406 IW I+P R V H+ + GEGSWNVAWD RPARWLH PDSAWLL+GV Sbjct: 73 IWHAIMPRSGDRFRRGVVPVHDLK------GEGSWNVAWDTRPARWLHRPDSAWLLFGVC 126 Query: 407 ACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPNYRVTGVPADGRCLFRA 586 ACLA P D ++ VD++ G +D AD YRVTGVPADGRCLFRA Sbjct: 127 ACLAPP--GCVDVVTDFEAVAVDESCGVLKVEASADYAD-----YRVTGVPADGRCLFRA 179 Query: 587 IAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDFDAYVKRIQQPY 766 IAH CLRNG+KAPDEN Q ELADELRA+VV ELLKR++E EWFIE DFD YVKRIQQP+ Sbjct: 180 IAHGDCLRNGEKAPDENCQRELADELRAKVVDELLKRREETEWFIEGDFDTYVKRIQQPF 239 Query: 767 VWGGE 781 VWGGE Sbjct: 240 VWGGE 244 >ref|XP_004142455.1| PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis sativus] gi|700197033|gb|KGN52210.1| hypothetical protein Csa_5G615810 [Cucumis sativus] Length = 313 Score = 239 bits (609), Expect = 2e-60 Identities = 138/255 (54%), Positives = 171/255 (67%), Gaps = 16/255 (6%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDRWRS-VVAGGGESFD-----YAGRCR-- 220 MLGVLC +RP+PW+L SLS ++ H S + +S ++ FD ++ C+ Sbjct: 1 MLGVLC-ARPKPWILVSLS-NFIHGSAVYHHHHHQSRLLVQSPIQFDRRQRHHSSACKLA 58 Query: 221 -GGEASIWKVILPVGR-------RTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHP 376 GG ASIW I+P G R A+ H HE GEGSWNVAWDARPARWLH P Sbjct: 59 GGGAASIWHAIMPSGAGSSSNLCRPAI---HCHE----RKGEGSWNVAWDARPARWLHRP 111 Query: 377 DSAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPNYRVTGV 556 DSAWLL+GV AC+A LD+ D + E + D K + ++ + D S +YRVTGV Sbjct: 112 DSAWLLFGVCACIAP--LDWVDASHEAVSLD-QKKEVCESSGPEFNQNDESSADYRVTGV 168 Query: 557 PADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDFD 736 ADGRCLFRAIAH ACLR+G++APD++RQ ELADELRA+VV ELLKR+KE EW+IE DFD Sbjct: 169 LADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFD 228 Query: 737 AYVKRIQQPYVWGGE 781 AYVKRIQQP+VWGGE Sbjct: 229 AYVKRIQQPFVWGGE 243 >ref|XP_003556279.1| PREDICTED: OTU domain-containing protein At3g57810-like [Glycine max] gi|734312743|gb|KHN00921.1| OTU domain-containing protein [Glycine soja] Length = 294 Score = 238 bits (607), Expect = 3e-60 Identities = 135/250 (54%), Positives = 160/250 (64%), Gaps = 11/250 (4%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDRWRSVVAGGGESFDYAGRCR-----GGE 229 MLGVLC +R +PWL S H+S S S A ++ C+ GG Sbjct: 1 MLGVLCATRSKPWLF-----SLVHASLPRLSHAPLSPSASPPPRRRHSTACKLFLSAGGA 55 Query: 230 ASIWKVILPV-----GRRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAWLL 394 ASIW I+P G R V +H+ + GEGSWNVAWDARPARWLH PDSAWLL Sbjct: 56 ASIWHAIMPRVNDDDGFRRGVVAFHDMK------GEGSWNVAWDARPARWLHRPDSAWLL 109 Query: 395 YGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADC-CSPNYRVTGVPADGR 571 +GV ACLA P D ++ VD+ S ++ + + S +YRVTGVPADGR Sbjct: 110 FGVCACLAPPS-SCVDADTNTDAIAVDE----SCRLLDKEREEYEVSADYRVTGVPADGR 164 Query: 572 CLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDFDAYVKR 751 CLFRAIAH ACLRNG+KAPDENRQ ELADELRA+VV EL+KR++E EWFIE DFD YV+R Sbjct: 165 CLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELMKRREETEWFIEGDFDTYVQR 224 Query: 752 IQQPYVWGGE 781 IQQPYVWGGE Sbjct: 225 IQQPYVWGGE 234 >gb|KJB73389.1| hypothetical protein B456_011G230700 [Gossypium raimondii] Length = 263 Score = 237 bits (605), Expect = 5e-60 Identities = 139/257 (54%), Positives = 164/257 (63%), Gaps = 18/257 (7%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDR-----WRS----VVAGGGESFDYAGRC 217 MLGVLC P+PW+L SLSL AH A + W S + A ++ C Sbjct: 1 MLGVLCARPPKPWILNSLSL-IAHGGSAAHHHENRLLHWPSHFADLSAANRRCRHHSTAC 59 Query: 218 R------GGEASIWKVILPVGRRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPD 379 R GG ASIW ILP G V + GEGSWNV+WDARPARWL D Sbjct: 60 RLGGGSEGGAASIWHAILPCGGDRGVKNRGDVWKNVERKGEGSWNVSWDARPARWLRS-D 118 Query: 380 SAWLLYGVFACLA-LPLLDYSDFNSEVS--TSDVDKADGFSTNVVGSDAADCCSPNYRVT 550 SAWLL+GV ACLA +P+ ++ D N + T +D S+N + S AA + NY+VT Sbjct: 119 SAWLLFGVCACLAPMPMDEFDDVNLDADNKTDASLNSDENSSNHLSSVAA---ADNYKVT 175 Query: 551 GVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEED 730 G+ ADGRCLFRAIAH ACLR+G++APDENRQ ELADELRAQVV ELLKR++E EWFIE D Sbjct: 176 GILADGRCLFRAIAHGACLRSGEEAPDENRQRELADELRAQVVNELLKRREETEWFIEGD 235 Query: 731 FDAYVKRIQQPYVWGGE 781 FDAYVK IQQPYVWGGE Sbjct: 236 FDAYVKEIQQPYVWGGE 252 >ref|XP_012456105.1| PREDICTED: uncharacterized protein LOC105777394 [Gossypium raimondii] gi|763806450|gb|KJB73388.1| hypothetical protein B456_011G230700 [Gossypium raimondii] Length = 319 Score = 237 bits (605), Expect = 5e-60 Identities = 139/257 (54%), Positives = 164/257 (63%), Gaps = 18/257 (7%) Frame = +2 Query: 65 MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDR-----WRS----VVAGGGESFDYAGRC 217 MLGVLC P+PW+L SLSL AH A + W S + A ++ C Sbjct: 1 MLGVLCARPPKPWILNSLSL-IAHGGSAAHHHENRLLHWPSHFADLSAANRRCRHHSTAC 59 Query: 218 R------GGEASIWKVILPVGRRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPD 379 R GG ASIW ILP G V + GEGSWNV+WDARPARWL D Sbjct: 60 RLGGGSEGGAASIWHAILPCGGDRGVKNRGDVWKNVERKGEGSWNVSWDARPARWLRS-D 118 Query: 380 SAWLLYGVFACLA-LPLLDYSDFNSEVS--TSDVDKADGFSTNVVGSDAADCCSPNYRVT 550 SAWLL+GV ACLA +P+ ++ D N + T +D S+N + S AA + NY+VT Sbjct: 119 SAWLLFGVCACLAPMPMDEFDDVNLDADNKTDASLNSDENSSNHLSSVAA---ADNYKVT 175 Query: 551 GVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEED 730 G+ ADGRCLFRAIAH ACLR+G++APDENRQ ELADELRAQVV ELLKR++E EWFIE D Sbjct: 176 GILADGRCLFRAIAHGACLRSGEEAPDENRQRELADELRAQVVNELLKRREETEWFIEGD 235 Query: 731 FDAYVKRIQQPYVWGGE 781 FDAYVK IQQPYVWGGE Sbjct: 236 FDAYVKEIQQPYVWGGE 252