BLASTX nr result

ID: Forsythia22_contig00025637 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00025637
         (783 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009793129.1| PREDICTED: uncharacterized protein LOC104240...   306   7e-81
ref|XP_009603537.1| PREDICTED: uncharacterized protein LOC104098...   306   9e-81
ref|XP_004250001.1| PREDICTED: uncharacterized protein LOC101253...   273   7e-71
ref|XP_011101645.1| PREDICTED: OTU domain-containing protein 6B-...   271   3e-70
ref|XP_006360486.1| PREDICTED: uncharacterized protein LOC102606...   270   7e-70
ref|XP_010658710.1| PREDICTED: uncharacterized protein LOC100245...   248   2e-63
ref|XP_007010219.1| Cysteine proteinases superfamily protein iso...   246   1e-62
emb|CDO99851.1| unnamed protein product [Coffea canephora]            243   1e-61
ref|XP_003536306.1| PREDICTED: uncharacterized protein LOC100793...   241   3e-61
ref|XP_002316423.1| hypothetical protein POPTR_0010s24050g [Popu...   240   6e-61
ref|XP_002315401.2| hypothetical protein POPTR_0010s24050g [Popu...   240   6e-61
ref|XP_007010220.1| Cysteine proteinases superfamily protein iso...   240   6e-61
ref|XP_011024271.1| PREDICTED: uncharacterized protein LOC105125...   240   8e-61
ref|XP_002311041.1| hypothetical protein POPTR_0008s02620g [Popu...   240   8e-61
ref|XP_008446786.1| PREDICTED: OTU domain-containing protein At3...   239   1e-60
ref|XP_007143828.1| hypothetical protein PHAVU_007G105100g [Phas...   239   1e-60
ref|XP_004142455.1| PREDICTED: OTU domain-containing protein At3...   239   2e-60
ref|XP_003556279.1| PREDICTED: OTU domain-containing protein At3...   238   3e-60
gb|KJB73389.1| hypothetical protein B456_011G230700 [Gossypium r...   237   5e-60
ref|XP_012456105.1| PREDICTED: uncharacterized protein LOC105777...   237   5e-60

>ref|XP_009793129.1| PREDICTED: uncharacterized protein LOC104240043 [Nicotiana
           sylvestris]
          Length = 328

 Score =  306 bits (785), Expect = 7e-81
 Identities = 163/261 (62%), Positives = 187/261 (71%), Gaps = 22/261 (8%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSLSYAH-SSPALYS----DRWRSVVAGGGESFD---YAGRCR 220
           MLGVLC +RP+PWL  SLSLS+AH S+PA Y+       +SV+ GG +      ++  CR
Sbjct: 1   MLGVLC-ARPKPWLFASLSLSHAHGSAPAAYNRLIGTPTKSVLVGGSDQLQRRHHSSHCR 59

Query: 221 -------GGEASIWKVILPVGRRT-------AVFGWHEHEVAKMAGGEGSWNVAWDARPA 358
                  GG ASIW  ILP GRR         VF  H +E+AK   GEGSWNVAWD RPA
Sbjct: 60  LGASVNRGGAASIWHAILPAGRRNKDVKRRNTVFHHHHYELAKK--GEGSWNVAWDTRPA 117

Query: 359 RWLHHPDSAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPN 538
           RWLH+PDSAWLL+GV +CLA P LD  D NS+V     + + GFS+N V SD AD  S N
Sbjct: 118 RWLHNPDSAWLLFGVCSCLAAPSLDLPDSNSDVVAPIENMSQGFSSNTVNSDEADRNSAN 177

Query: 539 YRVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWF 718
           Y VTGVPADGRCLFRAIAHMACLRNG+ APDENRQ ELADELRAQVV ELLKR+KE EWF
Sbjct: 178 YTVTGVPADGRCLFRAIAHMACLRNGEGAPDENRQRELADELRAQVVDELLKRRKEAEWF 237

Query: 719 IEEDFDAYVKRIQQPYVWGGE 781
           IE DFDAYV+RI++PYVWGGE
Sbjct: 238 IEGDFDAYVERIEKPYVWGGE 258


>ref|XP_009603537.1| PREDICTED: uncharacterized protein LOC104098494 [Nicotiana
           tomentosiformis]
          Length = 328

 Score =  306 bits (784), Expect = 9e-81
 Identities = 163/261 (62%), Positives = 186/261 (71%), Gaps = 22/261 (8%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSLSYAH-SSPALYS----DRWRSVVAGGGESFD---YAGRCR 220
           MLGVLC +RP+PWL  SLSLS+AH S+PA Y+       +SV+ GG +      ++  CR
Sbjct: 1   MLGVLC-ARPKPWLFASLSLSHAHGSAPAAYNRLIGTPTKSVLVGGSDQLQRRHHSSHCR 59

Query: 221 -------GGEASIWKVILPVGRRT-------AVFGWHEHEVAKMAGGEGSWNVAWDARPA 358
                  GG ASIW  ILP GRR         VF  H + +AK   GEGSWNVAWD RPA
Sbjct: 60  LGASVNRGGAASIWHAILPAGRRNKDVKRRNTVFHHHHYVLAKK--GEGSWNVAWDTRPA 117

Query: 359 RWLHHPDSAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPN 538
           RWLH+PDSAWLL+GV +CLA P LD  D NSEV     +K+ GFS+N V SD  D  S N
Sbjct: 118 RWLHNPDSAWLLFGVCSCLAAPTLDLPDSNSEVVAPIENKSQGFSSNTVNSDEVDRNSAN 177

Query: 539 YRVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWF 718
           Y VTGVPADGRCLFRAIAHMACLRNG+ APDENRQ ELADELRAQVV ELLKR+KE EWF
Sbjct: 178 YTVTGVPADGRCLFRAIAHMACLRNGEGAPDENRQRELADELRAQVVDELLKRRKEAEWF 237

Query: 719 IEEDFDAYVKRIQQPYVWGGE 781
           IE DFDAYV+RI++PYVWGGE
Sbjct: 238 IEGDFDAYVERIEKPYVWGGE 258


>ref|XP_004250001.1| PREDICTED: uncharacterized protein LOC101253339 [Solanum
           lycopersicum]
          Length = 338

 Score =  273 bits (699), Expect = 7e-71
 Identities = 157/277 (56%), Positives = 187/277 (67%), Gaps = 38/277 (13%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSLSYAH-SSPALYSDRWRS---------VVAGGG-------- 190
           MLGVLC +RP+PWL  SL LS+AH S+P+ YS    +         +++GGG        
Sbjct: 1   MLGVLC-ARPKPWLFASLCLSHAHGSTPSGYSRLIPTNTANKSSLLLISGGGGGGGGGIG 59

Query: 191 --ESFDYAGRCR--------GGEASIWKVILPVGRRT---------AVFGWHEHEVAKMA 313
             +  +++  CR        GG ASIW  ILP GRR           VF  H +E+AK  
Sbjct: 60  VDQRRNHSSHCRIASSVNRVGGAASIWHAILPAGRRNKKDINRRNNTVFK-HHYELAKK- 117

Query: 314 GGEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDY-SDFNSEVSTSDVDKADGF 490
            GEGSWNV WD+RPARWLH+PDSAWLL+GV +CLA P LD   D NS+V+   +DK    
Sbjct: 118 -GEGSWNVNWDSRPARWLHNPDSAWLLFGVCSCLAAPSLDLLPDANSDVAVP-IDKQSAV 175

Query: 491 STNVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRA 670
           ++    SD  D  S NYRVTGVPADGRCLFRAIAHMACLRNG++APDENRQ ELADELRA
Sbjct: 176 NS----SDEDDQNSANYRVTGVPADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRA 231

Query: 671 QVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGE 781
           QVV ELLKR+KE EWFIE DFDAYV+RI++PYVWGGE
Sbjct: 232 QVVDELLKRRKEAEWFIEGDFDAYVERIEKPYVWGGE 268


>ref|XP_011101645.1| PREDICTED: OTU domain-containing protein 6B-like [Sesamum indicum]
          Length = 284

 Score =  271 bits (694), Expect = 3e-70
 Identities = 154/244 (63%), Positives = 171/244 (70%), Gaps = 5/244 (2%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDRW-RSVVAGGGESFDYAGR-CRGGEASI 238
           MLGVLC +RPRPWLLTSLSLSYAH S A   DR  RS +    +  +++   C GG ASI
Sbjct: 1   MLGVLC-ARPRPWLLTSLSLSYAHGSAAAPFDRLTRSSLHPPRDPCNHSPPPCGGGAASI 59

Query: 239 WKVILPVG---RRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAWLLYGVFA 409
           W  ILP     RRTAV G  E E  K  GGEGSWNVAWDARPARWLHHP+SAWLL+    
Sbjct: 60  WHTILPSHWRRRRTAVLGRRERESVK--GGEGSWNVAWDARPARWLHHPESAWLLFA--- 114

Query: 410 CLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPNYRVTGVPADGRCLFRAI 589
             A P +D SD     +  D  K+D                 NYRVTGV ADGRCLFRA+
Sbjct: 115 --AAPAID-SDPIPNPAAEDELKSDVIC--------------NYRVTGVVADGRCLFRAV 157

Query: 590 AHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYV 769
           AHMACLRNG++APDENRQ ELADELRAQVV+ELLKR+KEVEWFIEEDFD YVKRIQ+PYV
Sbjct: 158 AHMACLRNGEEAPDENRQRELADELRAQVVEELLKRRKEVEWFIEEDFDVYVKRIQEPYV 217

Query: 770 WGGE 781
           WGGE
Sbjct: 218 WGGE 221


>ref|XP_006360486.1| PREDICTED: uncharacterized protein LOC102606023 isoform X1 [Solanum
           tuberosum]
          Length = 338

 Score =  270 bits (690), Expect = 7e-70
 Identities = 159/277 (57%), Positives = 183/277 (66%), Gaps = 38/277 (13%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSLSYAH-SSPALYSDRWRSVVA---------------GGGES 196
           MLGVLC +RP+PWL  SL LS+AH S+P+ YS    +  A               GGG  
Sbjct: 1   MLGVLC-ARPKPWLFASLCLSHAHGSTPSGYSRLIATNTANKSSLLLISGGGSGGGGGTG 59

Query: 197 FD----YAGRCR--------GGEASIWKVILPVGRRT---------AVFGWHEHEVAKMA 313
            D    ++  CR        GG ASIW  ILP GRR           VF  H +E+AK  
Sbjct: 60  VDQRRNHSIHCRIASSVNRGGGAASIWHAILPAGRRNKKDINRRNNTVFK-HHYELAKK- 117

Query: 314 GGEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDY-SDFNSEVSTSDVDKADGF 490
            GEGSWNV WD+RPARWLH+PDSAWLL+GV +CLA P LD   D N +V+   +DK    
Sbjct: 118 -GEGSWNVNWDSRPARWLHNPDSAWLLFGVCSCLAAPSLDLLPDANFDVAVP-IDK---- 171

Query: 491 STNVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRA 670
            + V  SD  D  S NYRVTGVPADGRCLFRAIAHMACLRNG++APDENRQ ELADELRA
Sbjct: 172 QSVVNSSDEDDQNSANYRVTGVPADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRA 231

Query: 671 QVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGE 781
           QVV ELLKR+KE EWFIE DFDAYV+RI++PYVWGGE
Sbjct: 232 QVVDELLKRRKEAEWFIEGDFDAYVERIEKPYVWGGE 268


>ref|XP_010658710.1| PREDICTED: uncharacterized protein LOC100245448 [Vitis vinifera]
           gi|296090402|emb|CBI40221.3| unnamed protein product
           [Vitis vinifera]
          Length = 317

 Score =  248 bits (634), Expect = 2e-63
 Identities = 143/255 (56%), Positives = 173/255 (67%), Gaps = 16/255 (6%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDRWRSVVA------GGGESF---DYAGRC 217
           MLGVLC +R +PW+L +LS  +  ++          ++       GGG+      ++  C
Sbjct: 1   MLGVLC-ARHKPWILATLSFVHGSATHHHLHLNHHHLLGTPIQFNGGGDDHRRRHHSRAC 59

Query: 218 R-----GGEASIWKVILPVG--RRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHP 376
           R     GG ASIW  ILP G  RR+++     H+      GEGSWNVAWDARPARWLH P
Sbjct: 60  RQGSSGGGAASIWHAILPSGGDRRSSLRPALLHDQK----GEGSWNVAWDARPARWLHRP 115

Query: 377 DSAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPNYRVTGV 556
           DSAWLL+GV ACLA   LD  D ++EV   D DK +G +     SD  +  S +YRVTGV
Sbjct: 116 DSAWLLFGVCACLAP--LDSFDVDNEVVAVD-DKIEGCNQVNEISDENNNSSADYRVTGV 172

Query: 557 PADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDFD 736
           PADGRCLFRAIAH ACLR+G++APDENRQTELAD+LRAQVV ELLKR++E EWFIE +FD
Sbjct: 173 PADGRCLFRAIAHSACLRSGEEAPDENRQTELADDLRAQVVDELLKRREETEWFIEGNFD 232

Query: 737 AYVKRIQQPYVWGGE 781
           AYVKRIQQPYVWGGE
Sbjct: 233 AYVKRIQQPYVWGGE 247


>ref|XP_007010219.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma
           cacao] gi|508727132|gb|EOY19029.1| Cysteine proteinases
           superfamily protein isoform 1 [Theobroma cacao]
          Length = 327

 Score =  246 bits (627), Expect = 1e-62
 Identities = 141/264 (53%), Positives = 166/264 (62%), Gaps = 25/264 (9%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSL-------SYAHSSPAL-YSDRWRSVVAGGGESFDYAGRCR 220
           MLGVLC   P+PW+L SLSL       ++ H S  + +   +  + A       ++  CR
Sbjct: 1   MLGVLCARPPKPWILNSLSLIAHGGLAAHHHDSRLVEWPTHFADLSADDRRCRHHSTACR 60

Query: 221 -----GGEASIWKVILPVGRRTAVFGWHEHEVAKMAG--GEGSWNVAWDARPARWLHHPD 379
                GG ASIW  ILP G      G    EV K     GEGSWNVAWDARPARWLH PD
Sbjct: 61  LGGSDGGAASIWHAILPCGGGGG--GRRRGEVWKNVERKGEGSWNVAWDARPARWLHRPD 118

Query: 380 SAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAAD----------CC 529
           SAWLL+GV ACLA P++++ D N +      DK +G   N+V   +AD            
Sbjct: 119 SAWLLFGVCACLA-PMIEFVDVNPDAD----DKIEGAELNLVSRLSADEKSSSSSSSVAA 173

Query: 530 SPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEV 709
           + N +VTGV ADGRCLFRAIAH ACLR+G+ APDEN Q ELADELRAQVV ELLKR++E 
Sbjct: 174 ADNCKVTGVLADGRCLFRAIAHGACLRSGEDAPDENHQRELADELRAQVVNELLKRREET 233

Query: 710 EWFIEEDFDAYVKRIQQPYVWGGE 781
           EWFIE DFDAYVK IQQPYVWGGE
Sbjct: 234 EWFIEGDFDAYVKEIQQPYVWGGE 257


>emb|CDO99851.1| unnamed protein product [Coffea canephora]
          Length = 337

 Score =  243 bits (620), Expect = 1e-61
 Identities = 139/260 (53%), Positives = 165/260 (63%), Gaps = 21/260 (8%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSLSYAHSSPA-------LYSDRWRSVV-AGGGESFDYAGRCR 220
           ML  LC +RP+ WL T+L LS+AHSS A       + S   +SVV A   +   ++  CR
Sbjct: 1   MLSALC-ARPKSWLFTALFLSHAHSSAAALVHNRLIGSPLLKSVVVANADQRRHHSSSCR 59

Query: 221 -------GGEASIWKVILPVG------RRTAVFGWHEHEVAKMAGGEGSWNVAWDARPAR 361
                  GG ASIW  ILP G       RT       H    M  GEGSWNVAWDARPAR
Sbjct: 60  LVDTSAQGGAASIWHAILPAGDGDLDLHRTKRNVLVHHHDELMNKGEGSWNVAWDARPAR 119

Query: 362 WLHHPDSAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPNY 541
           WLH+ DSAWLL+GV ACLA P L     +SE    + D+    +  +   +   C   N+
Sbjct: 120 WLHNRDSAWLLFGVCACLAAPPLPLLADSSEFVDGETDEFRHEAAAMTVVENGKCA--NF 177

Query: 542 RVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFI 721
           RVTGVPADGRCLFRAIAH+A LR G+  PDENRQ ELADELRA VV+ELLKR+K+ EWFI
Sbjct: 178 RVTGVPADGRCLFRAIAHVAWLRKGESVPDENRQRELADELRALVVEELLKRRKDAEWFI 237

Query: 722 EEDFDAYVKRIQQPYVWGGE 781
           E DFDAYV+RI++PYVWGGE
Sbjct: 238 EGDFDAYVERIEKPYVWGGE 257


>ref|XP_003536306.1| PREDICTED: uncharacterized protein LOC100793001 [Glycine max]
          Length = 296

 Score =  241 bits (616), Expect = 3e-61
 Identities = 137/247 (55%), Positives = 160/247 (64%), Gaps = 8/247 (3%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDRWRSVVAGGGESFDYAGRCR-----GGE 229
           MLGVLC +RP+PWLL   SL + H+S         S  A       ++  C+     G  
Sbjct: 1   MLGVLCATRPKPWLL---SLVHVHASLPRLPHSPLSPSASPPPRRRHSTACKLFLSGGAA 57

Query: 230 ASIWKVILPVGR---RTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAWLLYG 400
           ASIW  I+P G    R  V   H+ +      GEGSWNVAWDARPARWLH PDSAWLL+G
Sbjct: 58  ASIWHAIMPRGDDGLRRGVVAVHDLK------GEGSWNVAWDARPARWLHRPDSAWLLFG 111

Query: 401 VFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPNYRVTGVPADGRCLF 580
           V ACLA P     D ++  +   VD++ G        D     S +YRVTGVPADGRCLF
Sbjct: 112 VCACLAPPP-GCVDADTNSAGIAVDESCGLLDKEREEDEV---SADYRVTGVPADGRCLF 167

Query: 581 RAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDFDAYVKRIQQ 760
           RAIAH ACLRNG+KAPDENRQ ELADELRA+VV ELLKR++E EWFIE DFD Y++RIQQ
Sbjct: 168 RAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYLQRIQQ 227

Query: 761 PYVWGGE 781
           PYVWGGE
Sbjct: 228 PYVWGGE 234


>ref|XP_002316423.1| hypothetical protein POPTR_0010s24050g [Populus trichocarpa]
           gi|222865463|gb|EEF02594.1| hypothetical protein
           POPTR_0010s24050g [Populus trichocarpa]
          Length = 318

 Score =  240 bits (613), Expect = 6e-61
 Identities = 136/256 (53%), Positives = 161/256 (62%), Gaps = 17/256 (6%)
 Frame = +2

Query: 65  MLGVLCGSRPRP-WLLTSLSLSYAHSSPALYSDRWRSVVAGGGESF----DYAGRCR--- 220
           MLGVLC +RP+P W+L SL   +  +    ++   R  +   G S      ++  C    
Sbjct: 1   MLGVLC-ARPKPNWILNSLFTHFHLNHHHHHNSNNRLSLHLSGSSTAARRHHSNLCSADS 59

Query: 221 --GGEASIWKVILPVGRRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAWLL 394
             GG A+IW VI P         W      +   GEGSWN AWD RPARWLH PDSAWLL
Sbjct: 60  GCGGAAAIWHVIQPAD-------WRRRTERRSVRGEGSWNAAWDGRPARWLHRPDSAWLL 112

Query: 395 YGVFACLALPLLDYSDFNS--EVSTSDVDKADGFSTNVVGSDAADCCSP-----NYRVTG 553
           +GV ACLA  +   SD N+  +V   + ++ DG   N    DA    S      +Y+VTG
Sbjct: 113 FGVCACLAPAIEFLSDVNNIDDVDHQEKERIDGGDLNASSDDAKQDNSDATVGSDYKVTG 172

Query: 554 VPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDF 733
           V ADGRCLFRAIAHMACLRNG++APDENRQ ELADELRAQVV ELLKR++E EWFIE DF
Sbjct: 173 VLADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDELLKRREETEWFIEGDF 232

Query: 734 DAYVKRIQQPYVWGGE 781
           DAYVKRIQQPYVWGGE
Sbjct: 233 DAYVKRIQQPYVWGGE 248


>ref|XP_002315401.2| hypothetical protein POPTR_0010s24050g [Populus trichocarpa]
           gi|550330486|gb|EEF01572.2| hypothetical protein
           POPTR_0010s24050g [Populus trichocarpa]
          Length = 303

 Score =  240 bits (613), Expect = 6e-61
 Identities = 136/256 (53%), Positives = 161/256 (62%), Gaps = 17/256 (6%)
 Frame = +2

Query: 65  MLGVLCGSRPRP-WLLTSLSLSYAHSSPALYSDRWRSVVAGGGESF----DYAGRCR--- 220
           MLGVLC +RP+P W+L SL   +  +    ++   R  +   G S      ++  C    
Sbjct: 1   MLGVLC-ARPKPNWILNSLFTHFHLNHHHHHNSNNRLSLHLSGSSTAARRHHSNLCSADS 59

Query: 221 --GGEASIWKVILPVGRRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAWLL 394
             GG A+IW VI P         W      +   GEGSWN AWD RPARWLH PDSAWLL
Sbjct: 60  GCGGAAAIWHVIQPAD-------WRRRTERRSVRGEGSWNAAWDGRPARWLHRPDSAWLL 112

Query: 395 YGVFACLALPLLDYSDFNS--EVSTSDVDKADGFSTNVVGSDAADCCSP-----NYRVTG 553
           +GV ACLA  +   SD N+  +V   + ++ DG   N    DA    S      +Y+VTG
Sbjct: 113 FGVCACLAPAIEFLSDVNNIDDVDHQEKERIDGGDLNASSDDAKQDNSDATVGSDYKVTG 172

Query: 554 VPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDF 733
           V ADGRCLFRAIAHMACLRNG++APDENRQ ELADELRAQVV ELLKR++E EWFIE DF
Sbjct: 173 VLADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDELLKRREETEWFIEGDF 232

Query: 734 DAYVKRIQQPYVWGGE 781
           DAYVKRIQQPYVWGGE
Sbjct: 233 DAYVKRIQQPYVWGGE 248


>ref|XP_007010220.1| Cysteine proteinases superfamily protein isoform 2 [Theobroma
           cacao] gi|508727133|gb|EOY19030.1| Cysteine proteinases
           superfamily protein isoform 2 [Theobroma cacao]
          Length = 330

 Score =  240 bits (613), Expect = 6e-61
 Identities = 141/267 (52%), Positives = 166/267 (62%), Gaps = 28/267 (10%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSL-------SYAHSSPAL-YSDRWRSVVAGGGESFDYAGRCR 220
           MLGVLC   P+PW+L SLSL       ++ H S  + +   +  + A       ++  CR
Sbjct: 1   MLGVLCARPPKPWILNSLSLIAHGGLAAHHHDSRLVEWPTHFADLSADDRRCRHHSTACR 60

Query: 221 -----GGEASIWKVILPVGRRTAVFGWHEHEVAKMAG--GEGSWNVAWDARPARWLHHPD 379
                GG ASIW  ILP G      G    EV K     GEGSWNVAWDARPARWLH PD
Sbjct: 61  LGGSDGGAASIWHAILPCGGGGG--GRRRGEVWKNVERKGEGSWNVAWDARPARWLHRPD 118

Query: 380 SAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAAD----------CC 529
           SAWLL+GV ACLA P++++ D N +      DK +G   N+V   +AD            
Sbjct: 119 SAWLLFGVCACLA-PMIEFVDVNPDAD----DKIEGAELNLVSRLSADEKSSSSSSSVAA 173

Query: 530 SPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQ---VVQELLKRK 700
           + N +VTGV ADGRCLFRAIAH ACLR+G+ APDEN Q ELADELRAQ   VV ELLKR+
Sbjct: 174 ADNCKVTGVLADGRCLFRAIAHGACLRSGEDAPDENHQRELADELRAQVSLVVNELLKRR 233

Query: 701 KEVEWFIEEDFDAYVKRIQQPYVWGGE 781
           +E EWFIE DFDAYVK IQQPYVWGGE
Sbjct: 234 EETEWFIEGDFDAYVKEIQQPYVWGGE 260


>ref|XP_011024271.1| PREDICTED: uncharacterized protein LOC105125498 [Populus
           euphratica]
          Length = 320

 Score =  240 bits (612), Expect = 8e-61
 Identities = 135/258 (52%), Positives = 163/258 (63%), Gaps = 19/258 (7%)
 Frame = +2

Query: 65  MLGVLCGSRPRP-WLLTSL----SLSYAHSSPALYSDRWRSVVAGGGESF--DYAGRCR- 220
           MLGVLC +RP+P W+L SL     L++ H      ++R    ++G   +    ++  C  
Sbjct: 1   MLGVLC-ARPKPNWILNSLFTHFHLNHHHHQHHNSNNRLSLHLSGSSTAARRHHSSLCSA 59

Query: 221 ----GGEASIWKVILPVGRRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAW 388
               GG A+IW VI P         W      +   GEGSWN AWD RPARWLH PDSAW
Sbjct: 60  DSGCGGAAAIWHVIQPAD-------WRRRTERRSVRGEGSWNAAWDGRPARWLHRPDSAW 112

Query: 389 LLYGVFACLALPLLDYSDFNS--EVSTSDVDKADGFSTNVVGSDAADCCSPN-----YRV 547
           LL+GV AC+   +   SD N+  +V   + ++ DG   N    DA    S +     Y+V
Sbjct: 113 LLFGVCACVTPAIEFLSDVNNIDDVDHQEKERIDGGDLNASSDDARQDSSDSTVGSDYKV 172

Query: 548 TGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEE 727
           TGV ADGRCLFRAIAHMACLRNG++APDENRQ ELADELRAQVV ELLKR++E EWFIE 
Sbjct: 173 TGVLADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDELLKRREETEWFIEG 232

Query: 728 DFDAYVKRIQQPYVWGGE 781
           DFDAYVKRIQQPYVWGGE
Sbjct: 233 DFDAYVKRIQQPYVWGGE 250


>ref|XP_002311041.1| hypothetical protein POPTR_0008s02620g [Populus trichocarpa]
           gi|222850861|gb|EEE88408.1| hypothetical protein
           POPTR_0008s02620g [Populus trichocarpa]
          Length = 326

 Score =  240 bits (612), Expect = 8e-61
 Identities = 137/264 (51%), Positives = 161/264 (60%), Gaps = 25/264 (9%)
 Frame = +2

Query: 65  MLGVLCGSRPRP-WLLTSLSLSYAHSSP--------ALYSDRWRSVVAGGGESFDYAGRC 217
           MLGVLC +RP+P W+L SL   + H           +L+     +       SF  A   
Sbjct: 1   MLGVLC-ARPKPNWILNSLFTHFHHQHHHHQSNDRLSLHLPHSFTAARRHHSSFCSADCG 59

Query: 218 RGGEASIWKVILPVGRRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAWLLY 397
            GG A+IW V+ P         W      +   GEGSWNVAWD RPARWLH PDSAWLL+
Sbjct: 60  GGGAAAIWHVVQPAD-------WRRRRGRRSVRGEGSWNVAWDGRPARWLHRPDSAWLLF 112

Query: 398 GVFACLALPLLDYSDFNSE--------VSTSDVDKADGFSTNV--VGSD------AADCC 529
           GV ACLA  +  + D N E        V   + ++ DG   N   V SD      ++   
Sbjct: 113 GVCACLAPAIELFCDVNIEGGENVVVDVDHQEKERIDGGDLNASAVNSDDVKQDSSSSTA 172

Query: 530 SPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEV 709
             +Y+VTGV ADGRCLFRAIAHMACLRNG++APDENRQ ELADELRAQVV ELLKR++E 
Sbjct: 173 GSDYKVTGVLADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDELLKRREET 232

Query: 710 EWFIEEDFDAYVKRIQQPYVWGGE 781
           EWFIE DFDAYVKRIQQPYVWGGE
Sbjct: 233 EWFIEGDFDAYVKRIQQPYVWGGE 256


>ref|XP_008446786.1| PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
           melo]
          Length = 313

 Score =  239 bits (611), Expect = 1e-60
 Identities = 139/255 (54%), Positives = 171/255 (67%), Gaps = 16/255 (6%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDRWRS-VVAGGGESFD-----YAGRCR-- 220
           MLGVLC +RP+PW+L SLS ++ H S   +    +S ++      FD     ++  C+  
Sbjct: 1   MLGVLC-ARPKPWILVSLS-NFIHGSAVYHHHHHQSRLLVQSPIQFDRRQRHHSSACKLA 58

Query: 221 -GGEASIWKVILPVGR-------RTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHP 376
            GG ASIW  ILP G        R A+   H HE      GEGSWNVAWDARPARWLH P
Sbjct: 59  GGGAASIWHAILPSGAGSSSNLCRPAI---HCHE----RKGEGSWNVAWDARPARWLHRP 111

Query: 377 DSAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPNYRVTGV 556
           DSAWLL+GV AC+A   LD+ D + E  + D  K +   ++    +  D  S +YRVTGV
Sbjct: 112 DSAWLLFGVCACIAP--LDWVDASHEAVSLD-QKKEVCESSGPEFNQNDESSADYRVTGV 168

Query: 557 PADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDFD 736
            ADGRCLFRAIAH ACLR+G++APD++RQ ELADELRA+VV ELLKR+KE EW+IE DFD
Sbjct: 169 LADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFD 228

Query: 737 AYVKRIQQPYVWGGE 781
           AYVKRIQQP+VWGGE
Sbjct: 229 AYVKRIQQPFVWGGE 243


>ref|XP_007143828.1| hypothetical protein PHAVU_007G105100g [Phaseolus vulgaris]
           gi|561017018|gb|ESW15822.1| hypothetical protein
           PHAVU_007G105100g [Phaseolus vulgaris]
          Length = 305

 Score =  239 bits (611), Expect = 1e-60
 Identities = 136/245 (55%), Positives = 158/245 (64%), Gaps = 6/245 (2%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSLSY---AHSSPALYSDRWRSVVAGGGESFDYAGRCRGGEAS 235
           MLGVLC +RPRPWL + +  S     H+S +L +   R   +   + F  AG    G AS
Sbjct: 17  MLGVLCATRPRPWLFSHVHASLPRLVHASVSLSASPPRRHHSSACKIFGSAG----GAAS 72

Query: 236 IWKVILPVGR---RTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAWLLYGVF 406
           IW  I+P      R  V   H+ +      GEGSWNVAWD RPARWLH PDSAWLL+GV 
Sbjct: 73  IWHAIMPRSGDRFRRGVVPVHDLK------GEGSWNVAWDTRPARWLHRPDSAWLLFGVC 126

Query: 407 ACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPNYRVTGVPADGRCLFRA 586
           ACLA P     D  ++     VD++ G       +D AD     YRVTGVPADGRCLFRA
Sbjct: 127 ACLAPP--GCVDVVTDFEAVAVDESCGVLKVEASADYAD-----YRVTGVPADGRCLFRA 179

Query: 587 IAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDFDAYVKRIQQPY 766
           IAH  CLRNG+KAPDEN Q ELADELRA+VV ELLKR++E EWFIE DFD YVKRIQQP+
Sbjct: 180 IAHGDCLRNGEKAPDENCQRELADELRAKVVDELLKRREETEWFIEGDFDTYVKRIQQPF 239

Query: 767 VWGGE 781
           VWGGE
Sbjct: 240 VWGGE 244


>ref|XP_004142455.1| PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
           sativus] gi|700197033|gb|KGN52210.1| hypothetical
           protein Csa_5G615810 [Cucumis sativus]
          Length = 313

 Score =  239 bits (609), Expect = 2e-60
 Identities = 138/255 (54%), Positives = 171/255 (67%), Gaps = 16/255 (6%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDRWRS-VVAGGGESFD-----YAGRCR-- 220
           MLGVLC +RP+PW+L SLS ++ H S   +    +S ++      FD     ++  C+  
Sbjct: 1   MLGVLC-ARPKPWILVSLS-NFIHGSAVYHHHHHQSRLLVQSPIQFDRRQRHHSSACKLA 58

Query: 221 -GGEASIWKVILPVGR-------RTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHP 376
            GG ASIW  I+P G        R A+   H HE      GEGSWNVAWDARPARWLH P
Sbjct: 59  GGGAASIWHAIMPSGAGSSSNLCRPAI---HCHE----RKGEGSWNVAWDARPARWLHRP 111

Query: 377 DSAWLLYGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADCCSPNYRVTGV 556
           DSAWLL+GV AC+A   LD+ D + E  + D  K +   ++    +  D  S +YRVTGV
Sbjct: 112 DSAWLLFGVCACIAP--LDWVDASHEAVSLD-QKKEVCESSGPEFNQNDESSADYRVTGV 168

Query: 557 PADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDFD 736
            ADGRCLFRAIAH ACLR+G++APD++RQ ELADELRA+VV ELLKR+KE EW+IE DFD
Sbjct: 169 LADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFD 228

Query: 737 AYVKRIQQPYVWGGE 781
           AYVKRIQQP+VWGGE
Sbjct: 229 AYVKRIQQPFVWGGE 243


>ref|XP_003556279.1| PREDICTED: OTU domain-containing protein At3g57810-like [Glycine
           max] gi|734312743|gb|KHN00921.1| OTU domain-containing
           protein [Glycine soja]
          Length = 294

 Score =  238 bits (607), Expect = 3e-60
 Identities = 135/250 (54%), Positives = 160/250 (64%), Gaps = 11/250 (4%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDRWRSVVAGGGESFDYAGRCR-----GGE 229
           MLGVLC +R +PWL      S  H+S    S    S  A       ++  C+     GG 
Sbjct: 1   MLGVLCATRSKPWLF-----SLVHASLPRLSHAPLSPSASPPPRRRHSTACKLFLSAGGA 55

Query: 230 ASIWKVILPV-----GRRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPDSAWLL 394
           ASIW  I+P      G R  V  +H+ +      GEGSWNVAWDARPARWLH PDSAWLL
Sbjct: 56  ASIWHAIMPRVNDDDGFRRGVVAFHDMK------GEGSWNVAWDARPARWLHRPDSAWLL 109

Query: 395 YGVFACLALPLLDYSDFNSEVSTSDVDKADGFSTNVVGSDAADC-CSPNYRVTGVPADGR 571
           +GV ACLA P     D ++      VD+    S  ++  +  +   S +YRVTGVPADGR
Sbjct: 110 FGVCACLAPPS-SCVDADTNTDAIAVDE----SCRLLDKEREEYEVSADYRVTGVPADGR 164

Query: 572 CLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEEDFDAYVKR 751
           CLFRAIAH ACLRNG+KAPDENRQ ELADELRA+VV EL+KR++E EWFIE DFD YV+R
Sbjct: 165 CLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELMKRREETEWFIEGDFDTYVQR 224

Query: 752 IQQPYVWGGE 781
           IQQPYVWGGE
Sbjct: 225 IQQPYVWGGE 234


>gb|KJB73389.1| hypothetical protein B456_011G230700 [Gossypium raimondii]
          Length = 263

 Score =  237 bits (605), Expect = 5e-60
 Identities = 139/257 (54%), Positives = 164/257 (63%), Gaps = 18/257 (7%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDR-----WRS----VVAGGGESFDYAGRC 217
           MLGVLC   P+PW+L SLSL  AH   A +        W S    + A       ++  C
Sbjct: 1   MLGVLCARPPKPWILNSLSL-IAHGGSAAHHHENRLLHWPSHFADLSAANRRCRHHSTAC 59

Query: 218 R------GGEASIWKVILPVGRRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPD 379
           R      GG ASIW  ILP G    V    +        GEGSWNV+WDARPARWL   D
Sbjct: 60  RLGGGSEGGAASIWHAILPCGGDRGVKNRGDVWKNVERKGEGSWNVSWDARPARWLRS-D 118

Query: 380 SAWLLYGVFACLA-LPLLDYSDFNSEVS--TSDVDKADGFSTNVVGSDAADCCSPNYRVT 550
           SAWLL+GV ACLA +P+ ++ D N +    T     +D  S+N + S AA   + NY+VT
Sbjct: 119 SAWLLFGVCACLAPMPMDEFDDVNLDADNKTDASLNSDENSSNHLSSVAA---ADNYKVT 175

Query: 551 GVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEED 730
           G+ ADGRCLFRAIAH ACLR+G++APDENRQ ELADELRAQVV ELLKR++E EWFIE D
Sbjct: 176 GILADGRCLFRAIAHGACLRSGEEAPDENRQRELADELRAQVVNELLKRREETEWFIEGD 235

Query: 731 FDAYVKRIQQPYVWGGE 781
           FDAYVK IQQPYVWGGE
Sbjct: 236 FDAYVKEIQQPYVWGGE 252


>ref|XP_012456105.1| PREDICTED: uncharacterized protein LOC105777394 [Gossypium
           raimondii] gi|763806450|gb|KJB73388.1| hypothetical
           protein B456_011G230700 [Gossypium raimondii]
          Length = 319

 Score =  237 bits (605), Expect = 5e-60
 Identities = 139/257 (54%), Positives = 164/257 (63%), Gaps = 18/257 (7%)
 Frame = +2

Query: 65  MLGVLCGSRPRPWLLTSLSLSYAHSSPALYSDR-----WRS----VVAGGGESFDYAGRC 217
           MLGVLC   P+PW+L SLSL  AH   A +        W S    + A       ++  C
Sbjct: 1   MLGVLCARPPKPWILNSLSL-IAHGGSAAHHHENRLLHWPSHFADLSAANRRCRHHSTAC 59

Query: 218 R------GGEASIWKVILPVGRRTAVFGWHEHEVAKMAGGEGSWNVAWDARPARWLHHPD 379
           R      GG ASIW  ILP G    V    +        GEGSWNV+WDARPARWL   D
Sbjct: 60  RLGGGSEGGAASIWHAILPCGGDRGVKNRGDVWKNVERKGEGSWNVSWDARPARWLRS-D 118

Query: 380 SAWLLYGVFACLA-LPLLDYSDFNSEVS--TSDVDKADGFSTNVVGSDAADCCSPNYRVT 550
           SAWLL+GV ACLA +P+ ++ D N +    T     +D  S+N + S AA   + NY+VT
Sbjct: 119 SAWLLFGVCACLAPMPMDEFDDVNLDADNKTDASLNSDENSSNHLSSVAA---ADNYKVT 175

Query: 551 GVPADGRCLFRAIAHMACLRNGDKAPDENRQTELADELRAQVVQELLKRKKEVEWFIEED 730
           G+ ADGRCLFRAIAH ACLR+G++APDENRQ ELADELRAQVV ELLKR++E EWFIE D
Sbjct: 176 GILADGRCLFRAIAHGACLRSGEEAPDENRQRELADELRAQVVNELLKRREETEWFIEGD 235

Query: 731 FDAYVKRIQQPYVWGGE 781
           FDAYVK IQQPYVWGGE
Sbjct: 236 FDAYVKEIQQPYVWGGE 252


Top