BLASTX nr result

ID: Forsythia23_contig00019291 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia23_contig00019291
         (971 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009793129.1| PREDICTED: uncharacterized protein LOC104240...   345   2e-92
ref|XP_009603537.1| PREDICTED: uncharacterized protein LOC104098...   344   6e-92
ref|XP_004250001.1| PREDICTED: uncharacterized protein LOC101253...   328   3e-87
ref|XP_006360486.1| PREDICTED: uncharacterized protein LOC102606...   325   4e-86
ref|XP_010658710.1| PREDICTED: uncharacterized protein LOC100245...   321   6e-85
ref|XP_007010219.1| Cysteine proteinases superfamily protein iso...   320   9e-85
ref|XP_008232087.1| PREDICTED: OTU domain-containing protein At3...   318   4e-84
ref|XP_007220473.1| hypothetical protein PRUPE_ppa008484mg [Prun...   318   5e-84
ref|XP_002316423.1| hypothetical protein POPTR_0010s24050g [Popu...   317   8e-84
ref|XP_002311041.1| hypothetical protein POPTR_0008s02620g [Popu...   317   8e-84
ref|XP_011101645.1| PREDICTED: OTU domain-containing protein 6B-...   317   1e-83
ref|XP_007010220.1| Cysteine proteinases superfamily protein iso...   315   4e-83
ref|XP_011024271.1| PREDICTED: uncharacterized protein LOC105125...   314   5e-83
ref|XP_011010409.1| PREDICTED: uncharacterized protein LOC105115...   310   7e-82
ref|XP_008345443.1| PREDICTED: uncharacterized protein LOC103408...   308   3e-81
ref|XP_004307032.1| PREDICTED: OTU domain-containing protein At3...   308   4e-81
ref|XP_010032108.1| PREDICTED: OTU domain-containing protein At3...   306   1e-80
ref|XP_012089989.1| PREDICTED: uncharacterized protein LOC105648...   305   4e-80
ref|XP_004142455.1| PREDICTED: OTU domain-containing protein At3...   302   3e-79
gb|KHG26701.1| hypothetical protein F383_04817 [Gossypium arboreum]   301   3e-79

>ref|XP_009793129.1| PREDICTED: uncharacterized protein LOC104240043 [Nicotiana
           sylvestris]
          Length = 328

 Score =  345 bits (886), Expect = 2e-92
 Identities = 165/223 (73%), Positives = 187/223 (83%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDYSDLNSKVSTSDVDKADGFST 187
           GEGSWNVAWD RPARWLH+PDSAWLL+GV +CLA P LD  D NS V     + + GFS+
Sbjct: 104 GEGSWNVAWDTRPARWLHNPDSAWLLFGVCSCLAAPSLDLPDSNSDVVAPIENMSQGFSS 163

Query: 188 SVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELADELRAQV 367
           + V SD AD  S NY VTGVPADGRCLFRAIAHMACLRNG+ APDEN Q ELADELRAQV
Sbjct: 164 NTVNSDEADRNSANYTVTGVPADGRCLFRAIAHMACLRNGEGAPDENRQRELADELRAQV 223

Query: 368 VQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFMIERSSGS 547
           V ELLKR+KE EWFIE DFDAYV+RI++PYVWGGEPELLMASHVL++PISV+M++RSSGS
Sbjct: 224 VDELLKRRKEAEWFIEGDFDAYVERIEKPYVWGGEPELLMASHVLKSPISVYMVDRSSGS 283

Query: 548 LMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKI 676
           L+ I+NYGE Y K+ ENPINVLFHGYGHYDILETI +   QK+
Sbjct: 284 LINISNYGEEYRKEGENPINVLFHGYGHYDILETISEKGHQKL 326


>ref|XP_009603537.1| PREDICTED: uncharacterized protein LOC104098494 [Nicotiana
           tomentosiformis]
          Length = 328

 Score =  344 bits (882), Expect = 6e-92
 Identities = 163/215 (75%), Positives = 184/215 (85%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDYSDLNSKVSTSDVDKADGFST 187
           GEGSWNVAWD RPARWLH+PDSAWLL+GV +CLA P LD  D NS+V     +K+ GFS+
Sbjct: 104 GEGSWNVAWDTRPARWLHNPDSAWLLFGVCSCLAAPTLDLPDSNSEVVAPIENKSQGFSS 163

Query: 188 SVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELADELRAQV 367
           + V SD  D  S NY VTGVPADGRCLFRAIAHMACLRNG+ APDEN Q ELADELRAQV
Sbjct: 164 NTVNSDEVDRNSANYTVTGVPADGRCLFRAIAHMACLRNGEGAPDENRQRELADELRAQV 223

Query: 368 VQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFMIERSSGS 547
           V ELLKR+KE EWFIE DFDAYV+RI++PYVWGGEPELLMASHVL++PISV+M++RSSGS
Sbjct: 224 VDELLKRRKEAEWFIEGDFDAYVERIEKPYVWGGEPELLMASHVLKSPISVYMVDRSSGS 283

Query: 548 LMKIANYGEGYSKDEENPINVLFHGYGHYDILETI 652
           L+ I+NYGE Y K+ ENPINVLFHGYGHYDILETI
Sbjct: 284 LINISNYGEEYRKEGENPINVLFHGYGHYDILETI 318


>ref|XP_004250001.1| PREDICTED: uncharacterized protein LOC101253339 [Solanum
           lycopersicum]
          Length = 338

 Score =  328 bits (841), Expect = 3e-87
 Identities = 162/224 (72%), Positives = 187/224 (83%), Gaps = 1/224 (0%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDY-SDLNSKVSTSDVDKADGFS 184
           GEGSWNV WD+RPARWLH+PDSAWLL+GV +CLA P LD   D NS V+   +DK     
Sbjct: 118 GEGSWNVNWDSRPARWLHNPDSAWLLFGVCSCLAAPSLDLLPDANSDVAVP-IDK----Q 172

Query: 185 TSVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELADELRAQ 364
           ++V  SD  D  S NYRVTGVPADGRCLFRAIAHMACLRNG++APDEN Q ELADELRAQ
Sbjct: 173 SAVNSSDEDDQNSANYRVTGVPADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQ 232

Query: 365 VVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFMIERSSG 544
           VV ELLKR+KE EWFIE DFDAYV+RI++PYVWGGEPELLMASHVL++ ISV+M++RSSG
Sbjct: 233 VVDELLKRRKEAEWFIEGDFDAYVERIEKPYVWGGEPELLMASHVLKSAISVYMVDRSSG 292

Query: 545 SLMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKI 676
           SL+ I+NYGE Y K+ E+PINVLFHGYGHYDILETIP+   QK+
Sbjct: 293 SLINISNYGEEYRKEGESPINVLFHGYGHYDILETIPEKIHQKL 336


>ref|XP_006360486.1| PREDICTED: uncharacterized protein LOC102606023 isoform X1 [Solanum
           tuberosum]
          Length = 338

 Score =  325 bits (832), Expect = 4e-86
 Identities = 161/224 (71%), Positives = 185/224 (82%), Gaps = 1/224 (0%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDY-SDLNSKVSTSDVDKADGFS 184
           GEGSWNV WD+RPARWLH+PDSAWLL+GV +CLA P LD   D N  V+   +DK     
Sbjct: 118 GEGSWNVNWDSRPARWLHNPDSAWLLFGVCSCLAAPSLDLLPDANFDVAVP-IDK----Q 172

Query: 185 TSVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELADELRAQ 364
           + V  SD  D  S NYRVTGVPADGRCLFRAIAHMACLRNG++APDEN Q ELADELRAQ
Sbjct: 173 SVVNSSDEDDQNSANYRVTGVPADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQ 232

Query: 365 VVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFMIERSSG 544
           VV ELLKR+KE EWFIE DFDAYV+RI++PYVWGGEPELLMASHVL++ ISV+M++RSSG
Sbjct: 233 VVDELLKRRKEAEWFIEGDFDAYVERIEKPYVWGGEPELLMASHVLKSSISVYMVDRSSG 292

Query: 545 SLMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKI 676
           SL+ I+NYGE Y K+ E+PINVLFHGYGHYDILETIP+   QK+
Sbjct: 293 SLINISNYGEEYRKEGESPINVLFHGYGHYDILETIPEKIHQKL 336


>ref|XP_010658710.1| PREDICTED: uncharacterized protein LOC100245448 [Vitis vinifera]
           gi|296090402|emb|CBI40221.3| unnamed protein product
           [Vitis vinifera]
          Length = 317

 Score =  321 bits (822), Expect = 6e-85
 Identities = 161/223 (72%), Positives = 182/223 (81%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDYSDLNSKVSTSDVDKADGFST 187
           GEGSWNVAWDARPARWLH PDSAWLL+GV ACLA   LD  D++++V   D DK +G + 
Sbjct: 96  GEGSWNVAWDARPARWLHRPDSAWLLFGVCACLAP--LDSFDVDNEVVAVD-DKIEGCNQ 152

Query: 188 SVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELADELRAQV 367
               SD  +  S +YRVTGVPADGRCLFRAIAH ACLR+G++APDEN QTELAD+LRAQV
Sbjct: 153 VNEISDENNNSSADYRVTGVPADGRCLFRAIAHSACLRSGEEAPDENRQTELADDLRAQV 212

Query: 368 VQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFMIERSSGS 547
           V ELLKR++E EWFIE +FDAYVKRIQQPYVWGGEPEL+MASHVL+ PISVFMI RSSG 
Sbjct: 213 VDELLKRREETEWFIEGNFDAYVKRIQQPYVWGGEPELIMASHVLKMPISVFMIGRSSGD 272

Query: 548 LMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKI 676
           L  IANYG+ Y  D E+PINVLFHGYGHYDILET  D S QK+
Sbjct: 273 LKNIANYGKEYRIDNESPINVLFHGYGHYDILETFSDHSYQKL 315


>ref|XP_007010219.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma
           cacao] gi|508727132|gb|EOY19029.1| Cysteine proteinases
           superfamily protein isoform 1 [Theobroma cacao]
          Length = 327

 Score =  320 bits (820), Expect = 9e-85
 Identities = 157/234 (67%), Positives = 184/234 (78%), Gaps = 10/234 (4%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDYSDLNSKVSTSDVDKADGFST 187
           GEGSWNVAWDARPARWLH PDSAWLL+GV ACLA P++++ D+N        DK +G   
Sbjct: 98  GEGSWNVAWDARPARWLHRPDSAWLLFGVCACLA-PMIEFVDVNPDAD----DKIEGAEL 152

Query: 188 SVVGSDAAD----------CCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQT 337
           ++V   +AD            + N +VTGV ADGRCLFRAIAH ACLR+G+ APDEN Q 
Sbjct: 153 NLVSRLSADEKSSSSSSSVAAADNCKVTGVLADGRCLFRAIAHGACLRSGEDAPDENHQR 212

Query: 338 ELADELRAQVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPIS 517
           ELADELRAQVV ELLKR++E EWFIE DFDAYVK IQQPYVWGGEPE+LMASHVL+TPIS
Sbjct: 213 ELADELRAQVVNELLKRREETEWFIEGDFDAYVKEIQQPYVWGGEPEILMASHVLKTPIS 272

Query: 518 VFMIERSSGSLMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKIN 679
           V+MI RSS +L KIA YGE Y KD+ENPINVLFHGYGHYDILE++P+ +C ++N
Sbjct: 273 VYMIPRSSSNLTKIAKYGEEYQKDKENPINVLFHGYGHYDILESLPEQNCAQVN 326


>ref|XP_008232087.1| PREDICTED: OTU domain-containing protein At3g57810-like [Prunus
           mume]
          Length = 329

 Score =  318 bits (815), Expect = 4e-84
 Identities = 157/228 (68%), Positives = 182/228 (79%), Gaps = 3/228 (1%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLA-LPLLDYS--DLNSKVSTSDVDKADG 178
           GEGSWN AWDARPARWLH PDSAWLL+GV  CLA +   D S  D N  VS  + +  D 
Sbjct: 103 GEGSWNAAWDARPARWLHRPDSAWLLFGVCNCLAPIDWADDSTPDGNDGVSNENAESFDS 162

Query: 179 FSTSVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELADELR 358
             ++    +  D  S +YRVTGVPADGRCLFRAIAH+ACLRNG++APDEN Q +LADELR
Sbjct: 163 KCSAASDQNNIDS-SADYRVTGVPADGRCLFRAIAHVACLRNGEEAPDENRQRDLADELR 221

Query: 359 AQVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFMIERS 538
           AQVV ELLKR++E EWFIE DFDAYVKR+QQPYVWGGEPELLMASHVL+TPISVFMI+RS
Sbjct: 222 AQVVDELLKRREETEWFIEGDFDAYVKRLQQPYVWGGEPELLMASHVLKTPISVFMIDRS 281

Query: 539 SGSLMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKINV 682
           S  L+ IANYGE Y K+EE PINVLFHGYGHYDIL++  + S +K+N+
Sbjct: 282 SAGLVNIANYGEDYRKEEEKPINVLFHGYGHYDILDSFSEQSLKKLNM 329


>ref|XP_007220473.1| hypothetical protein PRUPE_ppa008484mg [Prunus persica]
           gi|462416935|gb|EMJ21672.1| hypothetical protein
           PRUPE_ppa008484mg [Prunus persica]
          Length = 329

 Score =  318 bits (814), Expect = 5e-84
 Identities = 157/228 (68%), Positives = 182/228 (79%), Gaps = 3/228 (1%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLA-LPLLDYS--DLNSKVSTSDVDKADG 178
           GEGSWN AWDARPARWLH PDSAWLL+GV  CLA +   D S  D N  VS  + +  D 
Sbjct: 103 GEGSWNAAWDARPARWLHRPDSAWLLFGVCNCLAPIDWADDSTPDGNDGVSNENAESFDS 162

Query: 179 FSTSVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELADELR 358
             ++    +  D  S +YRVTGVPADGRCLFRAIAH+ACLRNG++APDEN Q +LADELR
Sbjct: 163 KCSAAPDQNNIDS-SADYRVTGVPADGRCLFRAIAHVACLRNGEEAPDENRQRDLADELR 221

Query: 359 AQVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFMIERS 538
           AQVV ELLKR++E EWFIE DFDAYVKR+QQPYVWGGEPELLMASHVL+TPISVFMI+RS
Sbjct: 222 AQVVDELLKRREETEWFIEGDFDAYVKRLQQPYVWGGEPELLMASHVLKTPISVFMIDRS 281

Query: 539 SGSLMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKINV 682
           S  L+ IANYGE Y K+EE PINVLFHGYGHYDIL++  + S +K+N+
Sbjct: 282 SAGLVNIANYGEEYRKEEEKPINVLFHGYGHYDILDSFSEQSLKKLNM 329


>ref|XP_002316423.1| hypothetical protein POPTR_0010s24050g [Populus trichocarpa]
           gi|222865463|gb|EEF02594.1| hypothetical protein
           POPTR_0010s24050g [Populus trichocarpa]
          Length = 318

 Score =  317 bits (812), Expect = 8e-84
 Identities = 158/232 (68%), Positives = 180/232 (77%), Gaps = 7/232 (3%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDYSDLNS--KVSTSDVDKADGF 181
           GEGSWN AWD RPARWLH PDSAWLL+GV ACLA  +   SD+N+   V   + ++ DG 
Sbjct: 87  GEGSWNAAWDGRPARWLHRPDSAWLLFGVCACLAPAIEFLSDVNNIDDVDHQEKERIDGG 146

Query: 182 STSVVGSDAADCCSP-----NYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELA 346
             +    DA    S      +Y+VTGV ADGRCLFRAIAHMACLRNG++APDEN Q ELA
Sbjct: 147 DLNASSDDAKQDNSDATVGSDYKVTGVLADGRCLFRAIAHMACLRNGEEAPDENRQRELA 206

Query: 347 DELRAQVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFM 526
           DELRAQVV ELLKR++E EWFIE DFDAYVKRIQQPYVWGGEPELLMASHVL+T ISVFM
Sbjct: 207 DELRAQVVDELLKRREETEWFIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTMISVFM 266

Query: 527 IERSSGSLMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKINV 682
            +R++G+L+ I NYGE Y KDE NPINVLFHGYGHYDILET P  S QK ++
Sbjct: 267 RDRTTGNLVNIVNYGEEYQKDEVNPINVLFHGYGHYDILETTPGQSYQKADI 318


>ref|XP_002311041.1| hypothetical protein POPTR_0008s02620g [Populus trichocarpa]
           gi|222850861|gb|EEE88408.1| hypothetical protein
           POPTR_0008s02620g [Populus trichocarpa]
          Length = 326

 Score =  317 bits (812), Expect = 8e-84
 Identities = 159/241 (65%), Positives = 186/241 (77%), Gaps = 16/241 (6%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDYSDLNSK--------VSTSDV 163
           GEGSWNVAWD RPARWLH PDSAWLL+GV ACLA  +  + D+N +        V   + 
Sbjct: 86  GEGSWNVAWDGRPARWLHRPDSAWLLFGVCACLAPAIELFCDVNIEGGENVVVDVDHQEK 145

Query: 164 DKADG--FSTSVVGSD------AADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAP 319
           ++ DG   + S V SD      ++     +Y+VTGV ADGRCLFRAIAHMACLRNG++AP
Sbjct: 146 ERIDGGDLNASAVNSDDVKQDSSSSTAGSDYKVTGVLADGRCLFRAIAHMACLRNGEEAP 205

Query: 320 DENLQTELADELRAQVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHV 499
           DEN Q ELADELRAQVV ELLKR++E EWFIE DFDAYVKRIQQPYVWGGEPELLMASHV
Sbjct: 206 DENRQRELADELRAQVVDELLKRREETEWFIEGDFDAYVKRIQQPYVWGGEPELLMASHV 265

Query: 500 LRTPISVFMIERSSGSLMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKIN 679
           L+T ISVFM +R++G+L+ IANYGE Y KDE NPINVLFHGYGHYDILET P  S +K++
Sbjct: 266 LKTMISVFMRDRTTGNLVNIANYGEEYRKDEVNPINVLFHGYGHYDILETTPGQSYKKVD 325

Query: 680 V 682
           +
Sbjct: 326 L 326


>ref|XP_011101645.1| PREDICTED: OTU domain-containing protein 6B-like [Sesamum indicum]
          Length = 284

 Score =  317 bits (811), Expect = 1e-83
 Identities = 159/216 (73%), Positives = 173/216 (80%)
 Frame = +2

Query: 5   GGEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDYSDLNSKVSTSDVDKADGFS 184
           GGEGSWNVAWDARPARWLHHP+SAWLL+      A P +D SD     +  D  K+D   
Sbjct: 86  GGEGSWNVAWDARPARWLHHPESAWLLFA-----AAPAID-SDPIPNPAAEDELKSDVIC 139

Query: 185 TSVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELADELRAQ 364
                         NYRVTGV ADGRCLFRA+AHMACLRNG++APDEN Q ELADELRAQ
Sbjct: 140 --------------NYRVTGVVADGRCLFRAVAHMACLRNGEEAPDENRQRELADELRAQ 185

Query: 365 VVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFMIERSSG 544
           VV+ELLKR+KEVEWFIEEDFD YVKRIQ+PYVWGGEPELLM+SHVLRTPISVFM ERSSG
Sbjct: 186 VVEELLKRRKEVEWFIEEDFDVYVKRIQEPYVWGGEPELLMSSHVLRTPISVFMKERSSG 245

Query: 545 SLMKIANYGEGYSKDEENPINVLFHGYGHYDILETI 652
           SLM IANYGE Y KDE+NPINVLFHGYGHYDILET+
Sbjct: 246 SLMNIANYGEEYRKDEDNPINVLFHGYGHYDILETL 281


>ref|XP_007010220.1| Cysteine proteinases superfamily protein isoform 2 [Theobroma
           cacao] gi|508727133|gb|EOY19030.1| Cysteine proteinases
           superfamily protein isoform 2 [Theobroma cacao]
          Length = 330

 Score =  315 bits (806), Expect = 4e-83
 Identities = 157/237 (66%), Positives = 184/237 (77%), Gaps = 13/237 (5%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDYSDLNSKVSTSDVDKADGFST 187
           GEGSWNVAWDARPARWLH PDSAWLL+GV ACLA P++++ D+N        DK +G   
Sbjct: 98  GEGSWNVAWDARPARWLHRPDSAWLLFGVCACLA-PMIEFVDVNPDAD----DKIEGAEL 152

Query: 188 SVVGSDAAD----------CCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQT 337
           ++V   +AD            + N +VTGV ADGRCLFRAIAH ACLR+G+ APDEN Q 
Sbjct: 153 NLVSRLSADEKSSSSSSSVAAADNCKVTGVLADGRCLFRAIAHGACLRSGEDAPDENHQR 212

Query: 338 ELADELRAQV---VQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRT 508
           ELADELRAQV   V ELLKR++E EWFIE DFDAYVK IQQPYVWGGEPE+LMASHVL+T
Sbjct: 213 ELADELRAQVSLVVNELLKRREETEWFIEGDFDAYVKEIQQPYVWGGEPEILMASHVLKT 272

Query: 509 PISVFMIERSSGSLMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKIN 679
           PISV+MI RSS +L KIA YGE Y KD+ENPINVLFHGYGHYDILE++P+ +C ++N
Sbjct: 273 PISVYMIPRSSSNLTKIAKYGEEYQKDKENPINVLFHGYGHYDILESLPEQNCAQVN 329


>ref|XP_011024271.1| PREDICTED: uncharacterized protein LOC105125498 [Populus
           euphratica]
          Length = 320

 Score =  314 bits (805), Expect = 5e-83
 Identities = 156/232 (67%), Positives = 179/232 (77%), Gaps = 7/232 (3%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDYSDLNS--KVSTSDVDKADGF 181
           GEGSWN AWD RPARWLH PDSAWLL+GV AC+   +   SD+N+   V   + ++ DG 
Sbjct: 89  GEGSWNAAWDGRPARWLHRPDSAWLLFGVCACVTPAIEFLSDVNNIDDVDHQEKERIDGG 148

Query: 182 STSVVGSDAADCCSPN-----YRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELA 346
             +    DA    S +     Y+VTGV ADGRCLFRAIAHMACLRNG++APDEN Q ELA
Sbjct: 149 DLNASSDDARQDSSDSTVGSDYKVTGVLADGRCLFRAIAHMACLRNGEEAPDENRQRELA 208

Query: 347 DELRAQVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFM 526
           DELRAQVV ELLKR++E EWFIE DFDAYVKRIQQPYVWGGEPELLMASHVL+T ISVFM
Sbjct: 209 DELRAQVVDELLKRREETEWFIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTMISVFM 268

Query: 527 IERSSGSLMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKINV 682
            +R++G+L+ I NYGE Y KDE NPINVLFHGYGHYDILET P  S QK ++
Sbjct: 269 RDRTTGNLVNIVNYGEEYQKDEVNPINVLFHGYGHYDILETTPGQSYQKEDI 320


>ref|XP_011010409.1| PREDICTED: uncharacterized protein LOC105115268 isoform X1 [Populus
           euphratica] gi|743932247|ref|XP_011010410.1| PREDICTED:
           uncharacterized protein LOC105115268 isoform X2 [Populus
           euphratica]
          Length = 326

 Score =  310 bits (795), Expect = 7e-82
 Identities = 158/241 (65%), Positives = 181/241 (75%), Gaps = 16/241 (6%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDYSDLNSK--------VSTSDV 163
           GEGSWNVAWD RPARWL  PDSAWLL+GV ACLA  +  + D+N +        V   D 
Sbjct: 86  GEGSWNVAWDGRPARWLCRPDSAWLLFGVCACLAPAIELFCDVNIEGRENVVVDVDHKDK 145

Query: 164 DKADG--FSTSVVGSD------AADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAP 319
           +  DG   + S V SD      +      +Y+VTGV ADGRCLFRAIAHMACLRNG+ AP
Sbjct: 146 ESIDGGDLNASAVNSDDVKQDSSGSTVGSDYKVTGVLADGRCLFRAIAHMACLRNGEDAP 205

Query: 320 DENLQTELADELRAQVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHV 499
           DEN Q ELADELRAQVV ELLKR++E EWFIE DFDAYVKRIQQPYVWGGEPELLMASHV
Sbjct: 206 DENRQRELADELRAQVVDELLKRREETEWFIEGDFDAYVKRIQQPYVWGGEPELLMASHV 265

Query: 500 LRTPISVFMIERSSGSLMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKIN 679
           L+T ISVFM +R++G+L+ I NYGE Y KDE NPINVLFHGYGHYDILET P  S +K++
Sbjct: 266 LKTMISVFMRDRTTGNLVNIVNYGEEYQKDEVNPINVLFHGYGHYDILETSPGQSYKKVD 325

Query: 680 V 682
           +
Sbjct: 326 L 326


>ref|XP_008345443.1| PREDICTED: uncharacterized protein LOC103408366 [Malus domestica]
          Length = 323

 Score =  308 bits (790), Expect = 3e-81
 Identities = 154/232 (66%), Positives = 177/232 (76%), Gaps = 8/232 (3%)
 Frame = +2

Query: 11  EGSWNVAWDARPARWLHHPDSAWLLYGVFACLA--------LPLLDYSDLNSKVSTSDVD 166
           EGSWN AWDARPARWLH PDSAWLL+GV +CLA         P       NSK       
Sbjct: 99  EGSWNAAWDARPARWLHRPDSAWLLFGVRSCLAPTNWAVDSAPGEXNDVYNSKTDCDS-- 156

Query: 167 KADGFSTSVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELA 346
           K+     +++ S AAD     YRVTGV ADGRCLFRAIAH+ACLRNG++APDEN Q +LA
Sbjct: 157 KSSSSPENIIDSSAAD-----YRVTGVLADGRCLFRAIAHVACLRNGEEAPDENRQRDLA 211

Query: 347 DELRAQVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFM 526
           DELR+QVV ELLKR+KE EWFIE DFDAYVKR+QQPYVWGGEPELLMASHVL+TPISVFM
Sbjct: 212 DELRSQVVDELLKRRKETEWFIEGDFDAYVKRLQQPYVWGGEPELLMASHVLKTPISVFM 271

Query: 527 IERSSGSLMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKINV 682
           ++RSS  L+ IA YGE Y K+EE PINVLFHGYGHYDILE+  + S QK+++
Sbjct: 272 VDRSSSGLVNIAKYGEEYQKEEEKPINVLFHGYGHYDILESFSEQSLQKLSM 323


>ref|XP_004307032.1| PREDICTED: OTU domain-containing protein At3g57810-like [Fragaria
           vesca subsp. vesca]
          Length = 324

 Score =  308 bits (789), Expect = 4e-81
 Identities = 156/230 (67%), Positives = 179/230 (77%), Gaps = 5/230 (2%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDY-----SDLNSKVSTSDVDKA 172
           GEGSWN A DARPARWLH PDSAWLL+GV  CLA   +D+     S  N +VS +  +  
Sbjct: 100 GEGSWNAALDARPARWLHRPDSAWLLFGVCNCLAP--IDWGSTTNSTTNDEVSNNKTEAC 157

Query: 173 DGFSTSVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELADE 352
           D  S S + SD     +P+YRVTGV ADGRCLFRAIAH+ACLRNG++ PDEN Q ELADE
Sbjct: 158 D--SKSSITSDV-QLETPDYRVTGVLADGRCLFRAIAHVACLRNGEEPPDENRQRELADE 214

Query: 353 LRAQVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFMIE 532
           LRAQVV ELLKR++E EWFIE DFDAYVKRIQQPYVWGGEPELLMASHV + PISV+M++
Sbjct: 215 LRAQVVDELLKRREETEWFIEGDFDAYVKRIQQPYVWGGEPELLMASHVKKAPISVYMVD 274

Query: 533 RSSGSLMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKINV 682
           RSSG L+ IA YGE Y K EE PINVLFHGYGHYDILE+  + S QK+N+
Sbjct: 275 RSSGGLVNIAKYGEEYGKQEEKPINVLFHGYGHYDILESFSEQSLQKVNM 324


>ref|XP_010032108.1| PREDICTED: OTU domain-containing protein At3g57810-like [Eucalyptus
           grandis] gi|629085145|gb|KCW51502.1| hypothetical
           protein EUGRSUZ_J01018 [Eucalyptus grandis]
          Length = 314

 Score =  306 bits (785), Expect = 1e-80
 Identities = 152/219 (69%), Positives = 169/219 (77%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDYSDLNSKVSTSDVDKADGFST 187
           GEGSWNVAWDARPARWLH PDSAWLL+GV ACLA            V  + V+  D    
Sbjct: 102 GEGSWNVAWDARPARWLHRPDSAWLLFGVCACLAPVDAAEPSREEVVPEARVEDRDSL-- 159

Query: 188 SVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELADELRAQV 367
                D A   SP+YRVTGV ADGRCLFRAIAH ACLR G+ APD+N Q ELADELRAQV
Sbjct: 160 -----DEAKRSSPDYRVTGVLADGRCLFRAIAHCACLRKGEAAPDDNRQRELADELRAQV 214

Query: 368 VQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFMIERSSGS 547
           V ELLKR++E EW IE DFDAY++RIQQPYVWGGEPELLMASHVL+TPISVFM++RSSG+
Sbjct: 215 VAELLKRREETEWAIEGDFDAYIERIQQPYVWGGEPELLMASHVLKTPISVFMVDRSSGN 274

Query: 548 LMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPS 664
           L+ +A YGE Y KDEE PINVLFHGYGHYDILE+ P  S
Sbjct: 275 LVNVAKYGEEYRKDEEIPINVLFHGYGHYDILESFPGQS 313


>ref|XP_012089989.1| PREDICTED: uncharacterized protein LOC105648266 [Jatropha curcas]
           gi|643739215|gb|KDP45029.1| hypothetical protein
           JCGZ_01529 [Jatropha curcas]
          Length = 331

 Score =  305 bits (780), Expect = 4e-80
 Identities = 154/226 (68%), Positives = 182/226 (80%), Gaps = 1/226 (0%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDYSDLNSKVSTSDVDKADGFS- 184
           GEGSWNVAWDARPARWLH PDSAWLL+GV   LA P+   +D+N++      D ++G   
Sbjct: 108 GEGSWNVAWDARPARWLHRPDSAWLLFGVRGWLA-PIGIGTDVNNESVVVAEDNSNGSDD 166

Query: 185 TSVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELADELRAQ 364
           T+  G D     S N++VTGV ADGRCLFRAIAH ACLR+G++APDEN Q ELADELRAQ
Sbjct: 167 TNSNGIDFNRDSSANFKVTGVLADGRCLFRAIAHGACLRSGEEAPDENRQRELADELRAQ 226

Query: 365 VVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFMIERSSG 544
           VV ELLKR++E EWFIE DFDAYV+RIQQPY WGGEPELLMASHVL+T +SVFMI+R+SG
Sbjct: 227 VVDELLKRREESEWFIEGDFDAYVERIQQPYAWGGEPELLMASHVLKTMVSVFMIDRTSG 286

Query: 545 SLMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKINV 682
           +L+ IANYGE Y KD+ NPINVLFHGYGHYDILET  + + QK+N+
Sbjct: 287 NLVNIANYGEEYRKDQVNPINVLFHGYGHYDILET-SEQNYQKLNI 331


>ref|XP_004142455.1| PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
           sativus] gi|700197033|gb|KGN52210.1| hypothetical
           protein Csa_5G615810 [Cucumis sativus]
          Length = 313

 Score =  302 bits (773), Expect = 3e-79
 Identities = 152/225 (67%), Positives = 178/225 (79%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLALPLLDYSDLNSKVSTSDVDKADGFST 187
           GEGSWNVAWDARPARWLH PDSAWLL+GV AC+A   LD+ D + +  + D  K +   +
Sbjct: 92  GEGSWNVAWDARPARWLHRPDSAWLLFGVCACIAP--LDWVDASHEAVSLD-QKKEVCES 148

Query: 188 SVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELADELRAQV 367
           S    +  D  S +YRVTGV ADGRCLFRAIAH ACLR+G++APD++ Q ELADELRA+V
Sbjct: 149 SGPEFNQNDESSADYRVTGVLADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKV 208

Query: 368 VQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFMIERSSGS 547
           V ELLKR+KE EW+IE DFDAYVKRIQQP+VWGGEPELLMASHVL+TPISVFM ERSS  
Sbjct: 209 VDELLKRRKETEWYIEGDFDAYVKRIQQPFVWGGEPELLMASHVLKTPISVFMRERSSDG 268

Query: 548 LMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPDPSCQKINV 682
           L+ IA YG+ Y K EE+PINVLFHGYGHYDILET  D    K+++
Sbjct: 269 LINIAKYGQEYQKGEESPINVLFHGYGHYDILETSSDKVSLKLSM 313


>gb|KHG26701.1| hypothetical protein F383_04817 [Gossypium arboreum]
          Length = 319

 Score =  301 bits (772), Expect = 3e-79
 Identities = 149/224 (66%), Positives = 180/224 (80%), Gaps = 7/224 (3%)
 Frame = +2

Query: 8   GEGSWNVAWDARPARWLHHPDSAWLLYGVFACLA-LPLLDYSDLN------SKVSTSDVD 166
           GEGSWNV+WDARPARWL  PDSAWLL+GV ACLA +P+ ++ D+N      +  S +  +
Sbjct: 99  GEGSWNVSWDARPARWLR-PDSAWLLFGVCACLAPMPMDEFDDVNLDADNKTDASLNSDE 157

Query: 167 KADGFSTSVVGSDAADCCSPNYRVTGVPADGRCLFRAIAHMACLRNGDKAPDENLQTELA 346
           K+    +SV  +D       N++VTG+ ADGRCLFRAIAH ACLR+G++APDEN Q ELA
Sbjct: 158 KSSNHLSSVAAAD-------NFKVTGILADGRCLFRAIAHGACLRSGEEAPDENRQRELA 210

Query: 347 DELRAQVVQELLKRKKEVEWFIEEDFDAYVKRIQQPYVWGGEPELLMASHVLRTPISVFM 526
           DELRAQVV ELLKR++E EW+IE DFDAYVK IQQPYVWGGEPELLMASHVL+T ISV+M
Sbjct: 211 DELRAQVVNELLKRREETEWYIEGDFDAYVKEIQQPYVWGGEPELLMASHVLKTRISVYM 270

Query: 527 IERSSGSLMKIANYGEGYSKDEENPINVLFHGYGHYDILETIPD 658
           I RSSG+L+ IA YGE Y K++ENPINVLFHGYGHYDILE++P+
Sbjct: 271 IHRSSGNLINIAKYGEEYQKEKENPINVLFHGYGHYDILESLPE 314


Top