BLASTX nr result

ID: Akebia25_contig00021078 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00021078
         (760 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004485897.1| PREDICTED: cysteine proteinase RD21a-like [C...   382   e-104
ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis v...   377   e-102
emb|CBI30692.3| unnamed protein product [Vitis vinifera]              377   e-102
ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula] gi...   376   e-102
ref|XP_003590277.1| Cysteine protease [Medicago truncatula] gi|3...   376   e-102
ref|XP_007211825.1| hypothetical protein PRUPE_ppa004381mg [Prun...   374   e-101
gb|AHM88211.1| CAC1 protein [Malus hybrid cultivar]                   369   e-100
gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]            369   e-100
ref|XP_007025364.1| Xylem bark cysteine peptidase 3 isoform 2 [T...   368   1e-99
ref|XP_007025363.1| Xylem bark cysteine peptidase 3 isoform 1 [T...   368   1e-99
ref|XP_002317418.2| hypothetical protein POPTR_0011s07310g [Popu...   365   9e-99
ref|XP_002305743.2| hypothetical protein POPTR_0004s05640g [Popu...   363   5e-98
ref|XP_002317417.2| hypothetical protein POPTR_0011s07300g [Popu...   362   1e-97
gb|ABK95906.1| unknown [Populus trichocarpa]                          362   1e-97
ref|XP_004293953.1| PREDICTED: cysteine proteinase RD21a-like [F...   361   1e-97
ref|XP_006449509.1| hypothetical protein CICLE_v10015066mg [Citr...   361   2e-97
ref|XP_006467643.1| PREDICTED: cysteine proteinase RD21a-like [C...   360   3e-97
gb|AAP41846.1| cysteine protease [Anthurium andraeanum]               359   7e-97
ref|XP_006852404.1| hypothetical protein AMTR_s00021p00031000 [A...   357   2e-96
ref|XP_007148042.1| hypothetical protein PHAVU_006G175500g [Phas...   357   3e-96

>ref|XP_004485897.1| PREDICTED: cysteine proteinase RD21a-like [Cicer arietinum]
          Length = 492

 Score =  382 bits (980), Expect = e-104
 Identities = 172/252 (68%), Positives = 204/252 (80%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCW+FSSTGA+EG+NAIVTGDLISLSEQELVDCD+TNDGC+GGYMD+AFEW+I NGGID
Sbjct: 157 GSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDSTNDGCDGGYMDYAFEWVINNGGID 216

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           TESSYPYTG +GTCNVT+EETK+VTIDGY D+A  +S +LCA + QPIS GIDGS++DFQ
Sbjct: 217 TESSYPYTGVDGTCNVTKEETKVVTIDGYTDVAQSDSGVLCATVKQPISAGIDGSSLDFQ 276

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LYT GIYDGDCSS+P+DIDHAVLIVGYGS+ D+DYWIVKNSWGT+WG+EGY+YIRRNT+L
Sbjct: 277 LYTGGIYDGDCSSDPDDIDHAVLIVGYGSKGDEDYWIVKNSWGTNWGIEGYIYIRRNTNL 336

Query: 543 EYGVCAINDMASYPTKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGDYTFCPSDET 722
           +YGVCAIN MASYPTK                               +CGD+++C +D+T
Sbjct: 337 KYGVCAINYMASYPTKESSAVSPTSPPSPPSPPSPLPPPPPPSPSPSECGDFSYCHADQT 396

Query: 723 CCCIDEFYDICL 758
           CCC  E +D CL
Sbjct: 397 CCCNLELFDFCL 408


>ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
          Length = 501

 Score =  377 bits (968), Expect = e-102
 Identities = 177/262 (67%), Positives = 200/262 (76%), Gaps = 10/262 (3%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCWAFSSTGA+EGINAIVTGDLISLSEQELVDCD TN GCEGGYMD+AFEW+I NGGID
Sbjct: 158 GSCWAFSSTGAMEGINAIVTGDLISLSEQELVDCDTTNYGCEGGYMDYAFEWVISNGGID 217

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           +ES YPYTG +GTCN T+E+TK+V+IDGY+D+   +SALLCA +NQPISVG+DGSA+DFQ
Sbjct: 218 SESDYPYTGTDGTCNTTKEDTKVVSIDGYKDVDESDSALLCAAVNQPISVGMDGSALDFQ 277

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LYT GIY GDCS +P+DIDHAVLIVGYGSED +DYWI KNSWGTSWGMEGY YI+RNTDL
Sbjct: 278 LYTSGIYAGDCSDDPDDIDHAVLIVGYGSEDSEDYWICKNSWGTSWGMEGYFYIKRNTDL 337

Query: 543 EYGVCAINDMASYPTK----------IXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCG 692
            YG CAIN MASYPTK                                         +CG
Sbjct: 338 PYGECAINAMASYPTKESSSPSPYPSPAVPPPPPPPPSPPPPPPPSPPPPSPGPSPSECG 397

Query: 693 DYTFCPSDETCCCIDEFYDICL 758
           D+++CPSDETCCCI EFYD CL
Sbjct: 398 DFSYCPSDETCCCIYEFYDFCL 419


>emb|CBI30692.3| unnamed protein product [Vitis vinifera]
          Length = 377

 Score =  377 bits (968), Expect = e-102
 Identities = 177/262 (67%), Positives = 200/262 (76%), Gaps = 10/262 (3%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCWAFSSTGA+EGINAIVTGDLISLSEQELVDCD TN GCEGGYMD+AFEW+I NGGID
Sbjct: 34  GSCWAFSSTGAMEGINAIVTGDLISLSEQELVDCDTTNYGCEGGYMDYAFEWVISNGGID 93

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           +ES YPYTG +GTCN T+E+TK+V+IDGY+D+   +SALLCA +NQPISVG+DGSA+DFQ
Sbjct: 94  SESDYPYTGTDGTCNTTKEDTKVVSIDGYKDVDESDSALLCAAVNQPISVGMDGSALDFQ 153

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LYT GIY GDCS +P+DIDHAVLIVGYGSED +DYWI KNSWGTSWGMEGY YI+RNTDL
Sbjct: 154 LYTSGIYAGDCSDDPDDIDHAVLIVGYGSEDSEDYWICKNSWGTSWGMEGYFYIKRNTDL 213

Query: 543 EYGVCAINDMASYPTK----------IXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCG 692
            YG CAIN MASYPTK                                         +CG
Sbjct: 214 PYGECAINAMASYPTKESSSPSPYPSPAVPPPPPPPPSPPPPPPPSPPPPSPGPSPSECG 273

Query: 693 DYTFCPSDETCCCIDEFYDICL 758
           D+++CPSDETCCCI EFYD CL
Sbjct: 274 DFSYCPSDETCCCIYEFYDFCL 295


>ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
           gi|355502731|gb|AES83934.1| Cysteine proteinase
           [Medicago truncatula]
          Length = 475

 Score =  376 bits (965), Expect = e-102
 Identities = 171/252 (67%), Positives = 200/252 (79%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCW+FSSTGA+EG+NAIVTGDLISLSEQELVDCD TNDGCEGGYMD+AFEW+I NGGID
Sbjct: 146 GSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDTTNDGCEGGYMDYAFEWVINNGGID 205

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           TE+ YPY G  GTCNVT+EETK+VTIDGY D+   +SAL CA + QPISVGIDGS +DFQ
Sbjct: 206 TEADYPYIGVGGTCNVTKEETKVVTIDGYTDVTQSDSALFCATVKQPISVGIDGSTLDFQ 265

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LYT GIYDGDCSSNP+DIDHAVLIVGYGS+ +QDYWIVKNSWGTSWG+EG++YIRRNT+L
Sbjct: 266 LYTGGIYDGDCSSNPDDIDHAVLIVGYGSDGNQDYWIVKNSWGTSWGIEGFIYIRRNTNL 325

Query: 543 EYGVCAINDMASYPTKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGDYTFCPSDET 722
           +YGVCAIN MAS+PTK                               +CGD+++C ++ET
Sbjct: 326 KYGVCAINYMASFPTK----ESTSISPTSPPSPPSPPPPTPPSPTPSKCGDFSYCTTEET 381

Query: 723 CCCIDEFYDICL 758
           CCC+ E +D CL
Sbjct: 382 CCCLYELFDFCL 393


>ref|XP_003590277.1| Cysteine protease [Medicago truncatula] gi|355479325|gb|AES60528.1|
           Cysteine protease [Medicago truncatula]
          Length = 514

 Score =  376 bits (965), Expect = e-102
 Identities = 171/252 (67%), Positives = 200/252 (79%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCW+FSSTGA+EG+NAIVTGDLISLSEQELVDCD TNDGCEGGYMD+AFEW+I NGGID
Sbjct: 206 GSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDTTNDGCEGGYMDYAFEWVINNGGID 265

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           TE+ YPY G  GTCNVT+EETK+VTIDGY D+   +SAL CA + QPISVGIDGS +DFQ
Sbjct: 266 TEADYPYIGVGGTCNVTKEETKVVTIDGYTDVTQSDSALFCATVKQPISVGIDGSTLDFQ 325

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LYT GIYDGDCSSNP+DIDHAVLIVGYGS+ +QDYWIVKNSWGTSWG+EG++YIRRNT+L
Sbjct: 326 LYTGGIYDGDCSSNPDDIDHAVLIVGYGSDGNQDYWIVKNSWGTSWGIEGFIYIRRNTNL 385

Query: 543 EYGVCAINDMASYPTKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGDYTFCPSDET 722
           +YGVCAIN MAS+PTK                               +CGD+++C ++ET
Sbjct: 386 KYGVCAINYMASFPTK----ESTSISPTSPPSPPSPPPPTPPSPTPSKCGDFSYCTTEET 441

Query: 723 CCCIDEFYDICL 758
           CCC+ E +D CL
Sbjct: 442 CCCLYELFDFCL 453


>ref|XP_007211825.1| hypothetical protein PRUPE_ppa004381mg [Prunus persica]
           gi|462407690|gb|EMJ13024.1| hypothetical protein
           PRUPE_ppa004381mg [Prunus persica]
          Length = 513

 Score =  374 bits (961), Expect = e-101
 Identities = 173/261 (66%), Positives = 198/261 (75%), Gaps = 9/261 (3%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCWAFS+TGA+EGINAI TG+LISLSEQELVDCD TN+GC+GGYMD+AFEW+I NGGID
Sbjct: 171 GSCWAFSTTGAIEGINAIATGELISLSEQELVDCDGTNEGCDGGYMDYAFEWVIDNGGID 230

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           TE +YPYTG +GTCNVT+EETK+VTIDGYED+   +  LLCA + QP SVGIDGSA DFQ
Sbjct: 231 TEKNYPYTGVDGTCNVTKEETKVVTIDGYEDVGETDGDLLCAAVQQPFSVGIDGSAWDFQ 290

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LYT GIYDGDCS NP+DIDHA L+VGYGSE D+DYWIVKNSWGTSWGM+GY+YIRRNT+L
Sbjct: 291 LYTGGIYDGDCSDNPDDIDHAPLVVGYGSEGDEDYWIVKNSWGTSWGMDGYIYIRRNTNL 350

Query: 543 EYGVCAINDMASYPTK---------IXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGD 695
           +YGVCAIN MASYPTK                                         CGD
Sbjct: 351 KYGVCAINAMASYPTKESSAPSPTAPPPPPTPVSPPPPPTPPTPVTPPPPPSPSPSDCGD 410

Query: 696 YTFCPSDETCCCIDEFYDICL 758
           +++CPSDETCCC+ EF D CL
Sbjct: 411 FSYCPSDETCCCLFEFLDYCL 431


>gb|AHM88211.1| CAC1 protein [Malus hybrid cultivar]
          Length = 511

 Score =  369 bits (948), Expect = e-100
 Identities = 174/260 (66%), Positives = 195/260 (75%), Gaps = 8/260 (3%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCD+TN GCEGGYMD+AFEW+I NGGID
Sbjct: 170 GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDSTNYGCEGGYMDYAFEWVISNGGID 229

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           TE+ YPYTG +G C+  +EE K VTIDGYED+   +  LLCA + QPISVG+DGSA DFQ
Sbjct: 230 TETDYPYTGVDGQCSTAKEENKAVTIDGYEDVGETDGDLLCASVQQPISVGMDGSAWDFQ 289

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LYT GIYDGDCSS+P+DIDHAVLIVGYGSE D+DYWIVKNSWGTSWGM+GY+YIRRNT L
Sbjct: 290 LYTGGIYDGDCSSDPDDIDHAVLIVGYGSEGDEDYWIVKNSWGTSWGMDGYIYIRRNTSL 349

Query: 543 EYGVCAINDMASYPTK--------IXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGDY 698
            YGVCAIN MASYPTK                                        CGD+
Sbjct: 350 TYGVCAINAMASYPTKESSAPSPASPPPPPAPPSPPPPPPPPPSPPPPPPSPSPSDCGDF 409

Query: 699 TFCPSDETCCCIDEFYDICL 758
           ++CPSD+TCCC+ EFY  CL
Sbjct: 410 SYCPSDQTCCCLYEFYGFCL 429


>gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
          Length = 509

 Score =  369 bits (947), Expect = e-100
 Identities = 169/259 (65%), Positives = 197/259 (76%), Gaps = 7/259 (2%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCWAFSSTGA+EGINA+  GDLISLSEQELVDCD+TNDGCEGGYMD+AFEW++ NGGID
Sbjct: 169 GSCWAFSSTGAIEGINALANGDLISLSEQELVDCDSTNDGCEGGYMDYAFEWVMSNGGID 228

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           TE+ YPYTG +GTCN T+EETK V+IDGYED+A EESAL CA+L QPISVGIDG A+DFQ
Sbjct: 229 TETDYPYTGEDGTCNTTKEETKAVSIDGYEDVAEEESALFCAVLKQPISVGIDGGAIDFQ 288

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LYT GIYDGDCS +P+DIDHAVL+VGYG+E  ++YWI+KNSWGT WGM+GY YI+RNT  
Sbjct: 289 LYTGGIYDGDCSDDPDDIDHAVLVVGYGAESGEEYWIIKNSWGTDWGMKGYAYIKRNTSK 348

Query: 543 EYGVCAINDMASYPTK-------IXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGDYT 701
           +YGVCAIN MASYPTK                                      QCGD++
Sbjct: 349 DYGVCAINAMASYPTKESSAPSPYPSPAVPPPPPPPPPPPSPPPPPPPPSPSPTQCGDFS 408

Query: 702 FCPSDETCCCIDEFYDICL 758
           +C + ETCCCI EF+D CL
Sbjct: 409 YCAATETCCCIFEFFDYCL 427


>ref|XP_007025364.1| Xylem bark cysteine peptidase 3 isoform 2 [Theobroma cacao]
           gi|508780730|gb|EOY27986.1| Xylem bark cysteine
           peptidase 3 isoform 2 [Theobroma cacao]
          Length = 343

 Score =  368 bits (944), Expect = 1e-99
 Identities = 166/258 (64%), Positives = 202/258 (78%), Gaps = 6/258 (2%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCWAFSSTGA+EGINA+VTG+LISLSEQEL+DCD+TN GC+GGYMD+AFEW+I NGGID
Sbjct: 4   GSCWAFSSTGAMEGINALVTGNLISLSEQELMDCDSTNYGCDGGYMDYAFEWVINNGGID 63

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           +E+ YPY G +GTCN+T+EETK+V+IDGY+D+   +SALLCA++ QP+SVGID S++DFQ
Sbjct: 64  SEADYPYEGVDGTCNITKEETKVVSIDGYKDVEESDSALLCAVVQQPVSVGIDASSIDFQ 123

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LYT GI+DG CS NP+DIDHAVLIVGYGSED +DYWIVKNSWGTSWGM+GY Y++R+TDL
Sbjct: 124 LYTGGIFDGSCSDNPDDIDHAVLIVGYGSEDGEDYWIVKNSWGTSWGMDGYFYLKRDTDL 183

Query: 543 EYGVCAINDMASYPTK------IXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGDYTF 704
            YGVCA+N MASYPTK                                     +CGD+++
Sbjct: 184 PYGVCAVNAMASYPTKESSSPSPYPSPSVPPPPPPPSTPPPPPPPPPPSPSPSECGDFSY 243

Query: 705 CPSDETCCCIDEFYDICL 758
           CPSDETCCC+ EFYD CL
Sbjct: 244 CPSDETCCCLFEFYDYCL 261


>ref|XP_007025363.1| Xylem bark cysteine peptidase 3 isoform 1 [Theobroma cacao]
           gi|508780729|gb|EOY27985.1| Xylem bark cysteine
           peptidase 3 isoform 1 [Theobroma cacao]
          Length = 501

 Score =  368 bits (944), Expect = 1e-99
 Identities = 166/258 (64%), Positives = 202/258 (78%), Gaps = 6/258 (2%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCWAFSSTGA+EGINA+VTG+LISLSEQEL+DCD+TN GC+GGYMD+AFEW+I NGGID
Sbjct: 162 GSCWAFSSTGAMEGINALVTGNLISLSEQELMDCDSTNYGCDGGYMDYAFEWVINNGGID 221

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           +E+ YPY G +GTCN+T+EETK+V+IDGY+D+   +SALLCA++ QP+SVGID S++DFQ
Sbjct: 222 SEADYPYEGVDGTCNITKEETKVVSIDGYKDVEESDSALLCAVVQQPVSVGIDASSIDFQ 281

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LYT GI+DG CS NP+DIDHAVLIVGYGSED +DYWIVKNSWGTSWGM+GY Y++R+TDL
Sbjct: 282 LYTGGIFDGSCSDNPDDIDHAVLIVGYGSEDGEDYWIVKNSWGTSWGMDGYFYLKRDTDL 341

Query: 543 EYGVCAINDMASYPTK------IXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGDYTF 704
            YGVCA+N MASYPTK                                     +CGD+++
Sbjct: 342 PYGVCAVNAMASYPTKESSSPSPYPSPSVPPPPPPPSTPPPPPPPPPPSPSPSECGDFSY 401

Query: 705 CPSDETCCCIDEFYDICL 758
           CPSDETCCC+ EFYD CL
Sbjct: 402 CPSDETCCCLFEFYDYCL 419


>ref|XP_002317418.2| hypothetical protein POPTR_0011s07310g [Populus trichocarpa]
           gi|550327862|gb|EEE98030.2| hypothetical protein
           POPTR_0011s07310g [Populus trichocarpa]
          Length = 503

 Score =  365 bits (937), Expect = 9e-99
 Identities = 170/258 (65%), Positives = 196/258 (75%), Gaps = 6/258 (2%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCW+FS+TGA+EGINAIVTGDLISLSEQELVDCD T+ GCEGGYMD+AFEW+I NGGID
Sbjct: 163 GSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTTDYGCEGGYMDYAFEWVINNGGID 222

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           TE++YPYTG +GTCN T+EE K+V+IDGY D+   +SALLCA + QPISVG+DGSA+DFQ
Sbjct: 223 TEANYPYTGVDGTCNTTKEEIKVVSIDGYTDVDETDSALLCATVQQPISVGMDGSALDFQ 282

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LYT GIYDGDCS +PNDIDHAVLIVGYGSE+ +DYWIVKNSWGT WGMEGY YI+RNTDL
Sbjct: 283 LYTGGIYDGDCSDDPNDIDHAVLIVGYGSENGEDYWIVKNSWGTEWGMEGYFYIKRNTDL 342

Query: 543 EYGVCAINDMASYPTK------IXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGDYTF 704
            YGVCAIN  ASYPTK                                      CGD+ +
Sbjct: 343 PYGVCAINAEASYPTKESSSPSPTSPPSPPSPLSPPPPPPPTPVPPPPCPQPSDCGDFAY 402

Query: 705 CPSDETCCCIDEFYDICL 758
           CPSDETCCCI + +D C+
Sbjct: 403 CPSDETCCCILKVFDYCI 420


>ref|XP_002305743.2| hypothetical protein POPTR_0004s05640g [Populus trichocarpa]
           gi|550340399|gb|EEE86254.2| hypothetical protein
           POPTR_0004s05640g [Populus trichocarpa]
          Length = 506

 Score =  363 bits (931), Expect = 5e-98
 Identities = 170/257 (66%), Positives = 195/257 (75%), Gaps = 5/257 (1%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCW+FS+TGA+EGINAIVT DLISLSEQELVDCD TN GCE GYMD+AFEW+I NGGID
Sbjct: 167 GSCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTTNYGCERGYMDYAFEWVINNGGID 226

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           TE++YPYTG +GTCN  +EE K+V+IDGY+D+   +SALLCA   QPISVGIDGSA+DFQ
Sbjct: 227 TEANYPYTGVDGTCNTAKEEIKVVSIDGYKDVDETDSALLCAAAQQPISVGIDGSAIDFQ 286

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LYT GIYDGDCS +P+DIDHAVLIVGYGSE+ +DYWIVKNSWGTSWG+EGY YI+RNTDL
Sbjct: 287 LYTGGIYDGDCSDDPDDIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYIKRNTDL 346

Query: 543 EYGVCAINDMASYPTK-----IXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGDYTFC 707
            YGVCAIN MASYPTK                                     CGD+++C
Sbjct: 347 PYGVCAINAMASYPTKEASAQSPTSPPSPPSPPPPPPPPPTPVPPPPSPQPSDCGDFSYC 406

Query: 708 PSDETCCCIDEFYDICL 758
           PSDETCCCI   +D CL
Sbjct: 407 PSDETCCCILNVFDYCL 423


>ref|XP_002317417.2| hypothetical protein POPTR_0011s07300g [Populus trichocarpa]
           gi|550327861|gb|EEE98029.2| hypothetical protein
           POPTR_0011s07300g [Populus trichocarpa]
          Length = 498

 Score =  362 bits (928), Expect = 1e-97
 Identities = 167/253 (66%), Positives = 193/253 (76%), Gaps = 1/253 (0%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATND-GCEGGYMDHAFEWIIGNGGI 179
           GSCW+FS+TGA+E INAIVTGDLISLSEQELVDCD TN+ GCEGG MD AF+W+IGNGGI
Sbjct: 159 GSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGNGGI 218

Query: 180 DTESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDF 359
           DTE+ YPYTG +GTCN  +EE K+V+I+GY D+ P +SALLCA + QPISVG+DGSA+DF
Sbjct: 219 DTEADYPYTGVDGTCNTAKEEKKVVSIEGYVDVDPSDSALLCATVQQPISVGMDGSALDF 278

Query: 360 QLYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTD 539
           QLYT GIYDGDCS +PNDIDHA+LIVGYGSE+D+DYWIVKNSWGT WGMEGY YIRRNT 
Sbjct: 279 QLYTGGIYDGDCSGDPNDIDHAILIVGYGSENDEDYWIVKNSWGTEWGMEGYFYIRRNTS 338

Query: 540 LEYGVCAINDMASYPTKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGDYTFCPSDE 719
             YGVCAIN  ASYPTK+                               CGD +FCPSDE
Sbjct: 339 KPYGVCAINADASYPTKVPSPPSPPSPPPPPSPPPPPPSPPPPCPQPSDCGDSSFCPSDE 398

Query: 720 TCCCIDEFYDICL 758
           TCCCI + +  C+
Sbjct: 399 TCCCILKLFSSCI 411


>gb|ABK95906.1| unknown [Populus trichocarpa]
          Length = 498

 Score =  362 bits (928), Expect = 1e-97
 Identities = 167/253 (66%), Positives = 193/253 (76%), Gaps = 1/253 (0%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATND-GCEGGYMDHAFEWIIGNGGI 179
           GSCW+FS+TGA+E INAIVTGDLISLSEQELVDCD TN+ GCEGG MD AF+W+IGNGGI
Sbjct: 159 GSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGNGGI 218

Query: 180 DTESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDF 359
           DTE+ YPYTG +GTCN  +EE K+V+I+GY D+ P +SALLCA + QPISVG+DGSA+DF
Sbjct: 219 DTEADYPYTGVDGTCNTAKEEKKVVSIEGYVDVDPSDSALLCATVQQPISVGMDGSALDF 278

Query: 360 QLYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTD 539
           QLYT GIYDGDCS +PNDIDHA+LIVGYGSE+D+DYWIVKNSWGT WGMEGY YIRRNT 
Sbjct: 279 QLYTGGIYDGDCSGDPNDIDHAILIVGYGSENDEDYWIVKNSWGTEWGMEGYFYIRRNTS 338

Query: 540 LEYGVCAINDMASYPTKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGDYTFCPSDE 719
             YGVCAIN  ASYPTK+                               CGD +FCPSDE
Sbjct: 339 KPYGVCAINADASYPTKVPSPPSPPSPPPPPSPPPPPPSPPPPCPQPSDCGDSSFCPSDE 398

Query: 720 TCCCIDEFYDICL 758
           TCCCI + +  C+
Sbjct: 399 TCCCILKLFSSCI 411


>ref|XP_004293953.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp.
           vesca]
          Length = 519

 Score =  361 bits (927), Expect = 1e-97
 Identities = 171/268 (63%), Positives = 194/268 (72%), Gaps = 17/268 (6%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCWAFSSTG +EGINA+VTGDLISLSEQELVDCD TN GC GGYMD+AFEW+I NGGID
Sbjct: 169 GSCWAFSSTGGIEGINALVTGDLISLSEQELVDCDTTNYGCSGGYMDYAFEWVISNGGID 228

Query: 183 TESSYPYT---GYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAM 353
           TE+ YPYT   G+ GTCNVT+EETK+VTIDGY D+   E+ L  A+L QPISVGIDGS  
Sbjct: 229 TEADYPYTSTTGFGGTCNVTKEETKVVTIDGYTDVEETETGLFNAVLQQPISVGIDGSTW 288

Query: 354 DFQLYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRN 533
           DFQLY+ GIYDGDCS +PN+IDHAVLIVGYGSE  +DYWIVKNSWGTSWGMEGY Y+RRN
Sbjct: 289 DFQLYSSGIYDGDCSDDPNNIDHAVLIVGYGSESGEDYWIVKNSWGTSWGMEGYFYLRRN 348

Query: 534 TDLEYGVCAINDMASYPTK--------------IXXXXXXXXXXXXXXXXXXXXXXXXXX 671
           TDL YGVCA+N MASYPTK                                         
Sbjct: 349 TDLPYGVCAVNAMASYPTKESSAPTPYPSPTPPPPPTPVSPPPPPVTPPPPTPVTPPPPS 408

Query: 672 XXXXQCGDYTFCPSDETCCCIDEFYDIC 755
               QCGD+++CP+DETCCC+ EF+D C
Sbjct: 409 PSPSQCGDFSYCPADETCCCLYEFFDYC 436


>ref|XP_006449509.1| hypothetical protein CICLE_v10015066mg [Citrus clementina]
           gi|557552120|gb|ESR62749.1| hypothetical protein
           CICLE_v10015066mg [Citrus clementina]
          Length = 485

 Score =  361 bits (926), Expect = 2e-97
 Identities = 165/251 (65%), Positives = 195/251 (77%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCW+FS+TGA+EGINA+VTGDLISLSEQELVDCD T+ GC+GGYMD+AFEW+I NGGID
Sbjct: 152 GSCWSFSTTGAIEGINALVTGDLISLSEQELVDCDTTSCGCDGGYMDYAFEWVINNGGID 211

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           TES YPYTG +GTCN+T+EETK+V+IDGY+D+ P +SALLCA + QPISVG+ GSA+DFQ
Sbjct: 212 TESDYPYTGVDGTCNITKEETKVVSIDGYKDVEPSDSALLCAAVQQPISVGMVGSAIDFQ 271

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LYT GIY+GDCS++P  IDHAVLIVGYGSE+ +DYWIVKNSWGTSWG++GY YI R+T L
Sbjct: 272 LYTSGIYNGDCSNDPYYIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIDGYFYITRDTSL 331

Query: 543 EYGVCAINDMASYPTKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGDYTFCPSDET 722
           EYG CAIN MASYP K                               QCGD+++CPS ET
Sbjct: 332 EYGKCAINAMASYPIKESYAPSPYSPPSEPPPLPSPPPPPPPSPSPSQCGDFSYCPSGET 391

Query: 723 CCCIDEFYDIC 755
           CCCI  F D C
Sbjct: 392 CCCIFGFLDFC 402


>ref|XP_006467643.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 485

 Score =  360 bits (924), Expect = 3e-97
 Identities = 165/251 (65%), Positives = 194/251 (77%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCW+FS+TGA+EGINA+VTGDLISLSEQELVDCD T+ GC+GGYMD+AFEW+I NGGID
Sbjct: 152 GSCWSFSTTGAIEGINALVTGDLISLSEQELVDCDTTSYGCDGGYMDYAFEWVINNGGID 211

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           TES YPYTG +GTCN+T+EETK+V+IDGY+D+ P +SALLCA + QPISVG+ GSA DFQ
Sbjct: 212 TESDYPYTGVDGTCNITKEETKVVSIDGYKDVEPSDSALLCAAVQQPISVGMVGSASDFQ 271

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LYT GIY+GDCS++P  IDHAVLIVGYGSE+ +DYWIVKNSWGTSWG++GY YI R+T L
Sbjct: 272 LYTSGIYNGDCSNDPYYIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIDGYFYITRDTSL 331

Query: 543 EYGVCAINDMASYPTKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGDYTFCPSDET 722
           EYG CAIN MASYP K                               QCGD+++CPS ET
Sbjct: 332 EYGKCAINAMASYPIKESYAPSPYSPPSEPPPLPSPPPPPPPSPSPSQCGDFSYCPSGET 391

Query: 723 CCCIDEFYDIC 755
           CCCI  F D C
Sbjct: 392 CCCIFGFLDFC 402


>gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
          Length = 502

 Score =  359 bits (921), Expect = 7e-97
 Identities = 166/253 (65%), Positives = 193/253 (76%), Gaps = 1/253 (0%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCWAFSSTGA+EGINAI TG+LISLSEQELVDCD TN+GC+GGYMD+AFEW+I NGGID
Sbjct: 168 GSCWAFSSTGAMEGINAITTGELISLSEQELVDCDTTNEGCDGGYMDYAFEWVINNGGID 227

Query: 183 TESSYPYTGY-NGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDF 359
           +E++YPYTG  +  CN T+EE K+V+IDGYED+A  ESALLCA + QP+SVGIDGS++DF
Sbjct: 228 SEANYPYTGQADSVCNTTKEEIKVVSIDGYEDVATSESALLCAAVQQPVSVGIDGSSLDF 287

Query: 360 QLYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTD 539
           QLY  GIYDGDCS NP+DIDHAVL+VGYG +   DYWIVKNSWGT WGM+GY+YIRRNT 
Sbjct: 288 QLYAGGIYDGDCSGNPDDIDHAVLVVGYGQQGGTDYWIVKNSWGTDWGMQGYIYIRRNTG 347

Query: 540 LEYGVCAINDMASYPTKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGDYTFCPSDE 719
           L YGVCAI+ MASYPTK                               QCGDY++CPSDE
Sbjct: 348 LPYGVCAIDAMASYPTK-QFAPAATPPSPAPPPPSPPPPPTPPSPSPSQCGDYSYCPSDE 406

Query: 720 TCCCIDEFYDICL 758
           TCCC+ E    CL
Sbjct: 407 TCCCLVELGGFCL 419


>ref|XP_006852404.1| hypothetical protein AMTR_s00021p00031000 [Amborella trichopoda]
           gi|548856015|gb|ERN13871.1| hypothetical protein
           AMTR_s00021p00031000 [Amborella trichopoda]
          Length = 501

 Score =  357 bits (917), Expect = 2e-96
 Identities = 170/265 (64%), Positives = 195/265 (73%), Gaps = 13/265 (4%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCWAFS TGA+E IN IVT +LISLSEQELVDCD+TNDGC+GGYMD+AF+W+I N GID
Sbjct: 152 GSCWAFSVTGAIESINEIVTSELISLSEQELVDCDSTNDGCDGGYMDYAFQWVIQNEGID 211

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           TES Y YTG +GTCN  +EE K+V+IDGYED+  EESALLCA++NQPISVGIDGSA+DFQ
Sbjct: 212 TESDYSYTGQDGTCNTEKEEKKVVSIDGYEDVEEEESALLCAVVNQPISVGIDGSAIDFQ 271

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LY+ GIYDG CSSNP+DIDHAVLIVGY S+ D+DYWIVKNSWGTSWG+ GY+YIRRNTDL
Sbjct: 272 LYSGGIYDGLCSSNPDDIDHAVLIVGYASQGDEDYWIVKNSWGTSWGINGYIYIRRNTDL 331

Query: 543 EYGVCAINDMASYPTKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQ------------ 686
           EYGVCAIN MASYPTK                                            
Sbjct: 332 EYGVCAINSMASYPTKESTSPSPMPSPGAPPPPSTTPPPPPPPPPPPPPTPPSPPGPSPV 391

Query: 687 -CGDYTFCPSDETCCCIDEFYDICL 758
            CGD+++C S ETCCC+ E Y ICL
Sbjct: 392 ICGDFSYCDSGETCCCLLELYGICL 416


>ref|XP_007148042.1| hypothetical protein PHAVU_006G175500g [Phaseolus vulgaris]
           gi|561021265|gb|ESW20036.1| hypothetical protein
           PHAVU_006G175500g [Phaseolus vulgaris]
          Length = 507

 Score =  357 bits (915), Expect = 3e-96
 Identities = 163/254 (64%), Positives = 195/254 (76%), Gaps = 3/254 (1%)
 Frame = +3

Query: 3   GSCWAFSSTGAVEGINAIVTGDLISLSEQELVDCDATNDGCEGGYMDHAFEWIIGNGGID 182
           GSCWAFSSTGA+EGINA+VTGDL+SLSEQELVDCD+TN+GC GG MD+AFEW++ NGGID
Sbjct: 172 GSCWAFSSTGAIEGINALVTGDLVSLSEQELVDCDSTNEGCYGGLMDYAFEWVMHNGGID 231

Query: 183 TESSYPYTGYNGTCNVTEEETKIVTIDGYEDLAPEESALLCALLNQPISVGIDGSAMDFQ 362
           +E+ YPYTG +  CNVT+E+TK+V+IDGY D+   +++LLCA   QPISV IDGS++DFQ
Sbjct: 232 SETEYPYTGVDARCNVTKEKTKVVSIDGYSDVGQSDNSLLCATAKQPISVAIDGSSLDFQ 291

Query: 363 LYTEGIYDGDCSSNPNDIDHAVLIVGYGSEDDQDYWIVKNSWGTSWGMEGYVYIRRNTDL 542
           LY  GIYDGDCSS+P+DIDHAVLIVGYGSEDD+DYWIVKNSWGTSWGMEGY+YIRRNTDL
Sbjct: 292 LYAGGIYDGDCSSDPDDIDHAVLIVGYGSEDDEDYWIVKNSWGTSWGMEGYIYIRRNTDL 351

Query: 543 EYGVCAINDMASYPTK---IXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQCGDYTFCPS 713
           +YGVCAIN MASYPTK                                  +CGD+++C +
Sbjct: 352 KYGVCAINYMASYPTKEITAPSPSSSPSPPSPSPPQPLPPPPPPPPPPPIRCGDFSYCSA 411

Query: 714 DETCCCIDEFYDIC 755
            ETCCC+  F   C
Sbjct: 412 SETCCCLYGFSGFC 425


Top