BLASTX nr result

ID: Akebia25_contig00038871 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00038871
         (658 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004307346.1| PREDICTED: cysteine proteinase 15A-like [Fra...   280   2e-73
ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vit...   274   2e-71
gb|EXB96194.1| putative cysteine proteinase A494 [Morus notabilis]    273   5e-71
ref|XP_007218137.1| hypothetical protein PRUPE_ppa007240mg [Prun...   266   6e-69
ref|XP_006436511.1| hypothetical protein CICLE_v10031865mg [Citr...   262   6e-68
ref|XP_007010012.1| Papain family cysteine protease [Theobroma c...   262   8e-68
ref|XP_002533377.1| cysteine protease, putative [Ricinus communi...   258   9e-67
ref|XP_006361690.1| PREDICTED: cysteine proteinase 15A-like [Sol...   254   1e-65
ref|XP_004496226.1| PREDICTED: cysteine proteinase 15A-like [Cic...   254   1e-65
emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]   250   3e-64
emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]          250   3e-64
ref|NP_001236888.1| cysteine proteinase precursor [Glycine max] ...   249   7e-64
ref|XP_004250044.1| PREDICTED: cysteine proteinase 15A-like isof...   248   2e-63
ref|XP_004250043.1| PREDICTED: cysteine proteinase 15A-like isof...   248   2e-63
gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]             246   4e-63
ref|XP_002316398.1| hypothetical protein POPTR_0010s23510g [Popu...   244   2e-62
ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cuc...   243   3e-62
ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cuc...   243   3e-62
ref|NP_567010.5| Papain family cysteine protease [Arabidopsis th...   243   3e-62
ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arab...   241   1e-61

>ref|XP_004307346.1| PREDICTED: cysteine proteinase 15A-like [Fragaria vesca subsp.
           vesca]
          Length = 376

 Score =  280 bits (717), Expect = 2e-73
 Identities = 138/217 (63%), Positives = 163/217 (75%), Gaps = 6/217 (2%)
 Frame = +2

Query: 23  ILTSTVGVTVLTCALMVSAVFHKTLEDPKIHQITEEDQINRILAGNWALGTEKNFQIFMK 202
           +LT  VGV VL CAL +      + +DP IHQ+T+ D        N  LGTEK F+IFM+
Sbjct: 10  LLTCLVGVAVLICALTLCVALDVSPQDPIIHQVTDHDT-------NKLLGTEKEFKIFME 62

Query: 203 KYGKEYSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEEFERLFTG--- 373
           KYGK+Y + +EY+HR+G+FAKN++RAAEHQ LDPTAVHGVTPF DL EEEF+R++TG   
Sbjct: 63  KYGKKYPTMKEYMHRLGVFAKNMIRAAEHQALDPTAVHGVTPFSDLEEEEFKRMYTGVRG 122

Query: 374 ---LSSGGSPLNKISSTAPLMDVEGLPSSFDWREKGAVTDVKSQGTCGSCWAFSTTGTIE 544
              L++GG       ST  L+DV GLP SFDWREKGAVT+VK QG CGSCWAFSTTG IE
Sbjct: 123 APGLTNGGGGAG---STVELLDVSGLPESFDWREKGAVTEVKMQGGCGSCWAFSTTGAIE 179

Query: 545 GANFIATGKLLKLSEQQLVDCDHMCDAAEKDSCDNGC 655
           GANFIATGKL+ LSEQQLVDCDH CDA EKD+CD GC
Sbjct: 180 GANFIATGKLVSLSEQQLVDCDHQCDAKEKDACDRGC 216


>ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
          Length = 375

 Score =  274 bits (700), Expect = 2e-71
 Identities = 136/214 (63%), Positives = 165/214 (77%), Gaps = 4/214 (1%)
 Frame = +2

Query: 26  LTSTVGVT-VLTCALMVSAVF---HKTLEDPKIHQITEEDQINRILAGNWALGTEKNFQI 193
           LT  +GV  +LTCAL  SA+    H T  DP I Q+T+    +R    +  LGTEK F++
Sbjct: 5   LTCALGVAALLTCALAASAISLHEHDTPWDPNIVQVTDGHS-HRKFGVDGVLGTEKEFRM 63

Query: 194 FMKKYGKEYSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEEFERLFTG 373
           FM+KYGKEYSS EEY+HR+GIFAKN++RAAEHQ LDPTA+HGVTPF DLSEEEFER+FTG
Sbjct: 64  FMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPTALHGVTPFSDLSEEEFERMFTG 123

Query: 374 LSSGGSPLNKISSTAPLMDVEGLPSSFDWREKGAVTDVKSQGTCGSCWAFSTTGTIEGAN 553
           +         ++ TA  ++V+GLP SFDWREKGAVT+VK QGTCGSCWAFSTTG +EGA+
Sbjct: 124 VVGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFSTTGAVEGAH 183

Query: 554 FIATGKLLKLSEQQLVDCDHMCDAAEKDSCDNGC 655
           FI+T KLL LSEQQLVDCDHMCD  +K +CD+GC
Sbjct: 184 FISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGC 217


>gb|EXB96194.1| putative cysteine proteinase A494 [Morus notabilis]
          Length = 372

 Score =  273 bits (697), Expect = 5e-71
 Identities = 138/215 (64%), Positives = 160/215 (74%), Gaps = 5/215 (2%)
 Frame = +2

Query: 26  LTSTVGVTVLTCALMVSAVFHKTLEDPKIHQITEEDQINRILAGNWALGTEKNFQIFMKK 205
           LT  V    LTCAL ++       +DP IHQ+T  D ++   +GN  LGTEK F+IFM+K
Sbjct: 4   LTCAVVAAALTCALTLTMALG--YQDPVIHQVT--DDLHPKFSGNELLGTEKKFKIFMEK 59

Query: 206 YGKEYSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEEFERLFTGLSSG 385
           YGK Y S EEY+ R+GIFA+N++RAAEHQ LDPTAVHGVTPF DLSEEEFE ++TG   G
Sbjct: 60  YGKTYGSREEYVRRLGIFARNMLRAAEHQALDPTAVHGVTPFSDLSEEEFESMYTGFRGG 119

Query: 386 GS--PLNKISSTAPLMDVEG---LPSSFDWREKGAVTDVKSQGTCGSCWAFSTTGTIEGA 550
                 N  +S A   +V+G   LP SFDWREKGAVTDVK QG+CGSCWAFSTTG +EGA
Sbjct: 120 PGFGHGNVANSAAAAAEVDGGGHLPESFDWREKGAVTDVKMQGSCGSCWAFSTTGAVEGA 179

Query: 551 NFIATGKLLKLSEQQLVDCDHMCDAAEKDSCDNGC 655
           NFIATGKLL LSEQQLVDCDH CDA EKD+CDNGC
Sbjct: 180 NFIATGKLLNLSEQQLVDCDHTCDATEKDACDNGC 214


>ref|XP_007218137.1| hypothetical protein PRUPE_ppa007240mg [Prunus persica]
           gi|462414599|gb|EMJ19336.1| hypothetical protein
           PRUPE_ppa007240mg [Prunus persica]
          Length = 377

 Score =  266 bits (679), Expect = 6e-69
 Identities = 132/209 (63%), Positives = 164/209 (78%), Gaps = 5/209 (2%)
 Frame = +2

Query: 44  VTVLTCALM--VSAVFHKTLEDPKIHQITEEDQINRILAGNWALGTEKNFQIFMKKYGKE 217
           V VLTCAL+  +S   H   +DP IHQ+T+  +          LGTE++FQ+F++KYGK+
Sbjct: 17  VAVLTCALISTLSFALHDAAQDPIIHQVTDNHRP--------LLGTERSFQMFIEKYGKK 68

Query: 218 YSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEEFERLFTGLSS--GGS 391
           YS+ +EY+HR+GIFAKN++RAAEHQ LDPTAVHGVTPF DLSEEEFER++TG+ +    S
Sbjct: 69  YSTRKEYMHRLGIFAKNMVRAAEHQALDPTAVHGVTPFSDLSEEEFERMYTGMRAVPPAS 128

Query: 392 PLNKISSTAP-LMDVEGLPSSFDWREKGAVTDVKSQGTCGSCWAFSTTGTIEGANFIATG 568
             + +S +AP +MDV  LP +FDWREKGAVT+VK QG CGSCWAFSTTG IEGANFIATG
Sbjct: 129 SNDGVSDSAPPVMDVGDLPENFDWREKGAVTEVKMQGGCGSCWAFSTTGAIEGANFIATG 188

Query: 569 KLLKLSEQQLVDCDHMCDAAEKDSCDNGC 655
           KLL LSEQQLVDCD+ CDA +K +CD+GC
Sbjct: 189 KLLSLSEQQLVDCDNTCDAKDKTACDSGC 217


>ref|XP_006436511.1| hypothetical protein CICLE_v10031865mg [Citrus clementina]
           gi|568864541|ref|XP_006485655.1| PREDICTED: cysteine
           proteinase 1-like [Citrus sinensis]
           gi|557538707|gb|ESR49751.1| hypothetical protein
           CICLE_v10031865mg [Citrus clementina]
          Length = 373

 Score =  262 bits (670), Expect = 6e-68
 Identities = 128/214 (59%), Positives = 163/214 (76%), Gaps = 3/214 (1%)
 Frame = +2

Query: 26  LTSTVGVTVLTCALMVSAVFHKTLEDPKIHQITEEDQINRILAGNWALGTEKNFQIFMKK 205
           LT  +GVT+LT AL +S+      ++P I Q+T  D    +L G+    TE NF+IFM+K
Sbjct: 9   LTCAIGVTLLTYALTLSSAL--VPQNPTIRQVT--DNPGHLLLGS---ATENNFKIFMQK 61

Query: 206 YGKEYSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEEFERLFTGLSSG 385
           Y K Y++ E+Y+HR+GIFAKN++RAAEHQ+LDPTAVHGVTPF DLSEEEFE ++TG+  G
Sbjct: 62  YEKSYATREDYVHRLGIFAKNMIRAAEHQLLDPTAVHGVTPFSDLSEEEFESMYTGMKGG 121

Query: 386 GSPLNK---ISSTAPLMDVEGLPSSFDWREKGAVTDVKSQGTCGSCWAFSTTGTIEGANF 556
              ++     SS+  +M+++GLP +FDWR+KGAVT+VK QG CGSCWAFSTTG +EGANF
Sbjct: 122 PPEMDGGGLESSSVKMMEIDGLPENFDWRDKGAVTEVKMQGACGSCWAFSTTGAVEGANF 181

Query: 557 IATGKLLKLSEQQLVDCDHMCDAAEKDSCDNGCN 658
           IATG LL LSEQQLVDCDH CD  EK +CDNGC+
Sbjct: 182 IATGNLLSLSEQQLVDCDHSCDIKEKGTCDNGCS 215


>ref|XP_007010012.1| Papain family cysteine protease [Theobroma cacao]
           gi|508726925|gb|EOY18822.1| Papain family cysteine
           protease [Theobroma cacao]
          Length = 382

 Score =  262 bits (669), Expect = 8e-68
 Identities = 131/220 (59%), Positives = 164/220 (74%), Gaps = 9/220 (4%)
 Frame = +2

Query: 26  LTSTVGVTVLTCALMVSAVFHKTL--EDPKIHQITEE--DQINRILAGNWALGTEKNFQI 193
           LT T  +  L  +L++S     T   ++P I Q+T+     +NR  + N+     K FQ+
Sbjct: 9   LTCTTAIAALIFSLILSLCLALTEIPQEPTILQVTDNLIPTLNRKFSRNYV---HKEFQV 65

Query: 194 FMKKYGKEYSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEEFERLFTG 373
           F++KYGK YS+TEEY+HR+GIFAKNL+RAAEHQVLDPTAVHGVT F DLSEEEFERL+TG
Sbjct: 66  FVEKYGKNYSTTEEYMHRLGIFAKNLIRAAEHQVLDPTAVHGVTQFSDLSEEEFERLYTG 125

Query: 374 LSSGGSP-----LNKISSTAPLMDVEGLPSSFDWREKGAVTDVKSQGTCGSCWAFSTTGT 538
           +  G +      ++ + S A +++V+GLP SFDWREKGAVT+VK QG CGSCWAFSTTG 
Sbjct: 126 VKGGMAAAAPRMMDGVGSEAEMVEVDGLPESFDWREKGAVTEVKMQGACGSCWAFSTTGA 185

Query: 539 IEGANFIATGKLLKLSEQQLVDCDHMCDAAEKDSCDNGCN 658
           IEGANF+ATGKLL LSEQQLVDCD MCD  +K +CDNGC+
Sbjct: 186 IEGANFVATGKLLSLSEQQLVDCDQMCDIKDKTACDNGCS 225


>ref|XP_002533377.1| cysteine protease, putative [Ricinus communis]
           gi|223526784|gb|EEF29008.1| cysteine protease, putative
           [Ricinus communis]
          Length = 381

 Score =  258 bits (660), Expect = 9e-67
 Identities = 123/191 (64%), Positives = 150/191 (78%), Gaps = 3/191 (1%)
 Frame = +2

Query: 92  TLEDPKIHQITEEDQI---NRILAGNWALGTEKNFQIFMKKYGKEYSSTEEYLHRMGIFA 262
           TL+DP I Q+T++  +   NR   G     TE+NF++FM KY KEY + EEY+HR+G+FA
Sbjct: 36  TLQDPTILQVTDDPSVTLSNRKFLGT---NTEENFKMFMIKYDKEYDTREEYMHRLGVFA 92

Query: 263 KNLMRAAEHQVLDPTAVHGVTPFFDLSEEEFERLFTGLSSGGSPLNKISSTAPLMDVEGL 442
           KNL+RAAEHQVLDPTAVHG+TPF DL+EEEFER++TG+  GG+   +  +    ++  GL
Sbjct: 93  KNLIRAAEHQVLDPTAVHGITPFMDLTEEEFERMYTGVVGGGAVGAEGVTATSFLETAGL 152

Query: 443 PSSFDWREKGAVTDVKSQGTCGSCWAFSTTGTIEGANFIATGKLLKLSEQQLVDCDHMCD 622
           PSSFDWR+KGAVTDVK QG CGSCWAFSTTG IEGANFIATGKLL LSEQQLVDCD +CD
Sbjct: 153 PSSFDWRKKGAVTDVKMQGACGSCWAFSTTGAIEGANFIATGKLLNLSEQQLVDCDRVCD 212

Query: 623 AAEKDSCDNGC 655
             EK +CD+GC
Sbjct: 213 IKEKTACDDGC 223


>ref|XP_006361690.1| PREDICTED: cysteine proteinase 15A-like [Solanum tuberosum]
          Length = 386

 Score =  254 bits (650), Expect = 1e-65
 Identities = 128/222 (57%), Positives = 162/222 (72%), Gaps = 12/222 (5%)
 Frame = +2

Query: 26  LTSTVGVTVLTCALMVSAVFHKTL-----EDPKIHQITEEDQINRILAG---NWALGT-- 175
           LT  +GVT+LTCA  +    H ++     E+ KI Q+T++        G   +  LGT  
Sbjct: 7   LTYALGVTILTCAFSLLPFHHTSVAAAIPEEFKIRQVTDDQNPTTTAHGGSNHHLLGTPA 66

Query: 176 EKNFQIFMKKYGKEYSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEEF 355
           E  F+ F+++Y KEYS+ EEY+HR+G+F KNL++AAEHQ LDPTAVHGVT F DL+ EEF
Sbjct: 67  EHRFKSFIQEYNKEYSTREEYIHRLGVFVKNLLKAAEHQALDPTAVHGVTQFSDLTSEEF 126

Query: 356 ERLFTGLSSG--GSPLNKISSTAPLMDVEGLPSSFDWREKGAVTDVKSQGTCGSCWAFST 529
           ER++ G+  G   S L ++ S AP M+V+ LP SFDWREKGAVTDVK QG+CGSCWAFST
Sbjct: 127 ERMYMGVKGGDRSSLLGEVGSHAPPMEVKDLPKSFDWREKGAVTDVKMQGSCGSCWAFST 186

Query: 530 TGTIEGANFIATGKLLKLSEQQLVDCDHMCDAAEKDSCDNGC 655
           TG+IEGANFIATGKLL LSEQQLVDCD+ CD  +K +CD+GC
Sbjct: 187 TGSIEGANFIATGKLLNLSEQQLVDCDNTCDKKDKKACDSGC 228


>ref|XP_004496226.1| PREDICTED: cysteine proteinase 15A-like [Cicer arietinum]
          Length = 387

 Score =  254 bits (650), Expect = 1e-65
 Identities = 125/212 (58%), Positives = 154/212 (72%), Gaps = 2/212 (0%)
 Frame = +2

Query: 26  LTSTVGVTVLTCALMVSAVFHKTLEDPKIHQITEEDQINRI-LAGNWALGTEKNFQIFMK 202
           LT    +T+  C L +S   H+  E+   H+   ++   ++ L  N  L TEK F++FM+
Sbjct: 9   LTCYSRITIFLCVLTLSTSLHRFSEN---HETLIQNVARKLELKDNELLKTEKKFKVFME 65

Query: 203 KYGKEYSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEEFERLFTGLSS 382
            Y K YS+ EEYL R+GIFA+N+++AAEHQVLDPTA+HG+T F DLSEEEFER +TG+  
Sbjct: 66  DYSKRYSTREEYLLRLGIFARNMVKAAEHQVLDPTAIHGITQFSDLSEEEFERFYTGVKG 125

Query: 383 GGS-PLNKISSTAPLMDVEGLPSSFDWREKGAVTDVKSQGTCGSCWAFSTTGTIEGANFI 559
           GG    N     AP +DVEGLP +FDWREKGAVT VK QG CGSCWAFSTTG++EGANF+
Sbjct: 126 GGLLASNAAGEVAPPLDVEGLPENFDWREKGAVTGVKMQGKCGSCWAFSTTGSVEGANFL 185

Query: 560 ATGKLLKLSEQQLVDCDHMCDAAEKDSCDNGC 655
           ATGKLL LSEQQLVDCD+ CD  EK SCDNGC
Sbjct: 186 ATGKLLSLSEQQLVDCDNKCDITEKTSCDNGC 217


>emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
          Length = 321

 Score =  250 bits (638), Expect = 3e-64
 Identities = 114/163 (69%), Positives = 137/163 (84%)
 Frame = +2

Query: 167 LGTEKNFQIFMKKYGKEYSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSE 346
           +G EK F++FM+KYGKEYSS EEY+HR+GIFAKN++RAAEHQ LDP A+HGVTPF DLSE
Sbjct: 1   MGGEKEFRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPXALHGVTPFSDLSE 60

Query: 347 EEFERLFTGLSSGGSPLNKISSTAPLMDVEGLPSSFDWREKGAVTDVKSQGTCGSCWAFS 526
           EEFER+FTG+         ++ TA  ++V+GLP SFDWREKGAVT+VK QGTCGSCWAFS
Sbjct: 61  EEFERMFTGVVGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFS 120

Query: 527 TTGTIEGANFIATGKLLKLSEQQLVDCDHMCDAAEKDSCDNGC 655
           TTG +EGA+FI+T KLL LSEQQLVDCDHMCD  +K +CD+GC
Sbjct: 121 TTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGC 163


>emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]
          Length = 379

 Score =  250 bits (638), Expect = 3e-64
 Identities = 124/212 (58%), Positives = 151/212 (71%), Gaps = 1/212 (0%)
 Frame = +2

Query: 26  LTSTVGVTVLTCALMVSAVFHKTLEDPKIHQITEEDQINRI-LAGNWALGTEKNFQIFMK 202
           LT    V +  CAL +S+  H        H+   +D   ++ L  N  L TEK F++FMK
Sbjct: 9   LTRYARVAIFLCALTLSSSLH--------HETLIQDVARKLELKDNDLLTTEKKFKLFMK 60

Query: 203 KYGKEYSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEEFERLFTGLSS 382
            Y K+YS+TEEYL R+GIFAKN+++AAEHQ LDPTA+HGVT F DLSEEEFER +TG   
Sbjct: 61  DYSKKYSTTEEYLLRLGIFAKNMVKAAEHQALDPTAIHGVTQFSDLSEEEFERFYTGFKG 120

Query: 383 GGSPLNKISSTAPLMDVEGLPSSFDWREKGAVTDVKSQGTCGSCWAFSTTGTIEGANFIA 562
           G    N     AP +DV+G P +FDWREKGAVT +K+QG CGSCWAF+TTG+IEGANF+A
Sbjct: 121 GFPSSNAAGGVAPPLDVKGFPENFDWREKGAVTGIKTQGKCGSCWAFTTTGSIEGANFLA 180

Query: 563 TGKLLKLSEQQLVDCDHMCDAAEKDSCDNGCN 658
           TGKL+ LSEQQLVDCD+ CD   K SCDNGCN
Sbjct: 181 TGKLVSLSEQQLVDCDNKCDIT-KTSCDNGCN 211


>ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
           gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine
           max] gi|300507422|gb|ADK24076.1| cysteine proteinase
           [Glycine max] gi|300507425|gb|ADK24077.1| cysteine
           proteinase [Glycine max] gi|1096153|prf||2111244A Cys
           protease
          Length = 380

 Score =  249 bits (635), Expect = 7e-64
 Identities = 123/207 (59%), Positives = 154/207 (74%), Gaps = 2/207 (0%)
 Frame = +2

Query: 44  VTVLTCALMVSAVFHKTLEDPKIHQITEEDQINRILAG-NWALGTEKNFQIFMKKYGKEY 220
           V++  CAL +SA    T         T +D   ++  G N  L TEK F++FM+ YG+ Y
Sbjct: 15  VSLFLCALTLSAAHGST---------TVQDIARKLKLGDNELLRTEKKFKVFMENYGRSY 65

Query: 221 SSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEEFERLFTGLSSG-GSPL 397
           S+ EEYL R+GIFA+N++RAAEHQ LDPTAVHGVT F DL+E+EFE+L+TG++ G  S  
Sbjct: 66  STEEEYLRRLGIFAQNMVRAAEHQALDPTAVHGVTQFSDLTEDEFEKLYTGVNGGFPSSN 125

Query: 398 NKISSTAPLMDVEGLPSSFDWREKGAVTDVKSQGTCGSCWAFSTTGTIEGANFIATGKLL 577
           N     AP ++V+GLP +FDWREKGAVT+VK QG CGSCWAFSTTG+IEGANF+ATGKL+
Sbjct: 126 NAAGGIAPPLEVDGLPENFDWREKGAVTEVKLQGRCGSCWAFSTTGSIEGANFLATGKLV 185

Query: 578 KLSEQQLVDCDHMCDAAEKDSCDNGCN 658
            LSEQQL+DCD+ CD  EK SCDNGCN
Sbjct: 186 SLSEQQLLDCDNKCDITEKTSCDNGCN 212


>ref|XP_004250044.1| PREDICTED: cysteine proteinase 15A-like isoform 2 [Solanum
           lycopersicum]
          Length = 386

 Score =  248 bits (632), Expect = 2e-63
 Identities = 127/222 (57%), Positives = 159/222 (71%), Gaps = 12/222 (5%)
 Frame = +2

Query: 26  LTSTVGVTVLTCALMVSAVFHKTL-----EDPKIHQITEEDQINRILAG---NWALGT-- 175
           LT  + VT+LTCA  +    H +      E+ KI Q+T+         G   +  LGT  
Sbjct: 7   LTYALSVTILTCAFSLLPFHHTSAAAAVPEEFKIRQVTDGRNPTTTAHGGSNHHLLGTPA 66

Query: 176 EKNFQIFMKKYGKEYSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEEF 355
           E  F+ F+++Y KEYS+ EEY+HR+G+F KNL+RAAEHQ LDPTAVHGVT F DL+ EEF
Sbjct: 67  EHRFKSFIQEYNKEYSTREEYVHRLGVFVKNLLRAAEHQALDPTAVHGVTQFSDLTSEEF 126

Query: 356 ERLFTGLSSGG--SPLNKISSTAPLMDVEGLPSSFDWREKGAVTDVKSQGTCGSCWAFST 529
           ER++ G+  G   S L +  S AP M+V+ LP+SFDWREKGAVTDVK QG+CGSCWAFST
Sbjct: 127 ERMYMGVKGGDRTSLLREFGSHAPPMEVKDLPNSFDWREKGAVTDVKMQGSCGSCWAFST 186

Query: 530 TGTIEGANFIATGKLLKLSEQQLVDCDHMCDAAEKDSCDNGC 655
           TG+IEGANFIATGKLL LSEQQLVDCD+ CD  ++ +CD+GC
Sbjct: 187 TGSIEGANFIATGKLLNLSEQQLVDCDNTCDKKDRKACDSGC 228


>ref|XP_004250043.1| PREDICTED: cysteine proteinase 15A-like isoform 1 [Solanum
           lycopersicum]
          Length = 425

 Score =  248 bits (632), Expect = 2e-63
 Identities = 127/222 (57%), Positives = 159/222 (71%), Gaps = 12/222 (5%)
 Frame = +2

Query: 26  LTSTVGVTVLTCALMVSAVFHKTL-----EDPKIHQITEEDQINRILAG---NWALGT-- 175
           LT  + VT+LTCA  +    H +      E+ KI Q+T+         G   +  LGT  
Sbjct: 7   LTYALSVTILTCAFSLLPFHHTSAAAAVPEEFKIRQVTDGRNPTTTAHGGSNHHLLGTPA 66

Query: 176 EKNFQIFMKKYGKEYSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEEF 355
           E  F+ F+++Y KEYS+ EEY+HR+G+F KNL+RAAEHQ LDPTAVHGVT F DL+ EEF
Sbjct: 67  EHRFKSFIQEYNKEYSTREEYVHRLGVFVKNLLRAAEHQALDPTAVHGVTQFSDLTSEEF 126

Query: 356 ERLFTGLSSGG--SPLNKISSTAPLMDVEGLPSSFDWREKGAVTDVKSQGTCGSCWAFST 529
           ER++ G+  G   S L +  S AP M+V+ LP+SFDWREKGAVTDVK QG+CGSCWAFST
Sbjct: 127 ERMYMGVKGGDRTSLLREFGSHAPPMEVKDLPNSFDWREKGAVTDVKMQGSCGSCWAFST 186

Query: 530 TGTIEGANFIATGKLLKLSEQQLVDCDHMCDAAEKDSCDNGC 655
           TG+IEGANFIATGKLL LSEQQLVDCD+ CD  ++ +CD+GC
Sbjct: 187 TGSIEGANFIATGKLLNLSEQQLVDCDNTCDKKDRKACDSGC 228


>gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]
          Length = 397

 Score =  246 bits (629), Expect = 4e-63
 Identities = 129/232 (55%), Positives = 161/232 (69%), Gaps = 20/232 (8%)
 Frame = +2

Query: 23  ILTSTVGVTVLTCALMVSAVFHKTLE----DP-KIHQITEEDQIN----RILAGNWALGT 175
           +LT T+ +T+L+CAL+ S  F   ++    DP  I Q+T+         R  A +  LGT
Sbjct: 9   MLTCTLAITLLSCALISSTTFQHEIQYRVQDPLMIRQVTDNHHHRHHPGRSSANHRLLGT 68

Query: 176 --EKNFQIFMKKYGKEYSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEE 349
             E +F+ F+++Y K YS+ EEY+HR+GIFAKNL++AAEHQ +DP+A+HGVT F DL+EE
Sbjct: 69  TTEVHFKSFVEEYEKTYSTHEEYVHRLGIFAKNLIKAAEHQAMDPSAIHGVTQFSDLTEE 128

Query: 350 EFERLFTGLSSG---------GSPLNKISSTAPLMDVEGLPSSFDWREKGAVTDVKSQGT 502
           EFE  + GL  G         G      S+   +MDV  LP SFDWREKGAVT+VK+QG 
Sbjct: 129 EFEATYMGLKGGAGVGGTTQLGKDDGDESAAEVMMDVSDLPESFDWREKGAVTEVKTQGR 188

Query: 503 CGSCWAFSTTGTIEGANFIATGKLLKLSEQQLVDCDHMCDAAEKDSCDNGCN 658
           CGSCWAFSTTG IEGANFIATGKLL LSEQQLVDCDHMCD  EKD CD+GC+
Sbjct: 189 CGSCWAFSTTGAIEGANFIATGKLLSLSEQQLVDCDHMCDLKEKDDCDDGCS 240


>ref|XP_002316398.1| hypothetical protein POPTR_0010s23510g [Populus trichocarpa]
           gi|222865438|gb|EEF02569.1| hypothetical protein
           POPTR_0010s23510g [Populus trichocarpa]
          Length = 327

 Score =  244 bits (623), Expect = 2e-62
 Identities = 112/164 (68%), Positives = 136/164 (82%), Gaps = 1/164 (0%)
 Frame = +2

Query: 167 LGTEKNFQIFMKKYGKEYSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSE 346
           LGTE+ F++F+K++ KEY++ EEY+HR GIF KNL+RA EHQ LDPTA+HGVTPF DL+E
Sbjct: 8   LGTEEKFKMFIKEHNKEYATREEYVHRFGIFGKNLIRAVEHQALDPTAIHGVTPFMDLTE 67

Query: 347 EEFERLFTGLSSGGS-PLNKISSTAPLMDVEGLPSSFDWREKGAVTDVKSQGTCGSCWAF 523
           EEFER++ G+  GG+ P+ K   +   MD  GLP SFDWREKGAVTDVK QG+CGSCWAF
Sbjct: 68  EEFERMYAGVLGGGTVPVEK--GSVSFMDASGLPDSFDWREKGAVTDVKIQGSCGSCWAF 125

Query: 524 STTGTIEGANFIATGKLLKLSEQQLVDCDHMCDAAEKDSCDNGC 655
           STTG++EGANFIATGKLL LSEQQLVDCD +CD  +K SCD+GC
Sbjct: 126 STTGSVEGANFIATGKLLNLSEQQLVDCDRVCDKTDKASCDDGC 169


>ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
          Length = 406

 Score =  243 bits (621), Expect = 3e-62
 Identities = 123/226 (54%), Positives = 161/226 (71%), Gaps = 7/226 (3%)
 Frame = +2

Query: 2   RNEVRMRILTSTVGVTVLTCALMVSAVFHKTL--EDPK-IHQITEEDQINRILAGNWALG 172
           R++ +M    + +    ++ AL++SA+   T    DP+ + Q+T+ +  N + AG+    
Sbjct: 30  RSKNKMATAVTLLLACAISLALLISAIPSATALRRDPEFLRQVTDGEIFNNLPAGS---- 85

Query: 173 TEKNFQIFMKKYGKEYSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEE 352
            E+ F +FM+KYGK Y + +EYLHR GIF KNL+RAAEHQ LDPTAVHGVT F DLSEEE
Sbjct: 86  -ERKFVMFMEKYGKSYPTRKEYLHRFGIFVKNLIRAAEHQALDPTAVHGVTQFSDLSEEE 144

Query: 353 FERLFTGL--SSGGSPLNKISSTAPLM--DVEGLPSSFDWREKGAVTDVKSQGTCGSCWA 520
           FER+F G+   +GG  L +++    +   +V+GLP  FDWR+KGAVT+VK QGTCGSCWA
Sbjct: 145 FERMFMGVRGGAGGEGLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKMQGTCGSCWA 204

Query: 521 FSTTGTIEGANFIATGKLLKLSEQQLVDCDHMCDAAEKDSCDNGCN 658
           FST G +EGANFIATG LL LSEQQLVDCDH CD  +K +C+NGCN
Sbjct: 205 FSTCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCN 250


>ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
          Length = 406

 Score =  243 bits (621), Expect = 3e-62
 Identities = 123/226 (54%), Positives = 161/226 (71%), Gaps = 7/226 (3%)
 Frame = +2

Query: 2   RNEVRMRILTSTVGVTVLTCALMVSAVFHKTL--EDPK-IHQITEEDQINRILAGNWALG 172
           R++ +M    + +    ++ AL++SA+   T    DP+ + Q+T+ +  N + AG+    
Sbjct: 30  RSKNKMATAVTLLLACAISLALLISAIPSATALRRDPEFLRQVTDGEIFNNLPAGS---- 85

Query: 173 TEKNFQIFMKKYGKEYSSTEEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEE 352
            E+ F +FM+KYGK Y + +EYLHR GIF KNL+RAAEHQ LDPTAVHGVT F DLSEEE
Sbjct: 86  -ERKFVMFMEKYGKSYPTRKEYLHRFGIFVKNLIRAAEHQALDPTAVHGVTQFSDLSEEE 144

Query: 353 FERLFTGL--SSGGSPLNKISSTAPLM--DVEGLPSSFDWREKGAVTDVKSQGTCGSCWA 520
           FER+F G+   +GG  L +++    +   +V+GLP  FDWR+KGAVT+VK QGTCGSCWA
Sbjct: 145 FERMFMGVRGGAGGEGLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKMQGTCGSCWA 204

Query: 521 FSTTGTIEGANFIATGKLLKLSEQQLVDCDHMCDAAEKDSCDNGCN 658
           FST G +EGANFIATG LL LSEQQLVDCDH CD  +K +C+NGCN
Sbjct: 205 FSTCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCN 250


>ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
           gi|17979125|gb|AAL49820.1| putative cysteine proteinase
           [Arabidopsis thaliana] gi|332645795|gb|AEE79316.1|
           Papain family cysteine protease [Arabidopsis thaliana]
          Length = 367

 Score =  243 bits (621), Expect = 3e-62
 Identities = 114/203 (56%), Positives = 154/203 (75%), Gaps = 1/203 (0%)
 Frame = +2

Query: 50  VLTCALMVSAVFHKTLEDPKIHQITEEDQINRILAGNWALGTEKNFQIFMKKYGKEYSST 229
           ++TC ++   V   ++ED  I Q+T +++  RI        TE  F++FM  YGK YS+ 
Sbjct: 9   LITCIILFCHVV-ASVEDLTIRQVTADNR--RIRPNLLGTHTESKFRLFMSDYGKNYSTR 65

Query: 230 EEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEEFERLFTGLSS-GGSPLNKI 406
           EEY+HR+GIFAKN+++AAEHQ++DP+AVHGVT F DL+EEEF+R++TG++  GGS    +
Sbjct: 66  EEYIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGSRGGTV 125

Query: 407 SSTAPLMDVEGLPSSFDWREKGAVTDVKSQGTCGSCWAFSTTGTIEGANFIATGKLLKLS 586
            + AP+++V+GLP  FDWREKG VT+VK+QG CGSCWAFSTTG  EGA+F++TGKLL LS
Sbjct: 126 GAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLS 185

Query: 587 EQQLVDCDHMCDAAEKDSCDNGC 655
           EQQLVDCD  CD  +K +CDNGC
Sbjct: 186 EQQLVDCDQACDPKDKKACDNGC 208


>ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
           lyrata] gi|297322116|gb|EFH52537.1| hypothetical protein
           ARALYDRAFT_485911 [Arabidopsis lyrata subsp. lyrata]
          Length = 368

 Score =  241 bits (615), Expect = 1e-61
 Identities = 114/204 (55%), Positives = 155/204 (75%), Gaps = 2/204 (0%)
 Frame = +2

Query: 50  VLTCALMVSAVFHKTLEDPKIHQITEEDQINRILAGNWALGTEKNFQIFMKKYGKEYSST 229
           ++TC +    V   ++ED  I Q+T +++  R+        TE  F++FM  YGK YS+ 
Sbjct: 9   LITCIIFFCHVV-ASVEDLTIRQVTADER--RVRPNLLGTHTESKFRVFMSDYGKNYSTR 65

Query: 230 EEYLHRMGIFAKNLMRAAEHQVLDPTAVHGVTPFFDLSEEEFERLFTGLSS-GGSPLNKI 406
           EEY+HR+GIFAKN+++AAEHQ++DPTAVHGVT F DL+EEEF+R++TG++  GGS  + +
Sbjct: 66  EEYIHRLGIFAKNVLKAAEHQMMDPTAVHGVTQFSDLTEEEFKRMYTGVADVGGSRGHAV 125

Query: 407 SSTAPLMDVEGLPSSFDWREKGAVTDVKSQGTCGSCWAFSTTGTIEGANFIATGKLLKLS 586
            + AP+++V+GLP  FDWREKG VT+VK+QG CGSCWAFSTTG  EGA+F++TGKLL LS
Sbjct: 126 GAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLS 185

Query: 587 EQQLVDCDH-MCDAAEKDSCDNGC 655
           EQQLVDCD  +CD  +K +CDNGC
Sbjct: 186 EQQLVDCDQAVCDPKDKKACDNGC 209


Top