BLASTX nr result

ID: Akebia27_contig00017535 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00017535
         (1490 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          586   e-164
ref|XP_002307688.2| cysteine protease family protein [Populus tr...   582   e-163
ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087...   575   e-161
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   573   e-161
ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phas...   570   e-160
ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr...   570   e-160
ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C...   568   e-159
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   563   e-158
gb|AGV54418.1| oryzain alpha chain-like protein [Phaseolus vulga...   562   e-157
gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]                  560   e-157
ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine...   559   e-156
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   556   e-156
ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S...   551   e-154
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   551   e-154
ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr...   550   e-154
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   550   e-154
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   549   e-153
ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a...   548   e-153
ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prun...   547   e-153
ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F...   544   e-152

>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  586 bits (1510), Expect = e-164
 Identities = 279/426 (65%), Positives = 327/426 (76%), Gaps = 9/426 (2%)
 Frame = +3

Query: 60   FSSFTSD---LFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNA 230
            FSS +S+   LF +WC+Q+GK Y+S+EEKL+R  VF+DN  F+T HN+  NS+YTL+LNA
Sbjct: 19   FSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNA 78

Query: 231  FADLTHHEFKSSRFGLSLAAS---NLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQ 401
            FADLTHHEFK+SR GLS AAS   N+DRSN Q+  F    DVP+S+DWRK GAVT VKDQ
Sbjct: 79   FADLTHHEFKASRLGLSSAASASLNVDRSNRQIPDFVA--DVPASVDWRKNGAVTQVKDQ 136

Query: 402  ASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDN 581
             +CGACW+FSATGAIEGIN+IV+GSLVSLSEQELVDCDK+YN+GC GG+MDYAFQFV+DN
Sbjct: 137  GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDN 196

Query: 582  KGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGS 758
             GIDTE+DYPYQ +  SCNK KLKRHVVTIDGY D+P + E+E+++AVA+QPVSVGICGS
Sbjct: 197  HGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGS 256

Query: 759  ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 938
            ERAFQ YSKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WGMDGYMHM RN+
Sbjct: 257  ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNS 316

Query: 939  GDQLGICGINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWK 1112
            G   G+CGIN LA                   +C L T+CG GETCCC   + GIC  WK
Sbjct: 317  GSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWK 376

Query: 1113 CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNSL 1292
            CCE++SAVCCKD R CCP DYPVCDT   +C K   N+T ++    K  S GK   W+SL
Sbjct: 377  CCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKF-AKNSSSGKFRSWSSL 435

Query: 1293 FEAWNL 1310
             E W L
Sbjct: 436  LEGWIL 441


>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  582 bits (1501), Expect = e-163
 Identities = 272/417 (65%), Positives = 326/417 (78%), Gaps = 3/417 (0%)
 Frame = +3

Query: 63   SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 242
            SS  S LF +WC+++GK+Y+S+EE+ +R  VFEDN  F+T HN+  NS+Y+LALNAFADL
Sbjct: 22   SSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADL 81

Query: 243  THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 422
            THHEFK+SR GLS A  NL   N ++TG  V  D+P+SIDWR KG VT VKDQ SCGACW
Sbjct: 82   THHEFKTSRLGLSAAPLNLAHRNLEITG--VVGDIPASIDWRNKGVVTNVKDQGSCGACW 139

Query: 423  AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 602
            +FSATGAIEGIN+IV+GSLVSLSEQEL++CDK+YN GCGGGLMDYAFQFV++N GIDTE+
Sbjct: 140  SFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEE 199

Query: 603  DYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQSY 779
            DYPY+A+  +CNK+++KR VVTID Y D+P + E++++QAVA+QPVSVGICGSERAFQ Y
Sbjct: 200  DYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMY 259

Query: 780  SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 959
            SKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WGM GYMHM RN+G+  G+C
Sbjct: 260  SKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVC 319

Query: 960  GINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1133
            GIN LA                   KC+LLTYC AGETCCC R+  GIC  WKCC ++SA
Sbjct: 320  GINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLDSA 379

Query: 1134 VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNSLFEAW 1304
            VCCKD   CCPHDYPVCDT   +CFK   N+T ++ +E K  + GK G WNSL EAW
Sbjct: 380  VCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGK--TSGKFGSWNSLPEAW 434


>ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1|
            JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  575 bits (1483), Expect = e-161
 Identities = 273/418 (65%), Positives = 317/418 (75%), Gaps = 3/418 (0%)
 Frame = +3

Query: 66   SFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLT 245
            S  S LF +WC+Q+GK YSSEEEK YR  VFE+N AF+T HN + NS+Y+LALNAFADLT
Sbjct: 24   SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83

Query: 246  HHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWA 425
            HHEFK+SR GLS AA    R N QL G  + RD+P+S+DWR KGAVT VKDQ SCGACW+
Sbjct: 84   HHEFKASRLGLSAAAIEGSRPNLQLPG--LVRDIPASMDWRTKGAVTKVKDQGSCGACWS 141

Query: 426  FSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKD 605
            FSATGAIEGIN+IV+G+LVSLSEQELVDCD++YNSGC GGLMDYA+QFV+DN GID E+D
Sbjct: 142  FSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHGIDNEED 201

Query: 606  YPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYS 782
            YPY  +  +CNK K KR VVTIDGY  +P+  E+ ++QAVA QPVSVGICGSERAFQ YS
Sbjct: 202  YPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSERAFQLYS 261

Query: 783  KGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICG 962
            KGIF G CS+SLDHAVLIVGYGSENGVDYWI+KNSWG  WGM+GY+HM RN+GD  G+CG
Sbjct: 262  KGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGDSKGLCG 321

Query: 963  INTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAV 1136
            IN LA                   KC L TYC AGETCCCT R+ GICF WKCCE++SAV
Sbjct: 322  INMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCCELDSAV 381

Query: 1137 CCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNSLFEAWNL 1310
            CCKD+R CCP+DYPVCDTK   C K   N+T ++  E K+ S  K   W    E W L
Sbjct: 382  CCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFE-KRHSTRKFSSWRPFVENWVL 438


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  573 bits (1476), Expect = e-161
 Identities = 270/412 (65%), Positives = 321/412 (77%), Gaps = 3/412 (0%)
 Frame = +3

Query: 63   SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 242
            +S  S+LF  WC ++GK+YSS EEKLYR  VF DN  F+T+HNN++NS+YTL+LN++ADL
Sbjct: 22   TSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADL 81

Query: 243  THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 422
            THHEFK SR G S A  N      Q    S+ RDVP S+DWRKKGAVT VKDQ SCGACW
Sbjct: 82   THHEFKVSRLGFSPALRNFRPVLPQEP--SLPRDVPDSLDWRKKGAVTAVKDQGSCGACW 139

Query: 423  AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 602
            +FSATGA+EGINQI++GSL+SLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDTE 
Sbjct: 140  SFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTEN 199

Query: 603  DYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEE-EIMQAVASQPVSVGICGSERAFQSY 779
            DYPYQA+  SC K+KL+R+VVTIDGY D+PS +E +++QAVA+QPVSVGICGSERAFQ Y
Sbjct: 200  DYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLY 259

Query: 780  SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 959
            SKGIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGKSWGMDGYMHM RN+G+  G+C
Sbjct: 260  SKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVC 319

Query: 960  GINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1133
            GIN LA                   KCS+LT C AGETCCC ++ LG+C  WKCC ++SA
Sbjct: 320  GINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCGLSSA 379

Query: 1134 VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNS 1289
            VCCKD R CCP DYP+CDT   LC K T+N T  + LE +  S G  G W+S
Sbjct: 380  VCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSSS-GSSGTWSS 430


>ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris]
            gi|561009128|gb|ESW08035.1| hypothetical protein
            PHAVU_009G013000g [Phaseolus vulgaris]
          Length = 428

 Score =  570 bits (1469), Expect = e-160
 Identities = 273/412 (66%), Positives = 310/412 (75%), Gaps = 3/412 (0%)
 Frame = +3

Query: 63   SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHN---NMENSTYTLALNAF 233
            +S TSDLF  WC+++ K YSSEEEK YRF VFEDN AF++ HN   N  NSTYTL+LNAF
Sbjct: 20   ASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLNAF 79

Query: 234  ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCG 413
            ADLTHHEFK+SR G S +     R  +Q     +    PS IDWR+ GAVTPVKDQASCG
Sbjct: 80   ADLTHHEFKTSRLGFSPSLLRFKRVQNQQPRHLLHN--PSQIDWRQSGAVTPVKDQASCG 137

Query: 414  ACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGID 593
            ACWAFSATGAIEGIN+IV+GSL SLSEQELVDCD +YNSGC GGLMDYA+QFV+DNKGID
Sbjct: 138  ACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDYAYQFVIDNKGID 197

Query: 594  TEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERAFQ 773
            TE DYPYQA+   CNK+KLKRH+VTID Y DLP  EEE+++AVASQPVSVGICGSERAFQ
Sbjct: 198  TEDDYPYQARQRPCNKDKLKRHIVTIDDYVDLPPNEEELLKAVASQPVSVGICGSERAFQ 257

Query: 774  SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 953
             YS+GIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RNTGD  G
Sbjct: 258  LYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMEGYIHMIRNTGDPKG 317

Query: 954  ICGINTLAXXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1133
            ICGINTLA                 +C+L T+C  GETCCC +  LGICF WKCC + SA
Sbjct: 318  ICGINTLASYPIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCGLTSA 377

Query: 1134 VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNS 1289
            VCCKD R CCP DYP+CDT+   C K T  +T +      K    K  GW S
Sbjct: 378  VCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTI--TSGNKDISNKPRGWKS 427


>ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina]
            gi|557537201|gb|ESR48319.1| hypothetical protein
            CICLE_v10001178mg [Citrus clementina]
          Length = 441

 Score =  570 bits (1469), Expect = e-160
 Identities = 269/420 (64%), Positives = 321/420 (76%), Gaps = 5/420 (1%)
 Frame = +3

Query: 60   FSSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFAD 239
            + S  ++LF +WC+Q+GK YSSE+EK  R  +FEDN AF+T HNNM NS++TL+LNAFAD
Sbjct: 21   YCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80

Query: 240  LTHHEFKSSRFGLSLAASNLDRS-NSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGA 416
            LTH EFK+S  G S A+ + DR  N+ +      RDVP+SIDWRKKGAVT VKDQASCGA
Sbjct: 81   LTHQEFKASFLGFSAASIDHDRRRNASVQSPGTLRDVPASIDWRKKGAVTEVKDQASCGA 140

Query: 417  CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 596
            CWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDT
Sbjct: 141  CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200

Query: 597  EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 773
            EKDYPY+ +   CNK KL RH+VTIDGY D+P + E++++QAV +QPVSVGICGSERAFQ
Sbjct: 201  EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260

Query: 774  SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 953
             YS GIF G CSTSLDHAVLIVGY SENGVDYWI+KNSWG+SWGM+GYMHM RNTG+ LG
Sbjct: 261  LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320

Query: 954  ICGINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 1127
            ICGIN LA                   +CSLLTYC AGETCCC   +LGIC  WKCC  +
Sbjct: 321  ICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFS 380

Query: 1128 SAVCCKDHRFCCPHDYPVCDTKNKLCF-KGTVNSTMVKGLERKKGSFGKLGGWNSLFEAW 1304
            SAVCC DHR+CCP +YP+CD+    C  + T N T  + +E  +GS  K G W+S  + W
Sbjct: 381  SAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIE-MRGSSWKFGSWSSFIDVW 439


>ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 441

 Score =  568 bits (1464), Expect = e-159
 Identities = 268/420 (63%), Positives = 321/420 (76%), Gaps = 5/420 (1%)
 Frame = +3

Query: 60   FSSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFAD 239
            + S  ++LF +WC+Q+GK YSSE+EK  R  +FEDN AF+T HNNM NS++TL+LNAFAD
Sbjct: 21   YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80

Query: 240  LTHHEFKSSRFGLSLAASNLDRS-NSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGA 416
            LTH EFK+S  G S A+ + DR  N+ +      RDVP+SIDWRKKGAVT VKDQASCGA
Sbjct: 81   LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140

Query: 417  CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 596
            CWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDT
Sbjct: 141  CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200

Query: 597  EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 773
            EKDYPY+ +   CNK KL RH+VTIDGY D+P + E++++QAV +QPVSVGICGSERAFQ
Sbjct: 201  EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260

Query: 774  SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 953
             YS GIF G CSTSLDHAVLI+GY SENGVDYWI+KNSWG+SWGM+GYMHM RNTG+ LG
Sbjct: 261  LYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320

Query: 954  ICGINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 1127
            ICGIN LA                   +CSLLTYC  GETCCC   +LGIC  WKCC  +
Sbjct: 321  ICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGICLSWKCCGFS 380

Query: 1128 SAVCCKDHRFCCPHDYPVCDTKNKLCF-KGTVNSTMVKGLERKKGSFGKLGGWNSLFEAW 1304
            SAVCC DHR+CCP +YP+CD+    C  + T N T  + +E  +GS  K G W+S  +AW
Sbjct: 381  SAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIE-MRGSSWKFGSWSSFIDAW 439


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  563 bits (1452), Expect = e-158
 Identities = 265/416 (63%), Positives = 322/416 (77%), Gaps = 8/416 (1%)
 Frame = +3

Query: 57   CFSSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFA 236
            C  S  SDLF +WC+QNGK YSSE+E++YRF VFE+N A+IT HN+ ENS+YTL LNA++
Sbjct: 20   CTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYITEHNSKENSSYTLGLNAYS 79

Query: 237  DLTHHEFKSSRFGLSLAASNLDR-----SNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQ 401
            DLTHHEF++S  GLS +A++  R     S S  TG     D PSS+DWR+KGAVT VK+Q
Sbjct: 80   DLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSETGVLSDVDAPSSLDWREKGAVTDVKNQ 139

Query: 402  ASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDN 581
             SCGACW+FSATGA+EGIN+I +GSLVSLSEQEL+DCD++YN GCGGGLMDYAF+FV+ N
Sbjct: 140  GSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGGGLMDYAFEFVIKN 199

Query: 582  KGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGS 758
             GIDTEKDYP++ +  +CNKNKL+RHVVTIDGYTD+P  +E+ +++AVA+QPVSVGICGS
Sbjct: 200  GGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICGS 259

Query: 759  ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 938
             RAFQSYSKGIF G CST+LDHAVLIVGYGSENGVDYWI+KNSWG SWG++GY+HM RN+
Sbjct: 260  ARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYIHMQRNS 319

Query: 939  GDQLGICGINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWK 1112
            G+Q GICGIN LA                   KCS+ T CG GETCCC  + LGIC  WK
Sbjct: 320  GNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCCCGSKFLGICLSWK 379

Query: 1113 CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGG 1280
            CC ++SAVCCKD R CCP DYP+CDT   LC K   N+T+V+   +K+   GK GG
Sbjct: 380  CCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQ-QPQKEAFTGKFGG 434


>gb|AGV54418.1| oryzain alpha chain-like protein [Phaseolus vulgaris]
          Length = 467

 Score =  562 bits (1449), Expect = e-157
 Identities = 270/412 (65%), Positives = 309/412 (75%), Gaps = 3/412 (0%)
 Frame = +3

Query: 63   SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHN---NMENSTYTLALNAF 233
            +S TSDLF  WC+++ K YSSEEEK YRF VFEDN AF++ HN   N  NSTYTL+LNAF
Sbjct: 59   ASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLNAF 118

Query: 234  ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCG 413
            ADLTHHEFK+SR G S +     R  +Q     +    PS IDWR+ GAVTPVKDQASCG
Sbjct: 119  ADLTHHEFKTSRLGFSPSLLRFKRVQNQQPRHLLHN--PSQIDWRQSGAVTPVKDQASCG 176

Query: 414  ACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGID 593
            ACWAFSATGAIEGIN+IV+GSL SLSEQELVDCD +YNSGC GGLMD+A+QFV+DNKGID
Sbjct: 177  ACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDFAYQFVIDNKGID 236

Query: 594  TEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERAFQ 773
            TE DYPYQA+  SC+K+KLKR  VTI+ Y D+P  EEEI++AVASQPVSVGICGSERAFQ
Sbjct: 237  TEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVASQPVSVGICGSERAFQ 296

Query: 774  SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 953
             YS+GIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WG+DGY+HM RNTGD  G
Sbjct: 297  LYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGIDGYIHMIRNTGDPKG 356

Query: 954  ICGINTLAXXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1133
            ICGINTLA                 +C+L T+C  GETCCC +  LGICF WKCC + SA
Sbjct: 357  ICGINTLASYPIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCGLTSA 416

Query: 1134 VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNS 1289
            VCCKD R CCP DYP+CDT+   C K T  +T +      K    K  GW S
Sbjct: 417  VCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTI--TSGNKDISNKPRGWKS 466


>gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]
          Length = 517

 Score =  560 bits (1442), Expect = e-157
 Identities = 264/384 (68%), Positives = 306/384 (79%), Gaps = 4/384 (1%)
 Frame = +3

Query: 72   TSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHH 251
            +S LF +WCE++G++YSSEEE+LYR +VFEDNLAF+T HNNM NS+YTL+LNAFADLTHH
Sbjct: 26   SSQLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVTQHNNMGNSSYTLSLNAFADLTHH 85

Query: 252  EFKSSRFGLSLAA-SNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAF 428
            EFKSSR G S A  S+L +  S+L      RDVP+S+DWRKKGAVT VKDQ SCGACWAF
Sbjct: 86   EFKSSRLGFSSALLSSLPKLGSKLLDL---RDVPASLDWRKKGAVTNVKDQGSCGACWAF 142

Query: 429  SATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDY 608
            SATGAIEGIN+IV+GSLVSLSEQEL+DCD +YN+GC GGLMDYA+QFV+DN GIDTE+DY
Sbjct: 143  SATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAGCDGGLMDYAYQFVIDNHGIDTEEDY 202

Query: 609  PYQAKGTSCNKNKLKRHVVTIDGYTDL-PSKEEEIMQAVASQPVSVGICGSERAFQSYSK 785
            PYQA+  SC K KLKR VVTIDGYTD+ P+   +++QAV +QPVSVGICGSERAFQ YSK
Sbjct: 203  PYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQAVVTQPVSVGICGSERAFQLYSK 262

Query: 786  GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGI 965
            GIF G CSTSLDHAVLIVGY SENGVDYWI+KNSWGK WGMDGY+HM RNTG+  G+CGI
Sbjct: 263  GIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQWGMDGYIHMQRNTGNSQGVCGI 322

Query: 966  NTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVC 1139
            N LA                   +CS    CG GETCCC+ R LG+CF WKCC +NSAVC
Sbjct: 323  NMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGETCCCSWRFLGLCFSWKCCGLNSAVC 382

Query: 1140 CKDHRFCCPHDYPVCDTKNKLCFK 1211
            CKD   CCP DYP+CDT+  +C K
Sbjct: 383  CKDKIHCCPQDYPLCDTQRNVCLK 406


>ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  559 bits (1440), Expect = e-156
 Identities = 270/419 (64%), Positives = 311/419 (74%), Gaps = 10/419 (2%)
 Frame = +3

Query: 63   SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHN-----NMENSTYTLALN 227
            +S TS+LF  WC+++ K YSSEEEKLYR  VFEDN AF+  HN     N  NS+YTL+LN
Sbjct: 26   ASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLN 85

Query: 228  AFADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRD---VPSSIDWRKKGAVTPVKD 398
            AFADLTHHEFK++R GL L      R  +Q +     RD   +PS IDWR+ GAVTPVKD
Sbjct: 86   AFADLTHHEFKTTRLGLPLTLLRFKRPQNQQS-----RDLLHIPSQIDWRQSGAVTPVKD 140

Query: 399  QASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVD 578
            QASCGACWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD +YNSGCGGGLMD+A+QFV+D
Sbjct: 141  QASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVID 200

Query: 579  NKGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGS 758
            NKGIDTE DYPYQA+  SC+K+KLKR  VTI+ Y D+P  EEEI++AVASQPVSVGICGS
Sbjct: 201  NKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVASQPVSVGICGS 260

Query: 759  ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 938
            ER FQ YSKGIF G CST LDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RN+
Sbjct: 261  EREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNS 320

Query: 939  GDQLGICGINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWK 1112
            G+  GICGINTLA                   +C+L T+C  GETCCC +  LGICF WK
Sbjct: 321  GNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGICFSWK 380

Query: 1113 CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNS 1289
            CC + SAVCCKD R CCP DYP+CDT+   C K T N T     E +  S  K  GW S
Sbjct: 381  CCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFSH-KSRGWKS 438


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  556 bits (1434), Expect = e-156
 Identities = 262/393 (66%), Positives = 313/393 (79%), Gaps = 5/393 (1%)
 Frame = +3

Query: 63   SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 242
            SS  S LF SW +++GK Y+S+E+KLYRF +FE+N  F+  HN+  NS+YTL+LNAFADL
Sbjct: 25   SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84

Query: 243  THHEFKSSRFGLSLAASN--LDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGA 416
            THHEFK+SR GLS  +++  L R N  L  F    DVP SIDWRKKGAV+ VKDQ +CGA
Sbjct: 85   THHEFKASRLGLSAFSTSGKLSRRNFPLHDF--VGDVPISIDWRKKGAVSQVKDQGNCGA 142

Query: 417  CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 596
            CW+FSATGAIEGIN+IV+GSLVSLSEQELVDCD++YN+GC GGLMDYA+QFV++N GIDT
Sbjct: 143  CWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDT 202

Query: 597  EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 773
            E+DYPYQA+  +CNK KLKRHVVTIDGYTD+P + E+E+++AVA+QPVSVGICGSERAFQ
Sbjct: 203  EEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQ 262

Query: 774  SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 953
             YSKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WG++GYM+M RN+G+  G
Sbjct: 263  LYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQG 322

Query: 954  ICGINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 1127
            +CGIN LA                   KC L T CG GETCCCTRR+ G+CF WKCCE++
Sbjct: 323  LCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCELD 382

Query: 1128 SAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNS 1226
            SAVCCKD   CCPHDYPVCDTK  +C K ++ S
Sbjct: 383  SAVCCKDGLHCCPHDYPVCDTKRNMCLKVSIFS 415


>ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum]
          Length = 439

 Score =  551 bits (1421), Expect = e-154
 Identities = 261/417 (62%), Positives = 315/417 (75%), Gaps = 8/417 (1%)
 Frame = +3

Query: 54   ICFSSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAF 233
            +C  S  SDLF +WC+QNGK YSSE+E++YRF VFE+N A+IT HN+  NS+YTL LNA+
Sbjct: 19   LCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYITEHNSKGNSSYTLGLNAY 78

Query: 234  ADLTHHEFKSSRFGLSLAASNLDR-----SNSQLTGFSVFRDVPSSIDWRKKGAVTPVKD 398
            +DLTHHEF++S  GLS +A++  R     S S   G     D PSS+DWR KGAVT VK+
Sbjct: 79   SDLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSAAGVLSDVDAPSSLDWRDKGAVTNVKN 138

Query: 399  QASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVD 578
            Q SCGACW+FSATGAIEGIN+I +GSLVSLSEQEL+DCD++YN GCGGGLMDYAF+FV+ 
Sbjct: 139  QGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGGGLMDYAFEFVIK 198

Query: 579  NKGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICG 755
            N GIDTEKDYP++ K  +CNKNKL+R VVTIDGYTD+P  +E+ +++AVA+QPVSVGICG
Sbjct: 199  NGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICG 258

Query: 756  SERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARN 935
            S RAFQSYSKGIF G C T LDHAVLIVGYGSENG DYWI+KNSWG SWG++GY+HM RN
Sbjct: 259  SARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTSWGINGYIHMQRN 318

Query: 936  TGDQLGICGINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHW 1109
            +G+Q GICG+N LA                   KCS  T CG GETCCC  + LGIC  W
Sbjct: 319  SGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCCCGLKFLGICLSW 378

Query: 1110 KCCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGG 1280
            KCC ++SAVCCKD R CCP DYP+CDT   LC K   N+T+V+   +K+   GK GG
Sbjct: 379  KCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQ-QPQKEPFTGKFGG 434


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  551 bits (1420), Expect = e-154
 Identities = 261/409 (63%), Positives = 312/409 (76%), Gaps = 6/409 (1%)
 Frame = +3

Query: 75   SDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 254
            S+LF+ WC+++GK Y SEEE+  R  +F+DN  F+T HN + N+TY+L+LNAFADLTHHE
Sbjct: 29   SELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88

Query: 255  FKSSRFGLSLAA-SNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 431
            FK+SR GLS++A S +  S  Q  G SV   VP S+DWRKKGAVT VKDQ SCGACW+FS
Sbjct: 89   FKASRLGLSVSAPSVIMASKGQSLGGSV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146

Query: 432  ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 611
            ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP
Sbjct: 147  ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206

Query: 612  YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSKG 788
            YQ +  +C K+KLK+ VVTID Y  + S +E+ +M+AVA+QPVSVGICGSERAFQ YS+G
Sbjct: 207  YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRG 266

Query: 789  IFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGIN 968
            IF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNT +  G+CGIN
Sbjct: 267  IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326

Query: 969  TLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVCC 1142
             LA                   KC+L TYC +GETCCC R L G+CF WKCCE+ SAVCC
Sbjct: 327  MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386

Query: 1143 KDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGS--FGKLGGW 1283
            KD R CCPHDYPVCDT   LC K T N T +K   +K  S   G+   W
Sbjct: 387  KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGRFEEW 435


>ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum]
            gi|557095297|gb|ESQ35879.1| hypothetical protein
            EUTSA_v10007640mg [Eutrema salsugineum]
          Length = 444

 Score =  550 bits (1418), Expect = e-154
 Identities = 265/415 (63%), Positives = 311/415 (74%), Gaps = 5/415 (1%)
 Frame = +3

Query: 75   SDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 254
            ++LF+ WC ++GK Y SEEE+ +R  +F DN  F+T HN++ NSTY+L+LNAFADLTHHE
Sbjct: 34   AELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTYSLSLNAFADLTHHE 93

Query: 255  FKSSRFGLSLAASNLDRSNSQLTGFS--VFRDVPSSIDWRKKGAVTPVKDQASCGACWAF 428
            FK+SR GLS  + +L  +  Q  G S  V   VP S+DWRKKGAVT VKDQ SCGACW+F
Sbjct: 94   FKASRLGLSAPSPSL-MAKEQSLGVSERVRVKVPDSVDWRKKGAVTNVKDQGSCGACWSF 152

Query: 429  SATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDY 608
            SATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDY
Sbjct: 153  SATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 212

Query: 609  PYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSK 785
            PYQ +  +C K+KLK+ VVTID Y  + S  E+ +M+AVASQPVSVGICGSERAFQ YS 
Sbjct: 213  PYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQPVSVGICGSERAFQLYSS 272

Query: 786  GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGI 965
            GIF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNTG+  G+CGI
Sbjct: 273  GIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGVCGI 332

Query: 966  NTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVC 1139
            N LA                   KC+L TYC +GETCCC R L G+CF WKCCE+ SAVC
Sbjct: 333  NMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARTLFGLCFSWKCCELESAVC 392

Query: 1140 CKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNSLFEAW 1304
            CKD R CCP DYPVCDT   LC K T N T +K    KK S  KLG     FE W
Sbjct: 393  CKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPF-WKKNSSNKLG----RFEEW 442


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  550 bits (1418), Expect = e-154
 Identities = 261/409 (63%), Positives = 311/409 (76%), Gaps = 6/409 (1%)
 Frame = +3

Query: 75   SDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 254
            S+LF+ WC+++GK Y SEEE+  R  +F+DN  F+T HN + N+TY+L+LNAFADLTHHE
Sbjct: 29   SELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88

Query: 255  FKSSRFGLSLAA-SNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 431
            FK+SR GLS++A S +  S  Q  G SV   VP S+DWRKKGAVT VKDQ SCGACW+FS
Sbjct: 89   FKASRLGLSVSAPSVIMASKGQSLGGSV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146

Query: 432  ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 611
            ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP
Sbjct: 147  ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206

Query: 612  YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSKG 788
            YQ +  +C K+KLK+ VVTID Y  + S +E+ +M+AVA+QPVSVGICGSERAFQ YS G
Sbjct: 207  YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSG 266

Query: 789  IFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGIN 968
            IF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNT +  G+CGIN
Sbjct: 267  IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326

Query: 969  TLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVCC 1142
             LA                   KC+L TYC +GETCCC R L G+CF WKCCE+ SAVCC
Sbjct: 327  MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386

Query: 1143 KDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGS--FGKLGGW 1283
            KD R CCPHDYPVCDT   LC K T N T +K   +K  S   G+   W
Sbjct: 387  KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGRFEEW 435


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  549 bits (1415), Expect = e-153
 Identities = 262/411 (63%), Positives = 312/411 (75%), Gaps = 8/411 (1%)
 Frame = +3

Query: 75   SDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 254
            S+LF+ WC+++GK Y SEEE+  R  +F+DN  F+T HN + N+TY+L+LNAFADLTHHE
Sbjct: 29   SELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88

Query: 255  FKSSRFGLSLAASNLDR-SNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 431
            FK+SR GLS++AS+L   S  Q  G +    VP S+DWRKKGAVT VKDQ SCGACW+FS
Sbjct: 89   FKASRLGLSVSASSLIMASKGQSLGGNA--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146

Query: 432  ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 611
            ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP
Sbjct: 147  ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206

Query: 612  YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSK- 785
            YQ +  +C K+KLK+ VVTID Y  + S +E+ + +AVA+QPVSVGICGSERAFQ YS+ 
Sbjct: 207  YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRV 266

Query: 786  -GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICG 962
             GIF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNTG+  GICG
Sbjct: 267  SGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGICG 326

Query: 963  INTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAV 1136
            IN LA                   KC+L TYC AGETCCC R L G+CF WKCCE+ SAV
Sbjct: 327  INMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETCCCARNLFGLCFSWKCCEIESAV 386

Query: 1137 CCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGS--FGKLGGW 1283
            CC D R CCPHDYPVCDT   LC K T N T +K   +K  S   G+  GW
Sbjct: 387  CCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKDSSNKLGRFEGW 437


>ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum]
          Length = 436

 Score =  548 bits (1412), Expect = e-153
 Identities = 265/412 (64%), Positives = 305/412 (74%), Gaps = 6/412 (1%)
 Frame = +3

Query: 72   TSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHH 251
            TS LF  WC+Q+GK Y SE+EK YRF+VFEDN AF+  HN + NS+YTL+LNAFADLTHH
Sbjct: 26   TSKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHH 85

Query: 252  EFKSSRFGL---SLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 422
            EFK++R GL   SL     +R   Q      F  VPS IDWRK GAV+ VKDQ SCGACW
Sbjct: 86   EFKATRLGLPPSSLLRFKFNRFQDQQRSDD-FLQVPSEIDWRKNGAVSIVKDQGSCGACW 144

Query: 423  AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 602
            +FSATGAIEGIN+IV+GSLVSLSEQELVDCD TYNSGC GGLMDYA+QF++DN GIDTE+
Sbjct: 145  SFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGIDTEE 204

Query: 603  DYPYQAKGTSCNKNKLKRHVVTIDGYTDL-PSKEEEIMQAVASQPVSVGICGSERAFQSY 779
            DYPYQA+   C K+KLKR VVTIDGYTD+ P+ E+++++AVA QPVSVGICGS RAFQ Y
Sbjct: 205  DYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARAFQLY 264

Query: 780  SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 959
            SKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RNT    G+C
Sbjct: 265  SKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSSAGLC 324

Query: 960  GINTLA--XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1133
            GIN LA                   KC+L TYC  GETCCC ++ LGICF WKCC V SA
Sbjct: 325  GINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCGVTSA 384

Query: 1134 VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKKGSFGKLGGWNS 1289
            VCCKD R CCP DYPVCD  N  C K   N T++   + K+  F +   W S
Sbjct: 385  VCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTSD-KEDPFHQTRDWRS 435


>ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prunus persica]
            gi|462420299|gb|EMJ24562.1| hypothetical protein
            PRUPE_ppa005615mg [Prunus persica]
          Length = 451

 Score =  547 bits (1410), Expect = e-153
 Identities = 268/429 (62%), Positives = 318/429 (74%), Gaps = 13/429 (3%)
 Frame = +3

Query: 63   SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 242
            S  TS+LF  WC+Q GK+YSS +EKLYR SVFEDNLAF+T HN+M NS+YTL+LN F+DL
Sbjct: 26   SQTTSELFEVWCKQYGKSYSSAQEKLYRLSVFEDNLAFVTQHNDMGNSSYTLSLNDFSDL 85

Query: 243  THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 422
            THHEFKSSR G S +  +L   + +    SV RD+PSS+DWRKKGAVT VKDQ SCGACW
Sbjct: 86   THHEFKSSRLGFSPSFLSLKLKSDRKP--SVVRDLPSSLDWRKKGAVTNVKDQGSCGACW 143

Query: 423  AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTY-NSGCGGGLMDYAFQFVVDNKGIDTE 599
            AFS TGAIEGIN+IV+GSL+SLSEQELVDCD+ Y N+GC GGLMD AF+FV+DN GIDTE
Sbjct: 144  AFSTTGAIEGINKIVTGSLISLSEQELVDCDRVYPNNGCNGGLMDDAFRFVIDNNGIDTE 203

Query: 600  KDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQS 776
            +DYPY+    +C K KLKR+ VTID YTD+PS +EE ++QAVASQPVSVGI GS+  FQ 
Sbjct: 204  EDYPYKGWDDTCIKKKLKRNAVTIDDYTDVPSNDEEQLLQAVASQPVSVGISGSDMGFQL 263

Query: 777  YSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGI 956
            YSKGIFNG CSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WGM+GYMHM R+  +  GI
Sbjct: 264  YSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGMNGYMHMLRDHSNPKGI 323

Query: 957  CGINTLA-XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1133
            CGINTLA                  +C + T+C AGETCCC +R++GICF W+CCE++SA
Sbjct: 324  CGINTLASYPIKTGENPPLPPPGPTRCDIFTHCAAGETCCCAKRVVGICFSWRCCELDSA 383

Query: 1134 VCCKDHRFCCPHDYPVCDTKNKLCFKG---------TVNSTMVKGLERKKGSFGKLG-GW 1283
            VCCKD R CCP DYP+CDT+  LC +             +   K LE  +GS  K G GW
Sbjct: 384  VCCKDQRHCCPRDYPICDTERTLCLQSNEQLSTQSHATGNLTSKALE-SRGSLRKSGRGW 442

Query: 1284 NSLFEAWNL 1310
             S+   W L
Sbjct: 443  GSMIRDWIL 451


>ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp.
            vesca]
          Length = 441

 Score =  544 bits (1401), Expect = e-152
 Identities = 264/419 (63%), Positives = 313/419 (74%), Gaps = 12/419 (2%)
 Frame = +3

Query: 63   SSFTSDLFNSWCEQNGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 242
            SS +S+LF +WC+Q GK+YSS+EEKLYR S+FE NLAFIT HN++ NS+YTL+LN+F+DL
Sbjct: 25   SSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNLAFITQHNDLGNSSYTLSLNSFSDL 84

Query: 243  THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 422
            THHEFK+SR G S     L R +      SV R VPSSIDWRK GAVT VKDQ SCGACW
Sbjct: 85   THHEFKASRLGFSPTFLRLYRKSDPKP--SVVRHVPSSIDWRKNGAVTNVKDQGSCGACW 142

Query: 423  AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTY-NSGCGGGLMDYAFQFVVDNKGIDTE 599
            +FSATGAIEGIN+IV+GSLVSLSEQEL+DCD+ Y NSGC GGLMD AFQF++DN GIDTE
Sbjct: 143  SFSATGAIEGINKIVTGSLVSLSEQELIDCDRVYPNSGCNGGLMDDAFQFIIDNNGIDTE 202

Query: 600  KDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSK-EEEIMQAVASQPVSVGICGSERAFQS 776
            +DYPYQ    +CNK KLKRHVVTIDGYTD+P+  EE++++AVA+QPVSVGI GS R FQ 
Sbjct: 203  EDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANNEEQLLKAVATQPVSVGIAGSGREFQF 262

Query: 777  YSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGI 956
            YSKGIF G CST+LDHAVLIVGYGSENGVDYWI+KNSWGK+WGM+GY+H+ R+  +  G+
Sbjct: 263  YSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGYIHILRDHSNSKGL 322

Query: 957  CGINTLA-----XXXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCE 1121
            CGIN LA                      KC L + CG GETCCC R++LGIC  W+CCE
Sbjct: 323  CGINMLASYPTKTGENPPFPPPSPPPGPTKCDLFSKCGVGETCCCARKILGICLSWRCCE 382

Query: 1122 VNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTM----VKGLERKKG-SFGKLGGW 1283
              SAVCCKD   CCPHDYP+CDT+   C +   N TM    ++G  RK   S  KL  W
Sbjct: 383  FTSAVCCKDRLHCCPHDYPICDTERNYCLQANGNLTMRANEIRGSLRKSSRSKAKLSYW 441


Top