BLASTX nr result

ID: Akebia24_contig00008837 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00008837
         (1437 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          598   e-168
ref|XP_002307688.2| cysteine protease family protein [Populus tr...   594   e-167
ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087...   581   e-163
ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr...   578   e-162
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   578   e-162
ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C...   577   e-162
ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phas...   575   e-161
gb|AGV54418.1| oryzain alpha chain-like protein [Phaseolus vulga...   567   e-159
ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine...   563   e-158
gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]                  563   e-158
ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr...   562   e-157
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   562   e-157
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   561   e-157
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   561   e-157
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   561   e-157
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   560   e-157
ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prun...   554   e-155
ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a...   550   e-154
ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S...   550   e-154
ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F...   549   e-153

>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  598 bits (1543), Expect = e-168
 Identities = 284/426 (66%), Positives = 330/426 (77%), Gaps = 9/426 (2%)
 Frame = +3

Query: 45   FSSFTSD---LFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNA 215
            FSS +S+   LF +WC QHGK Y+S+EEKL+R  VF+DN DF+T HN+  NS+YTL+LNA
Sbjct: 19   FSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNA 78

Query: 216  FADLTHHEFKSSRFGLSLAAS---NLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQ 386
            FADLTHHEFK+SR GLS AAS   N+DRSN Q+  F  + DVP+S+DWRK GAVT VKDQ
Sbjct: 79   FADLTHHEFKASRLGLSSAASASLNVDRSNRQIPDF--VADVPASVDWRKNGAVTQVKDQ 136

Query: 387  ASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDN 566
             +CGACW+FSATGAIEGIN+IV+GSLVSLSEQELVDCDK+YN+GC GG+MDYAFQFV+DN
Sbjct: 137  GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDN 196

Query: 567  KGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGS 743
             GIDTE+DYPYQ +  SCNK KLKRHVVTIDGY D+P + E+E+++AVA+QPVSVGICGS
Sbjct: 197  HGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGS 256

Query: 744  ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 923
            ERAFQ YSKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WGMDGYMHM RN+
Sbjct: 257  ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNS 316

Query: 924  GDQLGICGINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWK 1097
            G   G+CGIN LASY                  C L T+CG GETCCC   + GIC  WK
Sbjct: 317  GSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWK 376

Query: 1098 CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNSL 1277
            CCE++SAVCCKD R CCP DYPVCDT   +C K  GN+T ++    K  S GK   W+SL
Sbjct: 377  CCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKF-AKNSSSGKFRSWSSL 435

Query: 1278 FEAWNL 1295
             E W L
Sbjct: 436  LEGWIL 441


>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  594 bits (1532), Expect = e-167
 Identities = 276/417 (66%), Positives = 329/417 (78%), Gaps = 3/417 (0%)
 Frame = +3

Query: 48   SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADL 227
            SS  S LF +WC +HGK+Y+S+EE+ +R  VFEDN DF+T HN+  NS+Y+LALNAFADL
Sbjct: 22   SSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADL 81

Query: 228  THHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACW 407
            THHEFK+SR GLS A  NL   N ++TG  V+ D+P+SIDWR KG VT VKDQ SCGACW
Sbjct: 82   THHEFKTSRLGLSAAPLNLAHRNLEITG--VVGDIPASIDWRNKGVVTNVKDQGSCGACW 139

Query: 408  AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 587
            +FSATGAIEGIN+IV+GSLVSLSEQEL++CDK+YN GCGGGLMDYAFQFV++N GIDTE+
Sbjct: 140  SFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEE 199

Query: 588  DYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQSY 764
            DYPY+A+  +CNK+++KR VVTID Y D+P + E++++QAVA+QPVSVGICGSERAFQ Y
Sbjct: 200  DYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMY 259

Query: 765  SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 944
            SKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WGM GYMHM RN+G+  G+C
Sbjct: 260  SKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVC 319

Query: 945  GINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1118
            GIN LASY                  C+LLTYC AGETCCC R+  GIC  WKCC ++SA
Sbjct: 320  GINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLDSA 379

Query: 1119 VCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNSLFEAW 1289
            VCCKD   CCPHDYPVCDT   +CFK  GN+T ++ +E K  + GK G WNSL EAW
Sbjct: 380  VCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGK--TSGKFGSWNSLPEAW 434


>ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1|
            JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  581 bits (1498), Expect = e-163
 Identities = 275/418 (65%), Positives = 318/418 (76%), Gaps = 3/418 (0%)
 Frame = +3

Query: 51   SFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLT 230
            S  S LF +WC QHGK YSSEEEK YR  VFE+N  F+T HN + NS+Y+LALNAFADLT
Sbjct: 24   SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83

Query: 231  HHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACWA 410
            HHEFK+SR GLS AA    R N QL G  ++RD+P+S+DWR KGAVT VKDQ SCGACW+
Sbjct: 84   HHEFKASRLGLSAAAIEGSRPNLQLPG--LVRDIPASMDWRTKGAVTKVKDQGSCGACWS 141

Query: 411  FSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKD 590
            FSATGAIEGIN+IV+G+LVSLSEQELVDCD++YNSGC GGLMDYA+QFV+DN GID E+D
Sbjct: 142  FSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHGIDNEED 201

Query: 591  YPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYS 767
            YPY  +  +CNK K KR VVTIDGY  +P+  E+ ++QAVA QPVSVGICGSERAFQ YS
Sbjct: 202  YPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSERAFQLYS 261

Query: 768  KGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICG 947
            KGIF G CS+SLDHAVLIVGYGSENGVDYWI+KNSWG  WGM+GY+HM RN+GD  G+CG
Sbjct: 262  KGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGDSKGLCG 321

Query: 948  INTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAV 1121
            IN LASY                  C L TYC AGETCCCT R+ GICF WKCCE++SAV
Sbjct: 322  INMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCCELDSAV 381

Query: 1122 CCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNSLFEAWNL 1295
            CCKD+R CCP+DYPVCDTK   C K  GN+T ++  E K+ S  K   W    E W L
Sbjct: 382  CCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFE-KRHSTRKFSSWRPFVENWVL 438


>ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina]
            gi|557537201|gb|ESR48319.1| hypothetical protein
            CICLE_v10001178mg [Citrus clementina]
          Length = 441

 Score =  578 bits (1491), Expect = e-162
 Identities = 273/420 (65%), Positives = 322/420 (76%), Gaps = 5/420 (1%)
 Frame = +3

Query: 45   FSSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFAD 224
            + S  ++LF +WC QHGK YSSE+EK  R  +FEDN  F+T HNNM NS++TL+LNAFAD
Sbjct: 21   YCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80

Query: 225  LTHHEFKSSRFGLSLAASNLDRS-NSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGA 401
            LTH EFK+S  G S A+ + DR  N+ +     LRDVP+SIDWRKKGAVT VKDQASCGA
Sbjct: 81   LTHQEFKASFLGFSAASIDHDRRRNASVQSPGTLRDVPASIDWRKKGAVTEVKDQASCGA 140

Query: 402  CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 581
            CWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDT
Sbjct: 141  CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200

Query: 582  EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 758
            EKDYPY+ +   CNK KL RH+VTIDGY D+P + E++++QAV +QPVSVGICGSERAFQ
Sbjct: 201  EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260

Query: 759  SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 938
             YS GIF G CSTSLDHAVLIVGY SENGVDYWI+KNSWG+SWGM+GYMHM RNTG+ LG
Sbjct: 261  LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320

Query: 939  ICGINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 1112
            ICGIN LASY                  CSLLTYC AGETCCC   +LGIC  WKCC  +
Sbjct: 321  ICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFS 380

Query: 1113 SAVCCKDHRFCCPHDYPVCDTKNKLCF-KGTGNSTMVKGLERKKGSFGKLGGWNSLFEAW 1289
            SAVCC DHR+CCP +YP+CD+    C  + TGN T  + +E  +GS  K G W+S  + W
Sbjct: 381  SAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIE-MRGSSWKFGSWSSFIDVW 439


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  578 bits (1491), Expect = e-162
 Identities = 272/412 (66%), Positives = 322/412 (78%), Gaps = 3/412 (0%)
 Frame = +3

Query: 48   SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADL 227
            +S  S+LF  WC +HGK+YSS EEKLYR  VF DN +F+T+HNN++NS+YTL+LN++ADL
Sbjct: 22   TSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADL 81

Query: 228  THHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACW 407
            THHEFK SR G S A  N      Q    S+ RDVP S+DWRKKGAVT VKDQ SCGACW
Sbjct: 82   THHEFKVSRLGFSPALRNFRPVLPQEP--SLPRDVPDSLDWRKKGAVTAVKDQGSCGACW 139

Query: 408  AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 587
            +FSATGA+EGINQI++GSL+SLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDTE 
Sbjct: 140  SFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTEN 199

Query: 588  DYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEE-EIMQAVASQPVSVGICGSERAFQSY 764
            DYPYQA+  SC K+KL+R+VVTIDGY D+PS +E +++QAVA+QPVSVGICGSERAFQ Y
Sbjct: 200  DYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLY 259

Query: 765  SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 944
            SKGIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGKSWGMDGYMHM RN+G+  G+C
Sbjct: 260  SKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVC 319

Query: 945  GINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1118
            GIN LASY                  CS+LT C AGETCCC ++ LG+C  WKCC ++SA
Sbjct: 320  GINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCGLSSA 379

Query: 1119 VCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNS 1274
            VCCKD R CCP DYP+CDT   LC K T N T  + LE +  S G  G W+S
Sbjct: 380  VCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSSS-GSSGTWSS 430


>ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 441

 Score =  577 bits (1486), Expect = e-162
 Identities = 272/420 (64%), Positives = 322/420 (76%), Gaps = 5/420 (1%)
 Frame = +3

Query: 45   FSSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFAD 224
            + S  ++LF +WC QHGK YSSE+EK  R  +FEDN  F+T HNNM NS++TL+LNAFAD
Sbjct: 21   YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80

Query: 225  LTHHEFKSSRFGLSLAASNLDRS-NSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGA 401
            LTH EFK+S  G S A+ + DR  N+ +     LRDVP+SIDWRKKGAVT VKDQASCGA
Sbjct: 81   LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140

Query: 402  CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 581
            CWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDT
Sbjct: 141  CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200

Query: 582  EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 758
            EKDYPY+ +   CNK KL RH+VTIDGY D+P + E++++QAV +QPVSVGICGSERAFQ
Sbjct: 201  EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260

Query: 759  SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 938
             YS GIF G CSTSLDHAVLI+GY SENGVDYWI+KNSWG+SWGM+GYMHM RNTG+ LG
Sbjct: 261  LYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320

Query: 939  ICGINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 1112
            ICGIN LASY                  CSLLTYC  GETCCC   +LGIC  WKCC  +
Sbjct: 321  ICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGICLSWKCCGFS 380

Query: 1113 SAVCCKDHRFCCPHDYPVCDTKNKLCF-KGTGNSTMVKGLERKKGSFGKLGGWNSLFEAW 1289
            SAVCC DHR+CCP +YP+CD+    C  + TGN T  + +E  +GS  K G W+S  +AW
Sbjct: 381  SAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIE-MRGSSWKFGSWSSFIDAW 439


>ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris]
            gi|561009128|gb|ESW08035.1| hypothetical protein
            PHAVU_009G013000g [Phaseolus vulgaris]
          Length = 428

 Score =  575 bits (1482), Expect = e-161
 Identities = 278/413 (67%), Positives = 311/413 (75%), Gaps = 4/413 (0%)
 Frame = +3

Query: 48   SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHN---NMENSTYTLALNAF 218
            +S TSDLF  WC +H K YSSEEEK YRF VFEDN  F++ HN   N  NSTYTL+LNAF
Sbjct: 20   ASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLNAF 79

Query: 219  ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCG 398
            ADLTHHEFK+SR G S +     R  +Q      L   PS IDWR+ GAVTPVKDQASCG
Sbjct: 80   ADLTHHEFKTSRLGFSPSLLRFKRVQNQQPRH--LLHNPSQIDWRQSGAVTPVKDQASCG 137

Query: 399  ACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGID 578
            ACWAFSATGAIEGIN+IV+GSL SLSEQELVDCD +YNSGC GGLMDYA+QFV+DNKGID
Sbjct: 138  ACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDYAYQFVIDNKGID 197

Query: 579  TEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERAFQ 758
            TE DYPYQA+   CNK+KLKRH+VTID Y DLP  EEE+++AVASQPVSVGICGSERAFQ
Sbjct: 198  TEDDYPYQARQRPCNKDKLKRHIVTIDDYVDLPPNEEELLKAVASQPVSVGICGSERAFQ 257

Query: 759  SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 938
             YS+GIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RNTGD  G
Sbjct: 258  LYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMEGYIHMIRNTGDPKG 317

Query: 939  ICGINTLASYXXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1118
            ICGINTLASY                C+L T+C  GETCCC +  LGICF WKCC + SA
Sbjct: 318  ICGINTLASYPIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCGLTSA 377

Query: 1119 VCCKDHRFCCPHDYPVCDTKNKLCFKGT-GNSTMVKGLERKKGSFGKLGGWNS 1274
            VCCKD R CCP DYP+CDT+   C K T G +T+  G    K    K  GW S
Sbjct: 378  VCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTITSG---NKDISNKPRGWKS 427


>gb|AGV54418.1| oryzain alpha chain-like protein [Phaseolus vulgaris]
          Length = 467

 Score =  567 bits (1462), Expect = e-159
 Identities = 275/413 (66%), Positives = 310/413 (75%), Gaps = 4/413 (0%)
 Frame = +3

Query: 48   SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHN---NMENSTYTLALNAF 218
            +S TSDLF  WC +H K YSSEEEK YRF VFEDN  F++ HN   N  NSTYTL+LNAF
Sbjct: 59   ASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLNAF 118

Query: 219  ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCG 398
            ADLTHHEFK+SR G S +     R  +Q      L   PS IDWR+ GAVTPVKDQASCG
Sbjct: 119  ADLTHHEFKTSRLGFSPSLLRFKRVQNQQPRH--LLHNPSQIDWRQSGAVTPVKDQASCG 176

Query: 399  ACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGID 578
            ACWAFSATGAIEGIN+IV+GSL SLSEQELVDCD +YNSGC GGLMD+A+QFV+DNKGID
Sbjct: 177  ACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDFAYQFVIDNKGID 236

Query: 579  TEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERAFQ 758
            TE DYPYQA+  SC+K+KLKR  VTI+ Y D+P  EEEI++AVASQPVSVGICGSERAFQ
Sbjct: 237  TEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVASQPVSVGICGSERAFQ 296

Query: 759  SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 938
             YS+GIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WG+DGY+HM RNTGD  G
Sbjct: 297  LYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGIDGYIHMIRNTGDPKG 356

Query: 939  ICGINTLASYXXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1118
            ICGINTLASY                C+L T+C  GETCCC +  LGICF WKCC + SA
Sbjct: 357  ICGINTLASYPIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCGLTSA 416

Query: 1119 VCCKDHRFCCPHDYPVCDTKNKLCFKGT-GNSTMVKGLERKKGSFGKLGGWNS 1274
            VCCKD R CCP DYP+CDT+   C K T G +T+  G    K    K  GW S
Sbjct: 417  VCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTITSG---NKDISNKPRGWKS 466


>ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  563 bits (1452), Expect = e-158
 Identities = 271/416 (65%), Positives = 309/416 (74%), Gaps = 7/416 (1%)
 Frame = +3

Query: 48   SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHN-----NMENSTYTLALN 212
            +S TS+LF  WC +H K YSSEEEKLYR  VFEDN  F+  HN     N  NS+YTL+LN
Sbjct: 26   ASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLN 85

Query: 213  AFADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQAS 392
            AFADLTHHEFK++R GL L      R  +Q +    L  +PS IDWR+ GAVTPVKDQAS
Sbjct: 86   AFADLTHHEFKTTRLGLPLTLLRFKRPQNQQS--RDLLHIPSQIDWRQSGAVTPVKDQAS 143

Query: 393  CGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKG 572
            CGACWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD +YNSGCGGGLMD+A+QFV+DNKG
Sbjct: 144  CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKG 203

Query: 573  IDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERA 752
            IDTE DYPYQA+  SC+K+KLKR  VTI+ Y D+P  EEEI++AVASQPVSVGICGSER 
Sbjct: 204  IDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVASQPVSVGICGSERE 263

Query: 753  FQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQ 932
            FQ YSKGIF G CST LDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RN+G+ 
Sbjct: 264  FQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNSGNS 323

Query: 933  LGICGINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCE 1106
             GICGINTLASY                  C+L T+C  GETCCC +  LGICF WKCC 
Sbjct: 324  KGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCG 383

Query: 1107 VNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNS 1274
            + SAVCCKD R CCP DYP+CDT+   C K T N T     E +  S  K  GW S
Sbjct: 384  LTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFSH-KSRGWKS 438


>gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]
          Length = 517

 Score =  563 bits (1451), Expect = e-158
 Identities = 266/384 (69%), Positives = 306/384 (79%), Gaps = 4/384 (1%)
 Frame = +3

Query: 57   TSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHH 236
            +S LF +WC +HG++YSSEEE+LYR +VFEDNL F+T HNNM NS+YTL+LNAFADLTHH
Sbjct: 26   SSQLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVTQHNNMGNSSYTLSLNAFADLTHH 85

Query: 237  EFKSSRFGLSLAA-SNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACWAF 413
            EFKSSR G S A  S+L +  S+L     LRDVP+S+DWRKKGAVT VKDQ SCGACWAF
Sbjct: 86   EFKSSRLGFSSALLSSLPKLGSKLLD---LRDVPASLDWRKKGAVTNVKDQGSCGACWAF 142

Query: 414  SATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDY 593
            SATGAIEGIN+IV+GSLVSLSEQEL+DCD +YN+GC GGLMDYA+QFV+DN GIDTE+DY
Sbjct: 143  SATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAGCDGGLMDYAYQFVIDNHGIDTEEDY 202

Query: 594  PYQAKGTSCNKNKLKRHVVTIDGYTDL-PSKEEEIMQAVASQPVSVGICGSERAFQSYSK 770
            PYQA+  SC K KLKR VVTIDGYTD+ P+   +++QAV +QPVSVGICGSERAFQ YSK
Sbjct: 203  PYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQAVVTQPVSVGICGSERAFQLYSK 262

Query: 771  GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGI 950
            GIF G CSTSLDHAVLIVGY SENGVDYWI+KNSWGK WGMDGY+HM RNTG+  G+CGI
Sbjct: 263  GIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQWGMDGYIHMQRNTGNSQGVCGI 322

Query: 951  NTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVC 1124
            N LASY                  CS    CG GETCCC+ R LG+CF WKCC +NSAVC
Sbjct: 323  NMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGETCCCSWRFLGLCFSWKCCGLNSAVC 382

Query: 1125 CKDHRFCCPHDYPVCDTKNKLCFK 1196
            CKD   CCP DYP+CDT+  +C K
Sbjct: 383  CKDKIHCCPQDYPLCDTQRNVCLK 406


>ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum]
            gi|557095297|gb|ESQ35879.1| hypothetical protein
            EUTSA_v10007640mg [Eutrema salsugineum]
          Length = 444

 Score =  562 bits (1448), Expect = e-157
 Identities = 269/415 (64%), Positives = 314/415 (75%), Gaps = 5/415 (1%)
 Frame = +3

Query: 60   SDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHHE 239
            ++LF+ WC +HGK Y SEEE+ +R  +F DN DF+T HN++ NSTY+L+LNAFADLTHHE
Sbjct: 34   AELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTYSLSLNAFADLTHHE 93

Query: 240  FKSSRFGLSLAASNLDRSNSQLTGFS--VLRDVPSSIDWRKKGAVTPVKDQASCGACWAF 413
            FK+SR GLS  + +L  +  Q  G S  V   VP S+DWRKKGAVT VKDQ SCGACW+F
Sbjct: 94   FKASRLGLSAPSPSL-MAKEQSLGVSERVRVKVPDSVDWRKKGAVTNVKDQGSCGACWSF 152

Query: 414  SATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDY 593
            SATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDY
Sbjct: 153  SATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 212

Query: 594  PYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSK 770
            PYQ +  +C K+KLK+ VVTID Y  + S  E+ +M+AVASQPVSVGICGSERAFQ YS 
Sbjct: 213  PYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQPVSVGICGSERAFQLYSS 272

Query: 771  GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGI 950
            GIF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNTG+  G+CGI
Sbjct: 273  GIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGVCGI 332

Query: 951  NTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVC 1124
            N LASY                  C+L TYC +GETCCC R L G+CF WKCCE+ SAVC
Sbjct: 333  NMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARTLFGLCFSWKCCELESAVC 392

Query: 1125 CKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNSLFEAW 1289
            CKD R CCP DYPVCDT   LC K TGN T +K    KK S  KLG     FE W
Sbjct: 393  CKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPF-WKKNSSNKLG----RFEEW 442


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  562 bits (1448), Expect = e-157
 Identities = 265/409 (64%), Positives = 314/409 (76%), Gaps = 6/409 (1%)
 Frame = +3

Query: 60   SDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHHE 239
            S+LF+ WC +HGK Y SEEE+  R  +F+DN DF+T HN + N+TY+L+LNAFADLTHHE
Sbjct: 29   SELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88

Query: 240  FKSSRFGLSLAA-SNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 416
            FK+SR GLS++A S +  S  Q  G SV   VP S+DWRKKGAVT VKDQ SCGACW+FS
Sbjct: 89   FKASRLGLSVSAPSVIMASKGQSLGGSV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146

Query: 417  ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 596
            ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP
Sbjct: 147  ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206

Query: 597  YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSKG 773
            YQ +  +C K+KLK+ VVTID Y  + S +E+ +M+AVA+QPVSVGICGSERAFQ YS+G
Sbjct: 207  YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRG 266

Query: 774  IFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGIN 953
            IF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNT +  G+CGIN
Sbjct: 267  IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326

Query: 954  TLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVCC 1127
             LASY                  C+L TYC +GETCCC R L G+CF WKCCE+ SAVCC
Sbjct: 327  MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386

Query: 1128 KDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGS--FGKLGGW 1268
            KD R CCPHDYPVCDT   LC K TGN T +K   +K  S   G+   W
Sbjct: 387  KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGRFEEW 435


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  561 bits (1447), Expect = e-157
 Identities = 265/416 (63%), Positives = 324/416 (77%), Gaps = 8/416 (1%)
 Frame = +3

Query: 42   CFSSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFA 221
            C  S  SDLF +WC Q+GK YSSE+E++YRF VFE+N  +IT HN+ ENS+YTL LNA++
Sbjct: 20   CTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYITEHNSKENSSYTLGLNAYS 79

Query: 222  DLTHHEFKSSRFGLSLAASNLDRSNSQLTGFS---VLRDV--PSSIDWRKKGAVTPVKDQ 386
            DLTHHEF++S  GLS +A++  R   + +G S   VL DV  PSS+DWR+KGAVT VK+Q
Sbjct: 80   DLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSETGVLSDVDAPSSLDWREKGAVTDVKNQ 139

Query: 387  ASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDN 566
             SCGACW+FSATGA+EGIN+I +GSLVSLSEQEL+DCD++YN GCGGGLMDYAF+FV+ N
Sbjct: 140  GSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGGGLMDYAFEFVIKN 199

Query: 567  KGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGS 743
             GIDTEKDYP++ +  +CNKNKL+RHVVTIDGYTD+P  +E+ +++AVA+QPVSVGICGS
Sbjct: 200  GGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICGS 259

Query: 744  ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 923
             RAFQSYSKGIF G CST+LDHAVLIVGYGSENGVDYWI+KNSWG SWG++GY+HM RN+
Sbjct: 260  ARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYIHMQRNS 319

Query: 924  GDQLGICGINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWK 1097
            G+Q GICGIN LASY                  CS+ T CG GETCCC  + LGIC  WK
Sbjct: 320  GNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCCCGSKFLGICLSWK 379

Query: 1098 CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGG 1265
            CC ++SAVCCKD R CCP DYP+CDT   LC K   N+T+V+   +K+   GK GG
Sbjct: 380  CCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQ-QPQKEAFTGKFGG 434


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  561 bits (1446), Expect = e-157
 Identities = 265/409 (64%), Positives = 313/409 (76%), Gaps = 6/409 (1%)
 Frame = +3

Query: 60   SDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHHE 239
            S+LF+ WC +HGK Y SEEE+  R  +F+DN DF+T HN + N+TY+L+LNAFADLTHHE
Sbjct: 29   SELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88

Query: 240  FKSSRFGLSLAA-SNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 416
            FK+SR GLS++A S +  S  Q  G SV   VP S+DWRKKGAVT VKDQ SCGACW+FS
Sbjct: 89   FKASRLGLSVSAPSVIMASKGQSLGGSV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146

Query: 417  ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 596
            ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP
Sbjct: 147  ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206

Query: 597  YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSKG 773
            YQ +  +C K+KLK+ VVTID Y  + S +E+ +M+AVA+QPVSVGICGSERAFQ YS G
Sbjct: 207  YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSG 266

Query: 774  IFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGIN 953
            IF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNT +  G+CGIN
Sbjct: 267  IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326

Query: 954  TLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVCC 1127
             LASY                  C+L TYC +GETCCC R L G+CF WKCCE+ SAVCC
Sbjct: 327  MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386

Query: 1128 KDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGS--FGKLGGW 1268
            KD R CCPHDYPVCDT   LC K TGN T +K   +K  S   G+   W
Sbjct: 387  KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGRFEEW 435


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  561 bits (1445), Expect = e-157
 Identities = 262/388 (67%), Positives = 312/388 (80%), Gaps = 5/388 (1%)
 Frame = +3

Query: 48   SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADL 227
            SS  S LF SW  +HGK Y+S+E+KLYRF +FE+N +F+  HN+  NS+YTL+LNAFADL
Sbjct: 25   SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84

Query: 228  THHEFKSSRFGLSLAASN--LDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGA 401
            THHEFK+SR GLS  +++  L R N  L  F  + DVP SIDWRKKGAV+ VKDQ +CGA
Sbjct: 85   THHEFKASRLGLSAFSTSGKLSRRNFPLHDF--VGDVPISIDWRKKGAVSQVKDQGNCGA 142

Query: 402  CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 581
            CW+FSATGAIEGIN+IV+GSLVSLSEQELVDCD++YN+GC GGLMDYA+QFV++N GIDT
Sbjct: 143  CWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDT 202

Query: 582  EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 758
            E+DYPYQA+  +CNK KLKRHVVTIDGYTD+P + E+E+++AVA+QPVSVGICGSERAFQ
Sbjct: 203  EEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQ 262

Query: 759  SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 938
             YSKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WG++GYM+M RN+G+  G
Sbjct: 263  LYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQG 322

Query: 939  ICGINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 1112
            +CGIN LAS+                  C L T CG GETCCCTRR+ G+CF WKCCE++
Sbjct: 323  LCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCELD 382

Query: 1113 SAVCCKDHRFCCPHDYPVCDTKNKLCFK 1196
            SAVCCKD   CCPHDYPVCDTK  +C K
Sbjct: 383  SAVCCKDGLHCCPHDYPVCDTKRNMCLK 410


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  560 bits (1443), Expect = e-157
 Identities = 266/411 (64%), Positives = 314/411 (76%), Gaps = 8/411 (1%)
 Frame = +3

Query: 60   SDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHHE 239
            S+LF+ WC +HGK Y SEEE+  R  +F+DN DF+T HN + N+TY+L+LNAFADLTHHE
Sbjct: 29   SELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88

Query: 240  FKSSRFGLSLAASNLDR-SNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 416
            FK+SR GLS++AS+L   S  Q  G +    VP S+DWRKKGAVT VKDQ SCGACW+FS
Sbjct: 89   FKASRLGLSVSASSLIMASKGQSLGGNA--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146

Query: 417  ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 596
            ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP
Sbjct: 147  ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206

Query: 597  YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSK- 770
            YQ +  +C K+KLK+ VVTID Y  + S +E+ + +AVA+QPVSVGICGSERAFQ YS+ 
Sbjct: 207  YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRV 266

Query: 771  -GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICG 947
             GIF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNTG+  GICG
Sbjct: 267  SGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGICG 326

Query: 948  INTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAV 1121
            IN LASY                  C+L TYC AGETCCC R L G+CF WKCCE+ SAV
Sbjct: 327  INMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETCCCARNLFGLCFSWKCCEIESAV 386

Query: 1122 CCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGS--FGKLGGW 1268
            CC D R CCPHDYPVCDT   LC K TGN T +K   +K  S   G+  GW
Sbjct: 387  CCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKDSSNKLGRFEGW 437


>ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prunus persica]
            gi|462420299|gb|EMJ24562.1| hypothetical protein
            PRUPE_ppa005615mg [Prunus persica]
          Length = 451

 Score =  554 bits (1427), Expect = e-155
 Identities = 273/430 (63%), Positives = 322/430 (74%), Gaps = 14/430 (3%)
 Frame = +3

Query: 48   SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADL 227
            S  TS+LF  WC Q+GK+YSS +EKLYR SVFEDNL F+T HN+M NS+YTL+LN F+DL
Sbjct: 26   SQTTSELFEVWCKQYGKSYSSAQEKLYRLSVFEDNLAFVTQHNDMGNSSYTLSLNDFSDL 85

Query: 228  THHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACW 407
            THHEFKSSR G S +  +L   + +    SV+RD+PSS+DWRKKGAVT VKDQ SCGACW
Sbjct: 86   THHEFKSSRLGFSPSFLSLKLKSDRKP--SVVRDLPSSLDWRKKGAVTNVKDQGSCGACW 143

Query: 408  AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTY-NSGCGGGLMDYAFQFVVDNKGIDTE 584
            AFS TGAIEGIN+IV+GSL+SLSEQELVDCD+ Y N+GC GGLMD AF+FV+DN GIDTE
Sbjct: 144  AFSTTGAIEGINKIVTGSLISLSEQELVDCDRVYPNNGCNGGLMDDAFRFVIDNNGIDTE 203

Query: 585  KDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQS 761
            +DYPY+    +C K KLKR+ VTID YTD+PS +EE ++QAVASQPVSVGI GS+  FQ 
Sbjct: 204  EDYPYKGWDDTCIKKKLKRNAVTIDDYTDVPSNDEEQLLQAVASQPVSVGISGSDMGFQL 263

Query: 762  YSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGI 941
            YSKGIFNG CSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WGM+GYMHM R+  +  GI
Sbjct: 264  YSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGMNGYMHMLRDHSNPKGI 323

Query: 942  CGINTLASY-XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1118
            CGINTLASY                 C + T+C AGETCCC +R++GICF W+CCE++SA
Sbjct: 324  CGINTLASYPIKTGENPPLPPPGPTRCDIFTHCAAGETCCCAKRVVGICFSWRCCELDSA 383

Query: 1119 VCCKDHRFCCPHDYPVCDTKNKLCFK----------GTGNSTMVKGLERKKGSFGKLG-G 1265
            VCCKD R CCP DYP+CDT+  LC +           TGN T  K LE  +GS  K G G
Sbjct: 384  VCCKDQRHCCPRDYPICDTERTLCLQSNEQLSTQSHATGNLTS-KALE-SRGSLRKSGRG 441

Query: 1266 WNSLFEAWNL 1295
            W S+   W L
Sbjct: 442  WGSMIRDWIL 451


>ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum]
          Length = 436

 Score =  550 bits (1418), Expect = e-154
 Identities = 266/412 (64%), Positives = 305/412 (74%), Gaps = 6/412 (1%)
 Frame = +3

Query: 57   TSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHH 236
            TS LF  WC QHGK Y SE+EK YRF+VFEDN  F+  HN + NS+YTL+LNAFADLTHH
Sbjct: 26   TSKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHH 85

Query: 237  EFKSSRFGL---SLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACW 407
            EFK++R GL   SL     +R   Q      L+ VPS IDWRK GAV+ VKDQ SCGACW
Sbjct: 86   EFKATRLGLPPSSLLRFKFNRFQDQQRSDDFLQ-VPSEIDWRKNGAVSIVKDQGSCGACW 144

Query: 408  AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 587
            +FSATGAIEGIN+IV+GSLVSLSEQELVDCD TYNSGC GGLMDYA+QF++DN GIDTE+
Sbjct: 145  SFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGIDTEE 204

Query: 588  DYPYQAKGTSCNKNKLKRHVVTIDGYTDL-PSKEEEIMQAVASQPVSVGICGSERAFQSY 764
            DYPYQA+   C K+KLKR VVTIDGYTD+ P+ E+++++AVA QPVSVGICGS RAFQ Y
Sbjct: 205  DYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARAFQLY 264

Query: 765  SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 944
            SKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RNT    G+C
Sbjct: 265  SKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSSAGLC 324

Query: 945  GINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1118
            GIN LASY                  C+L TYC  GETCCC ++ LGICF WKCC V SA
Sbjct: 325  GINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCGVTSA 384

Query: 1119 VCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNS 1274
            VCCKD R CCP DYPVCD  N  C K   N T++   + K+  F +   W S
Sbjct: 385  VCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTSD-KEDPFHQTRDWRS 435


>ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum]
          Length = 439

 Score =  550 bits (1418), Expect = e-154
 Identities = 262/417 (62%), Positives = 318/417 (76%), Gaps = 8/417 (1%)
 Frame = +3

Query: 39   ICFSSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAF 218
            +C  S  SDLF +WC Q+GK YSSE+E++YRF VFE+N  +IT HN+  NS+YTL LNA+
Sbjct: 19   LCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYITEHNSKGNSSYTLGLNAY 78

Query: 219  ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFS---VLRDV--PSSIDWRKKGAVTPVKD 383
            +DLTHHEF++S  GLS +A++  R   + +G S   VL DV  PSS+DWR KGAVT VK+
Sbjct: 79   SDLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSAAGVLSDVDAPSSLDWRDKGAVTNVKN 138

Query: 384  QASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVD 563
            Q SCGACW+FSATGAIEGIN+I +GSLVSLSEQEL+DCD++YN GCGGGLMDYAF+FV+ 
Sbjct: 139  QGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGGGLMDYAFEFVIK 198

Query: 564  NKGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICG 740
            N GIDTEKDYP++ K  +CNKNKL+R VVTIDGYTD+P  +E+ +++AVA+QPVSVGICG
Sbjct: 199  NGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICG 258

Query: 741  SERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARN 920
            S RAFQSYSKGIF G C T LDHAVLIVGYGSENG DYWI+KNSWG SWG++GY+HM RN
Sbjct: 259  SARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTSWGINGYIHMQRN 318

Query: 921  TGDQLGICGINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHW 1094
            +G+Q GICG+N LASY                  CS  T CG GETCCC  + LGIC  W
Sbjct: 319  SGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCCCGLKFLGICLSW 378

Query: 1095 KCCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGG 1265
            KCC ++SAVCCKD R CCP DYP+CDT   LC K   N+T+V+   +K+   GK GG
Sbjct: 379  KCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQ-QPQKEPFTGKFGG 434


>ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp.
            vesca]
          Length = 441

 Score =  549 bits (1415), Expect = e-153
 Identities = 265/419 (63%), Positives = 315/419 (75%), Gaps = 12/419 (2%)
 Frame = +3

Query: 48   SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADL 227
            SS +S+LF +WC Q+GK+YSS+EEKLYR S+FE NL FIT HN++ NS+YTL+LN+F+DL
Sbjct: 25   SSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNLAFITQHNDLGNSSYTLSLNSFSDL 84

Query: 228  THHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACW 407
            THHEFK+SR G S     L R +      SV+R VPSSIDWRK GAVT VKDQ SCGACW
Sbjct: 85   THHEFKASRLGFSPTFLRLYRKSDPKP--SVVRHVPSSIDWRKNGAVTNVKDQGSCGACW 142

Query: 408  AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTY-NSGCGGGLMDYAFQFVVDNKGIDTE 584
            +FSATGAIEGIN+IV+GSLVSLSEQEL+DCD+ Y NSGC GGLMD AFQF++DN GIDTE
Sbjct: 143  SFSATGAIEGINKIVTGSLVSLSEQELIDCDRVYPNSGCNGGLMDDAFQFIIDNNGIDTE 202

Query: 585  KDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSK-EEEIMQAVASQPVSVGICGSERAFQS 761
            +DYPYQ    +CNK KLKRHVVTIDGYTD+P+  EE++++AVA+QPVSVGI GS R FQ 
Sbjct: 203  EDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANNEEQLLKAVATQPVSVGIAGSGREFQF 262

Query: 762  YSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGI 941
            YSKGIF G CST+LDHAVLIVGYGSENGVDYWI+KNSWGK+WGM+GY+H+ R+  +  G+
Sbjct: 263  YSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGYIHILRDHSNSKGL 322

Query: 942  CGINTLASY-----XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCE 1106
            CGIN LASY                     C L + CG GETCCC R++LGIC  W+CCE
Sbjct: 323  CGINMLASYPTKTGENPPFPPPSPPPGPTKCDLFSKCGVGETCCCARKILGICLSWRCCE 382

Query: 1107 VNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTM----VKGLERKKG-SFGKLGGW 1268
              SAVCCKD   CCPHDYP+CDT+   C +  GN TM    ++G  RK   S  KL  W
Sbjct: 383  FTSAVCCKDRLHCCPHDYPICDTERNYCLQANGNLTMRANEIRGSLRKSSRSKAKLSYW 441


Top