BLASTX nr result

ID: Akebia26_contig00009116 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia26_contig00009116
         (1478 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          592   e-166
ref|XP_002307688.2| cysteine protease family protein [Populus tr...   588   e-165
ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087...   582   e-163
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   578   e-162
ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr...   578   e-162
ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C...   576   e-161
ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phas...   575   e-161
gb|AGV54418.1| oryzain alpha chain-like protein [Phaseolus vulga...   568   e-159
gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]                  566   e-159
ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine...   565   e-158
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   564   e-158
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   560   e-157
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   556   e-156
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   556   e-155
ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr...   555   e-155
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   554   e-155
ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prun...   554   e-155
ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a...   553   e-155
ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S...   552   e-154
ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F...   548   e-153

>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  592 bits (1527), Expect = e-166
 Identities = 282/426 (66%), Positives = 329/426 (77%), Gaps = 9/426 (2%)
 Frame = -2

Query: 1432 FSSFTSD---LFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNA 1262
            FSS +S+   LF +WC+QHGK Y+S+EEKL+R  VF+DN  F+T HN+  NS+YTL+LNA
Sbjct: 19   FSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNA 78

Query: 1261 FADLTHHEFKSSRFGLSLAAS---NLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQ 1091
            FADLTHHEFK+SR GLS AAS   N+DRSN Q+  F    DVP+S+DWRK GAVT VKDQ
Sbjct: 79   FADLTHHEFKASRLGLSSAASASLNVDRSNRQIPDFVA--DVPASVDWRKNGAVTQVKDQ 136

Query: 1090 ASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDN 911
             +CGACW+FSATGAIEGIN+IV+GSLVSLSEQELVDCDK+YN+GC GG+MDYAFQFV+DN
Sbjct: 137  GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDN 196

Query: 910  KGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGS 734
             GIDTE+DYPYQ +  SCNK KLKRHVVTIDGY D+P + E+E+++AVA+QPVSVGICGS
Sbjct: 197  HGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGS 256

Query: 733  ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 554
            ERAFQ YSKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WGMDGYMHM RN+
Sbjct: 257  ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNS 316

Query: 553  GDQLGICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWK 380
            G   G+CGIN LASY                 +C L T+CG GETCCC   + GIC  WK
Sbjct: 317  GSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWK 376

Query: 379  CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNSL 200
            CCE++SAVCCKD R CCP DYPVCDT   +C K   N+T ++    K  S GK   W+SL
Sbjct: 377  CCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKF-AKNSSSGKFRSWSSL 435

Query: 199  FEAWNL 182
             E W L
Sbjct: 436  LEGWIL 441


>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  588 bits (1515), Expect = e-165
 Identities = 275/417 (65%), Positives = 328/417 (78%), Gaps = 3/417 (0%)
 Frame = -2

Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 1250
            SS  S LF +WC++HGK+Y+S+EE+ +R  VFEDN  F+T HN+  NS+Y+LALNAFADL
Sbjct: 22   SSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADL 81

Query: 1249 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 1070
            THHEFK+SR GLS A  NL   N ++TG  V  D+P+SIDWR KG VT VKDQ SCGACW
Sbjct: 82   THHEFKTSRLGLSAAPLNLAHRNLEITG--VVGDIPASIDWRNKGVVTNVKDQGSCGACW 139

Query: 1069 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 890
            +FSATGAIEGIN+IV+GSLVSLSEQEL++CDK+YN GCGGGLMDYAFQFV++N GIDTE+
Sbjct: 140  SFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEE 199

Query: 889  DYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQSY 713
            DYPY+A+  +CNK+++KR VVTID Y D+P + E++++QAVA+QPVSVGICGSERAFQ Y
Sbjct: 200  DYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMY 259

Query: 712  SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 533
            SKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WGM GYMHM RN+G+  G+C
Sbjct: 260  SKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVC 319

Query: 532  GINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 359
            GIN LASY                 KC+LLTYC AGETCCC R+  GIC  WKCC ++SA
Sbjct: 320  GINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLDSA 379

Query: 358  VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNSLFEAW 188
            VCCKD   CCPHDYPVCDT   +CFK   N+T ++ +E K  + GK G WNSL EAW
Sbjct: 380  VCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGK--TSGKFGSWNSLPEAW 434


>ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1|
            JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  582 bits (1500), Expect = e-163
 Identities = 277/418 (66%), Positives = 319/418 (76%), Gaps = 3/418 (0%)
 Frame = -2

Query: 1426 SFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLT 1247
            S  S LF +WC+QHGK YSSEEEK YR  VFE+N AF+T HN + NS+Y+LALNAFADLT
Sbjct: 24   SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83

Query: 1246 HHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWA 1067
            HHEFK+SR GLS AA    R N QL G  + RD+P+S+DWR KGAVT VKDQ SCGACW+
Sbjct: 84   HHEFKASRLGLSAAAIEGSRPNLQLPG--LVRDIPASMDWRTKGAVTKVKDQGSCGACWS 141

Query: 1066 FSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKD 887
            FSATGAIEGIN+IV+G+LVSLSEQELVDCD++YNSGC GGLMDYA+QFV+DN GID E+D
Sbjct: 142  FSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHGIDNEED 201

Query: 886  YPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYS 710
            YPY  +  +CNK K KR VVTIDGY  +P+  E+ ++QAVA QPVSVGICGSERAFQ YS
Sbjct: 202  YPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSERAFQLYS 261

Query: 709  KGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICG 530
            KGIF G CS+SLDHAVLIVGYGSENGVDYWI+KNSWG  WGM+GY+HM RN+GD  G+CG
Sbjct: 262  KGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGDSKGLCG 321

Query: 529  INTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAV 356
            IN LASY                 KC L TYC AGETCCCT R+ GICF WKCCE++SAV
Sbjct: 322  INMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCCELDSAV 381

Query: 355  CCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNSLFEAWNL 182
            CCKD+R CCP+DYPVCDTK   C K   N+T ++  E KR S  K   W    E W L
Sbjct: 382  CCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFE-KRHSTRKFSSWRPFVENWVL 438


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  578 bits (1491), Expect = e-162
 Identities = 274/412 (66%), Positives = 323/412 (78%), Gaps = 3/412 (0%)
 Frame = -2

Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 1250
            +S  S+LF  WC +HGK+YSS EEKLYR  VF DN  F+T+HNN++NS+YTL+LN++ADL
Sbjct: 22   TSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADL 81

Query: 1249 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 1070
            THHEFK SR G S A  N      Q    S+ RDVP S+DWRKKGAVT VKDQ SCGACW
Sbjct: 82   THHEFKVSRLGFSPALRNFRPVLPQEP--SLPRDVPDSLDWRKKGAVTAVKDQGSCGACW 139

Query: 1069 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 890
            +FSATGA+EGINQI++GSL+SLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDTE 
Sbjct: 140  SFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTEN 199

Query: 889  DYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEE-EIMQAVASQPVSVGICGSERAFQSY 713
            DYPYQA+  SC K+KL+R+VVTIDGY D+PS +E +++QAVA+QPVSVGICGSERAFQ Y
Sbjct: 200  DYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLY 259

Query: 712  SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 533
            SKGIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGKSWGMDGYMHM RN+G+  G+C
Sbjct: 260  SKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVC 319

Query: 532  GINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 359
            GIN LASY                 KCS+LT C AGETCCC ++ LG+C  WKCC ++SA
Sbjct: 320  GINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCGLSSA 379

Query: 358  VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNS 203
            VCCKD R CCP DYP+CDT   LC K T+N T  + LE  R S G  G W+S
Sbjct: 380  VCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILE-NRSSSGSSGTWSS 430


>ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina]
            gi|557537201|gb|ESR48319.1| hypothetical protein
            CICLE_v10001178mg [Citrus clementina]
          Length = 441

 Score =  578 bits (1489), Expect = e-162
 Identities = 273/420 (65%), Positives = 323/420 (76%), Gaps = 5/420 (1%)
 Frame = -2

Query: 1432 FSSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFAD 1253
            + S  ++LF +WC+QHGK YSSE+EK  R  +FEDN AF+T HNNM NS++TL+LNAFAD
Sbjct: 21   YCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80

Query: 1252 LTHHEFKSSRFGLSLAASNLDRS-NSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGA 1076
            LTH EFK+S  G S A+ + DR  N+ +      RDVP+SIDWRKKGAVT VKDQASCGA
Sbjct: 81   LTHQEFKASFLGFSAASIDHDRRRNASVQSPGTLRDVPASIDWRKKGAVTEVKDQASCGA 140

Query: 1075 CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 896
            CWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDT
Sbjct: 141  CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200

Query: 895  EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 719
            EKDYPY+ +   CNK KL RH+VTIDGY D+P + E++++QAV +QPVSVGICGSERAFQ
Sbjct: 201  EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260

Query: 718  SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 539
             YS GIF G CSTSLDHAVLIVGY SENGVDYWI+KNSWG+SWGM+GYMHM RNTG+ LG
Sbjct: 261  LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320

Query: 538  ICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 365
            ICGIN LASY                 +CSLLTYC AGETCCC   +LGIC  WKCC  +
Sbjct: 321  ICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFS 380

Query: 364  SAVCCKDHRFCCPHDYPVCDTKNKLCF-KGTVNSTMVKGLERKRGSFGKLGGWNSLFEAW 188
            SAVCC DHR+CCP +YP+CD+    C  + T N T  + +E  RGS  K G W+S  + W
Sbjct: 381  SAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIE-MRGSSWKFGSWSSFIDVW 439


>ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 441

 Score =  576 bits (1484), Expect = e-161
 Identities = 272/420 (64%), Positives = 323/420 (76%), Gaps = 5/420 (1%)
 Frame = -2

Query: 1432 FSSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFAD 1253
            + S  ++LF +WC+QHGK YSSE+EK  R  +FEDN AF+T HNNM NS++TL+LNAFAD
Sbjct: 21   YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80

Query: 1252 LTHHEFKSSRFGLSLAASNLDRS-NSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGA 1076
            LTH EFK+S  G S A+ + DR  N+ +      RDVP+SIDWRKKGAVT VKDQASCGA
Sbjct: 81   LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140

Query: 1075 CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 896
            CWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDT
Sbjct: 141  CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200

Query: 895  EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 719
            EKDYPY+ +   CNK KL RH+VTIDGY D+P + E++++QAV +QPVSVGICGSERAFQ
Sbjct: 201  EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260

Query: 718  SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 539
             YS GIF G CSTSLDHAVLI+GY SENGVDYWI+KNSWG+SWGM+GYMHM RNTG+ LG
Sbjct: 261  LYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320

Query: 538  ICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 365
            ICGIN LASY                 +CSLLTYC  GETCCC   +LGIC  WKCC  +
Sbjct: 321  ICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGICLSWKCCGFS 380

Query: 364  SAVCCKDHRFCCPHDYPVCDTKNKLCF-KGTVNSTMVKGLERKRGSFGKLGGWNSLFEAW 188
            SAVCC DHR+CCP +YP+CD+    C  + T N T  + +E  RGS  K G W+S  +AW
Sbjct: 381  SAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIE-MRGSSWKFGSWSSFIDAW 439


>ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris]
            gi|561009128|gb|ESW08035.1| hypothetical protein
            PHAVU_009G013000g [Phaseolus vulgaris]
          Length = 428

 Score =  575 bits (1483), Expect = e-161
 Identities = 275/412 (66%), Positives = 312/412 (75%), Gaps = 3/412 (0%)
 Frame = -2

Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHN---NMENSTYTLALNAF 1259
            +S TSDLF  WC++H K YSSEEEK YRF VFEDN AF++ HN   N  NSTYTL+LNAF
Sbjct: 20   ASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLNAF 79

Query: 1258 ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCG 1079
            ADLTHHEFK+SR G S +     R  +Q     +    PS IDWR+ GAVTPVKDQASCG
Sbjct: 80   ADLTHHEFKTSRLGFSPSLLRFKRVQNQQPRHLLHN--PSQIDWRQSGAVTPVKDQASCG 137

Query: 1078 ACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGID 899
            ACWAFSATGAIEGIN+IV+GSL SLSEQELVDCD +YNSGC GGLMDYA+QFV+DNKGID
Sbjct: 138  ACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDYAYQFVIDNKGID 197

Query: 898  TEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERAFQ 719
            TE DYPYQA+   CNK+KLKRH+VTID Y DLP  EEE+++AVASQPVSVGICGSERAFQ
Sbjct: 198  TEDDYPYQARQRPCNKDKLKRHIVTIDDYVDLPPNEEELLKAVASQPVSVGICGSERAFQ 257

Query: 718  SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 539
             YS+GIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RNTGD  G
Sbjct: 258  LYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMEGYIHMIRNTGDPKG 317

Query: 538  ICGINTLASYXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 359
            ICGINTLASY               +C+L T+C  GETCCC +  LGICF WKCC + SA
Sbjct: 318  ICGINTLASYPIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCGLTSA 377

Query: 358  VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNS 203
            VCCKD R CCP DYP+CDT+   C K T  +T +      +    K  GW S
Sbjct: 378  VCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTI--TSGNKDISNKPRGWKS 427


>gb|AGV54418.1| oryzain alpha chain-like protein [Phaseolus vulgaris]
          Length = 467

 Score =  568 bits (1463), Expect = e-159
 Identities = 272/412 (66%), Positives = 311/412 (75%), Gaps = 3/412 (0%)
 Frame = -2

Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHN---NMENSTYTLALNAF 1259
            +S TSDLF  WC++H K YSSEEEK YRF VFEDN AF++ HN   N  NSTYTL+LNAF
Sbjct: 59   ASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLNAF 118

Query: 1258 ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCG 1079
            ADLTHHEFK+SR G S +     R  +Q     +    PS IDWR+ GAVTPVKDQASCG
Sbjct: 119  ADLTHHEFKTSRLGFSPSLLRFKRVQNQQPRHLLHN--PSQIDWRQSGAVTPVKDQASCG 176

Query: 1078 ACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGID 899
            ACWAFSATGAIEGIN+IV+GSL SLSEQELVDCD +YNSGC GGLMD+A+QFV+DNKGID
Sbjct: 177  ACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDFAYQFVIDNKGID 236

Query: 898  TEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERAFQ 719
            TE DYPYQA+  SC+K+KLKR  VTI+ Y D+P  EEEI++AVASQPVSVGICGSERAFQ
Sbjct: 237  TEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVASQPVSVGICGSERAFQ 296

Query: 718  SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 539
             YS+GIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WG+DGY+HM RNTGD  G
Sbjct: 297  LYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGIDGYIHMIRNTGDPKG 356

Query: 538  ICGINTLASYXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 359
            ICGINTLASY               +C+L T+C  GETCCC +  LGICF WKCC + SA
Sbjct: 357  ICGINTLASYPIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCGLTSA 416

Query: 358  VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNS 203
            VCCKD R CCP DYP+CDT+   C K T  +T +      +    K  GW S
Sbjct: 417  VCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTI--TSGNKDISNKPRGWKS 466


>gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]
          Length = 517

 Score =  566 bits (1459), Expect = e-159
 Identities = 267/384 (69%), Positives = 308/384 (80%), Gaps = 4/384 (1%)
 Frame = -2

Query: 1420 TSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHH 1241
            +S LF +WCE+HG++YSSEEE+LYR +VFEDNLAF+T HNNM NS+YTL+LNAFADLTHH
Sbjct: 26   SSQLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVTQHNNMGNSSYTLSLNAFADLTHH 85

Query: 1240 EFKSSRFGLSLAA-SNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAF 1064
            EFKSSR G S A  S+L +  S+L      RDVP+S+DWRKKGAVT VKDQ SCGACWAF
Sbjct: 86   EFKSSRLGFSSALLSSLPKLGSKLLDL---RDVPASLDWRKKGAVTNVKDQGSCGACWAF 142

Query: 1063 SATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDY 884
            SATGAIEGIN+IV+GSLVSLSEQEL+DCD +YN+GC GGLMDYA+QFV+DN GIDTE+DY
Sbjct: 143  SATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAGCDGGLMDYAYQFVIDNHGIDTEEDY 202

Query: 883  PYQAKGTSCNKNKLKRHVVTIDGYTDL-PSKEEEIMQAVASQPVSVGICGSERAFQSYSK 707
            PYQA+  SC K KLKR VVTIDGYTD+ P+   +++QAV +QPVSVGICGSERAFQ YSK
Sbjct: 203  PYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQAVVTQPVSVGICGSERAFQLYSK 262

Query: 706  GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGI 527
            GIF G CSTSLDHAVLIVGY SENGVDYWI+KNSWGK WGMDGY+HM RNTG+  G+CGI
Sbjct: 263  GIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQWGMDGYIHMQRNTGNSQGVCGI 322

Query: 526  NTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVC 353
            N LASY                 +CS    CG GETCCC+ R LG+CF WKCC +NSAVC
Sbjct: 323  NMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGETCCCSWRFLGLCFSWKCCGLNSAVC 382

Query: 352  CKDHRFCCPHDYPVCDTKNKLCFK 281
            CKD   CCP DYP+CDT+  +C K
Sbjct: 383  CKDKIHCCPQDYPLCDTQRNVCLK 406


>ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  565 bits (1456), Expect = e-158
 Identities = 273/419 (65%), Positives = 313/419 (74%), Gaps = 10/419 (2%)
 Frame = -2

Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHN-----NMENSTYTLALN 1265
            +S TS+LF  WC++H K YSSEEEKLYR  VFEDN AF+  HN     N  NS+YTL+LN
Sbjct: 26   ASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLN 85

Query: 1264 AFADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRD---VPSSIDWRKKGAVTPVKD 1094
            AFADLTHHEFK++R GL L      R  +Q +     RD   +PS IDWR+ GAVTPVKD
Sbjct: 86   AFADLTHHEFKTTRLGLPLTLLRFKRPQNQQS-----RDLLHIPSQIDWRQSGAVTPVKD 140

Query: 1093 QASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVD 914
            QASCGACWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD +YNSGCGGGLMD+A+QFV+D
Sbjct: 141  QASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVID 200

Query: 913  NKGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGS 734
            NKGIDTE DYPYQA+  SC+K+KLKR  VTI+ Y D+P  EEEI++AVASQPVSVGICGS
Sbjct: 201  NKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVASQPVSVGICGS 260

Query: 733  ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 554
            ER FQ YSKGIF G CST LDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RN+
Sbjct: 261  EREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNS 320

Query: 553  GDQLGICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWK 380
            G+  GICGINTLASY                 +C+L T+C  GETCCC +  LGICF WK
Sbjct: 321  GNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGICFSWK 380

Query: 379  CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNS 203
            CC + SAVCCKD R CCP DYP+CDT+   C K T N T     E +  S  K  GW S
Sbjct: 381  CCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFSH-KSRGWKS 438


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  564 bits (1453), Expect = e-158
 Identities = 266/416 (63%), Positives = 323/416 (77%), Gaps = 8/416 (1%)
 Frame = -2

Query: 1435 CFSSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFA 1256
            C  S  SDLF +WC+Q+GK YSSE+E++YRF VFE+N A+IT HN+ ENS+YTL LNA++
Sbjct: 20   CTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYITEHNSKENSSYTLGLNAYS 79

Query: 1255 DLTHHEFKSSRFGLSLAASNLDR-----SNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQ 1091
            DLTHHEF++S  GLS +A++  R     S S  TG     D PSS+DWR+KGAVT VK+Q
Sbjct: 80   DLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSETGVLSDVDAPSSLDWREKGAVTDVKNQ 139

Query: 1090 ASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDN 911
             SCGACW+FSATGA+EGIN+I +GSLVSLSEQEL+DCD++YN GCGGGLMDYAF+FV+ N
Sbjct: 140  GSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGGGLMDYAFEFVIKN 199

Query: 910  KGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGS 734
             GIDTEKDYP++ +  +CNKNKL+RHVVTIDGYTD+P  +E+ +++AVA+QPVSVGICGS
Sbjct: 200  GGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICGS 259

Query: 733  ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 554
             RAFQSYSKGIF G CST+LDHAVLIVGYGSENGVDYWI+KNSWG SWG++GY+HM RN+
Sbjct: 260  ARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYIHMQRNS 319

Query: 553  GDQLGICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWK 380
            G+Q GICGIN LASY                 KCS+ T CG GETCCC  + LGIC  WK
Sbjct: 320  GNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCCCGSKFLGICLSWK 379

Query: 379  CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGG 212
            CC ++SAVCCKD R CCP DYP+CDT   LC K   N+T+V+   +K    GK GG
Sbjct: 380  CCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQ-QPQKEAFTGKFGG 434


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  560 bits (1444), Expect = e-157
 Identities = 264/393 (67%), Positives = 315/393 (80%), Gaps = 5/393 (1%)
 Frame = -2

Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 1250
            SS  S LF SW ++HGK Y+S+E+KLYRF +FE+N  F+  HN+  NS+YTL+LNAFADL
Sbjct: 25   SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84

Query: 1249 THHEFKSSRFGLSLAASN--LDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGA 1076
            THHEFK+SR GLS  +++  L R N  L  F    DVP SIDWRKKGAV+ VKDQ +CGA
Sbjct: 85   THHEFKASRLGLSAFSTSGKLSRRNFPLHDF--VGDVPISIDWRKKGAVSQVKDQGNCGA 142

Query: 1075 CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 896
            CW+FSATGAIEGIN+IV+GSLVSLSEQELVDCD++YN+GC GGLMDYA+QFV++N GIDT
Sbjct: 143  CWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDT 202

Query: 895  EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 719
            E+DYPYQA+  +CNK KLKRHVVTIDGYTD+P + E+E+++AVA+QPVSVGICGSERAFQ
Sbjct: 203  EEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQ 262

Query: 718  SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 539
             YSKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WG++GYM+M RN+G+  G
Sbjct: 263  LYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQG 322

Query: 538  ICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 365
            +CGIN LAS+                 KC L T CG GETCCCTRR+ G+CF WKCCE++
Sbjct: 323  LCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCELD 382

Query: 364  SAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNS 266
            SAVCCKD   CCPHDYPVCDTK  +C K ++ S
Sbjct: 383  SAVCCKDGLHCCPHDYPVCDTKRNMCLKVSIFS 415


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  556 bits (1434), Expect = e-156
 Identities = 264/409 (64%), Positives = 314/409 (76%), Gaps = 6/409 (1%)
 Frame = -2

Query: 1417 SDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 1238
            S+LF+ WC++HGK Y SEEE+  R  +F+DN  F+T HN + N+TY+L+LNAFADLTHHE
Sbjct: 29   SELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88

Query: 1237 FKSSRFGLSLAA-SNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 1061
            FK+SR GLS++A S +  S  Q  G SV   VP S+DWRKKGAVT VKDQ SCGACW+FS
Sbjct: 89   FKASRLGLSVSAPSVIMASKGQSLGGSV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146

Query: 1060 ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 881
            ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP
Sbjct: 147  ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206

Query: 880  YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSKG 704
            YQ +  +C K+KLK+ VVTID Y  + S +E+ +M+AVA+QPVSVGICGSERAFQ YS+G
Sbjct: 207  YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRG 266

Query: 703  IFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGIN 524
            IF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNT +  G+CGIN
Sbjct: 267  IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326

Query: 523  TLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVCC 350
             LASY                 KC+L TYC +GETCCC R L G+CF WKCCE+ SAVCC
Sbjct: 327  MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386

Query: 349  KDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGS--FGKLGGW 209
            KD R CCPHDYPVCDT   LC K T N T +K   +K  S   G+   W
Sbjct: 387  KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGRFEEW 435


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  556 bits (1432), Expect = e-155
 Identities = 264/409 (64%), Positives = 313/409 (76%), Gaps = 6/409 (1%)
 Frame = -2

Query: 1417 SDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 1238
            S+LF+ WC++HGK Y SEEE+  R  +F+DN  F+T HN + N+TY+L+LNAFADLTHHE
Sbjct: 29   SELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88

Query: 1237 FKSSRFGLSLAA-SNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 1061
            FK+SR GLS++A S +  S  Q  G SV   VP S+DWRKKGAVT VKDQ SCGACW+FS
Sbjct: 89   FKASRLGLSVSAPSVIMASKGQSLGGSV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146

Query: 1060 ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 881
            ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP
Sbjct: 147  ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206

Query: 880  YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSKG 704
            YQ +  +C K+KLK+ VVTID Y  + S +E+ +M+AVA+QPVSVGICGSERAFQ YS G
Sbjct: 207  YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSG 266

Query: 703  IFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGIN 524
            IF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNT +  G+CGIN
Sbjct: 267  IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326

Query: 523  TLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVCC 350
             LASY                 KC+L TYC +GETCCC R L G+CF WKCCE+ SAVCC
Sbjct: 327  MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386

Query: 349  KDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGS--FGKLGGW 209
            KD R CCPHDYPVCDT   LC K T N T +K   +K  S   G+   W
Sbjct: 387  KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGRFEEW 435


>ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum]
            gi|557095297|gb|ESQ35879.1| hypothetical protein
            EUTSA_v10007640mg [Eutrema salsugineum]
          Length = 444

 Score =  555 bits (1431), Expect = e-155
 Identities = 267/415 (64%), Positives = 313/415 (75%), Gaps = 5/415 (1%)
 Frame = -2

Query: 1417 SDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 1238
            ++LF+ WC +HGK Y SEEE+ +R  +F DN  F+T HN++ NSTY+L+LNAFADLTHHE
Sbjct: 34   AELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTYSLSLNAFADLTHHE 93

Query: 1237 FKSSRFGLSLAASNLDRSNSQLTGFS--VFRDVPSSIDWRKKGAVTPVKDQASCGACWAF 1064
            FK+SR GLS  + +L  +  Q  G S  V   VP S+DWRKKGAVT VKDQ SCGACW+F
Sbjct: 94   FKASRLGLSAPSPSL-MAKEQSLGVSERVRVKVPDSVDWRKKGAVTNVKDQGSCGACWSF 152

Query: 1063 SATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDY 884
            SATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDY
Sbjct: 153  SATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 212

Query: 883  PYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSK 707
            PYQ +  +C K+KLK+ VVTID Y  + S  E+ +M+AVASQPVSVGICGSERAFQ YS 
Sbjct: 213  PYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQPVSVGICGSERAFQLYSS 272

Query: 706  GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGI 527
            GIF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNTG+  G+CGI
Sbjct: 273  GIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGVCGI 332

Query: 526  NTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVC 353
            N LASY                 KC+L TYC +GETCCC R L G+CF WKCCE+ SAVC
Sbjct: 333  NMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARTLFGLCFSWKCCELESAVC 392

Query: 352  CKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNSLFEAW 188
            CKD R CCP DYPVCDT   LC K T N T +K   +K  S  KLG     FE W
Sbjct: 393  CKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPFWKKNSS-NKLG----RFEEW 442


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  554 bits (1428), Expect = e-155
 Identities = 265/411 (64%), Positives = 314/411 (76%), Gaps = 8/411 (1%)
 Frame = -2

Query: 1417 SDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 1238
            S+LF+ WC++HGK Y SEEE+  R  +F+DN  F+T HN + N+TY+L+LNAFADLTHHE
Sbjct: 29   SELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88

Query: 1237 FKSSRFGLSLAASNLDR-SNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 1061
            FK+SR GLS++AS+L   S  Q  G +    VP S+DWRKKGAVT VKDQ SCGACW+FS
Sbjct: 89   FKASRLGLSVSASSLIMASKGQSLGGNA--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146

Query: 1060 ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 881
            ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP
Sbjct: 147  ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206

Query: 880  YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSK- 707
            YQ +  +C K+KLK+ VVTID Y  + S +E+ + +AVA+QPVSVGICGSERAFQ YS+ 
Sbjct: 207  YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRV 266

Query: 706  -GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICG 530
             GIF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNTG+  GICG
Sbjct: 267  SGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGICG 326

Query: 529  INTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAV 356
            IN LASY                 KC+L TYC AGETCCC R L G+CF WKCCE+ SAV
Sbjct: 327  INMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETCCCARNLFGLCFSWKCCEIESAV 386

Query: 355  CCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGS--FGKLGGW 209
            CC D R CCPHDYPVCDT   LC K T N T +K   +K  S   G+  GW
Sbjct: 387  CCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKDSSNKLGRFEGW 437


>ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prunus persica]
            gi|462420299|gb|EMJ24562.1| hypothetical protein
            PRUPE_ppa005615mg [Prunus persica]
          Length = 451

 Score =  554 bits (1427), Expect = e-155
 Identities = 271/429 (63%), Positives = 321/429 (74%), Gaps = 13/429 (3%)
 Frame = -2

Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 1250
            S  TS+LF  WC+Q+GK+YSS +EKLYR SVFEDNLAF+T HN+M NS+YTL+LN F+DL
Sbjct: 26   SQTTSELFEVWCKQYGKSYSSAQEKLYRLSVFEDNLAFVTQHNDMGNSSYTLSLNDFSDL 85

Query: 1249 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 1070
            THHEFKSSR G S +  +L   + +    SV RD+PSS+DWRKKGAVT VKDQ SCGACW
Sbjct: 86   THHEFKSSRLGFSPSFLSLKLKSDRKP--SVVRDLPSSLDWRKKGAVTNVKDQGSCGACW 143

Query: 1069 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTY-NSGCGGGLMDYAFQFVVDNKGIDTE 893
            AFS TGAIEGIN+IV+GSL+SLSEQELVDCD+ Y N+GC GGLMD AF+FV+DN GIDTE
Sbjct: 144  AFSTTGAIEGINKIVTGSLISLSEQELVDCDRVYPNNGCNGGLMDDAFRFVIDNNGIDTE 203

Query: 892  KDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQS 716
            +DYPY+    +C K KLKR+ VTID YTD+PS +EE ++QAVASQPVSVGI GS+  FQ 
Sbjct: 204  EDYPYKGWDDTCIKKKLKRNAVTIDDYTDVPSNDEEQLLQAVASQPVSVGISGSDMGFQL 263

Query: 715  YSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGI 536
            YSKGIFNG CSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WGM+GYMHM R+  +  GI
Sbjct: 264  YSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGMNGYMHMLRDHSNPKGI 323

Query: 535  CGINTLASY-XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 359
            CGINTLASY                +C + T+C AGETCCC +R++GICF W+CCE++SA
Sbjct: 324  CGINTLASYPIKTGENPPLPPPGPTRCDIFTHCAAGETCCCAKRVVGICFSWRCCELDSA 383

Query: 358  VCCKDHRFCCPHDYPVCDTKNKLCFKG---------TVNSTMVKGLERKRGSFGKLG-GW 209
            VCCKD R CCP DYP+CDT+  LC +             +   K LE  RGS  K G GW
Sbjct: 384  VCCKDQRHCCPRDYPICDTERTLCLQSNEQLSTQSHATGNLTSKALE-SRGSLRKSGRGW 442

Query: 208  NSLFEAWNL 182
             S+   W L
Sbjct: 443  GSMIRDWIL 451


>ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum]
          Length = 436

 Score =  553 bits (1425), Expect = e-155
 Identities = 268/412 (65%), Positives = 306/412 (74%), Gaps = 6/412 (1%)
 Frame = -2

Query: 1420 TSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHH 1241
            TS LF  WC+QHGK Y SE+EK YRF+VFEDN AF+  HN + NS+YTL+LNAFADLTHH
Sbjct: 26   TSKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHH 85

Query: 1240 EFKSSRFGL---SLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 1070
            EFK++R GL   SL     +R   Q      F  VPS IDWRK GAV+ VKDQ SCGACW
Sbjct: 86   EFKATRLGLPPSSLLRFKFNRFQDQQRSDD-FLQVPSEIDWRKNGAVSIVKDQGSCGACW 144

Query: 1069 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 890
            +FSATGAIEGIN+IV+GSLVSLSEQELVDCD TYNSGC GGLMDYA+QF++DN GIDTE+
Sbjct: 145  SFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGIDTEE 204

Query: 889  DYPYQAKGTSCNKNKLKRHVVTIDGYTDL-PSKEEEIMQAVASQPVSVGICGSERAFQSY 713
            DYPYQA+   C K+KLKR VVTIDGYTD+ P+ E+++++AVA QPVSVGICGS RAFQ Y
Sbjct: 205  DYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARAFQLY 264

Query: 712  SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 533
            SKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RNT    G+C
Sbjct: 265  SKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSSAGLC 324

Query: 532  GINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 359
            GIN LASY                 KC+L TYC  GETCCC ++ LGICF WKCC V SA
Sbjct: 325  GINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCGVTSA 384

Query: 358  VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNS 203
            VCCKD R CCP DYPVCD  N  C K   N T++   + K   F +   W S
Sbjct: 385  VCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTSD-KEDPFHQTRDWRS 435


>ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum]
          Length = 439

 Score =  552 bits (1422), Expect = e-154
 Identities = 262/417 (62%), Positives = 316/417 (75%), Gaps = 8/417 (1%)
 Frame = -2

Query: 1438 ICFSSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAF 1259
            +C  S  SDLF +WC+Q+GK YSSE+E++YRF VFE+N A+IT HN+  NS+YTL LNA+
Sbjct: 19   LCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYITEHNSKGNSSYTLGLNAY 78

Query: 1258 ADLTHHEFKSSRFGLSLAASNLDR-----SNSQLTGFSVFRDVPSSIDWRKKGAVTPVKD 1094
            +DLTHHEF++S  GLS +A++  R     S S   G     D PSS+DWR KGAVT VK+
Sbjct: 79   SDLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSAAGVLSDVDAPSSLDWRDKGAVTNVKN 138

Query: 1093 QASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVD 914
            Q SCGACW+FSATGAIEGIN+I +GSLVSLSEQEL+DCD++YN GCGGGLMDYAF+FV+ 
Sbjct: 139  QGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGGGLMDYAFEFVIK 198

Query: 913  NKGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICG 737
            N GIDTEKDYP++ K  +CNKNKL+R VVTIDGYTD+P  +E+ +++AVA+QPVSVGICG
Sbjct: 199  NGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICG 258

Query: 736  SERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARN 557
            S RAFQSYSKGIF G C T LDHAVLIVGYGSENG DYWI+KNSWG SWG++GY+HM RN
Sbjct: 259  SARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTSWGINGYIHMQRN 318

Query: 556  TGDQLGICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHW 383
            +G+Q GICG+N LASY                 KCS  T CG GETCCC  + LGIC  W
Sbjct: 319  SGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCCCGLKFLGICLSW 378

Query: 382  KCCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGG 212
            KCC ++SAVCCKD R CCP DYP+CDT   LC K   N+T+V+   +K    GK GG
Sbjct: 379  KCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQ-QPQKEPFTGKFGG 434


>ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp.
            vesca]
          Length = 441

 Score =  548 bits (1411), Expect = e-153
 Identities = 266/419 (63%), Positives = 316/419 (75%), Gaps = 12/419 (2%)
 Frame = -2

Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 1250
            SS +S+LF +WC+Q+GK+YSS+EEKLYR S+FE NLAFIT HN++ NS+YTL+LN+F+DL
Sbjct: 25   SSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNLAFITQHNDLGNSSYTLSLNSFSDL 84

Query: 1249 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 1070
            THHEFK+SR G S     L R +      SV R VPSSIDWRK GAVT VKDQ SCGACW
Sbjct: 85   THHEFKASRLGFSPTFLRLYRKSDPKP--SVVRHVPSSIDWRKNGAVTNVKDQGSCGACW 142

Query: 1069 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTY-NSGCGGGLMDYAFQFVVDNKGIDTE 893
            +FSATGAIEGIN+IV+GSLVSLSEQEL+DCD+ Y NSGC GGLMD AFQF++DN GIDTE
Sbjct: 143  SFSATGAIEGINKIVTGSLVSLSEQELIDCDRVYPNSGCNGGLMDDAFQFIIDNNGIDTE 202

Query: 892  KDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSK-EEEIMQAVASQPVSVGICGSERAFQS 716
            +DYPYQ    +CNK KLKRHVVTIDGYTD+P+  EE++++AVA+QPVSVGI GS R FQ 
Sbjct: 203  EDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANNEEQLLKAVATQPVSVGIAGSGREFQF 262

Query: 715  YSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGI 536
            YSKGIF G CST+LDHAVLIVGYGSENGVDYWI+KNSWGK+WGM+GY+H+ R+  +  G+
Sbjct: 263  YSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGYIHILRDHSNSKGL 322

Query: 535  CGINTLASY-----XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCE 371
            CGIN LASY                    KC L + CG GETCCC R++LGIC  W+CCE
Sbjct: 323  CGINMLASYPTKTGENPPFPPPSPPPGPTKCDLFSKCGVGETCCCARKILGICLSWRCCE 382

Query: 370  VNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTM----VKGLERKRG-SFGKLGGW 209
              SAVCCKD   CCPHDYP+CDT+   C +   N TM    ++G  RK   S  KL  W
Sbjct: 383  FTSAVCCKDRLHCCPHDYPICDTERNYCLQANGNLTMRANEIRGSLRKSSRSKAKLSYW 441


Top