BLASTX nr result
ID: Akebia26_contig00009116
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia26_contig00009116 (1478 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 592 e-166 ref|XP_002307688.2| cysteine protease family protein [Populus tr... 588 e-165 ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087... 582 e-163 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 578 e-162 ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr... 578 e-162 ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C... 576 e-161 ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phas... 575 e-161 gb|AGV54418.1| oryzain alpha chain-like protein [Phaseolus vulga... 568 e-159 gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] 566 e-159 ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine... 565 e-158 ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S... 564 e-158 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 560 e-157 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 556 e-156 ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha... 556 e-155 ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr... 555 e-155 ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab... 554 e-155 ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prun... 554 e-155 ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a... 553 e-155 ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S... 552 e-154 ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F... 548 e-153 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 592 bits (1527), Expect = e-166 Identities = 282/426 (66%), Positives = 329/426 (77%), Gaps = 9/426 (2%) Frame = -2 Query: 1432 FSSFTSD---LFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNA 1262 FSS +S+ LF +WC+QHGK Y+S+EEKL+R VF+DN F+T HN+ NS+YTL+LNA Sbjct: 19 FSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNA 78 Query: 1261 FADLTHHEFKSSRFGLSLAAS---NLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQ 1091 FADLTHHEFK+SR GLS AAS N+DRSN Q+ F DVP+S+DWRK GAVT VKDQ Sbjct: 79 FADLTHHEFKASRLGLSSAASASLNVDRSNRQIPDFVA--DVPASVDWRKNGAVTQVKDQ 136 Query: 1090 ASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDN 911 +CGACW+FSATGAIEGIN+IV+GSLVSLSEQELVDCDK+YN+GC GG+MDYAFQFV+DN Sbjct: 137 GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDN 196 Query: 910 KGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGS 734 GIDTE+DYPYQ + SCNK KLKRHVVTIDGY D+P + E+E+++AVA+QPVSVGICGS Sbjct: 197 HGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGS 256 Query: 733 ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 554 ERAFQ YSKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGMDGYMHM RN+ Sbjct: 257 ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNS 316 Query: 553 GDQLGICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWK 380 G G+CGIN LASY +C L T+CG GETCCC + GIC WK Sbjct: 317 GSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWK 376 Query: 379 CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNSL 200 CCE++SAVCCKD R CCP DYPVCDT +C K N+T ++ K S GK W+SL Sbjct: 377 CCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKF-AKNSSSGKFRSWSSL 435 Query: 199 FEAWNL 182 E W L Sbjct: 436 LEGWIL 441 >ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa] gi|550339725|gb|EEE94684.2| cysteine protease family protein [Populus trichocarpa] Length = 436 Score = 588 bits (1515), Expect = e-165 Identities = 275/417 (65%), Positives = 328/417 (78%), Gaps = 3/417 (0%) Frame = -2 Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 1250 SS S LF +WC++HGK+Y+S+EE+ +R VFEDN F+T HN+ NS+Y+LALNAFADL Sbjct: 22 SSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADL 81 Query: 1249 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 1070 THHEFK+SR GLS A NL N ++TG V D+P+SIDWR KG VT VKDQ SCGACW Sbjct: 82 THHEFKTSRLGLSAAPLNLAHRNLEITG--VVGDIPASIDWRNKGVVTNVKDQGSCGACW 139 Query: 1069 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 890 +FSATGAIEGIN+IV+GSLVSLSEQEL++CDK+YN GCGGGLMDYAFQFV++N GIDTE+ Sbjct: 140 SFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEE 199 Query: 889 DYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQSY 713 DYPY+A+ +CNK+++KR VVTID Y D+P + E++++QAVA+QPVSVGICGSERAFQ Y Sbjct: 200 DYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMY 259 Query: 712 SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 533 SKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGM GYMHM RN+G+ G+C Sbjct: 260 SKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVC 319 Query: 532 GINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 359 GIN LASY KC+LLTYC AGETCCC R+ GIC WKCC ++SA Sbjct: 320 GINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLDSA 379 Query: 358 VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNSLFEAW 188 VCCKD CCPHDYPVCDT +CFK N+T ++ +E K + GK G WNSL EAW Sbjct: 380 VCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGK--TSGKFGSWNSLPEAW 434 >ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] Length = 438 Score = 582 bits (1500), Expect = e-163 Identities = 277/418 (66%), Positives = 319/418 (76%), Gaps = 3/418 (0%) Frame = -2 Query: 1426 SFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLT 1247 S S LF +WC+QHGK YSSEEEK YR VFE+N AF+T HN + NS+Y+LALNAFADLT Sbjct: 24 SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83 Query: 1246 HHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWA 1067 HHEFK+SR GLS AA R N QL G + RD+P+S+DWR KGAVT VKDQ SCGACW+ Sbjct: 84 HHEFKASRLGLSAAAIEGSRPNLQLPG--LVRDIPASMDWRTKGAVTKVKDQGSCGACWS 141 Query: 1066 FSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKD 887 FSATGAIEGIN+IV+G+LVSLSEQELVDCD++YNSGC GGLMDYA+QFV+DN GID E+D Sbjct: 142 FSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHGIDNEED 201 Query: 886 YPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYS 710 YPY + +CNK K KR VVTIDGY +P+ E+ ++QAVA QPVSVGICGSERAFQ YS Sbjct: 202 YPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSERAFQLYS 261 Query: 709 KGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICG 530 KGIF G CS+SLDHAVLIVGYGSENGVDYWI+KNSWG WGM+GY+HM RN+GD G+CG Sbjct: 262 KGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGDSKGLCG 321 Query: 529 INTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAV 356 IN LASY KC L TYC AGETCCCT R+ GICF WKCCE++SAV Sbjct: 322 INMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCCELDSAV 381 Query: 355 CCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNSLFEAWNL 182 CCKD+R CCP+DYPVCDTK C K N+T ++ E KR S K W E W L Sbjct: 382 CCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFE-KRHSTRKFSSWRPFVENWVL 438 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 578 bits (1491), Expect = e-162 Identities = 274/412 (66%), Positives = 323/412 (78%), Gaps = 3/412 (0%) Frame = -2 Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 1250 +S S+LF WC +HGK+YSS EEKLYR VF DN F+T+HNN++NS+YTL+LN++ADL Sbjct: 22 TSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADL 81 Query: 1249 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 1070 THHEFK SR G S A N Q S+ RDVP S+DWRKKGAVT VKDQ SCGACW Sbjct: 82 THHEFKVSRLGFSPALRNFRPVLPQEP--SLPRDVPDSLDWRKKGAVTAVKDQGSCGACW 139 Query: 1069 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 890 +FSATGA+EGINQI++GSL+SLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDTE Sbjct: 140 SFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTEN 199 Query: 889 DYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEE-EIMQAVASQPVSVGICGSERAFQSY 713 DYPYQA+ SC K+KL+R+VVTIDGY D+PS +E +++QAVA+QPVSVGICGSERAFQ Y Sbjct: 200 DYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLY 259 Query: 712 SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 533 SKGIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGKSWGMDGYMHM RN+G+ G+C Sbjct: 260 SKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVC 319 Query: 532 GINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 359 GIN LASY KCS+LT C AGETCCC ++ LG+C WKCC ++SA Sbjct: 320 GINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCGLSSA 379 Query: 358 VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNS 203 VCCKD R CCP DYP+CDT LC K T+N T + LE R S G G W+S Sbjct: 380 VCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILE-NRSSSGSSGTWSS 430 >ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] gi|557537201|gb|ESR48319.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] Length = 441 Score = 578 bits (1489), Expect = e-162 Identities = 273/420 (65%), Positives = 323/420 (76%), Gaps = 5/420 (1%) Frame = -2 Query: 1432 FSSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFAD 1253 + S ++LF +WC+QHGK YSSE+EK R +FEDN AF+T HNNM NS++TL+LNAFAD Sbjct: 21 YCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80 Query: 1252 LTHHEFKSSRFGLSLAASNLDRS-NSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGA 1076 LTH EFK+S G S A+ + DR N+ + RDVP+SIDWRKKGAVT VKDQASCGA Sbjct: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGTLRDVPASIDWRKKGAVTEVKDQASCGA 140 Query: 1075 CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 896 CWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDT Sbjct: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200 Query: 895 EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 719 EKDYPY+ + CNK KL RH+VTIDGY D+P + E++++QAV +QPVSVGICGSERAFQ Sbjct: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260 Query: 718 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 539 YS GIF G CSTSLDHAVLIVGY SENGVDYWI+KNSWG+SWGM+GYMHM RNTG+ LG Sbjct: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320 Query: 538 ICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 365 ICGIN LASY +CSLLTYC AGETCCC +LGIC WKCC + Sbjct: 321 ICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFS 380 Query: 364 SAVCCKDHRFCCPHDYPVCDTKNKLCF-KGTVNSTMVKGLERKRGSFGKLGGWNSLFEAW 188 SAVCC DHR+CCP +YP+CD+ C + T N T + +E RGS K G W+S + W Sbjct: 381 SAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIE-MRGSSWKFGSWSSFIDVW 439 >ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis] Length = 441 Score = 576 bits (1484), Expect = e-161 Identities = 272/420 (64%), Positives = 323/420 (76%), Gaps = 5/420 (1%) Frame = -2 Query: 1432 FSSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFAD 1253 + S ++LF +WC+QHGK YSSE+EK R +FEDN AF+T HNNM NS++TL+LNAFAD Sbjct: 21 YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80 Query: 1252 LTHHEFKSSRFGLSLAASNLDRS-NSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGA 1076 LTH EFK+S G S A+ + DR N+ + RDVP+SIDWRKKGAVT VKDQASCGA Sbjct: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140 Query: 1075 CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 896 CWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDT Sbjct: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200 Query: 895 EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 719 EKDYPY+ + CNK KL RH+VTIDGY D+P + E++++QAV +QPVSVGICGSERAFQ Sbjct: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260 Query: 718 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 539 YS GIF G CSTSLDHAVLI+GY SENGVDYWI+KNSWG+SWGM+GYMHM RNTG+ LG Sbjct: 261 LYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320 Query: 538 ICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 365 ICGIN LASY +CSLLTYC GETCCC +LGIC WKCC + Sbjct: 321 ICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGICLSWKCCGFS 380 Query: 364 SAVCCKDHRFCCPHDYPVCDTKNKLCF-KGTVNSTMVKGLERKRGSFGKLGGWNSLFEAW 188 SAVCC DHR+CCP +YP+CD+ C + T N T + +E RGS K G W+S +AW Sbjct: 381 SAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIE-MRGSSWKFGSWSSFIDAW 439 >ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris] gi|561009128|gb|ESW08035.1| hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris] Length = 428 Score = 575 bits (1483), Expect = e-161 Identities = 275/412 (66%), Positives = 312/412 (75%), Gaps = 3/412 (0%) Frame = -2 Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHN---NMENSTYTLALNAF 1259 +S TSDLF WC++H K YSSEEEK YRF VFEDN AF++ HN N NSTYTL+LNAF Sbjct: 20 ASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLNAF 79 Query: 1258 ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCG 1079 ADLTHHEFK+SR G S + R +Q + PS IDWR+ GAVTPVKDQASCG Sbjct: 80 ADLTHHEFKTSRLGFSPSLLRFKRVQNQQPRHLLHN--PSQIDWRQSGAVTPVKDQASCG 137 Query: 1078 ACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGID 899 ACWAFSATGAIEGIN+IV+GSL SLSEQELVDCD +YNSGC GGLMDYA+QFV+DNKGID Sbjct: 138 ACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDYAYQFVIDNKGID 197 Query: 898 TEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERAFQ 719 TE DYPYQA+ CNK+KLKRH+VTID Y DLP EEE+++AVASQPVSVGICGSERAFQ Sbjct: 198 TEDDYPYQARQRPCNKDKLKRHIVTIDDYVDLPPNEEELLKAVASQPVSVGICGSERAFQ 257 Query: 718 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 539 YS+GIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RNTGD G Sbjct: 258 LYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMEGYIHMIRNTGDPKG 317 Query: 538 ICGINTLASYXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 359 ICGINTLASY +C+L T+C GETCCC + LGICF WKCC + SA Sbjct: 318 ICGINTLASYPIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCGLTSA 377 Query: 358 VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNS 203 VCCKD R CCP DYP+CDT+ C K T +T + + K GW S Sbjct: 378 VCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTI--TSGNKDISNKPRGWKS 427 >gb|AGV54418.1| oryzain alpha chain-like protein [Phaseolus vulgaris] Length = 467 Score = 568 bits (1463), Expect = e-159 Identities = 272/412 (66%), Positives = 311/412 (75%), Gaps = 3/412 (0%) Frame = -2 Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHN---NMENSTYTLALNAF 1259 +S TSDLF WC++H K YSSEEEK YRF VFEDN AF++ HN N NSTYTL+LNAF Sbjct: 59 ASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLNAF 118 Query: 1258 ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCG 1079 ADLTHHEFK+SR G S + R +Q + PS IDWR+ GAVTPVKDQASCG Sbjct: 119 ADLTHHEFKTSRLGFSPSLLRFKRVQNQQPRHLLHN--PSQIDWRQSGAVTPVKDQASCG 176 Query: 1078 ACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGID 899 ACWAFSATGAIEGIN+IV+GSL SLSEQELVDCD +YNSGC GGLMD+A+QFV+DNKGID Sbjct: 177 ACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDFAYQFVIDNKGID 236 Query: 898 TEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERAFQ 719 TE DYPYQA+ SC+K+KLKR VTI+ Y D+P EEEI++AVASQPVSVGICGSERAFQ Sbjct: 237 TEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVASQPVSVGICGSERAFQ 296 Query: 718 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 539 YS+GIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WG+DGY+HM RNTGD G Sbjct: 297 LYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGIDGYIHMIRNTGDPKG 356 Query: 538 ICGINTLASYXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 359 ICGINTLASY +C+L T+C GETCCC + LGICF WKCC + SA Sbjct: 357 ICGINTLASYPIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCGLTSA 416 Query: 358 VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNS 203 VCCKD R CCP DYP+CDT+ C K T +T + + K GW S Sbjct: 417 VCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTI--TSGNKDISNKPRGWKS 466 >gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] Length = 517 Score = 566 bits (1459), Expect = e-159 Identities = 267/384 (69%), Positives = 308/384 (80%), Gaps = 4/384 (1%) Frame = -2 Query: 1420 TSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHH 1241 +S LF +WCE+HG++YSSEEE+LYR +VFEDNLAF+T HNNM NS+YTL+LNAFADLTHH Sbjct: 26 SSQLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVTQHNNMGNSSYTLSLNAFADLTHH 85 Query: 1240 EFKSSRFGLSLAA-SNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAF 1064 EFKSSR G S A S+L + S+L RDVP+S+DWRKKGAVT VKDQ SCGACWAF Sbjct: 86 EFKSSRLGFSSALLSSLPKLGSKLLDL---RDVPASLDWRKKGAVTNVKDQGSCGACWAF 142 Query: 1063 SATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDY 884 SATGAIEGIN+IV+GSLVSLSEQEL+DCD +YN+GC GGLMDYA+QFV+DN GIDTE+DY Sbjct: 143 SATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAGCDGGLMDYAYQFVIDNHGIDTEEDY 202 Query: 883 PYQAKGTSCNKNKLKRHVVTIDGYTDL-PSKEEEIMQAVASQPVSVGICGSERAFQSYSK 707 PYQA+ SC K KLKR VVTIDGYTD+ P+ +++QAV +QPVSVGICGSERAFQ YSK Sbjct: 203 PYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQAVVTQPVSVGICGSERAFQLYSK 262 Query: 706 GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGI 527 GIF G CSTSLDHAVLIVGY SENGVDYWI+KNSWGK WGMDGY+HM RNTG+ G+CGI Sbjct: 263 GIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQWGMDGYIHMQRNTGNSQGVCGI 322 Query: 526 NTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVC 353 N LASY +CS CG GETCCC+ R LG+CF WKCC +NSAVC Sbjct: 323 NMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGETCCCSWRFLGLCFSWKCCGLNSAVC 382 Query: 352 CKDHRFCCPHDYPVCDTKNKLCFK 281 CKD CCP DYP+CDT+ +C K Sbjct: 383 CKDKIHCCPQDYPLCDTQRNVCLK 406 >ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max] Length = 439 Score = 565 bits (1456), Expect = e-158 Identities = 273/419 (65%), Positives = 313/419 (74%), Gaps = 10/419 (2%) Frame = -2 Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHN-----NMENSTYTLALN 1265 +S TS+LF WC++H K YSSEEEKLYR VFEDN AF+ HN N NS+YTL+LN Sbjct: 26 ASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLN 85 Query: 1264 AFADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRD---VPSSIDWRKKGAVTPVKD 1094 AFADLTHHEFK++R GL L R +Q + RD +PS IDWR+ GAVTPVKD Sbjct: 86 AFADLTHHEFKTTRLGLPLTLLRFKRPQNQQS-----RDLLHIPSQIDWRQSGAVTPVKD 140 Query: 1093 QASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVD 914 QASCGACWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD +YNSGCGGGLMD+A+QFV+D Sbjct: 141 QASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVID 200 Query: 913 NKGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGS 734 NKGIDTE DYPYQA+ SC+K+KLKR VTI+ Y D+P EEEI++AVASQPVSVGICGS Sbjct: 201 NKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVASQPVSVGICGS 260 Query: 733 ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 554 ER FQ YSKGIF G CST LDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RN+ Sbjct: 261 EREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNS 320 Query: 553 GDQLGICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWK 380 G+ GICGINTLASY +C+L T+C GETCCC + LGICF WK Sbjct: 321 GNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGICFSWK 380 Query: 379 CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNS 203 CC + SAVCCKD R CCP DYP+CDT+ C K T N T E + S K GW S Sbjct: 381 CCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFSH-KSRGWKS 438 >ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum] Length = 439 Score = 564 bits (1453), Expect = e-158 Identities = 266/416 (63%), Positives = 323/416 (77%), Gaps = 8/416 (1%) Frame = -2 Query: 1435 CFSSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFA 1256 C S SDLF +WC+Q+GK YSSE+E++YRF VFE+N A+IT HN+ ENS+YTL LNA++ Sbjct: 20 CTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYITEHNSKENSSYTLGLNAYS 79 Query: 1255 DLTHHEFKSSRFGLSLAASNLDR-----SNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQ 1091 DLTHHEF++S GLS +A++ R S S TG D PSS+DWR+KGAVT VK+Q Sbjct: 80 DLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSETGVLSDVDAPSSLDWREKGAVTDVKNQ 139 Query: 1090 ASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDN 911 SCGACW+FSATGA+EGIN+I +GSLVSLSEQEL+DCD++YN GCGGGLMDYAF+FV+ N Sbjct: 140 GSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGGGLMDYAFEFVIKN 199 Query: 910 KGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGS 734 GIDTEKDYP++ + +CNKNKL+RHVVTIDGYTD+P +E+ +++AVA+QPVSVGICGS Sbjct: 200 GGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICGS 259 Query: 733 ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 554 RAFQSYSKGIF G CST+LDHAVLIVGYGSENGVDYWI+KNSWG SWG++GY+HM RN+ Sbjct: 260 ARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYIHMQRNS 319 Query: 553 GDQLGICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWK 380 G+Q GICGIN LASY KCS+ T CG GETCCC + LGIC WK Sbjct: 320 GNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCCCGSKFLGICLSWK 379 Query: 379 CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGG 212 CC ++SAVCCKD R CCP DYP+CDT LC K N+T+V+ +K GK GG Sbjct: 380 CCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQ-QPQKEAFTGKFGG 434 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 560 bits (1444), Expect = e-157 Identities = 264/393 (67%), Positives = 315/393 (80%), Gaps = 5/393 (1%) Frame = -2 Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 1250 SS S LF SW ++HGK Y+S+E+KLYRF +FE+N F+ HN+ NS+YTL+LNAFADL Sbjct: 25 SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84 Query: 1249 THHEFKSSRFGLSLAASN--LDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGA 1076 THHEFK+SR GLS +++ L R N L F DVP SIDWRKKGAV+ VKDQ +CGA Sbjct: 85 THHEFKASRLGLSAFSTSGKLSRRNFPLHDF--VGDVPISIDWRKKGAVSQVKDQGNCGA 142 Query: 1075 CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 896 CW+FSATGAIEGIN+IV+GSLVSLSEQELVDCD++YN+GC GGLMDYA+QFV++N GIDT Sbjct: 143 CWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDT 202 Query: 895 EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 719 E+DYPYQA+ +CNK KLKRHVVTIDGYTD+P + E+E+++AVA+QPVSVGICGSERAFQ Sbjct: 203 EEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQ 262 Query: 718 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 539 YSKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WG++GYM+M RN+G+ G Sbjct: 263 LYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQG 322 Query: 538 ICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 365 +CGIN LAS+ KC L T CG GETCCCTRR+ G+CF WKCCE++ Sbjct: 323 LCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCELD 382 Query: 364 SAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNS 266 SAVCCKD CCPHDYPVCDTK +C K ++ S Sbjct: 383 SAVCCKDGLHCCPHDYPVCDTKRNMCLKVSIFS 415 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 556 bits (1434), Expect = e-156 Identities = 264/409 (64%), Positives = 314/409 (76%), Gaps = 6/409 (1%) Frame = -2 Query: 1417 SDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 1238 S+LF+ WC++HGK Y SEEE+ R +F+DN F+T HN + N+TY+L+LNAFADLTHHE Sbjct: 29 SELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88 Query: 1237 FKSSRFGLSLAA-SNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 1061 FK+SR GLS++A S + S Q G SV VP S+DWRKKGAVT VKDQ SCGACW+FS Sbjct: 89 FKASRLGLSVSAPSVIMASKGQSLGGSV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146 Query: 1060 ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 881 ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206 Query: 880 YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSKG 704 YQ + +C K+KLK+ VVTID Y + S +E+ +M+AVA+QPVSVGICGSERAFQ YS+G Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRG 266 Query: 703 IFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGIN 524 IF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNT + G+CGIN Sbjct: 267 IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326 Query: 523 TLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVCC 350 LASY KC+L TYC +GETCCC R L G+CF WKCCE+ SAVCC Sbjct: 327 MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386 Query: 349 KDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGS--FGKLGGW 209 KD R CCPHDYPVCDT LC K T N T +K +K S G+ W Sbjct: 387 KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGRFEEW 435 >ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana] gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana] gi|332190386|gb|AEE28507.1| papain-like cysteine peptidase [Arabidopsis thaliana] Length = 437 Score = 556 bits (1432), Expect = e-155 Identities = 264/409 (64%), Positives = 313/409 (76%), Gaps = 6/409 (1%) Frame = -2 Query: 1417 SDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 1238 S+LF+ WC++HGK Y SEEE+ R +F+DN F+T HN + N+TY+L+LNAFADLTHHE Sbjct: 29 SELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88 Query: 1237 FKSSRFGLSLAA-SNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 1061 FK+SR GLS++A S + S Q G SV VP S+DWRKKGAVT VKDQ SCGACW+FS Sbjct: 89 FKASRLGLSVSAPSVIMASKGQSLGGSV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146 Query: 1060 ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 881 ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206 Query: 880 YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSKG 704 YQ + +C K+KLK+ VVTID Y + S +E+ +M+AVA+QPVSVGICGSERAFQ YS G Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSG 266 Query: 703 IFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGIN 524 IF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNT + G+CGIN Sbjct: 267 IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326 Query: 523 TLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVCC 350 LASY KC+L TYC +GETCCC R L G+CF WKCCE+ SAVCC Sbjct: 327 MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386 Query: 349 KDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGS--FGKLGGW 209 KD R CCPHDYPVCDT LC K T N T +K +K S G+ W Sbjct: 387 KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGRFEEW 435 >ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] gi|557095297|gb|ESQ35879.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] Length = 444 Score = 555 bits (1431), Expect = e-155 Identities = 267/415 (64%), Positives = 313/415 (75%), Gaps = 5/415 (1%) Frame = -2 Query: 1417 SDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 1238 ++LF+ WC +HGK Y SEEE+ +R +F DN F+T HN++ NSTY+L+LNAFADLTHHE Sbjct: 34 AELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTYSLSLNAFADLTHHE 93 Query: 1237 FKSSRFGLSLAASNLDRSNSQLTGFS--VFRDVPSSIDWRKKGAVTPVKDQASCGACWAF 1064 FK+SR GLS + +L + Q G S V VP S+DWRKKGAVT VKDQ SCGACW+F Sbjct: 94 FKASRLGLSAPSPSL-MAKEQSLGVSERVRVKVPDSVDWRKKGAVTNVKDQGSCGACWSF 152 Query: 1063 SATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDY 884 SATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDY Sbjct: 153 SATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 212 Query: 883 PYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSK 707 PYQ + +C K+KLK+ VVTID Y + S E+ +M+AVASQPVSVGICGSERAFQ YS Sbjct: 213 PYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQPVSVGICGSERAFQLYSS 272 Query: 706 GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGI 527 GIF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNTG+ G+CGI Sbjct: 273 GIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGVCGI 332 Query: 526 NTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVC 353 N LASY KC+L TYC +GETCCC R L G+CF WKCCE+ SAVC Sbjct: 333 NMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARTLFGLCFSWKCCELESAVC 392 Query: 352 CKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNSLFEAW 188 CKD R CCP DYPVCDT LC K T N T +K +K S KLG FE W Sbjct: 393 CKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPFWKKNSS-NKLG----RFEEW 442 >ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] Length = 439 Score = 554 bits (1428), Expect = e-155 Identities = 265/411 (64%), Positives = 314/411 (76%), Gaps = 8/411 (1%) Frame = -2 Query: 1417 SDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHHE 1238 S+LF+ WC++HGK Y SEEE+ R +F+DN F+T HN + N+TY+L+LNAFADLTHHE Sbjct: 29 SELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88 Query: 1237 FKSSRFGLSLAASNLDR-SNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 1061 FK+SR GLS++AS+L S Q G + VP S+DWRKKGAVT VKDQ SCGACW+FS Sbjct: 89 FKASRLGLSVSASSLIMASKGQSLGGNA--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146 Query: 1060 ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 881 ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206 Query: 880 YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSK- 707 YQ + +C K+KLK+ VVTID Y + S +E+ + +AVA+QPVSVGICGSERAFQ YS+ Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRV 266 Query: 706 -GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICG 530 GIF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNTG+ GICG Sbjct: 267 SGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGICG 326 Query: 529 INTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAV 356 IN LASY KC+L TYC AGETCCC R L G+CF WKCCE+ SAV Sbjct: 327 INMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETCCCARNLFGLCFSWKCCEIESAV 386 Query: 355 CCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGS--FGKLGGW 209 CC D R CCPHDYPVCDT LC K T N T +K +K S G+ GW Sbjct: 387 CCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKDSSNKLGRFEGW 437 >ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prunus persica] gi|462420299|gb|EMJ24562.1| hypothetical protein PRUPE_ppa005615mg [Prunus persica] Length = 451 Score = 554 bits (1427), Expect = e-155 Identities = 271/429 (63%), Positives = 321/429 (74%), Gaps = 13/429 (3%) Frame = -2 Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 1250 S TS+LF WC+Q+GK+YSS +EKLYR SVFEDNLAF+T HN+M NS+YTL+LN F+DL Sbjct: 26 SQTTSELFEVWCKQYGKSYSSAQEKLYRLSVFEDNLAFVTQHNDMGNSSYTLSLNDFSDL 85 Query: 1249 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 1070 THHEFKSSR G S + +L + + SV RD+PSS+DWRKKGAVT VKDQ SCGACW Sbjct: 86 THHEFKSSRLGFSPSFLSLKLKSDRKP--SVVRDLPSSLDWRKKGAVTNVKDQGSCGACW 143 Query: 1069 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTY-NSGCGGGLMDYAFQFVVDNKGIDTE 893 AFS TGAIEGIN+IV+GSL+SLSEQELVDCD+ Y N+GC GGLMD AF+FV+DN GIDTE Sbjct: 144 AFSTTGAIEGINKIVTGSLISLSEQELVDCDRVYPNNGCNGGLMDDAFRFVIDNNGIDTE 203 Query: 892 KDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQS 716 +DYPY+ +C K KLKR+ VTID YTD+PS +EE ++QAVASQPVSVGI GS+ FQ Sbjct: 204 EDYPYKGWDDTCIKKKLKRNAVTIDDYTDVPSNDEEQLLQAVASQPVSVGISGSDMGFQL 263 Query: 715 YSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGI 536 YSKGIFNG CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGM+GYMHM R+ + GI Sbjct: 264 YSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGMNGYMHMLRDHSNPKGI 323 Query: 535 CGINTLASY-XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 359 CGINTLASY +C + T+C AGETCCC +R++GICF W+CCE++SA Sbjct: 324 CGINTLASYPIKTGENPPLPPPGPTRCDIFTHCAAGETCCCAKRVVGICFSWRCCELDSA 383 Query: 358 VCCKDHRFCCPHDYPVCDTKNKLCFKG---------TVNSTMVKGLERKRGSFGKLG-GW 209 VCCKD R CCP DYP+CDT+ LC + + K LE RGS K G GW Sbjct: 384 VCCKDQRHCCPRDYPICDTERTLCLQSNEQLSTQSHATGNLTSKALE-SRGSLRKSGRGW 442 Query: 208 NSLFEAWNL 182 S+ W L Sbjct: 443 GSMIRDWIL 451 >ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum] Length = 436 Score = 553 bits (1425), Expect = e-155 Identities = 268/412 (65%), Positives = 306/412 (74%), Gaps = 6/412 (1%) Frame = -2 Query: 1420 TSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADLTHH 1241 TS LF WC+QHGK Y SE+EK YRF+VFEDN AF+ HN + NS+YTL+LNAFADLTHH Sbjct: 26 TSKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHH 85 Query: 1240 EFKSSRFGL---SLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 1070 EFK++R GL SL +R Q F VPS IDWRK GAV+ VKDQ SCGACW Sbjct: 86 EFKATRLGLPPSSLLRFKFNRFQDQQRSDD-FLQVPSEIDWRKNGAVSIVKDQGSCGACW 144 Query: 1069 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 890 +FSATGAIEGIN+IV+GSLVSLSEQELVDCD TYNSGC GGLMDYA+QF++DN GIDTE+ Sbjct: 145 SFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGIDTEE 204 Query: 889 DYPYQAKGTSCNKNKLKRHVVTIDGYTDL-PSKEEEIMQAVASQPVSVGICGSERAFQSY 713 DYPYQA+ C K+KLKR VVTIDGYTD+ P+ E+++++AVA QPVSVGICGS RAFQ Y Sbjct: 205 DYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARAFQLY 264 Query: 712 SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 533 SKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RNT G+C Sbjct: 265 SKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSSAGLC 324 Query: 532 GINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 359 GIN LASY KC+L TYC GETCCC ++ LGICF WKCC V SA Sbjct: 325 GINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCGVTSA 384 Query: 358 VCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGGWNS 203 VCCKD R CCP DYPVCD N C K N T++ + K F + W S Sbjct: 385 VCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTSD-KEDPFHQTRDWRS 435 >ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum] Length = 439 Score = 552 bits (1422), Expect = e-154 Identities = 262/417 (62%), Positives = 316/417 (75%), Gaps = 8/417 (1%) Frame = -2 Query: 1438 ICFSSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAF 1259 +C S SDLF +WC+Q+GK YSSE+E++YRF VFE+N A+IT HN+ NS+YTL LNA+ Sbjct: 19 LCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYITEHNSKGNSSYTLGLNAY 78 Query: 1258 ADLTHHEFKSSRFGLSLAASNLDR-----SNSQLTGFSVFRDVPSSIDWRKKGAVTPVKD 1094 +DLTHHEF++S GLS +A++ R S S G D PSS+DWR KGAVT VK+ Sbjct: 79 SDLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSAAGVLSDVDAPSSLDWRDKGAVTNVKN 138 Query: 1093 QASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVD 914 Q SCGACW+FSATGAIEGIN+I +GSLVSLSEQEL+DCD++YN GCGGGLMDYAF+FV+ Sbjct: 139 QGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGGGLMDYAFEFVIK 198 Query: 913 NKGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICG 737 N GIDTEKDYP++ K +CNKNKL+R VVTIDGYTD+P +E+ +++AVA+QPVSVGICG Sbjct: 199 NGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICG 258 Query: 736 SERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARN 557 S RAFQSYSKGIF G C T LDHAVLIVGYGSENG DYWI+KNSWG SWG++GY+HM RN Sbjct: 259 SARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTSWGINGYIHMQRN 318 Query: 556 TGDQLGICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHW 383 +G+Q GICG+N LASY KCS T CG GETCCC + LGIC W Sbjct: 319 SGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCCCGLKFLGICLSW 378 Query: 382 KCCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTMVKGLERKRGSFGKLGG 212 KCC ++SAVCCKD R CCP DYP+CDT LC K N+T+V+ +K GK GG Sbjct: 379 KCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQ-QPQKEPFTGKFGG 434 >ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp. vesca] Length = 441 Score = 548 bits (1411), Expect = e-153 Identities = 266/419 (63%), Positives = 316/419 (75%), Gaps = 12/419 (2%) Frame = -2 Query: 1429 SSFTSDLFNSWCEQHGKNYSSEEEKLYRFSVFEDNLAFITNHNNMENSTYTLALNAFADL 1250 SS +S+LF +WC+Q+GK+YSS+EEKLYR S+FE NLAFIT HN++ NS+YTL+LN+F+DL Sbjct: 25 SSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNLAFITQHNDLGNSSYTLSLNSFSDL 84 Query: 1249 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVFRDVPSSIDWRKKGAVTPVKDQASCGACW 1070 THHEFK+SR G S L R + SV R VPSSIDWRK GAVT VKDQ SCGACW Sbjct: 85 THHEFKASRLGFSPTFLRLYRKSDPKP--SVVRHVPSSIDWRKNGAVTNVKDQGSCGACW 142 Query: 1069 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTY-NSGCGGGLMDYAFQFVVDNKGIDTE 893 +FSATGAIEGIN+IV+GSLVSLSEQEL+DCD+ Y NSGC GGLMD AFQF++DN GIDTE Sbjct: 143 SFSATGAIEGINKIVTGSLVSLSEQELIDCDRVYPNSGCNGGLMDDAFQFIIDNNGIDTE 202 Query: 892 KDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSK-EEEIMQAVASQPVSVGICGSERAFQS 716 +DYPYQ +CNK KLKRHVVTIDGYTD+P+ EE++++AVA+QPVSVGI GS R FQ Sbjct: 203 EDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANNEEQLLKAVATQPVSVGIAGSGREFQF 262 Query: 715 YSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGI 536 YSKGIF G CST+LDHAVLIVGYGSENGVDYWI+KNSWGK+WGM+GY+H+ R+ + G+ Sbjct: 263 YSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGYIHILRDHSNSKGL 322 Query: 535 CGINTLASY-----XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCE 371 CGIN LASY KC L + CG GETCCC R++LGIC W+CCE Sbjct: 323 CGINMLASYPTKTGENPPFPPPSPPPGPTKCDLFSKCGVGETCCCARKILGICLSWRCCE 382 Query: 370 VNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTVNSTM----VKGLERKRG-SFGKLGGW 209 SAVCCKD CCPHDYP+CDT+ C + N TM ++G RK S KL W Sbjct: 383 FTSAVCCKDRLHCCPHDYPICDTERNYCLQANGNLTMRANEIRGSLRKSSRSKAKLSYW 441