BLASTX nr result
ID: Akebia25_contig00008507
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00008507 (1475 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 598 e-168 ref|XP_002307688.2| cysteine protease family protein [Populus tr... 594 e-167 ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087... 581 e-163 ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr... 578 e-162 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 578 e-162 ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C... 577 e-162 ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phas... 575 e-161 gb|AGV54418.1| oryzain alpha chain-like protein [Phaseolus vulga... 567 e-159 ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine... 563 e-158 gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] 563 e-158 ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr... 562 e-157 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 562 e-157 ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S... 561 e-157 ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha... 561 e-157 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 561 e-157 ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab... 560 e-157 ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prun... 554 e-155 ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a... 550 e-154 ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S... 550 e-154 ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F... 549 e-153 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 598 bits (1543), Expect = e-168 Identities = 284/426 (66%), Positives = 331/426 (77%), Gaps = 9/426 (2%) Frame = -1 Query: 1430 FSSFTSD---LFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNA 1260 FSS +S+ LF +WC QHGK Y+S+EEKL+R VF+DN DF+T HN+ NS+YTL+LNA Sbjct: 19 FSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNA 78 Query: 1259 FADLTHHEFKSSRFGLSLAAS---NLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQ 1089 FADLTHHEFK+SR GLS AAS N+DRSN Q+ F + DVP+S+DWRK GAVT VKDQ Sbjct: 79 FADLTHHEFKASRLGLSSAASASLNVDRSNRQIPDF--VADVPASVDWRKNGAVTQVKDQ 136 Query: 1088 ASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDN 909 +CGACW+FSATGAIEGIN+IV+GSLVSLSEQELVDCDK+YN+GC GG+MDYAFQFV+DN Sbjct: 137 GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDN 196 Query: 908 KGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGS 732 GIDTE+DYPYQ + SCNK KLKRHVVTIDGY D+P + E+E+++AVA+QPVSVGICGS Sbjct: 197 HGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGS 256 Query: 731 ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 552 ERAFQ YSKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGMDGYMHM RN+ Sbjct: 257 ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNS 316 Query: 551 GDQLGICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWK 378 G G+CGIN LASY +C L T+CG GETCCC + GIC WK Sbjct: 317 GSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWK 376 Query: 377 CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNSL 198 CCE++SAVCCKD R CCP DYPVCDT +C K GN+T ++ K S GK W+SL Sbjct: 377 CCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKF-AKNSSSGKFRSWSSL 435 Query: 197 FEAWNL 180 E W L Sbjct: 436 LEGWIL 441 >ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa] gi|550339725|gb|EEE94684.2| cysteine protease family protein [Populus trichocarpa] Length = 436 Score = 594 bits (1532), Expect = e-167 Identities = 277/417 (66%), Positives = 330/417 (79%), Gaps = 3/417 (0%) Frame = -1 Query: 1427 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADL 1248 SS S LF +WC +HGK+Y+S+EE+ +R VFEDN DF+T HN+ NS+Y+LALNAFADL Sbjct: 22 SSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADL 81 Query: 1247 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACW 1068 THHEFK+SR GLS A NL N ++TG V+ D+P+SIDWR KG VT VKDQ SCGACW Sbjct: 82 THHEFKTSRLGLSAAPLNLAHRNLEITG--VVGDIPASIDWRNKGVVTNVKDQGSCGACW 139 Query: 1067 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 888 +FSATGAIEGIN+IV+GSLVSLSEQEL++CDK+YN GCGGGLMDYAFQFV++N GIDTE+ Sbjct: 140 SFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEE 199 Query: 887 DYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQSY 711 DYPY+A+ +CNK+++KR VVTID Y D+P + E++++QAVA+QPVSVGICGSERAFQ Y Sbjct: 200 DYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMY 259 Query: 710 SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 531 SKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGM GYMHM RN+G+ G+C Sbjct: 260 SKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVC 319 Query: 530 GINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 357 GIN LASY KC+LLTYC AGETCCC R+ GIC WKCC ++SA Sbjct: 320 GINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLDSA 379 Query: 356 VCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNSLFEAW 186 VCCKD CCPHDYPVCDT +CFK GN+T ++ +E K + GK G WNSL EAW Sbjct: 380 VCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGK--TSGKFGSWNSLPEAW 434 >ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] Length = 438 Score = 581 bits (1498), Expect = e-163 Identities = 276/418 (66%), Positives = 319/418 (76%), Gaps = 3/418 (0%) Frame = -1 Query: 1424 SFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLT 1245 S S LF +WC QHGK YSSEEEK YR VFE+N F+T HN + NS+Y+LALNAFADLT Sbjct: 24 SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83 Query: 1244 HHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACWA 1065 HHEFK+SR GLS AA R N QL G ++RD+P+S+DWR KGAVT VKDQ SCGACW+ Sbjct: 84 HHEFKASRLGLSAAAIEGSRPNLQLPG--LVRDIPASMDWRTKGAVTKVKDQGSCGACWS 141 Query: 1064 FSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKD 885 FSATGAIEGIN+IV+G+LVSLSEQELVDCD++YNSGC GGLMDYA+QFV+DN GID E+D Sbjct: 142 FSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHGIDNEED 201 Query: 884 YPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYS 708 YPY + +CNK K KR VVTIDGY +P+ E+ ++QAVA QPVSVGICGSERAFQ YS Sbjct: 202 YPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSERAFQLYS 261 Query: 707 KGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICG 528 KGIF G CS+SLDHAVLIVGYGSENGVDYWI+KNSWG WGM+GY+HM RN+GD G+CG Sbjct: 262 KGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGDSKGLCG 321 Query: 527 INTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAV 354 IN LASY KC L TYC AGETCCCT R+ GICF WKCCE++SAV Sbjct: 322 INMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCCELDSAV 381 Query: 353 CCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNSLFEAWNL 180 CCKD+R CCP+DYPVCDTK C K GN+T ++ E K+ S K W E W L Sbjct: 382 CCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFE-KRHSTRKFSSWRPFVENWVL 438 >ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] gi|557537201|gb|ESR48319.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] Length = 441 Score = 578 bits (1491), Expect = e-162 Identities = 273/420 (65%), Positives = 323/420 (76%), Gaps = 5/420 (1%) Frame = -1 Query: 1430 FSSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFAD 1251 + S ++LF +WC QHGK YSSE+EK R +FEDN F+T HNNM NS++TL+LNAFAD Sbjct: 21 YCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80 Query: 1250 LTHHEFKSSRFGLSLAASNLDRS-NSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGA 1074 LTH EFK+S G S A+ + DR N+ + LRDVP+SIDWRKKGAVT VKDQASCGA Sbjct: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGTLRDVPASIDWRKKGAVTEVKDQASCGA 140 Query: 1073 CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 894 CWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDT Sbjct: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200 Query: 893 EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 717 EKDYPY+ + CNK KL RH+VTIDGY D+P + E++++QAV +QPVSVGICGSERAFQ Sbjct: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260 Query: 716 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 537 YS GIF G CSTSLDHAVLIVGY SENGVDYWI+KNSWG+SWGM+GYMHM RNTG+ LG Sbjct: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320 Query: 536 ICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 363 ICGIN LASY +CSLLTYC AGETCCC +LGIC WKCC + Sbjct: 321 ICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFS 380 Query: 362 SAVCCKDHRFCCPHDYPVCDTKNKLCF-KGTGNSTMVKGLERKKGSFGKLGGWNSLFEAW 186 SAVCC DHR+CCP +YP+CD+ C + TGN T + +E +GS K G W+S + W Sbjct: 381 SAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIE-MRGSSWKFGSWSSFIDVW 439 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 578 bits (1491), Expect = e-162 Identities = 273/412 (66%), Positives = 323/412 (78%), Gaps = 3/412 (0%) Frame = -1 Query: 1427 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADL 1248 +S S+LF WC +HGK+YSS EEKLYR VF DN +F+T+HNN++NS+YTL+LN++ADL Sbjct: 22 TSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADL 81 Query: 1247 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACW 1068 THHEFK SR G S A N Q S+ RDVP S+DWRKKGAVT VKDQ SCGACW Sbjct: 82 THHEFKVSRLGFSPALRNFRPVLPQEP--SLPRDVPDSLDWRKKGAVTAVKDQGSCGACW 139 Query: 1067 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 888 +FSATGA+EGINQI++GSL+SLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDTE Sbjct: 140 SFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTEN 199 Query: 887 DYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEE-EIMQAVASQPVSVGICGSERAFQSY 711 DYPYQA+ SC K+KL+R+VVTIDGY D+PS +E +++QAVA+QPVSVGICGSERAFQ Y Sbjct: 200 DYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLY 259 Query: 710 SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 531 SKGIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGKSWGMDGYMHM RN+G+ G+C Sbjct: 260 SKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVC 319 Query: 530 GINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 357 GIN LASY KCS+LT C AGETCCC ++ LG+C WKCC ++SA Sbjct: 320 GINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCGLSSA 379 Query: 356 VCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNS 201 VCCKD R CCP DYP+CDT LC K T N T + LE + S G G W+S Sbjct: 380 VCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSSS-GSSGTWSS 430 >ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis] Length = 441 Score = 577 bits (1486), Expect = e-162 Identities = 272/420 (64%), Positives = 323/420 (76%), Gaps = 5/420 (1%) Frame = -1 Query: 1430 FSSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFAD 1251 + S ++LF +WC QHGK YSSE+EK R +FEDN F+T HNNM NS++TL+LNAFAD Sbjct: 21 YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80 Query: 1250 LTHHEFKSSRFGLSLAASNLDRS-NSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGA 1074 LTH EFK+S G S A+ + DR N+ + LRDVP+SIDWRKKGAVT VKDQASCGA Sbjct: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140 Query: 1073 CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 894 CWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDT Sbjct: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200 Query: 893 EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 717 EKDYPY+ + CNK KL RH+VTIDGY D+P + E++++QAV +QPVSVGICGSERAFQ Sbjct: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260 Query: 716 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 537 YS GIF G CSTSLDHAVLI+GY SENGVDYWI+KNSWG+SWGM+GYMHM RNTG+ LG Sbjct: 261 LYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320 Query: 536 ICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 363 ICGIN LASY +CSLLTYC GETCCC +LGIC WKCC + Sbjct: 321 ICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGICLSWKCCGFS 380 Query: 362 SAVCCKDHRFCCPHDYPVCDTKNKLCF-KGTGNSTMVKGLERKKGSFGKLGGWNSLFEAW 186 SAVCC DHR+CCP +YP+CD+ C + TGN T + +E +GS K G W+S +AW Sbjct: 381 SAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIE-MRGSSWKFGSWSSFIDAW 439 >ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris] gi|561009128|gb|ESW08035.1| hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris] Length = 428 Score = 575 bits (1482), Expect = e-161 Identities = 278/413 (67%), Positives = 312/413 (75%), Gaps = 4/413 (0%) Frame = -1 Query: 1427 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHN---NMENSTYTLALNAF 1257 +S TSDLF WC +H K YSSEEEK YRF VFEDN F++ HN N NSTYTL+LNAF Sbjct: 20 ASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLNAF 79 Query: 1256 ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCG 1077 ADLTHHEFK+SR G S + R +Q L PS IDWR+ GAVTPVKDQASCG Sbjct: 80 ADLTHHEFKTSRLGFSPSLLRFKRVQNQQPRH--LLHNPSQIDWRQSGAVTPVKDQASCG 137 Query: 1076 ACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGID 897 ACWAFSATGAIEGIN+IV+GSL SLSEQELVDCD +YNSGC GGLMDYA+QFV+DNKGID Sbjct: 138 ACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDYAYQFVIDNKGID 197 Query: 896 TEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERAFQ 717 TE DYPYQA+ CNK+KLKRH+VTID Y DLP EEE+++AVASQPVSVGICGSERAFQ Sbjct: 198 TEDDYPYQARQRPCNKDKLKRHIVTIDDYVDLPPNEEELLKAVASQPVSVGICGSERAFQ 257 Query: 716 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 537 YS+GIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RNTGD G Sbjct: 258 LYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMEGYIHMIRNTGDPKG 317 Query: 536 ICGINTLASYXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 357 ICGINTLASY +C+L T+C GETCCC + LGICF WKCC + SA Sbjct: 318 ICGINTLASYPIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCGLTSA 377 Query: 356 VCCKDHRFCCPHDYPVCDTKNKLCFKGT-GNSTMVKGLERKKGSFGKLGGWNS 201 VCCKD R CCP DYP+CDT+ C K T G +T+ G K K GW S Sbjct: 378 VCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTITSG---NKDISNKPRGWKS 427 >gb|AGV54418.1| oryzain alpha chain-like protein [Phaseolus vulgaris] Length = 467 Score = 567 bits (1462), Expect = e-159 Identities = 275/413 (66%), Positives = 311/413 (75%), Gaps = 4/413 (0%) Frame = -1 Query: 1427 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHN---NMENSTYTLALNAF 1257 +S TSDLF WC +H K YSSEEEK YRF VFEDN F++ HN N NSTYTL+LNAF Sbjct: 59 ASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLNAF 118 Query: 1256 ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCG 1077 ADLTHHEFK+SR G S + R +Q L PS IDWR+ GAVTPVKDQASCG Sbjct: 119 ADLTHHEFKTSRLGFSPSLLRFKRVQNQQPRH--LLHNPSQIDWRQSGAVTPVKDQASCG 176 Query: 1076 ACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGID 897 ACWAFSATGAIEGIN+IV+GSL SLSEQELVDCD +YNSGC GGLMD+A+QFV+DNKGID Sbjct: 177 ACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDFAYQFVIDNKGID 236 Query: 896 TEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERAFQ 717 TE DYPYQA+ SC+K+KLKR VTI+ Y D+P EEEI++AVASQPVSVGICGSERAFQ Sbjct: 237 TEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVASQPVSVGICGSERAFQ 296 Query: 716 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 537 YS+GIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WG+DGY+HM RNTGD G Sbjct: 297 LYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGIDGYIHMIRNTGDPKG 356 Query: 536 ICGINTLASYXXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 357 ICGINTLASY +C+L T+C GETCCC + LGICF WKCC + SA Sbjct: 357 ICGINTLASYPIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCGLTSA 416 Query: 356 VCCKDHRFCCPHDYPVCDTKNKLCFKGT-GNSTMVKGLERKKGSFGKLGGWNS 201 VCCKD R CCP DYP+CDT+ C K T G +T+ G K K GW S Sbjct: 417 VCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTITSG---NKDISNKPRGWKS 466 >ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max] Length = 439 Score = 563 bits (1452), Expect = e-158 Identities = 271/416 (65%), Positives = 310/416 (74%), Gaps = 7/416 (1%) Frame = -1 Query: 1427 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHN-----NMENSTYTLALN 1263 +S TS+LF WC +H K YSSEEEKLYR VFEDN F+ HN N NS+YTL+LN Sbjct: 26 ASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLN 85 Query: 1262 AFADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQAS 1083 AFADLTHHEFK++R GL L R +Q + L +PS IDWR+ GAVTPVKDQAS Sbjct: 86 AFADLTHHEFKTTRLGLPLTLLRFKRPQNQQS--RDLLHIPSQIDWRQSGAVTPVKDQAS 143 Query: 1082 CGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKG 903 CGACWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD +YNSGCGGGLMD+A+QFV+DNKG Sbjct: 144 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKG 203 Query: 902 IDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERA 723 IDTE DYPYQA+ SC+K+KLKR VTI+ Y D+P EEEI++AVASQPVSVGICGSER Sbjct: 204 IDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVASQPVSVGICGSERE 263 Query: 722 FQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQ 543 FQ YSKGIF G CST LDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RN+G+ Sbjct: 264 FQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNSGNS 323 Query: 542 LGICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCE 369 GICGINTLASY +C+L T+C GETCCC + LGICF WKCC Sbjct: 324 KGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCG 383 Query: 368 VNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNS 201 + SAVCCKD R CCP DYP+CDT+ C K T N T E + S K GW S Sbjct: 384 LTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFSH-KSRGWKS 438 >gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] Length = 517 Score = 563 bits (1451), Expect = e-158 Identities = 266/384 (69%), Positives = 307/384 (79%), Gaps = 4/384 (1%) Frame = -1 Query: 1418 TSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHH 1239 +S LF +WC +HG++YSSEEE+LYR +VFEDNL F+T HNNM NS+YTL+LNAFADLTHH Sbjct: 26 SSQLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVTQHNNMGNSSYTLSLNAFADLTHH 85 Query: 1238 EFKSSRFGLSLAA-SNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACWAF 1062 EFKSSR G S A S+L + S+L LRDVP+S+DWRKKGAVT VKDQ SCGACWAF Sbjct: 86 EFKSSRLGFSSALLSSLPKLGSKLLD---LRDVPASLDWRKKGAVTNVKDQGSCGACWAF 142 Query: 1061 SATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDY 882 SATGAIEGIN+IV+GSLVSLSEQEL+DCD +YN+GC GGLMDYA+QFV+DN GIDTE+DY Sbjct: 143 SATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAGCDGGLMDYAYQFVIDNHGIDTEEDY 202 Query: 881 PYQAKGTSCNKNKLKRHVVTIDGYTDL-PSKEEEIMQAVASQPVSVGICGSERAFQSYSK 705 PYQA+ SC K KLKR VVTIDGYTD+ P+ +++QAV +QPVSVGICGSERAFQ YSK Sbjct: 203 PYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQAVVTQPVSVGICGSERAFQLYSK 262 Query: 704 GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGI 525 GIF G CSTSLDHAVLIVGY SENGVDYWI+KNSWGK WGMDGY+HM RNTG+ G+CGI Sbjct: 263 GIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQWGMDGYIHMQRNTGNSQGVCGI 322 Query: 524 NTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVC 351 N LASY +CS CG GETCCC+ R LG+CF WKCC +NSAVC Sbjct: 323 NMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGETCCCSWRFLGLCFSWKCCGLNSAVC 382 Query: 350 CKDHRFCCPHDYPVCDTKNKLCFK 279 CKD CCP DYP+CDT+ +C K Sbjct: 383 CKDKIHCCPQDYPLCDTQRNVCLK 406 >ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] gi|557095297|gb|ESQ35879.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] Length = 444 Score = 562 bits (1448), Expect = e-157 Identities = 270/415 (65%), Positives = 315/415 (75%), Gaps = 5/415 (1%) Frame = -1 Query: 1415 SDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHHE 1236 ++LF+ WC +HGK Y SEEE+ +R +F DN DF+T HN++ NSTY+L+LNAFADLTHHE Sbjct: 34 AELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTYSLSLNAFADLTHHE 93 Query: 1235 FKSSRFGLSLAASNLDRSNSQLTGFS--VLRDVPSSIDWRKKGAVTPVKDQASCGACWAF 1062 FK+SR GLS + +L + Q G S V VP S+DWRKKGAVT VKDQ SCGACW+F Sbjct: 94 FKASRLGLSAPSPSL-MAKEQSLGVSERVRVKVPDSVDWRKKGAVTNVKDQGSCGACWSF 152 Query: 1061 SATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDY 882 SATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDY Sbjct: 153 SATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 212 Query: 881 PYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSK 705 PYQ + +C K+KLK+ VVTID Y + S E+ +M+AVASQPVSVGICGSERAFQ YS Sbjct: 213 PYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQPVSVGICGSERAFQLYSS 272 Query: 704 GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGI 525 GIF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNTG+ G+CGI Sbjct: 273 GIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGVCGI 332 Query: 524 NTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVC 351 N LASY KC+L TYC +GETCCC R L G+CF WKCCE+ SAVC Sbjct: 333 NMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARTLFGLCFSWKCCELESAVC 392 Query: 350 CKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNSLFEAW 186 CKD R CCP DYPVCDT LC K TGN T +K KK S KLG FE W Sbjct: 393 CKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPF-WKKNSSNKLG----RFEEW 442 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 562 bits (1448), Expect = e-157 Identities = 266/409 (65%), Positives = 315/409 (77%), Gaps = 6/409 (1%) Frame = -1 Query: 1415 SDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHHE 1236 S+LF+ WC +HGK Y SEEE+ R +F+DN DF+T HN + N+TY+L+LNAFADLTHHE Sbjct: 29 SELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88 Query: 1235 FKSSRFGLSLAA-SNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 1059 FK+SR GLS++A S + S Q G SV VP S+DWRKKGAVT VKDQ SCGACW+FS Sbjct: 89 FKASRLGLSVSAPSVIMASKGQSLGGSV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146 Query: 1058 ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 879 ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206 Query: 878 YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSKG 702 YQ + +C K+KLK+ VVTID Y + S +E+ +M+AVA+QPVSVGICGSERAFQ YS+G Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRG 266 Query: 701 IFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGIN 522 IF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNT + G+CGIN Sbjct: 267 IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326 Query: 521 TLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVCC 348 LASY KC+L TYC +GETCCC R L G+CF WKCCE+ SAVCC Sbjct: 327 MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386 Query: 347 KDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGS--FGKLGGW 207 KD R CCPHDYPVCDT LC K TGN T +K +K S G+ W Sbjct: 387 KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGRFEEW 435 >ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum] Length = 439 Score = 561 bits (1447), Expect = e-157 Identities = 266/416 (63%), Positives = 325/416 (78%), Gaps = 8/416 (1%) Frame = -1 Query: 1433 CFSSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFA 1254 C S SDLF +WC Q+GK YSSE+E++YRF VFE+N +IT HN+ ENS+YTL LNA++ Sbjct: 20 CTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYITEHNSKENSSYTLGLNAYS 79 Query: 1253 DLTHHEFKSSRFGLSLAASNLDRSNSQLTGFS---VLRDV--PSSIDWRKKGAVTPVKDQ 1089 DLTHHEF++S GLS +A++ R + +G S VL DV PSS+DWR+KGAVT VK+Q Sbjct: 80 DLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSETGVLSDVDAPSSLDWREKGAVTDVKNQ 139 Query: 1088 ASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDN 909 SCGACW+FSATGA+EGIN+I +GSLVSLSEQEL+DCD++YN GCGGGLMDYAF+FV+ N Sbjct: 140 GSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGGGLMDYAFEFVIKN 199 Query: 908 KGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGS 732 GIDTEKDYP++ + +CNKNKL+RHVVTIDGYTD+P +E+ +++AVA+QPVSVGICGS Sbjct: 200 GGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICGS 259 Query: 731 ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 552 RAFQSYSKGIF G CST+LDHAVLIVGYGSENGVDYWI+KNSWG SWG++GY+HM RN+ Sbjct: 260 ARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYIHMQRNS 319 Query: 551 GDQLGICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWK 378 G+Q GICGIN LASY KCS+ T CG GETCCC + LGIC WK Sbjct: 320 GNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCCCGSKFLGICLSWK 379 Query: 377 CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGG 210 CC ++SAVCCKD R CCP DYP+CDT LC K N+T+V+ +K+ GK GG Sbjct: 380 CCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQ-QPQKEAFTGKFGG 434 >ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana] gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana] gi|332190386|gb|AEE28507.1| papain-like cysteine peptidase [Arabidopsis thaliana] Length = 437 Score = 561 bits (1446), Expect = e-157 Identities = 266/409 (65%), Positives = 314/409 (76%), Gaps = 6/409 (1%) Frame = -1 Query: 1415 SDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHHE 1236 S+LF+ WC +HGK Y SEEE+ R +F+DN DF+T HN + N+TY+L+LNAFADLTHHE Sbjct: 29 SELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88 Query: 1235 FKSSRFGLSLAA-SNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 1059 FK+SR GLS++A S + S Q G SV VP S+DWRKKGAVT VKDQ SCGACW+FS Sbjct: 89 FKASRLGLSVSAPSVIMASKGQSLGGSV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146 Query: 1058 ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 879 ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206 Query: 878 YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSKG 702 YQ + +C K+KLK+ VVTID Y + S +E+ +M+AVA+QPVSVGICGSERAFQ YS G Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSG 266 Query: 701 IFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGIN 522 IF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNT + G+CGIN Sbjct: 267 IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326 Query: 521 TLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVCC 348 LASY KC+L TYC +GETCCC R L G+CF WKCCE+ SAVCC Sbjct: 327 MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386 Query: 347 KDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGS--FGKLGGW 207 KD R CCPHDYPVCDT LC K TGN T +K +K S G+ W Sbjct: 387 KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGRFEEW 435 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 561 bits (1445), Expect = e-157 Identities = 263/388 (67%), Positives = 313/388 (80%), Gaps = 5/388 (1%) Frame = -1 Query: 1427 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADL 1248 SS S LF SW +HGK Y+S+E+KLYRF +FE+N +F+ HN+ NS+YTL+LNAFADL Sbjct: 25 SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84 Query: 1247 THHEFKSSRFGLSLAASN--LDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGA 1074 THHEFK+SR GLS +++ L R N L F + DVP SIDWRKKGAV+ VKDQ +CGA Sbjct: 85 THHEFKASRLGLSAFSTSGKLSRRNFPLHDF--VGDVPISIDWRKKGAVSQVKDQGNCGA 142 Query: 1073 CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 894 CW+FSATGAIEGIN+IV+GSLVSLSEQELVDCD++YN+GC GGLMDYA+QFV++N GIDT Sbjct: 143 CWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDT 202 Query: 893 EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 717 E+DYPYQA+ +CNK KLKRHVVTIDGYTD+P + E+E+++AVA+QPVSVGICGSERAFQ Sbjct: 203 EEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQ 262 Query: 716 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 537 YSKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WG++GYM+M RN+G+ G Sbjct: 263 LYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQG 322 Query: 536 ICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 363 +CGIN LAS+ KC L T CG GETCCCTRR+ G+CF WKCCE++ Sbjct: 323 LCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCELD 382 Query: 362 SAVCCKDHRFCCPHDYPVCDTKNKLCFK 279 SAVCCKD CCPHDYPVCDTK +C K Sbjct: 383 SAVCCKDGLHCCPHDYPVCDTKRNMCLK 410 >ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] Length = 439 Score = 560 bits (1443), Expect = e-157 Identities = 267/411 (64%), Positives = 315/411 (76%), Gaps = 8/411 (1%) Frame = -1 Query: 1415 SDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHHE 1236 S+LF+ WC +HGK Y SEEE+ R +F+DN DF+T HN + N+TY+L+LNAFADLTHHE Sbjct: 29 SELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88 Query: 1235 FKSSRFGLSLAASNLDR-SNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 1059 FK+SR GLS++AS+L S Q G + VP S+DWRKKGAVT VKDQ SCGACW+FS Sbjct: 89 FKASRLGLSVSASSLIMASKGQSLGGNA--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146 Query: 1058 ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 879 ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206 Query: 878 YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSK- 705 YQ + +C K+KLK+ VVTID Y + S +E+ + +AVA+QPVSVGICGSERAFQ YS+ Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRV 266 Query: 704 -GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICG 528 GIF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNTG+ GICG Sbjct: 267 SGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGICG 326 Query: 527 INTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAV 354 IN LASY KC+L TYC AGETCCC R L G+CF WKCCE+ SAV Sbjct: 327 INMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETCCCARNLFGLCFSWKCCEIESAV 386 Query: 353 CCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGS--FGKLGGW 207 CC D R CCPHDYPVCDT LC K TGN T +K +K S G+ GW Sbjct: 387 CCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKDSSNKLGRFEGW 437 >ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prunus persica] gi|462420299|gb|EMJ24562.1| hypothetical protein PRUPE_ppa005615mg [Prunus persica] Length = 451 Score = 554 bits (1427), Expect = e-155 Identities = 273/430 (63%), Positives = 323/430 (75%), Gaps = 14/430 (3%) Frame = -1 Query: 1427 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADL 1248 S TS+LF WC Q+GK+YSS +EKLYR SVFEDNL F+T HN+M NS+YTL+LN F+DL Sbjct: 26 SQTTSELFEVWCKQYGKSYSSAQEKLYRLSVFEDNLAFVTQHNDMGNSSYTLSLNDFSDL 85 Query: 1247 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACW 1068 THHEFKSSR G S + +L + + SV+RD+PSS+DWRKKGAVT VKDQ SCGACW Sbjct: 86 THHEFKSSRLGFSPSFLSLKLKSDRKP--SVVRDLPSSLDWRKKGAVTNVKDQGSCGACW 143 Query: 1067 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTY-NSGCGGGLMDYAFQFVVDNKGIDTE 891 AFS TGAIEGIN+IV+GSL+SLSEQELVDCD+ Y N+GC GGLMD AF+FV+DN GIDTE Sbjct: 144 AFSTTGAIEGINKIVTGSLISLSEQELVDCDRVYPNNGCNGGLMDDAFRFVIDNNGIDTE 203 Query: 890 KDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQS 714 +DYPY+ +C K KLKR+ VTID YTD+PS +EE ++QAVASQPVSVGI GS+ FQ Sbjct: 204 EDYPYKGWDDTCIKKKLKRNAVTIDDYTDVPSNDEEQLLQAVASQPVSVGISGSDMGFQL 263 Query: 713 YSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGI 534 YSKGIFNG CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGM+GYMHM R+ + GI Sbjct: 264 YSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGMNGYMHMLRDHSNPKGI 323 Query: 533 CGINTLASY-XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 357 CGINTLASY +C + T+C AGETCCC +R++GICF W+CCE++SA Sbjct: 324 CGINTLASYPIKTGENPPLPPPGPTRCDIFTHCAAGETCCCAKRVVGICFSWRCCELDSA 383 Query: 356 VCCKDHRFCCPHDYPVCDTKNKLCFK----------GTGNSTMVKGLERKKGSFGKLG-G 210 VCCKD R CCP DYP+CDT+ LC + TGN T K LE +GS K G G Sbjct: 384 VCCKDQRHCCPRDYPICDTERTLCLQSNEQLSTQSHATGNLTS-KALE-SRGSLRKSGRG 441 Query: 209 WNSLFEAWNL 180 W S+ W L Sbjct: 442 WGSMIRDWIL 451 >ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum] Length = 436 Score = 550 bits (1418), Expect = e-154 Identities = 267/412 (64%), Positives = 306/412 (74%), Gaps = 6/412 (1%) Frame = -1 Query: 1418 TSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHH 1239 TS LF WC QHGK Y SE+EK YRF+VFEDN F+ HN + NS+YTL+LNAFADLTHH Sbjct: 26 TSKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHH 85 Query: 1238 EFKSSRFGL---SLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACW 1068 EFK++R GL SL +R Q L+ VPS IDWRK GAV+ VKDQ SCGACW Sbjct: 86 EFKATRLGLPPSSLLRFKFNRFQDQQRSDDFLQ-VPSEIDWRKNGAVSIVKDQGSCGACW 144 Query: 1067 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 888 +FSATGAIEGIN+IV+GSLVSLSEQELVDCD TYNSGC GGLMDYA+QF++DN GIDTE+ Sbjct: 145 SFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGIDTEE 204 Query: 887 DYPYQAKGTSCNKNKLKRHVVTIDGYTDL-PSKEEEIMQAVASQPVSVGICGSERAFQSY 711 DYPYQA+ C K+KLKR VVTIDGYTD+ P+ E+++++AVA QPVSVGICGS RAFQ Y Sbjct: 205 DYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARAFQLY 264 Query: 710 SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 531 SKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RNT G+C Sbjct: 265 SKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSSAGLC 324 Query: 530 GINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 357 GIN LASY KC+L TYC GETCCC ++ LGICF WKCC V SA Sbjct: 325 GINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCGVTSA 384 Query: 356 VCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNS 201 VCCKD R CCP DYPVCD N C K N T++ + K+ F + W S Sbjct: 385 VCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTSD-KEDPFHQTRDWRS 435 >ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum] Length = 439 Score = 550 bits (1418), Expect = e-154 Identities = 263/417 (63%), Positives = 319/417 (76%), Gaps = 8/417 (1%) Frame = -1 Query: 1436 ICFSSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAF 1257 +C S SDLF +WC Q+GK YSSE+E++YRF VFE+N +IT HN+ NS+YTL LNA+ Sbjct: 19 LCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYITEHNSKGNSSYTLGLNAY 78 Query: 1256 ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFS---VLRDV--PSSIDWRKKGAVTPVKD 1092 +DLTHHEF++S GLS +A++ R + +G S VL DV PSS+DWR KGAVT VK+ Sbjct: 79 SDLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSAAGVLSDVDAPSSLDWRDKGAVTNVKN 138 Query: 1091 QASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVD 912 Q SCGACW+FSATGAIEGIN+I +GSLVSLSEQEL+DCD++YN GCGGGLMDYAF+FV+ Sbjct: 139 QGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGGGLMDYAFEFVIK 198 Query: 911 NKGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICG 735 N GIDTEKDYP++ K +CNKNKL+R VVTIDGYTD+P +E+ +++AVA+QPVSVGICG Sbjct: 199 NGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICG 258 Query: 734 SERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARN 555 S RAFQSYSKGIF G C T LDHAVLIVGYGSENG DYWI+KNSWG SWG++GY+HM RN Sbjct: 259 SARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTSWGINGYIHMQRN 318 Query: 554 TGDQLGICGINTLASY--XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHW 381 +G+Q GICG+N LASY KCS T CG GETCCC + LGIC W Sbjct: 319 SGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCCCGLKFLGICLSW 378 Query: 380 KCCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGG 210 KCC ++SAVCCKD R CCP DYP+CDT LC K N+T+V+ +K+ GK GG Sbjct: 379 KCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQ-QPQKEPFTGKFGG 434 >ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp. vesca] Length = 441 Score = 549 bits (1415), Expect = e-153 Identities = 266/419 (63%), Positives = 316/419 (75%), Gaps = 12/419 (2%) Frame = -1 Query: 1427 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADL 1248 SS +S+LF +WC Q+GK+YSS+EEKLYR S+FE NL FIT HN++ NS+YTL+LN+F+DL Sbjct: 25 SSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNLAFITQHNDLGNSSYTLSLNSFSDL 84 Query: 1247 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACW 1068 THHEFK+SR G S L R + SV+R VPSSIDWRK GAVT VKDQ SCGACW Sbjct: 85 THHEFKASRLGFSPTFLRLYRKSDPKP--SVVRHVPSSIDWRKNGAVTNVKDQGSCGACW 142 Query: 1067 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTY-NSGCGGGLMDYAFQFVVDNKGIDTE 891 +FSATGAIEGIN+IV+GSLVSLSEQEL+DCD+ Y NSGC GGLMD AFQF++DN GIDTE Sbjct: 143 SFSATGAIEGINKIVTGSLVSLSEQELIDCDRVYPNSGCNGGLMDDAFQFIIDNNGIDTE 202 Query: 890 KDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSK-EEEIMQAVASQPVSVGICGSERAFQS 714 +DYPYQ +CNK KLKRHVVTIDGYTD+P+ EE++++AVA+QPVSVGI GS R FQ Sbjct: 203 EDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANNEEQLLKAVATQPVSVGIAGSGREFQF 262 Query: 713 YSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGI 534 YSKGIF G CST+LDHAVLIVGYGSENGVDYWI+KNSWGK+WGM+GY+H+ R+ + G+ Sbjct: 263 YSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGYIHILRDHSNSKGL 322 Query: 533 CGINTLASY-----XXXXXXXXXXXXXXXKCSLLTYCGAGETCCCTRRLLGICFHWKCCE 369 CGIN LASY KC L + CG GETCCC R++LGIC W+CCE Sbjct: 323 CGINMLASYPTKTGENPPFPPPSPPPGPTKCDLFSKCGVGETCCCARKILGICLSWRCCE 382 Query: 368 VNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTM----VKGLERKKG-SFGKLGGW 207 SAVCCKD CCPHDYP+CDT+ C + GN TM ++G RK S KL W Sbjct: 383 FTSAVCCKDRLHCCPHDYPICDTERNYCLQANGNLTMRANEIRGSLRKSSRSKAKLSYW 441