BLASTX nr result
ID: Akebia24_contig00008837
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00008837 (1437 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 598 e-168 ref|XP_002307688.2| cysteine protease family protein [Populus tr... 594 e-167 ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087... 581 e-163 ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr... 578 e-162 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 578 e-162 ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C... 577 e-162 ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phas... 575 e-161 gb|AGV54418.1| oryzain alpha chain-like protein [Phaseolus vulga... 567 e-159 ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine... 563 e-158 gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] 563 e-158 ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr... 562 e-157 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 562 e-157 ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S... 561 e-157 ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha... 561 e-157 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 561 e-157 ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab... 560 e-157 ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prun... 554 e-155 ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a... 550 e-154 ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S... 550 e-154 ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F... 549 e-153 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 598 bits (1543), Expect = e-168 Identities = 284/426 (66%), Positives = 330/426 (77%), Gaps = 9/426 (2%) Frame = +3 Query: 45 FSSFTSD---LFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNA 215 FSS +S+ LF +WC QHGK Y+S+EEKL+R VF+DN DF+T HN+ NS+YTL+LNA Sbjct: 19 FSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNA 78 Query: 216 FADLTHHEFKSSRFGLSLAAS---NLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQ 386 FADLTHHEFK+SR GLS AAS N+DRSN Q+ F + DVP+S+DWRK GAVT VKDQ Sbjct: 79 FADLTHHEFKASRLGLSSAASASLNVDRSNRQIPDF--VADVPASVDWRKNGAVTQVKDQ 136 Query: 387 ASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDN 566 +CGACW+FSATGAIEGIN+IV+GSLVSLSEQELVDCDK+YN+GC GG+MDYAFQFV+DN Sbjct: 137 GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDN 196 Query: 567 KGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGS 743 GIDTE+DYPYQ + SCNK KLKRHVVTIDGY D+P + E+E+++AVA+QPVSVGICGS Sbjct: 197 HGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGS 256 Query: 744 ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 923 ERAFQ YSKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGMDGYMHM RN+ Sbjct: 257 ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNS 316 Query: 924 GDQLGICGINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWK 1097 G G+CGIN LASY C L T+CG GETCCC + GIC WK Sbjct: 317 GSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWK 376 Query: 1098 CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNSL 1277 CCE++SAVCCKD R CCP DYPVCDT +C K GN+T ++ K S GK W+SL Sbjct: 377 CCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKF-AKNSSSGKFRSWSSL 435 Query: 1278 FEAWNL 1295 E W L Sbjct: 436 LEGWIL 441 >ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa] gi|550339725|gb|EEE94684.2| cysteine protease family protein [Populus trichocarpa] Length = 436 Score = 594 bits (1532), Expect = e-167 Identities = 276/417 (66%), Positives = 329/417 (78%), Gaps = 3/417 (0%) Frame = +3 Query: 48 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADL 227 SS S LF +WC +HGK+Y+S+EE+ +R VFEDN DF+T HN+ NS+Y+LALNAFADL Sbjct: 22 SSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADL 81 Query: 228 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACW 407 THHEFK+SR GLS A NL N ++TG V+ D+P+SIDWR KG VT VKDQ SCGACW Sbjct: 82 THHEFKTSRLGLSAAPLNLAHRNLEITG--VVGDIPASIDWRNKGVVTNVKDQGSCGACW 139 Query: 408 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 587 +FSATGAIEGIN+IV+GSLVSLSEQEL++CDK+YN GCGGGLMDYAFQFV++N GIDTE+ Sbjct: 140 SFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEE 199 Query: 588 DYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQSY 764 DYPY+A+ +CNK+++KR VVTID Y D+P + E++++QAVA+QPVSVGICGSERAFQ Y Sbjct: 200 DYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMY 259 Query: 765 SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 944 SKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGM GYMHM RN+G+ G+C Sbjct: 260 SKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVC 319 Query: 945 GINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1118 GIN LASY C+LLTYC AGETCCC R+ GIC WKCC ++SA Sbjct: 320 GINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLDSA 379 Query: 1119 VCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNSLFEAW 1289 VCCKD CCPHDYPVCDT +CFK GN+T ++ +E K + GK G WNSL EAW Sbjct: 380 VCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGK--TSGKFGSWNSLPEAW 434 >ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] Length = 438 Score = 581 bits (1498), Expect = e-163 Identities = 275/418 (65%), Positives = 318/418 (76%), Gaps = 3/418 (0%) Frame = +3 Query: 51 SFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLT 230 S S LF +WC QHGK YSSEEEK YR VFE+N F+T HN + NS+Y+LALNAFADLT Sbjct: 24 SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83 Query: 231 HHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACWA 410 HHEFK+SR GLS AA R N QL G ++RD+P+S+DWR KGAVT VKDQ SCGACW+ Sbjct: 84 HHEFKASRLGLSAAAIEGSRPNLQLPG--LVRDIPASMDWRTKGAVTKVKDQGSCGACWS 141 Query: 411 FSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKD 590 FSATGAIEGIN+IV+G+LVSLSEQELVDCD++YNSGC GGLMDYA+QFV+DN GID E+D Sbjct: 142 FSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHGIDNEED 201 Query: 591 YPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYS 767 YPY + +CNK K KR VVTIDGY +P+ E+ ++QAVA QPVSVGICGSERAFQ YS Sbjct: 202 YPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSERAFQLYS 261 Query: 768 KGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICG 947 KGIF G CS+SLDHAVLIVGYGSENGVDYWI+KNSWG WGM+GY+HM RN+GD G+CG Sbjct: 262 KGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGDSKGLCG 321 Query: 948 INTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAV 1121 IN LASY C L TYC AGETCCCT R+ GICF WKCCE++SAV Sbjct: 322 INMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCCELDSAV 381 Query: 1122 CCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNSLFEAWNL 1295 CCKD+R CCP+DYPVCDTK C K GN+T ++ E K+ S K W E W L Sbjct: 382 CCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFE-KRHSTRKFSSWRPFVENWVL 438 >ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] gi|557537201|gb|ESR48319.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] Length = 441 Score = 578 bits (1491), Expect = e-162 Identities = 273/420 (65%), Positives = 322/420 (76%), Gaps = 5/420 (1%) Frame = +3 Query: 45 FSSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFAD 224 + S ++LF +WC QHGK YSSE+EK R +FEDN F+T HNNM NS++TL+LNAFAD Sbjct: 21 YCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80 Query: 225 LTHHEFKSSRFGLSLAASNLDRS-NSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGA 401 LTH EFK+S G S A+ + DR N+ + LRDVP+SIDWRKKGAVT VKDQASCGA Sbjct: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGTLRDVPASIDWRKKGAVTEVKDQASCGA 140 Query: 402 CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 581 CWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDT Sbjct: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200 Query: 582 EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 758 EKDYPY+ + CNK KL RH+VTIDGY D+P + E++++QAV +QPVSVGICGSERAFQ Sbjct: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260 Query: 759 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 938 YS GIF G CSTSLDHAVLIVGY SENGVDYWI+KNSWG+SWGM+GYMHM RNTG+ LG Sbjct: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320 Query: 939 ICGINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 1112 ICGIN LASY CSLLTYC AGETCCC +LGIC WKCC + Sbjct: 321 ICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFS 380 Query: 1113 SAVCCKDHRFCCPHDYPVCDTKNKLCF-KGTGNSTMVKGLERKKGSFGKLGGWNSLFEAW 1289 SAVCC DHR+CCP +YP+CD+ C + TGN T + +E +GS K G W+S + W Sbjct: 381 SAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIE-MRGSSWKFGSWSSFIDVW 439 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 578 bits (1491), Expect = e-162 Identities = 272/412 (66%), Positives = 322/412 (78%), Gaps = 3/412 (0%) Frame = +3 Query: 48 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADL 227 +S S+LF WC +HGK+YSS EEKLYR VF DN +F+T+HNN++NS+YTL+LN++ADL Sbjct: 22 TSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADL 81 Query: 228 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACW 407 THHEFK SR G S A N Q S+ RDVP S+DWRKKGAVT VKDQ SCGACW Sbjct: 82 THHEFKVSRLGFSPALRNFRPVLPQEP--SLPRDVPDSLDWRKKGAVTAVKDQGSCGACW 139 Query: 408 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 587 +FSATGA+EGINQI++GSL+SLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDTE Sbjct: 140 SFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTEN 199 Query: 588 DYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEE-EIMQAVASQPVSVGICGSERAFQSY 764 DYPYQA+ SC K+KL+R+VVTIDGY D+PS +E +++QAVA+QPVSVGICGSERAFQ Y Sbjct: 200 DYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLY 259 Query: 765 SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 944 SKGIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGKSWGMDGYMHM RN+G+ G+C Sbjct: 260 SKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVC 319 Query: 945 GINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1118 GIN LASY CS+LT C AGETCCC ++ LG+C WKCC ++SA Sbjct: 320 GINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCGLSSA 379 Query: 1119 VCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNS 1274 VCCKD R CCP DYP+CDT LC K T N T + LE + S G G W+S Sbjct: 380 VCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSSS-GSSGTWSS 430 >ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis] Length = 441 Score = 577 bits (1486), Expect = e-162 Identities = 272/420 (64%), Positives = 322/420 (76%), Gaps = 5/420 (1%) Frame = +3 Query: 45 FSSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFAD 224 + S ++LF +WC QHGK YSSE+EK R +FEDN F+T HNNM NS++TL+LNAFAD Sbjct: 21 YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80 Query: 225 LTHHEFKSSRFGLSLAASNLDRS-NSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGA 401 LTH EFK+S G S A+ + DR N+ + LRDVP+SIDWRKKGAVT VKDQASCGA Sbjct: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140 Query: 402 CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 581 CWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD++YNSGCGGGLMDYA+QFV+ N GIDT Sbjct: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200 Query: 582 EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 758 EKDYPY+ + CNK KL RH+VTIDGY D+P + E++++QAV +QPVSVGICGSERAFQ Sbjct: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260 Query: 759 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 938 YS GIF G CSTSLDHAVLI+GY SENGVDYWI+KNSWG+SWGM+GYMHM RNTG+ LG Sbjct: 261 LYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320 Query: 939 ICGINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 1112 ICGIN LASY CSLLTYC GETCCC +LGIC WKCC + Sbjct: 321 ICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGICLSWKCCGFS 380 Query: 1113 SAVCCKDHRFCCPHDYPVCDTKNKLCF-KGTGNSTMVKGLERKKGSFGKLGGWNSLFEAW 1289 SAVCC DHR+CCP +YP+CD+ C + TGN T + +E +GS K G W+S +AW Sbjct: 381 SAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIE-MRGSSWKFGSWSSFIDAW 439 >ref|XP_007136041.1| hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris] gi|561009128|gb|ESW08035.1| hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris] Length = 428 Score = 575 bits (1482), Expect = e-161 Identities = 278/413 (67%), Positives = 311/413 (75%), Gaps = 4/413 (0%) Frame = +3 Query: 48 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHN---NMENSTYTLALNAF 218 +S TSDLF WC +H K YSSEEEK YRF VFEDN F++ HN N NSTYTL+LNAF Sbjct: 20 ASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLNAF 79 Query: 219 ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCG 398 ADLTHHEFK+SR G S + R +Q L PS IDWR+ GAVTPVKDQASCG Sbjct: 80 ADLTHHEFKTSRLGFSPSLLRFKRVQNQQPRH--LLHNPSQIDWRQSGAVTPVKDQASCG 137 Query: 399 ACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGID 578 ACWAFSATGAIEGIN+IV+GSL SLSEQELVDCD +YNSGC GGLMDYA+QFV+DNKGID Sbjct: 138 ACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDYAYQFVIDNKGID 197 Query: 579 TEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERAFQ 758 TE DYPYQA+ CNK+KLKRH+VTID Y DLP EEE+++AVASQPVSVGICGSERAFQ Sbjct: 198 TEDDYPYQARQRPCNKDKLKRHIVTIDDYVDLPPNEEELLKAVASQPVSVGICGSERAFQ 257 Query: 759 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 938 YS+GIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RNTGD G Sbjct: 258 LYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMEGYIHMIRNTGDPKG 317 Query: 939 ICGINTLASYXXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1118 ICGINTLASY C+L T+C GETCCC + LGICF WKCC + SA Sbjct: 318 ICGINTLASYPIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCGLTSA 377 Query: 1119 VCCKDHRFCCPHDYPVCDTKNKLCFKGT-GNSTMVKGLERKKGSFGKLGGWNS 1274 VCCKD R CCP DYP+CDT+ C K T G +T+ G K K GW S Sbjct: 378 VCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTITSG---NKDISNKPRGWKS 427 >gb|AGV54418.1| oryzain alpha chain-like protein [Phaseolus vulgaris] Length = 467 Score = 567 bits (1462), Expect = e-159 Identities = 275/413 (66%), Positives = 310/413 (75%), Gaps = 4/413 (0%) Frame = +3 Query: 48 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHN---NMENSTYTLALNAF 218 +S TSDLF WC +H K YSSEEEK YRF VFEDN F++ HN N NSTYTL+LNAF Sbjct: 59 ASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLNAF 118 Query: 219 ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCG 398 ADLTHHEFK+SR G S + R +Q L PS IDWR+ GAVTPVKDQASCG Sbjct: 119 ADLTHHEFKTSRLGFSPSLLRFKRVQNQQPRH--LLHNPSQIDWRQSGAVTPVKDQASCG 176 Query: 399 ACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGID 578 ACWAFSATGAIEGIN+IV+GSL SLSEQELVDCD +YNSGC GGLMD+A+QFV+DNKGID Sbjct: 177 ACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDFAYQFVIDNKGID 236 Query: 579 TEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERAFQ 758 TE DYPYQA+ SC+K+KLKR VTI+ Y D+P EEEI++AVASQPVSVGICGSERAFQ Sbjct: 237 TEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVASQPVSVGICGSERAFQ 296 Query: 759 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 938 YS+GIF+G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WG+DGY+HM RNTGD G Sbjct: 297 LYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGIDGYIHMIRNTGDPKG 356 Query: 939 ICGINTLASYXXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1118 ICGINTLASY C+L T+C GETCCC + LGICF WKCC + SA Sbjct: 357 ICGINTLASYPIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCGLTSA 416 Query: 1119 VCCKDHRFCCPHDYPVCDTKNKLCFKGT-GNSTMVKGLERKKGSFGKLGGWNS 1274 VCCKD R CCP DYP+CDT+ C K T G +T+ G K K GW S Sbjct: 417 VCCKDKRHCCPRDYPICDTEKSQCLKITNGTTTITSG---NKDISNKPRGWKS 466 >ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max] Length = 439 Score = 563 bits (1452), Expect = e-158 Identities = 271/416 (65%), Positives = 309/416 (74%), Gaps = 7/416 (1%) Frame = +3 Query: 48 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHN-----NMENSTYTLALN 212 +S TS+LF WC +H K YSSEEEKLYR VFEDN F+ HN N NS+YTL+LN Sbjct: 26 ASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLN 85 Query: 213 AFADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQAS 392 AFADLTHHEFK++R GL L R +Q + L +PS IDWR+ GAVTPVKDQAS Sbjct: 86 AFADLTHHEFKTTRLGLPLTLLRFKRPQNQQS--RDLLHIPSQIDWRQSGAVTPVKDQAS 143 Query: 393 CGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKG 572 CGACWAFSATGAIEGIN+IV+GSLVSLSEQEL+DCD +YNSGCGGGLMD+A+QFV+DNKG Sbjct: 144 CGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKG 203 Query: 573 IDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEEIMQAVASQPVSVGICGSERA 752 IDTE DYPYQA+ SC+K+KLKR VTI+ Y D+P EEEI++AVASQPVSVGICGSER Sbjct: 204 IDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVASQPVSVGICGSERE 263 Query: 753 FQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQ 932 FQ YSKGIF G CST LDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RN+G+ Sbjct: 264 FQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNSGNS 323 Query: 933 LGICGINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCE 1106 GICGINTLASY C+L T+C GETCCC + LGICF WKCC Sbjct: 324 KGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCG 383 Query: 1107 VNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNS 1274 + SAVCCKD R CCP DYP+CDT+ C K T N T E + S K GW S Sbjct: 384 LTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFSH-KSRGWKS 438 >gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] Length = 517 Score = 563 bits (1451), Expect = e-158 Identities = 266/384 (69%), Positives = 306/384 (79%), Gaps = 4/384 (1%) Frame = +3 Query: 57 TSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHH 236 +S LF +WC +HG++YSSEEE+LYR +VFEDNL F+T HNNM NS+YTL+LNAFADLTHH Sbjct: 26 SSQLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVTQHNNMGNSSYTLSLNAFADLTHH 85 Query: 237 EFKSSRFGLSLAA-SNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACWAF 413 EFKSSR G S A S+L + S+L LRDVP+S+DWRKKGAVT VKDQ SCGACWAF Sbjct: 86 EFKSSRLGFSSALLSSLPKLGSKLLD---LRDVPASLDWRKKGAVTNVKDQGSCGACWAF 142 Query: 414 SATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDY 593 SATGAIEGIN+IV+GSLVSLSEQEL+DCD +YN+GC GGLMDYA+QFV+DN GIDTE+DY Sbjct: 143 SATGAIEGINKIVTGSLVSLSEQELIDCDTSYNAGCDGGLMDYAYQFVIDNHGIDTEEDY 202 Query: 594 PYQAKGTSCNKNKLKRHVVTIDGYTDL-PSKEEEIMQAVASQPVSVGICGSERAFQSYSK 770 PYQA+ SC K KLKR VVTIDGYTD+ P+ +++QAV +QPVSVGICGSERAFQ YSK Sbjct: 203 PYQARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQAVVTQPVSVGICGSERAFQLYSK 262 Query: 771 GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGI 950 GIF G CSTSLDHAVLIVGY SENGVDYWI+KNSWGK WGMDGY+HM RNTG+ G+CGI Sbjct: 263 GIFTGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQWGMDGYIHMQRNTGNSQGVCGI 322 Query: 951 NTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVC 1124 N LASY CS CG GETCCC+ R LG+CF WKCC +NSAVC Sbjct: 323 NMLASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGETCCCSWRFLGLCFSWKCCGLNSAVC 382 Query: 1125 CKDHRFCCPHDYPVCDTKNKLCFK 1196 CKD CCP DYP+CDT+ +C K Sbjct: 383 CKDKIHCCPQDYPLCDTQRNVCLK 406 >ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] gi|557095297|gb|ESQ35879.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] Length = 444 Score = 562 bits (1448), Expect = e-157 Identities = 269/415 (64%), Positives = 314/415 (75%), Gaps = 5/415 (1%) Frame = +3 Query: 60 SDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHHE 239 ++LF+ WC +HGK Y SEEE+ +R +F DN DF+T HN++ NSTY+L+LNAFADLTHHE Sbjct: 34 AELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTYSLSLNAFADLTHHE 93 Query: 240 FKSSRFGLSLAASNLDRSNSQLTGFS--VLRDVPSSIDWRKKGAVTPVKDQASCGACWAF 413 FK+SR GLS + +L + Q G S V VP S+DWRKKGAVT VKDQ SCGACW+F Sbjct: 94 FKASRLGLSAPSPSL-MAKEQSLGVSERVRVKVPDSVDWRKKGAVTNVKDQGSCGACWSF 152 Query: 414 SATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDY 593 SATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDY Sbjct: 153 SATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 212 Query: 594 PYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSK 770 PYQ + +C K+KLK+ VVTID Y + S E+ +M+AVASQPVSVGICGSERAFQ YS Sbjct: 213 PYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQPVSVGICGSERAFQLYSS 272 Query: 771 GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGI 950 GIF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNTG+ G+CGI Sbjct: 273 GIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGVCGI 332 Query: 951 NTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVC 1124 N LASY C+L TYC +GETCCC R L G+CF WKCCE+ SAVC Sbjct: 333 NMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARTLFGLCFSWKCCELESAVC 392 Query: 1125 CKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNSLFEAW 1289 CKD R CCP DYPVCDT LC K TGN T +K KK S KLG FE W Sbjct: 393 CKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPF-WKKNSSNKLG----RFEEW 442 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 562 bits (1448), Expect = e-157 Identities = 265/409 (64%), Positives = 314/409 (76%), Gaps = 6/409 (1%) Frame = +3 Query: 60 SDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHHE 239 S+LF+ WC +HGK Y SEEE+ R +F+DN DF+T HN + N+TY+L+LNAFADLTHHE Sbjct: 29 SELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88 Query: 240 FKSSRFGLSLAA-SNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 416 FK+SR GLS++A S + S Q G SV VP S+DWRKKGAVT VKDQ SCGACW+FS Sbjct: 89 FKASRLGLSVSAPSVIMASKGQSLGGSV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146 Query: 417 ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 596 ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206 Query: 597 YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSKG 773 YQ + +C K+KLK+ VVTID Y + S +E+ +M+AVA+QPVSVGICGSERAFQ YS+G Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRG 266 Query: 774 IFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGIN 953 IF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNT + G+CGIN Sbjct: 267 IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326 Query: 954 TLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVCC 1127 LASY C+L TYC +GETCCC R L G+CF WKCCE+ SAVCC Sbjct: 327 MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386 Query: 1128 KDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGS--FGKLGGW 1268 KD R CCPHDYPVCDT LC K TGN T +K +K S G+ W Sbjct: 387 KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGRFEEW 435 >ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum] Length = 439 Score = 561 bits (1447), Expect = e-157 Identities = 265/416 (63%), Positives = 324/416 (77%), Gaps = 8/416 (1%) Frame = +3 Query: 42 CFSSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFA 221 C S SDLF +WC Q+GK YSSE+E++YRF VFE+N +IT HN+ ENS+YTL LNA++ Sbjct: 20 CTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYITEHNSKENSSYTLGLNAYS 79 Query: 222 DLTHHEFKSSRFGLSLAASNLDRSNSQLTGFS---VLRDV--PSSIDWRKKGAVTPVKDQ 386 DLTHHEF++S GLS +A++ R + +G S VL DV PSS+DWR+KGAVT VK+Q Sbjct: 80 DLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSETGVLSDVDAPSSLDWREKGAVTDVKNQ 139 Query: 387 ASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDN 566 SCGACW+FSATGA+EGIN+I +GSLVSLSEQEL+DCD++YN GCGGGLMDYAF+FV+ N Sbjct: 140 GSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSYNEGCGGGLMDYAFEFVIKN 199 Query: 567 KGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGS 743 GIDTEKDYP++ + +CNKNKL+RHVVTIDGYTD+P +E+ +++AVA+QPVSVGICGS Sbjct: 200 GGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICGS 259 Query: 744 ERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNT 923 RAFQSYSKGIF G CST+LDHAVLIVGYGSENGVDYWI+KNSWG SWG++GY+HM RN+ Sbjct: 260 ARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYIHMQRNS 319 Query: 924 GDQLGICGINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWK 1097 G+Q GICGIN LASY CS+ T CG GETCCC + LGIC WK Sbjct: 320 GNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCGQGETCCCGSKFLGICLSWK 379 Query: 1098 CCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGG 1265 CC ++SAVCCKD R CCP DYP+CDT LC K N+T+V+ +K+ GK GG Sbjct: 380 CCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIVQ-QPQKEAFTGKFGG 434 >ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana] gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana] gi|332190386|gb|AEE28507.1| papain-like cysteine peptidase [Arabidopsis thaliana] Length = 437 Score = 561 bits (1446), Expect = e-157 Identities = 265/409 (64%), Positives = 313/409 (76%), Gaps = 6/409 (1%) Frame = +3 Query: 60 SDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHHE 239 S+LF+ WC +HGK Y SEEE+ R +F+DN DF+T HN + N+TY+L+LNAFADLTHHE Sbjct: 29 SELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88 Query: 240 FKSSRFGLSLAA-SNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 416 FK+SR GLS++A S + S Q G SV VP S+DWRKKGAVT VKDQ SCGACW+FS Sbjct: 89 FKASRLGLSVSAPSVIMASKGQSLGGSV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146 Query: 417 ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 596 ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206 Query: 597 YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSKG 773 YQ + +C K+KLK+ VVTID Y + S +E+ +M+AVA+QPVSVGICGSERAFQ YS G Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSG 266 Query: 774 IFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICGIN 953 IF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNT + G+CGIN Sbjct: 267 IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGIN 326 Query: 954 TLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAVCC 1127 LASY C+L TYC +GETCCC R L G+CF WKCCE+ SAVCC Sbjct: 327 MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCC 386 Query: 1128 KDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGS--FGKLGGW 1268 KD R CCPHDYPVCDT LC K TGN T +K +K S G+ W Sbjct: 387 KDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSKQLGRFEEW 435 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 561 bits (1445), Expect = e-157 Identities = 262/388 (67%), Positives = 312/388 (80%), Gaps = 5/388 (1%) Frame = +3 Query: 48 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADL 227 SS S LF SW +HGK Y+S+E+KLYRF +FE+N +F+ HN+ NS+YTL+LNAFADL Sbjct: 25 SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84 Query: 228 THHEFKSSRFGLSLAASN--LDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGA 401 THHEFK+SR GLS +++ L R N L F + DVP SIDWRKKGAV+ VKDQ +CGA Sbjct: 85 THHEFKASRLGLSAFSTSGKLSRRNFPLHDF--VGDVPISIDWRKKGAVSQVKDQGNCGA 142 Query: 402 CWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDT 581 CW+FSATGAIEGIN+IV+GSLVSLSEQELVDCD++YN+GC GGLMDYA+QFV++N GIDT Sbjct: 143 CWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDT 202 Query: 582 EKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLP-SKEEEIMQAVASQPVSVGICGSERAFQ 758 E+DYPYQA+ +CNK KLKRHVVTIDGYTD+P + E+E+++AVA+QPVSVGICGSERAFQ Sbjct: 203 EEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQ 262 Query: 759 SYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLG 938 YSKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WG++GYM+M RN+G+ G Sbjct: 263 LYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQG 322 Query: 939 ICGINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVN 1112 +CGIN LAS+ C L T CG GETCCCTRR+ G+CF WKCCE++ Sbjct: 323 LCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCELD 382 Query: 1113 SAVCCKDHRFCCPHDYPVCDTKNKLCFK 1196 SAVCCKD CCPHDYPVCDTK +C K Sbjct: 383 SAVCCKDGLHCCPHDYPVCDTKRNMCLK 410 >ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] Length = 439 Score = 560 bits (1443), Expect = e-157 Identities = 266/411 (64%), Positives = 314/411 (76%), Gaps = 8/411 (1%) Frame = +3 Query: 60 SDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHHE 239 S+LF+ WC +HGK Y SEEE+ R +F+DN DF+T HN + N+TY+L+LNAFADLTHHE Sbjct: 29 SELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHE 88 Query: 240 FKSSRFGLSLAASNLDR-SNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACWAFS 416 FK+SR GLS++AS+L S Q G + VP S+DWRKKGAVT VKDQ SCGACW+FS Sbjct: 89 FKASRLGLSVSASSLIMASKGQSLGGNA--KVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146 Query: 417 ATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEKDYP 596 ATGA+EGINQIV+G L+SLSEQEL+DCDK+YN+GC GGLMDYAF+FV+ N GIDTEKDYP Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206 Query: 597 YQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQSYSK- 770 YQ + +C K+KLK+ VVTID Y + S +E+ + +AVA+QPVSVGICGSERAFQ YS+ Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRV 266 Query: 771 -GIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGICG 947 GIF+G CSTSLDHAVLIVGYGS+NGVDYWI+KNSWGKSWGMDG+MHM RNTG+ GICG Sbjct: 267 SGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGICG 326 Query: 948 INTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSAV 1121 IN LASY C+L TYC AGETCCC R L G+CF WKCCE+ SAV Sbjct: 327 INMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETCCCARNLFGLCFSWKCCEIESAV 386 Query: 1122 CCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGS--FGKLGGW 1268 CC D R CCPHDYPVCDT LC K TGN T +K +K S G+ GW Sbjct: 387 CCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKDSSNKLGRFEGW 437 >ref|XP_007223363.1| hypothetical protein PRUPE_ppa005615mg [Prunus persica] gi|462420299|gb|EMJ24562.1| hypothetical protein PRUPE_ppa005615mg [Prunus persica] Length = 451 Score = 554 bits (1427), Expect = e-155 Identities = 273/430 (63%), Positives = 322/430 (74%), Gaps = 14/430 (3%) Frame = +3 Query: 48 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADL 227 S TS+LF WC Q+GK+YSS +EKLYR SVFEDNL F+T HN+M NS+YTL+LN F+DL Sbjct: 26 SQTTSELFEVWCKQYGKSYSSAQEKLYRLSVFEDNLAFVTQHNDMGNSSYTLSLNDFSDL 85 Query: 228 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACW 407 THHEFKSSR G S + +L + + SV+RD+PSS+DWRKKGAVT VKDQ SCGACW Sbjct: 86 THHEFKSSRLGFSPSFLSLKLKSDRKP--SVVRDLPSSLDWRKKGAVTNVKDQGSCGACW 143 Query: 408 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTY-NSGCGGGLMDYAFQFVVDNKGIDTE 584 AFS TGAIEGIN+IV+GSL+SLSEQELVDCD+ Y N+GC GGLMD AF+FV+DN GIDTE Sbjct: 144 AFSTTGAIEGINKIVTGSLISLSEQELVDCDRVYPNNGCNGGLMDDAFRFVIDNNGIDTE 203 Query: 585 KDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICGSERAFQS 761 +DYPY+ +C K KLKR+ VTID YTD+PS +EE ++QAVASQPVSVGI GS+ FQ Sbjct: 204 EDYPYKGWDDTCIKKKLKRNAVTIDDYTDVPSNDEEQLLQAVASQPVSVGISGSDMGFQL 263 Query: 762 YSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGI 941 YSKGIFNG CSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGM+GYMHM R+ + GI Sbjct: 264 YSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGMNGYMHMLRDHSNPKGI 323 Query: 942 CGINTLASY-XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1118 CGINTLASY C + T+C AGETCCC +R++GICF W+CCE++SA Sbjct: 324 CGINTLASYPIKTGENPPLPPPGPTRCDIFTHCAAGETCCCAKRVVGICFSWRCCELDSA 383 Query: 1119 VCCKDHRFCCPHDYPVCDTKNKLCFK----------GTGNSTMVKGLERKKGSFGKLG-G 1265 VCCKD R CCP DYP+CDT+ LC + TGN T K LE +GS K G G Sbjct: 384 VCCKDQRHCCPRDYPICDTERTLCLQSNEQLSTQSHATGNLTS-KALE-SRGSLRKSGRG 441 Query: 1266 WNSLFEAWNL 1295 W S+ W L Sbjct: 442 WGSMIRDWIL 451 >ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum] Length = 436 Score = 550 bits (1418), Expect = e-154 Identities = 266/412 (64%), Positives = 305/412 (74%), Gaps = 6/412 (1%) Frame = +3 Query: 57 TSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADLTHH 236 TS LF WC QHGK Y SE+EK YRF+VFEDN F+ HN + NS+YTL+LNAFADLTHH Sbjct: 26 TSKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHH 85 Query: 237 EFKSSRFGL---SLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACW 407 EFK++R GL SL +R Q L+ VPS IDWRK GAV+ VKDQ SCGACW Sbjct: 86 EFKATRLGLPPSSLLRFKFNRFQDQQRSDDFLQ-VPSEIDWRKNGAVSIVKDQGSCGACW 144 Query: 408 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVDNKGIDTEK 587 +FSATGAIEGIN+IV+GSLVSLSEQELVDCD TYNSGC GGLMDYA+QF++DN GIDTE+ Sbjct: 145 SFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGIDTEE 204 Query: 588 DYPYQAKGTSCNKNKLKRHVVTIDGYTDL-PSKEEEIMQAVASQPVSVGICGSERAFQSY 764 DYPYQA+ C K+KLKR VVTIDGYTD+ P+ E+++++AVA QPVSVGICGS RAFQ Y Sbjct: 205 DYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARAFQLY 264 Query: 765 SKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGIC 944 SKGIF G CSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GY+HM RNT G+C Sbjct: 265 SKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSSAGLC 324 Query: 945 GINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCEVNSA 1118 GIN LASY C+L TYC GETCCC ++ LGICF WKCC V SA Sbjct: 325 GINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCGVTSA 384 Query: 1119 VCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGGWNS 1274 VCCKD R CCP DYPVCD N C K N T++ + K+ F + W S Sbjct: 385 VCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTSD-KEDPFHQTRDWRS 435 >ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum] Length = 439 Score = 550 bits (1418), Expect = e-154 Identities = 262/417 (62%), Positives = 318/417 (76%), Gaps = 8/417 (1%) Frame = +3 Query: 39 ICFSSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAF 218 +C S SDLF +WC Q+GK YSSE+E++YRF VFE+N +IT HN+ NS+YTL LNA+ Sbjct: 19 LCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYITEHNSKGNSSYTLGLNAY 78 Query: 219 ADLTHHEFKSSRFGLSLAASNLDRSNSQLTGFS---VLRDV--PSSIDWRKKGAVTPVKD 383 +DLTHHEF++S GLS +A++ R + +G S VL DV PSS+DWR KGAVT VK+ Sbjct: 79 SDLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSAAGVLSDVDAPSSLDWRDKGAVTNVKN 138 Query: 384 QASCGACWAFSATGAIEGINQIVSGSLVSLSEQELVDCDKTYNSGCGGGLMDYAFQFVVD 563 Q SCGACW+FSATGAIEGIN+I +GSLVSLSEQEL+DCD++YN GCGGGLMDYAF+FV+ Sbjct: 139 QGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSYNQGCGGGLMDYAFEFVIK 198 Query: 564 NKGIDTEKDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSKEEE-IMQAVASQPVSVGICG 740 N GIDTEKDYP++ K +CNKNKL+R VVTIDGYTD+P +E+ +++AVA+QPVSVGICG Sbjct: 199 NGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDEDKLLKAVATQPVSVGICG 258 Query: 741 SERAFQSYSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARN 920 S RAFQSYSKGIF G C T LDHAVLIVGYGSENG DYWI+KNSWG SWG++GY+HM RN Sbjct: 259 SARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIKNSWGTSWGINGYIHMQRN 318 Query: 921 TGDQLGICGINTLASY--XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHW 1094 +G+Q GICG+N LASY CS T CG GETCCC + LGIC W Sbjct: 319 SGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCGQGETCCCGLKFLGICLSW 378 Query: 1095 KCCEVNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTMVKGLERKKGSFGKLGG 1265 KCC ++SAVCCKD R CCP DYP+CDT LC K N+T+V+ +K+ GK GG Sbjct: 379 KCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIVQ-QPQKEPFTGKFGG 434 >ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp. vesca] Length = 441 Score = 549 bits (1415), Expect = e-153 Identities = 265/419 (63%), Positives = 315/419 (75%), Gaps = 12/419 (2%) Frame = +3 Query: 48 SSFTSDLFNSWCAQHGKNYSSEEEKLYRFSVFEDNLDFITNHNNMENSTYTLALNAFADL 227 SS +S+LF +WC Q+GK+YSS+EEKLYR S+FE NL FIT HN++ NS+YTL+LN+F+DL Sbjct: 25 SSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNLAFITQHNDLGNSSYTLSLNSFSDL 84 Query: 228 THHEFKSSRFGLSLAASNLDRSNSQLTGFSVLRDVPSSIDWRKKGAVTPVKDQASCGACW 407 THHEFK+SR G S L R + SV+R VPSSIDWRK GAVT VKDQ SCGACW Sbjct: 85 THHEFKASRLGFSPTFLRLYRKSDPKP--SVVRHVPSSIDWRKNGAVTNVKDQGSCGACW 142 Query: 408 AFSATGAIEGINQIVSGSLVSLSEQELVDCDKTY-NSGCGGGLMDYAFQFVVDNKGIDTE 584 +FSATGAIEGIN+IV+GSLVSLSEQEL+DCD+ Y NSGC GGLMD AFQF++DN GIDTE Sbjct: 143 SFSATGAIEGINKIVTGSLVSLSEQELIDCDRVYPNSGCNGGLMDDAFQFIIDNNGIDTE 202 Query: 585 KDYPYQAKGTSCNKNKLKRHVVTIDGYTDLPSK-EEEIMQAVASQPVSVGICGSERAFQS 761 +DYPYQ +CNK KLKRHVVTIDGYTD+P+ EE++++AVA+QPVSVGI GS R FQ Sbjct: 203 EDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANNEEQLLKAVATQPVSVGIAGSGREFQF 262 Query: 762 YSKGIFNGACSTSLDHAVLIVGYGSENGVDYWILKNSWGKSWGMDGYMHMARNTGDQLGI 941 YSKGIF G CST+LDHAVLIVGYGSENGVDYWI+KNSWGK+WGM+GY+H+ R+ + G+ Sbjct: 263 YSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGYIHILRDHSNSKGL 322 Query: 942 CGINTLASY-----XXXXXXXXXXXXXXXXCSLLTYCGAGETCCCTRRLLGICFHWKCCE 1106 CGIN LASY C L + CG GETCCC R++LGIC W+CCE Sbjct: 323 CGINMLASYPTKTGENPPFPPPSPPPGPTKCDLFSKCGVGETCCCARKILGICLSWRCCE 382 Query: 1107 VNSAVCCKDHRFCCPHDYPVCDTKNKLCFKGTGNSTM----VKGLERKKG-SFGKLGGW 1268 SAVCCKD CCPHDYP+CDT+ C + GN TM ++G RK S KL W Sbjct: 383 FTSAVCCKDRLHCCPHDYPICDTERNYCLQANGNLTMRANEIRGSLRKSSRSKAKLSYW 441